Bioinformatics Advance Access originally published online on September 24, 2007
Bioinformatics 2007 23(21):2942-2944; doi:10.1093/bioinformatics/btm451
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Extending assembly of short DNA sequences to handle error
1Department of Biology, University of Carolina—Chapel Hill, Chapel Hill, NC 27599, 2Department of Genetics, Washington University School of Medicine, St. Louis, MO 63108 and 3Carolina Center for Genome Sciences, University of Carolina—Chapel Hill, Chapel Hill, NC 27599, USA
*To whom correspondence should be addressed.
| Abstract |
|---|
Inexpensive de novo genome sequencing, particularly in organisms with small genomes, is now possible using several new sequencing technologies. Some of these technologies such as that from Illumina's Solexa Sequencing, produce high genomic coverage by generating a very large number of small reads (
30 bp). While prior work shows that partial assembly can be performed by k-mer extension in error-free reads, this algorithm is unsuccessful with the sequencing error rates found in practice. We present VCAKE (Verified Consensus Assembly by K-mer Extension), a modification of simple k-mer extension that overcomes error by using high depth coverage. Though it is a simple modification of a previous approach, we show significant improvements in assembly results on simulated and experimental datasets that include error.
Availability: http://152.2.15.114/~labweb/VCAKE
Contact: william.jeck{at}gmail.com
Associate Editor: Alex Bateman
Received on July 23, 2007; revised on August 21, 2007; accepted on August 24, 2007
This article has been cited by other articles:
![]() |
M. Imelfort and D. Edwards De novo sequencing of plant genomes using second-generation technologies Brief Bioinform, November 1, 2009; 10(6): 609 - 618. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Suchland, K. M. Sandoz, B. M. Jeffrey, W. E. Stamm, and D. D. Rockey Horizontal Transfer of Tetracycline Resistance among Chlamydia spp. In Vitro Antimicrob. Agents Chemother., November 1, 2009; 53(11): 4604 - 4611. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. S. Horner, G. Pavesi, T. Castrignano, P. D. De Meo, S. Liuni, M. Sammeth, E. Picardi, and G. Pesole Bioinformatics approaches for genomics and post genomics applications of next-generation sequencing Brief Bioinform, October 27, 2009; (2009) bbp046v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Schmidt, R. Sinha, B. Beresford-Smith, and S. J. Puglisi A fast hybrid short read fragment assembly algorithm Bioinformatics, September 1, 2009; 25(17): 2279 - 2280. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pop Genome assembly reborn: recent computational challenges Brief Bioinform, July 1, 2009; 10(4): 354 - 366. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. T. Simpson, K. Wong, S. D. Jackman, J. E. Schein, S. J.M. Jones, and I. Birol ABySS: A parallel assembler for short read sequence data Genome Res., June 1, 2009; 19(6): 1117 - 1123. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Warren, B. H. Nelson, and R. A. Holt Profiling model T-cell metagenomes with short reads Bioinformatics, February 15, 2009; 25(4): 458 - 464. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Reinhardt, D. A. Baltrus, M. T. Nishimura, W. R. Jeck, C. D. Jones, and J. L. Dangl De novo assembly using low-coverage short read sequence data from the rice pathogen Pseudomonas syringae pv. oryzae Genome Res., February 1, 2009; 19(2): 294 - 305. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Chaisson, D. Brinza, and P. A. Pevzner De novo fragment assembly with short mate-paired reads: Does the read length matter? Genome Res., February 1, 2009; 19(2): 336 - 346. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Suchland, B. M. Jeffrey, M. Xia, A. Bhatia, H. G. Chu, D. D. Rockey, and W. E. Stamm Identification of Concomitant Infection with Chlamydia trachomatis IncA-Negative Mutant and Wild-Type Strains by Genomic, Transcriptional, and Biological Characterizations Infect. Immun., December 1, 2008; 76(12): 5438 - 5446. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Holt and S. J.M. Jones The new paradigm of flow cell sequencing Genome Res., June 1, 2008; 18(6): 839 - 846. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Butler, I. MacCallum, M. Kleber, I. A. Shlyakhter, M. K. Belmonte, E. S. Lander, C. Nusbaum, and D. B. Jaffe ALLPATHS: De novo assembly of whole-genome shotgun microreads Genome Res., May 1, 2008; 18(5): 810 - 820. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. R. Zerbino and E. Birney Velvet: Algorithms for de novo short read assembly using de Bruijn graphs Genome Res., May 1, 2008; 18(5): 821 - 829. [Abstract] [Full Text] [PDF] |
||||




