Bioinformatics Advance Access originally published online on April 1, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics 20(13) © Oxford University Press 2004; all rights reserved.
Fragment assembly with short reads
1 Bioinformatics Program and 2 Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA 92093, USA
Received on October 8, 2003; revised on February 3, 2004; accepted on February 9, 2004
Advance Access Publication April 1, 2004
Motivation: Current DNA sequencing technology produces reads of about 500750 bp, with typical coverage under 10x. New sequencing technologies are emerging that produce shorter reads (length 80200 bp) but allow one to generate significantly higher coverage (30x and higher) at low cost. Modern assembly programs and error correction routines have been tuned to work well with current read technology but were not designed for assembly of short reads.
Results: We analyze the limitations of assembling reads generated by these new technologies and present a routine for base-calling in reads prior to their assembly. We demonstrate that while it is feasible to assemble such short reads, the resulting contigs will require significant (if not prohibitive) finishing efforts.
Availability: Available from the web at http://www.cse.ucsd.edu/groups/bioinformatics/software.html
Contact: mchaisso{at}bioinf.ucsd.edu; ppevzner{at}cs.ucsd.edu; htangg{at}cs.ucsd.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
I. Hajirasouliha, F. Hormozdiari, S. C. Sahinalp, and I. Birol Optimal pooling for genome re-sequencing with ultra-high-throughput short-read technologies Bioinformatics, July 1, 2008; 24(13): i32 - i40. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. R. Zerbino and E. Birney Velvet: Algorithms for de novo short read assembly using de Bruijn graphs Genome Res., May 1, 2008; 18(5): 821 - 829. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Chaisson and P. A. Pevzner Short read fragment assembly of bacterial genomes Genome Res., February 1, 2008; 18(2): 324 - 330. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. P. Fredlake, D. G. Hert, C.-W. Kan, T. N. Chiesl, B. E. Root, R. E. Forster, and A. E. Barron Ultrafast DNA sequencing on a microchip by a hybrid separation mechanism that gives 600 bases in 6.5 minutes PNAS, January 15, 2008; 105(2): 476 - 481. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Warren, G. G. Sutton, S. J. M. Jones, and R. A. Holt Assembling millions of short DNA sequences using SSAKE Bioinformatics, February 15, 2007; 23(4): 500 - 501. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Metzker Emerging technologies in DNA sequencing Genome Res., December 1, 2005; 15(12): 1767 - 1776. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Whiteford, N. Haslam, G. Weber, A. Prugel-Bennett, J. W. Essex, P. L. Roach, M. Bradley, and C. Neylon An analysis of the feasibility of short read sequencing Nucleic Acids Res., November 7, 2005; 33(19): e171 - e171. [Abstract] [Full Text] [PDF] |
||||



