Skip Navigation

Bioinformatics 2005 21(24):4320-4321; doi:10.1093/bioinformatics/bti769
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (12)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Salzberg, S. L.
Right arrow Articles by Yorke, J. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Salzberg, S. L.
Right arrow Articles by Yorke, J. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2005. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

Beware of mis-assembled genomes

Steven L. Salzberg 1,* and James A. Yorke 2

1Center for Bioinformatics and Computational Biology, University of Maryland College Park, MD 20742, USA
2Institute for Physical Sciences and Technology, University of Maryland College Park, MD 20742, USA

*To whom correspondence should be addressed. E-mail: salzberg@umd.edu

The first 10% of the full text of this article appears below.

With hundreds of genomes now in GenBank, researchers might be forgiven for assuming that genome sequence data are correct, at least at a large scale. Certainly there might be errors at some small rate, perhaps 1 in 50 000 or 100 000 bases (Schmutz et al., 2004; Read et al., 2002), but at a large scale these genomes are put together correctly, are not they? Well, not always.

We have been looking at the assemblies of large genomes for several years now, and for every ‘draft’ genome we look at, we find hundreds—and sometimes thousands—of mis-assemblies. These include regions where a genome is incorrectly re-arranged as well as places where large chunks of DNA sequence are simply . . . [Full Text of this Article]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
J.-H. Choi, S. Kim, H. Tang, J. Andrews, D. G. Gilbert, and J. K. Colbourne
A machine-learning approach to combined evidence validation of genome assemblies
Bioinformatics, March 15, 2008; 24(6): 744 - 750.
[Abstract] [Full Text] [PDF]