Skip Navigation


Bioinformatics Advance Access originally published online on March 5, 2009
Bioinformatics 2009 25(9):1118-1124; doi:10.1093/bioinformatics/btp131
This Article
Right arrow Full Text
Right arrow Full Text (Print PDF)
Right arrow All Versions of this Article:
25/9/1118    most recent
btp131v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Rausch, T.
Right arrow Articles by Reinert, K.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Rausch, T.
Right arrow Articles by Reinert, K.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2009. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

A consistency-based consensus algorithm for de novo and reference-guided sequence assembly of short reads

Tobias Rausch 1,2,*, Sergey Koren 3, Gennady Denisov 3, David Weese 2, Anne-Katrin Emde 1,2, Andreas Döring 2 and Knut Reinert 2

1International Max Planck Research School for Computational Biology and Scientific Computing, Ihnestr. 63 - 73, 2Algorithmische Bioinformatik, Institut für Informatik, Takustr. 9, 14195 Berlin, Germany and 3J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA

*To whom correspondence should be addressed.


   Abstract

Motivation: Novel high-throughput sequencing technologies pose new algorithmic challenges in handling massive amounts of short-read, high-coverage data. A robust and versatile consensus tool is of particular interest for such data since a sound multi-read alignment is a prerequisite for variation analyses, accurate genome assemblies and insert sequencing.

Results: A multi-read alignment algorithm for de novo or reference-guided genome assembly is presented. The program identifies segments shared by multiple reads and then aligns these segments using a consistency-enhanced alignment graph. On real de novo sequencing data obtained from the newly established NCBI Short Read Archive, the program performs similarly in quality to other comparable programs. On more challenging simulated datasets for insert sequencing and variation analyses, our program outperforms the other tools.

Availability: The consensus program can be downloaded from http://www.seqan.de/projects/consensus.html. It can be used stand-alone or in conjunction with the Celera Assembler. Both application scenarios as well as the usage of the tool are described in the documentation.

Contact: rausch{at}inf.fu-berlin.de

Associate Editor: Limsoon Wong


Received on November 3, 2008; revised on January 23, 2009; accepted on March 2, 2009

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.