Skip Navigation


Bioinformatics Advance Access originally published online on June 19, 2009
Bioinformatics 2009 25(17):2157-2163; doi:10.1093/bioinformatics/btp379
This Article
Right arrow Full Text
Right arrow Full Text (Print PDF)
Right arrow All Versions of this Article:
25/17/2157    most recent
btp379v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Schröder, J.
Right arrow Articles by Schmidt, B.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Schröder, J.
Right arrow Articles by Schmidt, B.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2009. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

SHREC: a short-read error correction method

Jan Schröder 1,3,*, Heiko Schröder 2, Simon J. Puglisi 2, Ranjan Sinha 3 and Bertil Schmidt 4

1 Institut für Informatik, Christian-Albrecht-Universität Kiel, Herman-Rodewald-Strasse 3, 24118 Kiel, Germany, 2 School of Computer Science and Information Technology, RMIT University, Victoria 3000, 3 Department of Computer Science and Software Engineering, University of Melbourne, Victoria 3010, Australia and 4 School of Computer Engineering, Nanyang Technological University, Singapore 639798

* To whom correspondence should be addressed.


   Abstract

Motivation: Second-generation sequencing technologies produce a massive amount of short reads in a single experiment. However, sequencing errors can cause major problems when using this approach for de novo sequencing applications. Moreover, existing error correction methods have been designed and optimized for shotgun sequencing. Therefore, there is an urgent need for the design of fast and accurate computational methods and tools for error correction of large amounts of short read data.

Results: We present SHREC, a new algorithm for correcting errors in short-read data that uses a generalized suffix trie on the read data as the underlying data structure. Our results show that the method can identify erroneous reads with sensitivity and specificity of over 99% and 96% for simulated data with error rates of up to 3% as well as for real data. Furthermore, it achieves an error correction accuracy of over 80% for simulated data and over 88% for real data. These results are clearly superior to previously published approaches. SHREC is available as an efficient open-source Java implementation that allows processing of 10 million of short reads on a standard workstation.

Availability: SHREC source code in JAVA is freely available at http://www.informatik.uni-kiel.de/~jasc/Shrec/

Contact: jasc{at}informatik.uni-kiel.de

Associate Editor: Joaquin Dopazo


Received on January 15, 2009; revised on May 18, 2009; accepted on June 15, 2009

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.