Skip Navigation



Bioinformatics Advance Access published online on October 18, 2005

Bioinformatics, doi:10.1093/bioinformatics/bti709
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow Supplementary Data
Right arrowOA All Versions of this Article:
21/24/4414    most recent
bti709v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Hayden, C. A.
Right arrow Articles by Jorgensen, R. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hayden, C. A.
Right arrow Articles by Jorgensen, R. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author (2005). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org
Received May 27, 2005
Revised September 1, 2005
Accepted October 6, 2005

Applications note

Evaluating and improving cDNA sequence quality with cQC

Celine A. Hayden 1, Travis J. Wheeler 2, and Richard A. Jorgensen 1*

1 Department of Plant Sciences, University of Arizona, Tucson, AZ 85721-0036, USA
2 Department of Computer Science, University of Arizona, Tucson, AZ 85721-0077, USA

* To whom correspondence should be addressed.
Richard A. Jorgensen, E-mail: raj{at}ag.arizona.edu


   Abstract

Summary: Errors are prevalent in cDNA sequences but the extent to which sequence collections differ in frequencies and types of errors has not been investigated systematically. cDNA Quality Control, or cQC, was developed to evaluate the quality of cDNA sequence collections and revise those sequences that differ from a higher quality genomic sequence. After removing rRNA, vector, bacterial IS, and chimeric cDNA contaminants, small-scale nucleotide discrepancies were found in 51% of cDNA sequences from one Arabidopsis cDNA collection, 89% from a second Arabidopsis collection, and 75% from a rice collection. These errors created premature termination codons in 4% and 42% of cDNA sequences in the respective Arabidopsis collections and in 7% of the rice cDNA sequences.

Availability: A web-based version of cQC, source code and revised cDNA collections are available at http://genomics.arizona.edu/software/cQC/.

Supplementary data: Further text, tables and figures are available at the above website or on Bioinformatics online.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.