Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (11)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Scheetz, T. E.
Right arrow Articles by Casavant, T. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Scheetz, T. E.
Right arrow Articles by Casavant, T. L.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 19 no. 11 2003
Pages 1318-1324
© 2003 Oxford University Press

ESTprep: preprocessing cDNA sequence reads

Todd E. Scheetz 1,2,*, Nishank Trivedi 3, Chad A. Roberts 3, Tamara Kucaba 4, Brian Berger 4, Natalie L. Robinson 3, Clayton L. Birkett 3, Allen J. Gavin 3, Brian O’Leary 3, Terry A. Braun 1,3,5, Maria F. Bonaldo 4, John P. Robinson 3, Val C. Sheffield 4,8, Marcelo B. Soares 4,6,7 and Thomas L. Casavant 1,3,5

1 Center for Bioinformatics and Computational Biology
2 Department of Ophthalmology
3 Department of Electrical and Computer Engineering
4 Department of Pediatrics
5 Department of Biomedical Engineering
6 Department of Biochemistry
7 Department of Physiology
8 Howard Hughes Medical Institute, The University of Iowa, Iowa City, IA 52242, USA

Received on October 27, 2002 ; revised on November 3, 2002 ; accepted on February 6, 2003

Motivation: High accuracy of data always governs the large-scale gene discovery projects. The data should not only be trustworthy but should be correctly annotated for various features it contains. Sequence errors are inherent in single-pass sequences such as ESTs obtained from automated sequencing. These errors further complicate the automated identification of EST-related sequencing. A tool is required to prepare the data prior to advanced annotation processing and submission to public databases.

Results: This paper describes ESTprep, a program designed to preprocess expressed sequence tag (EST) sequences. It identifies the location of features present in ESTs and allows the sequence to pass only if it meets various quality criteria. Use of ESTprep has resulted in substantial improvement in accurate EST feature identification and fidelity of results submitted to GenBank.

Availability: The program is freely available for download from http://genome.uiowa.edu/pubsoft/software.html

Contact: tscheetz{at}eng.uiowa.edu

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
C. Liang, G. Wang, L. Liu, G. Ji, Y. Liu, J. Chen, J. S. Webb, G. Reese, and J. F. D. Dean
WebTraceMiner: a web service for processing and mining EST sequence trace files
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W137 - W142.
[Abstract] [Full Text] [PDF]


Home page
ReproductionHome page
C. Agca, J. E Ries, S. J Kolath, J.-H. Kim, L. J Forrester, E. Antoniou, K. M Whitworth, N. Mathialagan, G. K Springer, R. S Prather, et al.
Luteinization of porcine preovulatory follicles leads to systematic changes in follicular gene expression
Reproduction, July 1, 2006; 132(1): 133 - 145.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
K. A.T. Silverstein, M. A. Graham, T. D. Paape, and K. A. VandenBosch
Genome Organization of More Than 300 Defensin-Like Genes in Arabidopsis
Plant Physiology, June 1, 2005; 138(2): 600 - 610.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
J. J.S. Laffin, T. E. Scheetz, M. de Fatima Bonaldo, R. S. Reiter, S. Chang, M. Eyestone, H. Abdulkawy, B. Brown, C. Roberts, D. Tack, et al.
A comprehensive nonredundant expressed sequence tag collection for the developing Rattus norvegicus heart
Physiol Genomics, April 13, 2004; 17(2): 245 - 252.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
T. E. Scheetz, J. J. Laffin, B. Berger, S. Holte, S. A. Baumes, R. Brown II, S. Chang, J. Coco, J. Conklin, K. Crouch, et al.
High-Throughput Gene Discovery in the Rat
Genome Res., April 1, 2004; 14(4): 733 - 741.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
T. E. Scheetz, J. Zabner, M. J. Welsh, J. Coco, M. Eyestone, M. de Fatima Bonaldo, T. Kucaba, T. L. Casavant, M. B. Soares, and P. B. McCray Jr.
Large-scale gene discovery in human airway epithelia reveals novel transcripts
Physiol Genomics, March 12, 2004; 17(1): 69 - 77.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.