Skip Navigation

Bioinformatics 2007 23(13):i418-i423; doi:10.1093/bioinformatics/btm177
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Saeys, Y.
Right arrow Articles by Van de Peer, Y.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Saeys, Y.
Right arrow Articles by Van de Peer, Y.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2007 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Translation initiation site prediction on a genomic scale: beauty in simplicity

Yvan Saeys 1,2,*, Thomas Abeel 1,2, Sven Degroeve 3 and Yves Van de Peer 1,2

1Department of Plant Systems Biology, VIB, Technologiepark 927, B-9052 Ghent, Belgium, 2Department of Molecular Genetics, Ghent University, Ghent, Belgium and 3Pronota, Technologiepark - Zwijnaarde 927, B-9052 Ghent, Belgium

*To whom correspondence should be addressed.


   Abstract

Motivation: The correct identification of translation initiation sites (TIS) remains a challenging problem for computational methods that automatically try to solve this problem. Furthermore, the lion's share of these computational techniques focuses on the identification of TIS in transcript data. However, in the gene prediction context the identification of TIS occurs on the genomic level, which makes things even harder because at the genome level many more pseudo-TIS occur, resulting in models that achieve a higher number of false positive predictions.

Results: In this article, we evaluate the performance of several ‘simple’ TIS recognition methods at the genomic level, and compare them to state-of-the-art models for TIS prediction in transcript data. We conclude that the simple methods largely outperform the complex ones at the genomic scale, and we propose a new model for TIS recognition at the genome level that combines the strengths of these simple models. The new model obtains a false positive rate of 0.125 at a sensitivity of 0.80 on a well annotated human chromosome (chromosome 21). Detailed analyses show that the model is useful, both on its own and in a simple gene prediction setting.

Availability: Datafiles and a web interface for the StartScan program are available at http://bioinformatics.psb.ugent.be/supplementary_data/

Contact: yvan.saeys{at}psb.ugent.be



Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
S. Sonnenburg, A. Zien, P. Philips, and G. Ratsch
POIMs: positional oligomer importance matrices--understanding support vector machine-based signal detectors
Bioinformatics, July 1, 2008; 24(13): i6 - i14.
[Abstract] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.