Bioinformatics Advance Access originally published online on April 4, 2006
Bioinformatics 2006 22(10):1239-1244; doi:10.1093/bioinformatics/btl076
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Using mRNAs lengths to accurately predict the alternatively spliced gene products in Caenorhabditis elegans
Department of Genetics, Washington University School of Medicine 660 S. Euclid, Campus Box 8232, St. Louis, MO 63110, USA
*To whom correspondence should be addressed.
Motivation: Computational gene prediction methods are an important component of whole genome analyses. While ab initio gene finders have demonstrated major improvements in accuracy, the most reliable methods are evidence-based gene predictors. These algorithms can rely on several different sources of evidence including predictions from multiple ab initio gene finders, matches to known proteins, sequence conservation and partial cDNAs to predict the final product. Despite the success of these algorithms, prediction of complete gene structures, especially for alternatively spliced products, remains a difficult task.
Results: LOCUS (Length Optimized Characterization of Unknown Spliceforms) is a new evidence-based gene finding algorithm which integrates a length-constraint into a dynamic programming-based framework for prediction of gene products. On a Caenorhabditis elegans test set of alternatively spliced internal exons, its performance exceeds that of current ab initio gene finders and in most cases can accurately predict the correct form of all the alternative products. As the length information used by the algorithm can be obtained in a high-throughput fashion, we propose that integration of such information into a gene-prediction pipeline is feasible and doing so may improve our ability to fully characterize the complete set of mRNAs for a genome.
Availability: LOCUS is available from http://ural.wustl.edu/software.html
Contact: stormo{at}genetics.wustl.edu
Received on November 3, 2005; revised on February 18, 2006; accepted on February 27, 2006