Bioinformatics Advance Access published online on January 20, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti283
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Computing, City University, London, United Kingdom
* To whom correspondence should be addressed.
Motivation: Gene finding remains an open problem well after the sequencing of the human genome. The low gene sensitivity of current methods is a problem for divergent protein families, because fairly accurate exon assemblies are required before sensitive fold recognition algorithms can be applied. This paper presents a new genomic threading algorithm which integrates the gene finding and fold recognition steps into a single process. The method is applicable to evolutionarily divergent protein families that have retained some trace of their common ancestry; number and phase of introns, sizes of exons, and placement of structural elements on specific exons. Such conserved structural signals may be visible despite dramatic evolution of protein sequence. Results: The method is evaluated on the family of helical cytokines by cross-validation sensitivity analysis. The method has also been applied to all intergenic regions of the human genome, and an expression and cloning approach has been coupled with the predictions of the method. Two genes discovered by the method are discussed. Supplementary information: All data used and the results obtained in the cross-validation analysis are available at http://www.soi.city.ac.uk/~conklin/papers/GT/.
Received September 20, 2004
Revised January 7, 2005
Accepted January 18, 2005
Article
Gene finding for the helical cytokines
2 ZymoGenetics Inc., Seattle, USA
Darrell Conklin, E-mail: conklin{at}city.ac.uk
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?