Bioinformatics Advance Access originally published online on June 24, 2004
Bioinformatics 2004 20(17):3064-3079; doi:10.1093/bioinformatics/bth368
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics vol. 20 issue 17 © Oxford University Press 2004; all rights reserved.
A comparative method for identification of gene structures and alternatively spliced variants
1 Genomics Research Center and 2 Institute of Information Science, Academia Sinica, Taipei 11529, Taiwan
Received on April 11, 2004; revised on May 27, 2004; accepted on June 12, 2004
Advance Access Publication June 24, 2004
Motivation: Alternative splicing (AS) serves as a mechanism to create diversity among functional proteins. Increasing evidence indicates that a large portion of genes have AS forms. Hence AS variants should be considered while analyzing gene structures.
Results: A new cross-species gene identification and AS analysis system, PSEP, has been developed. The system is based on expressed sequence tag (EST)-to-genome and genome-to-genome comparisons and is implemented in two steps: sequence alignment and a series of post-alignment processes, including progressive signal extraction and patching. For gene identification, these post-alignment processes serve as noise filters and enable PSEP to eliminate
88% of potential overprediction. The overall accuracy of PSEP is better than or comparable to that of other well-known cross-species gene prediction programs, including the ROSETTA program, TWINSCAN, SGP-1/-2 and SLAM, when tested on three benchmark datasets (the ELN gene region, the HoxA cluster and the ROSETTA set). In addition, 76.2 and 76.0% of multiple-exon genes in the ROSETTA dataset and human chromosome 20, respectively, are found to have AS forms. Approximately 23% of the 210 elementary alternatives identified in the ROSETTA dataset are not conserved between the human and mouse genomes, and none of the 210 transcripts is found in the RefSeq annotation. With its dual functions in cross-species conserved sequence analysis and AS analysis, PSEP is highly suitable for studying the evolution of AS patterns and for finding unidentified gene expression features.
Availability: The programs of PESP as well as the visualization tool required to build the proposed annotation scheme are available at ttp://www.sinica.edu.tw/~trees/PSEP/.
Contact: trees{at}gate.sinica.edu.tw
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y.-T. Huang, F.-C. Chen, C.-J. Chen, H.-L. Chen, and T.-J. Chuang Identification and analysis of ancestral hominoid transcriptome inferred from cross-species transcript and processed pseudogene comparisons Genome Res., July 1, 2008; 18(7): 1163 - 1170. [Abstract] [Full Text] [PDF] |
||||
![]() |
F.-C. Chen, S.-S. Wang, S.-M. Chaw, Y.-T. Huang, and T.-J. Chuang Plant Gene and Alternatively Spliced Variant Annotator. A Plant Genome Annotation Pipeline for Rice Gene and Alternatively Spliced Variant Identification with Cross-Species Expressed Sequence Tag Conservation from Seven Plant Species Plant Physiology, March 1, 2007; 143(3): 1086 - 1095. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Kan, P. W. Garrett-Engele, J. M. Johnson, and J. C. Castle Evolutionarily conserved and diverged alternative splicing events show different expression and functional profiles Nucleic Acids Res., September 29, 2005; 33(17): 5659 - 5666. [Abstract] [Full Text] [PDF] |
||||
![]() |
F.-C. Chen and T.-J. Chuang ESTviewer: a web interface for visualizing mouse, rat, cattle, pig and chicken conserved ESTs in human genes and human alternatively spliced variants Bioinformatics, May 15, 2005; 21(10): 2510 - 2513. [Abstract] [Full Text] [PDF] |
||||



