Bioinformatics Advance Access published online on September 30, 2004
Bioinformatics, doi:10.1093/bioinformatics/bti045
Bioinformatics © Oxford University Press 2004; all rights reserved
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Assays and Bioinformatics, Celera Genomics, 45 West Gude Drive, Rockville, Maryland 20850, USA
* To whom correspondence should be addressed. E-mail: richard.mural{at}celera.com.
Motivation: The identification of orthologous gene pairs is generally based on sequence similarity. Gene pairs that are mutually "best hits" between the genomes being compared are asserted to be orthologs. While this method identifies most orthologous gene pairs with high confidence, it will miss a fraction of ortholog pairs, especially genes in duplicated gene families. In addition, the approach depends heavily on the completeness and the quality of gene annotation. When the gene sequences are not correctly represented the approach is unlikely to find the correct ortholog. To overcome these limitations, we have developed an approach to identify orthologous gene pairs using shared chromosomal synteny and the annotation of protein function. Results: Assembled mouse and human genomes (Mural et al., 2002; Venter et al., 2001) were used to identify the regions of conserved synteny between these genomes. "Syntenic anchors" are conserved non-repetitive locations between mouse and human genomes. Using these anchors, we identified blocks of sequences that contain consistently ordered anchors between the two genomes (syntenic blocks). The synteny information has been used to help us identify orthologous gene pairs between mouse and human. The approach combines selecting mutually best tBlastX hits among human and mouse transcripts, and inferring gene orthologous relationships based on sharing syntenic anchors, co-locating in the same syntenic blocks, and sharing the same annotated protein function. Using this approach, we are able to find 19357 orthologous gene pairs between human and mouse, a 20% increase over the number of orthologs identified by conventional approaches. Supplementary information: A table containing human mouse orthologs on mouse chromosome 16 can be found in THIS URL (editor, please provide a URL).
Revised September 20, 2004
Accepted September 21, 2004
Article
Using shared genomic synteny and shared protein functions to enhance the identification of orthologous gene pairs
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T. Hachiya, Y. Osana, K. Popendorf, and Y. Sakakibara Accurate identification of orthologous segments among multiple genomes Bioinformatics, April 1, 2009; 25(7): 853 - 860. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Eckhart, L. D. Valle, K. Jaeger, C. Ballaun, S. Szabo, A. Nardi, M. Buchberger, M. Hermann, L. Alibardi, and E. Tschachler From the Cover: Identification of reptilian genes encoding hair keratin-like proteins suggests a new scenario for the evolutionary origin of hair PNAS, November 25, 2008; 105(47): 18419 - 18423. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Eckhart, C. Ballaun, M. Hermann, J. L. VandeBerg, W. Sipos, A. Uthman, H. Fischer, and E. Tschachler Identification of Novel Mammalian Caspases Reveals an Important Role of Gene Loss in Shaping the Human Caspase Repertoire Mol. Biol. Evol., May 1, 2008; 25(5): 831 - 841. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Nolan, L. Wu, H. J. Bang, S. A. Jelinsky, K. P. Roberts, T. T. Turner, G. S. Kopf, and D. S. Johnston Identification of Rat Cysteine-Rich Secretory Protein 4 (Crisp4) as the Ortholog to Human CRISP1 and Mouse Crisp4 Biol Reprod, May 1, 2006; 74(5): 984 - 991. [Abstract] [Full Text] [PDF] |
||||



