Bioinformatics Advance Access originally published online on February 15, 2005
Bioinformatics 2005 21(11):2596-2603; doi:10.1093/bioinformatics/bti325
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Tree pattern matching in phylogenetic trees: automatic search for orthologs or paralogs in homologous gene sequence databases
1INRIA Rhône-Alpes 38334 Montbonnot, Saint Ismier Cedex, France
2Laboratoire de Biométrie et Biologie Évolutive, UMR CNRS 5558, Université Claude BernardLyon 1 43 bd. du 11 Novembre 1918, 69622 Villeurbanne Cedex, France
*To whom correspondence should be addressed.
Motivation: Comparative sequence analysis is widely used to study genome function and evolution. This approach first requires the identification of homologous genes and then the interpretation of their homology relationships (orthology or paralogy). To provide help in this complex task, we developed three databases of homologous genes containing sequences, multiple alignments and phylogenetic trees: HOBACGEN, HOVERGEN and HOGENOM. In this paper, we present two new tools for automating the search for orthologs or paralogs in these databases.
Results: First, we have developed and implemented an algorithm to infer speciation and duplication events by comparison of gene and species trees (tree reconciliation). Second, we have developed a general method to search in our databases the gene families for which the tree topology matches a peculiar tree pattern. This algorithm of unordered tree pattern matching has been implemented in the FamFetch graphical interface. With the help of a graphical editor, the user can specify the topology of the tree pattern, and set constraints on its nodes and leaves. Then, this pattern is compared with all the phylogenetic trees of the database, to retrieve the families in which one or several occurrences of this pattern are found. By specifying ad hoc patterns, it is therefore possible to identify orthologs in our databases.
Availability: The tree reconciliation program and the FamFetch interface are available from the Pôle Bioinformatique Lyonnais Web server at the following addresses: http://pbil.univ-lyon1.fr/software/RAP/RAP.htm and http://pbil.univ-lyon1.fr/software/famfetch.html
Contact: perriere{at}biomserv.univ-lyon1.fr
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
L. Si Quang, O. Gascuel, and N. Lartillot Empirical profile mixture models for phylogenetic reconstruction Bioinformatics, October 15, 2008; 24(20): 2317 - 2323. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Hulsen, P. M. A. Groenen, J. de Vlieg, and W. Alkema PhyloPat: an updated version of the phylogenetic pattern database contains gene neighborhood Nucleic Acids Res., October 2, 2008; (2008) gkn645v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Studer, S. Penel, L. Duret, and M. Robinson-Rechavi Pervasive positive selection on duplicated and nonduplicated vertebrate protein coding genes Genome Res., September 1, 2008; 18(9): 1393 - 1402. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. M. Pasini, M. Kirkegaard, D. Salerno, P. Mortensen, M. Mann, and A. W. Thomas Deep Coverage Mouse Red Blood Cell Proteome: A First Comparison with the Human Red Blood Cell Mol. Cell. Proteomics, July 1, 2008; 7(7): 1317 - 1330. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Matsuya, R. Sakate, Y. Kawahara, K. O. Koyanagi, Y. Sato, Y. Fujii, C. Yamasaki, T. Habara, H. Nakaoka, F. Todokoro, et al. Evola: Ortholog database of all human genes in H-InvDB with manual curation of phylogenetic trees Nucleic Acids Res., January 11, 2008; 36(suppl_1): D787 - D792. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Ruan, H. Li, Z. Chen, A. Coghlan, L. J. M. Coin, Y. Guo, J.-K. Heriche, Y. Hu, K. Kristiansen, R. Li, et al. TreeFam: 2008 Update Nucleic Acids Res., January 11, 2008; 36(suppl_1): D735 - D740. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. D. Rasmussen and M. Kellis Accurate gene-tree reconstruction by learning gene- and species-specific substitution rates across multiple complete genomes Genome Res., December 1, 2007; 17(12): 1932 - 1942. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Wapinski, A. Pfeffer, N. Friedman, and A. Regev Automatic genome-wide reconstruction of phylogenetic gene trees Bioinformatics, July 1, 2007; 23(13): i549 - i558. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Wu, F. Mao, V. Olman, and Y. Xu Hierarchical classification of functionally equivalent genes in prokaryotes Nucleic Acids Res., April 1, 2007; 35(7): 2125 - 2140. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. P. Cusack and K. H. Wolfe Not Born Equal: Increased Rate Asymmetry in Relocated and Retrotransposed Rodent Gene Duplicates Mol. Biol. Evol., March 1, 2007; 24(3): 679 - 686. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Semon and K. H. Wolfe Rearrangement Rate following the Whole-Genome Duplication in Teleosts Mol. Biol. Evol., March 1, 2007; 24(3): 860 - 867. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. K. Deka, C. A. Brautigam, F. L. Tomson, S. B. Lumpkins, D. R. Tomchick, M. Machius, and M. V. Norgard Crystal Structure of the Tp34 (TP0971) Lipoprotein of Treponema pallidum: IMPLICATIONS OF ITS METAL-BOUND STATE AND AFFINITY FOR HUMAN LACTOFERRIN J. Biol. Chem., February 23, 2007; 282(8): 5944 - 5958. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. J. P. Hubbard, B. L. Aken, K. Beal, B. Ballester, M. Caccamo, Y. Chen, L. Clarke, G. Coates, F. Cunningham, T. Cutts, et al. Ensembl 2007 Nucleic Acids Res., January 12, 2007; 35(suppl_1): D610 - D617. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Semon and L. Duret Evolutionary Origin and Maintenance of Coexpressed Gene Clusters in Mammals Mol. Biol. Evol., September 1, 2006; 23(9): 1715 - 1723. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Jothi, E. Zotenko, A. Tasneem, and T. M. Przytycka COCO-CL: hierarchical clustering of homology relations based on evolutionary correlations Bioinformatics, April 1, 2006; 22(7): 779 - 788. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Ternes, P. Sperling, S. Albrecht, S. Franke, J. M. Cregg, D. Warnecke, and E. Heinz Identification of Fungal Sphingolipid C9-methyltransferases by Phylogenetic Profiling J. Biol. Chem., March 3, 2006; 281(9): 5582 - 5592. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li, A. Coghlan, J. Ruan, L. J. Coin, J.-K. Heriche, L. Osmotherly, R. Li, T. Liu, Z. Zhang, L. Bolund, et al. TreeFam: a curated database of phylogenetic trees of animal gene families Nucleic Acids Res., January 1, 2006; 34(suppl_1): D572 - D580. [Abstract] [Full Text] [PDF] |
||||





