Skip Navigation


Bioinformatics Advance Access originally published online on February 15, 2005
Bioinformatics 2005 21(11):2596-2603; doi:10.1093/bioinformatics/bti325
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
21/11/2596    most recent
bti325v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (48)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Dufayard, J.-F.
Right arrow Articles by Perrière, G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Dufayard, J.-F.
Right arrow Articles by Perrière, G.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2005. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions{at}oupjournals.org

Tree pattern matching in phylogenetic trees: automatic search for orthologs or paralogs in homologous gene sequence databases

Jean-François Dufayard 1, Laurent Duret 2, Simon Penel 2, Manolo Gouy 2, François Rechenmann 1 and Guy Perrière 2,*

1INRIA Rhône-Alpes 38334 Montbonnot, Saint Ismier Cedex, France
2Laboratoire de Biométrie et Biologie Évolutive, UMR CNRS 5558, Université Claude Bernard—Lyon 1 43 bd. du 11 Novembre 1918, 69622 Villeurbanne Cedex, France

*To whom correspondence should be addressed.

Motivation: Comparative sequence analysis is widely used to study genome function and evolution. This approach first requires the identification of homologous genes and then the interpretation of their homology relationships (orthology or paralogy). To provide help in this complex task, we developed three databases of homologous genes containing sequences, multiple alignments and phylogenetic trees: HOBACGEN, HOVERGEN and HOGENOM. In this paper, we present two new tools for automating the search for orthologs or paralogs in these databases.

Results: First, we have developed and implemented an algorithm to infer speciation and duplication events by comparison of gene and species trees (tree reconciliation). Second, we have developed a general method to search in our databases the gene families for which the tree topology matches a peculiar tree pattern. This algorithm of unordered tree pattern matching has been implemented in the FamFetch graphical interface. With the help of a graphical editor, the user can specify the topology of the tree pattern, and set constraints on its nodes and leaves. Then, this pattern is compared with all the phylogenetic trees of the database, to retrieve the families in which one or several occurrences of this pattern are found. By specifying ad hoc patterns, it is therefore possible to identify orthologs in our databases.

Availability: The tree reconciliation program and the FamFetch interface are available from the Pôle Bioinformatique Lyonnais Web server at the following addresses: http://pbil.univ-lyon1.fr/software/RAP/RAP.htm and http://pbil.univ-lyon1.fr/software/famfetch.html

Contact: perriere{at}biomserv.univ-lyon1.fr


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Brief Funct Genomic ProteomicHome page
M. Naville and D. Gautheret
Transcription attenuation in bacteria: theme and variations
Brief Funct Genomic Proteomic, November 1, 2009; 8(6): 482 - 492.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. S. Datta, C. Meacham, B. Samad, C. Neyer, and K. Sjolander
Berkeley PHOG: PhyloFacts orthology group prediction web server
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W84 - W89.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
A. J. Vilella, J. Severin, A. Ureta-Vidal, L. Heng, R. Durbin, and E. Birney
EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates
Genome Res., February 1, 2009; 19(2): 327 - 335.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. Hulsen, P. M. A. Groenen, J. de Vlieg, and W. Alkema
PhyloPat: an updated version of the phylogenetic pattern database contains gene neighborhood
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D731 - D737.
[Abstract] [Full Text] [PDF]


Home page
Phil Trans R Soc BHome page
N. Galtier and V. Daubin
Dealing with incongruence in phylogenomic analyses
Phil Trans R Soc B, December 27, 2008; 363(1512): 4023 - 4029.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. V. Koonin and Y. I. Wolf
Genomics of bacteria and archaea: the emerging dynamic view of the prokaryotic world
Nucleic Acids Res., December 1, 2008; 36(21): 6688 - 6719.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
L. Si Quang, O. Gascuel, and N. Lartillot
Empirical profile mixture models for phylogenetic reconstruction
Bioinformatics, October 15, 2008; 24(20): 2317 - 2323.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
R. A. Studer, S. Penel, L. Duret, and M. Robinson-Rechavi
Pervasive positive selection on duplicated and nonduplicated vertebrate protein coding genes
Genome Res., September 1, 2008; 18(9): 1393 - 1402.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
E. M. Pasini, M. Kirkegaard, D. Salerno, P. Mortensen, M. Mann, and A. W. Thomas
Deep Coverage Mouse Red Blood Cell Proteome: A First Comparison with the Human Red Blood Cell
Mol. Cell. Proteomics, July 1, 2008; 7(7): 1317 - 1330.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Matsuya, R. Sakate, Y. Kawahara, K. O. Koyanagi, Y. Sato, Y. Fujii, C. Yamasaki, T. Habara, H. Nakaoka, F. Todokoro, et al.
Evola: Ortholog database of all human genes in H-InvDB with manual curation of phylogenetic trees
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D787 - D792.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Ruan, H. Li, Z. Chen, A. Coghlan, L. J. M. Coin, Y. Guo, J.-K. Heriche, Y. Hu, K. Kristiansen, R. Li, et al.
TreeFam: 2008 Update
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D735 - D740.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. D. Rasmussen and M. Kellis
Accurate gene-tree reconstruction by learning gene- and species-specific substitution rates across multiple complete genomes
Genome Res., December 1, 2007; 17(12): 1932 - 1942.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
I. Wapinski, A. Pfeffer, N. Friedman, and A. Regev
Automatic genome-wide reconstruction of phylogenetic gene trees
Bioinformatics, July 1, 2007; 23(13): i549 - i558.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Wu, F. Mao, V. Olman, and Y. Xu
Hierarchical classification of functionally equivalent genes in prokaryotes
Nucleic Acids Res., April 1, 2007; 35(7): 2125 - 2140.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
B. P. Cusack and K. H. Wolfe
Not Born Equal: Increased Rate Asymmetry in Relocated and Retrotransposed Rodent Gene Duplicates
Mol. Biol. Evol., March 1, 2007; 24(3): 679 - 686.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. Semon and K. H. Wolfe
Rearrangement Rate following the Whole-Genome Duplication in Teleosts
Mol. Biol. Evol., March 1, 2007; 24(3): 860 - 867.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
R. K. Deka, C. A. Brautigam, F. L. Tomson, S. B. Lumpkins, D. R. Tomchick, M. Machius, and M. V. Norgard
Crystal Structure of the Tp34 (TP0971) Lipoprotein of Treponema pallidum: IMPLICATIONS OF ITS METAL-BOUND STATE AND AFFINITY FOR HUMAN LACTOFERRIN
J. Biol. Chem., February 23, 2007; 282(8): 5944 - 5958.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. J. P. Hubbard, B. L. Aken, K. Beal, B. Ballester, M. Caccamo, Y. Chen, L. Clarke, G. Coates, F. Cunningham, T. Cutts, et al.
Ensembl 2007
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D610 - D617.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. Semon and L. Duret
Evolutionary Origin and Maintenance of Coexpressed Gene Clusters in Mammals
Mol. Biol. Evol., September 1, 2006; 23(9): 1715 - 1723.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. Jothi, E. Zotenko, A. Tasneem, and T. M. Przytycka
COCO-CL: hierarchical clustering of homology relations based on evolutionary correlations
Bioinformatics, April 1, 2006; 22(7): 779 - 788.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
P. Ternes, P. Sperling, S. Albrecht, S. Franke, J. M. Cregg, D. Warnecke, and E. Heinz
Identification of Fungal Sphingolipid C9-methyltransferases by Phylogenetic Profiling
J. Biol. Chem., March 3, 2006; 281(9): 5582 - 5592.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Li, A. Coghlan, J. Ruan, L. J. Coin, J.-K. Heriche, L. Osmotherly, R. Li, T. Liu, Z. Zhang, L. Bolund, et al.
TreeFam: a curated database of phylogenetic trees of animal gene families
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D572 - D580.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.