Skip Navigation

Bioinformatics 2007 23(13):i549-i558; doi:10.1093/bioinformatics/btm193
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Wapinski, I.
Right arrow Articles by Regev, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wapinski, I.
Right arrow Articles by Regev, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2007 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Automatic genome-wide reconstruction of phylogenetic gene trees

Ilan Wapinski 1,2,3, Avi Pfeffer 2, Nir Friedman 5 and Aviv Regev 3,4,*

1Broad Institute of MIT and Harvard,2School of Engineering and Applied Sciences, Harvard University,3FAS Center for Systems Biology,4Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA and5School of Computer Science & Engineering, Hebrew University, Jerusalem, Israel

*To whom correspondence should be addressed.


   Abstract

Gene duplication and divergence is a major evolutionary force. Despite the growing number of fully sequenced genomes, methods for investigating these events on a genome-wide scale are still in their infancy. Here, we present SYNERGY, a novel and scalable algorithm that uses sequence similarity and a given species phylogeny to reconstruct the underlying evolutionary history of all genes in a large group of species. In doing so, SYNERGY resolves homology relations and accurately distinguishes orthologs from paralogs. We applied our approach to a set of nine fully sequenced fungal genomes spanning 150 million years, generating a genome-wide catalog of orthologous groups and corresponding gene trees. Our results are highly accurate when compared to a manually curated gold standard, and are robust to the quality of input according to a novel jackknife confidence scoring. The reconstructed gene trees provide a comprehensive view of gene evolution on a genomic scale. Our approach can be applied to any set of sequenced eukaryotic species with a known phylogeny, and opens the way to systematic studies of the evolution of individual genes, molecular systems and whole genomes.

Contact: aregev{at}broad.mit.edu

Supplementary information: Supplementary data are available at Bioinformatics online.



Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
J. Muller, D. Szklarczyk, P. Julien, I. Letunic, A. Roth, M. Kuhn, S. Powell, C. von Mering, T. Doerks, L. J. Jensen, et al.
eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations
Nucleic Acids Res., November 9, 2009; (2009) gkp951v1.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
G. M. Nava, D. Y. Lee, J. H. Ospina, S.-Y. Cai, and H. R. Gaskins
Genomic analyses reveal a conserved glutathione homeostasis pathway in the invertebrate chordate Ciona intestinalis
Physiol Genomics, November 1, 2009; 39(3): 183 - 194.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. B. Arnaud, M. C. Chibucos, M. C. Costanzo, J. Crabtree, D. O. Inglis, A. Lotia, J. Orvis, P. Shah, M. S. Skrzypek, G. Binkley, et al.
The Aspergillus Genome Database, a curated comparative genomics resource for gene, protein and sequence information for the Aspergillus research community
Nucleic Acids Res., September 22, 2009; (2009) gkp751v1.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
O. Akerborg, B. Sennblad, L. Arvestad, and J. Lagergren
Simultaneous Bayesian gene tree reconstruction and reconciliation analysis
PNAS, April 7, 2009; 106(14): 5714 - 5719.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
T. Bedford, I. Wapinski, and D. L. Hartl
Overdispersion of the Molecular Clock Varies Between Yeast, Drosophila and Mammals
Genetics, June 1, 2008; 179(2): 977 - 984.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
O. Sakarya, K. S. Kosik, and T. H. Oakley
Reconstructing ancestral genome content based on symmetrical best alignments and Dollo parsimony
Bioinformatics, March 1, 2008; 24(5): 606 - 612.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. J. Jensen, P. Julien, M. Kuhn, C. von Mering, J. Muller, T. Doerks, and P. Bork
eggNOG: automated construction and annotation of orthologous groups of genes
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D250 - D254.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.