Automatic genome-wide reconstruction of phylogenetic gene trees
1Broad Institute of MIT and Harvard,2School of Engineering and Applied Sciences, Harvard University,3FAS Center for Systems Biology,4Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA and5School of Computer Science & Engineering, Hebrew University, Jerusalem, Israel
*To whom correspondence should be addressed.
| Abstract |
|---|
Gene duplication and divergence is a major evolutionary force. Despite the growing number of fully sequenced genomes, methods for investigating these events on a genome-wide scale are still in their infancy. Here, we present SYNERGY, a novel and scalable algorithm that uses sequence similarity and a given species phylogeny to reconstruct the underlying evolutionary history of all genes in a large group of species. In doing so, SYNERGY resolves homology relations and accurately distinguishes orthologs from paralogs. We applied our approach to a set of nine fully sequenced fungal genomes spanning 150 million years, generating a genome-wide catalog of orthologous groups and corresponding gene trees. Our results are highly accurate when compared to a manually curated gold standard, and are robust to the quality of input according to a novel jackknife confidence scoring. The reconstructed gene trees provide a comprehensive view of gene evolution on a genomic scale. Our approach can be applied to any set of sequenced eukaryotic species with a known phylogeny, and opens the way to systematic studies of the evolution of individual genes, molecular systems and whole genomes.
Contact: aregev{at}broad.mit.edu
Supplementary information: Supplementary data are available at Bioinformatics online.
This article has been cited by other articles:
![]() |
J. Muller, D. Szklarczyk, P. Julien, I. Letunic, A. Roth, M. Kuhn, S. Powell, C. von Mering, T. Doerks, L. J. Jensen, et al. eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations Nucleic Acids Res., November 9, 2009; (2009) gkp951v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. M. Nava, D. Y. Lee, J. H. Ospina, S.-Y. Cai, and H. R. Gaskins Genomic analyses reveal a conserved glutathione homeostasis pathway in the invertebrate chordate Ciona intestinalis Physiol Genomics, November 1, 2009; 39(3): 183 - 194. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. B. Arnaud, M. C. Chibucos, M. C. Costanzo, J. Crabtree, D. O. Inglis, A. Lotia, J. Orvis, P. Shah, M. S. Skrzypek, G. Binkley, et al. The Aspergillus Genome Database, a curated comparative genomics resource for gene, protein and sequence information for the Aspergillus research community Nucleic Acids Res., September 22, 2009; (2009) gkp751v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Akerborg, B. Sennblad, L. Arvestad, and J. Lagergren Simultaneous Bayesian gene tree reconstruction and reconciliation analysis PNAS, April 7, 2009; 106(14): 5714 - 5719. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Bedford, I. Wapinski, and D. L. Hartl Overdispersion of the Molecular Clock Varies Between Yeast, Drosophila and Mammals Genetics, June 1, 2008; 179(2): 977 - 984. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Sakarya, K. S. Kosik, and T. H. Oakley Reconstructing ancestral genome content based on symmetrical best alignments and Dollo parsimony Bioinformatics, March 1, 2008; 24(5): 606 - 612. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. J. Jensen, P. Julien, M. Kuhn, C. von Mering, J. Muller, T. Doerks, and P. Bork eggNOG: automated construction and annotation of orthologous groups of genes Nucleic Acids Res., January 11, 2008; 36(suppl_1): D250 - D254. [Abstract] [Full Text] [PDF] |
||||




