Vol. 20 no. 2 2004, pages 170-179
Bioinformatics © Oxford University Press 2004; all rights reserved.
PAG Meeting Original Paper |
Phylogenomic inference of protein molecular function: advances and challenges
Berkeley Phylogenomics Group, Department of Bioengineering, University of California, 473 Evans Hall #1762, Berkeley, CA 94720-1762, USA
Received on November 9, 2003
; accepted on November 26, 2003
Motivation: Protein families evolve a multiplicity of functions through gene duplication, speciation and other processes. As a number of studies have shown, standard methods of protein function prediction produce systematic errors on these data. Phylogenomic analysiscombining phylogenetic tree construction, integration of experimental data and differentiation of orthologs and paralogshas been proposed to address these errors and improve the accuracy of functional classification. The explicit integration of structure prediction and analysis in this framework, which we call structural phylogenomics, provides additional insights into protein superfamily evolution.
Results: Results of protein functional classification using phylogenomic analysis show fewer expected false positives overall than when pairwise methods of functional classification are employed. We present an overview of the motivations and fundamental principles of phylogenomic analysis, new methods developed for the key tasks, benchmark datasets for these tasks (when available) and suggest procedures to increase accuracy. We also discuss some of the methods used in the Celera Genomics high-throughput phylogenomic classification of the human genome.
Availability: Software tools from the Berkeley Phylogenomics Group are available at http://phylogenomics.berkeley.edu
Contact: kimmen{at}uclink.berkeley.edu
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
N. Yosef, R. Sharan, and W. S. Noble Improved network-based identification of protein orthologs Bioinformatics, August 15, 2008; 24(16): i200 - i206. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Wanchana, S. Thongjuea, V. J. Ulat, M. Anacleto, R. Mauleon, M. Conte, M. Rouard, M. Ruiz, N. Krishnamurthy, K. Sjolander, et al. The Generation Challenge Programme comparative plant stress-responsive gene catalogue Nucleic Acids Res., January 11, 2008; 36(suppl_1): D943 - D946. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Masso and I. I. Vaisman Accurate prediction of enzyme mutant activity based on a multibody statistical potential Bioinformatics, December 1, 2007; 23(23): 3155 - 3161. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Glanville, D. Kirshner, N. Krishnamurthy, and K. Sjolander Berkeley Phylogenomics Group web servers: resources for structural phylogenomic analysis Nucleic Acids Res., July 13, 2007; 35(suppl_2): W27 - W32. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Friedberg Automated protein function prediction--the genomic challenge Brief Bioinform, September 1, 2006; 7(3): 225 - 242. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Bandyopadhyay, R. Sharan, and T. Ideker Systematic identification of functional orthologs based on protein network comparison Genome Res., March 1, 2006; 16(3): 428 - 435. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Bordner and R. Abagyan REVCOM: a robust Bayesian method for evolutionary rate estimation Bioinformatics, May 15, 2005; 21(10): 2315 - 2321. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. R. Marshall, J. A. Fox, S. L. Butland, B. F. F. Ouellette, F. S. L. Brinkman, and G. F. Tibbits Phylogeny of Na+/Ca2+ exchanger (NCX) genes from genomic data identifies new gene duplications and a new family member in fish species Physiol Genomics, April 14, 2005; 21(2): 161 - 173. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Bandyopadhyay, R. Sharan, and T. Ideker Systematic identification of functional orthologs based on protein network comparison Genome Res., March 1, 2006; 16(3): 428 - 435. [Abstract] [Full Text] [PDF] |
||||




