Bioinformatics Advance Access originally published online on January 29, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics 20(6) © Oxford University Press 2004; all rights reserved.
Mapping gene ontology to proteins based on proteinprotein interaction data
Department of Biological Sciences, Molecular and Computational Biology Program, University of Southern California, 1042 West 36th Place, Los Angeles, CA 90089-1113, USA
Received on July 29, 2003
; revised on October 12, 2003
; accepted on October 13, 2003
Motivation: Gene Ontology (GO) consortium provides structural description of protein function that is used as a common language for gene annotation in many organisms. Large-scale techniques have generated many valuable proteinprotein interaction datasets that are useful for the study of protein function. Combining both GO and proteinprotein interaction data allows the prediction of function for unknown proteins.
Result: We apply a Markov random field method to the prediction of yeast protein function based on multiple proteinprotein interaction datasets. We assign function to unknown proteins with a probability representing the confidence of this prediction. The functions are based on three general categories of cellular component, molecular function and biological process defined in GO. The yeast proteins are defined in the Saccharomyces Genome Database (SGD). The proteinprotein interaction datasets are obtained from the Munich Information Center for Protein Sequences (MIPS), including physical interactions and genetic interactions. The efficiency of our prediction is measured by applying the leave-one-out validation procedure to a functional path matching scheme, which compares the prediction with the GO description of a protein's function from the abstract level to the detailed level along the GO structure. For biological process, the leave-one-out validation procedure shows 52% precision and recall of our method, much better than that of the simple guilty-by-association methods.
Supplementary material: http://www.cmb.usc.edu/~msms/gomapping
Contact: fsun{at}hto.usc.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J.-H. Kim, A. Mitchell, T. K. Attwood, and M. Hilario Learning to extract relations for protein annotation Bioinformatics, July 1, 2007; 23(13): i256 - i263. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Carroll and V. Pavlovic Protein classification using probabilistic chain graphs and the Gene Ontology structure Bioinformatics, August 1, 2006; 22(15): 1871 - 1878. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Massjouni, C. G. Rivera, and T. M. Murali VIRGO: computational prediction of gene functions. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W340 - W344. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. N. Chua, W.-K. Sung, and L. Wong Exploiting indirect neighbours and topological weight to predict protein function from protein-protein interactions Bioinformatics, July 1, 2006; 22(13): 1623 - 1630. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Espadaler, O. Romero-Isart, R. M. Jackson, and B. Oliva Prediction of protein-protein interactions using distant conservation of sequence patterns and structure relationships Bioinformatics, August 15, 2005; 21(16): 3360 - 3368. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. McDermott, R. Bumgarner, and R. Samudrala Functional annotation from predicted protein interaction networks Bioinformatics, August 1, 2005; 21(15): 3217 - 3226. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. McDermott, M. Guerquin, Z. Frazier, A. N. Chang, and R. Samudrala BIOVERSE: enhancements to the framework for structural, functional and contextual modeling of proteins and proteomes Nucleic Acids Res., July 1, 2005; 33(suppl_2): W324 - W325. [Abstract] [Full Text] [PDF] |
||||

