Bioinformatics Vol. 17 no. 5 2001
Pages 455-460
© 2001 Oxford University Press
Predicting proteinprotein interactions from primary structure
Department of Bioengineering, 9500 Gilman Drive, University of California, San Diego, La Jolla, CA 92093-0412, USA
Received on August 22, 2000
; revised on November 22, 2000
; accepted on January 4, 2001
Motivation: An ambitious goal of proteomics is to elucidate the structure, interactions and functions of all proteins within cells and organisms. The expectation is that this will provide a fuller appreciation of cellular processes and networks at the protein level, ultimately leading to a better understanding of disease mechanisms and suggesting new means for intervention. This paper addresses the question: can proteinprotein interactions be predicted directly from primary structure and associated data? Using a diverse database of known protein interactions, a Support Vector Machine (SVM) learning system was trained to recognize and predict interactions based solely on primary structure and associated physicochemical properties.
Results: Inductive accuracy of the trained system, defined here as the percentage of correct protein interaction predictions for previously unseen test sets, averaged 80% for the ensemble of statistical experiments. Future proteomics studies may benefit from this research by proceeding directly from the automated identification of a cells gene products to prediction of protein interaction pairs.
Contact: dgough{at}bioeng.ucsd.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S.-E. Schelhorn, T. Lengauer, and M. Albrecht An integrative approach for predicting interactions of protein regions Bioinformatics, August 15, 2008; 24(16): i35 - i41. [Abstract] [PDF] |
||||
![]() |
Y. Guo, L. Yu, Z. Wen, and M. Li Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences Nucleic Acids Res., May 1, 2008; 36(9): 3025 - 3030. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-L. Faulon, M. Misra, S. Martin, K. Sale, and R. Sapra Genome scale enzyme metabolite and drug target interaction predictions using the signature molecular descriptor Bioinformatics, January 15, 2008; 24(2): 225 - 233. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. S. Negi, C. H. Schein, N. Oezguen, T. D. Power, and W. Braun InterProSurf: a web server for predicting interacting sites on protein surfaces Bioinformatics, December 15, 2007; 23(24): 3397 - 3399. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Nagamine and Y. Sakakibara Statistical prediction of protein chemical interactions based on chemical structure and mass spectrometry data Bioinformatics, August 1, 2007; 23(15): 2004 - 2012. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-R. Xu, J.-X. Zhang, B.-C. Han, L. Liang, and Z.-L. Ji CytoSVM: an advanced server for identification of cytokine-receptor interactions Nucleic Acids Res., July 13, 2007; 35(suppl_2): W538 - W542. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Shen, J. Zhang, X. Luo, W. Zhu, K. Yu, K. Chen, Y. Li, and H. Jiang Predicting protein-protein interactions based only on sequences information PNAS, March 13, 2007; 104(11): 4337 - 4341. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. R. Jefferson, T. P. Walsh, T. J. Roberts, and G. J. Barton SNAPPI-DB: a database and API of Structures, iNterfaces and Alignments for Protein-Protein Interactions Nucleic Acids Res., January 12, 2007; 35(suppl_1): D580 - D589. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Ferraro, A. Via, G. Ausiello, and M. Helmer-Citterich A novel structure-based encoding for machine-learning applied to the inference of SH3 domain specificity Bioinformatics, October 1, 2006; 22(19): 2333 - 2339. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Zheng, Z. Liu, C. Xue, W. Zhu, K. Chen, X. Luo, and H. Jiang Mutagenic probability estimation of chemical compounds by a novel molecular electrophilicity vector and support vector machine Bioinformatics, September 1, 2006; 22(17): 2099 - 2106. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. R. Li, H. H. Lin, L. Y. Han, L. Jiang, X. Chen, and Y. Z. Chen PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W32 - W37. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Eramian, M.-y. Shen, D. Devos, F. Melo, A. Sali, and M. A. Marti-Renom A composite score for predicting errors in protein structure models Protein Sci., July 1, 2006; 15(7): 1653 - 1666. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. H. Lin, L. Y. Han, H. L. Zhang, C. J. Zheng, B. Xie, and Y. Z. Chen Prediction of the functional class of lipid binding proteins from sequence-derived properties irrespective of sequence similarity J. Lipid Res., April 1, 2006; 47(4): 824 - 831. [Abstract] [Full Text] [PDF] |
||||
![]() |
X.-W. Chen and M. Liu Prediction of protein-protein interactions using random decision forest framework Bioinformatics, December 15, 2005; 21(24): 4394 - 4400. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. M. Kasson, J. B. Huppa, M. M. Davis, and A. T. Brunger A hybrid machine-learning approach for segmentation of protein localization data Bioinformatics, October 1, 2005; 21(19): 3778 - 3786. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. T. Nasim and R. C. Trembath A dual-light reporter system to determine the efficiency of protein-protein interactions in mammalian cells Nucleic Acids Res., April 11, 2005; 33(7): e66 - e66. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Droit, G. G Poirier, and J. M Hunter Experimental and bioinformatic approaches for interrogating protein-protein interactions to determine protein function J. Mol. Endocrinol., April 1, 2005; 34(2): 263 - 280. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Martin, D. Roe, and J.-L. Faulon Predicting protein-protein interactions using signature products Bioinformatics, January 15, 2005; 21(2): 218 - 226. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Y. Han, C. Z. Cai, Z. L. Ji, Z. W. Cao, J. Cui, and Y. Z. Chen Predicting functional family of novel enzymes irrespective of sequence similarity: a statistical learning approach Nucleic Acids Res., December 7, 2004; 32(21): 6437 - 6444. [Abstract] [Full Text] [PDF] |
||||
![]() |
D.-S. Han, H.-S. Kim, W.-H. Jang, S.-D. Lee, and J.-K. Suh PreSPI: a domain combination based prediction system for protein-protein interaction Nucleic Acids Res., December 1, 2004; 32(21): 6312 - 6320. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Wang, J. Yang, G.-P. Liu, Z.-J. Xu, and K.-C. Chou Weighted-support vector machines for predicting membrane protein types based on pseudo-amino acid composition Protein Eng. Des. Sel., June 1, 2004; 17(6): 509 - 516. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Y. HAN, C. Z. CAI, S. L. LO, M. C.M. CHUNG, and Y. Z. CHEN Prediction of RNA-binding proteins from primary sequence by a support vector machine approach RNA, March 1, 2004; 10(3): 355 - 368. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.Z. Cai, L.Y. Han, Z.L. Ji, X. Chen, and Y.Z. Chen SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence Nucleic Acids Res., July 1, 2003; 31(13): 3692 - 3697. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Lu, M. Hallett, S. Pollock, and D. Thomas DePIE: Designing Primers for Protein Interaction Experiments Nucleic Acids Res., July 1, 2003; 31(13): 3755 - 3757. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Halperin, H. Wolfson, and R. Nussinov SiteLight: Binding-site prediction using phage display libraries Protein Sci., July 1, 2003; 12(7): 1344 - 1359. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-D. Cai, G.-P. Zhou, and K.-C. Chou Support Vector Machines for Predicting Membrane Protein Types by Using Functional Domain Composition Biophys. J., May 1, 2003; 84(5): 3257 - 3263. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-C. Chou and Y.-D. Cai Using Functional Domain Composition and Support Vector Machines for Prediction of Protein Subcellular Location J. Biol. Chem., November 22, 2002; 277(48): 45765 - 45769. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. R. Bock and D. A. Gough A New Method to Estimate Ligand-Receptor Energetics Mol. Cell. Proteomics, November 1, 2002; 1(11): 904 - 910. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Deng, S. Mehta, F. Sun, and T. Chen Inferring Domain-Domain Interactions From Protein-Protein Interactions Genome Res., October 1, 2002; 12(10): 1540 - 1548. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Saito, H. Suzuki, and Y. Hayashizaki Interaction generality, a measurement to assess the reliability of a protein-protein interaction Nucleic Acids Res., March 1, 2002; 30(5): 1163 - 1168. [Abstract] [Full Text] [PDF] |
||||











