Bioinformatics Advance Access originally published online on August 19, 2004
Bioinformatics 2005 21(2):218-226; doi:10.1093/bioinformatics/bth483
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics vol. 21 issue 2 © Oxford University Press 2005; all rights reserved.
Predicting proteinprotein interactions using signature products
1 Sandia National Laboratories, Computational Biology 9212, P.O. Box 5800, MS 310, Albuquerque, NM, 87185, USA
2 Biosystems Research 9212, P.O. Box 969, MS 9951, Livermore, CA, 94551, USA
3 Computational Biology 9212, P.O. Box 969, MS 9951, Livermore, CA, 94551, USA
*To whom correspondence should be addressed.
Motivation: Proteome-wide prediction of proteinprotein interaction is a difficult and important problem in biology. Although there have been recent advances in both experimental and computational methods for predicting proteinprotein interactions, we are only beginning to see a confluence of these techniques. In this paper, we describe a very general, high-throughput method for predicting proteinprotein interactions. Our method combines a sequence-based description of proteins with experimental information that can be gathered from any type of proteinprotein interaction screen. The method uses a novel description of interacting proteins by extending the signature descriptor, which has demonstrated success in predicting peptide/protein binding interactions for individual proteins. This descriptor is extended to protein pairs by taking signature products. The signature product is implemented within a support vector machine classifier as a kernel function.
Results: We have applied our method to publicly available yeast, Helicobacter pylori, human and mouse datasets. We used the yeast and H.pylori datasets to verify the predictive ability of our method, achieving from 70 to 80% accuracy rates using 10-fold cross-validation. We used the human and mouse datasets to demonstrate that our method is capable of cross-species prediction. Finally, we reused the yeast dataset to explore the ability of our algorithm to predict domains.
Contact: smartin{at}sandia.gov.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. Guo, L. Yu, Z. Wen, and M. Li Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences Nucleic Acids Res., May 1, 2008; 36(9): 3025 - 3030. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-L. Faulon, M. Misra, S. Martin, K. Sale, and R. Sapra Genome scale enzyme metabolite and drug target interaction predictions using the signature molecular descriptor Bioinformatics, January 15, 2008; 24(2): 225 - 233. [Abstract] [Full Text] [PDF] |
||||
![]() |
A.D.J. van Dijk, C.J.F. ter Braak, R.G. Immink, G.C. Angenent, and R.C.H.J. van Ham Predicting and understanding transcription factor interactions based on sequence level determinants of combinatorial control Bioinformatics, January 1, 2008; 24(1): 26 - 33. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Nagamine and Y. Sakakibara Statistical prediction of protein chemical interactions based on chemical structure and mass spectrometry data Bioinformatics, August 1, 2007; 23(15): 2004 - 2012. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Ferraro, A. Via, G. Ausiello, and M. Helmer-Citterich A novel structure-based encoding for machine-learning applied to the inference of SH3 domain specificity Bioinformatics, October 1, 2006; 22(19): 2333 - 2339. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Zhang, C. Shao, D. Zheng, and Y. Gao An Integrated Machine Learning System to Computationally Screen Protein Databases for Protein Binding Peptide Ligands Mol. Cell. Proteomics, July 1, 2006; 5(7): 1224 - 1232. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Nanni and A. Lumini An ensemble of K-local hyperplanes for predicting protein-protein interactions Bioinformatics, May 15, 2006; 22(10): 1207 - 1210. [Abstract] [Full Text] [PDF] |
||||


