Bioinformatics Advance Access published online on August 19, 2004
Bioinformatics, doi:10.1093/bioinformatics/bth483
Bioinformatics © Oxford University Press 2004; all rights reserved
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Sandia National Laboratories, Computational Biology, 9212, PO Box 5800, MS 310, Albuquerque, NM, 87185, USA
* To whom correspondence should be addressed. E-mail: smartin{at}sandia.gov.
Motivation: Proteome wide prediction of protein-protein interaction is a difficult and important problem in biology. Although there have been recent advances in both experimental and computational methods for predicting protein-protein interactions, we are only beginning to see a confluence of these techniques. In this paper we describe a very general, high-throughput method for predicting protein-protein interactions. Our method combines a sequence-based description of proteins with experimental information that can be gathered from any type of protein-protein interaction screen. The method uses a novel description of interacting proteins by extending the signature descriptor, which has demonstrated success in predicting peptide/protein binding interactions for individual proteins. This descriptor is extended to protein pairs by taking signature products. The signature product is implemented within a Support Vector Machine classifier as a kernel function. Results: We have applied our method to publicly available yeast, Helicobacter pylori, human, and mouse datasets. We used the yeast and H. pylori datasets to verify the predictive ability of our method, achieving from 70% to 80% accuracy rates using ten-fold cross-validation. We used the human and mouse datasets to demonstrate that our method is capable of cross species prediction. Finally, we re-used the yeast dataset to explore the ability of our algorithm to predict domains.
Revised August 12, 2004
Accepted August 13, 2004
Article
Predicting protein-protein interactions using signature products
2 Biosystems Research, PO Box 969, MS 9951, Livermore, CA, 94551, USA
3 Computational Biology, 9212, PO Box 969, MS 9951, Livermore, CA, 94551, USA
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. Guo, L. Yu, Z. Wen, and M. Li Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences Nucleic Acids Res., May 1, 2008; 36(9): 3025 - 3030. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-L. Faulon, M. Misra, S. Martin, K. Sale, and R. Sapra Genome scale enzyme metabolite and drug target interaction predictions using the signature molecular descriptor Bioinformatics, January 15, 2008; 24(2): 225 - 233. [Abstract] [Full Text] [PDF] |
||||
![]() |
A.D.J. van Dijk, C.J.F. ter Braak, R.G. Immink, G.C. Angenent, and R.C.H.J. van Ham Predicting and understanding transcription factor interactions based on sequence level determinants of combinatorial control Bioinformatics, January 1, 2008; 24(1): 26 - 33. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Nagamine and Y. Sakakibara Statistical prediction of protein chemical interactions based on chemical structure and mass spectrometry data Bioinformatics, August 1, 2007; 23(15): 2004 - 2012. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Ferraro, A. Via, G. Ausiello, and M. Helmer-Citterich A novel structure-based encoding for machine-learning applied to the inference of SH3 domain specificity Bioinformatics, October 1, 2006; 22(19): 2333 - 2339. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Zhang, C. Shao, D. Zheng, and Y. Gao An Integrated Machine Learning System to Computationally Screen Protein Databases for Protein Binding Peptide Ligands Mol. Cell. Proteomics, July 1, 2006; 5(7): 1224 - 1232. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Nanni and A. Lumini An ensemble of K-local hyperplanes for predicting protein-protein interactions Bioinformatics, May 15, 2006; 22(10): 1207 - 1210. [Abstract] [Full Text] [PDF] |
||||


