Bioinformatics Advance Access originally published online on October 18, 2005
Bioinformatics 2005 21(24):4394-4400; doi:10.1093/bioinformatics/bti721
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Prediction of proteinprotein interactions using random decision forest framework
Bioinformatics and Computational Life-Sciences Laboratory, ITTC, Department of Electrical Engineering and Computer Science, The University of Kansas 1520 West 15th Street, Lawrence, KS 66045, USA
*To whom correspondence should be addressed.
Motivation: Protein interactions are of biological interest because they orchestrate a number of cellular processes such as metabolic pathways and immunological recognition. Domains are the building blocks of proteins; therefore, proteins are assumed to interact as a result of their interacting domains. Many domain-based models for protein interaction prediction have been developed, and preliminary results have demonstrated their feasibility. Most of the existing domain-based methods, however, consider only single-domain pairs (one domain from one protein) and assume independence between domaindomain interactions.
Results: In this paper, we introduce a domain-based random forest of decision trees to infer protein interactions. Our proposed method is capable of exploring all possible domain interactions and making predictions based on all the protein domains. Experimental results on Saccharomyces cerevisiae dataset demonstrate that our approach can predict proteinprotein interactions with higher sensitivity (79.78%) and specificity (64.38%) compared with that of the maximum likelihood approach. Furthermore, our model can be used to infer interactions not only for single-domain pairs but also for multiple domain pairs.
Contact: xwchen{at}ku.edu
Availability: Source code is written in Java and is available upon request from the authors.
Supplementary information: http://www.ittc.ku.edu/~xwchen/PPI/random_forest_PPI
Received on August 14, 2005; revised on October 6, 2005; accepted on October 14, 2005
This article has been cited by other articles:
![]() |
S.-E. Schelhorn, T. Lengauer, and M. Albrecht An integrative approach for predicting interactions of protein regions Bioinformatics, August 15, 2008; 24(16): i35 - i41. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Guo, X. Wu, D.-Y. Zhang, and K. Lin Genome-wide inference of protein interaction sites: lessons from the yeast high-quality negative protein-protein interaction dataset Nucleic Acids Res., April 1, 2008; 36(6): 2002 - 2011. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Raghavachari, A. Tasneem, T. M. Przytycka, and R. Jothi DOMINE: a database of protein domain interactions Nucleic Acids Res., January 11, 2008; 36(suppl_1): D656 - D661. [Abstract] [Full Text] [PDF] |
||||
![]() |
A.D.J. van Dijk, C.J.F. ter Braak, R.G. Immink, G.C. Angenent, and R.C.H.J. van Ham Predicting and understanding transcription factor interactions based on sequence level determinants of combinatorial control Bioinformatics, January 1, 2008; 24(1): 26 - 33. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Shen, J. Zhang, X. Luo, W. Zhu, K. Yu, K. Chen, Y. Li, and H. Jiang Predicting protein-protein interactions based only on sequences information PNAS, March 13, 2007; 104(11): 4337 - 4341. [Abstract] [Full Text] [PDF] |
||||


