Bioinformatics Advance Access published online on December 21, 2004
Bioinformatics, doi:10.1093/bioinformatics/bti242
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 School Of Biochemistry and Molecular Biology, University of Leeds, Leeds, LS2 9JT, UK
* To whom correspondence should be addressed.
Motivation: Structural genomics projects are beginning to produce protein structures with unknown function therefore accurate, automated predictors of protein function are required if all these structures are to be properly annotated in reasonable time. Identifying the interface between two interacting proteins provides important clues as to the function of a protein, and can reduce the search space required by docking algorithms to predict the structures of complexes. Results: We have combined a support vector machine (SVM) approach with surface patch analysis to predict protein-protein binding sites. Using a leave-one-out cross validation procedure, we were able to successfully predict the location of the binding site on 76% of our data set made up proteins with both transient and obligate interfaces. With heterogeneous cross-validation, where we trained the SVM on transient complexes to predict on obligate complexes (and vice versa), we still achieved comparable success rates to the leave-one-out cross validation suggesting that sufficient properties are shared between transient and obligate interfaces. Availability: A web application based on the method can be found at http://www.bioinformatics.leeds.ac.uk/ppi_pred. The data set of 180 proteins used in this study is also available via the same web site. Supplementary information: Included.
Received September 14, 2004
Revised November 18, 2004
Accepted December 16, 2004
Article
Improved prediction of protein-protein binding sites using a support vector machines approach
David R. Westhead, E-mail: westhead{at}bmb.leeds.ac.uk
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
N. Tuncbag, G. Kar, O. Keskin, A. Gursoy, and R. Nussinov A survey of available tools and web servers for analysis of protein-protein interactions and interfaces Brief Bioinform, May 1, 2009; 10(3): 217 - 232. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Ezkurdia, L. Bartoli, P. Fariselli, R. Casadio, A. Valencia, and M. L. Tress Progress and challenges in predicting protein-protein interaction sites Brief Bioinform, May 1, 2009; 10(3): 233 - 246. [Abstract] [Full Text] [PDF] |
||||
![]() |
X.-w. Chen and J. C. Jeong Sequence-based prediction of protein interaction sites with an integrative method Bioinformatics, March 1, 2009; 25(5): 585 - 591. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. C. Chen and C. Lim Common physical basis of macromolecule-binding sites in proteins Nucleic Acids Res., December 1, 2008; 36(22): 7078 - 7087. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. C. Dukka Bahadur and D. R. Livesay Improving position-specific predictions of protein functional sites using phylogenetic motifs Bioinformatics, October 15, 2008; 24(20): 2308 - 2316. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Song, H. Tan, K. Takemoto, and T. Akutsu HSEpred: predict half-sphere exposure from protein sequences Bioinformatics, July 1, 2008; 24(13): 1489 - 1497. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Madaoui and R. Guerois Coevolution at protein complex interfaces can be detected by the complementarity trace with important impact for predictive docking PNAS, June 3, 2008; 105(22): 7708 - 7713. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Bernauer, R. P. Bahadur, F. Rodier, J. Janin, and A. Poupon DiMoVo: a Voronoi tessellation-based method for discriminating crystallographic and biological protein-protein interactions Bioinformatics, March 1, 2008; 24(5): 652 - 658. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Song, Z. Yuan, H. Tan, T. Huber, and K. Burrage Predicting disulfide connectivity from protein sequence using multiple sequence feature vectors and secondary structure Bioinformatics, December 1, 2007; 23(23): 3147 - 3154. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-X. Zhou and S. Qin Interaction-site prediction for protein complexes: a critical assessment Bioinformatics, September 1, 2007; 23(17): 2203 - 2209. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Neuvirth, U. Heinemann, D. Birnbaum, N. Tishby, and G. Schreiber ProMateus--an open research approach to protein-binding sites analysis Nucleic Acids Res., July 13, 2007; 35(suppl_2): W543 - W548. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Tjong, S. Qin, and H.-X. Zhou PI2PE: protein interface/interior prediction engine Nucleic Acids Res., July 13, 2007; 35(suppl_2): W357 - W362. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. M. Kim, S. C. Oh, K. J. Lim, J. Kasamatsu, J. Y. Heo, B. S. Park, H. Lee, O. J. Yoo, M. Kasahara, and J.-O. Lee Structural Diversity of the Hagfish Variable Lymphocyte Receptors J. Biol. Chem., March 2, 2007; 282(9): 6726 - 6732. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-H. Li, L. Lin, X.-L. Wang, and T. Liu Protein protein interaction site prediction based on conditional random fields Bioinformatics, March 1, 2007; 23(5): 597 - 604. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Friedrich, B. Pils, T. Dandekar, J. Schultz, and T. Muller Modelling interaction sites in protein domains with interaction profile hidden Markov models Bioinformatics, December 1, 2006; 22(23): 2851 - 2857. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. de Vries and A. M. J. J. Bonvin Intramolecular surface contacts contain information about protein-protein interface regions Bioinformatics, September 1, 2006; 22(17): 2094 - 2098. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Liang, C. Zhang, S. Liu, and Y. Zhou Protein binding site prediction using an empirical scoring function Nucleic Acids Res., August 7, 2006; 34(13): 3698 - 3707. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. J. Burgoyne and R. M. Jackson Predicting protein interaction sites: binding hot-spots in protein-protein and protein-ligand interfaces Bioinformatics, June 1, 2006; 22(11): 1335 - 1342. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Mihalek, I. Res, and O. Lichtarge A structure and evolution-guided Monte Carlo sequence selection strategy for multiple alignment-based analysis of proteins Bioinformatics, January 15, 2006; 22(2): 149 - 156. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-J. Park, M. M. Gromiha, P. Horton, and M. Suwa Discrimination of outer membrane proteins using support vector machines Bioinformatics, December 1, 2005; 21(23): 4223 - 4229. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. C. Kulkarni, R. Vigneshwar, V. K. Jayaraman, and B. D. Kulkarni Identification of coding and non-coding sequences using local Holder exponent formalism Bioinformatics, October 15, 2005; 21(20): 3818 - 3823. [Abstract] [Full Text] [PDF] |
||||




