Skip Navigation


Bioinformatics Advance Access originally published online on January 5, 2006
Bioinformatics 2006 22(5):532-540; doi:10.1093/bioinformatics/bti804
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
22/5/532    most recent
bti804v2
bti804v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (7)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Lehrach, W. P.
Right arrow Articles by Williams, C. K. I.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Lehrach, W. P.
Right arrow Articles by Williams, C. K. I.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

A regularized discriminative model for the prediction of protein–peptide interactions

Wolfgang P. Lehrach 1,2,*, Dirk Husmeier 2 and Christopher K. I. Williams 1

1University of Edinburgh Edinburgh EH1 2QL, UK
2Biomathematics and Statistics Scotland Edinburgh EH9 3JZ, UK

*To whom correspondence should be addressed.

Motivation: Short well-defined domains known as peptide recognition modules (PRMs) regulate many important protein–protein interactions involved in the formation of macromolecular complexes and biochemical pathways. Since high-throughput experiments like yeast two-hybrid and phage display are expensive and intrinsically noisy, it would be desirable to more specifically target or partially bypass them with complementary in silico approaches. In the present paper, we present a probabilistic discriminative approach to predicting PRM-mediated protein–protein interactions from sequence data. The model is motivated by the discriminative model of Segal and Sharan as an alternative to the generative approach of Reiss and Schwikowski. In our evaluation, we focus on predicting the interaction network. As proposed by Williams, we overcome the problem of susceptibility to over-fitting by adopting a Bayesian a posteriori approach based on a Laplacian prior in parameter space.

Results: The proposed method was tested on two datasets of protein–protein interactions involving 28 SH3 domain proteins in Saccharmomyces cerevisiae, where the datasets were obtained with different experimental techniques. The predictions were evaluated with out-of-sample receiver operator characteristic (ROC) curves. In both cases, Laplacian regularization turned out to be crucial for achieving a reasonable generalization performance. The Laplacian-regularized discriminative model outperformed the generative model of Reiss and Schwikowski in terms of the area under the ROC curve on both datasets. The performance was further improved with a hybrid approach, in which our model was initialized with the motifs obtained with the method of Reiss and Schwikowski.

Availability: Software and supplementary material is available from http://lehrach.com/wolfgang/dmf

Contact: wlehrach{at}ed.ac.uk


Received on August 4, 2005; revised on November 23, 2005; accepted on November 25, 2005

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
S. Pitre, C. North, M. Alamgir, M. Jessulat, A. Chan, X. Luo, J. R. Green, M. Dumontier, F. Dehne, and A. Golshani
Global investigation of protein-protein interactions in yeast Saccharomyces cerevisiae using re-occurring short polypeptide sequences
Nucleic Acids Res., August 1, 2008; 36(13): 4286 - 4294.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. Ferraro, D. Peluso, A. Via, G. Ausiello, and M. Helmer-Citterich
SH3-Hunter: discovery of SH3 domain interaction sites in proteins
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W451 - W454.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
G. C. Cawley and N. L. C. Talbot
Gene selection in cancer classification using sparse logistic regression with Bayesian regularization
Bioinformatics, October 1, 2006; 22(19): 2348 - 2355.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
E. Ferraro, A. Via, G. Ausiello, and M. Helmer-Citterich
A novel structure-based encoding for machine-learning applied to the inference of SH3 domain specificity
Bioinformatics, October 1, 2006; 22(19): 2333 - 2339.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.