Bioinformatics Advance Access published online on April 8, 2009
Bioinformatics, doi:10.1093/bioinformatics/btp238
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
ModLink+: Improving fold recognition by using protein-protein interactions
1Structural Bioinformatics Lab (GRIB-IMIM), Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona (PRBB), Barcelona, Catalonia, Spain.
2Present address: AB-Biotics S.L, Masia Can Fatjó del Molí s/n, 08290 Cerdanyola del Vallès, Catalonia, Spain.
3Structural Genomics Unit, Bioinformatics & Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain.
4Department of Bioengineering and Therapeutic Sciences, University of California at San Francisco, San Francisco, California, United States of America.
5Department of Pharmaceutical Chemistry, University of California at San Francisco, San Francisco, California, United States of America.
6California Institute for Quantitative Biosciences, University of California San Francisco, San Francisco, California, United States of America.
*To whom correspondence should be addressed. Dr. Baldomero Oliva, E-mail: baldo.oliva{at}upf.edu
| Abstract |
|---|
Motivation: Several strategies have been developed to predict the fold of a target protein sequence, most of which are based on aligning the target sequence to other sequences of known structure. Previously, we demonstrated that the consideration of protein-protein interactions significantly increases the accuracy of fold assignment compared to PSI-BLAST sequence comparisons. A drawback of our method was the low number of proteins to which a fold could be assigned. Here, we present an improved version of the method that addresses this limitation. We also compare our method to other state-of-the-art fold assignment methodologies.
Results: Our approach (ModLink+) has been tested on 3,716 proteins with domain folds classified in the Structural Classification Of Proteins (SCOP) as well as known interacting partners in the Database of Interacting Proteins (DIP). For this test set, the ratio of success (PPV) on fold assignment increases from 75% for PSI-BLAST, 83% for HHSearch and 81% for PRC to more than 90% for ModLink+ at the e-value cutoff of 10-3. Under this e-value, ModLink+ can assign a fold to 30-45% of the proteins in the test set, while our previous method could cover less than 25%. When applied to 6,384 proteins with unknown fold in the yeast proteome, ModLink+ combined with PSI-BLAST assigns a fold for domains in 3,738 proteins, while PSI-BLAST alone only covers 2,122 proteins, HHSearch 2,969 and PRC 2,826 proteins, using a threshold e-value that would represent a PPV higher than 82% for each method in the test set.
Availability: The ModLink+ server is freely accessible in the World Wide Web at http://sbi.imim.es/modlink/.
Contact: boliva{at}imim.es.
Associate Editor: Prof. Alfonso Valencia
Received on December 23, 2008; revised on March 12, 2009; accepted on April 4, 2009