Bioinformatics Advance Access originally published online on April 8, 2009
Bioinformatics 2009 25(12):1506-1512; doi:10.1093/bioinformatics/btp238
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
ModLink+: improving fold recognition by using protein–protein interactions

1Structural Bioinformatics Lab (GRIB-IMIM), Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona (PRBB), Barcelona, Catalonia, 2Structural Genomics Unit, Bioinformatics & Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain, 3Department of Bioengineering and Therapeutic Sciences, 4Department of Pharmaceutical Chemistry and 5California Institute for Quantitative Biosciences, University of California at San Francisco, San Francisco, CA, USA
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation:Several strategies have been developed to predict the fold of a target protein sequence, most of which are based on aligning the target sequence to other sequences of known structure. Previously, we demonstrated that the consideration of protein–protein interactions significantly increases the accuracy of fold assignment compared with PSI-BLAST sequence comparisons. A drawback of our method was the low number of proteins to which a fold could be assigned. Here, we present an improved version of the method that addresses this limitation. We also compare our method to other state-of-the-art fold assignment methodologies.
Results: Our approach (ModLink+) has been tested on 3716 proteins with domain folds classified in the Structural Classification Of Proteins (SCOP) as well as known interacting partners in the Database of Interacting Proteins (DIP). For this test set, the ratio of success [positive predictive value (PPV)] on fold assignment increases from 75% for PSI-BLAST, 83% for HHSearch and 81% for PRC to >90% for ModLink+at the e-value cutoff of 10–3. Under this e-value, ModLink+can assign a fold to 30–45% of the proteins in the test set, while our previous method could cover <25%. When applied to 6384 proteins with unknown fold in the yeast proteome, ModLink+combined with PSI-BLAST assigns a fold for domains in 3738 proteins, while PSI-BLAST alone covers only 2122 proteins, HHSearch 2969 and PRC 2826 proteins, using a threshold e-value that would represent a PPV >82% for each method in the test set.
Availability: The ModLink+server is freely accessible in the World Wide Web at http://sbi.imim.es/modlink/.
Contact: boliva{at}imim.es.
Supplementary information: Supplementary data are available at Bioinformatics online.
Present address: AB-Biotics S.L, Masia Can Fatjó Del Molí s/n, 08290 Cerdanyola del vallès, Catalonia, Spain.
Associate editor: Alfonso Valencia
Received on December 23, 2008; revised on March 12, 2009; accepted on April 4, 2009