Bioinformatics Advance Access published online on February 25, 2009
Bioinformatics, doi:10.1093/bioinformatics/btp114
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Predicting helix-helix interactions from residue contacts in membrane proteins
1Bioinformatics Program, Taiwan International Graduate Program, Academia Sinica, Taipei, 2Institute of Bioinformatics and Structural Biology, Department of Life Sciences, National Tsing Hua University, Hsinchu, 3Bioinformatics Lab., Institute of Information Science, Academia Sinica, Taipei, Taiwan, 4Centre for Cancer Biomedicine, University of Oslo, NO-0027 Oslo, and 5SAMBA, Norwegian Computing Center, P.O. Box 114 Blindern, NO-0314 Oslo, Norway
*To whom correspondence should be addressed., E-mail: tsung{at}iis.sinica.edu.tw; hsu{at}iis.sinica.edu.tw
| Abstract |
|---|
Motivation: Helix-helix interactions play a critical role in the structure assembly, stability, and function of membrane proteins. On the molecular level, the interactions are mediated by one or more residue contacts. Although previous studies focused on helix-packing patterns and sequence motifs, few of them developed methods specifically for contact prediction.
Results: We present a new hierarchical framework for contact prediction, with an application in membrane proteins. The hierarchical scheme consists of two levels: in the first level, contact residues are predicted from the sequence and their pairing relationships are fur-ther predicted in the second level. Statistical analyses on contact propensities are combined with other sequence and structural infor-mation for training the support vector machine classifiers. Evaluated on 52 protein chains using leave-one-out cross validation and an independent test set of 14 protein chains, the two-level approach consistently improves the conventional direct approach in prediction accuracy, with 80% reduction of input for prediction. Furthermore, the predicted contacts are then used to infer interactions between pairs of helices. When at least three predicted contacts are required for an inferred interaction, the accuracy, sensitivity, and specificity rates are 56%, 40%, and 89%, respectively. Our results demon-strate that a hierarchical framework can be applied to eliminate false positives while reducing computational complexity in predicting con-tacts. Together with the estimated contact propensities, this method can be used to gain insights into helix-packing in membrane proteins.
Availability: http://bio-cluster.iis.sinica.edu.tw/~bioapp/TMhit/
Contact: tsung{at}iis.sinica.edu.tw; hsu{at}iis.sinica.edu.tw
Supplementary information: Supplementary data are available at Bioinformatics online.
Associate Editor: Prof. Ivo Hofacker
Received on August 26, 2008; revised on February 20, 2009; accepted on February 23, 2009