Bioinformatics Advance Access originally published online on February 1, 2008
Bioinformatics 2008 24(6):775-782; doi:10.1093/bioinformatics/btm618
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Prediction of zinc-binding sites in proteins from sequence
Structural Chemistry, Arrhenius Laboratory, Stockholm University, SE-106 91 Stockholm, Sweden
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Motivated by the abundance, importance and unique functionality of zinc, both biologically and physiologically, we have developed an improved method for the prediction of zinc-binding sites in proteins from their amino acid sequences.
Results: By combining support vector machine (SVM) and homology-based predictions, our method predicts zinc-binding Cys, His, Asp and Glu with 75% precision (86% for Cys and His only) at 50% recall according to a 5-fold cross-validation on a non-redundant set of protein chains from the Protein Data Bank (PDB) (2727 chains, 235 of which bind zinc). Consequently, our method predicts zinc-binding Cys and His with 10% higher precision at different recall levels compared to a recently published method when tested on the same dataset.
Availability: The program is available for download at www.fos.su.se/~nanjiang/zincpred/download/
Contact: svenh{at}struc.su.se
Supplementary information: All Supplementary Data can be accessed at www.fos.su.se/~nanjiang/zincpred/suppliment
Associate Editor: Burkhard Rost
Received on October 18, 2007; revised on December 9, 2007; accepted on December 11, 2007