Bioinformatics Advance Access originally published online on February 19, 2008
Bioinformatics 2008 24(7):901-907; doi:10.1093/bioinformatics/btn055
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
ParCrys: a Parzen window density estimation approach to protein crystallization propensity prediction
1School of Life Sciences Research, University of Dundee, Dow Street, Dundee, DD1 5EH and 2Department of Computing Science, University of Glasgow, Glasgow, GL12 8QQ, UK
*To whom correspondence should be addressed.
| Abstract |
|---|
The ability to rank proteins by their likely success in crystallization is useful in current Structural Biology efforts and in particular in high-throughput Structural Genomics initiatives. We present ParCrys, a Parzen Window approach to estimate a protein's propensity to produce diffraction-quality crystals. The Protein Data Bank (PDB) provided training data whilst the databases TargetDB and PepcDB were used to define feature selection data as well as test data independent of feature selection and training. ParCrys outperforms the OB-Score, SECRET and CRYSTALP on the data examined, with accuracy and Matthews correlation coefficient values of 79.1% and 0.582, respectively (74.0% and 0.227, respectively, on data with a real-world ratio of positive:negative examples). ParCrys predictions and associated data are available from www.compbio.dundee.ac.uk/parcrys.
Contact: geoff{at}compbio.dundee.ac.uk
Supplementary information: Supplementary data are available at Bioinformatics online.
Associate Editor: John Quackenbush
Received on June 1, 2007; revised on January 21, 2008; accepted on February 6, 2008
This article has been cited by other articles:
![]() |
I. M. Overton, C. A. J. van Niekerk, L. G. Carter, A. Dawson, D. M. A. Martin, S. Cameron, S. A. McMahon, M. F. White, W. N. Hunter, J. H. Naismith, et al. TarO: a target optimisation system for structural biology Nucleic Acids Res., July 1, 2008; 36(suppl_2): W190 - W196. [Abstract] [Full Text] [PDF] |
||||
