Skip Navigation


Bioinformatics Advance Access originally published online on January 2, 2008
Bioinformatics 2008 24(5):613-620; doi:10.1093/bioinformatics/btm626
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Supplementary Data
Right arrowOA All Versions of this Article:
24/5/613    most recent
btm626v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Fischer, J. D.
Right arrow Articles by Söding, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Fischer, J. D.
Right arrow Articles by Söding, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2008 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Prediction of protein functional residues from sequence by probability density estimation

J. D. Fischer {dagger}, C. E. Mayer and J. Söding *,{ddagger}

Department for Protein Evolution, Max Planck Institute for Developmental Biology, Spemannstr. 35, 72076 Tübingen, Germany

*To whom correspondence should be addressed.


   Abstract

Motivation: The prediction of ligand-binding residues or catalytically active residues of a protein may give important hints that can guide further genetic or biochemical studies. Existing sequence-based prediction methods mostly rank residue positions by evolutionary conservation calculated from a multiple sequence alignment of homologs. A problem hampering more wide-spread application of these methods is the low per-residue precision, which at 20% sensitivity is around 35% for ligand-binding residues and 20% for catalytic residues.

Results: We combine information from the conservation at each site, its amino acid distribution, as well as its predicted secondary structure (ss) and relative solvent accessibility (rsa). First, we measure conservation by how much the amino acid distribution at each site differs from the distribution expected for the predicted ss and rsa states. Second, we include the conservation of neighboring residues in a weighted linear score by analytically optimizing the signal-to-noise ratio of the total score. Third, we use conditional probability density estimation to calculate the probability of each site to be functional given its conservation, the observed amino acid distribution, and the predicted ss and rsa states.

We have constructed two large data sets, one based on the Catalytic Site Atlas and the other on PDB SITE records, to benchmark methods for predicting functional residues. The new method FRcons predicts ligand-binding and catalytic residues with higher precision than alternative methods over the entire sensitivity range, reaching 50% and 40% precision at 20% sensitivity, respectively.

Availability: Server: http://frpred.tuebingen.mpg.de. Data sets: ftp://ftp.tuebingen.mpg.de/pub/protevo/FRpred/

Contact: soeding{at}lmb.uni-muenchen.de

Supplementary information: Supplementary data are available at Bioinformatics Online.

Associate Editor: Burkhard Rost

{dagger}Present address: EMBL-EBI, Hinxton, Cambridge, CB10 1SD, UK.

{ddagger}Present address: Gene Center Munich, University of Munich (LMU), 81377 Munich, Germany.


Received on October 10, 2007; revised on November 20, 2007; accepted on December 14, 2007

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
T. Zhang, H. Zhang, K. Chen, S. Shen, J. Ruan, and L. Kurgan
Accurate sequence-based prediction of catalytic residues
Bioinformatics, October 15, 2008; 24(20): 2329 - 2338.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. A. Capra and M. Singh
Characterization and prediction of residues determining protein functional specificity
Bioinformatics, July 1, 2008; 24(13): 1473 - 1480.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.