Skip Navigation


Bioinformatics Advance Access originally published online on September 24, 2007
Bioinformatics 2007 23(22):3001-3008; doi:10.1093/bioinformatics/btm470
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Supplementary data
Right arrow All Versions of this Article:
23/22/3001    most recent
btm470v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (6)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Davies, J.R.
Right arrow Articles by Taylor, C.C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Davies, J.R.
Right arrow Articles by Taylor, C.C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2007. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

The Poisson Index: a new probabilistic model for protein–ligand binding site similarity

J.R. Davies 1, R.M. Jackson 2,*, K.V. Mardia 1 and C.C. Taylor 1

1School of Mathematics and 2Institute of Molecular and Cellular Biology, University of Leeds, Leeds LS2 9JT, UK

*To whom correspondence should be addressed.


   Abstract

Motivation: The large-scale comparison of protein–ligand binding sites is problematic, in that measures of structural similarity are difficult to quantify and are not easily understood in terms of statistical similarity that can ultimately be related to structure and function. We present a binding site matching score the Poisson Index (PI) based upon a well-defined statistical model. PI requires only the number of matching atoms between two sites and the size of the two sites—the same information used by the Tanimoto Index (TI), a comparable and widely used measure for molecular similarity. We apply PI and TI to a previously automatically extracted set of binding sites to determine the robustness and usefulness of both scores.

Results: We found that PI outperforms TI; moreover, site similarity is poorly defined for TI at values around the 99.5% confidence level for which PI is well defined. A difference map at this confidence level shows that PI gives much more meaningful information than TI. We show individual examples where TI fails to distinguish either a false or a true site paring in contrast to PI, which performs much better. TI cannot handle large or small sites very well, or the comparison of large and small sites, in contrast to PI that is shown to be much more robust. Despite the difficulty of determining a biological ‘ground truth’ for binding site similarity we conclude that PI is a suitable measure of binding site similarity and could form the basis for a binding site classification scheme comparable to existing protein domain classification schema.

Availability: PI is implemented in SitesBase www.modelling.leeds.ac.uk/sb/

Contact: r.m.jackson{at}leeds.ac.uk

Associate Editor: Burkhard Rost


Received on June 13, 2007; revised on September 7, 2007; accepted on September 10, 2007

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Stat Methods Med ResHome page
T. Hamelryck
Probabilistic models and machine learning in structural bioinformatics
Statistical Methods in Medical Research, October 1, 2009; 18(5): 505 - 526.
[Abstract] [PDF]


Home page
BioinformaticsHome page
L. Xie, L. Xie, and P. E. Bourne
A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery
Bioinformatics, June 15, 2009; 25(12): i305 - i312.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.