Skip Navigation



Bioinformatics Advance Access published online on September 24, 2007

Bioinformatics, doi:10.1093/bioinformatics/btm470
This Article
Right arrow Advance Access manuscript (PDF)
Right arrow Supplementary data
Right arrow All Versions of this Article:
23/22/3001    most recent
btm470v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Davies, J.R.
Right arrow Articles by Taylor, C.C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Davies, J.R.
Right arrow Articles by Taylor, C.C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author (2007). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

The Poisson Index: a new probabilistic model for protein-ligand binding site similarity

J.R. Davies a, R.M. Jackson b,*, K.V. Mardia a and C.C. Taylor a

aSchool of Mathematics, University of Leeds, Leeds LS2 9JT, UK, bInstitute of Molecular and Cellular Biology, University of Leeds, Leeds LS2 9JT, UK

*To whom correspondence should be addressed. Dr. Richard Jackson, E-mail: r.m.jackson{at}leeds.ac.uk


   Abstract

Motivation: The large scale comparison of protein-ligand binding sites is problematic, in that measures of structural similarity are difficult to quantify and are not easily understood in terms of statistical similarity that can ultimately be related to structure and function. We present a binding site matching score the Poisson Index (PI) based upon a well defined statistical model. PI requires only the number of matching atoms between two sites and the size of the two sites —the same information used by the Tanimoto Index (TI), a comparable and widely used measure for molecular similarity. We apply PI and TI to a previously automatically extracted set of binding sites to determine the robustness and usefulness of both scores.

Results: We found that PI outperforms TI; moreover site similarity is poorly defined for TI at values around the 99.5% confidence level for which PI is well defined. A difference map at this confidence level shows that PI gives much more meaningful information than TI. We show individual examples where TI fails to distinguish either a false or a true site paring in contrast to PI, which performs much better. TI cannot handle large or small sites very well, or the comparison of large and small sites, in contrast to PI which is shown to be much more robust. Despite the difficulty of determining a biological "ground truth" for binding site similarity we conclude that PI is a suitable measure of binding site similarity and could form the basis for a binding site classification scheme comparable to existing protein domain classification schema.

Availability: PI is implemented in SitesBase www.modelling.leeds.ac.uk/sb/.

Contact: r.m.jackson{at}leeds.ac.uk

Keywords: Tanimoto Index, SitesBase, maximum likelihood estimation, protein structure, structural similarity.

Associate Editor: Prof. Burkhard Rost


Received on June 13, 2007; revised on September 7, 2007; accepted on September 10, 2007

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.