Bioinformatics Advance Access originally published online on January 9, 2009
Bioinformatics 2009 25(4):443-450; doi:10.1093/bioinformatics/btn664
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Masking residues using context-specific evolutionary conservation significantly improves short linear motif discovery
1UCD Complex and Adaptive Systems Laboratory, UCD Conway Institute of Biomolecular and Biomedical Sciences, University College Dublin, Dublin 4, Ireland and 2School of Biological Sciences, University of Southampton, Boldrewood Campus, Southampton SO16 7PX, UK
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Short linear motifs (SLiMs) are important mediators of protein–protein interactions. Their short and degenerate nature presents a challenge for computational discovery. We sought to improve SLiM discovery by incorporating evolutionary information, since SLiMs are more conserved than surrounding residues.
Results: We have developed a new method that assesses the evolutionary signal of a residue in its sequence and structural context. Under-conserved residues are masked out prior to SLiM discovery, allowing incorporation into the existing statistical model employed by SLiMFinder. The method shows considerable robustness in terms of both the conservation score used for individual residues and the size of the sequence neighbourhood. Optimal parameters significantly improve return of known functional motifs from benchmarking data, raising the return of significant validated SLiMs from typical human interaction datasets from 20% to 60%, while retaining the high level of stringency needed for application to real biological data. The success of this regime indicates that it could be of general benefit to computational annotation and prediction of protein function at the sequence level.
Availability: All data and tools in this article are available at http://bioware.ucd.ie/~slimdisc/slimfinder/conmasking/.
Contact: r.edwards{at}southampton.ac.uk
Supplementary information: Supplementary data are available at Bioinformatics online.
Associate Editor: Dmitrij Frishman
Received on September 4, 2008; revised on December 1, 2008; accepted on December 27, 2008
This article has been cited by other articles:
![]() |
D. T.-H. Chang, T.-Y. Chien, and C.-Y. Chen seeMotif: exploring and visualizing sequence motifs in 3D structures Nucleic Acids Res., July 1, 2009; 37(suppl_2): W552 - W558. [Abstract] [Full Text] [PDF] |
||||
