Skip Navigation



Bioinformatics Advance Access published online on November 13, 2008

Bioinformatics, doi:10.1093/bioinformatics/btn580
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow Supplementary Data
Right arrow All Versions of this Article:
25/1/22    most recent
btn580v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Persikov, A. V.
Right arrow Articles by Singh, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Persikov, A. V.
Right arrow Articles by Singh, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author (2008). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

Predicting DNA recognition by Cys2His2 zinc finger proteins

Anton V. Persikov 1, Robert Osada 2 and Mona Singh 1,2,*

1Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, 08544, USA.
2Department of Computer Science, Princeton University, Princeton, New Jersey, 08544, USA.

*To whom correspondence should be addressed. Prof. Mona Singh, E-mail: mona{at}cs.princeton.edu


   Abstract

Motivation: Cys2His2 zinc finger (ZF) proteins represent the largest class of eukaryotic transcription factors. Their modular structure and well-conserved protein–DNA interface allow the development of computational approaches for predicting their DNA binding preferences even when no binding sites are known for a particular protein. The "canonical model" for ZF protein–DNA interaction consists of only four amino acid-nucleotide contacts per zinc finger domain.

Results: We present an approach for predicting ZF binding based on support vector machines (SVMs). While most previous computational approaches have been based solely on examples of known ZF protein-DNA interactions, ours additionally incorporates information about protein-DNA pairs known to bind weakly or not at all. Moreover, SVMs with a linear kernel can naturally incorporate constraints about the relative binding affinities of protein-DNA pairs; this type of information has not been used previously in predicting ZF protein-DNA binding. Here, we build a high-quality literature-derived experimental database of ZF–DNA binding examples and utilize it to test both linear and polynomial kernels for predicting ZF protein–DNA binding on the basis of the canonical binding model. The polynomial SVM outperforms previously published prediction procedures as well as the linear SVM. This may indicate the presence of dependencies between contacts in the canonical binding model and suggests that modification of the underlying structural model may result in further improved performance in predicting ZF protein-DNA binding. Overall, this work demonstrates that methods incorporating information about non-binding and relative binding of protein-DNA pairs have great potential for effective prediction of protein–DNA interactions.

Availability: An online tool for predicting ZF DNA binding is available at http://compbio.cs.princeton.edu/zf/.

Contact: mona{at}cs.princeton.edu

Supplementary information: Available at Bioinformatics online

Associate Editor: Prof. Alfonso Valencia


Received on July 14, 2008; revised on October 14, 2008; accepted on November 6, 2008

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.