Skip Navigation

Bioinformatics 2007 23(13):i347-i353; doi:10.1093/bioinformatics/btm174
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Ofran, Y.
Right arrow Articles by Rost, B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ofran, Y.
Right arrow Articles by Rost, B.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2007 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Prediction of DNA-binding residues from sequence

Yanay Ofran 1,2,*, Venkatesh Mysore 1,3 and Burkhard Rost 1,2,4

1Department of Biochemistry and Molecular Biophysics, Columbia University, 630 West 168th Street, 2Columbia University Center for Computational Biology and Bioinformatics (C2B2), 1130 St Nicholas Ave. Rm. 802, 3D.E.Shaw Research, 120 West Forty Fifth Street (current affiliation) and 4NorthEast Structural Genomics Consortium (NESG), Columbia University, 1130 St Nicholas Ave. Rm. 802, New York, NY 10032, USA

*To whom correspondence should be addressed.


   Abstract

Motivation: Thousands of proteins are known to bind to DNA; for most of them the mechanism of action and the residues that bind to DNA, i.e. the binding sites, are yet unknown. Experimental identification of binding sites requires expensive and laborious methods such as mutagenesis and binding essays. Hence, such studies are not applicable on a large scale. If the 3D structure of a protein is known, it is often possible to predict DNA-binding sites in silico. However, for most proteins, such knowledge is not available.

Results: It has been shown that DNA-binding residues have distinct biophysical characteristics. Here we demonstrate that these characteristics are so distinct that they enable accurate prediction of the residues that bind DNA directly from amino acid sequence, without requiring any additional experimental or structural information. In a cross-validation based on the largest non-redundant dataset of high-resolution protein–DNA complexes available today, we found that 89% of our predictions are confirmed by experimental data. Thus, it is now possible to identify DNA-binding sites on a proteomic scale even in the absence of any experimental data or 3D-structural information.

Availability: http://cubic.bioc.columbia.edu/services/disis

Contact: yo135{at}columbia.edu



Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
W.-Y. Chu, Y.-F. Huang, C.-C. Huang, Y.-S. Cheng, C.-K. Huang, and Y.-J. Oyang
ProteDNA: a sequence-based predictor of sequence-specific DNA-binding residues in transcription factors
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W396 - W401.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Wu, H. Liu, X. Duan, Y. Ding, H. Wu, Y. Bai, and X. Sun
Prediction of DNA-binding residues in proteins from amino acid sequences using a random forest model with a hybrid feature
Bioinformatics, January 1, 2009; 25(1): 30 - 35.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Song, H. Tan, K. Takemoto, and T. Akutsu
HSEpred: predict half-sphere exposure from protein sequences
Bioinformatics, July 1, 2008; 24(13): 1489 - 1497.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.