Bioinformatics Advance Access published online on January 22, 2004
Bioinformatics, doi:10.1093/bioinformatics/btg432
Bioinformatics © Oxford University Press 2004; all rights reserved
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Biochemical Science and Engineering, Kyushu Institute of Technology, Iizuka 820 8502, Fukuoka, Japan
* To whom correspondence should be addressed. E-mail: shandar{at}bse.kutech.ac.jp.
Motivation: Though vitally important to cell function, the mechanism of protein-DNA binding has not yet been completely understood. We therefore analyzed the relationship between DNA-binding and protein sequence composition, solvent accessibility and secondary structure. Using non-redundant databases of transcription factors and Protein-DNA complexes, neural network models were developed to utilize the information present in this relationship to predict DNA-binding proteins and their binding residues. Results: Sequence composition was found to provide sufficient information to predict the probability of its binding to DNA with nearly 69% sensitivity at 64% accuracy for the considered proteins; sequence neighborhood and solvent accessibility information were sufficient to make binding site predictions with 40% sensitivity at 79% accuracy. Detailed analysis of binding residues shows that some three- and five-residue segments frequently bind to DNA and that solvent accessibility plays a major role in binding. Although, binding behavior was not associated with any particular secondary structure, there were interesting exceptions at the residue level. Over-representation of some residues in the binding sites was largely lost at the total sequence level, but a different kind of compositional preference was observed in DNA-binding proteins. Availability: Online predictions of DNA binding proteins and binding sites are available at http://www.netasa.org/dbs-pred/
Revised May 26, 2003
Accepted July 26, 2003
Article
Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information
2 Computational Biology Research Center (CBRC), AIST, 2-41-6, Aomi, Koto-ku, Tokyo 135 0064, Japan
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
B. Li, V. G. Krishnan, M. E. Mort, F. Xin, K. K. Kamati, D. N. Cooper, S. D. Mooney, and P. Radivojac Automated inference of molecular mechanisms of disease from amino acid substitutions Bioinformatics, November 1, 2009; 25(21): 2744 - 2750. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Xiong, T. Li, K. Chen, and K. Tang Local combinational variables: an approach used in DNA-binding helix-turn-helix motif prediction with sequence information Nucleic Acids Res., September 1, 2009; 37(17): 5632 - 5640. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Wu, H. Liu, X. Duan, Y. Ding, H. Wu, Y. Bai, and X. Sun Prediction of DNA-binding residues in proteins from amino acid sequences using a random forest model with a hybrid feature Bioinformatics, January 1, 2009; 25(1): 30 - 35. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. C. Chen and C. Lim Common physical basis of macromolecule-binding sites in proteins Nucleic Acids Res., December 1, 2008; 36(22): 7078 - 7087. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ahmad, O. Keskin, A. Sarai, and R. Nussinov Protein-DNA interactions: structural, thermodynamic and clustering patterns of conserved residues in DNA-binding proteins Nucleic Acids Res., October 1, 2008; 36(18): 5922 - 5932. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Liu and G. D. Stormo Context-dependent DNA recognition code for C2H2 zinc-finger transcription factors Bioinformatics, September 1, 2008; 24(17): 1850 - 1857. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Gao and J. Skolnick DBD-Hunter: a knowledge-based method for the prediction of DNA-protein interactions Nucleic Acids Res., July 1, 2008; 36(12): 3978 - 3992. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-X. Zhou and S. Qin Interaction-site prediction for protein complexes: a critical assessment Bioinformatics, September 1, 2007; 23(17): 2203 - 2209. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Neuvirth, U. Heinemann, D. Birnbaum, N. Tishby, and G. Schreiber ProMateus--an open research approach to protein-binding sites analysis Nucleic Acids Res., July 13, 2007; 35(suppl_2): W543 - W548. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Ofran, V. Mysore, and B. Rost Prediction of DNA-binding residues from sequence Bioinformatics, July 1, 2007; 23(13): i347 - i353. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Tjong and H.-X. Zhou DISPLAR: an accurate method for predicting DNA-binding sites on protein surfaces Nucleic Acids Res., March 12, 2007; 35(5): 1465 - 1477. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Hwang, Z. Gou, and I. B. Kuznetsov DP-Bind: a web server for sequence-based prediction of DNA-binding residues in DNA-binding proteins Bioinformatics, March 1, 2007; 23(5): 634 - 636. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Wang and S. J. Brown BindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequences. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W243 - W248. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Bhardwaj, R. E. Langlois, G. Zhao, and H. Lu Kernel-based machine learning protocol for predicting DNA-binding proteins Nucleic Acids Res., November 10, 2005; 33(20): 6486 - 6493. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Alonso, K. Baptista, T. Ngo, and D. E. Taylor Transcriptional organization of the temperature-sensitive transfer system from the IncHI1 plasmid R27 Microbiology, November 1, 2005; 151(11): 3563 - 3573. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pellegrini-Calace and J. M. Thornton Detecting DNA-binding helix-turn-helix structural motifs using sequence and structure information Nucleic Acids Res., April 14, 2005; 33(7): 2129 - 2140. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Liu, F. Mao, J.-t. Guo, B. Yan, P. Wang, Y. Qu, and Y. Xu Quantitative evaluation of protein-DNA interactions using an optimized knowledge-based potential Nucleic Acids Res., January 26, 2005; 33(2): 546 - 558. [Abstract] [Full Text] [PDF] |
||||


