Bioinformatics Advance Access originally published online on January 22, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics 20(4) © Oxford University Press 2004; all rights reserved.
Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information
1 Department of Biochemical Science and Engineering, Kyushu Institute of Technology, Fukuoka, Iizuka 820 8502, Japan, 2 Department of Biosciences, Jamia Mila Islamia, New Delhi 110025, India and 3 Computational Biology Research Center (CBRC), AIST, 2-41-6, Aomi, Koto-ku, Tokyo 135 0064, Japan
Received on February 26, 2003
; revised on May 26, 2003
; accepted on July 26, 2003
Advance Access Publication January 22, 2004
Motivation: Though vitally important to cell function, the mechanism of proteinDNA binding has not yet been completely understood. We therefore analysed the relationship between DNA binding and protein sequence composition, solvent accessibility and secondary structure. Using non-redundant databases of transcription factors and proteinDNA complexes, neural network models were developed to utilize the information present in this relationship to predict DNA-binding proteins and their binding residues.
Results: Sequence composition was found to provide sufficient information to predict the probability of its binding to DNA with nearly 69% sensitivity at 64% accuracy for the considered proteins; sequence neighbourhood and solvent accessibility information were sufficient to make binding site predictions with 40% sensitivity at 79% accuracy. Detailed analysis of binding residues shows that some three- and five-residue segments frequently bind to DNA and that solvent accessibility plays a major role in binding. Although, binding behaviour was not associated with any particular secondary structure, there were interesting exceptions at the residue level. Over-representation of some residues in the binding sites was largely lost at the total sequence level, but a different kind of compositional preference was observed in DNA-binding proteins.
Availability: Online predictions of DNA-binding proteins and binding sites are available at http://www.netasa.org/dbs-pred/
Contact: shandar{at}bse.kyutech.ac.jp
* To whom correspondence should be addressed at present address: Department of Biochemical Science, Engineering, Kyushu Institute of Technology, Fukuoka, Iizuka 820 8502, Japan.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S. Ahmad, O. Keskin, A. Sarai, and R. Nussinov Protein-DNA interactions: structural, thermodynamic and clustering patterns of conserved residues in DNA-binding proteins Nucleic Acids Res., October 1, 2008; 36(18): 5922 - 5932. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Liu and G. D. Stormo Context-dependent DNA recognition code for C2H2 zinc-finger transcription factors Bioinformatics, September 1, 2008; 24(17): 1850 - 1857. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Gao and J. Skolnick DBD-Hunter: a knowledge-based method for the prediction of DNA-protein interactions Nucleic Acids Res., July 1, 2008; 36(12): 3978 - 3992. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-X. Zhou and S. Qin Interaction-site prediction for protein complexes: a critical assessment Bioinformatics, September 1, 2007; 23(17): 2203 - 2209. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Neuvirth, U. Heinemann, D. Birnbaum, N. Tishby, and G. Schreiber ProMateus--an open research approach to protein-binding sites analysis Nucleic Acids Res., July 13, 2007; 35(suppl_2): W543 - W548. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Ofran, V. Mysore, and B. Rost Prediction of DNA-binding residues from sequence Bioinformatics, July 1, 2007; 23(13): i347 - i353. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Tjong and H.-X. Zhou DISPLAR: an accurate method for predicting DNA-binding sites on protein surfaces Nucleic Acids Res., March 12, 2007; 35(5): 1465 - 1477. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Hwang, Z. Gou, and I. B. Kuznetsov DP-Bind: a web server for sequence-based prediction of DNA-binding residues in DNA-binding proteins Bioinformatics, March 1, 2007; 23(5): 634 - 636. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Rossi, M. A. Marti-Renom, and A. Sali Localization of binding sites in protein structures by optimization of a composite scoring function Protein Sci., October 1, 2006; 15(10): 2366 - 2380. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Wang and S. J. Brown BindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequences. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W243 - W248. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Bhardwaj, R. E. Langlois, G. Zhao, and H. Lu Kernel-based machine learning protocol for predicting DNA-binding proteins Nucleic Acids Res., November 10, 2005; 33(20): 6486 - 6493. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Alonso, K. Baptista, T. Ngo, and D. E. Taylor Transcriptional organization of the temperature-sensitive transfer system from the IncHI1 plasmid R27 Microbiology, November 1, 2005; 151(11): 3563 - 3573. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pellegrini-Calace and J. M. Thornton Detecting DNA-binding helix-turn-helix structural motifs using sequence and structure information Nucleic Acids Res., April 14, 2005; 33(7): 2129 - 2140. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Liu, F. Mao, J.-t. Guo, B. Yan, P. Wang, Y. Qu, and Y. Xu Quantitative evaluation of protein-DNA interactions using an optimized knowledge-based potential Nucleic Acids Res., January 26, 2005; 33(2): 546 - 558. [Abstract] [Full Text] [PDF] |
||||



