Bioinformatics Advance Access originally published online on July 10, 2007
Bioinformatics 2007 23(18):2449-2454; doi:10.1093/bioinformatics/btm348
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
An approach to predict transcription factor DNA binding site specificity based upon gene and transcription factor functional categorization


1CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 2Graduate School of the Chinese Academy of Sciences, 19 Yuquan Road, Beijing 100039, 3Bioinformatics Center, Key Lab of Molecular Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 320 Yueyang Road, Shanghai 200031, China, 4Department of Mathematics, University of Manchester, Institute of Science and Technology, P.O. Box 88, Manchester M60 1QD, UK, 5Shanghai Center for Bioinformation Technology, 100 Qinzhou Road, 200235 Shanghai, 6College of Life Science & Biotechnology, Shanghai Jiao Tong University, China and 7Molecular Physiology Laboratory, Centre for Cardiovascular Science Queen's Medical Research Institute, 47 Little France Crescent, Edinburgh, EH16 4TJ, UK
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: To understand transcription regulatory mechanisms, it is indispensable to investigate transcription factor (TF) DNA binding preferences. We noted that the generally acknowledged information of functional annotations of TFs as well as that of their target genes should provide useful hints in determining TF DNA binding preferences.
Results: In this contribution, we developed an integrative method based on the Nearest Neighbor Algorithm, to predict DNA binding preferences through integrating both the functional/structural information of TFs and the interaction between TFs and their targets. The accuracy of cross-validation tests on the dataset consisting of 3430 positive samples and 7000 negative samples reaches 87.0 % for 10-fold cross-validation and 87.9 % for jackknife cross-validation test, which is a much better result than that in our previous work. The prediction result indicates that the improved method we developed could be a powerful approach to infer the TF DNA preference in silico.
Contact: cyd{at}picb.ac.cn
Supplementary information: Supplementary data are available at Bioinformatics online
Associate Editor: Alfonso Valencia
These authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors.
Received on December 24, 2007; revised on June 4, 2007; accepted on June 27, 2007
This article has been cited by other articles:
![]() |
T. M. Alleyne, L. Pena-Castillo, G. Badis, S. Talukder, M. F. Berger, A. R. Gehrke, A. A. Philippakis, M. L. Bulyk, Q. D. Morris, and T. R. Hughes Predicting the binding preference of transcription factors to individual DNA k-mers Bioinformatics, April 15, 2009; 25(8): 1012 - 1018. [Abstract] [Full Text] [PDF] |
||||
