Bioinformatics, Vol 15, 622-630, Copyright © 1999 by Oxford University Press
T Tsunoda and T Takagi
MOTIVATION: Precise analysis of the genetic network, gene function and
transcription regulation requires accurate prediction of transcription
factor (TF) bindability on DNA. For calculating the matching score between
an input sequence and a set of known TF binding sites, we use positional
weight matrices (PWMs) and Bucher's calculating method (Bucher, J. Mol.
Biol., 212, 563-578, 1990). Since estimating TF binding sites requires
cut-off values, we propose a robust cut-off value determining algorithm.
RESULTS: We generalize the concept of local overrepresentation with
statistics, and propose a new algorithm for determining the cut-off value
using the background rate estimated on non-promoters. The algorithm
iteratively determines parameters separating instances into
phenomena-dependent and phenomena-independent subsets. Our system includes
the method of re-estimating cut-off values of TFs that mis-recognize other
TF preferred regions. Our data source comprised 433 non-redundant
vertebrate promoters including viral promoters, from Eukaryotic Promoter
Database (EPD) R.50. The method is applied to 205 vertebrate TFs that have
frequency matrices in TRANSFAC Ver.3. 4 and the cut-off values of all of
them can be determined. AVAILABILITY: The cut-off values and TF binding
site predicting tool are available at http://www.hgc.ims.u-tokyo.ac.
jp/service/tooldoc/TFBIND. We also provide the cut-off value estimating
programs.
ARTICLES
Estimating transcription factor bindability on DNA
Genome Data Base, Human Genome Center, The Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan. tatsu@ims.u-tokyo.ac.jp
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. Fonseca, G. E. Lindahl, M. Ponticos, P. Sestini, E. A. Renzoni, A. M. Holmes, P. Spagnolo, P. Pantelidis, P. Leoni, N. McHugh, et al. A Polymorphism in the CTGF Promoter Region Associated with Systemic Sclerosis N. Engl. J. Med., September 20, 2007; 357(12): 1210 - 1220. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Tsuritani, T. Irie, R. Yamashita, Y. Sakakibara, H. Wakaguri, A. Kanai, J. Mizushima-Sugano, S. Sugano, K. Nakai, and Y. Suzuki Distinct class of putative "non-conserved" promoters in humans: Comparative studies of alternative promoters of human and mouse genes Genome Res., July 1, 2007; 17(7): 1005 - 1014. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Tomovic and E. J. Oakeley Position dependencies in transcription factor binding sites Bioinformatics, April 15, 2007; 23(8): 933 - 941. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Arai, N. Funatsu, K. Numayama-Tsuruta, T. Nomura, S. Nakamura, and N. Osumi Role of Fabp7, a Downstream Gene of Pax6, in the Maintenance of Neuroepithelial Cells during Early Embryonic Development of the Rat Cortex J. Neurosci., October 19, 2005; 25(42): 9752 - 9761. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Tsuchihara, V. Lapin, C. Bakal, H. Okada, L. Brown, M. Hirota-Tsuchihara, K. Zaugg, A. Ho, A. Itie-YouTen, M. Harris-Brandts, et al. Ckap2 Regulates Aneuploidy, Cell Cycling, and Cell Death in a p53-Dependent Manner Cancer Res., August 1, 2005; 65(15): 6685 - 6691. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Kitagawa, T. Washio, S. Kosugi, T. Yamashita, K. Higashi, H. Yanagawa, K. Higo, K. Satoh, Y. Ohtomo, T. Sunako, et al. Computational analysis suggests that alternative first exons are involved in tissue-specific transcription in rice (Oryza sativa) Bioinformatics, May 1, 2005; 21(9): 1758 - 1763. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. I. Gershenzon, G. D. Stormo, and I. P. Ioshikhes Computational technique for improvement of the position-weight matrices for the DNA/protein binding sites Nucleic Acids Res., April 22, 2005; 33(7): 2290 - 2301. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. KUTLU, N. NAAMANE, L. BERTHOU, and D. L. EIZIRIK New Approaches for in Silico Identification of Cytokine-Modified {beta} Cell Gene Networks Ann. N.Y. Acad. Sci., December 1, 2004; 1037(1): 41 - 58. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Marino-Ramirez, J. L. Spouge, G. C. Kanga, and D. Landsman Statistical analysis of over-represented words in human promoter sequences Nucleic Acids Res., February 12, 2004; 32(3): 949 - 958. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Suzuki, T. Tsunoda, J. Sese, H. Taira, J. Mizushima-Sugano, H. Hata, T. Ota, T. Isogai, T. Tanaka, Y. Nakamura, et al. Identification and Characterization of the Potential Promoter Regions of 1031 Kinds of Human Genes Genome Res., May 1, 2001; 11(5): 677 - 684. [Abstract] [Full Text] |
||||






