Bioinformatics Vol. 16 no. 1 2000
Pages 16-23
© 2000 Oxford University Press
DNA binding sites: representation and discovery
1 Department of Genetics, Washington University Medical School, St. Louis, MO 63110, USA
The purpose of this article is to provide a brief history of the development and application of computer algorithms for the analysis and prediction of DNA binding sites. This problem can be conveniently divided into two subproblems. The first is, given a collection of known binding sites, develop a representation of those sites that can be used to search new sequences and reliably predict where additional binding sites occur. The second is, given a set of sequences known to contain binding sites for a common factor, but not knowing where the sites are, discover the location of the sites in each sequence and a representation for the specificity of the protein.
Contact: stormo{at}ural.wustl.edu
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
K.-J. Won, A. Sandelin, T. T. Marstrand, and A. Krogh Modeling promoter grammars with evolving hidden Markov models Bioinformatics, August 1, 2008; 24(15): 1669 - 1675. [Abstract] [PDF] |
||||
![]() |
M. Boden and T. L. Bailey Associating transcription factor-binding site motifs with target GO terms and target genes Nucleic Acids Res., July 1, 2008; 36(12): 4108 - 4117. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Hannenhalli Eukaryotic transcription factor binding sites--modeling and integrative search methods Bioinformatics, June 1, 2008; 24(11): 1325 - 1331. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Djordjevic and R. Bundschuh Formation of the Open Complex by Bacterial RNA Polymerase--A Quantitative Model Biophys. J., June 1, 2008; 94(11): 4233 - 4248. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Chen, L. Guo, Z. Fan, and T. Jiang W-AlignACE: an improved Gibbs sampling algorithm based on more accurate position weight matrices learned from sequence and gene expression/ChIP-chip data Bioinformatics, May 1, 2008; 24(9): 1121 - 1128. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. B. Veprintsev and A. R. Fersht Algorithm for prediction of tumour suppressor p53 affinity for binding sites in DNA Nucleic Acids Res., March 1, 2008; 36(5): 1589 - 1598. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-Y. Chen, H.-K. Tsai, C.-M. Hsu, M.-J. May Chen, H.-G. Hung, G. T.-W. Huang, and W.-H. Li Discovering gapped binding sites of yeast transcription factors PNAS, February 19, 2008; 105(7): 2527 - 2532. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. P. da Rocha, A. C. de Miranda Paquola, M. do Valle Marques, C. F. M. Menck, and R. S. Galhardo Characterization of the SOS Regulon of Caulobacter crescentus J. Bacteriol., February 15, 2008; 190(4): 1209 - 1218. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. J. Pape, S. Rahmann, and M. Vingron Natural similarity measures between position frequency matrices with an application to clustering Bioinformatics, February 1, 2008; 24(3): 350 - 357. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Cordero, M. Botta, and R. A. Calogero Microarray data analysis and mining approaches Brief Funct Genomic Proteomic, January 22, 2008; (2008) elm034v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. C. Foat, R. G. Tepper, and H. J. Bussemaker TransfactomeDB: a resource for exploring the nucleotide sequence specificity and condition-specific regulatory activity of trans-acting factors Nucleic Acids Res., January 11, 2008; 36(suppl_1): D125 - D131. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Bryne, E. Valen, M.-H. E. Tang, T. Marstrand, O. Winther, I. da Piedade, A. Krogh, B. Lenhard, and A. Sandelin JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update Nucleic Acids Res., January 11, 2008; 36(suppl_1): D102 - D106. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Kheradpour, A. Stark, S. Roy, and M. Kellis Reliable prediction of regulator targets using 12 Drosophila genomes Genome Res., December 1, 2007; 17(12): 1919 - 1931. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. R. Davies, L.-W. Chang, D. Patra, X. Xing, K. Posey, J. Hecht, G. D. Stormo, and L. J. Sandell Computational identification and functional validation of regulatory motifs in cartilage-expressed genes Genome Res., October 1, 2007; 17(10): 1438 - 1447. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Zhang, M. L. Hastings, A. R. Krainer, and M. Q. Zhang Dual-specificity splice sites function alternatively as 5' and 3' splice sites PNAS, September 18, 2007; 104(38): 15028 - 15033. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Faiger, M. Ivanchenko, and T. E. Haran Nearest-neighbor non-additivity versus long-range non-additivity in TATA-box structure and its implications for TBP-binding mechanism Nucleic Acids Res., July 26, 2007; 35(13): 4409 - 4419. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Dai, J. He, and X. Zhao A new systematic computational approach to predicting target genes of transcription factors Nucleic Acids Res., July 26, 2007; 35(13): 4433 - 4440. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Okumura, H. Makiguchi, Y. Makita, R. Yamashita, and K. Nakai Melina II: a web tool for comparisons among several predictive algorithms to find potential motifs from promoter regions Nucleic Acids Res., July 13, 2007; 35(suppl_2): W227 - W231. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-K. Tsai, M.-Y. Chou, C.-H. Shih, G. T.-W. Huang, T.-H. Chang, and W.-H. Li MYBS: a comprehensive web server for mining transcription factor binding sites in yeast Nucleic Acids Res., July 13, 2007; 35(suppl_2): W221 - W226. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. A. Kolchanov, T. I. Merkulova, E. V. Ignatieva, E. A. Ananko, D. Yu. Oshchepkov, V. G. Levitsky, G. V. Vasiliev, N. V. Klimova, V. M. Merkulov, and T. C. Hodgman Combined experimental and computational approaches to study the regulatory elements in eukaryotic genes Brief Bioinform, July 12, 2007; (2007) bbm027v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Das, T. A. Clark, A. Schweitzer, M. Yamamoto, H. Marr, J. Arribere, S. Minovitsky, A. Poliakov, I. Dubchak, J. E. Blume, et al. A correlation with exon expression approach to identify cis-regulatory elements for tissue-specific alternative splicing Nucleic Acids Res., July 10, 2007; (2007) gkm485v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Mahony, P. E. Auron, and P. V. Benos Inferring protein DNA dependencies using motif alignments and mutual information Bioinformatics, July 1, 2007; 23(13): i297 - i304. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Segal, M. Lapidot, Z. Solan, E. Ruppin, Y. Pilpel, and D. Horn Nucleotide variation of regulatory motifs may lead to distinct expression patterns Bioinformatics, July 1, 2007; 23(13): i440 - i449. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Wang, D. J. Tomso, B. N. Chorley, H.-Y. Cho, V. G. Cheung, S. R. Kleeberger, and D. A. Bell Identification of polymorphic antioxidant response elements in the human genome Hum. Mol. Genet., May 15, 2007; 16(10): 1188 - 1200. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Li, Y. Liang, and R. L. Bass GAPWM: a genetic algorithm method for optimizing a position weight matrix Bioinformatics, May 15, 2007; 23(10): 1188 - 1194. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Tomovic and E. J. Oakeley Position dependencies in transcription factor binding sites Bioinformatics, April 15, 2007; 23(8): 933 - 941. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Itzkovitz and U. Alon The genetic code is nearly optimal for allowing additional information within protein-coding sequences Genome Res., April 1, 2007; 17(4): 405 - 412. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Zhao, G. Boekhoff-Falk, B. A. Wilson, and J. B. Skeath Linking pattern formation to cell-type specification: Dichaete and Ind directly repress achaete gene expression in the Drosophila CNS PNAS, March 6, 2007; 104(10): 3847 - 3852. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Zhao, L. A. Schriefer, and G. D. Stormo Identification of muscle-specific regulatory modules in Caenorhabditis elegans Genome Res., March 1, 2007; 17(3): 348 - 357. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Zhang, B. Jiang, M. Li, J. Tromp, X. Zhang, and M. Q. Zhang Computing exact P-values for DNA motifs Bioinformatics, March 1, 2007; 23(5): 531 - 537. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. W. Siggers and B. Honig Structure-based prediction of C2H2 zinc-finger binding specificity: sensitivity to docking geometry Nucleic Acids Res., February 28, 2007; 35(4): 1085 - 1097. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. E. Reddy, B. E. Shakhnovich, D. S. Roberts, S. J. Russek, and C. DeLisi Positional clustering improves computational binding site detection and identifies novel cis-regulatory sites in mammalian GABAA receptor subunit genes Nucleic Acids Res., February 16, 2007; 35(3): e20 - e20. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kumar and A. Filipski Multiple sequence alignment: In pursuit of homologous DNA positions Genome Res., February 1, 2007; 17(2): 127 - 135. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. G. Roider, A. Kanhere, T. Manke, and M. Vingron Predicting transcription factor affinities to DNA from a biophysical model Bioinformatics, January 15, 2007; 23(2): 134 - 141. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. B. Kinney, G. Tkacik, and C. G. Callan Jr. Precise physical models of protein-DNA interaction from high-throughput data PNAS, January 9, 2007; 104(2): 501 - 506. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hollunder, M. Friedel, A. Beyer, C. T. Workman, and T. Wilhelm DASS: efficient discovery and p-value calculation of substructures in unordered data Bioinformatics, January 1, 2007; 23(1): 77 - 83. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Martinez, A. D. Smith, B. Li, M. Q. Zhang, and K. S. Harrod Computational prediction of novel components of lung transcriptional networks Bioinformatics, January 1, 2007; 23(1): 21 - 29. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. T. Naughton, E. Fratkin, S. Batzoglou, and D. L. Brutlag A graph-based motif detection algorithm models complex nucleotide dependencies in transcription factor binding sites Nucleic Acids Res., November 6, 2006; 34(20): 5730 - 5739. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Megraw, V. Baev, V. Rusinov, S. T. Jensen, K. Kalantidis, and A. G. Hatzigeorgiou MicroRNA promoter element discovery in Arabidopsis RNA, September 1, 2006; 12(9): 1612 - 1619. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Johnson, R. J. Gamblin, L. Ooi, A. W. Bruce, I. J. Donaldson, D. R. Westhead, I. C. Wood, R. M. Jackson, and N. J. Buckley Identification of the REST regulon reveals extensive transposable element-mediated binding site duplication Nucleic Acids Res., September 1, 2006; 34(14): 3862 - 3877. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta Computational identification of transcriptional regulatory elements in DNA sequence Nucleic Acids Res., July 19, 2006; 34(12): 3585 - 3598. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-K. Tsai, G. T.-W. Huang, M.-Y. Chou, H. H.-S. Lu, and W.-H. Li Method for identifying transcription factor binding sites in yeast Bioinformatics, July 15, 2006; 22(14): 1675 - 1681. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Lardenois, F. Chalmel, L. Bianchetti, J.-A. Sahel, T. Leveillard, and O. Poch PromAn: an integrated knowledge-based web server dedicated to promoter analysis. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W578 - W583. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Yu. Mitrophanov and M. Borodovsky Statistical significance in biological sequence analysis Brief Bioinform, March 1, 2006; 7(1): 2 - 24. |
||||
![]() |
L. Florea Bioinformatics of alternative splicing and its regulation Brief Bioinform, March 1, 2006; 7(1): 55 - 69. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. D. MacIsaac, D. B. Gordon, L. Nekludova, D. T. Odom, J. Schreiber, D. K. Gifford, R. A. Young, and E. Fraenkel A hypothesis-based approach for identifying the binding specificity of regulatory proteins from chromatin immunoprecipitation data Bioinformatics, February 15, 2006; 22(4): 423 - 429. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Salsi and V. Zappavigna Hoxd13 and Hoxa13 Directly Control the Expression of the EphA7 Ephrin Tyrosine Kinase Receptor in Developing Limbs J. Biol. Chem., January 27, 2006; 281(4): 1992 - 1999. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Gordon, M. W. Towsey, J. M. Hogan, S. A. Mathews, and P. Timms Improved prediction of bacterial transcription start sites Bioinformatics, January 15, 2006; 22(2): 142 - 148. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Zhang, Z. Xuan, S. Otto, J. R. Hover, S. R. McCorkle, G. Mandel, and M. Q. Zhang A clustering property of highly-degenerate transcription factor binding sites in the mammalian genome. Nucleic Acids Res., January 1, 2006; 34(8): 2238 - 2246. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z.-C. Yuan, R. Zaheer, R. Morton, and T. M. Finan Genome prediction of PhoB regulated promoters in Sinorhizobium meliloti and twelve proteobacteria. Nucleic Acids Res., January 1, 2006; 34(9): 2686 - 2697. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Magee, L.-w. Chang, G. D. Stormo, and J. Milbrandt Direct, Androgen Receptor-Mediated Regulation of the FKBP5 Gene via a Distal Enhancer Element Endocrinology, January 1, 2006; 147(1): 590 - 598. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Robertson, M. Bilenky, K. Lin, A. He, W. Yuen, M. Dagpinar, R. Varhol, K. Teague, O. L. Griffith, X. Zhang, et al. cisRED: a database system for genome-scale computational discovery of regulatory elements Nucleic Acids Res., January 1, 2006; 34(suppl_1): D68 - D73. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Jagannathan, E. Roulet, M. Delorenzi, and P. Bucher HTPSELEX--a database of high-throughput SELEX libraries for transcription factor binding sites Nucleic Acids Res., January 1, 2006; 34(suppl_1): D90 - D94. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Vlieghe, A. Sandelin, P. J. De Bleser, K. Vleminckx, W. W. Wasserman, F. van Roy, and B. Lenhard A new generation of JASPAR, the open-access repository for transcription factor binding site profiles Nucleic Acids Res., January 1, 2006; 34(suppl_1): D95 - D97. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. R. Rogulski, D. E. Cohen, D. L. Corcoran, P. V. Benos, and E. V. Prochownik Deregulation of common genes by c-Myc and its direct target, MT-MC1 PNAS, December 27, 2005; 102(52): 18968 - 18973. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Wang and G. D. Stormo Identifying the conserved network of cis-regulatory sites of a eukaryotic genome PNAS, November 29, 2005; 102(48): 17400 - 17405. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Munch, K. Hiller, A. Grote, M. Scheer, J. Klein, M. Schobert, and D. Jahn Virtual Footprint and PRODORIC: an integrative framework for regulon prediction in prokaryotes Bioinformatics, November 15, 2005; 21(22): 4187 - 4189. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Zwir, H. Huang, and E. A. Groisman Analysis of differentially-regulated genes within a regulatory network by GPS genome navigation Bioinformatics, November 15, 2005; 21(22): 4073 - 4083. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Zhong, P. Zeng, P. Ma, J. S. Liu, and Y. Zhu RSIR: regularized sliced inverse regression for motif discovery Bioinformatics, November 15, 2005; 21(22): 4169 - 4175. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Mira and R. Pushker The Silencing of Pseudogenes Mol. Biol. Evol., November 1, 2005; 22(11): 2135 - 2138. [Abstract] [Full Text] |












