Bioinformatics Vol. 17 no. 12 2001
Pages 1113-1122
© 2001 Oxford University Press
A higher-order background model improves the detection of promoter regulatory elements by Gibbs sampling
1 ESAT-SISTA/COSIC, KULeuven, Kasteelpark
Arenberg 10, 3001 Leuven-Heverlee, Belgium
2 Department of Plant Genetics, VIB, UGent,
Ledeganckstraat 35, 9000 Gent, Belgium
3 INRA Associated Laboratory, VIB, UGent,
Ledeganckstraat 35, 9000 Gent, Belgium
Received on February 6, 2001
; revised on June 4, 2001
; accepted on June 6, 2001
Motivation: Transcriptome analysis allows detection and clustering of genes that are coexpressed under various biological circumstances. Under the assumption that coregulated genes share cis-acting regulatory elements, it is important to investigate the upstream sequences controlling the transcription of these genes. To improve the robustness of the Gibbs sampling algorithm to noisy data sets we propose an extension of this algorithm for motif finding with a higher-order background model.
Results: Simulated data and real biological data sets with well-described regulatory elements are used to test the influence of the different background models on the performance of the motif detection algorithm. We show that the use of a higher-order model considerably enhances the performance of our motif finding algorithm in the presence of noisy data. For Arabidopsis thaliana, a reliable background model based on a set of carefully selected intergenic sequences was constructed.
Availability: Our implementation of the Gibbs sampler called the Motif Sampler can be used through a web interface: http://www.esat.kuleuven.ac.be/~thijs/Work/MotifSampler.html.
Contact: gert.thijs{at}esat.kuleuven.ac.be; yves.moreau{at}esat.kuleuven.ac.be
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
L. Klucar, M. Stano, and M. Hajduk phiSITE: database of gene regulation in bacteriophages Nucleic Acids Res., November 9, 2009; (2009) gkp911v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Defrance and J. van Helden info-gibbs: a motif discovery algorithm that directly optimizes information content during sampling Bioinformatics, October 15, 2009; 25(20): 2715 - 2722. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. F. T. van Hijum, M. H. Medema, and O. P. Kuipers Mechanisms and Evolution of Control Logic in Prokaryotic Transcriptional Regulation Microbiol. Mol. Biol. Rev., September 1, 2009; 73(3): 481 - 509. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Narlikar and I. Ovcharenko Identifying regulatory elements in eukaryotic genomes Brief Funct Genomic Proteomic, July 1, 2009; 8(4): 215 - 230. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. D. Yokoyama, U. Ohler, and G. A. Wray Measuring spatial preferences at fine-scale resolution identifies known and novel cis-regulatory element candidates and functional motif-pair relationships Nucleic Acids Res., July 1, 2009; 37(13): e92 - e92. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Zhang, M. Xu, S. Li, and Z. Su Genome-wide de novo prediction of cis-regulatory binding sites in prokaryotes Nucleic Acids Res., June 1, 2009; 37(10): e72 - e72. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Yanover, M. Singh, and E. Zaslavsky M are better than one: an ensemble-based motif finder and its application to regulatory element prediction Bioinformatics, April 1, 2009; 25(7): 868 - 874. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-G. Joung and Z. Fei Identification of microRNA regulatory modules in Arabidopsis via a probabilistic graphical model Bioinformatics, February 1, 2009; 25(3): 387 - 393. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. B. Fogel, V. W. Porto, G. Varga, E. R. Dow, A. M. Craven, D. M. Powers, H. B. Harlow, E. W. Su, J. E. Onyia, and C. Su Evolutionary computation for discovery of composite transcription factor binding sites Nucleic Acids Res., December 1, 2008; 36(21): e142 - e142. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Wijaya, S.-M. Yiu, N. T. Son, R. Kanagasabai, and W.-K. Sung MotifVoter: a novel ensemble method for fine-grained integration of generic motif finders Bioinformatics, October 15, 2008; 24(20): 2288 - 2295. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Fauteux, M. Blanchette, and M. V. Stromvik Seeder: discriminative seeding DNA motif discovery Bioinformatics, October 15, 2008; 24(20): 2303 - 2307. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Mihara, T. Itoh, and T. Izawa In Silico Identification of Short Nucleotide Sequences Associated with Gene Expression of Pollen Development in Rice Plant Cell Physiol., October 1, 2008; 49(10): 1451 - 1464. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Thomas-Chollier, O. Sand, J.-V. Turatsinze, R. Janky, M. Defrance, E. Vervisch, S. Brohee, and J. van Helden RSAT: regulatory sequence analysis tools Nucleic Acids Res., July 1, 2008; 36(suppl_2): W119 - W127. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Li, R. L. Bass, and Y. Liang fdrMotif: identifying cis-elements by an EM algorithm coupled with false discovery rate control Bioinformatics, March 1, 2008; 24(5): 629 - 636. [Abstract] [Full Text] [PDF] |
||||
![]() |
T.-M. Chan, K.-S. Leung, and K.-H. Lee TFBS identification based on genetic algorithm with combined representations and adaptive post-processing Bioinformatics, February 1, 2008; 24(3): 341 - 349. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Shi, W. Zhou, and D. Xu Identifying cis-regulatory elements by statistical analysis and phylogenetic footprinting and analyzing their coexistence and related gene ontology Physiol Genomics, November 14, 2007; 31(3): 374 - 384. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Mahony and P. V. Benos STAMP: a web tool for exploring DNA-binding motif similarities Nucleic Acids Res., July 13, 2007; 35(suppl_2): W253 - W258. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Li, Y. Liang, and R. L. Bass GAPWM: a genetic algorithm method for optimizing a position weight matrix Bioinformatics, May 15, 2007; 23(10): 1188 - 1194. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Freeling, L. Rapaka, E. Lyons, B. Pedersen, and B. C. Thomas G-Boxes, Bigfoot Genes, and Environmental Response: Characterization of Intragenomic Conserved Noncoding Sequences in Arabidopsis PLANT CELL, May 1, 2007; 19(5): 1441 - 1457. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. He, H. Sommer, B. Grosardt, P. Huijser, and H. Saedler PFMAGO, a MAGO NASHI-Like Factor, Interacts with the MADS-Domain Protein MPF2 from Physalis floridana Mol. Biol. Evol., May 1, 2007; 24(5): 1229 - 1241. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. C. Mockler, T. P. Michael, H. D. Priest, R. Shen, C. M. Sullivan, S. A. Givan, C. McEntee, S. A. Kay, and J. Chory The Diurnal Project: Diurnal and Circadian Expression Profiling, Model-based Pattern Matching, and Promoter Analysis Cold Spring Harb Symp Quant Biol, January 1, 2007; 72(0): 353 - 363. [Abstract] [PDF] |
||||
![]() |
N.-K. Kim, K. Tharakaraman, and J. L. Spouge Adding sequence context to a Markov background model improves the identification of regulatory elements Bioinformatics, December 1, 2006; 22(23): 2870 - 2875. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Wang, T. Hindemitt, and K. F. X. Mayer Significant sequence similarities in promoters and precursors of Arabidopsis thaliana non-conserved microRNAs Bioinformatics, November 1, 2006; 22(21): 2585 - 2589. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta Computational identification of transcriptional regulatory elements in DNA sequence Nucleic Acids Res., July 19, 2006; 34(12): 3585 - 3598. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Larsen, T. G. Kloosterman, J. Kok, and O. P. Kuipers GlnR-Mediated Regulation of Nitrogen Metabolism in Lactococcus lactis J. Bacteriol., July 1, 2006; 188(13): 4978 - 4982. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Kankainen, P. Pehkonen, P. Rosenstom, P. Toronen, G. Wong, and L. Holm POXO: a web-enabled tool series to discover transcription factor binding sites. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W534 - W540. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. S Hon and A. N Jain A deterministic motif finding algorithm with application to the human genome Bioinformatics, May 1, 2006; 22(9): 1047 - 1054. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Hindemitt and K. F. X. Mayer CREDO: a web-based tool for computational detection of conserved sequence motifs in noncoding sequences Bioinformatics, December 1, 2005; 21(23): 4304 - 4306. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Hwang, J. J. Smith, D. M. Leslie, A. D. Weston, A. G. Rust, S. Ramsey, P. de Atauri, A. F. Siegel, H. Bolouri, J. D. Aitchison, et al. A data integration methodology for systems biology: Experimental verification PNAS, November 29, 2005; 102(48): 17302 - 17307. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Vandepoele, K. Vlieghe, K. Florquin, L. Hennig, G. T.S. Beemster, W. Gruissem, Y. Van de Peer, D. Inze, and L. De Veylder Genome-Wide Identification of Potential Plant E2F Target Genes Plant Physiology, September 1, 2005; 139(1): 316 - 328. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Capela, S. Carrere, and J. Batut Transcriptome-Based Identification of the Sinorhizobium meliloti NodD1 Regulon Appl. Envir. Microbiol., August 1, 2005; 71(8): 4910 - 4913. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. J. De Keersmaecker, K. Marchal, T. L. A. Verhoeven, K. Engelen, J. Vanderleyden, and C. S. Detweiler Microarray Analysis and Motif Detection Reveal New Targets of the Salmonella enterica Serovar Typhimurium HilA Regulatory Protein, Including hilA Itself J. Bacteriol., July 1, 2005; 187(13): 4381 - 4391. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Di Cara, K. Schmidt, B. A. Hemmings, and E. J. Oakeley PromoterPlot: a graphical display of promoter similarities by pattern recognition Nucleic Acids Res., July 1, 2005; 33(suppl_2): W423 - W426. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Kankainen and L. Holm POCO: discovery of regulatory patterns from promoters of oppositely expressed gene sets Nucleic Acids Res., July 1, 2005; 33(suppl_2): W427 - W431. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Ben-Gal, A. Shani, A. Gohr, J. Grau, S. Arviv, A. Shmilovici, S. Posch, and I. Grosse Identification of transcription factor binding sites with variable-order Bayesian networks Bioinformatics, June 1, 2005; 21(11): 2657 - 2666. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A. Down and T. J. P. Hubbard NestedMICA: sensitive inference of over-represented motifs in nucleic acid sequence Nucleic Acids Res., March 10, 2005; 33(5): 1445 - 1453. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Haberer, T. Hindemitt, B. C. Meyers, and K. F.X. Mayer Transcriptional Similarities, Dissimilarities, and Conservation of cis-Elements in Duplicated Genes of Arabidopsis Plant Physiology, October 1, 2004; 136(2): 3009 - 3022. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. B. Fogel, D. G. Weekes, G. Varga, E. R. Dow, H. B. Harlow, J. E. Onyia, and C. Su Discovery of sequence motifs related to coexpression of genes using evolutionary computation Nucleic Acids Res., July 20, 2004; 32(13): 3826 - 3835. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Kankainen and L. Holm POBO, transcription factor binding site verification with bootstrapping Nucleic Acids Res., July 1, 2004; 32(suppl_2): W222 - W229. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Hu, Y. Fu, A. S. Halees, S. M. Kielbasa, and Z. Weng SeqVISTA: a new module of integrated computational tools for studying transcriptional regulation Nucleic Acids Res., July 1, 2004; 32(suppl_2): W235 - W241. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. M. Haverty, U. Hansen, and Z. Weng Computational inference of transcriptional regulatory networks from expression profiling and transcription factor binding site identification Nucleic Acids Res., January 2, 2004; 32(1): 179 - 188. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Frith, U. Hansen, J. L. Spouge, and Z. Weng Finding functional sequence elements by multiple local alignment Nucleic Acids Res., January 2, 2004; 32(1): 189 - 200. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Thirkettle-Watts, T. C. McCabe, R. Clifton, C. Moore, P. M. Finnegan, D. A. Day, and J. Whelan Analysis of the Alternative Oxidase Promoters from Soybean Plant Physiology, November 1, 2003; 133(3): 1158 - 1169. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Coessens, G. Thijs, S. Aerts, K. Marchal, F. De Smet, K. Engelen, P. Glenisson, Y. Moreau, J. Mathys, and B. De Moor INCLUSive: a web portal and service registry for microarray and regulatory sequence analysis Nucleic Acids Res., July 1, 2003; 31(13): 3468 - 3470. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Thompson, E. C. Rouchka, and C. E. Lawrence Gibbs Recursive Sampler: finding transcription factor binding sites Nucleic Acids Res., July 1, 2003; 31(13): 3580 - 3585. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Rombauts, K. Florquin, M. Lescot, K. Marchal, P. Rouze, and Y. Van de Peer Computational Approaches to Identify Promoters and cis-Regulatory Elements in Plant Genomes Plant Physiology, July 1, 2003; 132(3): 1162 - 1176. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Aerts, G. Thijs, B. Coessens, M. Staes, Y. Moreau, and B. D. Moor Toucan: deciphering the cis-regulatory logic of coregulated genes Nucleic Acids Res., March 15, 2003; 31(6): 1753 - 1764. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. B. Rossel, I. W. Wilson, and B. J. Pogson Global Changes in Gene Expression in Response to High Light in Arabidopsis Plant Physiology, November 1, 2002; 130(3): 1109 - 1120. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Frith, J. L. Spouge, U. Hansen, and Z. Weng Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences Nucleic Acids Res., July 15, 2002; 30(14): 3214 - 3224. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Le Crom, F. Devaux, P. Marc, X. Zhang, W. S. Moye-Rowley, and C. Jacq New Insights into the Pleiotropic Drug Resistance Network from Genome-Wide Characterization of the YRR1 Transcription Factor Regulation System Mol. Cell. Biol., April 15, 2002; 22(8): 2642 - 2649. [Abstract] [Full Text] [PDF] |
||||













