Bioinformatics Vol. 16 no. 4 2000
Pages 326-333
© 2000 Oxford University Press
Automatic discovery of regulatory patterns in promoter regions based on whole cell expression data and functional annotation
1 Center for Biological Sequence Analysis, Department of Biotechnology, Building 208, The Technical University of Denmark, DK-2800 Lyngby, Denmark
Received on May 13, 1999
; revised on November 3, 1999
; accepted on November 3, 1999
Motivation: The whole genomes submitted to GenBank contain valuable information about the function of genes as well as the upstream sequences and whole cell expression provides valuable information on gene regulation. To utilize these large amounts of data for a biological understanding of the regulation of gene expression, new automatic methods for pattern finding are needed.
Results: Two word-analysis algorithms for automatic discovery of regulatory sequence elements have been developed. We show that sequence patterns correlated to whole cell expression data can be found using KolmogorovSmirnov tests on the raw data, thereby eliminating the need for clustering co-regulated genes. Regulatory elements have also been identified by systematic calculations of the significance of correlations between words found in the functional annotation of genes and DNA words occuring in their promoter regions. Application of these algorithms to the Saccharomyces cerevisiae genome and publicly available DNA array data sets revealed a highly conserved 9-mer occuring in the upstream regions of genes coding for proteasomal subunits. Several other putative and known regulatory elements were also found.
Availability: Upon request.
Contact: steen{at}cbs.dtu.dk
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C.-Y. Chen, H.-K. Tsai, C.-M. Hsu, M.-J. May Chen, H.-G. Hung, G. T.-W. Huang, and W.-H. Li Discovering gapped binding sites of yeast transcription factors PNAS, February 19, 2008; 105(7): 2527 - 2532. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. C. M. Leung and F. Y. L. Chin Finding motifs from all sequences with and without binding sites Bioinformatics, September 15, 2006; 22(18): 2217 - 2223. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. M. Trindade, R. van Berloo, M. Fiers, and R. G. F. Visser PRECISE: Software for Prediction of cis-Acting Regulatory Elements J. Hered., September 1, 2005; 96(5): 618 - 622. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Boorsma, B. C. Foat, D. Vis, F. Klis, and H. J. Bussemaker T-profiler: scoring the activity of predefined groups of genes using gene expression data Nucleic Acids Res., July 1, 2005; 33(suppl_2): W592 - W595. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. B. Fogel, D. G. Weekes, G. Varga, E. R. Dow, H. B. Harlow, J. E. Onyia, and C. Su Discovery of sequence motifs related to coexpression of genes using evolutionary computation Nucleic Acids Res., July 20, 2004; 32(13): 3826 - 3835. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-D. Huang, J.-T. Horng, Y.-M. Sun, A.-P. Tsou, and S.-L. Huang Identifying transcriptional regulatory sites in the human genome using an integrated system Nucleic Acids Res., March 29, 2004; 32(6): 1948 - 1956. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. E. Hudson and P. H. Quail Identification of Promoter Motifs Involved in the Network of Phytochrome A-Regulated Gene Expression by Combined Analysis of Genomic Sequence and Microarray Data Plant Physiology, December 1, 2003; 133(4): 1605 - 1616. [Abstract] [Full Text] |
||||
![]() |
S. Knudsen, C. Workman, T. Sicheritz-Ponten, and C. Friis GenePublisher: automated analysis of DNA microarray data Nucleic Acids Res., July 1, 2003; 31(13): 3471 - 3476. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. B. Nielsen, R. Wernersson, and S. Knudsen Design of oligonucleotides for microarrays and perspectives for design of multi-transcriptome arrays Nucleic Acids Res., July 1, 2003; 31(13): 3491 - 3496. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Rombauts, K. Florquin, M. Lescot, K. Marchal, P. Rouze, and Y. Van de Peer Computational Approaches to Identify Promoters and cis-Regulatory Elements in Plant Genomes Plant Physiology, July 1, 2003; 132(3): 1162 - 1176. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Meiners, D. Heyken, A. Weller, A. Ludwig, K. Stangl, P.-M. Kloetzel, and E. Kruger Inhibition of Proteasome Activity Induces Concerted Expression of Proteasome Genes and de Novo Formation of Mammalian Proteasomes J. Biol. Chem., June 6, 2003; 278(24): 21517 - 21525. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Sudarsanam, Y. Pilpel, and G. M. Church Genome-wide Co-occurrence of Promoter Elements Reveals a cis-Regulatory Cassette of rRNA Transcription Motifs in Saccharomyces cerevisiae Genome Res., November 1, 2002; 12(11): 1723 - 1731. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Halfon and A. M. Michelson Exploring genetic regulatory networks in metazoan development: methods and models Physiol Genomics, September 3, 2002; 10(3): 131 - 143. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Halfon, Y. Grad, G. M. Church, and A. M. Michelson Computation-Based Discovery of Related Transcriptional Regulatory Modules and Motifs Using an Experimentally Validated Combinatorial Model Genome Res., July 1, 2002; 12(7): 1019 - 1028. [Abstract] [Full Text] [PDF] |
||||







