Methods for discovering novel motifs in nucleic acid sequences
MRC Laboratory of Molecular Biology Hills Road, Cambridge CB2 2QH, UK
We describe a computer tool to aid the discovery of new motifs in nucleic acid sequences. A typical use would be to analyse a set of upstream regions from a family of related genes in order to find possible control sequences. The heart of the method is the creation of dictionaries of related subsequences. These dictionaries can then be analysed to look for the commonest or best-defined subsequences, those that occur in the highest number of different sequences, or for those in equivalent positions within the family. We show the application of the method to a set of E. coli promoter sequences.
Received on May 9, 1989; accepted on July 27, 1989
This article has been cited by other articles:
![]() |
S. R. Davies, L.-W. Chang, D. Patra, X. Xing, K. Posey, J. Hecht, G. D. Stormo, and L. J. Sandell Computational identification and functional validation of regulatory motifs in cartilage-expressed genes Genome Res., October 1, 2007; 17(10): 1438 - 1447. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Kann, S. L. Sheetlin, Y. Park, S. H. Bryant, and J. L. Spouge The identification of complete domains within protein sequences using accurate E-values for semi-global alignment Nucleic Acids Res., July 9, 2007; 35(14): 4678 - 4685. [Abstract] [Full Text] [PDF] |
||||
![]() |
L.-W. Chang, R. Nagarajan, J. A. Magee, J. Milbrandt, and G. D. Stormo A systematic model to predict transcriptional regulatory mechanisms based on overrepresentation of transcription factor binding profiles Genome Res., March 1, 2006; 16(3): 405 - 413. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Sinha and M. Tompa Discovery of novel transcription factor binding sites by statistical overrepresentation Nucleic Acids Res., December 15, 2002; 30(24): 5549 - 5560. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Esposito, J. S. Thrower, and J. J. Scocca Protein and DNA requirements of the bacteriophage HP1 recombination system: a model for intasome formation Nucleic Acids Res., October 1, 2001; 29(19): 3955 - 3964. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Besemer, A. Lomsadze, and M. Borodovsky GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions Nucleic Acids Res., June 15, 2001; 29(12): 2607 - 2618. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Jacobs Anderson and R. Parker Computational identification of cis-acting elements affecting post-transcriptional control of gene expression in Saccharomyces cerevisiae Nucleic Acids Res., April 1, 2000; 28(7): 1604 - 1617. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Lawrence, S. Altschul, M. Boguski, J. Liu, A. Neuwald, and J. Wootton Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment Science, October 8, 1993; 262(5131): 208 - 214. [Abstract] [PDF] |
||||


