Methods for calculating the probabilities of finding patterns in sequences
MRC Laboratory of Molecular Biology Hills Road, Cambridge CB2 2QH, UK
This paper describes the use of probability-generating functions for calculating the probabilities of finding motifs in nucleic acid and protein sequences. Equations and algorithms are given for calculating the probabilities associated with nine different ways of defining motifs. Comparisons are made with searches of random sequences. A higher level structure-the pattern-is defined as a list of motifs. A pattern also specifies the permitted ranges of spacing allowed between its constituent motifs. Equations for calculating the expected numbers of matches to patterns are given.
Received on March 1, 1988; accepted on September 30, 1988
This article has been cited by other articles:
![]() |
U. J. Pape, S. Rahmann, and M. Vingron Natural similarity measures between position frequency matrices with an application to clustering Bioinformatics, February 1, 2008; 24(3): 350 - 357. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Zhao, L. A. Schriefer, and G. D. Stormo Identification of muscle-specific regulatory modules in Caenorhabditis elegans Genome Res., March 1, 2007; 17(3): 348 - 357. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta Computational identification of transcriptional regulatory elements in DNA sequence Nucleic Acids Res., July 19, 2006; 34(12): 3585 - 3598. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Fiedler and M. Rehmsmeier jPREdictor: a versatile tool for the prediction of cis-regulatory elements. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W546 - W550. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Yu, M. J. Palumbo, C. E. Lawrence, and R. H. Morse Contribution of the Histone H3 and H4 Amino Termini to Gcn4p- and Gcn5p-mediated Transcription in Yeast J. Biol. Chem., April 7, 2006; 281(14): 9755 - 9764. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Conlan, C. Lawrence, and L. A. McCue Rhodopseudomonas palustris Regulons Detected by Cross-Species Analysis of Alphaproteobacterial Genomes Appl. Envir. Microbiol., November 1, 2005; 71(11): 7442 - 7452. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Huppert and S. Balasubramanian Prevalence of quadruplexes in the human genome Nucleic Acids Res., May 24, 2005; 33(9): 2908 - 2916. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. I. Gershenzon, G. D. Stormo, and I. P. Ioshikhes Computational technique for improvement of the position-weight matrices for the DNA/protein binding sites Nucleic Acids Res., April 22, 2005; 33(7): 2290 - 2301. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta, L. A. Schriefer, R. H. Waterston, and G. D. Stormo Novel transcription regulatory elements in Caenorhabditis elegans muscle genes Genome Res., December 1, 2004; 14(12): 2457 - 2468. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Thompson, E. C. Rouchka, and C. E. Lawrence Gibbs Recursive Sampler: finding transcription factor binding sites Nucleic Acids Res., July 1, 2003; 31(13): 3580 - 3585. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Frith, J. L. Spouge, U. Hansen, and Z. Weng Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences Nucleic Acids Res., July 15, 2002; 30(14): 3214 - 3224. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta, L. Palomar, G. D. Stormo, P. Tedesco, T. E. Johnson, D. W. Walker, G. Lithgow, S. Kim, and C. D. Link Identification of a Novel cis-Regulatory Element Involved in the Heat Shock Response in Caenorhabditis elegans Using Microarray Gene Expression and Computational Methods Genome Res., May 1, 2002; 12(5): 701 - 712. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Mackey, T. A. J. Haystead, and W. R. Pearson Getting More from Less: Algorithms for Rapid Protein Identification with Multiple Short Peptide Sequences Mol. Cell. Proteomics, February 1, 2002; 1(2): 139 - 147. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. F. Neuwald, L. Aravind, J. L. Spouge, and E. V. Koonin AAA+: A Class of Chaperone-Like ATPases Associated with the Assembly, Operation, and Disassembly of Protein Complexes Genome Res., January 1, 1999; 9(1): 27 - 43. [Abstract] [Full Text] |
||||





