The statistical significance of nucleotide position-weight matrix matches
Structural and Genetic Information Laboratory, CNRS-E.P. 91, Institute of Structural Biology and Microbiology 31 Chemin Joseph Aiguier, Marseille 13402, France
MOTIVATION: To improve the detection of nucleotide sequence signals (e.g. promoter elements) by position-weight matrices (PWM) using the concept of statistically significant matches.
RESULTS: The Mksite program was originally developed for analyzing protein sequences. We report NMksite, a new version adapted to the processing of nucleotide sequences. NMksite creates PWM from nucleotide sequence block alignments or occurrence tables using three weight computation schemes. An original feature of NMksite is the numerical computation of the statistical significance of PWM matches. The utility of this concept is demonstrated in the context of the prediction of splice sites and promoter regions.
AVAILABILITY: Mksite and other components of the MODEST (Motif DEsign and Search Tool) package (written in C/Unix) are available at http://igs-server.cnrs-mrs.fr
CONTACT: E-mail: jmc{at}igs.cnrs-mrs.fr
This article has been cited by other articles:
![]() |
J. Hu, H. Hu, and X. Li MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs Nucleic Acids Res., August 1, 2008; 36(13): 4488 - 4497. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. J. Pape, S. Rahmann, and M. Vingron Natural similarity measures between position frequency matrices with an application to clustering Bioinformatics, February 1, 2008; 24(3): 350 - 357. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. I. Gershenzon, G. D. Stormo, and I. P. Ioshikhes Computational technique for improvement of the position-weight matrices for the DNA/protein binding sites Nucleic Acids Res., April 22, 2005; 33(7): 2290 - 2301. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. L. Yap, M. P. Wong, X. W. Zhang, D. Hernandez, R. Gras, D. K. Smith, and A. Danchin Conserved transcription factor binding sites of cancer markers derived from primary lung adenocarcinoma microarrays Nucleic Acids Res., January 14, 2005; 33(1): 409 - 421. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Frith, J. L. Spouge, U. Hansen, and Z. Weng Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences Nucleic Acids Res., July 15, 2002; 30(14): 3214 - 3224. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-P. Alimi, O. Poirot, F. Lopez, and J.-M. Claverie Reverse Transcriptase-Polymerase Chain Reaction Validation of 25 "Orphan" Genes from Escherichia coli K-12 MG1655 Genome Res., July 1, 2000; 10(7): 959 - 966. [Abstract] [Full Text] |
||||
![]() |
J. W. Fickett and A. G. Hatzigeorgiou Eukaryotic Promoter Recognition Genome Res., September 1, 1997; 7(9): 861 - 878. [Full Text] [PDF] |
||||


