Applicability of the multiple alignment algorithm for detection of weak patterns: periodically distributed DNA pattern as a study case
1Department of Membranes Research and Biophysics, Genome Project Rehovot 76100, Israel
2Department of Structural Biology, Weizmann Institute of Science Rehovot 76100, Israel
3To whom correspondence should be addressed
MOTIVATION: A nucleosome DNA positioning pattern is known to be one of the weakest (highly degenerated) patterns. The alignment procedure that has been developed recently for the extraction of such a pattern is based on a statistical matching of the sequences, and its success depends on the pattern/background ratio in the individual sequences and in the generated pattern. The heuristic nature of the method and distinctive properties of the pattern bring up the question of efficiency and sensitivity in the procedure. This paper presents a method of verification for this multiple sequence alignment algorithm.
RESULTS: To verify the applicability of the multiple alignment approach, we constructed a set of sequences carrying the hidden pattern. The pattern was presented by weak (signal) oscillations of occurrences of AA and TT dinucleotides along otherwise random sequences. Only a few dinucleotides of any given 145 base long sequence would correspond to the signal, appearing in about the same phase within the simulated periodic pattern. The novelty of our simulation approach is that we simulated a database as a whole, as opposed to simulating each sequence separately. The correlation between the hidden pattern and a sequence from the database is negligible on average, but our statistical multicycle alignment procedure produced the pattern with attributes very close to the simulated ones. The accuracy of the procedure was tested and calibrated. The presence in a typical sequence of as little as three dinucleotides corresponding to the signal is sufficient to generate (detect) the pattern hidden in a collection of 204 sequences.
AVAILABILITY: The programs of the multiple sequence alignment algorithm and database simulation are available from the authors free of charge. Requests should be accompanied by a 3.5'' diskette.
CONTACT: E-mail: bmbolsho{at}dapsasI.weizmann.ac.il
This article has been cited by other articles:
![]() |
T. N. Mavrich, I. P. Ioshikhes, B. J. Venters, C. Jiang, L. P. Tomsho, J. Qi, S. C. Schuster, I. Albert, and B. F. Pugh A barrier nucleosome model for statistical positioning of nucleosomes throughout the yeast genome Genome Res., July 1, 2008; 18(7): 1073 - 1083. [Abstract] [Full Text] [PDF] |
||||
