Bioinformatics, Vol 14, 498-507, Copyright © 1998 by Oxford University Press
E Coward and F Drablos
MOTIVATION: The search for repeated patterns in DNA and protein sequences
is important in sequence analysis. The rapid increase in available
sequences, in particular from large-scale genome sequencing projects, makes
it relevant to develop sensitive automatic methods for the identification
of repeats. RESULTS : A new method for finding periodic patterns in
biological sequences is presented. The method is based on evolutionary
distance and 'phase shifts' corresponding to insertions and deletions. A
given sequence is aligned to itself in a certain sense, trying to minimize
a distance to periodicity. Relationships between different such periodicity
measures are discussed. An iterative algorithm is used, and the running
time is nearly proportional to the sequence length. The alignment produces
a periodic consensus pattern. A 'phase score' is used to indicate a
statistical significance of the periodicity. Three examples using both DNA
and protein sequences illustrate how the method can be used to find
patterns. AVAILABILITY: On request from the authors. CONTACT: evindc@mat
nu.no; finn.drablos@unimed.sintef.no
ARTICLES
Detecting periodic patterns in biological sequences
Department of Mathematical Sciences, Norwegian University of Science and Technology, Norway. evindc@math.ntnu.no
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Biegert and J. Soding De novo identification of highly diverged protein repeats by probabilistic consistency Bioinformatics, March 15, 2008; 24(6): 807 - 814. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Soding, M. Remmert, and A. Biegert HHrep: de novo protein repeat detection and the origin of TIM barrels. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W137 - W142. [Abstract] [Full Text] [PDF] |
||||

