Bioinformatics Vol. 18 no. 90001 2002
Pages S31-S37
© 2002 Oxford University Press
Beyond tandem repeats: complex pattern structures and distant regions of similarity
Department of Computer Sciences, University of Wisconsin - Madison, 1210 W. Dayton Ave., Madison, WI 53706, USA
Motivation: Tandem repeats (TRs) are associated with human disease, play a role in evolution and are important in regulatory processes. Despite their importance, locating and characterizing these patterns within anonymous DNA sequences remains a challenge. In part, the difficulty is due to imperfect conservation of patterns and complex pattern structures. We study recognition algorithms for two complex pattern structures: variable length tandem repeats (VLTRs) and multi-period tandem repeats (MPTRs).
Results: We extend previous algorithmic research to a class of regular tandem repeats (RegTRs). We formally define RegTRs, as well as two important subclasses: VLTRs and MPTRs. We present algorithms for identification of TRs in these classes. Furthermore, our algorithms identify degenerate VLTRs and MPTRs: repeats containing substitutions, insertions and deletions. To illustrate our work, we present results of our analysis for two difficult regions in cattle and human data which reflect practical occurrences of these subclasses in GenBank sequence data.
In addition, we show the applicability of our algorithmic techniques for identifying Alu sequences, gene clusters and other distant regions of similarity. We illustrate this with an example from yeast chromosome I.
Availability: Algorithms can be accessed at http://www.cs.wisc.edu/areas/theory
Contact: kryder{at}cs.wisc.edu joseph{at}cs.wisc.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. K. Brodzik Quaternionic periodicity transform: an algebraic solution to the tandem repeat detection problem Bioinformatics, March 15, 2007; 23(6): 694 - 700. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Reneker, C.-R. Shyu, P. Zeng, J. C. Polacco, and W. Gassmann ACMES: fast multiple-genome searches for short repeat sequences with concurrent cross-species information retrieval Nucleic Acids Res., July 1, 2004; 32(suppl_2): W649 - W653. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Kolpakov, G. Bana, and G. Kucherov mreps: efficient and flexible detection of tandem repeats in DNA Nucleic Acids Res., July 1, 2003; 31(13): 3672 - 3678. [Abstract] [Full Text] [PDF] |
||||

