Bioinformatics Advance Access originally published online on January 10, 2006
Bioinformatics 2006 22(6):676-684; doi:10.1093/bioinformatics/btk032
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Short fuzzy tandem repeats in genomic sequences, identification, and possible role in regulation of gene expression
1Department of Bioengineering and Bioinformatics, Moscow State University Moscow, Russia
2INRIA Rocquencourt France
3University of California Berkeley, USA
4State Research Center GosNIIGenetika Moscow, Russia
5Engelhardt Institute of Molecular Biology, Russian Academy of Sciences Moscow, Russia
*To whom correspondence should be addressed.
Motivation: Genomic sequences are highly redundant and contain many types of repetitive DNA. Fuzzy tandem repeats (FTRs) are of particular interest. They are found in regulatory regions of eukaryotic genes and are reported to interact with transcription factors. However, accurate assessment of FTR occurrences in different genome segments requires specific algorithm for efficient FTR identification and classification.
Results: We have obtained formulas for P-values of FTR occurrence and developed an FTR identification algorithm implemented in TandemSWAN software. Using TandemSWAN we compared the structure and the occurrence of FTRs with short period length (up to 24 bp) in coding and non-coding regions including UTRs, heterochromatic, intergenic and enhancer sequences of Drosophila melanogaster and Drosophila pseudoobscura. Tandems with period three and its multiples were found in coding segments, whereas FTRs with periods multiple of six are overrepresented in all non-coding segment. Periods equal to 57 and 1114 were characteristic of the enhancer regions and other non-coding regions close to genes.
Availability: TandemSWAN web page, stand-alone version and documentation can be found at http://bioinform.genetika.ru/projects/swan/www/
Contacts: valeyo{at}imb.ac.ru
Supplementary information: Supplementary data are available at Bioinformatics online.
Received on October 14, 2005; revised on December 22, 2005; accepted on December 28, 2005
This article has been cited by other articles:
![]() |
I. V. Kulakovskiy, A. V. Favorov, and V. J. Makeev Motif discovery and motif finding from genome-mapped DNase footprint data Bioinformatics, September 15, 2009; 25(18): 2318 - 2325. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Papatsenko, Y. Goltsev, and M. Levine Organization of developmental enhancers in the Drosophila embryo Nucleic Acids Res., September 1, 2009; 37(17): 5665 - 5677. [Abstract] [Full Text] [PDF] |
||||
![]() |
G.-F. Richard, A. Kerrest, and B. Dujon Comparative Genomics and Molecular Dynamics of DNA Repeats in Eukaryotes Microbiol. Mol. Biol. Rev., December 1, 2008; 72(4): 686 - 727. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Merkel and N. Gemmell Detecting short tandem repeats from genome data: opening the software black box Brief Bioinform, September 1, 2008; 9(5): 355 - 366. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. B. Mudunuri and H. A. Nagarajaram IMEx: Imperfect Microsatellite Extractor Bioinformatics, May 15, 2007; 23(10): 1181 - 1187. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Archak, E. Meduri, P. S. Kumar, and J. Nagaraju InSatDb: a microsatellite database of fully sequenced insect genomes Nucleic Acids Res., January 12, 2007; 35(suppl_1): D36 - D39. [Abstract] [Full Text] [PDF] |
||||



