Bioinformatics Advance Access originally published online on September 16, 2004
Bioinformatics 2005 21(5):582-588; doi:10.1093/bioinformatics/bti039
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
RAP: a new computer program for de novo identification of repeated sequences in whole genomes
CRIBI, Università degli Studi di Padova via Ugo Bassi 58b, I-35121 Padova, Italy
*To whom correspondence should be addressed.
Motivation: DNA repeats are a common feature of most genomic sequences. Their de novo identification is still difficult despite being a crucial step in genomic analysis and oligonucleotides design. Several efficient algorithms based on word counting are available, but too short words decrease specificity while long words decrease sensitivity, particularly in degenerated repeats.
Results: The Repeat Analysis Program (RAP) is based on a new word-counting algorithm optimized for high resolution repeat identification using gapped words. Many different overlapping gapped words can be counted at the same genomic position, thus producing a better signal than the single ungapped word. This results in better specificity both in terms of low-frequency detection, being able to identify sequences repeated only once, and highly divergent detection, producing a generally high score in most intron sequences.
Availability: The program is freely available for non-profit organizations, upon request to the authors.
Contact: giorgio.valle{at}unipd.it
Supplementary information: The program has been tested on the Caenorhabditis elegans genome using word lengths of 12, 14 and 16 bases. The full analysis has been implemented in the UCSC Genome Browser and is accessible at http://genome.cribi.unipd.it.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
X. Li, T. Kahveci, and A. M. Settles A novel genome-scale repeat finder geared towards transposons Bioinformatics, February 15, 2008; 24(4): 468 - 476. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Bergman and H. Quesneville Discovering and detecting transposable elements in genome sequences Brief Bioinform, November 1, 2007; 8(6): 382 - 392. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Hou, P. Berman, C.-H. Hsu, and R. S. Harris HomologMiner: looking for homologous genomic groups in whole genomes Bioinformatics, April 15, 2007; 23(8): 917 - 925. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Morgulis, E. M. Gertz, A. A. Schaffer, and R. Agarwala WindowMasker: window-based masker for sequenced genomes Bioinformatics, January 15, 2006; 22(2): 134 - 141. [Abstract] [Full Text] [PDF] |
||||

