Bioinformatics Advance Access originally published online on January 19, 2007
Bioinformatics 2007 23(6):694-700; doi:10.1093/bioinformatics/btl674
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Quaternionic periodicity transform: an algebraic solution to the tandem repeat detection problem
The MITRE Corporation, Bedford MA 01730
| Abstract |
|---|
Motivation: One of the main tasks of DNA sequence analysis is identification of repetitive patterns. DNA symbol repetitions play a key role in a number of applications, including prediction of gene and exon locations, identification of diseases, reconstruction of human evolutionary history and DNA forensics.
Results: A new approach towards identification of tandem repeats in DNA sequences is proposed. The approach is a refinement of previously considered method, based on the complex periodicity transform. The refinement is obtained, among others, by mapping of DNA symbols to pure quaternions. This mapping results in an enhanced, symbol-balanced sensitivity of the transform to DNA patterns, and an unambiguous threshold selection criterion. Computational efficiency of the transform is further improved, and coupling of the computation with the period value is removed, thereby facilitating parallel implementation of the algorithm. Additionally, a post-processing stage is inserted into the algorithm, enabling unambiguous display of results in a convenient graphical format. Comparison of the quaternionic periodicity transform with two well-known pattern detection techniques shows that the new approach is competitive with these two techniques in detection of exact and approximate repeats.
Supplementary information: Supplementary data are available at Bioinformatics online.
Associate Editor: Thomas Lengauer
Received on August 25, 2006; revised on December 10, 2006; accepted on January 3, 2007