Bioinformatics Advance Access originally published online on January 18, 2007
Bioinformatics 2007 23(6):680-686; doi:10.1093/bioinformatics/btl669
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
A 3D pattern matching algorithm for DNA sequences
LIMSI-CNRS, Univ. Paris-Sud, 91403 Orsay, France
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Biologists usually work with textual DNA sequences (succession of A, C, G and T). This representation allows biologists to study the syntax and other linguistic properties of DNA sequences. Nevertheless, such a linear coding offers only a local and a one-dimensional vision of the molecule. The 3D structure of DNA is known to be very important in many essential biological mechanisms. By using 3D conformation models, one is able to construct a 3D trajectory of a naked DNA molecule. From the various studies that we performed, it turned out that two very different textual DNA sequences could have similar 3D structures.
Results: In this article, we address a new research work on 3D pattern matching for DNA sequences. The aim of this work is to enhance conventional pattern matching analyses with 3D-augmented criteria. We have developed an algorithm, based on 3D trajectories, which compares angles formed by these trajectories and thus quantifies the difference between two 3D DNA sequences. This analysis performs from a global scale to al local one.
Availability: Available on request from the authors.
Contact: herisson{at}epigenomique.genopole.fr
Associate Editor: Keith Crandall
Received on September 11, 2006; revised on November 26, 2006; accepted on December 30, 2006