Random walk and gap plots of DNA sequences
Nestle Research Centre, Vers-chez-les-Blancs 1000 Lausanne 26, 1015 Lausanne, Switzerland
1Ecole Polytechnique Federate, Departement de Mathematiques 1015 Lausanne, Switzerland
2To whom correspondence should be addressed. E-mail. morgi{at}masg26.epfl.ch
Genomic sequence analysis is usually performed with the help of specialized software packages written for molecular biologists. The scope of such pre-programmed techniques is quite limited. Because DNA sequences contain a large amount of information, analysis of such sequences without underlying assumptions may provide additional insights. The present article proposes two new graphical representations as examples of such methods. The random walk plot is designed to show the base composition in a compact form, whereas the gap plot visualizes positional correlations. The random walk plot represents the DNA sequence as a curve, a random walk, in a plane. The four possible moves, left/right and up/down, are used to encode the four possible bases. Gap plots provide a tool to exhibit various features in a sequence. They visualize the periodic patterns within a sequence, both with regard to a single type of base or between two types of bases.
Received on December 17, 1994; revised on July 19, 1995; accepted on August 11, 1995