Bioinformatics Vol. 18 no. 90002 2002
Pages S44-S53
© 2002 Oxford University Press
A new distance measure for comparing sequence profiles based on path lengths along an entropy surface
Department of Biomathematical Sciences, Mount Sinai School of Medicine, New York, USA
Received on April 8, 2002
; accepted on June 15, 2002
We describe a new distance measure for comparing DNA sequence profiles. For this measure, columns in a multiple alignment are treated as character frequency vectors (sum of the frequencies equal to one). The distance between two vectors is based on minimum path length along an entropy surface. Path length is estimated using a random graph generated on the entropy surface and Dijkstra's algorithm for all shortest paths to a source. We use the new distance measure to analyze similarities within familes of tandem repeats in the C. elegans genome and show that this new measure gives more accurate refinement of family relationships than a method based on comparing consensus sequences.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T. Boby, A.-M. Patch, and S. J. Aves TRbase: a database relating tandem repeats to disease genes for the human genome Bioinformatics, March 15, 2005; 21(6): 811 - 816. [Abstract] [Full Text] [PDF] |
||||
