Bioinformatics Advance Access published online on April 29, 2004
Bioinformatics, doi:10.1093/bioinformatics/bth279
Bioinformatics © Oxford University Press 2004; all rights reserved
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Computer Science, Iowa State University, Ames, IA 50011-1040, USA
* To whom correspondence should be addressed. E-mail: xqhuang{at}cs.iastate.edu.
Motivation: Studies of efficient and sensitive sequence comparison methods are driven by a need to find homologous regions of weak similarity between large genomes. Results: We describe an improved method for finding similar regions between two sets of DNA sequences. The new method generalizes existing methods by locating word matches between sequences under two or more word models and extending word matches into high-scoring segment pairs (HSPs). The method is implemented as a computer program named DDS2. Experimental results show that DDS2 can find more HSPs by using several word models than by using one word model. Availability: The DDS2 program is freely available for academic use in binary code form at http://bioinformatics.iastate.edu/aat/align/align.html and in source code form from the corresponding author.
Revised April 17, 2004
Accepted April 18, 2004
Article
Efficient combination of multiple word models for improved sequence comparison
2 Department of Computer Science, Iowa State University, Ames, IA 50011-1040, USA; Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50011-1040, USA
3 Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T.-J. Wu, Y.-H. Huang, and L.-A. Li Optimal word sizes for dissimilarity measures and estimation of the degree of dissimilarity between DNA sequences Bioinformatics, November 15, 2005; 21(22): 4125 - 4132. [Abstract] [Full Text] [PDF] |
||||
