Bioinformatics Advance Access published online on June 9, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl277
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Genome Analysis, Leibniz Institute for Age Research - Fritz Lipmann Institute, Beutenbergstr. 11, 07745 Jena, Germany
* To whom correspondence should be addressed.
Summary: The program tuple_plot identifies and visualizes local similarities between two genomic sequences, typically 100 kbp or longer, by applying the well-known dotplot principle. A dictionary of sequence words built from the input sequences serves to construct a task-specific expectancy model that is used to attribute significance values to pairwise word hits. The dictionary-based approach allows fast computation, the computation time scaling to O(N log N), depending on the size of the input sequences. The proposed scoring scheme appreciably increases the signal-to-noise ratio and may help to improve other word-based sequence comparison approaches. Availability: tuple_plot is available at http://genome.fli-leibniz.de/software.html and may be used under GNU public license.
Received March 14, 2006
Revised May 19, 2006
Accepted May 29, 2006
Applications note
tuple_plot: fast pairwise nucleotide sequence comparison with noise suppression
Karol Szafranski 1 *,
Niels Jahn 1,
and
Matthias Platzer 1
Karol Szafranski, E-mail: szafrans{at}fli-leibniz.de
![]()
Abstract
Associate Editor: Martin Bishop
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Krumsiek, R. Arnold, and T. Rattei Gepard: a rapid and sensitive tool for creating dotplots on genome scale Bioinformatics, April 15, 2007; 23(8): 1026 - 1028. [Abstract] [Full Text] [PDF] |
||||
