Skip Navigation



Bioinformatics Advance Access published online on June 9, 2006

Bioinformatics, doi:10.1093/bioinformatics/btl277
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow All Versions of this Article:
22/15/1917    most recent
btl277v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Szafranski, K.
Right arrow Articles by Platzer, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Szafranski, K.
Right arrow Articles by Platzer, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author (2006). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org
Received March 14, 2006
Revised May 19, 2006
Accepted May 29, 2006

Applications note

tuple_plot: fast pairwise nucleotide sequence comparison with noise suppression

Karol Szafranski 1 *, Niels Jahn 1, and Matthias Platzer 1

1 Genome Analysis, Leibniz Institute for Age Research - Fritz Lipmann Institute, Beutenbergstr. 11, 07745 Jena, Germany

* To whom correspondence should be addressed.
Karol Szafranski, E-mail: szafrans{at}fli-leibniz.de


   Abstract

Summary: The program tuple_plot identifies and visualizes local similarities between two genomic sequences, typically 100 kbp or longer, by applying the well-known dotplot principle. A dictionary of sequence words built from the input sequences serves to construct a task-specific expectancy model that is used to attribute significance values to pairwise word hits. The dictionary-based approach allows fast computation, the computation time scaling to O(N log N), depending on the size of the input sequences. The proposed scoring scheme appreciably increases the signal-to-noise ratio and may help to improve other word-based sequence comparison approaches.

Availability: tuple_plot is available at http://genome.fli-leibniz.de/software.html and may be used under GNU public license.


Associate Editor: Martin Bishop
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
J. Krumsiek, R. Arnold, and T. Rattei
Gepard: a rapid and sensitive tool for creating dotplots on genome scale
Bioinformatics, April 15, 2007; 23(8): 1026 - 1028.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.