Skip Navigation

This Article
Right arrow Full Text (Print PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Chao, K.-M.
Right arrow Articles by Miller, W.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Chao, K.-M.
Right arrow Articles by Miller, W.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© Oxford University Press

A local alignment tool for very long DNA sequences

Kun-Mao Chao , Jinghui Zhang 1, James Ostell 1 and Webb Miller

Department of Computer Science and Engineering, The Pennsylvania State University University Park, PA 16802 USA
1National Center for Biotechnology Information, National Library of Medicine NIH, Bethesda, MD 20894, USA

This paper presents a practical program, called sim2, for building local alignments of two sequences, each of which may be hundreds of kilobases long. sim2 first constructs n best non-intersecting chains of ‘fragments’, such as all occurrences of identical 5-tuples in each of two DNA sequences,for any specified n ≥ 1. Each chain is then refined by delivering an optimal alignment in a region delimited by the chain. sim2 requires only space proportional to the size of the input sequences and the output alignments, and the same source code runs on Unix machines, on Macintoshes, on PCs, and on DEC Alpha PCs. We also describe an application of sim2 for aligning long DNA sequences from Escherichia coli. sim2 facilitates contig-building by providing a complete view of the related sequences, so difference can be analyzed and inconsistencies resolved. Examples are shown using the alignment display and editing functions from the software tool ChromoScope.


Received on June 24, 1994; accepted on November 13, 1994

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Genome ResHome page
T.-J. Chuang, W.-C. Lin, H.-C. Lee, C.-W. Wang, K.-L. Hsiao, Z.-H. Wang, D. Shieh, S. C. Lin, and L.-Y. Ch'ang
A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences
Genome Res., February 1, 2003; 13(2): 313 - 322.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. L. Delcher, A. Phillippy, J. Carlton, and S. L. Salzberg
Fast algorithms for large-scale genome alignment and comparison
Nucleic Acids Res., June 1, 2002; 30(11): 2478 - 2483.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
P. M. Kuehl, J. M. Weisemann, J. W. Touchman, E. D. Green, and M. S. Boguski
An Effective Approach for Analyzing "Prefinished" Genomic Sequence Data
Genome Res., February 1, 1999; 9(2): 189 - 194.
[Abstract] [Full Text]


Home page
Genome ResHome page
J. Zhang and T. L. Madden
PowerBLAST: A New Network BLAST Application for Interactive or Automated Sequence Analysis and Annotation
Genome Res., June 1, 1997; 7(6): 649 - 656.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
W Makalowski, J Zhang, and M S Boguski
Comparative analysis of 1196 orthologous mouse and human full-length mRNA and protein sequences.
Genome Res., September 1, 1996; 6(9): 846 - 857.
[Abstract] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.