A local alignment tool for very long DNA sequences
Department of Computer Science and Engineering, The Pennsylvania State University University Park, PA 16802 USA
1National Center for Biotechnology Information, National Library of Medicine NIH, Bethesda, MD 20894, USA
This paper presents a practical program, called sim2, for building local alignments of two sequences, each of which may be hundreds of kilobases long. sim2 first constructs n best non-intersecting chains of fragments, such as all occurrences of identical 5-tuples in each of two DNA sequences,for any specified n
1. Each chain is then refined by delivering an optimal alignment in a region delimited by the chain. sim2 requires only space proportional to the size of the input sequences and the output alignments, and the same source code runs on Unix machines, on Macintoshes, on PCs, and on DEC Alpha PCs. We also describe an application of sim2 for aligning long DNA sequences from Escherichia coli. sim2 facilitates contig-building by providing a complete view of the related sequences, so difference can be analyzed and inconsistencies resolved. Examples are shown using the alignment display and editing functions from the software tool ChromoScope.
Received on June 24, 1994; accepted on November 13, 1994
This article has been cited by other articles:
![]() |
T.-J. Chuang, W.-C. Lin, H.-C. Lee, C.-W. Wang, K.-L. Hsiao, Z.-H. Wang, D. Shieh, S. C. Lin, and L.-Y. Ch'ang A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Genome Res., February 1, 2003; 13(2): 313 - 322. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. L. Delcher, A. Phillippy, J. Carlton, and S. L. Salzberg Fast algorithms for large-scale genome alignment and comparison Nucleic Acids Res., June 1, 2002; 30(11): 2478 - 2483. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. M. Kuehl, J. M. Weisemann, J. W. Touchman, E. D. Green, and M. S. Boguski An Effective Approach for Analyzing "Prefinished" Genomic Sequence Data Genome Res., February 1, 1999; 9(2): 189 - 194. [Abstract] [Full Text] |
||||
![]() |
J. Zhang and T. L. Madden PowerBLAST: A New Network BLAST Application for Interactive or Automated Sequence Analysis and Annotation Genome Res., June 1, 1997; 7(6): 649 - 656. [Abstract] [Full Text] [PDF] |
||||
![]() |
W Makalowski, J Zhang, and M S Boguski Comparative analysis of 1196 orthologous mouse and human full-length mRNA and protein sequences. Genome Res., September 1, 1996; 6(9): 846 - 857. [Abstract] [PDF] |
||||

