Segment-based multiple sequence alignment
1International Max Planck Research School for Computational Biology and Scientific Computing, Ihnestr. 63-73, 14195 Berlin, Germany, 2Algorithmische Bioinformatik, Institut für Informatik, Takustr. 9, 14195 Berlin, Germany and 3Comparative Bioinformatics Group, Center for Genomic Regulation, Dr Aiguader 88, 08003 Barcelona, Spain
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Many multiple sequence alignment tools have been developed in the past, progressing either in speed or alignment accuracy. Given the importance and wide-spread use of alignment tools, progress in both categories is a contribution to the community and has driven research in the field so far.
Results: We introduce a graph-based extension to the consistency-based, progressive alignment strategy. We apply the consistency notion to segments instead of single characters. The main problem we solve in this context is to define segments of the sequences in such a way that a graph-based alignment is possible. We implemented the algorithm using the SeqAn library and report results on amino acid and DNA sequences. The benefit of our approach is threefold: (1) sequences with conserved blocks can be rapidly aligned, (2) the implementation is conceptually easy, generic and fast and (3) the consistency idea can be extended to align multiple genomic sequences.
Availability: The segment-based multiple sequence alignment tool can be downloaded from http://www.seqan.de/projects/msa.html. A novel version of T-Coffee interfaced with the tool is available from http://www.tcoffee.org. The usage of the tool is described in both documentations.
Contact: rausch{at}inf.fu-berlin.de
This article has been cited by other articles:
![]() |
C. Kemena and C. Notredame Upcoming challenges for multiple sequence alignment methods in the high-throughput era Bioinformatics, October 1, 2009; 25(19): 2455 - 2465. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Rausch, S. Koren, G. Denisov, D. Weese, A.-K. Emde, A. Doring, and K. Reinert A consistency-based consensus algorithm for de novo and reference-guided sequence assembly of short reads Bioinformatics, May 1, 2009; 25(9): 1118 - 1124. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Paten, J. Herrero, K. Beal, and E. Birney Sequence progressive alignment, a framework for practical large-scale probabilistic consistency alignment Bioinformatics, February 1, 2009; 25(3): 295 - 301. [Abstract] [Full Text] [PDF] |
||||
