Bioinformatics Advance Access originally published online on November 22, 2006
Bioinformatics 2007 23(2):162-168; doi:10.1093/bioinformatics/btl590
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
QOMA: quasi-optimal multiple alignment of protein sequences
Computer and Information Science and Engineering, University of Florida Gainesville, FL 32611, USA
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: We consider the problem of multiple alignment of protein sequences with the goal of achieving a large SP (Sum-of-Pairs) score.
Results: We introduce a new graph-based method. We name our method QOMA (Quasi-Optimal Multiple Alignment). QOMA starts with an initial alignment. It represents this alignment using a K-partite graph. It then improves the SP score of the initial alignment through local optimizations within a window that moves greedily on the alignment. QOMA uses two parameters to permit flexibility in time/accuracy trade off: (1) The size of the window for local optimization. (2) The sparsity of the K-partite graph. Unlike traditional progressive methods, QOMA is independent of the order of sequences. The experimental results on BAliBASE benchmarks show that QOMA produces higher SP score than the existing tools including ClustalW, Probcons, Muscle, T-Coffee and DCA. The difference is more significant for distant proteins.
Availability: The software is available from the authors upon request.
Contact: tamer{at}cise.ufl.edu
Supplementary information: Supplementary material is available at Bioinformatics online.
Associate Editor: John Quackenbush
Received on August 8, 2006; revised on November 15, 2006; accepted on November 16, 2006
This article has been cited by other articles:
![]() |
K. Katoh and H. Toh Recent developments in the MAFFT multiple sequence alignment program Brief Bioinform, July 1, 2008; 9(4): 286 - 298. [Abstract] [Full Text] [PDF] |
||||
