Bioinformatics Advance Access published online on February 22, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti335
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Institute for Molecular Bioscience, ARC Centre in Bioinformatics, The University of Queensland, Brisbane, Australia
* To whom correspondence should be addressed.
Motivation: Multiple sequence alignment at the level of whole proteomes requires a high degree of automation, precluding the use of traditional validation methods such as manual curation. Since evolutionary models are too general to describe the history of each residue in a protein family, there is no single algorithm/model combination that can yield a biologically or evolutionarily optimal alignment. We propose a "shotgun" strategy where many different algorithms are used to align the same family, and the best of these alignments is then chosen with a reliable objective function. We present WOOF, a novel "word-oriented objective function" that relies on the identification and scoring of conserved amino acid patterns (words) between pairs of sequences. Results: Tests on a subset of reference protein alignments from BAliBASE showed that WOOF tended to rank the (manually curated) reference alignment highest among 1060 alternative (automatically generated) alignments for a majority of protein families. Among the automated alignments, there was a strong positive relationship between the WOOF score and similarity to the reference alignment. The speed of WOOF and its independence from explicit considerations of three-dimensional structure make it an excellent tool for analysing large numbers of protein families. Availability: On request from the authors.
Received July 14, 2004
Revised January 27, 2005
Accepted February 16, 2005
Article
A word-oriented approach to alignment validation
Mark A. Ragan, E-mail: m.ragan{at}imb.uq.edu.au
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
K. Burrage, L. Hood, and M. A. Ragan Advanced computing for systems biology Brief Bioinform, December 1, 2006; 7(4): 390 - 398. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. G. Beiko, T. J. Harlow, and M. A. Ragan Highways of gene sharing in prokaryotes PNAS, October 4, 2005; 102(40): 14332 - 14337. [Abstract] [Full Text] [PDF] |
||||

