Bioinformatics Advance Access originally published online on February 12, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics 20(8) © Oxford University Press 2004; all rights reserved.
A comparison of scoring functions for protein sequence profile alignment
1 195 Roque Moraes Drive, Mill Valley, CA 94941, USA and 2 Department of Bioengineering, University of California, Berkeley, CA 94720, USA
Received on May 14, 2003; revised on August 11, 2003; accepted on November 5, 2003
Advance Access Publication February 12, 2004
Motivation:In recent years, several methods have been proposed for aligning two protein sequence profiles, with reported improvements in alignment accuracy and homolog discrimination versus sequencesequence methods (e.g. BLAST) and profilesequence methods (e.g. PSI-BLAST). Profileprofile alignment is also the iterated step in progressive multiple sequence alignment algorithms such as CLUSTALW. However, little is known about the relative performance of different profileprofile scoring functions. In this work, we evaluate the alignment accuracy of 23 different profileprofile scoring functions by comparing alignments of 488 pairs of sequences with identity
30% against structural alignments. We optimize parameters for all scoring functions on the same training set and use profiles of alignments from both PSI-BLAST and SAM-T99. Structural alignments are constructed from a consensus between the FSSP database and CE structural aligner. We compare the results with sequencesequence and sequenceprofile methods, including BLAST and PSI-BLAST.
Results: We find that profileprofile alignment gives an average improvement over our test set of typically 23% over profilesequence alignment and
40% over sequencesequence alignment. No statistically significant difference is seen in the relative performance of most of the scoring functions tested. Significantly better results are obtained with profiles constructed from SAM-T99 alignments than from PSI-BLAST alignments.
Availability: Source code, reference alignments and more detailed results are freely available at http://phylogenomics.berkeley.edu/profilealignment/
Contact: bob{at}drive5.com
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S. Y. Lee and J. Skolnick Benchmarking of TASSER_2.0: An Improved Protein Structure Prediction Algorithm with More Accurate Predicted Contact Restraints Biophys. J., August 15, 2008; 95(4): 1956 - 1964. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Glanville, D. Kirshner, N. Krishnamurthy, and K. Sjolander Berkeley Phylogenomics Group web servers: resources for structural phylogenomic analysis Nucleic Acids Res., July 13, 2007; 35(suppl_2): W27 - W32. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Papadopoulos and R. Agarwala COBALT: constraint-based alignment tool for multiple protein sequences Bioinformatics, May 1, 2007; 23(9): 1073 - 1079. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Rangwala and G. Karypis Incremental window-based protein sequence alignment algorithms Bioinformatics, January 15, 2007; 23(2): e17 - e23. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.S. Madhusudhan, M. A. Marti-Renom, R. Sanchez, and A. Sali Variable gap penalty for protein sequence-structure alignment Protein Eng. Des. Sel., March 1, 2006; 19(3): 129 - 133. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. A. Simossis and J. Heringa PRALINE: a multiple sequence alignment toolbox that integrates homology-extended and secondary structure information Nucleic Acids Res., July 1, 2005; 33(suppl_2): W289 - W294. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Frenkel-Morgenstern, H. Voet, and S. Pietrokovski Enhanced statistics for local alignment of multiple alignments improves prediction of protein function and structure Bioinformatics, July 1, 2005; 21(13): 2950 - 2956. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Schuster-Bockler and A. Bateman Visualizing profile-profile alignment: pairwise HMM logos Bioinformatics, June 15, 2005; 21(12): 2912 - 2913. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Y. Kahsay, G. Wang, G. Gao, L. Liao, and R. Dunbrack Quasi-consensus-based comparison of profile hidden Markov models for protein sequences Bioinformatics, May 15, 2005; 21(10): 2287 - 2293. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Soding Protein homology detection by HMM-HMM comparison Bioinformatics, April 1, 2005; 21(7): 951 - 960. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. E. Crooks and S. E. Brenner An alternative model of amino acid replacement Bioinformatics, April 1, 2005; 21(7): 975 - 980. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. A. Simossis, J. Kleinjung, and J. Heringa Homology-extended sequence alignment Nucleic Acids Res., February 7, 2005; 33(3): 816 - 824. [Abstract] [Full Text] [PDF] |
||||



