Bioinformatics Vol. 18 no. 9 2002
Pages 1243-1249
© 2002 Oxford University Press
The use of structure information to increase alignment accuracy does not aid homologue detection with profile HMMs
1 The Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
Received on November 15, 2001
; revised on February 18, 2002
; accepted on March 14, 2002
Motivation: The best quality multiple sequence alignments are generally considered to derive from structural superposition. However, no previous work has studied the relative performance of profile hidden Markov models (HMMs) derived from such alignments. Therefore several alignment methods have been used to generate multiple sequence alignments from 348 structurally aligned families in the HOMSTRAD database. The performance of profile HMMs derived from the structural and sequence-based alignments has been assessed for homologue detection.
Results: The best alignment methods studied here correctly align nearly 80% of residues with respect to structure alignments. Alignment quality and model sensitivity are found to be dependent on average number, length, and identity of sequences in the alignment. The striking conclusion is that, although structural data may improve the quality of multiple sequence alignments, this does not add to the ability of the derived profile HMMs to find sequence homologues.
Supplementary information: A list of HOMSTRAD families used in this study and the corresponding Pfam families is available at http://www.sanger.ac.uk/Users/sgj/alignments/map.html
Contact: sgj{at}sanger.ac.uk
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Heger, S. Mallick, C. Wilton, and L. Holm The global trace graph, a novel paradigm for searching protein sequence databases Bioinformatics, September 15, 2007; 23(18): 2361 - 2367. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Cheng and P. Baldi A machine learning information retrieval approach to protein fold recognition Bioinformatics, June 15, 2006; 22(12): 1456 - 1463. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Lassmann and E. L. L. Sonnhammer Automatic assessment of alignment quality Nucleic Acids Res., December 16, 2005; 33(22): 7120 - 7128. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Sillitoe, M. Dibley, J. Bray, S. Addou, and C. Orengo Assessing strategies for improved superfamily recognition Protein Sci., July 1, 2005; 14(7): 1800 - 1810. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. K. Shah, P. Aloy, P. Bork, and R. B. Russell Structural similarity to bridge sequence space: Finding new families on the bridges Protein Sci., May 1, 2005; 14(5): 1305 - 1314. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Goldsmith-Fischman and B. Honig Structural genomics: Computational methods for structure analysis Protein Sci., September 1, 2003; 12(9): 1813 - 1821. [Abstract] [Full Text] [PDF] |
||||


