Bioinformatics Advance Access originally published online on December 13, 2005
Bioinformatics 2006 22(4):413-422; doi:10.1093/bioinformatics/bti828
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Improved pairwise alignments of proteins in the Twilight Zone using local structure predictions
Center for Bioinformatics, Department of Biology, Rensselaer Polytechnic Institute Troy, NY 12180, USA
*To whom correspondence should be addressed.
Motivation: In recent years, advances have been made in the ability of computational methods to discriminate between homologous and non-homologous proteins in the twilight zone of sequence similarity, where the percent sequence identity is a poor indicator of homology. To make these predictions more valuable to the protein modeler, they must be accompanied by accurate alignments. Pairwise sequence alignments are inferences of orthologous relationships between sequence positions. Evolutionary distance is traditionally modeled using global amino acid substitution matrices. But real differences in the likelihood of substitutions may exist for different structural contexts within proteins, since structural context contributes to the selective pressure.
Results: HMMSUM (HMMSTR-based substitution matrices) is a new model for structural context-based amino acid substitution probabilities consisting of a set of 281 matrices, each for a different sequencestructure context. HMMSUM does not require the structure of the protein to be known. Instead, predictions of local structure are made using HMMSTR, a hidden Markov model for local structure. Alignments using the HMMSUM matrices compare favorably to alignments carried out using the BLOSUM matrices or structure-based substitution matrices SDM and HSDM when validated against remote homolog alignments from BAliBASE. HMMSUM has been implemented using local Dynamic Programming and with the Bayesian Adaptive alignment method.
Availability: Matrices, source codes and programs are available at http://www.bioinfo.rpi.edu/~bystrc/downloads.html.
Contact: bystrc{at}rpi.edu, huangy2{at}rpi.edu
Received on March 9, 2005; revised on November 2, 2005; accepted on December 8, 2005
This article has been cited by other articles:
![]() |
A. Biegert and J. Soding Sequence context-specific profiles for homology searching PNAS, March 10, 2009; 106(10): 3770 - 3775. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Fariselli, I. Rossi, E. Capriotti, and R. Casadio The WWWH of remote homolog detection: The state of the art Brief Bioinform, March 1, 2007; 8(2): 78 - 87. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pei and N. V. Grishin MUMMALS: multiple sequence alignment improved by using hidden Markov models with local structural information Nucleic Acids Res., September 11, 2006; 34(16): 4364 - 4374. [Abstract] [Full Text] [PDF] |
||||


