Bioinformatics Advance Access published online on December 13, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti828
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Center for Bioinformatics, Dept of Biology, Rensselaer Polytechnic Institute, Troy, New York, 12180, USA
* To whom correspondence should be addressed.
Motivation: In recent years, advances have been made in the ability of computational methods to discriminate between homologous and non-homologous proteins in the "Twilight Zone" of sequence similarity, where the percent sequence identity is a poor indicator of homology. To make these predictions more valuable to the protein modeler, they must be accompanied by accurate alignments. Pairwise sequence alignments are inferences of orthologous relationships between sequence positions. Evolutionary distance is traditionally modeled using global amino acid substitution matrices. But real differences in the likelihood of substitutions may exist for different structural contexts within proteins, since structural context contributes to the selective pressure. Results: HMMSUM (HMMSTR-based SUbstitution Matrices) is a new model for structural context-based amino acid substitution probabilities consisting of a set of 281 matrices, each for a different sequence-structure context. HMMSUM does not require the structure of the protein to be known. Instead, predictions of local structure are made using HMMSTR, a hidden Markov model for local structure. Alignments using the HMMSUM matrices compare favorably to alignments carried out using the BLOSUM matrices or structure-based substitution matrices SDM and HSDM when validated against remote homolog alignments from BAliBASE. HMMSUM has been implemented using local Dynamic Programming and with the Bayesian Adaptive alignment method. Availability: Matrices, source codes and programs are available at http://www.bioinfo.rpi.edu/~bystrc/downloads.html.
Received March 9, 2005
Revised November 2, 2005
Accepted December 8, 2005
Article
Improved pairwise alignments of proteins in the Twilight Zone using local structure predictions
Yao-ming Huang 1
and
Christopher Bystroff 1 *
Christopher Bystroff, E-mail: bystrc{at}rpi.edu
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Biegert and J. Soding Sequence context-specific profiles for homology searching PNAS, March 10, 2009; 106(10): 3770 - 3775. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Fariselli, I. Rossi, E. Capriotti, and R. Casadio The WWWH of remote homolog detection: The state of the art Brief Bioinform, March 1, 2007; 8(2): 78 - 87. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pei and N. V. Grishin MUMMALS: multiple sequence alignment improved by using hidden Markov models with local structural information Nucleic Acids Res., September 11, 2006; 34(16): 4364 - 4374. [Abstract] [Full Text] [PDF] |
||||


