Bioinformatics Advance Access published online on May 6, 2004
Bioinformatics, doi:10.1093/bioinformatics/bth297
Bioinformatics © Oxford University Press 2004; all rights reserved
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, 411-8540, Japan
* To whom correspondence should be addressed. E-mail: knishika{at}genes.nig.ac.jp.
The pattern of amino acid substitutions and sequence conservation over many structure-based alignments of protein sequences was analyzed as a function of percent sequence identity. The statistics of the amino acid substitutions were converted into the form of log-odds amino acid substitution matrices to which eigenvalue decomposition was applied. It was found that the most important component of the substitution matrices exhibited a sharp transition at the sequence identity of 30-35% which coincides with the twilight zone. Above the transition point, the most dominant component is related to the mutability of amino acids and it acts to disfavor any substitutions, whereas below the transition point, the most dominant component is related to the hydrophobicity of amino acids and substitutions between residues of similar hydrophobic character are positively favored. Implications for protein evolution and sequence analysis are discussed. Supplementary Information: http://maccl01.genes.nig.ac.jp/~akinjo/aasm/
Revised April 15, 2004
Accepted April 22, 2004
Discovery note
Eigenvalue analysis of amino acid substitution matrices reveals a sharp transition of the mode of sequence conservation in proteins
2 Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, 411-8540, Japan; Department of Genetics, The Graduate University for Advanced Studies (SOKENDAI), Mishima, 411-8540, Japan
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
E. Krissinel On the relationship between sequence and structure similarities in proteomics Bioinformatics, March 15, 2007; 23(6): 717 - 723. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Friedberg, T. Harder, R. Kolodny, E. Sitbon, Z. Li, and A. Godzik Using an alignment of fragment strings for comparing protein structures Bioinformatics, January 15, 2007; 23(2): e219 - e224. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Goldstein and D. D. Pollock Observations of Amino Acid Gain and Loss during Protein Evolution Are Explained by Statistical Bias Mol. Biol. Evol., July 1, 2006; 23(7): 1444 - 1449. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Porto, H. E. Roman, M. Vendruscolo, and U. Bastolla Prediction of Site-Specific Amino Acid Distributions and Limits of Divergent Evolutionary Changes in Protein Sequences Mol. Biol. Evol., March 1, 2005; 22(3): 630 - 638. [Abstract] [Full Text] [PDF] |
||||

