Bioinformatics Advance Access originally published online on February 26, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics 20(10) © Oxford University Press 2004; all rights reserved.
Protein secondary structure: entropy, correlations and prediction
Department of Plant and Microbial Biology, University of California, 111 Koshland Hall No. 3102, Berkeley, CA 94720-3102, USA
Received on November 4, 2003; accepted on November 26, 2003
Advance Access Publication February 26, 2004
Motivation: Is protein secondary structure primarily determined by local interactions between residues closely spaced along the amino acid backbone or by non-local tertiary interactions? To answer this question, we measure the entropy densities of primary and secondary structure sequences, and the local inter-sequence mutual information density.
Results: We find that the important inter-sequence interactions are short ranged, that correlations between neighboring amino acids are essentially uninformative and that only one-fourth of the total information needed to determine the secondary structure is available from local inter-sequence correlations. These observations support the view that the majority of most proteins fold via a cooperative process where secondary and tertiary structure form concurrently. Moreover, existing single-sequence secondary structure prediction algorithms are almost optimal, and we should not expect a dramatic improvement in prediction accuracy.
Availability: Both the data sets and analysis code are freely available from our Web site at http://compbio.berkeley.edu/
Contact: gec{at}compbio.berkeley.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. F. Pereira de Araujo and J. N. Onuchic A sequence-compatible amount of native burial information is sufficient for determining the structure of small globular proteins PNAS, November 10, 2009; 106(45): 19001 - 19004. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Song, H. Tan, K. Takemoto, and T. Akutsu HSEpred: predict half-sphere exposure from protein sequences Bioinformatics, July 1, 2008; 24(13): 1489 - 1497. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Birzele and S. Kramer A new representation for protein secondary structure prediction based on frequent patterns Bioinformatics, November 1, 2006; 22(21): 2628 - 2634. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. E. Crooks, R. E. Green, and S. E. Brenner Pairwise alignment incorporating dipeptide covariation Bioinformatics, October 1, 2005; 21(19): 3704 - 3710. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. E. Crooks and S. E. Brenner An alternative model of amino acid replacement Bioinformatics, April 1, 2005; 21(7): 975 - 980. [Abstract] [Full Text] [PDF] |
||||

