Bioinformatics Vol. 15 no. 11 1999
Pages 937-946
© 1999 Oxford University Press
Exploiting the past and the future in protein secondary structure prediction
1 Department of Information and Computer
Science, and Department of Biological Chemistry, College of
Medicine, University of California, Irvine, Irvine, CA 92697-3425,
USA
2 Center for Biological Sequence Analysis,
The Technical University of Denmark, DK-2800 Lyngby, Denmark
3 Department of Informatics and Systems,
University of Florence, 50139 Florence, Italy
4 Department of Information and Computer
Science, University of California, Irvine, Irvine, CA 92697-3425,
USA
Pierre Baldi
Motivation: Predicting the secondary structure of a protein (alpha-helix, beta-sheet, coil) is an important step towards elucidating its three-dimensional structure, as well as its function. Presently, the best predictors are based on machine learning approaches, in particular neural network architectures with a fixed, and relatively short, input window of amino acids, centered at the prediction site. Although a fixed small window avoids overfitting problems, it does not permit capturing variable long-rang information.
Results: We introduce a family of novel architectures which can learn to make predictions based on variable ranges of dependencies. These architectures extend recurrent neural networks, introducing non-causal bidirectional dynamics to capture both upstream and downstream information. The prediction algorithm is completed by the use of mixtures of estimators that leverage evolutionary information, expressed in terms of multiple alignments, both at the input and output levels. While our system currently achieves an overall performance close to 76% correct prediction at least comparable to the best existing systems the main emphasis here is on the development of new algorithmic ideas.
Availability: The executable program for predicting protein secondary structure is available from the authors free of charge.
Contact: pfbaldi{at}ics.uci.edu, gpollast{at}ics.uci.edu, brunak{at}cbs.dtu.dk, paolo{at}dsi.unifi.it
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S. Hochreiter, M. Heusel, and K. Obermayer Fast model-based protein homology detection without alignment Bioinformatics, July 15, 2007; 23(14): 1728 - 1736. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Bodjo, O. Kwiatek, A. Diallo, E. Albina, and G. Libeau Mapping and structural analysis of B-cell epitopes on the morbillivirus nucleoprotein amino terminus J. Gen. Virol., April 1, 2007; 88(4): 1231 - 1242. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Poetsch, R. J. Berzborn, J. Heberle, T. A. Link, N. A. Dencher, and H. Seelert Biophysics and Bioinformatics Reveal Structural Differences of the Two Peripheral Stalk Subunits in Chloroplast ATP Synthase J. Biochem., March 1, 2007; 141(3): 411 - 420. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ceroni, A. Passerini, A. Vullo, and P. Frasconi DISULFIND: a disulfide bonding state and cysteine connectivity prediction server. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W177 - W181. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Fossen, V. Wray, K. Bruns, J. Rachmat, P. Henklein, U. Tessmer, A. Maczurek, P. Klinger, and U. Schubert Solution Structure of the Human Immunodeficiency Virus Type 1 p6 Protein J. Biol. Chem., December 30, 2005; 280(52): 42515 - 42527. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Boden and J. Hawkins Prediction of subcellular localization using sequence-biased recurrent networks Bioinformatics, May 15, 2005; 21(10): 2279 - 2286. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Touitou, J. O'Nions, J. Heaney, and M. J. Allday Epstein-Barr virus EBNA3 proteins bind to the C8/{alpha}7 subunit of the 20S proteasome and are degraded by 20S proteasomes in vitro, but are very stable in latently infected B cells J. Gen. Virol., May 1, 2005; 86(5): 1269 - 1277. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Hammer, A. Micheli, and A. Sperduti Universal Approximation Capability of Cascade Correlation for Structures Neural Comput., May 1, 2005; 17(5): 1109 - 1159. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Pollastri and A. McLysaght Porter: a new, accurate server for protein secondary structure prediction Bioinformatics, April 15, 2005; 21(8): 1719 - 1720. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Sokolchik, T. Tanabe, P. F. Baldi, and J. Y. Sze Polymodal Sensory Function of the Caenorhabditis elegans OCR-2 Channel Arises from Distinct Intrinsic Determinants within the Protein and Is Selectively Conserved in Mammalian TRPV Proteins J. Neurosci., January 26, 2005; 25(4): 1015 - 1023. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. V. Stahelin, B. Ananthanarayanan, N. R. Blatner, S. Singh, K. S. Bruzik, D. Murray, and W. Cho Mechanism of Membrane Binding of the Phospholipase D1 PX Domain J. Biol. Chem., December 24, 2004; 279(52): 54918 - 54926. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. R. Blatner, R. V. Stahelin, K. Diraviyam, P. T. Hawkins, W. Hong, D. Murray, and W. Cho The Molecular Basis of the Differential Subcellular Localization of FYVE Domains J. Biol. Chem., December 17, 2004; 279(51): 53818 - 53827. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. L. Caicedo and B. A. Schaal Heterogeneous evolutionary processes affect R gene diversity in natural populations of Solanum pimpinellifolium PNAS, December 14, 2004; 101(50): 17444 - 17449. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Y. Pasta, B. Raman, T. Ramakrishna, and Ch. M. Rao Role of the Conserved SRLFDQFFG Region of {alpha}-Crystallin, a Small Heat Shock Protein: EFFECT ON OLIGOMERIC SIZE, SUBUNIT EXCHANGE, AND CHAPERONE-LIKE ACTIVITY J. Biol. Chem., December 19, 2003; 278(51): 51159 - 51166. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Colon-Ramos, P. M. Irusta, E. C. Gan, M. R. Olson, J. Song, R. I. Morimoto, R. M. Elliott, M. Lombard, R. Hollingsworth, J. M. Hardwick, et al. Inhibition of Translation and Induction of Apoptosis by Bunyaviral Nonstructural Proteins Bearing Sequence Similarity to Reaper Mol. Biol. Cell, October 1, 2003; 14(10): 4162 - 4172. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Borges, H. Fischer, A. F. Craievich, L. D. Hansen, and C. H. I. Ramos Free Human Mitochondrial GrpE Is a Symmetric Dimer in Solution J. Biol. Chem., September 12, 2003; 278(37): 35337 - 35344. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Jiang Prediction of protein secondary structure with a reliability score estimated by local sequence clustering Protein Eng. Des. Sel., September 1, 2003; 16(9): 651 - 657. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Singh and D. Murray Molecular modeling of the membrane targeting of phospholipase C pleckstrin homology domains Protein Sci., September 1, 2003; 12(9): 1934 - 1953. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Kim and H. Park Protein secondary structure prediction based on an improved support vector machines approach Protein Eng. Des. Sel., August 1, 2003; 16(8): 553 - 560. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. A. Eyrich and B. Rost META-PP: single interface to crucial prediction servers Nucleic Acids Res., July 1, 2003; 31(13): 3308 - 3310. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Albrecht, S. C.E. Tosatto, T. Lengauer, and G. Valle Simple consensus procedures are effective and sufficient in secondary structure prediction Protein Eng. Des. Sel., July 1, 2003; 16(7): 459 - 462. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Yudt, C. M. Jewell, R. J. Bienstock, and J. A. Cidlowski Molecular Origins for the Dominant Negative Function of Human Glucocorticoid Receptor Beta Mol. Cell. Biol., June 15, 2003; 23(12): 4319 - 4330. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Lesso and R. A. Li Helical Secondary Structure of the External S3-S4 Linker of Pacemaker (HCN) Channels Revealed by Site-dependent Perturbations of Activation Phenotype J. Biol. Chem., June 13, 2003; 278(25): 22290 - 22297. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Hatta, G. Mukerjee-Dhar, J. Damborsky, H. Kiyohara, and K. Kimbara Characterization of a Novel Thermostable Mn(II)-dependent 2,3-Dihydroxybiphenyl 1,2-Dioxygenase from a Polychlorinated Biphenyl- and Naphthalene-degrading Bacillus sp. JF8 J. Biol. Chem., June 6, 2003; 278(24): 21483 - 21492. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. A. Shepard and M. D. Purugganan Molecular Population Genetics of the Arabidopsis CLAVATA2 Region: The Genomic Scale of Variation and Selection in a Selfing Species Genetics, March 1, 2003; 163(3): 1083 - 1095. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Bienstock, M. Skorvaga, B. S. Mandavilli, and B. Van Houten Structural and Functional Characterization of the Human DNA Repair Helicase XPD by Comparative Molecular Modeling and Site-directed Mutagenesis of the Bacterial Repair Protein UvrB J. Biol. Chem., February 7, 2003; 278(7): 5309 - 5316. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Papazisi, S. Frasca Jr., M. Gladd, X. Liao, D. Yogev, and S. J. Geary GapA and CrmA Coexpression Is Essential for Mycoplasma gallisepticum Cytadherence and Virulence Infect. Immun., December 1, 2002; 70(12): 6839 - 6845. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Ovechkina, M. Wagenbach, and L. Wordeman K-loop insertion restores microtubule depolymerizing activity of a "neckless" MCAK mutant J. Cell Biol., November 25, 2002; 159(4): 557 - 562. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Segall, L. K. Lane, and R. Blostein New Insights into the Role of the N Terminus in Conformational Transitions of the Na,K-ATPase J. Biol. Chem., September 13, 2002; 277(38): 35202 - 35209. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Mondragon-Palomino, B. C. Meyers, R. W. Michelmore, and B. S. Gaut Patterns of Positive Selection in the Complete NBS-LRR Gene Family of Arabidopsis thaliana Genome Res., September 1, 2002; 12(9): 1305 - 1315. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Fu and J. J-N Liang Detection of Protein-Protein Interactions among Lens Crystallins in a Mammalian Two-hybrid System Assay J. Biol. Chem., February 1, 2002; 277(6): 4255 - 4260. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Bultynck, D. Rossi, G. Callewaert, L. Missiaen, V. Sorrentino, J. B. Parys, and H. De Smedt The Conserved Sites for the FK506-binding Proteins in Ryanodine Receptors and Inositol 1,4,5-Trisphosphate Receptors Are Structurally and Functionally Different J. Biol. Chem., December 7, 2001; 276(50): 47715 - 47724. [Abstract] [Full Text] [PDF] |
||||















