Bioinformatics Advance Access originally published online on August 29, 2006
Bioinformatics 2006 22(21):2628-2634; doi:10.1093/bioinformatics/btl453
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
© The Author 2006. Published by Oxford University Press. All rights reserved
A new representation for protein secondary structure prediction based on frequent patterns
1 Practical Informatics and Bioinformatics Group, Department of Informatics, Ludwig-Maximilians-University Amalienstrasse 17, D-80333 München, Germany
2 Technische Universität München, Institut für Informatik Boltzmannstrasse 3, D-85748 Garching b. München, Germany
*To whom correspondence should be addressed.
Motivation: A new representation for protein secondary structure prediction based on frequent amino acid patterns is described and evaluated. We discuss in detail how to identify frequent patterns in a protein sequence database using a level-wise search technique, how to define a set of features from those patterns and how to use those features in the prediction of the secondary structure of a protein sequence using support vector machines (SVMs).
Results: Three different sets of features based on frequent patterns are evaluated in a blind testing setup using 150 targets from the EVA contest and compared to predictions of PSI-PRED, PHD and PROFsec. Despite being trained on only 940 proteins, a simple SVM classifier based on this new representation yields results comparable to PSI-PRED and PROFsec. Finally, we show that the method contributes significant information to consensus predictions.
Availability: The method is available from the authors upon request.
Contact: kramer{at}in.tum.de
Received on April 12, 2006; revised on August 1, 2006; accepted on August 18, 2006
This article has been cited by other articles:
![]() |
K. Chen and L. Kurgan PFRES: protein fold classification by using evolutionary information and predicted secondary structure Bioinformatics, November 1, 2007; 23(21): 2843 - 2850. [Abstract] [Full Text] [PDF] |
||||
