Bioinformatics Vol. 19 no. 13 2003
Pages 1650-1655
© 2003 Oxford University Press
Secondary structure prediction with support vector machines
Bioinformatics Group, Department of Computer Science, University College London, Gower Street, London, WC1E 6BT, UK
Received on November 25, 2002
; revised on February 6, 2003
; accepted on February 10, 2003
Motivation: A new method that uses support vector machines (SVMs) to predict protein secondary structure is described and evaluated. The study is designed to develop a reliable prediction method using an alternative technique and to investigate the applicability of SVMs to this type of bioinformatics problem.
Methods: Binary SVMs are trained to discriminate between two structural classes. The binary classifiers are combined in several ways to predict multi-class secondary structure.
Results: The average three-state prediction accuracy per protein (Q3) is estimated by cross-validation to be 77.07 ± 0.26% with a segment overlap (Sov) score of 73.32 ± 0.39%. The SVM performs similarly to the 'state-of-the-art' PSIPRED prediction method on a non-homologous test set of 121 proteins despite being trained on substantially fewer examples. A simple consensus of the SVM, PSIPRED and PROFsec achieves significantly higher prediction accuracy than the individual methods.
Availability: The SVM classifier is available from the authors. Work is in progress to make the method available on-line and to integrate the SVM predictions into the PSIPRED server.
Contact: dtj{at}cs.ucl.ac.uk
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
O. Zimmermann and U. H. E. Hansmann Support vector machines for prediction of dihedral angle regions Bioinformatics, December 15, 2006; 22(24): 3009 - 3015. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Birzele and S. Kramer A new representation for protein secondary structure prediction based on frequent patterns Bioinformatics, November 1, 2006; 22(21): 2628 - 2634. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Vullo, O. Bortolami, G. Pollastri, and S. C. E. Tosatto Spritz: a server for the prediction of intrinsically disordered regions in protein sequences using kernel machines. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W164 - W168. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. C. Kulkarni, R. Vigneshwar, V. K. Jayaraman, and B. D. Kulkarni Identification of coding and non-coding sequences using local Holder exponent formalism Bioinformatics, October 15, 2005; 21(20): 3818 - 3823. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Zhang, S. Yoon, and W. J. Welsh Improved method for predicting {beta}-turn using support vector machine Bioinformatics, May 15, 2005; 21(10): 2370 - 2374. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Bhasin and G. P. S. Raghava Classification of Nuclear Receptors Based on Amino Acid Composition and Dipeptide Composition J. Biol. Chem., May 28, 2004; 279(22): 23262 - 23266. [Abstract] [Full Text] [PDF] |
||||


