Bioinformatics Advance Access originally published online on January 29, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics 20(5) © Oxford University Press 2004; all rights reserved.
Bio-support vector machines for computational proteomics
1 Department of Computer Science, Exeter University, Exeter EX4 4PT, UK, 2 Gordon Life Science Institute, 13784 Torrey Del Mar Drive, San Diego, CA 92130, USA and 3 Tianjin Institute of Bioinformatics & Drug Discovery (TIBDD), Tianjin, China
Received on June 25, 2003
; revised on August 7, 2003
; accepted on August 19, 2003
Advance Access Publication January 29, 2004
Motivation: One of the most important issues in computational proteomics is to produce a prediction model for the classification or annotation of biological function of novel protein sequences. In order to improve the prediction accuracy, much attention has been paid to the improvement of the performance of the algorithms used, few is for solving the fundamental issue, namely, amino acid encoding as most existing pattern recognition algorithms are unable to recognize amino acids in protein sequences. Importantly, the most commonly used amino acid encoding method has the flaw that leads to large computational cost and recognition bias.
Results: By replacing kernel functions of support vector machines (SVMs) with amino acid similarity measurement matrices, we have modified SVMs, a new type of pattern recognition algorithm for analysing protein sequences, particularly for proteolytic cleavage site prediction. We refer to the modified SVMs as bio-support vector machine. When applied to the prediction of HIV protease cleavage sites, the new method has shown a remarkable advantage in reducing the model complexity and enhancing the model robustness.
Contact: Z.R.Yang{at}exeter.ac.uk
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Hye, S. Lynham, M. Thambisetty, M. Causevic, J. Campbell, H. L. Byers, C. Hooper, F. Rijsdijk, S. J. Tabrizi, S. Banner, et al. Proteome-based plasma biomarkers for Alzheimer's disease. Brain, November 1, 2006; 129(Pt 11): 3042 - 3050. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. You, D. Garwicz, and T. Rognvaldsson Comprehensive Bioinformatic Analysis of the Specificity of Human Immunodeficiency Virus Type 1 Protease J. Virol., October 1, 2005; 79(19): 12477 - 12486. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Bhasin and G. P. S. Raghava GPCRsclass: a web tool for the classification of amine type of G-protein-coupled receptors Nucleic Acids Res., July 1, 2005; 33(suppl_2): W143 - W147. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. R. Yang Mining SARS-CoV protease cleavage data using non-orthogonal decision trees: a novel method for decisive template selection Bioinformatics, June 1, 2005; 21(11): 2644 - 2650. [Abstract] [Full Text] [PDF] |
||||



