Bioinformatics Vol. 16 no. 9 2000
Pages 799-807
© 2000 Oxford University Press
Engineering support vector machine kernels that recognize translation initiation sites
1 GMD.SCAI, Schloss Birlinghoven, 53754
Sankt Augustin, Germany
2 GMD.FIRST,
Kekuléstraße 7, 12489 Berlin,
Germany
3 Microsoft Research, 1 Guildhall Street,
Cambridge CB2 3NH, UK
Received on December 17, 1999
; revised on March 22, 2000
; accepted on March 29, 2000
Motivation: In order to extract protein sequences from nucleotide sequences, it is an important step to recognize points at which regions start that code for proteins. These points are called translation initiation sites (TIS).
Results: The task of finding TIS can be modeled as a classification problem. We demonstrate the applicability of support vector machines for this task, and show how to incorporate prior biological knowledge by engineering an appropriate kernel function. With the described techniques the recognition performance can be improved by 26% over leading existing approaches. We provide evidence that existing related methods (e.g. ESTScan) could profit from advanced TIS recognition.
Contact: {Alexander.Zien,Gunnar.Raetsch,Sebastian. Mika}@gmd.de; bsc{at}microsoft.com
To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S. Charaniya, S. Mehra, W. Lian, K. P. Jayapal, G. Karypis, and W.-S. Hu Transcriptome dynamics-based operon prediction and verification in Streptomyces coelicolor Nucleic Acids Res., December 18, 2007; 35(21): 7222 - 7236. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Saeys, T. Abeel, S. Degroeve, and Y. Van de Peer Translation initiation site prediction on a genomic scale: beauty in simplicity Bioinformatics, July 1, 2007; 23(13): i418 - i423. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Z. Kelemen, A. Kertesz-Farkas, A. Kocsor, and L. G. Puskas Kalman filtering for disease-state estimation from microarray data Bioinformatics, December 15, 2006; 22(24): 3047 - 3053. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. SUGIYAMA and H. OGAWA Constructing Kernel Functions for Binary Regression IEICE Trans D: Information, July 1, 2006; E89-D(7): 2243 - 2249. [Abstract] [PDF] |
||||
![]() |
F. Ferre and P. Clote DiANNA 1.1: an extension of the DiANNA web server for ternary cysteine classification. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W182 - W185. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. H. Lin, L. Y. Han, H. L. Zhang, C. J. Zheng, B. Xie, and Y. Z. Chen Prediction of the functional class of lipid binding proteins from sequence-derived properties irrespective of sequence similarity J. Lipid Res., April 1, 2006; 47(4): 824 - 831. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Idicula-Thomas, A. J. Kulkarni, B. D. Kulkarni, V. K. Jayaraman, and P. V. Balaji A support vector machine-based method for predicting the propensity of a protein to be soluble or to form inclusion body on overexpression in Escherichia coli Bioinformatics, February 1, 2006; 22(3): 278 - 284. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Liu, G. Cutler, W. Li, Z. Pan, S. Peng, T. Hoey, L. Chen, and X. B. Ling Multiclass cancer classification and biomarker discovery using GA-based algorithms Bioinformatics, June 1, 2005; 21(11): 2691 - 2697. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Tsuda, S. Akaho, M. Kawanabe, and K.-R. Muller Asymptotic Properties of the Fisher Kernel Neural Comput., January 1, 2004; 16(1): 115 - 137. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. H-F. Zhang, K. A. Heller, I. Hefter, C. S. Leslie, and L. A. Chasin Sequence Information for the Splicing of Human Pre-mRNA Identified by Support Vector Machine Classification Genome Res., December 1, 2003; 13(12): 2637 - 2650. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Mathe, M.-F. Sagot, T. Schiex, and P. Rouze Current methods of gene prediction, their strengths and weaknesses Nucleic Acids Res., October 1, 2002; 30(19): 4103 - 4117. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Mjolsness and D. DeCoste Machine Learning for Science: State of the Art and Future Prospects Science, September 14, 2001; 293(5537): 2051 - 2055. [Abstract] [Full Text] [PDF] |
||||






