Bioinformatics Advance Access originally published online on December 14, 2007
Bioinformatics 2008 24(3):358-366; doi:10.1093/bioinformatics/btm611
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Efficient peptide–MHC-I binding prediction for alleles with few known binders
Centre for Computational Biology, École des Mines de Paris, 35 rue Saint Honoré, 77305 Fontainebleau Cedex, France
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: In silico methods for the prediction of antigenic peptides binding to MHC class I molecules play an increasingly important role in the identification of T-cell epitopes. Statistical and machine learning methods in particular are widely used to score candidate binders based on their similarity with known binders and non-binders. The genes coding for the MHC molecules, however, are highly polymorphic, and statistical methods have difficulties building models for alleles with few known binders. In this context, recent work has demonstrated the utility of leveraging information across alleles to improve the performance of the prediction.
Results: We design a support vector machine algorithm that is able to learn peptide–MHC-I binding models for many alleles simultaneously, by sharing binding information across alleles. The sharing of information is controlled by a user-defined measure of similarity between alleles. We show that this similarity can be defined in terms of supertypes, or more directly by comparing key residues known to play a role in the peptide–MHC binding. We illustrate the potential of this approach on various benchmark experiments where it outperforms other state-of-the-art methods.
Availability: The method is implemented on a web server: http://cbio.ensmp.fr/kiss. All data and codes are freely and publicly available from the authors.
Contact: laurent.jacob{at}ensmp.fr
Supplementary information: Supplementary data are available at Bioinformatics online.
Associate Editor: Thomas Lengauer
Received on July 23, 2007; revised on November 6, 2007; accepted on December 7, 2007
This article has been cited by other articles:
![]() |
L. Jacob and J.-P. Vert Protein-ligand interaction prediction: an improved chemogenomics approach Bioinformatics, October 1, 2008; 24(19): 2149 - 2156. [Abstract] [Full Text] [PDF] |
||||
