Bioinformatics Advance Access originally published online on March 24, 2007
Bioinformatics 2007 23(8):942-949; doi:10.1093/bioinformatics/btm061
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
POPI: predicting immunogenicity of MHC class I binding peptides by mining informative physicochemical properties
1Institute of Bioinformatics and 2Department of Biological Science and Technology, National Chiao Tung University, Hsinchu 300, Taiwan
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Both modeling of antigen-processing pathway including major histocompatibility complex (MHC) binding and immunogenicity prediction of those MHC-binding peptides are essential to develop a computer-aided system of peptide-based vaccine design that is one goal of immunoinformatics. Numerous studies have dealt with modeling the immunogenic pathway but not the intractable problem of immunogenicity prediction due to complex effects of many intrinsic and extrinsic factors. Moderate affinity of the MHC–peptide complex is essential to induce immune responses, but the relationship between the affinity and peptide immunogenicity is too weak to use for predicting immunogenicity. This study focuses on mining informative physicochemical properties from known experimental immunogenicity data to understand immune responses and predict immunogenicity of MHC-binding peptides accurately.
Results: This study proposes a computational method to mine a feature set of informative physicochemical properties from MHC class I binding peptides to design a support vector machine (SVM) based system (named POPI) for the prediction of peptide immunogenicity. High performance of POPI arises mainly from an inheritable bi-objective genetic algorithm, which aims to automatically determine the best number m out of 531 physicochemical properties, identify these m properties and tune SVM parameters simultaneously. The dataset consisting of 428 human MHC class I binding peptides belonging to four classes of immunogenicity was established from MHCPEP, a database of MHC-binding peptides (Brusic et al., 1998). POPI, utilizing the m = 23 selected properties, performs well with the accuracy of 64.72% using leave-one-out cross-validation, compared with two sequence alignment-based prediction methods ALIGN (54.91%) and PSI-BLAST (53.23%). POPI is the first computational system for prediction of peptide immunogenicity based on physicochemical properties.
Availability: A web server for prediction of peptide immunogenicity (POPI) and the used dataset of MHC class I binding peptides (PEPMHCI) are available at http://iclab.life.nctu.edu.tw/POPI
Contact: syho{at}mail.nctu.edu.tw
Associate Editor: Limsoon Wong
Received on October 28, 2006; revised on February 14, 2007; accepted on February 14, 2007
This article has been cited by other articles:
![]() |
L. Jacob and J.-P. Vert Efficient peptide-MHC-I binding prediction for alleles with few known binders Bioinformatics, February 1, 2008; 24(3): 358 - 366. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kawashima, P. Pokarowski, M. Pokarowska, A. Kolinski, T. Katayama, and M. Kanehisa AAindex: amino acid index database, progress report 2008 Nucleic Acids Res., January 11, 2008; 36(suppl_1): D202 - D205. [Abstract] [Full Text] [PDF] |
||||

