Skip Navigation


Bioinformatics Advance Access originally published online on December 13, 2005
Bioinformatics 2006 22(4):453-459; doi:10.1093/bioinformatics/bti826
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
22/4/453    most recent
bti826v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (4)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Duprat, E.
Right arrow Articles by Gascuel, O.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Duprat, E.
Right arrow Articles by Gascuel, O.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2005. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

A simple method to predict protein-binding from aligned sequences—application to MHC superfamily and ß2-microglobulin

Elodie Duprat 1, Marie-Paule Lefranc 1,2 and Olivier Gascuel 3,*

1Laboratoire d'ImmunoGénétique Moléculaire IGH (UPR CNRS 1142), 141 rue de la Cardonille, 34396 Montpellier Cedex 5, France
2Institut Universitaire de France 103 Boulevard Saint-Michel, 75005 Paris, France
3Projet Méthodes et Algorithmes pour la Bioinformatique LIRMM (UMR CNRS-UM2 5506), 161 rue Ada, 34392 Montpellier Cedex 5, France

*To whom correspondence should be addressed.

Motivation: The MHC superfamily (MhcSF) consists of immune system MHC class I (MHC-I) proteins, along with proteins with a MHC-I-like structure that are involved in a large variety of biological processes. ß2-Microglobulin (B2M) non-covalent binding to MHC-I proteins is required for their surface expression and function, whereas MHC-I-like proteins interact, or not, with B2M. This study was designed to predict B2M binding (or non-binding) of newly identified MhcSF proteins, in order to decipher their function, understand the molecular recognition mechanisms and identify deleterious mutations. IMGT standardization of MhcSF protein domains provides a unique numbering of the multiple alignment positions, and conditions to develop such predictive tool.

Method: We combine a simple-Bayes classifier with IMGT unique numbering. Our method involves two steps: (1) selection of discriminant binary features, which associate an alignment position with an amino acid group; and (2) learning of the classifier by estimating the frequencies of selected features, conditionally to the B2M binding property.

Results: Our dataset contains aligned sequences of 806 allelic forms of 47 MhcSF proteins, corresponding to 9 receptor types and 4 mammalian species. Eighteen discriminant features are selected, belonging to B2M contact sites, or stabilizing the molecular structure required for this contact. Three leave-one-out procedures are used to assess classifier performance, which corresponds to B2M binding prediction for: (1) new proteins, (2) species not represented in the dataset and (3) new receptor types. The prediction accuracy is high, i.e. 98, 94 and 70%, respectively. Application of our classifier to lower vertebrate MHC-I proteins indicates that these proteins bind to B2M and should then be expressed on the cellular surface by a process similar to that of mammalian MHC-I proteins. These results demonstrate the usefulness and accuracy of our (simple) approach, which should apply to other function or interaction prediction problems.

Availability: Data and MhcSF multiple alignments are available on the IMGT website (http://imgt.cines.fr).

Contact: gascuel{at}lirmm.fr, duprat{at}ligm.igh.cnrs.fr, lefranc{at}ligm.igh.cnrs.fr

Supplementary information: Supplementary material is downloadable at http://imgt.igh.cnrs.fr/MhcSF-B2M.html.


Received on August 3, 2005; revised on December 7, 2005; accepted on December 7, 2005

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
F. Ehrenmann, Q. Kaas, and M.-P. Lefranc
IMGT/3Dstructure-DB and IMGT/DomainGapAlign: a database and a tool for immunoglobulins or antibodies, T cell receptors, MHC, IgSF and MhcSF
Nucleic Acids Res., November 17, 2009; (2009) gkp946v2.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
M.-P. Lefranc, V. Giudicelli, L. Regnier, and P. Duroux
IMGT, a system and an ontology that bridge biological and computational spheres in bioinformatics
Brief Bioinform, July 1, 2008; 9(4): 263 - 275.
[Abstract] [Full Text] [PDF]


Home page
Brief Funct Genomic ProteomicHome page
Q. Kaas, F. Ehrenmann, and M.-P. Lefranc
IG, TR and IgSF, MHC and MhcSF: what do we learn from the IMGT Colliers de Perles?
Brief Funct Genomic Proteomic, January 21, 2008; (2008) elm032v1.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.