Bioinformatics Advance Access originally published online on April 29, 2004
Bioinformatics 2004 20(16):2572-2578; doi:10.1093/bioinformatics/bth286
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics vol. 20 issue 16 © Oxford University Press 2004; all rights reserved.
Predicting allergenic proteins using wavelet transform
Bioinformatics Institute, 30 Biopolis Street, Singapore 138671, Singapore
Received on December 5, 2003; revised on April 7, 2004; accepted on April 13, 2004
Advance Access Publication April 29, 2004
Motivation: With many transgenic proteins introduced today, the ability to predict their potential allergenicity has become an important issue. Previous studies were based on either sequence similarity or the protein motifs identified from known allergen databases. The similarity-based approaches, although being able to produce high recalls, usually have low prediction precisions. Previous motif-based approaches have been shown to be able to improve the precisions on cross-validation experiments. In this study, a system that combines the advantages of similarity-based and motif-based prediction is described.
Results: The new prediction system uses a clustering algorithm that groups the known allergenic proteins into clusters. Proteins within each cluster are assumed to carry one or more common motifs. After a multiple sequence alignment, proteins in each cluster go through a wavelet analysis program whereby conserved motifs will be identified. A hidden Markov model (HMM) profile will then be prepared for each identified motif. The allergens that do not appear to carry detectable allergen motifs will be saved in a small database. The allergenicity of an unknown protein may be predicted by comparing it against the HMM profiles, and, if no matching profiles are found, against the small allergen database by BLASTP. Over 70% of recall and over 90% of precision were observed using cross-validation experiments. Using the entire Swiss-Prot as the query, we predicted about 2000 potential allergens.
Availability: The software is available upon request from the authors.
Contact: kuobin{at}bii.a-star.edu.sg
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. M. Barrio, D. Soeria-Atmadja, A. Nister, M. G. Gustafsson, U. Hammerling, and E. Bongcam-Rudloff EVALLER: a web server for in silico assessment of potential protein allergenicity Nucleic Acids Res., July 13, 2007; 35(suppl_2): W694 - W700. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Darewicz, J. Dziuba, and P. Minkiewicz Computational Characterisation and Identification of Peptides for in silico Detection of Potentially Celiac-Toxic Proteins Food Science and Technology International, April 1, 2007; 13(2): 125 - 133. [Abstract] [PDF] |
||||
![]() |
Z. H. Zhang, J. L. Y. Koh, G. L. Zhang, K. H. Choo, M. T. Tammi, and J. C. Tong AllerTool: a web server for predicting allergenicity and allergic cross-reactivity in proteins Bioinformatics, February 15, 2007; 23(4): 504 - 506. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Soeria-Atmadja, T. Lundell, M. G. Gustafsson, and U. Hammerling Computational detection of allergenic proteins attains a new level of accuracy with in silico variable-length peptide extraction and machine learning Nucleic Acids Res., August 29, 2006; 34(13): 3779 - 3793. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Saha and G. P. S. Raghava AlgPred: prediction of allergenic proteins and mapping of IgE epitopes. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W202 - W209. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Riaz, H. L. Hor, A. Krishnan, F. Tang, and K.-B. Li WebAllergen: a web server for predicting allergenic proteins Bioinformatics, May 15, 2005; 21(10): 2570 - 2571. [Abstract] [Full Text] [PDF] |
||||


