Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (44)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Ooi, C. H.
Right arrow Articles by Tan, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ooi, C. H.
Right arrow Articles by Tan, P.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 19 no. 1 2003
Pages 37-44
© 2003 Oxford University Press

Genetic algorithms applied to multi-class prediction for the analysis of gene expression data

C. H. Ooi 1 and Patrick Tan 2,*

1 Nanyang Technological University, School of Mechanical and Production Engineering, 50 Nanyang Avenue, Singapore 639798
2 Division of Cellular and Molecular Research, National Cancer Center/Defence Medical Research Institute, 11 Hospital Drive, Singapore 169610, Republic of Singapore

Received on April 17, 2002 ; revised on July 3, 2002 ; accepted on July 15, 2002

Motivation: An important challenge in the use of large-scale gene expression data for biological classification occurs when the expression dataset being analyzed involves multiple classes. Key issues that need to be addressed under such circumstances are the efficient selection of good predictive gene groups from datasets that are inherently ‘noisy’, and the development of new methodologies that can enhance the successful classification of these complex datasets.

Methods: We have applied genetic algorithms (GAs) to the problem of multi-class prediction. A GA-based gene selection scheme is described that automatically determines the members of a predictive gene group, as well as the optimal group size, that maximizes classification success using a maximum likelihood (MLHD) classification method.

Results: The GA/MLHD-based approach achieves higher classification accuracies than other published predictive methods on the same multi-class test dataset. It also permits substantial feature reduction in classifier genesets without compromising predictive accuracy. We propose that GA-based algorithms may represent a powerful new tool in the analysis and exploration of complex multi-class gene expression data.

Availability: Supplementary information, data sets and source codes are available at http://www.omniarray.com/bioinformatics/GA

Contact: cmrtan{at}nccs.com.sg

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Brief BioinformHome page
G. B. Fogel
Computational intelligence approaches for pattern discovery in biological systems
Brief Bioinform, July 1, 2008; 9(4): 307 - 316.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
F. Zhao, F. Zhao, T. Li, and D. A. Bryant
A new pheromone trail-based genetic algorithm for comparative genome assembly
Nucleic Acids Res., June 1, 2008; 36(10): 3455 - 3462.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Saeys, I. Inza, and P. Larranaga
A review of feature selection techniques in bioinformatics
Bioinformatics, October 1, 2007; 23(19): 2507 - 2517.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
M. Dolled-Filhart, L. Ryden, M. Cregger, K. Jirstrom, M. Harigopal, R. L. Camp, and D. L. Rimm
Classification of breast cancer using genetic algorithms and tissue microarrays.
Clin. Cancer Res., November 1, 2006; 12(21): 6459 - 6468.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Zhou and K. Z. Mao
The ties problem resulting from counting-based error estimators and its impact on gene selection algorithms
Bioinformatics, October 15, 2006; 22(20): 2507 - 2515.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
V. Trevino and F. Falciani
GALGO: an R package for multivariate variable selection using genetic algorithms
Bioinformatics, May 1, 2006; 22(9): 1154 - 1156.
[Abstract] [Full Text] [PDF]


Home page
Cancer Res.Home page
C. F. Basil, Y. Zhao, K. Zavaglia, P. Jin, M. C. Panelli, S. Voiculescu, S. Mandruzzato, H. M. Lee, B. Seliger, R. S. Freedman, et al.
Common cancer biomarkers.
Cancer Res., March 15, 2006; 66(6): 2953 - 2961.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
P. Larranaga, B. Calvo, R. Santana, C. Bielza, J. Galdiano, I. Inza, J. A. Lozano, R. Armananzas, G. Santafe, A. Perez, et al.
Machine learning in bioinformatics
Brief Bioinform, March 1, 2006; 7(1): 86 - 112.
[Abstract] [Full Text] [PDF]


Home page
Molecular Cancer TherapeuticsHome page
D. W. Mount and R. Pandey
Using bioinformatics and genome analysis for new therapeutic interventions
Mol. Cancer Ther., October 1, 2005; 4(10): 1636 - 1643.
[Abstract] [Full Text] [PDF]


Home page
J. Mol. Diagn.Home page
J. Stec, J. Wang, K. Coombes, M. Ayers, S. Hoersch, D. L. Gold, J. S Ross, K. R. Hess, S. Tirrell, G. Linette, et al.
Comparison of the Predictive Accuracy of DNA Array-Based Multigene Classifiers across cDNA Arrays and Affymetrix GeneChips
J. Mol. Diagn., August 1, 2005; 7(3): 357 - 367.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Liang, B. Tayo, X. Cai, and A. Kelemen
Differential and trajectory methods for time course gene expression data
Bioinformatics, July 1, 2005; 21(13): 3009 - 3016.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. J. Liu, G. Cutler, W. Li, Z. Pan, S. Peng, T. Hoey, L. Chen, and X. B. Ling
Multiclass cancer classification and biomarker discovery using GA-based algorithms
Bioinformatics, June 1, 2005; 21(11): 2691 - 2697.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. M. Jarvis and R. Goodacre
Genetic algorithm optimization for pre-processing and variable selection of spectroscopic data
Bioinformatics, April 1, 2005; 21(7): 860 - 868.
[Abstract] [Full Text] [PDF]


Home page
J. Gerontol. B Psychol. Sci. Soc. Sci.Home page
M. J. Shanahan and S. M. Hofer
Social Context in Gene-Environment Interactions: Retrospect and Prospect
J. Gerontol. B. Psychol. Sci. Soc. Sci., March 1, 2005; 60(suppl_Special_Issue_1): 65 - 76.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.