Bioinformatics Vol. 19 no. 1 2003
Pages 37-44
© 2003 Oxford University Press
Genetic algorithms applied to multi-class prediction for the analysis of gene expression data
1 Nanyang Technological University,
School of Mechanical and Production Engineering, 50 Nanyang Avenue,
Singapore 639798
2 Division of Cellular and Molecular Research,
National Cancer Center/Defence Medical Research Institute,
11 Hospital Drive, Singapore 169610, Republic of Singapore
Received on April 17, 2002
; revised on July 3, 2002
; accepted on July 15, 2002
Motivation: An important challenge in the use of large-scale gene expression data for biological classification occurs when the expression dataset being analyzed involves multiple classes. Key issues that need to be addressed under such circumstances are the efficient selection of good predictive gene groups from datasets that are inherently noisy, and the development of new methodologies that can enhance the successful classification of these complex datasets.
Methods: We have applied genetic algorithms (GAs) to the problem of multi-class prediction. A GA-based gene selection scheme is described that automatically determines the members of a predictive gene group, as well as the optimal group size, that maximizes classification success using a maximum likelihood (MLHD) classification method.
Results: The GA/MLHD-based approach achieves higher classification accuracies than other published predictive methods on the same multi-class test dataset. It also permits substantial feature reduction in classifier genesets without compromising predictive accuracy. We propose that GA-based algorithms may represent a powerful new tool in the analysis and exploration of complex multi-class gene expression data.
Availability: Supplementary information, data sets and source codes are available at http://www.omniarray.com/bioinformatics/GA
Contact: cmrtan{at}nccs.com.sg
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
B. E. Gould Rothberg, A. J. Berger, A. M. Molinaro, A. Subtil, M. O. Krauthammer, R. L. Camp, W. R. Bradley, S. Ariyan, H. M. Kluger, and D. L. Rimm Melanoma Prognostic Model Using Tissue Microarrays and Genetic Algorithms J. Clin. Oncol., December 1, 2009; 27(34): 5772 - 5780. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Duval and J.-K. Hao Advances in metaheuristics for gene selection and classification of microarray data Brief Bioinform, September 29, 2009; (2009) bbp035v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-H. Liu and C.-G. Xu A genetic programming-based approach to the classification of multiclass microarray datasets Bioinformatics, February 1, 2009; 25(3): 331 - 337. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. B. Fogel Computational intelligence approaches for pattern discovery in biological systems Brief Bioinform, July 1, 2008; 9(4): 307 - 316. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Zhao, F. Zhao, T. Li, and D. A. Bryant A new pheromone trail-based genetic algorithm for comparative genome assembly Nucleic Acids Res., June 1, 2008; 36(10): 3455 - 3462. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Saeys, I. Inza, and P. Larranaga A review of feature selection techniques in bioinformatics Bioinformatics, October 1, 2007; 23(19): 2507 - 2517. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Dolled-Filhart, L. Ryden, M. Cregger, K. Jirstrom, M. Harigopal, R. L. Camp, and D. L. Rimm Classification of breast cancer using genetic algorithms and tissue microarrays. Clin. Cancer Res., November 1, 2006; 12(21): 6459 - 6468. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Zhou and K. Z. Mao The ties problem resulting from counting-based error estimators and its impact on gene selection algorithms Bioinformatics, October 15, 2006; 22(20): 2507 - 2515. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Trevino and F. Falciani GALGO: an R package for multivariate variable selection using genetic algorithms Bioinformatics, May 1, 2006; 22(9): 1154 - 1156. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. F. Basil, Y. Zhao, K. Zavaglia, P. Jin, M. C. Panelli, S. Voiculescu, S. Mandruzzato, H. M. Lee, B. Seliger, R. S. Freedman, et al. Common cancer biomarkers. Cancer Res., March 15, 2006; 66(6): 2953 - 2961. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Larranaga, B. Calvo, R. Santana, C. Bielza, J. Galdiano, I. Inza, J. A. Lozano, R. Armananzas, G. Santafe, A. Perez, et al. Machine learning in bioinformatics Brief Bioinform, March 1, 2006; 7(1): 86 - 112. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. W. Mount and R. Pandey Using bioinformatics and genome analysis for new therapeutic interventions Mol. Cancer Ther., October 1, 2005; 4(10): 1636 - 1643. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Stec, J. Wang, K. Coombes, M. Ayers, S. Hoersch, D. L. Gold, J. S Ross, K. R. Hess, S. Tirrell, G. Linette, et al. Comparison of the Predictive Accuracy of DNA Array-Based Multigene Classifiers across cDNA Arrays and Affymetrix GeneChips J. Mol. Diagn., August 1, 2005; 7(3): 357 - 367. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Liang, B. Tayo, X. Cai, and A. Kelemen Differential and trajectory methods for time course gene expression data Bioinformatics, July 1, 2005; 21(13): 3009 - 3016. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Liu, G. Cutler, W. Li, Z. Pan, S. Peng, T. Hoey, L. Chen, and X. B. Ling Multiclass cancer classification and biomarker discovery using GA-based algorithms Bioinformatics, June 1, 2005; 21(11): 2691 - 2697. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. M. Jarvis and R. Goodacre Genetic algorithm optimization for pre-processing and variable selection of spectroscopic data Bioinformatics, April 1, 2005; 21(7): 860 - 868. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Shanahan and S. M. Hofer Social Context in Gene-Environment Interactions: Retrospect and Prospect J. Gerontol. B. Psychol. Sci. Soc. Sci., March 1, 2005; 60(suppl_Special_Issue_1): 65 - 76. [Abstract] [Full Text] [PDF] |
||||








