Bioinformatics Vol. 19 no. 5 2003
Pages 563-570
© 2003 Oxford University Press
Effective dimension reduction methods for tumor classification using gene expression data
Laboratoire IMAG-LMC, University Joseph Fourier, BP 53, 38041 Grenoble Cedex 9, France
Received on April 21, 2002
; revised on October 8, 2002
; accepted on November 10, 2002
Motivation: One particular application of microarray data, is to uncover the molecular variation among cancers. One feature of microarray studies is the fact that the number n of samples collected is relatively small compared to the number p of genes per sample which are usually in the thousands. In statistical terms this very large number of predictors compared to a small number of samples or observations makes the classification problem difficult. An efficient way to solve this problem is by using dimension reduction statistical techniques in conjunction with nonparametric discriminant procedures.
Results: We view the classification problem as a regression problem with few observations and many predictor variables. We use an adaptive dimension reduction method for generalized semi-parametric regression models that allows us to solve the curse of dimensionality problem arising in the context of expression data. The predictive performance of the resulting classification rule is illustrated on two well know data sets in the microarray literature: the leukemia data that is known to contain classes that are easy separable and the colon data set.
Availability: Software that implements the procedures on which this paper focus are freely available at http://www-lmc.imag.fr/SMS/software/microarrays/
Contact: anestis.antoniadis{at}imag.fr
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
P. Qiu, Z. J. Wang, K. J. R. Liu, Z.-Z. Hu, and C. H. Wu Dependence network modeling for biomarker identification Bioinformatics, January 15, 2007; 23(2): 198 - 206. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Qiu, Z. J. Wang, and K. J. R. Liu Ensemble dependence model for classification and prediction of cancer and normal gene expression data Bioinformatics, July 15, 2005; 21(14): 3114 - 3121. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Fort and S. Lambert-Lacroix Classification using partial least squares with penalized logistic regression Bioinformatics, April 1, 2005; 21(7): 1104 - 1111. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Nambiar, A. Mirmohammadsadegh, R. Doroudi, A. Gustrau, A. Marini, G. Roeder, T. Ruzicka, and U. R. Hengge Signaling Networks in Cutaneous Melanoma Metastasis Identified by Complementary DNA Microarrays Arch Dermatol, February 1, 2005; 141(2): 165 - 173. [Abstract] [Full Text] [PDF] |
||||

