Bioinformatics Vol. 19 no. 1 2003
Pages 90-97
© 2003 Oxford University Press
Gene selection: a Bayesian variable selection approach
1 Department of Statistics, Texas A&M University,
College Station, TX 77843-3143, USA
2 Department of Electrical Engineering, Texas A&M University,
College Station, TX 77840, USA
3 Department of Pathology, University of Texas,
M. D. Anderson Cancer Center, USA
4 Mathematical Sciences Department, University of Texas at El Paso,
USA
Received on January 24, 2002
; revised on April 29, 2002
; accepted on June 17, 2002
Selection of significant genes via expression patterns is an important problem in microarray experiments. Owing to small sample size and the large number of variables (genes), the selection process can be unstable. This paper proposes a hierarchical Bayesian model for gene (variable) selection. We employ latent variables to specialize the model to a regression setting and uses a Bayesian mixture prior to perform the variable selection. We control the size of the model by assigning a prior distribution over the dimension (number of significant genes) of the model. The posterior distributions of the parameters are not in explicit form and we need to use a combination of truncated sampling and Markov Chain Monte Carlo (MCMC) based computation techniques to simulate the parameters from the posteriors. The Bayesian model is flexible enough to identify significant genes as well as to perform future predictions. The method is applied to cancer classification via cDNA microarrays where the genes BRCA1 and BRCA2 are associated with a hereditary disposition to breast cancer, and the method is used to identify a set of significant genes. The method is also applied successfully to the leukemia data.
Supplementary information: http://stat.tamu.edu/people/faculty/bmallick.html
Contact: bmallick{at}stat.tanu.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. Saeys, I. Inza, and P. Larranaga A review of feature selection techniques in bioinformatics Bioinformatics, October 1, 2007; 23(19): 2507 - 2517. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Sykacek, R. Clarkson, C. Print, R. Furlong, and G. Micklem Bayesian modelling of shared gene function Bioinformatics, August 1, 2007; 23(15): 1936 - 1944. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.G. Liao and K.-V. Chin Logistic regression for disease classification using microarray data: model selection in a large p and small n case Bioinformatics, August 1, 2007; 23(15): 1945 - 1951. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Huang and T. W. S. Chow Identifying the biologically relevant gene categories based on gene expression and biological data: an example on prostate cancer Bioinformatics, June 15, 2007; 23(12): 1503 - 1510. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Zhou and D. P. Tuck MSVM-RFE: extensions of SVM-RFE for multiclass gene selection on DNA microarray data Bioinformatics, May 1, 2007; 23(9): 1106 - 1114. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Jiang On the consistency of bayesian variable selection for high dimensional binary regression and classification. Neural Comput., November 1, 2006; 18(11): 2762 - 2776. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-Q. Wang and K. Li A New Algorithm Based on Support Vectors and Penalty Strategy for Identifying Key Genes Related with Cancer Transactions of the Institute of Measurement and Control, August 1, 2006; 28(3): 263 - 273. [Abstract] [PDF] |
||||
![]() |
M. G. Kolonin, J. Sun, K.-A. Do, C. I. Vidal, Y. Ji, K. A. Baggerly, R. Pasqualini, and W. Arap Synchronous selection of homing peptides for multiple tissues by in vivo phage display FASEB J, May 1, 2006; 20(7): 979 - 981. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. H. Zhang, J. Ahn, X. Lin, and C. Park Gene selection using support vector machines with non-convex penalty Bioinformatics, January 1, 2006; 22(1): 88 - 95. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Y. Yeung, R. E. Bumgarner, and A. E. Raftery Bayesian model averaging: development of an improved multi-class, gene selection and classification tool for microarray data Bioinformatics, May 15, 2005; 21(10): 2394 - 2402. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Datta and S. Datta Empirical Bayes screening of many p-values with applications to microarray studies Bioinformatics, May 1, 2005; 21(9): 1987 - 1994. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Floyd and T. M. Mcshane Development and Use of Biomarkers in Oncology Drug Development Toxicol Pathol, January 1, 2004; 32(1_suppl): 106 - 115. [Abstract] [PDF] |
||||
![]() |
B. Walsh and D. Henderson Microarrays and beyond: What potential do current and future genomics tools have for breeders? J Anim Sci, January 1, 2004; 82(13_suppl): E292 - 299. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Zhou, X. Wang, and E. R. Dougherty Binarization of Microarray Data on the Basis of a Mixture Model Mol. Cancer Ther., July 1, 2003; 2(7): 679 - 684. [Abstract] [Full Text] [PDF] |
||||






