Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (64)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Lee, K. E.
Right arrow Articles by Mallick, B. K.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Lee, K. E.
Right arrow Articles by Mallick, B. K.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 19 no. 1 2003
Pages 90-97
© 2003 Oxford University Press

Gene selection: a Bayesian variable selection approach

Kyeong Eun Lee 1, Naijun Sha 4, Edward R. Dougherty 2,3, Marina Vannucci 1 and Bani K. Mallick 1,*

1 Department of Statistics, Texas A&M University, College Station, TX 77843-3143, USA
2 Department of Electrical Engineering, Texas A&M University, College Station, TX 77840, USA
3 Department of Pathology, University of Texas, M. D. Anderson Cancer Center, USA
4 Mathematical Sciences Department, University of Texas at El Paso, USA

Received on January 24, 2002 ; revised on April 29, 2002 ; accepted on June 17, 2002

Selection of significant genes via expression patterns is an important problem in microarray experiments. Owing to small sample size and the large number of variables (genes), the selection process can be unstable. This paper proposes a hierarchical Bayesian model for gene (variable) selection. We employ latent variables to specialize the model to a regression setting and uses a Bayesian mixture prior to perform the variable selection. We control the size of the model by assigning a prior distribution over the dimension (number of significant genes) of the model. The posterior distributions of the parameters are not in explicit form and we need to use a combination of truncated sampling and Markov Chain Monte Carlo (MCMC) based computation techniques to simulate the parameters from the posteriors. The Bayesian model is flexible enough to identify significant genes as well as to perform future predictions. The method is applied to cancer classification via cDNA microarrays where the genes BRCA1 and BRCA2 are associated with a hereditary disposition to breast cancer, and the method is used to identify a set of significant genes. The method is also applied successfully to the leukemia data.

Supplementary information: http://stat.tamu.edu/people/faculty/bmallick.html

Contact: bmallick{at}stat.tanu.edu

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
Y. Saeys, I. Inza, and P. Larranaga
A review of feature selection techniques in bioinformatics
Bioinformatics, October 1, 2007; 23(19): 2507 - 2517.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
P. Sykacek, R. Clarkson, C. Print, R. Furlong, and G. Micklem
Bayesian modelling of shared gene function
Bioinformatics, August 1, 2007; 23(15): 1936 - 1944.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J.G. Liao and K.-V. Chin
Logistic regression for disease classification using microarray data: model selection in a large p and small n case
Bioinformatics, August 1, 2007; 23(15): 1945 - 1951.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. Huang and T. W. S. Chow
Identifying the biologically relevant gene categories based on gene expression and biological data: an example on prostate cancer
Bioinformatics, June 15, 2007; 23(12): 1503 - 1510.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Zhou and D. P. Tuck
MSVM-RFE: extensions of SVM-RFE for multiclass gene selection on DNA microarray data
Bioinformatics, May 1, 2007; 23(9): 1106 - 1114.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
W. Jiang
On the consistency of bayesian variable selection for high dimensional binary regression and classification.
Neural Comput., November 1, 2006; 18(11): 2762 - 2776.
[Abstract] [Full Text] [PDF]


Home page
Transactions of the Institute of Measurement and ControlHome page
H.-Q. Wang and K. Li
A New Algorithm Based on Support Vectors and Penalty Strategy for Identifying Key Genes Related with Cancer
Transactions of the Institute of Measurement and Control, August 1, 2006; 28(3): 263 - 273.
[Abstract] [PDF]


Home page
FASEB J.Home page
M. G. Kolonin, J. Sun, K.-A. Do, C. I. Vidal, Y. Ji, K. A. Baggerly, R. Pasqualini, and W. Arap
Synchronous selection of homing peptides for multiple tissues by in vivo phage display
FASEB J, May 1, 2006; 20(7): 979 - 981.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. H. Zhang, J. Ahn, X. Lin, and C. Park
Gene selection using support vector machines with non-convex penalty
Bioinformatics, January 1, 2006; 22(1): 88 - 95.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Y. Yeung, R. E. Bumgarner, and A. E. Raftery
Bayesian model averaging: development of an improved multi-class, gene selection and classification tool for microarray data
Bioinformatics, May 15, 2005; 21(10): 2394 - 2402.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Datta and S. Datta
Empirical Bayes screening of many p-values with applications to microarray studies
Bioinformatics, May 1, 2005; 21(9): 1987 - 1994.
[Abstract] [Full Text] [PDF]


Home page
Toxicol PatholHome page
E. Floyd and T. M. Mcshane
Development and Use of Biomarkers in Oncology Drug Development
Toxicol Pathol, January 1, 2004; 32(1_suppl): 106 - 115.
[Abstract] [PDF]


Home page
J ANIM SCIHome page
B. Walsh and D. Henderson
Microarrays and beyond: What potential do current and future genomics tools have for breeders?
J Anim Sci, January 1, 2004; 82(13_suppl): E292 - 299.
[Abstract] [Full Text] [PDF]


Home page
Molecular Cancer TherapeuticsHome page
X. Zhou, X. Wang, and E. R. Dougherty
Binarization of Microarray Data on the Basis of a Mixture Model
Mol. Cancer Ther., July 1, 2003; 2(7): 679 - 684.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.