Vol. 20 no. 2 2004, pages 235-242
Bioinformatics © Oxford University Press 2004; all rights reserved.
An empirical bayes adjustment to increase the sensitivity of detecting differentially expressed genes in microarray experiments



1 Department of Mathematics and Statistics, Georgia State University, Atlanta, GA 30303, USA, 2 Centers for Disease Control and Prevention, Atlanta, GA 30333, USA, 3 Department of Physiology and Biophysics, 4 Department of Surgery, University of Alabama at Birmingham, Birmingham, AL 35294, USA and 5 Department of Statistics, University of Georgia, Athens, GA 30602, USA
Received on March 24, 2003
; revised on July 3, 2003
; accepted on July 31, 2003
Motivation: Detection of differentially expressed genes is one of the major goals of microarray experiments. Pairwise comparison for each gene is not appropriate without controlling the overall (experimentwise) type 1 error rate. Dudoit et al. have advocated use of permutation-based step-down P-value adjustments to correct the observed significance levels for the individual (i.e. for each gene) two sample t-tests.
Results: In this paper, we consider an ANOVA formulation of the gene expression levels corresponding to multiple tissue types. We provide resampling-based step-down adjustments to correct the observed significance levels for the individual ANOVA t-tests for each gene and for each pair of tissue type comparisons. More importantly, we introduce a novel empirical Bayes adjustment to the t-test statistics that can be incorporated into the step-down procedure. Using simulated data, we show that the empirical Bayes adjustment improved the sensitivity of detecting differentially expressed genes up to 16%, while maintaining a high level of specificity. This adjustment also reduces the false non-discovery rate to some degree at the cost of a modest increase in the false discovery rate. We illustrate our approach using a human colon cancer dataset consisting of oligonucleotide arrays of normal, adenoma and carcinoma cells. The number of genes with differential expression level declared statistically significant was about 50 when comparing normal to adenoma cells and about five when comparing adenoma to carcinoma cells. This list includes genes previously known to be associated with colon cancer as well as some novel genes.
Availability: R code for the empirical Bayes adjustment and step-down P-value calculation via resampling are available from the supplementary web-site.
Supplementary information: http://www.mathstat.gsu.edu/~matsnd/EB/supp.htm
Contact: datta{at}stat.uga.edu
* To whom correspondence should be addressed.
Susmita Datta, Glen Satten and Somnath Datta participated in the development of the statistical models used in this paper and were not involved in the overall study design or contact with human subjects.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S. Datta and S. Datta Empirical Bayes screening of many p-values with applications to microarray studies Bioinformatics, May 1, 2005; 21(9): 1987 - 1994. [Abstract] [Full Text] [PDF] |
||||
