Bioinformatics Advance Access originally published online on August 16, 2005
Bioinformatics 2005 21(20):3865-3872; doi:10.1093/bioinformatics/bti626
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bias in the estimation of false discovery rate in microarray studies
1Department of Medical Epidemiology and Biostatistics, Karolinska Institutet 17177 Stockholm, Sweden
2Genome Institute of Singapore Singapore
3Unit of Biostatistics and Epidemiology, Institute Gustave Roussy Villejuif, France
*To whom correspondence should be adderessed.
Motivation: The false discovery rate (FDR) provides a key statistical assessment for microarray studies. Its value depends on the proportion
0 of non-differentially expressed (non-DE) genes. In most microarray studies, many genes have small effects not easily separable from non-DE genes. As a result, current methods often overestimate
0 and FDR, leading to unnecessary loss of power in the overall analysis.
Methods: For the common two-sample comparison we derive a natural mixture model of the test statistic and an explicit bias formula in the standard estimation of
0. We suggest an improved estimation of
0 based on the mixture model and describe a practical likelihood-based procedure for this purpose.
Results: The analysis shows that a large bias occurs when
0 is far from 1 and when the non-centrality parameters of the distribution of the test statistic are near zero. The theoretical result also explains substantial discrepancies between non-parametric and model-based estimates of
0. Simulation studies indicate mixture-model estimates are less biased than standard estimates. The method is applied to breast cancer and lymphoma data examples.
Availability: An R-package OCplus containing functions to compute
0 based on the mixture model, the resulting FDR and other operating characteristics of microarray data, is freely available at http://www.meb.ki.se/~yudpaw
Contact: yudi.pawitan{at}meb.ki.se and alexander.ploner{at}meb.ki.se
Received on June 8, 2005; revised on July 11, 2005; accepted on August 10, 2005
This article has been cited by other articles:
![]() |
M. Zhang, C. Yao, Z. Guo, J. Zou, L. Zhang, H. Xiao, D. Wang, D. Yang, X. Gong, J. Zhu, et al. Apparently low reproducibility of true differential expression discoveries in microarray studies Bioinformatics, September 15, 2008; 24(18): 2057 - 2063. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Pawitan, S. Calza, and A. Ploner Estimation of false discovery proportion under general dependence Bioinformatics, December 15, 2006; 22(24): 3025 - 3031. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. S. Mehta, S. O. Zakharkin, G. L. Gadbury, and D. B. Allison Epistemological issues in omics and high-dimensional biology: give the people what they want Physiol Genomics, December 13, 2006; 28(1): 24 - 32. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ploner, S. Calza, A. Gusnanto, and Y. Pawitan Multidimensional local false discovery rate for microarray studies Bioinformatics, March 1, 2006; 22(5): 556 - 565. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Wolfram, E. Haas, and P. Bauerfeind Nickel Represses the Synthesis of the Nickel Permease NixA of Helicobacter pylori J. Bacteriol., February 15, 2006; 188(4): 1245 - 1250. [Abstract] [Full Text] [PDF] |
||||


