Bioinformatics Advance Access published online on April 27, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl161
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Biostatistics and Bioinformatics, Duke University, NC 27710, USA
* To whom correspondence should be addressed.
Summary: We want to evaluate the performance of two FDR-based multiple testing procedures by Benjamini and Hochberg (1995) and Storey (2002) in analyzing real microarray data. These procedures commonly require independence or weak dependence of the test statistics. However, expression levels of different genes from each array are usually correlated due to coexpressing genes and various sources of errors from experiment-specific and subject-specific conditions that are not adjusted for in data analysis. Because of high dimensionality of microarray data, it is usually impossible to check whether the weak dependence condition is met for a given data set or not. We propose to generate a large number of test statistics from a simulation model which has asymptotically (in terms of the number of arrays) the same correlation structure as the test statistics that will be calculated from the given data, and to investigate how accurately the FDR-based testing procedures control the FDR on the simulated data. Our approach is to directly check the performance of these procedures for a given data set, rather than to check the weak dependency requirement. We illustrate the proposed method with real microarray data sets, one where the clinical endpoint is disease group and another where it is survival.
Received December 5, 2005
Revised April 11, 2006
Accepted April 23, 2006
Article
How accurately can we control the FDR in analyzing microarray data?
Sin-Ho Jung 1 *
and
Woncheol Jang 2
2 Institute of Statistics and Decision Sciences, Duke University, NC 27705, USA
Sin-Ho Jung, E-mail: jung0005{at}mc.duke.edu
![]()
Abstract
Associate Editor: David Rocke
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
E. C. Kaizer, C. L. Glaser, D. Chaussabel, J. Banchereau, V. Pascual, and P. C. White Gene Expression in Peripheral Blood Mononuclear Cells from Children with Diabetes J. Clin. Endocrinol. Metab., September 1, 2007; 92(9): 3705 - 3711. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. S. Mehta, S. O. Zakharkin, G. L. Gadbury, and D. B. Allison Epistemological issues in omics and high-dimensional biology: give the people what they want Physiol Genomics, December 13, 2006; 28(1): 24 - 32. [Abstract] [Full Text] [PDF] |
||||

