Bioinformatics Advance Access originally published online on September 28, 2004
Bioinformatics 2005 21(6):781-787; doi:10.1093/bioinformatics/bti053
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
An efficient Monte Carlo approach to assessing statistical significance in genomic studies
Department of Biostatistics, University of North Carolina McGavran-Greenberg Hall, CB #7420, Chapel Hill, NC 27599-7420, USA
Motivation: Multiple hypothesis testing is a common problem in genome research, particularly in microarray experiments and genomewide association studies. Failure to account for the effects of multiple comparisons would result in an abundance of false positive results. The Bonferroni correction and Holm's step-down procedure are overly conservative, whereas the permutation test is time-consuming and is restricted to simple problems.
Results: We developed an efficient Monte Carlo approach to approximating the joint distribution of the test statistics along the genome. We then used the Monte Carlo distribution to evaluate the commonly used criteria for error control, such as familywise error rates and positive false discovery rates. This approach is applicable to any data structures and test statistics. Applications to simulated and real data demonstrate that the proposed approach provides accurate error control, and can be substantially more powerful than the Bonferroni and Holm methods, especially when the test statistics are highly correlated.
Contact: lin{at}bios.unc.edu
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. Zhang Poisson approximation for significance in genome-wide ChIP-chip tiling arrays Bioinformatics, December 15, 2008; 24(24): 2825 - 2831. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Kustra, X. Shi, D. J. Murdoch, C. M. T. Greenwood, and J. Rangrej Efficient p-value estimation in massively parallel testing problems Biostat., October 1, 2008; 9(4): 601 - 612. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-C. Yang, H.-Y. Hsieh, and C. S. J. Fann Kernel-Based Association Test Genetics, June 1, 2008; 179(2): 1057 - 1068. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. I. Amos Successful design and conduct of genome-wide association studies Hum. Mol. Genet., October 15, 2007; 16(R2): R220 - R225. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. J. Schork, T. A. Greenwood, and D. L. Braff Statistical Genetics Concepts and Approaches in Schizophrenia and Related Neuropsychiatric Research Schizophr Bull, January 1, 2007; 33(1): 95 - 104. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. L. Bonnycastle, C. J. Willer, K. N. Conneely, A. U. Jackson, C. P. Burrill, R. M. Watanabe, P. S. Chines, N. Narisu, L. J. Scott, S. T. Enloe, et al. Common Variants in Maturity-Onset Diabetes of the Young Genes Contribute to Risk of Type 2 Diabetes in Finns Diabetes, September 1, 2006; 55(9): 2534 - 2540. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-H. Jung and W. Jang How accurately can we control the FDR in analyzing microarray data? Bioinformatics, July 15, 2006; 22(14): 1730 - 1736. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Verducci, V. F. Melfi, S. Lin, Z. Wang, S. Roy, and C. K. Sen Microarray analysis of gene expression: considerations in data mining and statistical treatment Physiol Genomics, May 16, 2006; 25(3): 355 - 363. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. C. Thomas Are We Ready for Genome-wide Association Studies? Cancer Epidemiol. Biomarkers Prev., April 1, 2006; 15(4): 595 - 598. [Full Text] [PDF] |
||||







