Bioinformatics Advance Access published online on September 28, 2004
Bioinformatics, doi:10.1093/bioinformatics/bti053
Bioinformatics © Oxford University Press 2004; all rights reserved
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Biostatistics, University of North Carolina, Chapel Hill, North Carolina
* To whom correspondence should be addressed. E-mail: lin{at}bios.unc.edu.
Motivation: Multiple hypothesis testing is a common problem in genome research, particularly in microarray experiments and genomewide association studies. Failure to account for the effects of multiple comparisons would result in an abundance of false positive results. The Bonferroni correction and Holm's step-down procedure are overly conservative, while the permutation test is time-consuming and is restricted to simple problems. Results: We develop an efficient Monte Carlo approach to approximating the joint distribution of the test statistics along the genome. We then use the Monte Carlo distribution to evaluate the commonly used criteria for error control, such as family-wise error rates and positive false discovery rates. This approach is applicable to any data structures and test statistics. Applications to simulated and real data demonstrate that the proposed approach provides accurate error control, and can be substantially more powerful than the Bonferroni and Holm methods, especially when the test statistics are highly correlated.
Revised August 30, 2004
Accepted August 31, 2004
Article
An efficient Monte Carlo approach to assessing statistical significance in genomic studies
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. Zhang Poisson approximation for significance in genome-wide ChIP-chip tiling arrays Bioinformatics, December 15, 2008; 24(24): 2825 - 2831. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Kustra, X. Shi, D. J. Murdoch, C. M. T. Greenwood, and J. Rangrej Efficient p-value estimation in massively parallel testing problems Biostat., October 1, 2008; 9(4): 601 - 612. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-C. Yang, H.-Y. Hsieh, and C. S. J. Fann Kernel-Based Association Test Genetics, June 1, 2008; 179(2): 1057 - 1068. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. I. Amos Successful design and conduct of genome-wide association studies Hum. Mol. Genet., October 15, 2007; 16(R2): R220 - R225. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. J. Schork, T. A. Greenwood, and D. L. Braff Statistical Genetics Concepts and Approaches in Schizophrenia and Related Neuropsychiatric Research Schizophr Bull, January 1, 2007; 33(1): 95 - 104. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. L. Bonnycastle, C. J. Willer, K. N. Conneely, A. U. Jackson, C. P. Burrill, R. M. Watanabe, P. S. Chines, N. Narisu, L. J. Scott, S. T. Enloe, et al. Common Variants in Maturity-Onset Diabetes of the Young Genes Contribute to Risk of Type 2 Diabetes in Finns Diabetes, September 1, 2006; 55(9): 2534 - 2540. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-H. Jung and W. Jang How accurately can we control the FDR in analyzing microarray data? Bioinformatics, July 15, 2006; 22(14): 1730 - 1736. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Verducci, V. F. Melfi, S. Lin, Z. Wang, S. Roy, and C. K. Sen Microarray analysis of gene expression: considerations in data mining and statistical treatment Physiol Genomics, May 16, 2006; 25(3): 355 - 363. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. C. Thomas Are We Ready for Genome-wide Association Studies? Cancer Epidemiol. Biomarkers Prev., April 1, 2006; 15(4): 595 - 598. [Full Text] [PDF] |
||||







