Bioinformatics Advance Access published online on July 17, 2008
Bioinformatics, doi:10.1093/bioinformatics/btn373
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Considering dependence among genes and markers for false discovery control in eQTL mapping
1 Molecular and Computational Biology Program, Department of Biological Sciences, University of Southern California, Los Angeles, California, U.S.A.
2 Department of Applied Mathematics, University of Colorado, Boulder, Colorado, U.S.A.
3 Department of Epidemiology and Public Health, Yale University, New Haven, Connecticut, U.S.A.
4 Department of Genetics, Yale University, New Haven, Connecticut, U.S.A.
* To whom correspondence should be addressed. Dr. Liang Chen, Hongyu Zhao, E-mail: liang.chen{at}usc.edu, hongyu.zhao{at}yale.edu
| Abstract |
|---|
Motivation: Multiple comparison adjustment is a significant and challenging statistical issue in large-scale biological studies. In previous studies, dependence among genes is largely ignored. However, such dependence may be strong for some genomic-scale studies such as genetical genomics (also called expression Quantitative Trait Loci (eQTL) mapping) in which thousands of genes are treated as quantitative traits and mapped to different genetical markers. Besides the dependence among markers, the dependence among the expression levels of genes can also have a significant impact on data analysis and interpretation.
Results: In this paper, we propose to consider both the mean as well as the variance of false discovery number for multiple comparison adjustment to handle dependence among hypotheses. This is achieved by developing a variance estimator for false discovery number, and using the upper bound of false discovery proportion (uFDP) for false discovery control. More importantly, we introduce a weighted version of uFDP (wuFDP) control to improve the statistical power of eQTL identification. In addition, the wuFDP approach can better control false positives than false discovery rate (FDR) and uFDP approaches when markers are in linkage disequilibrium. The relative performance of uFDP control and wuFDP control is illustrated through simulation studies and real data analysis.
Contacts: liang.chen{at}usc.edu, hongyu.zhao{at}yale.edu
Supplementary information: Supplementary figures, tables, and appendices are available at Bioinformatics online.
Associate Editor: Dr. Joaquin Dopazo
Received on January 15, 2008; revised on July 4, 2008; accepted on July 15, 2008