Skip Navigation


Bioinformatics Advance Access originally published online on August 4, 2009
Bioinformatics 2009 25(21):2802-2808; doi:10.1093/bioinformatics/btp476
This Article
Right arrow Full Text
Right arrow Full Text (Print PDF)
Right arrow Supplementary Data
Right arrow All Versions of this Article:
25/21/2802    most recent
btp476v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Wei, Z.
Right arrow Articles by Hakonarson, H.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wei, Z.
Right arrow Articles by Hakonarson, H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2009. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

Multiple testing in genome-wide association studies via hidden Markov models

Zhi Wei 1,*, Wenguang Sun 2, Kai Wang 3 and Hakon Hakonarson 3,4

1 Department of Computer Science, New Jersey Institute of Technology, Newark, NJ 07102, 2 Department of Statistics, North Carolina State University, Raleigh, NC 27695, 3 Center for Applied Genomics, The Children's Hospital of Philadelphia and 4 Division of Genetics, Department of Pediatrics, The Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA

* To whom correspondence should be addressed.


   Abstract

Motivation: Genome-wide association studies (GWAS) interrogate common genetic variation across the entire human genome in an unbiased manner and hold promise in identifying genetic variants with moderate or weak effect sizes. However, conventional testing procedures, which are mostly P-value based, ignore the dependency and therefore suffer from loss of efficiency. The goal of this article is to exploit the dependency information among adjacent single nucleotide polymorphisms (SNPs) to improve the screening efficiency in GWAS.

Results: We propose to model the linear block dependency in the SNP data using hidden Markov models (HMMs). A compound decision–theoretic framework for testing HMM-dependent hypotheses is developed. We propose a powerful data-driven procedure [pooled local index of significance (PLIS)] that controls the false discovery rate (FDR) at the nominal level. PLIS is shown to be optimal in the sense that it has the smallest false negative rate (FNR) among all valid FDR procedures. By re-ranking significance for all SNPs with dependency considered, PLIS gains higher power than conventional P-value based methods. Simulation results demonstrate that PLIS dominates conventional FDR procedures in detecting disease-associated SNPs. Our method is applied to analysis of the SNP data from a GWAS of type 1 diabetes. Compared with the Benjamini–Hochberg (BH) procedure, PLIS yields more accurate results and has better reproducibility of findings.

Conclusion: The genomic rankings based on our procedure are substantially different from the rankings based on the P-values. By integrating information from adjacent locations, the PLIS rankings benefit from the increased signal-to-noise ratio, hence our procedure often has higher statistical power and better reproducibility. It provides a promising direction in large-scale GWAS.

Availability: An R package PLIS has been developed to implement the PLIS procedure. Source codes are available upon request and will be available on CRAN (http://cran.r-project.org/).

Contact: zhiwei{at}njit.edu

Supplementary information: Supplementary data are available at Bioinformatics online.

Associate Editor: Martin Bishop


Received on June 6, 2009; revised on July 17, 2009; accepted on July 31, 2009

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.