Graph theoretical approach to study eQTL: a case study of Plasmodium falciparum

1National Center for Biotechnology Information, NLM, NIH, 8600 Rockville Pike, Building 38A, Bethesda, MD 20894, 2Northwestern Institute on Complexity, Northwestern University, 600 Foster Street, Evanston, IL 60201 and 3Eck Institute for Global Health, Department of Biological Sciences, University of Notre Dame, 107 Galvin Life Sciences, Notre Dame, IN 46556, USA
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Analysis of expression quantitative trait loci (eQTL) significantly contributes to the determination of gene regulation programs. However, the discovery and analysis of associations of gene expression levels and their underlying sequence polymorphisms continue to pose many challenges. Methods are limited in their ability to illuminate the full structure of the eQTL data. Most rely on an exhaustive, genome scale search that considers all possible locus–gene pairs and tests the linkage between each locus and gene.
Result: To analyze eQTLs in a more comprehensive and efficient way, we developed the Graph based eQTL Decomposition method (GeD) that allows us to model genotype and expression data using an eQTL association graph. Through graph-based heuristics, GeD identifies dense subgraphs in the eQTL association graph. By identifying eQTL association cliques that expose the hidden structure of genotype and expression data, GeD effectively filters out most locus–gene pairs that are unlikely to have significant linkage. We apply GeD on eQTL data from Plasmodium falciparum, the human malaria parasite, and show that GeD reveals the structure of the relationship between all loci and all genes on a whole genome level. Furthermore, GeD allows us to uncover additional eQTLs with lower FDR, providing an important complement to traditional eQTL analysis methods.
Contact: przytyck{at}ncbi.nlm.nih.gov
Present address: NOB, NCI, NIH, 37 Convent Drive 1142E, Bethesda, MD 20892, USA.