Bioinformatics Advance Access published online on June 28, 2007
Bioinformatics, doi:10.1093/bioinformatics/btm327
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Detection of eQTL modules mediated by activity levels of transcription factors
aDepartment of Statistics, University of California at Los Angeles, Los Angeles, California, USA. bDepartment of Biostatistics, Emory University, Atlanta, Georgia, USA. cInstitute of StatisticalScience, Academia Sinica, Taipei, Taiwan, R.O.C.
*To whom correspondence should be addressed. Ker-Chau Li, E-mail: kcli{at}stat.ucla.edu
| Abstract |
|---|
Motivation: Studies of gene expression quantitative trait loci (eQTL) in different organisms have shown the existence of eQTL hot spots (Rockman et al., 2006): each being a small segment of DNA sequence that harbors the eQTL of a large number of genes. Two questions of great interest about eQTL hot spots arise: (1) Which gene within the hot spot is responsible for the linkages, i.e. which gene is the quantitative trait gene (QTG)? (2) How does a QTG affect the expression levels of many genes linked to it? Answers to the first question can be offered by available biological evidence or by statistical methods (Bing et al., 2005; Kulp et al., 2006). The second question is harder to address. One simple situation is that the QTG encodes a transcription factor (TF), which regulates the expression of genes linked to it. However, previous results (Yvert et al., 2003) have shown that TFs are not overrepresented in the eQTL hot spots. In this paper, we consider the scenario that the propagation of genetic perturbation from a QTG to other linked genes is mediated by the TF activity. We develop a procedure to detect the eQTL modules (eQTL hot spots together with linked genes) that are compatible with this scenario.
Results: We first detect 27 eQTL modules from a yeast eQTL data (Brem et al., 2005a,b), and estimate TF activity profiles using the method of Yu et al. (2005). Then likelihood ratio tests (LRTs) are conducted to find 760 relationships supporting the scenario of TF activity mediation: (DNA polymorphism
cis-linked gene
TF activity
downstream linked gene). They are organized into 4 eQTL modules: an amino acid synthesis module featuring a cis-linked gene LEU2 and the mediating TF Leu3; a pheromone response module featuring a cis-linked gene GPA1 and the mediating TF Ste12; an energysource control module featuring two cis-linked genes, GSY2 and HAP1, and the mediating TF Hap1; a mitotic exit module featuring 4 cis-linked genes, AMN1, CSH1, DEM1 and TOS1, and the mediating TF complex Ace2/Swi5. Gene ontology is utilized to reveal interesting functional groups of the downstream genes in each module.
Availability: Our methods are implemented in an R package: eqtl.TF, which includes source codes and relevant data. It can be freely downloaded at http://www.stat.ucla.edu/~sunwei/software.htm
Supplementary Materials: http://www.stat.ucla.edu/~sunwei/yeast_eQTL_TF/supplemetary.pdf
Associate Editor: Dr. Limsoon Wong
Received on December 23, 2006; revised on June 17, 2007; accepted on June 17, 2007
This article has been cited by other articles:
![]() |
W. Sun, F. A. Wright, Z. Tang, S. H. Nordgard, P. V. Loo, T. Yu, V. N. Kristensen, and C. M. Perou Integrated study of copy number states and genotype calls using high-density SNP arrays Nucleic Acids Res., September 1, 2009; 37(16): 5365 - 5377. [Abstract] [Full Text] [PDF] |
||||
