An integrative approach for causal gene identification and gene regulatory pathway inference
Molecular and Computational Biology Program, University of Southern California Los Angeles, USA
*To whom correspondence should be addressed.
Motivation: Gene expression variation can often be linked to certain chromosomal regions and are tightly associated with phenotypic variation such as disease conditions. Inferring the causal genes for the expression variation is of great importance but rather challenging as the linked region generally contains multiple genes. Even when a single candidate gene is proposed, the underlying biological mechanism by which the regulation is enforced remains unknown. Novel approaches are needed to both infer the causal genes and generate hypothesis on the underlying regulatory mechanisms.
Results: We propose a new approach which aims at achieving the above objectives by integrating genotype information, gene expression, protein-protein interaction, protein phosphorylation, and transcription factor (TF)DNA binding information. A network based stochastic algorithm is designed to infer the causal genes and identify the underlying regulatory pathways. We first quantitatively verified our method by a test using data generated by yeast knock-out experiments. Over 40% of inferred causal genes are correct, which is significantly better than 10% by random guess. We then applied our method to a recent genome-wide expression variation study in yeast. We show that our method can correctly identify the causal genes and effectively output experimentally verified pathways. New potential gene regulatory pathways are generated and presented as a global network.
Availability: Source code is available upon request.
Contacts: fsun{at}usc.edu
This article has been cited by other articles:
![]() |
T. M. Przytycka, M. Singh, and D. K. Slonim Toward the dynamic interactome: it's about time Brief Bioinform, January 8, 2010; (2010) bbp057v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Stojmirovic and Y.-K. Yu ITM Probe: analyzing information flow in protein networks Bioinformatics, September 15, 2009; 25(18): 2447 - 2449. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. E. Baranzini, N. W. Galwey, J. Wang, P. Khankhanian, R. Lindberg, D. Pelletier, W. Wu, B. M.J. Uitdehaag, L. Kappos, GeneMSA Consortium, et al. Pathway and network-based analysis of genome-wide association studies in multiple sclerosis Hum. Mol. Genet., June 1, 2009; 18(11): 2078 - 2090. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Iossifov, T. Zheng, M. Baron, T. C. Gilliam, and A. Rzhetsky Genetic-linkage mapping of complex hereditary disorders to a whole-genome molecular-interaction network Genome Res., July 1, 2008; 18(7): 1150 - 1162. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Ideker and R. Sharan Protein networks in disease Genome Res., April 1, 2008; 18(4): 644 - 652. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Liu, A. de la Fuente, and I. Hoeschele Gene Network Inference via Structural Equation Modeling in Genetical Genomics Experiments Genetics, March 1, 2008; 178(3): 1763 - 1776. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Cakmak and G. Ozsoyoglu Mining biological networks for unknown pathways Bioinformatics, October 15, 2007; 23(20): 2775 - 2783. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Bao, J. L. Peirce, M. Zhou, H. Li, D. Goldowitz, R. W. Williams, L. Lu, and Y. Cui An integrative genomics strategy for systematic characterization of genetic loci modulating phenotypes Hum. Mol. Genet., June 1, 2007; 16(11): 1381 - 1390. [Abstract] [Full Text] [PDF] |
||||




