Bioinformatics Advance Access originally published online on June 19, 2009
Bioinformatics 2009 25(17):2236-2243; doi:10.1093/bioinformatics/btp376
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Reconstruct modular phenotype-specific gene networks by knowledge-driven matrix factorization
1 Department of Chemical Engineering and Materials Science, 2 Department of Biochemistry and Molecular Biology and 3 Department of Computer Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
* To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Reconstructing gene networks from microarray data has provided mechanistic information on cellular processes. A popular structure learning method, Bayesian network inference, has been used to determine network topology despite its shortcomings, i.e. the high-computational cost when analyzing a large number of genes and the inefficiency in exploiting prior knowledge, such as the co-regulation information of the genes. To address these limitations, we are introducing an alternative method, knowledge-driven matrix factorization (KMF) framework, to reconstruct phenotype-specific modular gene networks.
Results: Considering the reconstruction of gene network as a matrix factorization problem, we first use the gene expression data to estimate a correlation matrix, and then factorize the correlation matrix to recover the gene modules and the interactions between them. Prior knowledge from Gene Ontology is integrated into the matrix factorization. We applied this KMF algorithm to hepatocellular carcinoma (HepG2) cells treated with free fatty acids (FFAs). By comparing the module networks for the different conditions, we identified the specific modules that are involved in conferring the cytotoxic phenotype induced by palmitate. Further analysis of the gene modules of the different conditions suggested individual genes that play important roles in palmitate-induced cytotoxicity. In summary, KMF can efficiently integrate gene expression data with prior knowledge, thereby providing a powerful method of reconstructing phenotype-specific gene networks and valuable insights into the mechanisms that govern the phenotype.
Contact: krischan{at}msu.edu
Supplementary information: Supplementary data are available at Bioinformatics online.
Received on March 9, 2009; revised on May 19, 2009; accepted on June 9, 2009