Bioinformatics Advance Access originally published online on May 5, 2007
Bioinformatics 2007 23(12):1537-1544; doi:10.1093/bioinformatics/btm129
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
A Markov random field model for network-based analysis of genomic data
1Genomics and Computational Biology Graduate Group and 2Department of Biostatistics and Epidemiology, University of Pennsylvania School of Medicine, Philadelphia, PA 19104, USA
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: A central problem in genomic research is the identification of genes and pathways involved in diseases and other biological processes. The genes identified or the univariate test statistics are often linked to known biological pathways through gene set enrichment analysis in order to identify the pathways involved. However, most of the procedures for identifying differentially expressed (DE) genes do not utilize the known pathway information in the phase of identifying such genes. In this article, we develop a Markov random field (MRF)-based method for identifying genes and subnetworks that are related to diseases. Such a procedure models the dependency of the DE patterns of genes on the networks using a local discrete MRF model.
Results: Simulation studies indicated that the method is quite effective in identifying genes and subnetworks that are related to disease and has higher sensitivity and lower false discovery rates than the commonly used procedures that do not use the pathway structure information. Applications to two breast cancer microarray gene expression datasets identified several subnetworks on several of the KEGG transcriptional pathways that are related to breast cancer recurrence or survival due to breast cancer.
Conclusions: The proposed MRF-based model efficiently utilizes the known pathway structures in identifying the DE genes and the subnetworks that might be related to phenotype. As more biological networks are identified and documented in databases, the proposed method should find more applications in identifying the subnetworks that are related to diseases and other biological processes.
Contact: hongzhe{at}mail.med.upenn.edu or hli{at}cceb.upenn.edu
Associate Editor: Olga Troyanskaya
Received on February 1, 2007; revised on March 26, 2007; accepted on March 27, 2007
This article has been cited by other articles:
![]() |
W. Pan Network-based multiple locus linkage analysis of expression traits Bioinformatics, June 1, 2009; 25(11): 1390 - 1396. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Zhang, H. Li, R. B. Riggins, M. Zhan, J. Xuan, Z. Zhang, E. P. Hoffman, R. Clarke, and Y. Wang Differential dependency network analysis to identify condition-specific topological changes in biological networks Bioinformatics, February 15, 2009; 25(4): 526 - 532. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Noirel, G. Sanguinetti, and P. C. Wright Identifying differentially expressed subnetworks with MMG Bioinformatics, December 1, 2008; 24(23): 2792 - 2793. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Li and H. Li Network-constrained regularization and variable selection for analysis of genomic data Bioinformatics, May 1, 2008; 24(9): 1175 - 1182. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Sanguinetti, J. Noirel, and P. C. Wright MMG: a probabilistic tool to identify submodules of metabolic pathways Bioinformatics, April 15, 2008; 24(8): 1078 - 1084. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Noirel, S. Y. Ow, G. Sanguinetti, A. Jaramillo, and P. C. Wright Automated extraction of meaningful pathways from quantitative proteomics data Brief Funct Genomic Proteomic, March 7, 2008; (2008) eln011v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Wei and W. Pan Incorporating gene networks into statistical tests for genomic data via a spatially correlated mixture model Bioinformatics, February 1, 2008; 24(3): 404 - 411. [Abstract] [Full Text] [PDF] |
||||

