Bioinformatics Advance Access originally published online on October 12, 2004
Bioinformatics 2005 21(6):754-764; doi:10.1093/bioinformatics/bti062
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
An empirical Bayes approach to inferring large-scale gene association networks
Department of Statistics, University of Munich Ludwigstrasse 33, D-80539 Munich, Germany
*To whom correspondence should be addressed.
Motivation: Genetic networks are often described statistically using graphical models (e.g. Bayesian networks). However, inferring the network structure offers a serious challenge in microarray analysis where the sample size is small compared to the number of considered genes. This renders many standard algorithms for graphical models inapplicable, and inferring genetic networks an ill-posed inverse problem.
Methods: We introduce a novel framework for small-sample inference of graphical models from gene expression data. Specifically, we focus on the so-called graphical Gaussian models (GGMs) that are now frequently used to describe gene association networks and to detect conditionally dependent genes. Our new approach is based on (1) improved (regularized) small-sample point estimates of partial correlation, (2) an exact test of edge inclusion with adaptive estimation of the degree of freedom and (3) a heuristic network search based on false discovery rate multiple testing. Steps (2) and (3) correspond to an empirical Bayes estimate of the network topology.
Results: Using computer simulations, we investigate the sensitivity (power) and specificity (true negative rate) of the proposed framework to estimate GGMs from microarray data. This shows that it is possible to recover the true network topology with high accuracy even for small-sample datasets. Subsequently, we analyze gene expression data from a breast cancer tumor study and illustrate our approach by inferring a corresponding large-scale gene association network for 3883 genes.
Availability: The authors have implemented the approach in the R package GeneTS that is freely available from http://www.stat.uni-muenchen.de/~strimmer/genets/, from the R archive (CRAN) and from the Bioconductor website.
Contact: korbinian.strimmer{at}lmu.de
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J.E. Schmitt, R.K. Lenroot, G.L. Wallace, S. Ordaz, K.N. Taylor, N. Kabani, D. Greenstein, J.P. Lerch, K.S. Kendler, M.C. Neale, et al. Identification of Genetically Mediated Cortical Networks: A Multivariate Study of Pediatric Twins and Siblings Cereb Cortex, August 1, 2008; 18(8): 1737 - 1747. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Zampieri, N. Soranzo, and C. Altafini Discerning static and causal interactions in genome-wide reverse engineering problems Bioinformatics, July 1, 2008; 24(13): 1510 - 1515. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Strimmer fdrtool: a versatile R package for estimating local and tail area-based false discovery rates Bioinformatics, June 15, 2008; 24(12): 1461 - 1462. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Chaibub Neto, C. T. Ferrara, A. D. Attie, and B. S. Yandell Inferring Causal Phenotype Networks From Segregating Populations Genetics, June 1, 2008; 179(2): 1089 - 1100. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. P. Keller, Y. Choi, P. Wang, D. Belt Davis, M. E. Rabaglia, A. T. Oler, D. S. Stapleton, C. Argmann, K. L. Schueler, S. Edwards, et al. A gene expression network model of type 2 diabetes links cell cycle regulation in islets with diabetes susceptibility Genome Res., May 1, 2008; 18(5): 706 - 716. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-L. Chuang, C.-H. Jen, C.-M. Chen, and G. S. Shieh A pattern recognition approach to infer time-lagged genetic interactions Bioinformatics, May 1, 2008; 24(9): 1183 - 1190. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Pihur, S. Datta, and S. Datta Reconstruction of genetic association networks from microarray data: a partial least squares approach Bioinformatics, February 15, 2008; 24(4): 561 - 568. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ma, Q. Gong, and H. J. Bohnert An Arabidopsis gene network based on the graphical Gaussian model Genome Res., November 1, 2007; 17(11): 1614 - 1625. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Fujita, J.R. Sato, H.M. Garay-Malpartida, P.A. Morettin, M.C. Sogayar, and C.E. Ferreira Time-varying modeling of gene expression regulatory networks using the wavelet dynamic vector autoregressive method Bioinformatics, July 1, 2007; 23(13): 1623 - 1630. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Soranzo, G. Bianconi, and C. Altafini Comparing association network algorithms for reverse engineering of large-scale gene regulatory networks: synthetic versus real data Bioinformatics, July 1, 2007; 23(13): 1640 - 1647. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Wilkinson Bayesian methods in bioinformatics and computational systems biology Brief Bioinform, April 12, 2007; (2007) bbm007v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. D. Mukhopadhyay and S. Chatterjee Causality and pathway search in microarray time series experiment Bioinformatics, February 15, 2007; 23(4): 442 - 449. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Li and S. Horvath Network neighborhood analysis with the multi-node topological overlap measure Bioinformatics, January 15, 2007; 23(2): 222 - 231. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. R. Lezon, J. R. Banavar, M. Cieplak, A. Maritan, and N. V. Fedoroff From the Cover: Using the principle of entropy maximization to infer genetic interaction networks from gene expression patterns PNAS, December 12, 2006; 103(50): 19033 - 19038. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. V. Werhli, M. Grzegorczyk, and D. Husmeier Comparative evaluation of reverse engineering gene regulatory networks with relevance networks, graphical gaussian models and bayesian networks Bioinformatics, October 15, 2006; 22(20): 2523 - 2531. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Wiuf, M. Brameier, O. Hagberg, and M. P. H. Stumpf A likelihood approach to analysis of network data PNAS, May 16, 2006; 103(20): 7566 - 7570. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li and J. Gui Gradient directed regularization for sparse Gaussian concentration graphs, with applications to inference of genetic networks Biostat., April 1, 2006; 7(2): 302 - 317. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Larranaga, B. Calvo, R. Santana, C. Bielza, J. Galdiano, I. Inza, J. A. Lozano, R. Armananzas, G. Santafe, A. Perez, et al. Machine learning in bioinformatics Brief Bioinform, March 1, 2006; 7(1): 86 - 112. [Abstract] [Full Text] [PDF] |
||||






