Bioinformatics Advance Access originally published online on March 25, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics 20(13) © Oxford University Press 2004; all rights reserved.
Bayesian hierarchical error model for analysis of gene expression data
Division of Biostatistics and Epidemiology, Department of Health Evaluation Sciences, University of Virginia School of Medicine, Hospital West Complex, Room 3181, P.O. Box 800717, Charlottesville, VA 22908-0717, USA
Received on June 10, 2003; revised on February 17, 2004; accepted on February 27, 2004
Advance Access Publication March 25, 2004
Motivation: Analysis of genome-wide microarray data requires the estimation of a large number of genetic parameters for individual genes and their interaction expression patterns under multiple biological conditions. The sources of microarray error variability comprises various biological and experimental factors, such as biological and individual replication, sample preparation, hybridization and image processing. Moreover, the same gene often shows quite heterogeneous error variability under different biological and experimental conditions, which must be estimated separately for evaluating the statistical significance of differential expression patterns. Widely used linear modeling approaches are limited because they do not allow simultaneous modeling and inference on the large number of these genetic parameters and heterogeneous error components on different genes, different biological and experimental conditions, and varying intensity ranges in microarray data.
Results: We propose a Bayesian hierarchical error model (HEM) to overcome the above restrictions. HEM accounts for heterogeneous error variability in an oligonucleotide microarray experiment. The error variability is decomposed into two components (experimental and biological errors) when both biological and experimental replicates are available. Our HEM inference is based on Markov chain Monte Carlo to estimate a large number of parameters from a single-likelihood function for all genes. An F-like summary statistic is proposed to identify differentially expressed genes under multiple conditions based on the HEM estimation. The performance of HEM and its F-like statistic was examined with simulated data and two published microarray datasetsprimate brain data and mouse B-cell development data. HEM was also compared with ANOVA using simulated data.
Availability: The software for the HEM is available from the authors upon request.
Contact: hcho{at}virginia.edu; jaeklee{at}virginia.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. Murie and R. Nadon A correction for estimating error when using the Local Pooled Error Statistical Test Bioinformatics, August 1, 2008; 24(15): 1735 - 1736. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Park, Y. Kim, S. Bekiranov, and J. K. Lee Error-pooling-based statistical methods for identifying novel temporal replication profiles of human chromosomes observed by DNA tiling arrays Nucleic Acids Res., May 14, 2007; 35(9): e69 - e69. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. J. Cho, P. Shashkin, C. A. Gleissner, D. Dunson, N. Jain, J. K. Lee, Y. Miller, and K. Ley Induction of dendritic cell-like phenotype in macrophages during foam cell formation Physiol Genomics, April 24, 2007; 29(2): 149 - 160. [Abstract] [Full Text] [PDF] |
||||
![]() |
X.-L. Wu, L. J. Forney, and P. Joyce Comments on 'Bayesian hierarchical error model for analysis of gene expression data' Bioinformatics, October 1, 2006; 22(19): 2446 - 2451. [Full Text] [PDF] |
||||
![]() |
H. Cho and J. K. Lee Response to comments on 'Bayesian Hierarchical Error Model for Analysis of Gene Expression Data' Bioinformatics, October 1, 2006; 22(19): 2452 - 2452. [Full Text] [PDF] |
||||


