Bioinformatics Advance Access originally published online on July 14, 2005
Bioinformatics 2005 21(18):3637-3644; doi:10.1093/bioinformatics/bti583
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
A tractable probabilistic model for Affymetrix probe-level analysis across multiple chips
1School of Computer Science, University of Manchester Oxford Road, Manchester M13 9PL, UK
2Department of Computer Science, University of Sheffield Regent Court 211 Portobello Street, Sheffield S1 4DP, UK
*To whom correspondence should be addressed.
Motivation: Affymetrix GeneChip® arrays are currently the most widely used microarray technology. Many summarization methods have been developed to provide gene expression levels from Affymetrix probe-level data. Most of the currently popular methods do not provide a measure of uncertainty for the expression level of each gene. The use of probabilistic models can overcome this limitation. A full hierarchical Bayesian approach requires the use of computationally intensive MCMC methods that are impractical for large datasets. An alternative computationally efficient probabilistic model, mgMOS, uses Gamma distributions to model specific and non-specific binding with a latent variable to capture variations in probe affinity. Although promising, the main limitations of this model are that it does not use information from multiple chips and does not account for specific binding to the mismatch (MM) probes.
Results: We extend mgMOS to model the binding affinity of probe-pairs across multiple chips and to capture the effect of specific binding to MM probes. The new model, multi-mgMOS, provides improved accuracy, as demonstrated on some bench-mark datasets and a real time-course dataset, and is much more computationally efficient than a competing hierarchical Bayesian approach that requires MCMC sampling. We demonstrate how the probabilistic model can be used to estimate credibility intervals for expression levels and their log-ratios between conditions.
Availability: Both mgMOS and the new model multi-mgMOS have been implemented in an R package, which is available at http://www.bioinf.man.ac.uk/resources/puma
Contact: magnus{at}cs.man.ac.uk
Received on May 5, 2005; revised on June 30, 2005; accepted on July 8, 2005
This article has been cited by other articles:
![]() |
W. K. Lim, K. Wang, C. Lefebvre, and A. Califano Comparative analysis of microarray normalization procedures: effects on reverse engineering gene networks Bioinformatics, July 1, 2007; 23(13): i282 - i288. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Stienstra, S. Mandard, D. Patsouris, C. Maass, S. Kersten, and M. Muller Peroxisome Proliferator-Activated Receptor {alpha} Protects against Obesity-Induced Hepatic Inflammation Endocrinology, June 1, 2007; 148(6): 2753 - 2763. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Purutcuoglu and E. Wit FGX: a frequentist gene expression index for Affymetrix arrays Biostat., April 1, 2007; 8(2): 433 - 437. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Liu, M. Milo, N. D Lawrence, and M. Rattray Probe-level measurement error improves accuracy in detecting differential gene expression Bioinformatics, September 1, 2006; 22(17): 2107 - 2113. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Sanguinetti, M. Rattray, and N. D. Lawrence A probabilistic dynamical model for quantitative inference of the regulatory mechanism of transcription Bioinformatics, July 15, 2006; 22(14): 1753 - 1759. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Rattray, X. Liu, G. Sanguinetti, M. Milo, and N. D. Lawrence Propagating uncertainty in microarray data analysis Brief Bioinform, March 1, 2006; 7(1): 37 - 47. |
||||
![]() |
G. Sanguinetti, M. Milo, M. Rattray, and N. D. Lawrence Accounting for probe-level noise in principal component analysis of microarray data Bioinformatics, October 1, 2005; 21(19): 3748 - 3754. [Abstract] [Full Text] [PDF] |
||||



