Bioinformatics Advance Access originally published online on August 25, 2007
Bioinformatics 2007 23(20):2700-2707; doi:10.1093/bioinformatics/btm412
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
A comparison of background correction methods for two-colour microarrays
1Department of Oncology, University of Cambridge, CRUK Cambridge Research Institute, Li Ka Shing Centre, Robinson Way, Cambridge CB2 0RE, UK, 2Bioinformatics Division, 3Immunology Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3050 and 4The Peter MacCallum Cancer Centre, St Andrews Place, East Melbourne, Victoria 3002, Australia
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Microarray data must be background corrected to remove the effects of non-specific binding or spatial heterogeneity across the array, but this practice typically causes other problems such as negative corrected intensities and high variability of low intensity log-ratios. Different estimators of background, and various model-based processing methods, are compared in this study in search of the best option for differential expression analyses of small microarray experiments.
Results: Using data where some independent truth in gene expression is known, eight different background correction alternatives are compared, in terms of precision and bias of the resulting gene expression measures, and in terms of their ability to detect differentially expressed genes as judged by two popular algorithms, SAM and limma eBayes. A new background processing method (normexp) is introduced which is based on a convolution model. The model-based correction methods are shown to be markedly superior to the usual practice of subtracting local background estimates. Methods which stabilize the variances of the log-ratios along the intensity range perform the best. The normexp+offset method is found to give the lowest false discovery rate overall, followed by morph and vsn. Like vsn, normexp is applicable to most types of two-colour microarray data.
Availability: The background correction methods compared in this article are available in the R package limma (Smyth, 2005) from http://www.bioconductor.org.
Contact: smyth{at}wehi.edu.au
Supplementary information: Supplementary data are available from http://bioinf.wehi.edu.au/resources/webReferences.html.
Received on April 16, 2007; revised on July 20, 2007; accepted on August 9, 2007
This article has been cited by other articles:
![]() |
L.-H. Ding, Y. Xie, S. Park, G. Xiao, and M. D. Story Enhanced identification and biological validation of differential gene expression via Illumina whole-genome expression arrays through the use of the model-based background correction methodology Nucleic Acids Res., June 1, 2008; 36(10): e58 - e58. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Lin, P. Du, W. Huber, and W. A. Kibbe Model-based variance-stabilizing transformation for Illumina microarray data Nucleic Acids Res., February 2, 2008; 36(2): e11 - e11. [Abstract] [Full Text] [PDF] |
||||
