Bioinformatics Advance Access originally published online on January 12, 2006
Bioinformatics 2006 22(7):789-794; doi:10.1093/bioinformatics/btk046
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Comparison of Affymetrix GeneChip expression measures
1Department of Biostatistics, Johns Hopkins University 615 N. Wolfe Street, Baltimore, MD 21205, USA
2Center for Statistical Sciences, Department of Community Health, Brown University 167 Angell Street, BOX G-H, Providence, RI 02912, USA
*To whom correspondence should be addressed.
Motivation: In the Affymetrix GeneChip system, preprocessing occurs before one obtains expression level measurements. Because the number of competing preprocessing methods was large and growing we developed a benchmark to help users identify the best method for their application. A webtool was made available for developers to benchmark their procedures. At the time of writing over 50 methods had been submitted.
Results: We benchmarked 31 probe set algorithms using a U95A dataset of spike in controls. Using this dataset, we found that background correction, one of the main steps in preprocessing, has the largest effect on performance. In particular, background correction appears to improve accuracy but, in general, worsen precision. The benchmark results put this balance in perspective. Furthermore, we have improved some of the original benchmark metrics to provide more detailed information regarding precision and accuracy. A handful of methods stand out as providing the best balance using spike-in data with the older U95A array, although different experiments on more current arrays may benchmark differently.
Availability: The affycomp package, now version 1.5.2, continues to be available as part of the Bioconductor project (http://www.bioconductor.org). The webtool continues to be available at http://affycomp.biostat.jhsph.edu
Contact: rafa{at}jhu.edu
Supplementary information: Supplementary data are available at Bioinformatics online.
Received on August 25, 2005; revised on January 5, 2006; accepted on January 5, 2006
This article has been cited by other articles:
![]() |
L. N. Singh and S. Hannenhalli Correlated changes between regulatory cis elements and condition-specific expression in paralogous gene families Nucleic Acids Res., November 19, 2009; (2009) gkp989v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Obayashi and K. Kinoshita Rank of Correlation Coefficient as a Comparable Measure for Biological Significance of Gene Coexpression DNA Res, October 1, 2009; 16(5): 249 - 260. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Cheng, K. Shen, C. Song, J. Luo, and G. C. Tseng Ratio adjustment and calibration scheme for gene-wise normalization to enhance microarray inter-study prediction Bioinformatics, July 1, 2009; 25(13): 1655 - 1661. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Fang, H. Ren, J. Liu, K. M. Cadigan, S. R. Patel, and G. R. Dressler Drosophila ptip is essential for anterior/posterior patterning in development and interacts with the PcG and trxG pathways Development, June 1, 2009; 136(11): 1929 - 1938. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. B. Langdon, G. J. G. Upton, and A. P. Harrison Probes containing runs of guanines provide insights into the biophysics and bioinformatics of Affymetrix GeneChips Brief Bioinform, May 1, 2009; 10(3): 259 - 277. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Xie, X. Wang, and M. Story Statistical methods of background correction for Illumina BeadArray data Bioinformatics, March 15, 2009; 25(6): 751 - 757. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Furusawa, N. Ono, S. Suzuki, T. Agata, H. Shimizu, and T. Yomo Model-based analysis of non-specific binding for background correction of high-density oligonucleotide microarrays Bioinformatics, January 1, 2009; 25(1): 36 - 41. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. N. McCall and R. A. Irizarry Consolidated strategy for the analysis of microarray spike-in data Nucleic Acids Res., October 1, 2008; 36(17): e108 - e108. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Owzar, W. T. Barry, S.-H. Jung, I. Sohn, and S. L. George Statistical Challenges in Preprocessing in Microarray Experiments in Cancer Clin. Cancer Res., October 1, 2008; 14(19): 5959 - 5966. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W.K. Ho, M. Stefani, C. G. dos Remedios, and M. A. Charleston Differential variability analysis of gene expression and its application to human diseases Bioinformatics, July 1, 2008; 24(13): i390 - i398. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Ono, S. Suzuki, C. Furusawa, T. Agata, A. Kashiwagi, H. Shimizu, and T. Yomo An improved physico-chemical model of hybridization on high-density oligonucleotide microarrays Bioinformatics, May 15, 2008; 24(10): 1278 - 1285. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Irizarry, C. Ladd-Acosta, B. Carvalho, H. Wu, S. A. Brandenburg, J. A. Jeddeloh, B. Wen, and A. P. Feinberg Comprehensive high-throughput arrays for relative methylation (CHARM) Genome Res., May 1, 2008; 18(5): 780 - 790. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Brux, T.-Y. Liu, M. Krebs, Y.-D. Stierhof, J. U. Lohmann, O. Miersch, C. Wasternack, and K. Schumacher Reduced V-ATPase Activity in the trans-Golgi Network Causes Oxylipin-Dependent Hypocotyl Growth Inhibition in Arabidopsis PLANT CELL, April 1, 2008; 20(4): 1088 - 1100. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Saleh, R. Alvarez-Venegas, M. Yilmaz, O. Le, G. Hou, M. Sadder, A. Al-Abdallat, Y. Xia, G. Lu, I. Ladunga, et al. The Highly Similar Arabidopsis Homologs of Trithorax ATX1 and ATX2 Encode Proteins with Divergent Biochemical Functions PLANT CELL, March 1, 2008; 20(3): 568 - 579. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Cordero, M. Botta, and R. A. Calogero Microarray data analysis and mining approaches Brief Funct Genomic Proteomic, January 22, 2008; (2008) elm034v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Talloen, D.-A. Clevert, S. Hochreiter, D. Amaratunga, L. Bijnens, S. Kass, and H. W.H. Gohlmann I/NI-calls for the exclusion of non-informative genes: a highly effective filtering tool for microarray data Bioinformatics, November 1, 2007; 23(21): 2897 - 2902. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Harbron, K.-M. Chang, and M. C. South RefPlus: an R package extending the RMA Algorithm Bioinformatics, September 15, 2007; 23(18): 2493 - 2494. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. M. Graze, O. Barmina, D. Tufts, E. Naderi, K. L. Harmon, M. Persianinova, and S. V. Nuzhdin New Candidate Genes for Sex-Comb Divergence Between Drosophila mauritiana and Drosophila simulans Genetics, August 1, 2007; 176(4): 2561 - 2576. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Chen, M. McGee, Q. Liu, and R. H. Scheuermann A distribution free summarization method for Affymetrix GeneChip(R) arrays Bioinformatics, February 1, 2007; 23(3): 321 - 327. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. S. Mehta, S. O. Zakharkin, G. L. Gadbury, and D. B. Allison Epistemological issues in omics and high-dimensional biology: give the people what they want Physiol Genomics, December 13, 2006; 28(1): 24 - 32. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-J. Park, J. R. Berggren, M. W. Hulver, J. A Houmard, and E. P. Hoffman GRB14, GPD1, and GDF8 as potential network collaborators in weight loss-induced improvements in insulin action in human skeletal muscle Physiol Genomics, October 11, 2006; 27(2): 114 - 121. [Abstract] [Full Text] [PDF] |
||||










