Bioinformatics 20(3) © Oxford University Press 2004; all rights reserved.
A benchmark for Affymetrix GeneChip expression measures
1 Department of Mathematical Sciences, Johns Hopkins University, 104 Whitehead Hall, 3400 North Charles Street, Baltimore, MD 21218, USA, 2 Department of Biostatistics, Johns Hopkins University, 615 N. Wolfe Street, Baltimore, MD 21205, USA and 3 Department of Statistics, University of California, Berkeley, 367 Evans Hall, Berkeley, CA 94720, USA
Received on May 9, 2003
; revised on July 17, 2003
; accepted on August 3, 2003
Motivation: The defining feature of oligonucleotide expression arrays is the use of several probes to assay each targeted transcript. This is a bonanza for the statistical geneticist, who can create probeset summaries with specific characteristics. There are now several methods available for summarizing probe level data from the popular Affymetrix GeneChips, but it is difficult to identify the best method for a given inquiry.
Results: We have developed a graphical tool to evaluate summaries of Affymetrix probe level data. Plots and summary statistics offer a picture of how an expression measure performs in several important areas. This picture facilitates the comparison of competing expression measures and the selection of methods suitable for a specific investigation. The key is a benchmark data set consisting of a dilution study and a spike-in study. Because the truth is known for these data, we can identify statistical features of the data for which the expected outcome is known in advance. Those features highlighted in our suite of graphs are justified by questions of biological interest and motivated by the presence of appropriate data.
Availability: In conjunction with the release of a graphics toolbox as part of the Bioconductor project (http://www.bioconductor.org), a webtool is available at http://affycomp.biostat.jhsph.edu. Supplemental material is available at http://www.biostat.jhsph.edu/~ririzarr/papers/suppaffycomp.pdf
Contact: rafa{at}jhu.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
N. Whiteford, T. Skelly, C. Curtis, M. E. Ritchie, A. Lohr, A. W. Zaranek, I. Abnizova, and C. Brown Swift: primary data analysis for the Illumina Solexa sequencing platform Bioinformatics, September 1, 2009; 25(17): 2194 - 2199. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Ma, J. C. Miller, H. Crandall, E. T. Larsen, D. M. Dunn, R. B. Weiss, M. Subramanian, J. H. Weis, J. F. Zachary, C. Teuscher, et al. Interval-Specific Congenic Lines Reveal Quantitative Trait Loci with Penetrant Lyme Arthritis Phenotypes on Chromosomes 5, 11, and 12 Infect. Immun., August 1, 2009; 77(8): 3302 - 3311. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. B. Langdon, G. J. G. Upton, and A. P. Harrison Probes containing runs of guanines provide insights into the biophysics and bioinformatics of Affymetrix GeneChips Brief Bioinform, May 1, 2009; 10(3): 259 - 277. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. J. G. Upton, O. Sanchez-Graillet, J. Rowsell, J. M. Arteaga-Salas, N. S. Graham, M. A. Stalteri, F. N. Memon, S. T. May, and A. P. Harrison On the causes of outliers in Affymetrix GeneChip data Brief Funct Genomic Proteomic, May 1, 2009; 8(3): 199 - 212. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Sarkar, R. Parkin, S. Wyman, A. Bendoraite, C. Sather, J. Delrow, A. K. Godwin, C. Drescher, W. Huber, R. Gentleman, et al. Quality Assessment and Data Analysis for microRNA Expression Arrays Nucleic Acids Res., February 1, 2009; 37(2): e17 - e17. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Furusawa, N. Ono, S. Suzuki, T. Agata, H. Shimizu, and T. Yomo Model-based analysis of non-specific binding for background correction of high-density oligonucleotide microarrays Bioinformatics, January 1, 2009; 25(1): 36 - 41. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. N. McCall and R. A. Irizarry Consolidated strategy for the analysis of microarray spike-in data Nucleic Acids Res., October 1, 2008; 36(17): e108 - e108. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Owzar, W. T. Barry, S.-H. Jung, I. Sohn, and S. L. George Statistical Challenges in Preprocessing in Microarray Experiments in Cancer Clin. Cancer Res., October 1, 2008; 14(19): 5959 - 5966. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. H. Gumus, B. Du, A. Kacker, J. O. Boyle, J. M. Bocker, P. Mukherjee, K. Subbaramaiah, A. J. Dannenberg, and H. Weinstein Effects of Tobacco Smoke on Gene Expression and Cellular Pathways in a Cellular Model of Oral Leukoplakia Cancer Prevention Research, July 1, 2008; 1(2): 100 - 111. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W.K. Ho, M. Stefani, C. G. dos Remedios, and M. A. Charleston Differential variability analysis of gene expression and its application to human diseases Bioinformatics, July 1, 2008; 24(13): i390 - i398. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Ono, S. Suzuki, C. Furusawa, T. Agata, A. Kashiwagi, H. Shimizu, and T. Yomo An improved physico-chemical model of hybridization on high-density oligonucleotide microarrays Bioinformatics, May 15, 2008; 24(10): 1278 - 1285. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Stoecklin, S. A. Tenenbaum, T. Mayo, S. V. Chittur, A. D. George, T. E. Baroni, P. J. Blackshear, and P. Anderson Genome-wide Analysis Identifies Interleukin-10 mRNA as Target of Tristetraprolin J. Biol. Chem., April 25, 2008; 283(17): 11689 - 11699. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Xu and X. Cui Robustified MANOVA with applications in detecting differentially expressed genes from oligonucleotide arrays Bioinformatics, April 15, 2008; 24(8): 1056 - 1062. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Pearson, T. J. Robinson, M. J. Munoz, A. R. Kornblihtt, and M. A. Garcia-Blanco Identification of the Cellular Targets of the Transcription Factor TCERG1 Reveals a Prevalent Role in mRNA Processing J. Biol. Chem., March 21, 2008; 283(12): 7949 - 7961. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Arteaga-Salas, H. Zuzan, W. B. Langdon, G. J. G. Upton, and A. P. Harrison An overview of image-processing methods for Affymetrix GeneChips Brief Bioinform, January 1, 2008; 9(1): 25 - 33. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Raghavan, A. M. I. M. De Bondt, W. Talloen, D. Moechars, H. W. H. Gohlmann, and D. Amaratunga The high-level similarity of some disparate gene expression measures Bioinformatics, November 15, 2007; 23(22): 3032 - 3038. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. K. Lim, K. Wang, C. Lefebvre, and A. Califano Comparative analysis of microarray normalization procedures: effects on reverse engineering gene networks Bioinformatics, July 1, 2007; 23(13): i282 - i288. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Sadri-Vakili, B. Bouzou, C. L. Benn, M.-O. Kim, P. Chawla, R. P. Overland, K. E. Glajch, E. Xia, Z. Qiu, S. M. Hersch, et al. Histones associated with downregulated genes are hypo-acetylated in Huntington's disease models Hum. Mol. Genet., June 1, 2007; 16(11): 1293 - 1306. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Oshlack, A. E. Chabot, G. K. Smyth, and Y. Gilad Using DNA microarrays to study gene expression in closely related species Bioinformatics, May 15, 2007; 23(10): 1235 - 1242. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Lo and R. Gottardo Flexible empirical Bayes models for differential gene expression Bioinformatics, February 1, 2007; 23(3): 328 - 335. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Chen, M. McGee, Q. Liu, and R. H. Scheuermann A distribution free summarization method for Affymetrix GeneChip(R) arrays Bioinformatics, February 1, 2007; 23(3): 321 - 327. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. S. Mehta, S. O. Zakharkin, G. L. Gadbury, and D. B. Allison Epistemological issues in omics and high-dimensional biology: give the people what they want Physiol Genomics, December 13, 2006; 28(1): 24 - 32. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Yoon, Y. Yang, J. Choi, and J. Seong Large scale data mining approach for gene-specific standardization of microarray gene expression data Bioinformatics, December 1, 2006; 22(23): 2898 - 2904. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. L. Oberg, D. W. Mahoney, K. V. Ballman, and T. M. Therneau Joint estimation of calibration and expression for high-density oligonucleotide arrays Bioinformatics, October 1, 2006; 22(19): 2381 - 2387. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. R. Goldstein Partition resampling and extrapolation averaging: approximation methods for quantifying gene expression in large numbers of short oligonucleotide arrays Bioinformatics, October 1, 2006; 22(19): 2364 - 2372. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Abdueva, D. Skvortsov, and S. Tavare Non-linear analysis of GeneChip arrays Nucleic Acids Res., September 10, 2006; 34(15): e105 - e105. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. A. Hughes, J. F. Ayroles, M. M. Reedy, J. M. Drnevich, K. C. Rowe, E. A. Ruedi, C. E. Caceres, and K. N. Paige Segregating Variation in the Transcriptome: Cis Regulation and Additivity of Effects Genetics, July 1, 2006; 173(3): 1347 - 1355. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Khare, A. H. Taylor, J. C. Konje, and S. C. Bell {Delta}9-Tetrahydrocannabinol inhibits cytotrophoblast cell proliferation and modulates gene transcription Mol. Hum. Reprod., May 1, 2006; 12(5): 321 - 333. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Hochreiter, D.-A. Clevert, and K. Obermayer A new summarization method for affymetrix probe level data Bioinformatics, April 15, 2006; 22(8): 943 - 949. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Irizarry, Z. Wu, and H. A. Jaffee Comparison of Affymetrix GeneChip expression measures Bioinformatics, April 1, 2006; 22(7): 789 - 794. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Rattray, X. Liu, G. Sanguinetti, M. Milo, and N. D. Lawrence Propagating uncertainty in microarray data analysis Brief Bioinform, March 1, 2006; 7(1): 37 - 47. |
||||
![]() |
D. Chowdary, J. Lathrop, J. Skelton, K. Curtin, T. Briggs, Y. Zhang, J. Yu, Y. Wang, and A. Mazumder Prognostic Gene Expression Signatures Can Be Measured in Tissues Collected in RNAlater Preservative J. Mol. Diagn., February 1, 2006; 8(1): 31 - 39. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. de Ridder, F. J. T. Staal, J. J. M. van Dongen, and M. J. T. Reinders Maximum significance clustering of oligonucleotide microarrays Bioinformatics, February 1, 2006; 22(3): 326 - 331. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. K. Jakobsen,, L. R. Poulsen,, A. Schulz,, P. Fleurat-Lessard,, A. Moller,, S. Husted,, M. Schiott,, A. Amtmann,, and M. G. Palmgren, Pollen development and fertilization in Arabidopsis is dependent on the MALE GAMETOGENESIS IMPAIRED ANTHERS gene encoding a Type V P-type ATPase Genes & Dev., November 15, 2005; 19(22): 2757 - 2769. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Lin and W. A. Kibbe Irrational Exuberance in Clinical Proteomics Clin. Cancer Res., November 15, 2005; 11(22): 7963 - 7964. [Full Text] [PDF] |
||||
![]() |
L. Zhou and D. M. Rocke An expression index for Affymetrix GeneChips based on the generalized logarithm Bioinformatics, November 1, 2005; 21(21): 3983 - 3989. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. L.G. Sieben, J. Oosting, A. M. Flanagan, J. Prat, G. M.J.M. Roemen, S. M. Kolkman-Uljee, R. van Eijk, C. J. Cornelisse, G. J. Fleuren, and M. van Engeland Differential Gene Expression in Ovarian Tumors Reveals Dusp 4 and Serpina 5 As Key Regulators for Benign Behavior of Serous Borderline Tumors J. Clin. Oncol., October 10, 2005; 23(29): 7257 - 7264. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Liu, M. Milo, N. D. Lawrence, and M. Rattray A tractable probabilistic model for Affymetrix probe-level analysis across multiple chips Bioinformatics, September 15, 2005; 21(18): 3637 - 3644. [Abstract] [Full Text] [PDF] |
||||
![]() |
A.-M. K. Hein, S. Richardson, H. C. Causton, G. K. Ambler, and P. J. Green BGX: a fully Bayesian integrated approach to the analysis of Affymetrix GeneChip data Biostat., July 1, 2005; 6(3): 349 - 373. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Wu, R. Carta, and L. Zhang Sequence dependence of cross-hybridization on short oligo microarrays Nucleic Acids Res., May 24, 2005; 33(9): e84 - e84. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Maziarz, C. Chung, D. J. Drucker, and A. Emili Integrating Global Proteomic and Genomic Expression Profiles Generated from Islet {alpha} Cells: Opportunities and Challenges to Deriving Reliable Biological Inferences Mol. Cell. Proteomics, April 1, 2005; 4(4): 458 - 474. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. L. Yap, M. P. Wong, X. W. Zhang, D. Hernandez, R. Gras, D. K. Smith, and A. Danchin Conserved transcription factor binding sites of cancer markers derived from primary lung adenocarcinoma microarrays Nucleic Acids Res., January 14, 2005; 33(1): 409 - 421. [Abstract] [Full Text] [PDF] |
||||
















