Bioinformatics Vol. 18 no. 12 2002
Pages 1633-1640
© 2002 Oxford University Press
Statistical analysis of high-density oligonucleotide arrays: a multiplicative noise model
á
ik 1,*
1 School of Medicine, University of
California San Diego, La Jolla, CA 92093-0679, USA
2 Centre Hospitalier de lUniversité Laval Research Centre and
Quebec Genome Centre, Québec, Canada G1V 4G2
Received on November 8, 2001
; revised on April 3, 2002 and May 25, 2002
; accepted on June 6, 2002
Motivation: High-density oligonucleotide arrays (GeneChip, Affymetrix, Santa Clara, CA) have become a standard research tool in many areas of biomedical research. They quantitatively monitor the expression of thousands of genes simultaneously by measuring fluorescence from gene-specific targets or probes. The relationship between signal intensities and transcript abundance as well as normalization issues have been the focus of much recent attention (Hill et al., 2001; Chudin et al., 2002; Naef et al., 2002a). It is desirable that a researcher has the best possible analytical tools to make the most of the information that this powerful technology has to offer. At present there are three analytical methods available: the newly released Affymetrix Microarray Suite 5.0 (AMS) software that accompanies the GeneChip product, the method of Li and Wong (LW; Li and Wong, 2001), and the method of Naef et al. (FN; Naef et al., 2001). The AMS method is tailored for analysis of a single microarray, and can therefore be used with any experimental design. The LW method on the other hand depends on a large number of microarrays in an experiment and cannot be used for an isolated microarray, and the FN method is particular to paired microarrays, such as resulting from an experiment in which each treatment sample has a corresponding control sample. Our focus is on analysis of experiments in which there is a series of samples. In this case only the AMS, LW, and the method described in this paper can be used. The present method is model-based, like the LW method, but assumes multiplicative not additive noise, and employs elimination of statistically significant outliers for improved results. Unlike LW and AMS, we do not assume probe-specific background (measured by the so-called mismatch probes). Rather, we assume uniform background, whose level is estimated using both the mismatch and perfect match probe intensities.
Results: We present a new method for GeneChip analysis, based on a statistical model with multiplicative noise. We demonstrated that this method yields results superior to those obtained by the Affymetrix Microarray Suite 5.0 software and to those obtained by the model-based method of Li and Wong (Li and Wong, 2001). The present method eliminates the hard-to-interpret negative expression indices, and the binary presence calls (present or absent) are replaced by the statistical significance (p-value) of gene expression. We have found that thresholding the p-values at the (0.1)16level produces about the same number of present calls as the AMS software. By testing our method on a pair of replicate GeneChips (hybridized with the same cRNA), we found that 95.6% of data points lie within the 1.25fold interval. In other words, our method had a 4.4% type I error rate at the 1.25fold level. The error rate of the LW method was 15%, and that of the AMS method was 29%. There were no points outside the 2fold interval with the present method. Analysis of variance (ANOVA) of another experiment with multiple replicates shows that this reduction of variance is not accompanied by a corresponding reduction of signal. On the contrary, the signal-to-noise ratio (as measured by the distribution of F-statistics) of the present method is on average 3.4-times better than that of AMS, and 1.4-times better than that of Li and Wong.
Contact: sasik{at}corgon.ucsd.edu
Availability: A Fortran 90 source code of this method is available from the corresponding author upon request.
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
W. B. Langdon, G. J. G. Upton, and A. P. Harrison Probes containing runs of guanines provide insights into the biophysics and bioinformatics of Affymetrix GeneChips Brief Bioinform, May 1, 2009; 10(3): 259 - 277. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. R. Bayjanov, M. Wels, M. Starrenburg, J. E. T. van Hylckama Vlieg, R. J. Siezen, and D. Molenaar PanCGH: a genotype-calling algorithm for pangenome CGH data Bioinformatics, February 1, 2009; 25(3): 309 - 314. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. M. Leoni, B. Bailey, J. Reifert, H. H. Bendall, R. W. Zeller, J. Corbeil, G. Elliott, and C. C. Niemeyer Bendamustine (Treanda) Displays a Distinct Pattern of Cytotoxicity and Unique Mechanistic Features Compared with Other Alkylating Agents Clin. Cancer Res., January 1, 2008; 14(1): 309 - 317. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Zhang, J. S. Bailey, D. Coss, B. Lin, R. Tsutsumi, M. A. Lawson, P. L. Mellon, and N. J. G. Webster Activin Modulates the Transcriptional Response of LssT2 Cells to Gonadotropin-Releasing Hormone and Alters Cellular Proliferation Mol. Endocrinol., November 1, 2006; 20(11): 2909 - 2930. [Abstract] [Full Text] [PDF] |
||||
![]() |
C L Wilson, A H Sims, A Howell, C J Miller, and R B Clarke Effects of oestrogen on gene expression in epithelium and stroma of normal human breast tissue. Endocr. Relat. Cancer, June 1, 2006; 13(2): 617 - 628. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. J. G. Upton and J. C. Lloyd Oligonucleotide arrays: information from replication and spatial structure Bioinformatics, November 15, 2005; 21(22): 4162 - 4168. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Glatt, I. P. Everall, W. S. Kremen, J. Corbeil, R. Sasik, N. Khanlou, M. Han, C.-C. Liew, and M. T. Tsuang Comparative gene expression analysis of blood and brain provides concurrent validation of SELENBP1 up-regulation in schizophrenia PNAS, October 25, 2005; 102(43): 15533 - 15538. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-C. Battista, E. Calvo, A. Chorvatova, B. Comte, J. Corbeil, and M. Brochu Intra-uterine growth restriction and the programming of left ventricular remodelling in female rats J. Physiol., May 15, 2005; 565(1): 197 - 205. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ogawa, J. Lozach, K. Jepsen, D. Sawka-Verhelle, V. Perissi, R. Sasik, D. W. Rose, R. S. Johnson, M. G. Rosenfeld, and C. K. Glass A nuclear receptor corepressor transcriptional checkpoint controlling activator protein 1-dependent gene networks required for macrophage activation PNAS, October 5, 2004; 101(40): 14461 - 14466. [Abstract] [Full Text] [PDF] |
||||
![]() |
W.-P. Hsieh, T.-M. Chu, R. D. Wolfinger, and G. Gibson Mixed-Model Reanalysis of Primate Data Suggests Tissue and Species Biases in Oligonucleotide-Based Gene Expression Profiles Genetics, October 1, 2003; 165(2): 747 - 757. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Welch, M. Ricote, T. E. Akiyama, F. J. Gonzalez, and C. K. Glass PPAR{gamma} and PPAR{delta} negatively regulate specific subsets of lipopolysaccharide and IFN-{gamma} target genes in macrophages PNAS, May 27, 2003; 100(11): 6712 - 6717. [Abstract] [Full Text] [PDF] |
||||







