Bioinformatics Advance Access published online on December 5, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl609
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Statistical Science, Southern Methodist University, Dallas, TX 75275 USA; Department of Pathology, University of Texas Southwestern Medical Center, Dallas, TX 75390 USA
* To whom correspondence should be addressed.
Motivation: Affymetrix GeneChip arrays require summarization in order to combine the probe-level intensities into one value representing the expression level of a gene. However, probe intensity measurements are expected to be affected by different levels of non-specific- and cross-hybridization to non-specific transcripts. Here we present a new summarization technique, the Distribution Free Weighted method (DFW), which uses information about the variability in probe behavior to estimate the extent of non-specific and cross-hybridization for each probe. The contribution of the probe is weighted accordingly during summarization, without making any distributional assumptions for the probe-level data. Results: We compare DFW with several popular summarization methods on spike-in data sets, via both our own calculations and the Affycomp II competition. The results show that DFW outperforms other methods when sensitivity and specificity are considered simultaneously. With the Affycomp spike-in data sets, the area under the Receiver Operating Characteristic (ROC) curve for DFW is nearly 1.0 (a perfect value), indicating that DFW can identify all differentially expressed genes with a few false positives. The approach used is also computationally faster than most other methods in current use. Availability: The R code for DFW is available upon request.
Received July 18, 2006
Revised November 6, 2006
Accepted November 24, 2006
Article
A distribution free summarization method for Affymetrix GeneChip® arrays
Zhongxue Chen 1, Monnie McGee 2 *, Qingzhong Liu 3, and Richard H. Scheuermann 4
2 Department of Statistical Science, Southern Methodist University, Dallas, TX 75275 USA
3 Department of Computer Science, New Mexico Institute of Mining and Technology, Socorro NM 87801 USA
4 Department of Pathology, University of Texas Southwestern Medical Center, Dallas, TX 75390 USA
Monnie McGee, E-mail: mmcgee{at}smu.edu
![]()
Abstract
Associate Editor: Martin Bishop
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
K. Motoyama, Y. Nakai, T. Miyashita, Y. Fukui, M. Morita, K. Sanmiya, H. Sakakibara, I. Matsumoto, K. Abe, T. Yakabe, et al. Isolation stress for 30 days alters hepatic gene expression profiles, especially with reference to lipid metabolism in mice Physiol Genomics, April 10, 2009; 37(2): 79 - 87. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Novak, T. Jensen, M. M. Oshiro, G. S. Watts, C. J. Kim, and B. W. Futscher Agglomerative Epigenetic Aberrations Are a Common Event in Human Breast Cancer Cancer Res., October 15, 2008; 68(20): 8616 - 8625. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Schachtner, D. Lutter, P. Knollmuller, A. M. Tome, F. J. Theis, G. Schmitz, M. Stetter, P. G. Vilda, and E. W. Lang Knowledge-based gene expression classification via matrix factorization Bioinformatics, August 1, 2008; 24(15): 1688 - 1697. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W.K. Ho, M. Stefani, C. G. dos Remedios, and M. A. Charleston Differential variability analysis of gene expression and its application to human diseases Bioinformatics, July 1, 2008; 24(13): i390 - i398. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Mashiguchi, E. Urakami, M. Hasegawa, K. Sanmiya, I. Matsumoto, I. Yamaguchi, T. Asami, and Y. Suzuki Defense-Related Signaling by Interaction of Arabinogalactan Proteins and {beta}-Glucosyl Yariv Reagent Inhibits Gibberellin Signaling in Barley Aleurone Cells Plant Cell Physiol., February 1, 2008; 49(2): 178 - 190. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. E. Bjork and K. Kafadar Systematic order-dependent effect in expression values, variance, detection calls and differential expression in Affymetrix GeneChips(R) Bioinformatics, November 1, 2007; 23(21): 2873 - 2880. [Abstract] [Full Text] [PDF] |
||||



