Bioinformatics Vol. 18 no. 11 2002
Pages 1454-1461
© 2002 Oxford University Press
Nonparametric methods for identifying differentially expressed genes in microarray data
1 Department of Genetics,
Stanford University School of Medicine
2 Department of Biochemistry,
Stanford University School of Medicine
3 Howard Hughes Medical Institute,
Stanford, CA, USA
Received on February 27, 2002
; revised on April 21, 2002
; accepted on May 24, 2002
Motivation: Gene expression experiments provide a fast and systematic way to identify disease markers relevant to clinical care. In this study, we address the problem of robust identification of differentially expressed genes from microarray data. Differentially expressed genes, or discriminator genes, are genes with significantly different expression in two user-defined groups of microarray experiments. We compare three model-free approaches: (1) nonparametric t-test, (2) Wilcoxon (or MannWhitney) rank sum test, and (3) a heuristic method based on high Pearson correlation to a perfectly differentiating gene (ideal discriminator method). We systematically assess the performance of each method based on simulated and biological data under varying noise levels and p-value cutoffs.
Results: All methods exhibit very low false positive rates and identify a large fraction of the differentially expressed genes in simulated data sets with noise level similar to that of actual data. Overall, the rank sum test appears most conservative, which may be advantageous when the computationally identified genes need to be tested biologically. However, if a more inclusive list of markers is desired, a higher p-value cutoff or the nonparametric t-test may be appropriate. When applied to data from lung tumor and lymphoma data sets, the methods identify biologically relevant differentially expressed genes that allow clear separation of groups in question. Thus the methods described and evaluated here provide a convenient and robust way to identify differentially expressed genes for further biological and clinical analysis.
Availability: By request from the authors.
Contact: russ.altman{at}stanford.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Z. Han, D. Verma, C. Hilscher, D. P. Dittmer, and S. Swaminathan General and Target-Specific RNA Binding Properties of Epstein-Barr Virus SM Posttranscriptional Regulatory Protein J. Virol., November 15, 2009; 83(22): 11635 - 11644. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Yang, Y. Zhou, R. Jin, and C. Chan Reconstruct modular phenotype-specific gene networks by knowledge-driven matrix factorization Bioinformatics, September 1, 2009; 25(17): 2236 - 2243. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Heller, E. Manduchi, and D. S. Small Matching methods for observational microarray studies Bioinformatics, April 1, 2009; 25(7): 904 - 909. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. O'Hara, W. Vahrson, and D. P. Dittmer Gene alteration and precursor and mature microRNA transcription changes contribute to the miRNA signature of primary effusion lymphoma Blood, February 15, 2008; 111(4): 2347 - 2353. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Saeys, I. Inza, and P. Larranaga A review of feature selection techniques in bioinformatics Bioinformatics, October 1, 2007; 23(19): 2507 - 2517. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Zhou, C. Cras-Meneur, M. Ohsugi, G. D. Stormo, and M. Alan. Permutt A global approach to identify differentially expressed genes in cDNA (two-color) microarray experiments Bioinformatics, August 15, 2007; 23(16): 2073 - 2079. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Wu Cancer outlier differential gene expression detection Biostat., July 1, 2007; 8(3): 566 - 575. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Juric, N. J. Lacayo, M. C. Ramsey, J. Racevskis, P. H. Wiernik, J. M. Rowe, A. H. Goldstone, P. J. O'Dwyer, E. Paietta, and B. I. Sikic Differential Gene Expression Patterns and Interaction Networks in BCR-ABL-Positive and -Negative Adult Acute Lymphoblastic Leukemias J. Clin. Oncol., April 10, 2007; 25(11): 1341 - 1349. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Zhao, E. Y. Chuang, M. Mishra, R. Awwad, K. Bisht, L. Sun, P. Nguyen, J. D. Pennington, T. J. C. Wang, C. M. Bradbury, et al. Distinct Effects of Ionizing Radiation on In vivo Murine Kidney and Brain Normal Tissue Gene Expression. Clin. Cancer Res., June 15, 2006; 12(12): 3823 - 3830. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Steinhoff and M. Vingron Normalization and quantification of differential expression in gene expression microarrays Brief Bioinform, June 1, 2006; 7(2): 166 - 177. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Larranaga, B. Calvo, R. Santana, C. Bielza, J. Galdiano, I. Inza, J. A. Lozano, R. Armananzas, G. Santafe, A. Perez, et al. Machine learning in bioinformatics Brief Bioinform, March 1, 2006; 7(1): 86 - 112. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Y. Kim, J. W. Lee, and I. S. Sohn Comparison of various statistical methods for identifying differential gene expression in replicated microarray data Statistical Methods in Medical Research, February 1, 2006; 15(1): 3 - 20. [Abstract] [PDF] |
||||
![]() |
H. H. Zhang, J. Ahn, X. Lin, and C. Park Gene selection using support vector machines with non-convex penalty Bioinformatics, January 1, 2006; 22(1): 88 - 95. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Liu, J. Li, and L. Wong Use of extreme patient samples for outcome prediction from gene expression data Bioinformatics, August 15, 2005; 21(16): 3377 - 3384. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Fluck, C. Dapp, S. Schmutz, E. Wit, and H. Hoppeler Transcriptional profiling of tissue plasticity: role of shifts in gene expression and technical limitations J Appl Physiol, August 1, 2005; 99(2): 397 - 413. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hu, F. Zou, and F. A. Wright Practical FDR-based sample size calculations in microarray experiments Bioinformatics, August 1, 2005; 21(15): 3264 - 3272. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Liang, B. Tayo, X. Cai, and A. Kelemen Differential and trajectory methods for time course gene expression data Bioinformatics, July 1, 2005; 21(13): 3009 - 3016. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-H. Pan, C.-J. Lih, and S. N. Cohen Effects of threshold choice on biological conclusions reached during analysis of gene expression by DNA microarrays PNAS, June 21, 2005; 102(25): 8961 - 8965. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Vinterbo, E.-Y. Kim, and L. Ohno-Machado Small, fuzzy and interpretable gene expression based classifiers Bioinformatics, May 1, 2005; 21(9): 1964 - 1970. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Z. N. Vêncio and T. Koide HTself: Self-Self Based Statistical Test for Low Replication Microarray Studies DNA Res, January 1, 2005; 12(3): 211 - 214. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Shapira, E. Segal, and D. Botstein Disruption of Yeast Forkhead-associated Cell Cycle Transcription by Oxidative Stress Mol. Biol. Cell, December 1, 2004; 15(12): 5659 - 5669. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Zhao, A. Chow, J. Powers, G. Fajardo, and D. Bernstein Microarray analysis of gene expression after transverse aortic constriction in mice Physiol Genomics, September 16, 2004; 19(1): 93 - 105. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Modlich, H.-B. Prisack, G. Pitschke, U. Ramp, R. Ackermann, H. Bojar, T. A. Vogeli, and M.-O. Grimm Identifying Superficial, Muscle-Invasive, and Metastasizing Transitional Cell Carcinoma of the Bladder: Use of cDNA Array Analysis of Gene Expression Profiles Clin. Cancer Res., May 15, 2004; 10(10): 3410 - 3421. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. D. Trinklein, J. I. Murray, S. J. Hartman, D. Botstein, and R. M. Myers The Role of Heat Shock Transcription Factor 1 in the Genome-wide Regulation of the Mammalian Heat Shock Response Mol. Biol. Cell, March 1, 2004; 15(3): 1254 - 1261. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Whitfield, D. R. Finlay, J. I. Murray, O. G. Troyanskaya, J.-T. Chi, A. Pergamenschikov, T. H. McCalmont, P. O. Brown, D. Botstein, and M. K. Connolly Systemic and cell type-specific gene expression patterns in scleroderma skin PNAS, October 14, 2003; 100(21): 12319 - 12324. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-T. Chi, H. Y. Chang, G. Haraldsen, F. L. Jahnsen, O. G. Troyanskaya, D. S. Chang, Z. Wang, S. G. Rockson, M. van de Rijn, D. Botstein, et al. Endothelial cell diversity revealed by global expression profiling PNAS, September 16, 2003; 100(19): 10623 - 10628. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Bar-Joseph, G. Gerber, I. Simon, D. K. Gifford, and T. S. Jaakkola Comparing the continuous representation of time-series expression profiles to identify differentially expressed genes PNAS, September 2, 2003; 100(18): 10146 - 10151. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Chen, S. Y. Leung, S. T. Yuen, K.-M. Chu, J. Ji, R. Li, A. S.Y. Chan, S. Law, O. G. Troyanskaya, J. Wong, et al. Variation in Gene Expression Patterns in Human Gastric Cancers Mol. Biol. Cell, August 1, 2003; 14(8): 3208 - 3215. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. P. Bohen, O. G. Troyanskaya, O. Alter, R. Warnke, D. Botstein, P. O. Brown, and R. Levy Variation in gene expression patterns in follicular lymphoma and the response to rituximab PNAS, February 18, 2003; 100(4): 1926 - 1930. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Y. Leung, X. Chen, K. M. Chu, S. T. Yuen, J. Mathy, J. Ji, A. S. Y. Chan, R. Li, S. Law, O. G. Troyanskaya, et al. Phospholipase A2 group IIA expression in gastric adenocarcinoma is associated with prolonged survival and less frequent metastasis PNAS, December 10, 2002; 99(25): 16203 - 16208. [Abstract] [Full Text] [PDF] |
||||












