Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (111)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Troyanskaya, O. G.
Right arrow Articles by Altman, R. B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Troyanskaya, O. G.
Right arrow Articles by Altman, R. B.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 18 no. 11 2002
Pages 1454-1461
© 2002 Oxford University Press

Nonparametric methods for identifying differentially expressed genes in microarray data

Olga G. Troyanskaya 1, Mitchell E. Garber 1, Patrick O. Brown 2,3, David Botstein 1,* and Russ B. Altman 1,*

1 Department of Genetics, Stanford University School of Medicine
2 Department of Biochemistry, Stanford University School of Medicine
3 Howard Hughes Medical Institute, Stanford, CA, USA

Received on February 27, 2002 ; revised on April 21, 2002 ; accepted on May 24, 2002

Motivation: Gene expression experiments provide a fast and systematic way to identify disease markers relevant to clinical care. In this study, we address the problem of robust identification of differentially expressed genes from microarray data. Differentially expressed genes, or discriminator genes, are genes with significantly different expression in two user-defined groups of microarray experiments. We compare three model-free approaches: (1) nonparametric t-test, (2) Wilcoxon (or Mann–Whitney) rank sum test, and (3) a heuristic method based on high Pearson correlation to a perfectly differentiating gene (‘ideal discriminator method’). We systematically assess the performance of each method based on simulated and biological data under varying noise levels and p-value cutoffs.

Results: All methods exhibit very low false positive rates and identify a large fraction of the differentially expressed genes in simulated data sets with noise level similar to that of actual data. Overall, the rank sum test appears most conservative, which may be advantageous when the computationally identified genes need to be tested biologically. However, if a more inclusive list of markers is desired, a higher p-value cutoff or the nonparametric t-test may be appropriate. When applied to data from lung tumor and lymphoma data sets, the methods identify biologically relevant differentially expressed genes that allow clear separation of groups in question. Thus the methods described and evaluated here provide a convenient and robust way to identify differentially expressed genes for further biological and clinical analysis.

Availability: By request from the authors.

Contact: russ.altman{at}stanford.edu

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Virol.Home page
Z. Han, D. Verma, C. Hilscher, D. P. Dittmer, and S. Swaminathan
General and Target-Specific RNA Binding Properties of Epstein-Barr Virus SM Posttranscriptional Regulatory Protein
J. Virol., November 15, 2009; 83(22): 11635 - 11644.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Yang, Y. Zhou, R. Jin, and C. Chan
Reconstruct modular phenotype-specific gene networks by knowledge-driven matrix factorization
Bioinformatics, September 1, 2009; 25(17): 2236 - 2243.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. Heller, E. Manduchi, and D. S. Small
Matching methods for observational microarray studies
Bioinformatics, April 1, 2009; 25(7): 904 - 909.
[Abstract] [Full Text] [PDF]


Home page
BloodHome page
A. J. O'Hara, W. Vahrson, and D. P. Dittmer
Gene alteration and precursor and mature microRNA transcription changes contribute to the miRNA signature of primary effusion lymphoma
Blood, February 15, 2008; 111(4): 2347 - 2353.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Saeys, I. Inza, and P. Larranaga
A review of feature selection techniques in bioinformatics
Bioinformatics, October 1, 2007; 23(19): 2507 - 2517.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Zhou, C. Cras-Meneur, M. Ohsugi, G. D. Stormo, and M. Alan. Permutt
A global approach to identify differentially expressed genes in cDNA (two-color) microarray experiments
Bioinformatics, August 15, 2007; 23(16): 2073 - 2079.
[Abstract] [Full Text] [PDF]


Home page
BiostatisticsHome page
B. Wu
Cancer outlier differential gene expression detection
Biostat., July 1, 2007; 8(3): 566 - 575.
[Abstract] [Full Text] [PDF]


Home page
JCOHome page
D. Juric, N. J. Lacayo, M. C. Ramsey, J. Racevskis, P. H. Wiernik, J. M. Rowe, A. H. Goldstone, P. J. O'Dwyer, E. Paietta, and B. I. Sikic
Differential Gene Expression Patterns and Interaction Networks in BCR-ABL-Positive and -Negative Adult Acute Lymphoblastic Leukemias
J. Clin. Oncol., April 10, 2007; 25(11): 1341 - 1349.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
W. Zhao, E. Y. Chuang, M. Mishra, R. Awwad, K. Bisht, L. Sun, P. Nguyen, J. D. Pennington, T. J. C. Wang, C. M. Bradbury, et al.
Distinct Effects of Ionizing Radiation on In vivo Murine Kidney and Brain Normal Tissue Gene Expression.
Clin. Cancer Res., June 15, 2006; 12(12): 3823 - 3830.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
C. Steinhoff and M. Vingron
Normalization and quantification of differential expression in gene expression microarrays
Brief Bioinform, June 1, 2006; 7(2): 166 - 177.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
P. Larranaga, B. Calvo, R. Santana, C. Bielza, J. Galdiano, I. Inza, J. A. Lozano, R. Armananzas, G. Santafe, A. Perez, et al.
Machine learning in bioinformatics
Brief Bioinform, March 1, 2006; 7(1): 86 - 112.
[Abstract] [Full Text] [PDF]


Home page
Stat Methods Med ResHome page
S. Y. Kim, J. W. Lee, and I. S. Sohn
Comparison of various statistical methods for identifying differential gene expression in replicated microarray data
Statistical Methods in Medical Research, February 1, 2006; 15(1): 3 - 20.
[Abstract] [PDF]


Home page
BioinformaticsHome page
H. H. Zhang, J. Ahn, X. Lin, and C. Park
Gene selection using support vector machines with non-convex penalty
Bioinformatics, January 1, 2006; 22(1): 88 - 95.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Liu, J. Li, and L. Wong
Use of extreme patient samples for outcome prediction from gene expression data
Bioinformatics, August 15, 2005; 21(16): 3377 - 3384.
[Abstract] [Full Text] [PDF]


Home page
J. Appl. Physiol.Home page
M. Fluck, C. Dapp, S. Schmutz, E. Wit, and H. Hoppeler
Transcriptional profiling of tissue plasticity: role of shifts in gene expression and technical limitations
J Appl Physiol, August 1, 2005; 99(2): 397 - 413.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Hu, F. Zou, and F. A. Wright
Practical FDR-based sample size calculations in microarray experiments
Bioinformatics, August 1, 2005; 21(15): 3264 - 3272.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Liang, B. Tayo, X. Cai, and A. Kelemen
Differential and trajectory methods for time course gene expression data
Bioinformatics, July 1, 2005; 21(13): 3009 - 3016.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
K.-H. Pan, C.-J. Lih, and S. N. Cohen
Effects of threshold choice on biological conclusions reached during analysis of gene expression by DNA microarrays
PNAS, June 21, 2005; 102(25): 8961 - 8965.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. A. Vinterbo, E.-Y. Kim, and L. Ohno-Machado
Small, fuzzy and interpretable gene expression based classifiers
Bioinformatics, May 1, 2005; 21(9): 1964 - 1970.
[Abstract] [Full Text] [PDF]


Home page
DNA ResHome page
R. Z. N. Vêncio and T. Koide
HTself: Self-Self Based Statistical Test for Low Replication Microarray Studies
DNA Res, January 1, 2005; 12(3): 211 - 214.
[Abstract] [Full Text] [PDF]


Home page
Mol. Biol. CellHome page
M. Shapira, E. Segal, and D. Botstein
Disruption of Yeast Forkhead-associated Cell Cycle Transcription by Oxidative Stress
Mol. Biol. Cell, December 1, 2004; 15(12): 5659 - 5669.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
M. Zhao, A. Chow, J. Powers, G. Fajardo, and D. Bernstein
Microarray analysis of gene expression after transverse aortic constriction in mice
Physiol Genomics, September 16, 2004; 19(1): 93 - 105.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
O. Modlich, H.-B. Prisack, G. Pitschke, U. Ramp, R. Ackermann, H. Bojar, T. A. Vogeli, and M.-O. Grimm
Identifying Superficial, Muscle-Invasive, and Metastasizing Transitional Cell Carcinoma of the Bladder: Use of cDNA Array Analysis of Gene Expression Profiles
Clin. Cancer Res., May 15, 2004; 10(10): 3410 - 3421.
[Abstract] [Full Text] [PDF]


Home page
Mol. Biol. CellHome page
N. D. Trinklein, J. I. Murray, S. J. Hartman, D. Botstein, and R. M. Myers
The Role of Heat Shock Transcription Factor 1 in the Genome-wide Regulation of the Mammalian Heat Shock Response
Mol. Biol. Cell, March 1, 2004; 15(3): 1254 - 1261.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. L. Whitfield, D. R. Finlay, J. I. Murray, O. G. Troyanskaya, J.-T. Chi, A. Pergamenschikov, T. H. McCalmont, P. O. Brown, D. Botstein, and M. K. Connolly
Systemic and cell type-specific gene expression patterns in scleroderma skin
PNAS, October 14, 2003; 100(21): 12319 - 12324.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J.-T. Chi, H. Y. Chang, G. Haraldsen, F. L. Jahnsen, O. G. Troyanskaya, D. S. Chang, Z. Wang, S. G. Rockson, M. van de Rijn, D. Botstein, et al.
Endothelial cell diversity revealed by global expression profiling
PNAS, September 16, 2003; 100(19): 10623 - 10628.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
Z. Bar-Joseph, G. Gerber, I. Simon, D. K. Gifford, and T. S. Jaakkola
Comparing the continuous representation of time-series expression profiles to identify differentially expressed genes
PNAS, September 2, 2003; 100(18): 10146 - 10151.
[Abstract] [Full Text] [PDF]


Home page
Mol. Biol. CellHome page
X. Chen, S. Y. Leung, S. T. Yuen, K.-M. Chu, J. Ji, R. Li, A. S.Y. Chan, S. Law, O. G. Troyanskaya, J. Wong, et al.
Variation in Gene Expression Patterns in Human Gastric Cancers
Mol. Biol. Cell, August 1, 2003; 14(8): 3208 - 3215.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. P. Bohen, O. G. Troyanskaya, O. Alter, R. Warnke, D. Botstein, P. O. Brown, and R. Levy
Variation in gene expression patterns in follicular lymphoma and the response to rituximab
PNAS, February 18, 2003; 100(4): 1926 - 1930.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. Y. Leung, X. Chen, K. M. Chu, S. T. Yuen, J. Mathy, J. Ji, A. S. Y. Chan, R. Li, S. Law, O. G. Troyanskaya, et al.
Phospholipase A2 group IIA expression in gastric adenocarcinoma is associated with prolonged survival and less frequent metastasis
PNAS, December 10, 2002; 99(25): 16203 - 16208.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.