Skip Navigation


Bioinformatics Advance Access originally published online on February 15, 2007
Bioinformatics 2007 23(8):980-987; doi:10.1093/bioinformatics/btm051
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
23/8/980    most recent
btm051v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (18)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Goeman, J. J.
Right arrow Articles by Bühlmann, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Goeman, J. J.
Right arrow Articles by Bühlmann, P.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2007. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

Analyzing gene expression data in terms of gene sets: methodological issues

Jelle J. Goeman 1,* and Peter Bühlmann 2

1Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Postzone S5-P, P.O. Box 9600, 2300 RC Leiden, The Netherlands and 2Seminar für Statistik, ETH Zurich, CH-8092 Zürich, Switzerland

*To whom correspondence should be addressed.


   Abstract

Motivation: Many statistical tests have been proposed in recent years for analyzing gene expression data in terms of gene sets, usually from Gene Ontology. These methods are based on widely different methodological assumptions. Some approaches test differential expression of each gene set against differential expression of the rest of the genes, whereas others test each gene set on its own. Also, some methods are based on a model in which the genes are the sampling units, whereas others treat the subjects as the sampling units. This article aims to clarify the assumptions behind different approaches and to indicate a preferential methodology of gene set testing.

Results: We identify some crucial assumptions which are needed by the majority of methods. P-values derived from methods that use a model which takes the genes as the sampling unit are easily misinterpreted, as they are based on a statistical model that does not resemble the biological experiment actually performed. Furthermore, because these models are based on a crucial and unrealistic independence assumption between genes, the P-values derived from such methods can be wildly anti-conservative, as a simulation experiment shows. We also argue that methods that competitively test each gene set against the rest of the genes create an unnecessary rift between single gene testing and gene set testing.

Contact: j.j.goeman{at}lumc.nl

Associate Editor: Trey Ideker


Received on September 21, 2006; revised on December 11, 2006; accepted on February 8, 2007

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
J. Tarraga, I. Medina, J. Carbonell, J. Huerta-Cepas, P. Minguez, E. Alloza, F. Al-Shahrour, S. Vegas-Azcarate, S. Goetz, P. Escobar, et al.
GEPAS, a web-based tool for microarray data analysis and interpretation
Nucleic Acids Res., July 1, 2008; 36(suppl_2): W308 - W314.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
F. Al-Shahrour, J. Carbonell, P. Minguez, S. Goetz, A. Conesa, J. Tarraga, I. Medina, E. Alloza, D. Montaner, and J. Dopazo
Babelomics: advanced functional profiling of transcriptomics, proteomics and genomics experiments
Nucleic Acids Res., July 1, 2008; 36(suppl_2): W341 - W346.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
D. Nam and S.-Y. Kim
Gene-set approach for expression pattern analysis
Brief Bioinform, May 1, 2008; 9(3): 189 - 197.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
W. Rodenburg, A. G. Heidema, J. M. A. Boer, I. M. J. Bovee-Oudenhoven, E. J. M. Feskens, E. C. M. Mariman, and J. Keijer
A framework to identify physiological responses in microarray-based gene expression studies: selection and interpretation of biologically relevant genes
Physiol Genomics, March 10, 2008; 33(1): 78 - 90.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. J. Goeman and U. Mansmann
Multiple testing on the directed acyclic graph of gene ontology
Bioinformatics, February 15, 2008; 24(4): 537 - 544.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. Nettleton, J. Recknor, and J. M. Reecy
Identification of differentially expressed gene categories in microarray studies using nonparametric multivariate analysis
Bioinformatics, January 15, 2008; 24(2): 192 - 201.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Hummel, R. Meister, and U. Mansmann
GlobalANCOVA: exploration and assessment of gene group effects
Bioinformatics, January 1, 2008; 24(1): 78 - 85.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
A. Kuhn, D. R. Goldstein, A. Hodges, A. D. Strand, T. Sengstag, C. Kooperberg, K. Becanovic, M. A. Pouladi, K. Sathasivam, J.-H. J. Cha, et al.
Mutant huntingtin's effects on striatal gene expression in mice recapitulate changes observed in human Huntington's disease brain and do not differ with mutant huntingtin length or wild-type huntingtin dosage
Hum. Mol. Genet., August 1, 2007; 16(15): 1845 - 1861.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.