Bioinformatics Advance Access published online on August 7, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl424
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Theoretical Bioinformatics, German Cancer Reseach Center, 69120 Heidelberg, Germany; Medical Research Center, University Hospital Mannheim, 68167 Mannheim, Germany; Cellular and Molecular Pathology, German Cancer Research Center, 69120 Heidelberg, Germany
* To whom correspondence should be addressed.
Motivation: The wide use of DNA microarrays for the investigation of the cell transcriptome triggered the invention of numerous methods for the processing of microarray data and lead to a growing number of microarray studies that examine the same biological conditions. However, comparisons made on the level of gene lists obtained by different statistical methods or from different data sets hardly converge. We aimed at examining such discrepancies on the level of apparently affected biologically related groups of genes, for example metabolic or signalling pathways. This can be achieved by group testing procedures, e.g. over-representation analysis (ORA), bluefunctional class scoring (FCS), or global tests. Results: Three public prostate cancer data sets obtained with the same microarray platform (HGU95A/HGU95av2) were analyzed. Each data set was subjected to normalization by either variance stabilizing normalization (vsn) or mixed model normalization (MMN). Then, statistical analysis of microarrays (SAM) was applied to the vsn-normalized data and mixed model analsis (MMA) to the data normalized by MMN. For multiple testing adjustment the false discovery rate (FDR) was calculated and the threshold was set to 0.05. Gene lists from the same method applied to different data sets showed overlaps between 42% and 52%, while lists from different methods applied to the same data set had between 63% and 85% of genes in common. A number of six gene lists obtained by the two statistical methods applied to the three data sets was then subjected to group testing by blueFisher's exact test. Group testing by GSEA and global test was applied to the three data sets, as well. Fisher's exact test followed by global test showed more consistent results with respect to the concordance between analyses on gene lists obtained by different methods and different data sets than the GSEA. However, all group testing methods identified pathways that had already been described to be involved in the pathogenesis of prostate cancer. Moreover, pathways recurrently identified in these analyses are more likely to be reliable than those from a single analysis on a single data set. Supplementary Info: Supplementary Figure 1 and Supplementary Tables 1-4 are available from the Journal's website.
Received March 8, 2006
Revised July 22, 2006
Accepted July 28, 2006
Article
Group testing for pathway analysis improves comparability of different microarray data sets
Theodora Manoli 1, Norbert Gretz 2, Hermann-Josef Gröne 3, Marc Kenzelmann 3, Roland Eils 4, and Benedikt Brors 4 *
2 Medical Research Center, University Hospital Mannheim, 68167 Mannheim, Germany
3 Cellular and Molecular Pathology, German Cancer Research Center, 69120 Heidelberg, Germany
4 Theoretical Bioinformatics, German Cancer Reseach Center, 69120 Heidelberg, Germany
Benedikt Brors, E-mail: b.brors{at}dkfz.de
![]()
Abstract
Associate Editor: David Rocke
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. Lu, P. Huggins, and Z. Bar-Joseph Cross species analysis of microarray expression data Bioinformatics, June 15, 2009; 25(12): 1476 - 1483. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. S. Leong and D. Kipling Text-based over-representation analysis of microarray gene lists with annotation bias Nucleic Acids Res., June 1, 2009; 37(11): e79 - e79. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. S. Nagaraj Evolving 'omics' technologies for diagnostics of head and neck cancer Brief Funct Genomic Proteomic, March 9, 2009; (2009) elp004v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Neretti, P.-Y. Wang, A. S. Brodsky, H. H. Nyguyen, K. P. White, B. Rogina, and S. L. Helfand Long-lived Indy induces reduced mitochondrial reactive oxygen species production and oxidative damage PNAS, February 17, 2009; 106(7): 2277 - 2282. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Sanchez-Espiridion, A. Sanchez-Aguilera, C. Montalban, C. Martin, R. Martinez, J. Gonzalez-Carrero, C. Poderos, C. Bellas, M. F. Fresno, C. Morante, et al. A TaqMan Low-Density Array to Predict Outcome in Advanced Hodgkin's Lymphoma Using Paraffin-Embedded Samples Clin. Cancer Res., February 15, 2009; 15(4): 1367 - 1375. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. P. Smirnova, A. A. Ptitsyn, K. J. Austin, H. Bielefeldt-Ohmann, H. Van Campen, H. Han, A. L. van Olphen, and T. R. Hansen Persistent fetal infection with bovine viral diarrhea virus differentially affects maternal blood cell signal transduction pathways Physiol Genomics, February 2, 2009; 36(3): 129 - 139. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Chen, L. Wang, J. D. Smith, and B. Zhang Supervised principal component analysis for gene set enrichment of microarray data with continuous or survival outcomes Bioinformatics, November 1, 2008; 24(21): 2474 - 2481. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Zschiedrich, U. Hardeland, A. Krones-Herzig, M. Berriel Diaz, A. Vegiopoulos, J. Muggenburg, D. Sombroek, T. G. Hofmann, R. Zawatzky, X. Yu, et al. Coactivator function of RIP140 for NF{kappa}B/RelA-dependent cytokine gene expression Blood, July 15, 2008; 112(2): 264 - 276. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Nam and S.-Y. Kim Gene-set approach for expression pattern analysis Brief Bioinform, May 1, 2008; 9(3): 189 - 197. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Shriner, T. M. Baye, M. A. Padilla, S. Zhang, L. K. Vaughan, and A. E. Loraine Commonality of functional annotation: a method for prioritization of candidate genes from genome-wide linkage studies Nucleic Acids Res., March 27, 2008; 36(4): e26 - e26. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Hummel, R. Meister, and U. Mansmann GlobalANCOVA: exploration and assessment of gene group effects Bioinformatics, January 1, 2008; 24(1): 78 - 85. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Xu, Y. Zhao, and R. Simon Gene Set Expression Comparison kit for BRB-ArrayTools Bioinformatics, January 1, 2008; 24(1): 137 - 139. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Goeman and P. Buhlmann Analyzing gene expression data in terms of gene sets: methodological issues Bioinformatics, April 15, 2007; 23(8): 980 - 987. [Abstract] [Full Text] [PDF] |
||||







