Bioinformatics Advance Access published online on November 14, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl572
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Institute for Molecules and Materials, Analytical Chemistry, Radboud University Nijmegen, Toernooiveld 1, 6525 ED, Nijmegen, The Netherlands
* To whom correspondence should be addressed.
Motivation: ANOVA is a technique which is frequently used in the analysis of microarray data, e.g. to assess the significance of treatment effects, and to select interesting genes based on p-values. However, it does not give information about what exactly is causing the effect. Our purpose is to improve the interpretation of the results from ANOVA on large microarray data sets, by applying PCA on the individual variance components. Interaction effects can be visualised by biplots, showing genes and variables in one plot, providing insight in the effect of e.g. treatment or time on gene expression. Because ANOVA has removed uninteresting sources of variance, the results are much more interpretable than without ANOVA. Moreover, the combination of ANOVA and PCA allows for simple way to select genes, based on the interactions of interest. Results: It is shown that the components from an ANOVA model can be summarised and visualised with PCA, which improves the interpretability of the models. The method is applied to a real timecourse gene expression dataset of mesenchymal stem cells. The dataset was designed to investigate the effect of different treatments on osteogenesis. The biplots generated with the algorithm give specific information about the effects of specific treatments on genes over time. These results are in agreement with the literature. The biological validation with GO annotation from the genes present in the selections shows that biologically relevant groups of genes are selected. Availability: R code with the implementation of the method for this dataset is available from http://www.cac.science.ru.nl under the heading "Software".
Received September 29, 2006
Accepted November 8, 2006
Article
Interpretation of ANOVA models for microarray data using PCA
J. R. de Haan 1, R. Wehrens 1, S. Bauerschmidt 2, E. Piek 3, R. C. van Schaik 2, and L. M. C. Buydens 1 *
2 NV Organon, Molenstraat 110, 5340 BH, Oss, The Netherlands; Centre for Molecular and Biomolecular Informatics, Nijmegen Centre for Molecular Life Sciences, Radboud University Nijmegen, Toernooiveld 1, 6525 ED, Nijmegen, The Netherlands
3 Department of Applied Biology, Radboud University Nijmegen, Toernooiveld 1, 6525 ED, Nijmegen, The Netherlands
L. M. C. Buydens, E-mail: L.Buydens{at}science.ru.nl
![]()
Abstract
Associate Editor: Joaquin Dopazo
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. Schmolke, D. Viemann, J. Roth, and S. Ludwig Essential Impact of NF-{kappa}B Signaling on the H5N1 Influenza A Virus-Induced Transcriptome J. Immunol., October 15, 2009; 183(8): 5180 - 5189. [Abstract] [Full Text] [PDF] |
||||
