Bioinformatics Advance Access published online on September 11, 2008
Bioinformatics, doi:10.1093/bioinformatics/btn465
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Gene Set Enrichment Analysis using Linear Models and Diagnostics
aFred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, Washington 98109-1024, U.S.A.
*to whom correspondence should be addressed. Dr. Assaf P. Oron, E-mail: aoron{at}fhcrc.org,rgentlem{at}fhcrc.org
| Abstract |
|---|
Motivation: Gene-set enrichment analysis (GSEA) can be greatly enhanced by linear model (regression) diagnostic techniques. Diagnostics can be used to identify outlying or influential samples, and also to evaluate model fit and explore model expansion.
Results: We demonstrate this methodology on an adult acute lymphoblastic leukemia (ALL) dataset, using GSEA based on chromosome-band mapping of genes. Individual residuals, grouped or aggregated by chromosomal loci, indicate problematic samples and potential data-entry errors, and help identify hyperdiploidy as a factor playing a key role in expression for this dataset. Subsequent analysis pinpoints suspected DNA copy number abnormalities of specific samples and chromosomes (most prevalent are chromosomes X, 21 and 14), and also reveals significant expression differences between the hyperdiploid and diploid groups on other chromosomes (most prominently 19, 22, 3 and 13) - differences which are apparently not associated with copy number.
Availability: Software for the statistical tools demonstrated in this article is available as Bioconductor package GSEAlm.
Contact: assaf.oron{at}gmail.com,rgentlem{at}fhcrc.org
Associate Editor: Prof. John Quackenbush
Received on June 27, 2008; revised on July 29, 2008; accepted on August 26, 2008
This article has been cited by other articles:
![]() |
A. C. Culhane, T. Schwarzl, R. Sultana, K. C. Picard, S. C. Picard, T. H. Lu, K. R. Franklin, S. J. French, G. Papenhausen, M. Correll, et al. GeneSigDB--a curated database of gene expression signatures Nucleic Acids Res., November 24, 2009; (2009) gkp1015v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Caldas, N. Gehlenborg, A. Faisal, A. Brazma, and S. Kaski Probabilistic retrieval and visualization of biologically relevant microarray experiments Bioinformatics, June 15, 2009; 25(12): i145 - i153. [Abstract] [Full Text] [PDF] |
||||

