Bioinformatics Advance Access published online on November 27, 2007
Bioinformatics, doi:10.1093/bioinformatics/btm583
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Identification of Differentially Expressed Gene Categories in Microarray Studies Using Nonparametric Multivariate Analysis
1 Department of Statistics, Iowa State University, Ames, Iowa 50011-1210, USA, 2 Eli Lilly and Company,Lilly Research Laboratories, P.O. Box 708, Greenfield, Indiana 46140 and 3 Department of Animal Science, Iowa State University, Ames, Iowa 50011-3150, USA
*To whom correspondence should be addressed. Prof. Dan Nettleton, E-mail: dnett{at}iastate.edu
| Abstract |
|---|
Motivation: The field of microarray data analysis is shifting emphasis from methods for identifying differentially expressed genes to methods for identifying differentially expressed gene categories. The latter approaches utilize a priori information about genes to group genes into categories and enhance the interpretation of experiments aimed at identifying expression differences across treatments. While almost all of the existing approaches for identifying differentially expressed gene categories are practically useful, they suffer from a variety of drawbacks. Perhaps most notably, many popular tools are based exclusively on gene-specific statistics that cannot detect many types of multivariate expression change.
Results: We have developed a nonparametric multivariate method for identifying gene categories whose multivariate expression distribution differs across two or more conditions. We illustrate our approach and compare its performance to several existing procedures via the analysis of a real data set and a unique data-based simulation study designed to capture the challenges and complexities of practical data analysis. We show that our method has good power for differentiating between differentially expressed and non-differentially expressed gene categories, and we utilize a resampling based strategy for controlling the false discovery rate when testing multiple categories.
Availability: R code (www.r-project.org) for implementing our approach is available from the first author by request.
Contact: dnett{at}iastate.edu
Associate Editor: Prof. John Quackenbush
Received on June 27, 2007; revised on October 11, 2007; accepted on November 19, 2007
This article has been cited by other articles:
![]() |
G. V. Glazko and F. Emmert-Streib Unite and conquer: univariate and multivariate approaches for finding differentially expressed gene sets Bioinformatics, September 15, 2009; 25(18): 2348 - 2354. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Pounds, C. Cheng, X. Cao, K. R. Crews, W. Plunkett, V. Gandhi, J. Rubnitz, R. C. Ribeiro, J. R. Downing, and J. Lamba PROMISE: a tool to identify genomic features with a specific biologically interesting pattern of associations with multiple endpoint variables Bioinformatics, August 15, 2009; 25(16): 2013 - 2019. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Heller, E. Manduchi, G. R. Grant, and W. J. Ewens A flexible two-stage procedure for identifying gene sets that are differentially expressed Bioinformatics, April 15, 2009; 25(8): 1019 - 1025. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ma and M. R. Kosorok Identification of differential gene pathways with principal component analysis Bioinformatics, April 1, 2009; 25(7): 882 - 889. [Abstract] [Full Text] [PDF] |
||||
