Bioinformatics Advance Access originally published online on May 19, 2005
Bioinformatics 2005 21(14):3105-3113; doi:10.1093/bioinformatics/bti496
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Hotelling's T2 multivariate profiling for detecting differential expression in microarrays
1Osteoporosis Research Center, Creighton University 601 N. 30th Street, Suite 6787, Omaha, NE 68131, USA
2The Key Laboratory of Biomedical Information Engineering of Ministry of Education and Institute of Molecular Genetics, School of Life Science and Technology, Xi'an Jiaotong University Xi'an 710049, PR China
3Laboratory of Molecular and Statistical Genetics, College of Life Sciences, Hunan Normal University Changsha, Hunan 410081, PR China
*To whom correspondence should be addressed.
Summary: The most widely used statistical methods for finding differentially expressed genes (DEGs) are essentially univariate. In this study, we present a new T2 statistic for analyzing microarray data. We implemented our method using a multiple forward search (MFS) algorithm that is designed for selecting a subset of feature vectors in high-dimensional microarray datasets. The proposed T2 statistic is a corollary to that originally developed for multivariate analyses and possesses two prominent statistical properties. First, our method takes into account multidimensional structure of microarray data. The utilization of the information hidden in gene interactions allows for finding genes whose differential expressions are not marginally detectable in univariate testing methods. Second, the statistic has a close relationship to discriminant analyses for classification of gene expression patterns. Our search algorithm sequentially maximizes gene expression difference/distance between two groups of genes. Including such a set of DEGs into initial feature variables may increase the power of classification rules. We validated our method by using a spike-in HGU95 dataset from Affymetrix. The utility of the new method was demonstrated by application to the analyses of gene expression patterns in human liver cancers and breast cancers. Extensive bioinformatics analyses and cross-validation of DEGs identified in the application datasets showed the significant advantages of our new algorithm.
Availability: The program for the method proposed in this article and supplementary material are available at http://orclinux.creighton.edu/hotelling/index.htm
Contact: deng{at}creighton.edu
Supplementary information: http://orclinux.creighton.edu/hotelling/index.htm
Received on October 13, 2004; revised on April 15, 2005; accepted on April 15, 2005
This article has been cited by other articles:
![]() |
V. Zuber and K. Strimmer Gene ranking and biomarker discovery under correlation Bioinformatics, October 15, 2009; 25(20): 2700 - 2707. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. V. Glazko and F. Emmert-Streib Unite and conquer: univariate and multivariate approaches for finding differentially expressed gene sets Bioinformatics, September 15, 2009; 25(18): 2348 - 2354. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-A. Tsai and J. J. Chen Multivariate analysis of variance test for gene set analysis Bioinformatics, April 1, 2009; 25(7): 897 - 903. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Ucar, I. Neuhaus, P. Ross-MacDonald, C. Tilford, S. Parthasarathy, N. Siemers, and R.-R. Ji Construction of a reference gene association network from multiple profiling data: application to data analysis Bioinformatics, October 15, 2007; 23(20): 2716 - 2724. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. W. Kong, W. T. Pu, and P. J. Park A multivariate approach for integrating genome-wide expression data and biological knowledge Bioinformatics, October 1, 2006; 22(19): 2373 - 2380. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Verducci, V. F. Melfi, S. Lin, Z. Wang, S. Roy, and C. K. Sen Microarray analysis of gene expression: considerations in data mining and statistical treatment Physiol Genomics, May 16, 2006; 25(3): 355 - 363. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Xiong Non-linear tests for identifying differentially expressed genes or genetic networks Bioinformatics, April 15, 2006; 22(8): 919 - 923. [Abstract] [Full Text] [PDF] |
||||

