Bioinformatics Vol. 18 no. 12 2002
Pages 1600-1608
© 2002 Oxford University Press
Between-group analysis of microarray data
1 Department of Biochemistry, University College Cork,
Cork, Ireland
2 Laboratoire de Biométrie et Biologie Évolutive,
UMR CNRS No. 5558, Université Claude BernardLyon 1, 43, blvd. du 11 Novembre 1918,
69622 Villeurbanne Cedex, France
Received on March 12, 2002
; revised on May 20, 2002
; accepted on May 24, 2002
Motivation: Most supervised classification methods are limited by the requirement for more cases than variables. In microarray data the number of variables (genes) far exceeds the number of cases (arrays), and thus filtering and pre-selection of genes is required. We describe the application of Between Group Analysis (BGA) to the analysis of microarray data. A feature of BGA is that it can be used when the number of variables (genes) exceeds the number of cases (arrays). BGA is based on carrying out an ordination of groups of samples, using a standard method such as Correspondence Analysis (COA), rather than an ordination of the individual microarray samples. As such, it can be viewed as a method of carrying out COA with grouped data.
Results: We illustrate the power of the method using two cancer data sets. In both cases, we can quickly and accurately classify test samples from any number of specified a priori groups and identify the genes which characterize these groups. We obtained very high rates of correct classification, as determined by jack-knife or validation experiments with training and test sets. The results are comparable to those from other methods in terms of accuracy but the power and flexibility of BGA make it an especially attractive method for the analysis of microarray cancer data.
Availability: The methods described are implemented in ADE-4 which runs under MacOS and Windows, and is freely available at http://pbil.univ-lyon1.fr/ADE-4/. All scripts are available on request.
Contact: A.Culhane{at}ucc.ie
Supplementary information: Supplementary figures and tables are available at http://bioinfo.ucc.ie/BGA/.
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
G. Perot, J. Derre, J.-M. Coindre, F. Tirode, C. Lucchesi, O. Mariani, L. Gibault, L. Guillou, P. Terrier, and A. Aurias Strong Smooth Muscle Differentiation Is Dependent on Myocardin Gene Amplification in Most Human Retroperitoneal Leiomyosarcomas Cancer Res., March 15, 2009; 69(6): 2269 - 2278. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Remigi, A. Faye, A. Kane, M. Deruaz, J. Thioulouse, M. Cissoko, Y. Prin, A. Galiana, B. Dreyfus, and R. Duponnois The Exotic Legume Tree Species Acacia holosericea Alters Microbial Soil Functionalities and the Structure of the Arbuscular Mycorrhizal Community Appl. Envir. Microbiol., March 1, 2008; 74(5): 1485 - 1493. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Thum, P. Galuppo, C. Wolf, J. Fiedler, S. Kneitz, L. W. van Laake, P. A. Doevendans, C. L. Mummery, J. Borlak, A. Haverich, et al. MicroRNAs in the Human Heart: A Clue to Fetal Gene Reprogramming in Heart Failure Circulation, July 17, 2007; 116(3): 258 - 267. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Stafford and M. Brun Three methods for optimization of cross-laboratory and cross-platform microarray expression data Nucleic Acids Res., May 11, 2007; 35(10): e72 - e72. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. B. Jeffery, S. F. Madden, P. A. McGettigan, G. Perriere, A. C. Culhane, and D. G. Higgins Integrating transcription factor binding site information with gene expression datasets Bioinformatics, February 1, 2007; 23(3): 298 - 305. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. W. Kong, W. T. Pu, and P. J. Park A multivariate approach for integrating genome-wide expression data and biological knowledge Bioinformatics, October 1, 2006; 22(19): 2373 - 2380. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Liang, Y. Li, X. Be, S. Howes, and W. Liu Detecting and profiling tissue-selective genes Physiol Genomics, September 14, 2006; 26(2): 158 - 162. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Bibikova, E. Chudin, B. Wu, L. Zhou, E. W. Garcia, Y. Liu, S. Shin, T. W. Plaia, J. M. Auerbach, D. E. Arking, et al. Human embryonic stem cells have a unique epigenetic signature Genome Res., September 1, 2006; 16(9): 1075 - 1083. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-Q. Wang and K. Li A New Algorithm Based on Support Vectors and Penalty Strategy for Identifying Key Genes Related with Cancer Transactions of the Institute of Measurement and Control, August 1, 2006; 28(3): 263 - 273. [Abstract] [PDF] |
||||
![]() |
A. C. Culhane, J. Thioulouse, G. Perriere, and D. G. Higgins MADE4: an R package for multivariate analysis of gene expression data Bioinformatics, June 1, 2005; 21(11): 2789 - 2790. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. McArdle, M. McDermott, R. Purcell, D. Grehan, A. O'Meara, F. Breatnach, D. Catchpoole, A. C. Culhane, I. Jeffery, W. M. Gallagher, et al. Oligonucleotide microarray analysis of gene expression in neuroblastoma displaying loss of chromosome 11q Carcinogenesis, September 1, 2004; 25(9): 1599 - 1609. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Kapushesky, P. Kemmeren, A. C. Culhane, S. Durinck, J. Ihmels, C. Korner, M. Kull, A. Torrente, U. Sarkans, J. Vilo, et al. Expression Profiler: next generation--an online platform for analysis of microarray data Nucleic Acids Res., July 1, 2004; 32(suppl_2): W465 - W470. [Abstract] [Full Text] [PDF] |
||||








