Bioinformatics Vol. 18 no. 90001 2002
Pages S111-S119
© 2002 Oxford University Press
Binary tree-structured vector quantization approach to clustering and visualizing microarray data
1 Division of Cancer Informatics, Ontario Cancer Institute, 610 University Avenue,
Toronto, Ontario, M5G 2M9, Canada
2 Samuel Lunenfeld Research Institute, Mount Sinai Hospital, 600 University Avenue,
Toronto, Ontario, M5G 1X5, Canada
3 Department of Computer Science, University of Toronto, 6 King's College Road,
Toronto, Ontario, M5S 3H5, Canada
4 Department of Computing and Information Science, Queen's University,
Kingston, Ontario, K7L 3N6, Canada
5 Division of Cellular and Molecular Biology, Ontario Cancer Institute,
610 University Avenue, Toronto, Ontario, M5G 2M9, Canada
Received on January 24, 2002
; revised on April 1, 2002
; accepted on April 1, 2002
Motivation: With the increasing number of gene expression databases, the need for more powerful analysis and visualization tools is growing. Many techniques have successfully been applied to unravel latent similarities among genes and/or experiments. Most of the current systems for microarray data analysis use statistical methods, hierarchical clustering, self-organizing maps, support vector machines, or k-means clustering to organize genes or experiments into meaningful groups. Without prior explicit bias almost all of these clustering methods applied to gene expression data not only produce different results, but may also produce clusters with little or no biological relevance. Of these methods, agglomerative hierarchical clustering has been the most widely applied, although many limitations have been identified.
Results: Starting with a systematic comparison of the underlying theories behind clustering approaches, we have devised a technique that combines tree-structured vector quantization and partitive k-means clustering (BTSVQ). This hybrid technique has revealed clinically relevant clusters in three large publicly available data sets. In contrast to existing systems, our approach is less sensitive to data preprocessing and data normalization. In addition, the clustering results produced by the technique have strong similarities to those of self-organizing maps (SOMs). We discuss the advantages and the mathematical reasoning behind our approach.
Availability: The BTSVQ system is implemented in Matlab R12 using the SOM toolbox for the visualization and preprocessing of the data http://www.cis.hut.fi/projects/somtoolbox/ BTSVQ is available for non-commercial use http://www.uhnres.utoronto.ca/ta3/BTSVQ
Contact: ij{at}uhnres.utoronto.ca
Keywords: microarray data clustering and visulization; self-organizing maps, partitive k-means clustering; lung cancer.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S. K. Lau, P. C. Boutros, M. Pintilie, F. H. Blackhall, C.-Q. Zhu, D. Strumpf, M. R. Johnston, G. Darling, S. Keshavjee, T. K. Waddell, et al. Three-Gene Prognostic Classifier for Early-Stage Non Small-Cell Lung Cancer J. Clin. Oncol., December 10, 2007; 25(35): 5562 - 5569. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. W. Mount and R. Pandey Using bioinformatics and genome analysis for new therapeutic interventions Mol. Cancer Ther., October 1, 2005; 4(10): 1636 - 1643. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Barrios-Rodiles, K. R. Brown, B. Ozdamar, R. Bose, Z. Liu, R. S. Donovan, F. Shinjo, Y. Liu, J. Dembowy, I. W. Taylor, et al. High-Throughput Mapping of a Dynamic Signaling Network in Mammalian Cells Science, March 11, 2005; 307(5715): 1621 - 1625. [Abstract] [Full Text] [PDF] |
||||
![]() |
B.M. Acton, A. Jurisicova, I. Jurisica, and R.F. Casper Alterations in mitochondrial membrane potential during preimplantation stages of mouse and human embryo development Mol. Hum. Reprod., January 1, 2004; 10(1): 23 - 32. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Breitkreutz, L. Boucher, B.-J. Breitkreutz, M. Sultan, I. Jurisica, and M. Tyers Phenotypic and Transcriptional Plasticity Directed by a Yeast Mitogen-Activated Protein Kinase Network Genetics, November 1, 2003; 165(3): 997 - 1015. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Evangelou, M. Letarte, I. Jurisica, M. Sultan, K. J. Murphy, B. Rosen, and T. J. Brown Loss of Coordinated Androgen Regulation in Nonmalignant Ovarian Epithelial Cells with BRCA1/2 Mutations and Ovarian Cancer Cells Cancer Res., May 15, 2003; 63(10): 2416 - 2424. [Abstract] [Full Text] [PDF] |
||||





