Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Sultan, M.
Right arrow Articles by Jurisica, I.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Sultan, M.
Right arrow Articles by Jurisica, I.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 18 no. 90001 2002
Pages S111-S119
© 2002 Oxford University Press

Binary tree-structured vector quantization approach to clustering and visualizing microarray data

M. Sultan 1, D.A. Wigle 2, C.A. Cumbaa 1, M. Maziarz 3, J. Glasgow 4, M.S. Tsao 5 and I. Jurisica 1,3

1 Division of Cancer Informatics, Ontario Cancer Institute, 610 University Avenue, Toronto, Ontario, M5G 2M9, Canada
2 Samuel Lunenfeld Research Institute, Mount Sinai Hospital, 600 University Avenue, Toronto, Ontario, M5G 1X5, Canada
3 Department of Computer Science, University of Toronto, 6 King's College Road, Toronto, Ontario, M5S 3H5, Canada
4 Department of Computing and Information Science, Queen's University, Kingston, Ontario, K7L 3N6, Canada
5 Division of Cellular and Molecular Biology, Ontario Cancer Institute, 610 University Avenue, Toronto, Ontario, M5G 2M9, Canada

Received on January 24, 2002 ; revised on April 1, 2002 ; accepted on April 1, 2002

Motivation: With the increasing number of gene expression databases, the need for more powerful analysis and visualization tools is growing. Many techniques have successfully been applied to unravel latent similarities among genes and/or experiments. Most of the current systems for microarray data analysis use statistical methods, hierarchical clustering, self-organizing maps, support vector machines, or k-means clustering to organize genes or experiments into ‘meaningful’ groups. Without prior explicit bias almost all of these clustering methods applied to gene expression data not only produce different results, but may also produce clusters with little or no biological relevance. Of these methods, agglomerative hierarchical clustering has been the most widely applied, although many limitations have been identified.

Results: Starting with a systematic comparison of the underlying theories behind clustering approaches, we have devised a technique that combines tree-structured vector quantization and partitive k-means clustering (BTSVQ). This hybrid technique has revealed clinically relevant clusters in three large publicly available data sets. In contrast to existing systems, our approach is less sensitive to data preprocessing and data normalization. In addition, the clustering results produced by the technique have strong similarities to those of self-organizing maps (SOMs). We discuss the advantages and the mathematical reasoning behind our approach.

Availability: The BTSVQ system is implemented in Matlab R12 using the SOM toolbox for the visualization and preprocessing of the data http://www.cis.hut.fi/projects/somtoolbox/ BTSVQ is available for non-commercial use http://www.uhnres.utoronto.ca/ta3/BTSVQ

Contact: ij{at}uhnres.utoronto.ca

Keywords: microarray data clustering and visulization; self-organizing maps, partitive k-means clustering; lung cancer.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
JCOHome page
S. K. Lau, P. C. Boutros, M. Pintilie, F. H. Blackhall, C.-Q. Zhu, D. Strumpf, M. R. Johnston, G. Darling, S. Keshavjee, T. K. Waddell, et al.
Three-Gene Prognostic Classifier for Early-Stage Non Small-Cell Lung Cancer
J. Clin. Oncol., December 10, 2007; 25(35): 5562 - 5569.
[Abstract] [Full Text] [PDF]


Home page
Molecular Cancer TherapeuticsHome page
D. W. Mount and R. Pandey
Using bioinformatics and genome analysis for new therapeutic interventions
Mol. Cancer Ther., October 1, 2005; 4(10): 1636 - 1643.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
M. Barrios-Rodiles, K. R. Brown, B. Ozdamar, R. Bose, Z. Liu, R. S. Donovan, F. Shinjo, Y. Liu, J. Dembowy, I. W. Taylor, et al.
High-Throughput Mapping of a Dynamic Signaling Network in Mammalian Cells
Science, March 11, 2005; 307(5715): 1621 - 1625.
[Abstract] [Full Text] [PDF]


Home page
Mol Hum ReprodHome page
B.M. Acton, A. Jurisicova, I. Jurisica, and R.F. Casper
Alterations in mitochondrial membrane potential during preimplantation stages of mouse and human embryo development
Mol. Hum. Reprod., January 1, 2004; 10(1): 23 - 32.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. Breitkreutz, L. Boucher, B.-J. Breitkreutz, M. Sultan, I. Jurisica, and M. Tyers
Phenotypic and Transcriptional Plasticity Directed by a Yeast Mitogen-Activated Protein Kinase Network
Genetics, November 1, 2003; 165(3): 997 - 1015.
[Abstract] [Full Text] [PDF]


Home page
Cancer Res.Home page
A. Evangelou, M. Letarte, I. Jurisica, M. Sultan, K. J. Murphy, B. Rosen, and T. J. Brown
Loss of Coordinated Androgen Regulation in Nonmalignant Ovarian Epithelial Cells with BRCA1/2 Mutations and Ovarian Cancer Cells
Cancer Res., May 15, 2003; 63(10): 2416 - 2424.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.