Bioinformatics Vol. 18 no. 11 2002
Pages 1462-1469
© 2002 Oxford University Press
Methods for assessing reproducibility of clustering patterns observed in analyses of microarray data

1 National Cancer Institute,
Biometric Research Branch, DCTD, NIH, Bethesda, MD 20892-7434
2 The Emmes Corporation, Rockville,
MD 20850, USA
Received on June 29, 2001
; revised on February 6, 2002
; accepted on April 17, 2002
Motivation: Recent technological advances such as cDNA microarray technology have made it possible to simultaneously interrogate thousands of genes in a biological specimen. A cDNA microarray experiment produces a gene expression profile. Often interest lies in discovering novel subgroupings, or clusters, of specimens based on their profiles, for example identification of new tumor taxonomies. Cluster analysis techniques such as hierarchical clustering and self-organizing maps have frequently been used for investigating structure in microarray data. However, clustering algorithms always detect clusters, even on random data, and it is easy to misinterpret the results without some objective measure of the reproducibility of the clusters.
Results: We present statistical methods for testing for overall clustering of gene expression profiles, and we define easily interpretable measures of cluster-specific reproducibility that facilitate understanding of the clustering structure. We apply these methods to elucidate structure in cDNA microarray gene expression profiles obtained on melanoma tumors and on prostate specimens.
Availability: Software to implement these methods is contained in BRB ArrayTools microarray analysis package available from http://linus.nci.nih.gov./BRB-ArrayTools.html
Contact: lm5h{at}nih.gov
* To whom correspondence should be addressed.
Present address: Human Genome Sciences Inc.,
Rockville, MD 20850, USA
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. Smid, Y. Wang, Y. Zhang, A. M. Sieuwerts, J. Yu, J. G.M. Klijn, J. A. Foekens, and J. W.M. Martens Subtypes of Breast Cancer Show Preferential Site of Relapse Cancer Res., May 1, 2008; 68(9): 3108 - 3114. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Brehelin, O. Gascuel, and O. Martin Using repeated measurements to validate hierarchical gene clusters Bioinformatics, March 1, 2008; 24(5): 682 - 688. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Hagerstrand, A. Smits, A. Eriksson, S. Sigurdardottir, T. Olofsson, M. Hartman, M. Nister, H. Kalimo, and A. Ostman Gene expression analyses of grade II gliomas and identification of rPTP{beta}/{zeta} as a candidate oligodendroglioma marker Neuro-oncol, February 1, 2008; 10(1): 2 - 9. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. S. Stearman, L. Dwyer-Nield, M. C. Grady, A. M. Malkinson, and M. W. Geraci A Macrophage Gene Expression Signature Defines a Field Effect in the Lung Tumor Microenvironment Cancer Res., January 1, 2008; 68(1): 34 - 43. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yu, H.-S. Wong, and H. Wang Graph-based consensus clustering for class discovery from gene expression data Bioinformatics, November 1, 2007; 23(21): 2888 - 2896. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Lottaz, J. Toedling, and R. Spang Annotation-based distance measures for patient subgroup discovery in clinical microarray studies Bioinformatics, September 1, 2007; 23(17): 2256 - 2264. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Bullinger, F. G. Rucker, S. Kurz, J. Du, C. Scholl, S. Sander, A. Corbacioglu, C. Lottaz, J. Krauter, S. Frohling, et al. Gene-expression profiling identifies distinct subclasses of core binding factor acute myeloid leukemia Blood, August 15, 2007; 110(4): 1291 - 1300. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Pusztai, C. Mazouni, K. Anderson, Y. Wu, and W. F. Symmans Molecular Classification of Breast Cancer: Limitations and Potential Oncologist, September 1, 2006; 11(8): 868 - 877. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. K. Tan, X. Zhou, M. D. Mayes, P. Gourh, X. Guo, C. Marcum, L. Jin, and F. C. Arnett Jr Signatures of differentially regulated interferon gene expression and vasculotrophism in the peripheral blood cells of systemic sclerosis patients Rheumatology, June 1, 2006; 45(6): 694 - 702. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. R. Jarnagin, D. S. Klimstra, M. Hezel, M. Gonen, Y. Fong, K. Roggin, K. Cymes, R. P. DeMatteo, M. D'Angelica, L. H. Blumgart, et al. Differential Cell Cycle-Regulatory Protein Expression in Biliary Tract Adenocarcinoma: Correlation With Anatomic Site, Pathologic Variables, and Clinical Outcome J. Clin. Oncol., March 1, 2006; 24(7): 1152 - 1160. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Valentini Clusterv: a tool for assessing the reliability of clusters discovered in DNA microarray data Bioinformatics, February 1, 2006; 22(3): 369 - 370. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. S. Stearman, L. Dwyer-Nield, L. Zerbe, S. A. Blaine, Z. Chan, P. A. Bunn Jr., G. L. Johnson, F. R. Hirsch, D. T. Merrick, W. A. Franklin, et al. Analysis of Orthologous Gene Expression between Human Pulmonary Adenocarcinoma and a Carcinogen-Induced Murine Model Am. J. Pathol., December 1, 2005; 167(6): 1763 - 1775. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Willinger, T. Freeman, H. Hasegawa, A. J. McMichael, and M. F. C. Callan Molecular Signatures Distinguish Human Central Memory from Effector Memory CD8 T Cell Subsets J. Immunol., November 1, 2005; 175(9): 5895 - 5903. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Gianni, M. Zambetti, K. Clark, J. Baker, M. Cronin, J. Wu, G. Mariani, J. Rodriguez, M. Carcangiu, D. Watson, et al. Gene Expression Profiles in Paraffin-Embedded Core Biopsy Tissue Predict Response to Chemotherapy in Women With Locally Advanced Breast Cancer J. Clin. Oncol., October 10, 2005; 23(29): 7265 - 7277. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. W. Mount and R. Pandey Using bioinformatics and genome analysis for new therapeutic interventions Mol. Cancer Ther., October 1, 2005; 4(10): 1636 - 1643. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Rouzier, C. M. Perou, W. F. Symmans, N. Ibrahim, M. Cristofanilli, K. Anderson, K. R. Hess, J. Stec, M. Ayers, P. Wagner, et al. Breast Cancer Molecular Subtypes Respond Differently to Preoperative Chemotherapy Clin. Cancer Res., August 15, 2005; 11(16): 5678 - 5685. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Handl, J. Knowles, and D. B. Kell Computational cluster validation in post-genomic data analysis Bioinformatics, August 1, 2005; 21(15): 3201 - 3212. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Munneke, K. A. Schlauch, K. L. Simonsen, W. D. Beavis, and R. W. Doerge Adding Confidence to Gene Expression Clustering Genetics, August 1, 2005; 170(4): 2003 - 2011. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. E. Teschendorff, Y. Wang, N. L. Barbosa-Morais, J. D. Brenton, and C. Caldas A variational Bayesian mixture modelling framework for cluster analysis of gene-expression data Bioinformatics, July 1, 2005; 21(13): 3025 - 3033. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. A. Golpon, C. D. Coldren, M. R. Zamora, G. P. Cosgrove, M. D. Moore, R. M. Tuder, M. W. Geraci, and N. F. Voelkel Emphysema Lung Tissue Gene Expression Profiling Am. J. Respir. Cell Mol. Biol., December 1, 2004; 31(6): 595 - 600. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Whitsett, C. J. Bachurski, K. C. Barnes, P. A. Bunn Jr., L. M. Case, D. N. Cook, D. Crooks, M. W. Duncan, L. Dwyer-Nield, R. C. Elston, et al. Functional Genomics of Lung Disease Am. J. Respir. Cell Mol. Biol., August 1, 2004; 31(2/S1): S1 - S81. [Full Text] [PDF] |
||||
![]() |
J. Fukuoka, T. Fujii, J. H. Shih, T. Dracheva, D. Meerzaman, A. Player, K. Hong, S. Settnek, A. Gupta, K. Buetow, et al. Chromatin Remodeling Factors and BRM/BRG1 Expression as Prognostic Indicators in Non-Small Cell Lung Cancer Clin. Cancer Res., July 1, 2004; 10(13): 4314 - 4324. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. K. Zorn, A. A. Jazaeri, C. S. Awtrey, G. J. Gardner, S. C. Mok, J. Boyd, and M. J. Birrer Choice of Normal Ovarian Control Influences Determination of Differentially Expressed Genes in Ovarian Cancer Expression Profiling Studies Clin. Cancer Res., October 15, 2003; 9(13): 4811 - 4818. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Sotiriou, S.-Y. Neo, L. M. McShane, E. L. Korn, P. M. Long, A. Jazaeri, P. Martiat, S. B. Fox, A. L. Harris, and E. T. Liu Breast cancer classification and prognosis based on gene expression profiles from a population-based study PNAS, September 2, 2003; 100(18): 10393 - 10398. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. R. Vasselli, J. H. Shih, S. R. Iyengar, J. Maranchie, J. Riss, R. Worrell, C. Torres-Cabala, R. Tabios, A. Mariotti, R. Stearman, et al. Predicting survival in patients with metastatic kidney cancer by gene-expression profiling in the primary tumor PNAS, June 10, 2003; 100(12): 6958 - 6963. [Abstract] [Full Text] [PDF] |
||||













