Bioinformatics Vol. 18 no. 5 2002
Pages 735-746
© 2002 Oxford University Press
Adaptive quality-based clustering of gene expression profiles
ESAT-SCD (SISTA), K.U.Leuven, Kasteelpark Arenberg 10, 3001 Leuven-Heverlee, Belgium
Received on March 4, 2001
; revised on September 19, 2001
; accepted on December 11, 2001
Motivation: Microarray experiments generate a considerable amount of data, which analyzed properly help us gain a huge amount of biologically relevant information about the global cellular behaviour. Clustering (grouping genes with similar expression profiles) is one of the first steps in data analysis of high-throughput expression measurements. A number of clustering algorithms have proved useful to make sense of such data. These classical algorithms, though useful, suffer from several drawbacks (e.g. they require the predefinition of arbitrary parameters like the number of clusters; they force every gene into a cluster despite a low correlation with other cluster members). In the following we describe a novel adaptive quality-based clustering algorithm that tackles some of these drawbacks.
Results: We propose a heuristic iterative two-step algorithm: First, we find in the high-dimensional representation of the data a sphere where the density of expression profiles is locally maximal (based on a preliminary estimate of the radius of the clusterquality-based approach). In a second step, we derive an optimal radius of the cluster (adaptive approach) so that only the significantly coexpressed genes are included in the cluster. This estimation is achieved by fitting a model to the data using an EM-algorithm. By inferring the radius from the data itself, the biologist is freed from finding an optimal value for this radius by trial-and-error. The computational complexity of this method is approximately linear in the number of gene expression profiles in the data set. Finally, our method is successfully validated using existing data sets.
Availability: http://www.esat.kuleuven.ac.be/~thijs/Work/Clustering.html
Contact: frank.desmet{at}esat.kuleuven.ac.be
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Gillard, V. Devos, M. J.J. Huysman, L. De Veylder, S. D'Hondt, C. Martens, P. Vanormelingen, K. Vannerum, K. Sabbe, V. A. Chepurnov, et al. Physiological and Transcriptomic Evidence for a Close Coupling between Chloroplast Ontogeny and Cell Cycle Progression in the Pennate Diatom Seminavis robusta Plant Physiology, November 1, 2008; 148(3): 1394 - 1411. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Pauwels, K. Morreel, E. De Witte, F. Lammertyn, M. Van Montagu, W. Boerjan, D. Inze, and A. Goossens Mapping methyl jasmonate-mediated transcriptional reprogramming of metabolism and cell cycle progression in cultured Arabidopsis cells PNAS, January 29, 2008; 105(4): 1380 - 1385. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Denolet, K. De Gendt, J. Allemeersch, K. Engelen, K. Marchal, P. Van Hummelen, K. A. L. Tan, R. M. Sharpe, P. T. K. Saunders, J. V. Swinnen, et al. The Effect of a Sertoli Cell-Selective Knockout of the Androgen Receptor on Testicular Gene Expression in Prepubertal Mice Mol. Endocrinol., February 1, 2006; 20(2): 321 - 334. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Grotkjaer, O. Winther, B. Regenberg, J. Nielsen, and L. K. Hansen Robust multi-scale clustering of large DNA microarray datasets with the consensus algorithm Bioinformatics, January 1, 2006; 22(1): 58 - 67. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Verlinden, G. Eelen, I. Beullens, M. Van Camp, P. Van Hummelen, K. Engelen, R. Van Hellemont, K. Marchal, B. De Moor, F. Foijer, et al. Characterization of the Condensin Component Cnap1 and Protein Kinase Melk as Novel E2F Target Genes Down-regulated by 1,25-Dihydroxyvitamin D3 J. Biol. Chem., November 11, 2005; 280(45): 37319 - 37330. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. BRANCO-PRICE, R. KAWAGUCHI, R. B. FERREIRA, and J. BAILEY-SERRES Genome-wide Analysis of Transcript Abundance and Translation in Arabidopsis Seedlings Subjected to Oxygen Deprivation Ann. Bot., September 1, 2005; 96(4): 647 - 660. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Handl, J. Knowles, and D. B. Kell Computational cluster validation in post-genomic data analysis Bioinformatics, August 1, 2005; 21(15): 3201 - 3212. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Florquin, Y. Saeys, S. Degroeve, P. Rouze, and Y. Van de Peer Large-scale structural analysis of the core promoter in mammalian and plant genomes Nucleic Acids Res., July 27, 2005; 33(13): 4255 - 4264. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. J. De Keersmaecker, K. Marchal, T. L. A. Verhoeven, K. Engelen, J. Vanderleyden, and C. S. Detweiler Microarray Analysis and Motif Detection Reveal New Targets of the Salmonella enterica Serovar Typhimurium HilA Regulatory Protein, Including hilA Itself J. Bacteriol., July 1, 2005; 187(13): 4381 - 4391. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Garten, S. Kaplan, and Y. Pilpel Extraction of transcription regulatory signals from genome-wide DNA-protein interaction data Nucleic Acids Res., January 31, 2005; 33(2): 605 - 615. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Himanen, M. Vuylsteke, S. Vanneste, S. Vercruysse, E. Boucheron, P. Alard, D. Chriqui, M. Van Montagu, D. Inze, and T. Beeckman Transcript profiling of early lateral root initiation PNAS, April 6, 2004; 101(14): 5146 - 5151. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Coessens, G. Thijs, S. Aerts, K. Marchal, F. De Smet, K. Engelen, P. Glenisson, Y. Moreau, J. Mathys, and B. De Moor INCLUSive: a web portal and service registry for microarray and regulatory sequence analysis Nucleic Acids Res., July 1, 2003; 31(13): 3468 - 3470. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Rombauts, K. Florquin, M. Lescot, K. Marchal, P. Rouze, and Y. Van de Peer Computational Approaches to Identify Promoters and cis-Regulatory Elements in Plant Genomes Plant Physiology, July 1, 2003; 132(3): 1162 - 1176. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Ressom, D. Wang, and P. Natarajan Clustering gene expression data using adaptive double self-organizing map Physiol Genomics, June 24, 2003; 14(1): 35 - 46. [Abstract] [Full Text] [PDF] |
||||








