Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (60)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Medvedovic, M.
Right arrow Articles by Sivaganesan, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Medvedovic, M.
Right arrow Articles by Sivaganesan, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 18 no. 9 2002
Pages 1194-1206
© 2002 Oxford University Press

Bayesian infinite mixture model based clustering of gene expression profiles

Mario Medvedovic 1 and Siva Sivaganesan 2,*

1 Center for Genome Information, Department of Environmental Health, University of Cincinnati Medical Center, 3223 Eden Av. ML 56, Cincinnati, OH~45267-0056, USA
2 Mathematical Sciences Department, University of Cincinnati, 4221 Fox Hollow Dr, Cincinnati, OH 45241, USA

Received on November 9, 2001 ; revised on January 22, 2002 and March 4, 2002 ; accepted on March 12, 2002

Motivation: The biologic significance of results obtained through cluster analyses of gene expression data generated in microarray experiments have been demonstrated in many studies. In this article we focus on the development of a clustering procedure based on the concept of Bayesian model-averaging and a precise statistical model of expression data.

Results: We developed a clustering procedure based on the Bayesian infinite mixture model and applied it to clustering gene expression profiles. Clusters of genes with similar expression patterns are identified from the posterior distribution of clusterings defined implicitly by the stochastic data-generation model. The posterior distribution of clusterings is estimated by a Gibbs sampler. We summarized the posterior distribution of clusterings by calculating posterior pairwise probabilities of co-expression and used the complete linkage principle to create clusters. This approach has several advantages over usual clustering procedures. The analysis allows for incorporation of a reasonable probabilistic model for generating data. The method does not require specifying the number of clusters and resulting optimal clustering is obtained by averaging over models with all possible numbers of clusters. Expression profiles that are not similar to any other profile are automatically detected, the method incorporates experimental replicates, and it can be extended to accommodate missing data. This approach represents a qualitative shift in the model-based cluster analysis of expression data because it allows for incorporation of uncertainties involved in the model selection in the final assessment of confidence in similarities of expression profiles. We also demonstrated the importance of incorporating the information on experimental variability into the clustering model.

Availability: The MS WindowsTM based program implementing the Gibbs sampler and supplemental material is available at http://homepages.uc.edu/~medvedm/BioinformaticsSupplement.htm

Contact: medvedm{at}email.uc.edu

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
RNAHome page
H. W. Trask, R. Cowper-Sal-lari, M. A. Sartor, J. Gui, C. V. Heath, J. Renuka, A.-J. Higgins, P. Andrews, M. Korc, J. H. Moore, et al.
Microarray analysis of cytoplasmic versus whole cell RNA reveals a considerable number of missed and false positive mRNAs
RNA, October 1, 2009; 15(10): 1917 - 1928.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
F. Achcar, J.-M. Camadro, and D. Mestivier
AutoClass@IJM: a powerful tool for Bayesian classification of heterogeneous data in biology
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W63 - W67.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Rogers, M. Girolami, W. Kolch, K. M. Waters, T. Liu, B. Thrall, and H. S. Wiley
Investigating the correspondence between transcriptomic and proteomic expression profiles using coupled cluster models
Bioinformatics, December 15, 2008; 24(24): 2894 - 2900.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Joshi, Y. Van de Peer, and T. Michoel
Analysis of a Gibbs sampler method for model-based clustering of gene expression data
Bioinformatics, January 15, 2008; 24(2): 176 - 183.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Yuan and K.-C. Li
Context-dependent clustering for dynamic cellular state modeling of microarray gene expression
Bioinformatics, November 15, 2007; 23(22): 3039 - 3047.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Respir. Cell Mol. Bio.Home page
W. D. Hardie, T. R. Korfhagen, M. A. Sartor, A. Prestridge, M. Medvedovic, T. D. Le Cras, M. Ikegami, S. C. Wesselkamper, C. Davidson, M. Dietsch, et al.
Genomic Profile of Matrix and Vasculature Remodeling in TGF-{alpha} Induced Pulmonary Fibrosis
Am. J. Respir. Cell Mol. Biol., September 1, 2007; 37(3): 309 - 321.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. Zhu, Y. Li, and H. Li
Multivariate correlation estimator for inferring functional relationships from replicated genome-wide data
Bioinformatics, September 1, 2007; 23(17): 2298 - 2305.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
G. C. Tseng
Penalized and weighted K-means for clustering with scattered objects and prior information in high-throughput biological data
Bioinformatics, September 1, 2007; 23(17): 2247 - 2255.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Physiol. Lung Cell. Mol. Physiol.Home page
E. L. Kramer, G. H. Deutsch, M. A. Sartor, W. D. Hardie, M. Ikegami, T. R. Korfhagen, and T. D. Le Cras
Perinatal increases in TGF-{alpha} disrupt the saccular phase of lung morphogenesis and cause remodeling: microarray analysis
Am J Physiol Lung Cell Mol Physiol, August 1, 2007; 293(2): L314 - L327.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Y. Lu, X. He, and S. Zhong
Cross-species microarray analysis with the OSCAR system suggests an INSR->Pax6->NQO1 neuro-protective pathway in aging and Alzheimer's disease
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W105 - W114.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Thalamuthu, I. Mukhopadhyay, X. Zheng, and G. C. Tseng
Evaluation and comparison of gene clustering methods in microarray analysis
Bioinformatics, October 1, 2006; 22(19): 2405 - 2412.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Z. S. Qin
Clustering microarray gene expression data using weighted Chinese restaurant process
Bioinformatics, August 15, 2006; 22(16): 1988 - 1997.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. K. Ng, G. J. McLachlan, K. Wang, L. Ben-Tovim Jones, and S.-W. Ng
A Mixture model with random-effects components for clustering correlated gene-expression profiles
Bioinformatics, July 15, 2006; 22(14): 1745 - 1752.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Liu, S. Sivaganesan, K. Y. Yeung, J. Guo, R. E. Bumgarner, and M. Medvedovic
Context-specific infinite mixtures for clustering gene expression profiles across diverse microarray dataset
Bioinformatics, July 15, 2006; 22(14): 1737 - 1744.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. Huang and W. Pan
Incorporating biological knowledge into distance-based clustering analysis of microarray gene expression data
Bioinformatics, May 15, 2006; 22(10): 1259 - 1268.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
W. Pan
Incorporating gene functions as priors in model-based clustering of microarray gene expression data
Bioinformatics, April 1, 2006; 22(7): 795 - 801.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. A. Sartor, A. M. Zorn, J. A. Schwanekamp, D. Halbleib, S. Karyala, M. L. Howell, G. E. Dean, M. Medvedovic, and C. R. Tomlinson
A new method to remove hybridization bias for interspecies comparison of global gene expression profiles uncovers an association between mRNA sequence divergence and differential gene expression in Xenopus
Nucleic Acids Res., January 5, 2006; 34(1): 185 - 200.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.