Bioinformatics Advance Access published online on April 28, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti466
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Oncology, Cancer Genomics Program, Hutchison-MRC Research Centre, University of Cambridge, Hills Road, Cambridge CB2 2XZ, UK
* To whom correspondence should be addressed.
Motivation: Accurate subcategorisation of tumour types through gene expression profiling requires analytical techniques that estimate the number of categories or clusters rigorously and reliably. Parametric mixture modelling provides a natural setting in which to address this problem. Results: We compare a criterion for model selection that is derived from a variational Bayesian framework with a popular alternative based on the Bayesian information criterion (BIC). Using simulated data we show that the variational Bayesian method is more accurate in finding the true number of clusters in situations that are relevant to current and future microarray studies. We also compare the two criteria using publicly available tumour microarray data sets and show that the variational Bayesian method is moresensitive to capturing biologically relevant structure. Availability: We have developed an R-package vabayelMix, available from www.cran.r-project.org, that implements the algorithm as described here.
Received October 11, 2004
Revised April 23, 2005
Accepted April 23, 2005
Article
A variational Bayesian mixture modelling framework for cluster analysis of gene expression data
Andrew E. Teschendorff, E-mail: aet21{at}cam.ac.uk
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
N. Nagamine and Y. Sakakibara Statistical prediction of protein chemical interactions based on chemical structure and mass spectrometry data Bioinformatics, August 1, 2007; 23(15): 2004 - 2012. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. E. Teschendorff, A. Naderi, N. L. Barbosa-Morais, and C. Caldas PACK: Profile Analysis using Clustering and Kurtosis to find molecular classifiers in cancer Bioinformatics, September 15, 2006; 22(18): 2269 - 2275. [Abstract] [Full Text] [PDF] |
||||
![]() |
D.-S. Huang and C.-H. Zheng Independent component analysis-based penalized discriminant method for tumor classification using gene expression data Bioinformatics, August 1, 2006; 22(15): 1855 - 1862. [Abstract] [Full Text] [PDF] |
||||
