Bioinformatics Vol. 19 no. 4 2003
Pages 474-482
© 2003 Oxford University Press
Clustering of time-course gene expression data using a mixed-effects model with B-splines
1 Rowe Program in Human Genetics,
Department of Medicine, University of California, Davis, CA 95616-8500, USA
2 Department of Statistics, Peking University,
Beijing, Peoples Republic of China
Received on April 20, 2002
; revised on October 8, 2002
; accepted on October 10, 2002
Motivation: Time-course gene expression data are often measured to study dynamic biological systems and gene regulatory networks. To account for time dependency of the gene expression measurements over time and the noisy nature of the microarray data, the mixed-effects model using B-splines was introduced. This paper further explores such mixed-effects model in analyzing the time-course gene expression data and in performing clustering of genes in a mixture model framework.
Results: After fitting the mixture model in the framework of the mixed-effects model using an EM algorithm, we obtained the smooth mean gene expression curve for each cluster. For each gene, we obtained the best linear unbiased smooth estimate of its gene expression trajectory over time, combining data from that gene and other genes in the same cluster. Simulated data indicate that the methods can effectively cluster noisy curves into clusters differing in either the shapes of the curves or the times to the peaks of the curves. We further demonstrate the proposed method by clustering the yeast genes based on their cell cycle gene expression data and the human genes based on the temporal transcriptional response of fibroblasts to serum. Clear periodic patterns and varying times to peaks are observed for different clusters of the cell-cycle regulated genes. Results of the analysis of the human fibroblasts data show seven distinct transcriptional response profiles with biological relevance.
Availability: Matlab programs are available on request from the authors.
Supplementary Information: http://dna/ucdavis.edu/~hli/bioinforsupp.pdf
Contact: hli{at}ucdavis.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
R. Tang and H.-G. Muller Time-synchronized clustering of gene expression trajectories Biostat., May 22, 2008; (2008) kxn011v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Sahoo, D. L. Dill, R. Tibshirani, and S. K. Plevritis Extracting binary signals from microarray time-course data Nucleic Acids Res., June 28, 2007; 35(11): 3705 - 3712. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Wang, G. Chen, and H. Li Group SCAD regression analysis for microarray time course gene expression data Bioinformatics, June 15, 2007; 23(12): 1486 - 1494. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Cheng, X. Ma, X. Yan, F. Sun, and L. M. Li MARD: a new method to detect differential gene expression in treatment-control time courses Bioinformatics, November 1, 2006; 22(21): 2650 - 2657. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Ng, G. J. McLachlan, K. Wang, L. Ben-Tovim Jones, and S.-W. Ng A Mixture model with random-effects components for clustering correlated gene-expression profiles Bioinformatics, July 15, 2006; 22(14): 1745 - 1752. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Conesa, M. J. Nueda, A. Ferrer, and M. Talon maSigPro: a method to identify significantly differential expression profiles in time-course microarray experiments Bioinformatics, May 1, 2006; 22(9): 1096 - 1102. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Pan Incorporating gene functions as priors in model-based clustering of microarray gene expression data Bioinformatics, April 1, 2006; 22(7): 795 - 801. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Ma, C. I. Castillo-Davis, W. Zhong, and J. S. Liu A data-driven clustering method for time course gene expression data Nucleic Acids Res., March 1, 2006; 34(4): 1261 - 1269. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Leng and H.-G. Muller Classification using functional data analysis for temporal gene expression data Bioinformatics, January 1, 2006; 22(1): 68 - 76. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. A. Heard, C. C. Holmes, D. A. Stephens, D. J. Hand, and G. Dimopoulos Bayesian coclustering of Anopheles gene expression time series: Study of immune defense response to multiple experimental challenges PNAS, November 22, 2005; 102(47): 16939 - 16944. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Bhasi, A. Forrest, and M. Ramanathan SPLINDID: a semi-parametric, model-based method for obtaining transcription rates and gene regulation parameters from genomic and proteomic expression profiles Bioinformatics, October 15, 2005; 21(20): 3873 - 3879. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Storey, W. Xiao, J. T. Leek, R. G. Tompkins, and R. W. Davis Significance analysis of time course microarray experiments PNAS, September 6, 2005; 102(36): 12837 - 12842. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Tabibiazar, R. A. Wagner, E. A. Ashley, J. Y. King, R. Ferrara, J. M. Spin, D. A. Sanan, B. Narasimhan, R. Tibshirani, P. S. Tsao, et al. Signature patterns of gene expression in mouse atherosclerosis and their correlation to human coronary disease Physiol Genomics, July 14, 2005; 22(2): 213 - 226. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Eckel-Passow, A. Hoering, T. M. Therneau, and I. Ghobrial Experimental Design and Analysis of Antibody Microarrays: Applying Methods from cDNA Arrays Cancer Res., April 15, 2005; 65(8): 2985 - 2989. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Liu, D. M. Umbach, S. D. Peddada, L. Li, P. W. Crockett, and C. R. Weinberg A random-periods model for expression of cell-cycle genes PNAS, May 11, 2004; 101(19): 7240 - 7245. [Abstract] [Full Text] [PDF] |
||||





