Bioinformatics Vol. 19 no. 4 2003
Pages 467-473
© 2003 Oxford University Press
A multivariate approach applied to microarray data for identification of genes with cell cycle-coupled transcription


Research group for Chemometrics, Organic Chemistry, Department of Chemistry, Umeå University, Sweden
Received on June 9, 2002
; revised on September 13, 2002
; accepted on October 6, 2002
We have analyzed microarray data using a modeling approach based on the multivariate statistical method partial least squares (PLS) regression to identify genes with periodic fluctuations in expression levels coupled to the cell cycle in the budding yeast, Saccharomyces cerevisiae. PLS has major advantages for analyzing microarray data since it can model data sets with large numbers of variables and with few observations.
A response model was derived describing the expression profile over time expected for periodically transcribed genes, and was used to identify budding yeast transcripts with similar profiles. PLS was then used to interpret the importance of the variables (genes) for the model, yielding a ranking list of how well the genes fitted the generated model. Application of an appropriate cutoff value, calculated from randomized data, allows the identification of genes whose expression appears to be synchronized with cell cycling. Our approach also provides information about the stage in the cell cycle where their transcription peaks.
Three synchronized yeast cell microarray data sets were analyzed, both separately and combined. Cell cycle-coupled periodicity was suggested for 455 of the 6,178 transcripts monitored in the combined data set, at a significance level of 0.5%. Among the candidates, 85% of the known periodic transcripts were included. Analysis of the three data sets separately yielded similar ranking lists, showing that the method is robust.
Contact: anders.berglund{at}chem.umu.se
Supplementary material: Available at: http://www.chem.umu.se/dep/orgchem/forskning/chemometrics/bioinformatic.stm
* To whom correspondence should be addressed.
The authors wish it to be known that, in their
opinion, the first two authors should be regarded as joint First
Authors.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. Quan, Z.-L. Ji, X. Wang, A. M. Tartakoff, and T. Tao Evolutionary and Transcriptional Analysis of Karyopherin {beta} Superfamily Proteins Mol. Cell. Proteomics, July 1, 2008; 7(7): 1254 - 1269. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. E. Futschik and H. Herzel Are we overestimating the number of cell-cycling genes? The impact of background models on time-series analysis Bioinformatics, April 15, 2008; 24(8): 1063 - 1069. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. P. Gauthier, M. E. Larsen, R. Wernersson, U. de Lichtenberg, L. J. Jensen, S. Brunak, and T. S. Jensen Cyclebase.org a comprehensive multi-organism online database of cell-cycle experiments Nucleic Acids Res., January 11, 2008; 36(suppl_1): D854 - D859. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Pierrou, P. Broberg, R. A. O'Donnell, K. Pawlowski, R. Virtala, E. Lindqvist, A. Richter, S. J. Wilson, G. Angco, S. Moller, et al. Expression of Genes Involved in Oxidative Stress Responses in Airway Epithelial Cells of Smokers with Chronic Obstructive Pulmonary Disease Am. J. Respir. Crit. Care Med., March 15, 2007; 175(6): 577 - 586. [Abstract] [Full Text] [PDF] |
||||
![]() |
A.-L. Boulesteix and K. Strimmer Partial least squares: a versatile tool for the analysis of high-dimensional genomic data Brief Bioinform, January 1, 2007; 8(1): 32 - 44. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Qiu, Z. J. Wang, and K. J. R. Liu Polynomial model approach for resynchronization analysis of cell-cycle gene expression data Bioinformatics, April 15, 2006; 22(8): 959 - 966. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. F. Glynn, J. Chen, and A. R. Mushegian Detecting periodic patterns in unevenly spaced gene expression time series using Lomb-Scargle periodograms Bioinformatics, February 1, 2006; 22(3): 310 - 316. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. de Lichtenberg, L. J. Jensen, A. Fausboll, T. S. Jensen, P. Bork, and S. Brunak Comparison of computational methods for the identification of cell cycle-regulated genes Bioinformatics, April 1, 2005; 21(7): 1164 - 1171. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-h. Taguchi and Y. Oono Relational patterns of gene expression via non-metric multidimensional scaling analysis Bioinformatics, March 15, 2005; 21(6): 730 - 740. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Liu, D. M. Umbach, S. D. Peddada, L. Li, P. W. Crockett, and C. R. Weinberg A random-periods model for expression of cell-cycle genes PNAS, May 11, 2004; 101(19): 7240 - 7245. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Lu, W. Zhang, Z. S. Qin, K. E. Kwast, and J. S. Liu Statistical resynchronization and Bayesian detection of periodically expressed genes Nucleic Acids Res., January 22, 2004; 32(2): 447 - 455. [Abstract] [Full Text] [PDF] |
||||





