Bioinformatics Vol. 18 no. 1 2002
Pages 51-60
© 2002 Oxford University Press
Linear modes of gene expression determined by independent component analysis
Theoretische Biophysik, Institut für Biologie, Humboldt-Universität zu Berlin, Invalidenstraße 42, 10115 Berlin, Germanyand Max-Planck-Institut für molekulare Genetik, Ihnestraße 73, 14195 Berlin, Germany
Received on March 22, 2001
; revised on July 20, 2001
; accepted on August 3, 2001
Motivation: The expression of genes is controlled by specific combinations of cellular variables. We applied Independent Component Analysis (ICA) to gene expression data, deriving a linear model based on hidden variables, which we term expression modes. The expression of each gene is a linear function of the expression modes, where, according to the ICA model, the linear influences of different modes show a minimal statistical dependence, and their distributions deviate sharply from the normal distribution.
Results: Studying cell cycle-related gene expression in yeast, we found that the dominant expression modes could be related to distinct biological functions, such as phases of the cell cycle or the mating response. Analysis of human lymphocytes revealed modes that were related to characteristic differences between cell types. With both data sets, the linear influences of the dominant modes showed distributions with large tails, indicating the existence of specifically up- and downregulated target genes. The expression modes and their influences can be used to visualize the samples and genes in low-dimensional spaces. A projection to expression modes helps to highlight particular biological functions, to reduce noise, and to compress the data in a biologically sensible way.
Availability: The FastICA algorithm (Hyvärinen, IEEE Trans. Neural Netw. , 10, 626634, 1999) is freely available at http://www.cis.hut.fi/projects/ica/fastica/. Additional matlab scripts and detailed results can be downloaded from http://www.molgen.mpg.de/research/lehrach/projects/genica/
Contact: wolfram.liebermeister{at}rz.hu-berlin.de
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S. Wienkoop, K. Morgenthal, F. Wolschin, M. Scholz, J. Selbig, and W. Weckwerth Integration of Metabolomic and Proteomic Phenotypes: Analysis of Data Covariance Dissects Starch and RFO Metabolism from Low and High Temperature Compensation Response in Arabidopsis Thaliana Mol. Cell. Proteomics, September 1, 2008; 7(9): 1725 - 1736. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li and M. Zhan Unraveling transcriptional regulatory programs by integrative analysis of microarray and transcription factor binding data Bioinformatics, September 1, 2008; 24(17): 1874 - 1880. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Schachtner, D. Lutter, P. Knollmuller, A. M. Tome, F. J. Theis, G. Schmitz, M. Stetter, P. G. Vilda, and E. W. Lang Knowledge-based gene expression classification via matrix factorization Bioinformatics, August 1, 2008; 24(15): 1688 - 1697. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Chang, Z. Ding, Y. S. Hung, and P. C. W. Fung Fast network component analysis (FastNCA) for gene regulatory network reconstruction from microarray data Bioinformatics, June 1, 2008; 24(11): 1349 - 1358. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Mantini, F. Petrucci, P. Del Boccio, D. Pieragostino, M. Di Nicola, A. Lugaresi, G. Federici, P. Sacchetta, C. Di Ilio, and A. Urbani Independent component analysis for the extraction of reliable protein signal profiles from MALDI-TOF mass spectra Bioinformatics, January 1, 2008; 24(1): 63 - 70. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Hurtado, J. J. Lozano, E. Castellanos, L. A Lopez-Fernandez, K. Harshman, C. Martinez-A, A. R Ortiz, T. M Thomson, and R. Paciucci Activation of the epidermal growth factor signalling pathway by tissue plasminogen activator in pancreas cancer cells Gut, September 1, 2007; 56(9): 1266 - 1274. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Naderi, A. E. Teschendorff, J. Beigel, M. Cariati, I. O. Ellis, J. D. Brenton, and C. Caldas BEX2 Is Overexpressed in a Subset of Primary Breast Cancers and Mediates Nerve Growth Factor/Nuclear Factor-{kappa}B Inhibition of Apoptosis in Breast Cancer Cell Lines Cancer Res., July 15, 2007; 67(14): 6725 - 6736. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. P. Brynildsen, T.-Y. Wu, S.-S. Jang, and J. C. Liao Biological network mapping and source signal deduction Bioinformatics, July 15, 2007; 23(14): 1783 - 1791. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li, Y. Sun, and M. Zhan The discovery of transcriptional modules by a two-stage matrix decomposition approach Bioinformatics, February 15, 2007; 23(4): 473 - 479. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. W. Schreiber and U. Baumann A framework for gene expression analysis Bioinformatics, January 15, 2007; 23(2): 191 - 197. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. P. Brynildsen, L. M. Tran, and J. C. Liao Versatility and Connectivity Efficiency of Bipartite Transcription Networks Biophys. J., October 15, 2006; 91(8): 2749 - 2759. [Abstract] [Full Text] [PDF] |
||||
![]() |
D.-S. Huang and C.-H. Zheng Independent component analysis-based penalized discriminant method for tumor classification using gene expression data Bioinformatics, August 1, 2006; 22(15): 1855 - 1862. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Galbraith, L. M. Tran, and J. C. Liao Transcriptome network component analysis with limited microarray data Bioinformatics, August 1, 2006; 22(15): 1886 - 1894. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Amato, A. Ciaramella, N. Deniskina, C. D. Mondo, D. di Bernardo, C. Donalek, G. Longo, G. Mangano, G. Miele, G. Raiconi, et al. A multi-step approach to time series analysis and gene expression clustering Bioinformatics, March 1, 2006; 22(5): 589 - 596. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Leng and H.-G. Muller Classification using functional data analysis for temporal gene expression data Bioinformatics, January 1, 2006; 22(1): 68 - 76. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. E. Teschendorff, Y. Wang, N. L. Barbosa-Morais, J. D. Brenton, and C. Caldas A variational Bayesian mixture modelling framework for cluster analysis of gene-expression data Bioinformatics, July 1, 2005; 21(13): 3025 - 3033. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. N. Vemuri and A. A. Aristidou Metabolic Engineering in the -omics Era: Elucidating and Modulating Regulatory Networks Microbiol. Mol. Biol. Rev., June 1, 2005; 69(2): 197 - 216. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Tsafrir, I. Tsafrir, L. Ein-Dor, O. Zuk, D.A. Notterman, and E. Domany Sorting points into neighborhoods (SPIN): data analysis and visualization by ordering distance matrices Bioinformatics, May 15, 2005; 21(10): 2301 - 2308. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Goutsias and S. Kim A Nonlinear Discrete Dynamical Model for Transcriptional Regulation: Construction and Properties Biophys. J., April 1, 2004; 86(4): 1922 - 1945. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. C. Kao, Y.-L. Yang, R. Boscolo, C. Sabatti, V. Roychowdhury, and J. C. Liao Transcriptome-based determination of multiple transcription regulator activities in Escherichia coli by using network component analysis PNAS, January 13, 2004; 101(2): 641 - 646. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Liao, R. Boscolo, Y.-L. Yang, L. M. Tran, C. Sabatti, and V. P. Roychowdhury Network component analysis: Reconstruction of regulatory signals in biological systems PNAS, December 23, 2003; 100(26): 15522 - 15527. [Abstract] [Full Text] [PDF] |
||||






