Bioinformatics Vol. 17 no. 90001 2001
Pages S243-S252
© 2001 Oxford University Press
Rich probabilistic models for gene expression
1 Computer Science Department, Stanford
University, Stanford, 94305, USA
2 Department of Genome Sciences, Lawrence
Berkeley National Labs, Berkeley, CA, 94702, USA
3 School of Computer Science &
Engineering, Hebrew University, Jerusalem, 94904, Israel
Received on February 6, 2001
; revised on April 2, 2001
; accepted on April 2, 2001
Clustering is commonly used for analyzing gene expression data. Despite their successes, clustering methods suffer from a number of limitations. First, these methods reveal similarities that exist over all of the measurements, while obscuring relationships that exist over only a subset of the data. Second, clustering methods cannot readily incorporate additional types of information, such as clinical data or known attributes of genes. To circumvent these shortcomings, we propose the use of a single coherent probabilistic model, that encompasses much of the rich structure in the genomic expression data, while incorporating additional information such as experiment type, putative binding sites, or functional information. We show how this model can be learned from the data, allowing us to discover patterns in the data and dependencies between the gene expression patterns and additional attributes. The learned model reveals context-specific relationships, that exist only over a subset of the experiments in the dataset. We demonstrate the power of our approach on synthetic data and on two real-world gene expression data sets for yeast. For example, we demonstrate a novel functionality that falls naturally out of our framework: predicting the "cluster" of the array resulting from a gene mutation based only on the genes expression pattern in the context of other mutations.
Contact: eran{at}cs.stanford.edu
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. Gupta, P. Qu, and J. G. Ibrahim A temporal hidden Markov regression model for the analysis of gene regulatory networks Biostat., October 1, 2007; 8(4): 805 - 820. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-H. Pan, C.-J. Lih, and S. N. Cohen Effects of threshold choice on biological conclusions reached during analysis of gene expression by DNA microarrays PNAS, June 21, 2005; 102(25): 8961 - 8965. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Friedman Inferring Cellular Networks Using Probabilistic Graphical Models Science, February 6, 2004; 303(5659): 799 - 805. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. G. Troyanskaya, K. Dolinski, A. B. Owen, R. B. Altman, and D. Botstein A Bayesian framework for combining heterogeneous data sources for gene function prediction (in Saccharomyces cerevisiae) PNAS, July 8, 2003; 100(14): 8348 - 8353. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Sachs, D. Gifford, T. Jaakkola, P. Sorger, and D. A. Lauffenburger Bayesian Network Approach to Cell Signaling Pathway Modeling Sci. Signal., September 3, 2002; 2002(148): pe38 - pe38. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Kaminski and N. Friedman Practical Approaches to Analyzing Results of Microarray Experiments Am. J. Respir. Cell Mol. Biol., August 1, 2002; 27(2): 125 - 132. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-H. Pan, C.-J. Lih, and S. N. Cohen Analysis of DNA microarrays using algorithms that employ rule-based expert knowledge PNAS, February 19, 2002; 99(4): 2118 - 2123. [Abstract] [Full Text] [PDF] |
||||




