Skip Navigation


Bioinformatics Advance Access originally published online on March 25, 2004
This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow All Versions of this Article:
20/13/1993    most recent
bth166v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (53)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Ihmels, J.
Right arrow Articles by Barkai, N.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ihmels, J.
Right arrow Articles by Barkai, N.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics 20(13) © Oxford University Press 2004; all rights reserved.

Defining transcription modules using large-scale gene expression data

Jan Ihmels 1,2, Sven Bergmann 1,2 and Naama Barkai 1,2,*

1 Department of Molecular Genetics and 2 Department of Physics of Complex Systems Weizman Institute of Science, 76100 Rehovot, Israel

Received on September 21, 2003; revised on January 18, 2004; accepted on February 5, 2004
Advance Access Publication March 25, 2004

Motivation: Large-scale gene expression data comprising a variety of cellular conditions hold the promise of a global view on the transcription program. While conventional clustering algorithms have been successfully applied to smaller datasets, the utility of many algorithms for the analysis of large-scale data is limited by their inability to capture combinatorial and condition-specific co-regulation. In addition, there is an increasing need to integrate the rapidly accumulating body of other high-throughput biological data with the expression analysis. In a previous work, we introduced the signature algorithm, which overcomes the problems of conventional clustering and allows for intuitive integration of additional biological data. However, this approach is constrained by the comprehensiveness of relevant external data and its lacking ability to capture hierarchical modularity.

Methods: We present a novel method for the analysis of large-scale expression data, which assigns genes into context-dependent and potentially overlapping regulatory units. We introduce the notion of a transcription module as a self-consistent regulatory unit consisting of a set of co-regulated genes as well as the experimental conditions that induce their co-regulation. Self-consistency is defined by a rigorous mathematical criterion. We propose an efficient algorithm to identify such modules, which is based on the iterative application of the signature algorithm. A threshold parameter that determines the resolution of the modular decomposition is introduced.

Results: The method is applied systematically to over 1000 expression profiles of the yeast Saccharomyces cerevisiae, and the results are presented using two complementary visualization schemes we developed. The average biological coherence, as measured by the conservation of putative cis-regulatory motifs between four related yeast species, is higher for transcription modules than for clusters identified by other methods applied to the same dataset. Our method is related to singular value decomposition (SVD) and to the pairwise average linkage clustering algorithm. It extends SVD by filtering out noise in the expression data and offering variable resolution to reveal hierarchical organization. It furthermore has the advantage over both methods of capturing overlapping modules in the presence of combinatorial regulation.

Supplementary information: http://www.weizmann.ac.il/~barkai/modules

Contact: naama.barkai{at}weizmann.ac.il

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
A. Bhattacharya and R. K. De
Divisive Correlation Clustering Algorithm (DCCA) for grouping of genes: detecting varying patterns in expression profiles
Bioinformatics, June 1, 2008; 24(11): 1359 - 1366.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
T. Dhollander, Q. Sheng, K. Lemmens, B. De Moor, K. Marchal, and Y. Moreau
Query-driven module discovery in microarray data
Bioinformatics, October 1, 2007; 23(19): 2573 - 2580.
[Abstract] [Full Text] [PDF]


Home page
Eukaryot CellHome page
C. Tian, T. Kasuga, M. S. Sachs, and N. L. Glass
Transcriptional Profiling of Cross Pathway Control in Neurospora crassa and Comparative Analysis of the Gcn4 and CPC1 Regulons
Eukaryot. Cell, June 1, 2007; 6(6): 1018 - 1029.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
S. R. Collins, P. Kemmeren, X.-C. Zhao, J. F. Greenblatt, F. Spencer, F. C. P. Holstege, J. S. Weissman, and N. J. Krogan
Toward a Comprehensive Atlas of the Physical Interactome of Saccharomyces cerevisiae
Mol. Cell. Proteomics, March 1, 2007; 6(3): 439 - 450.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Hollunder, M. Friedel, A. Beyer, C. T. Workman, and T. Wilhelm
DASS: efficient discovery and p-value calculation of substructures in unordered data
Bioinformatics, January 1, 2007; 23(1): 77 - 83.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Liu and L. Wang
Computing the maximum similarity bi-clusters of gene expression data
Bioinformatics, January 1, 2007; 23(1): 50 - 56.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
H. Wei, S. Persson, T. Mehta, V. Srinivasasainagendra, L. Chen, G. P. Page, C. Somerville, and A. Loraine
Transcriptional Coordination of the Metabolic Network in Arabidopsis
Plant Physiology, October 1, 2006; 142(2): 762 - 774.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
K. Yu, K. Ganesan, L. D. Miller, and P. Tan
A modular analysis of breast cancer reveals a novel low-grade molecular signature in estrogen receptor-positive tumors.
Clin. Cancer Res., June 1, 2006; 12(11): 3288 - 3296.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Barkow, S. Bleuler, A. Prelic, P. Zimmermann, and E. Zitzler
BicAT: a biclustering analysis toolbox
Bioinformatics, May 15, 2006; 22(10): 1282 - 1283.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. Blanchette, A. R. Bataille, X. Chen, C. Poitras, J. Laganiere, C. Lefebvre, G. Deblois, V. Giguere, V. Ferretti, D. Bergeron, et al.
Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression
Genome Res., May 1, 2006; 16(5): 656 - 668.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Prelic, S. Bleuler, P. Zimmermann, A. Wille, P. Buhlmann, W. Gruissem, L. Hennig, L. Thiele, and E. Zitzler
A systematic comparison and evaluation of biclustering methods for gene expression data
Bioinformatics, May 1, 2006; 22(9): 1122 - 1129.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
T. Yu and K.-C. Li
Inference of transcriptional regulatory network by two-stage constrained space factor analysis
Bioinformatics, November 1, 2005; 21(21): 4033 - 4038.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
A. A. Petti and G. M. Church
A network of transcriptionally coordinated functional modules in Saccharomyces cerevisiae
Genome Res., September 1, 2005; 15(9): 1298 - 1306.
[Abstract] [Full Text] [PDF]


Home page
Genes Dev.Home page
A. Blais and B. D. Dynlacht
Constructing transcriptional regulatory networks
Genes & Dev., July 1, 2005; 19(13): 1499 - 1511.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
I. Borodina, P. Krabben, and J. Nielsen
Genome-scale analysis of Streptomyces coelicolor A3(2) metabolism
Genome Res., June 1, 2005; 15(6): 820 - 829.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
W. Wang, J. M. Cherry, Y. Nochomovitz, E. Jolly, D. Botstein, and H. Li
Inference of combinatorial regulation in yeast transcriptional networks: A case study of sporulation
PNAS, February 8, 2005; 102(6): 1998 - 2003.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.