Skip Navigation

Bioinformatics 2007 23(13):i577-i586; doi:10.1093/bioinformatics/btm227
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Yan, X.
Right arrow Articles by Zhou, X. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Yan, X.
Right arrow Articles by Zhou, X. J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2007 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

A graph-based approach to systematically reconstruct human transcriptional regulatory modules

Xifeng Yan 1, Michael R. Mehan 2,{dagger}, Yu Huang 2, Michael S. Waterman 2, Philip S. Yu 1 and Xianghong Jasmine Zhou 2,*

1IBM T. J. Watson Research Center, Hawthorne NY and 2Program in Molecular and Computational Biology, University of Southern California, Los Angeles CA, USA

*To whom correspondence should be addressed.


   Abstract

Motivation: A major challenge in studying gene regulation is to systematically reconstruct transcription regulatory modules, which are defined as sets of genes that are regulated by a common set of transcription factors. A commonly used approach for transcription module reconstruction is to derive coexpression clusters from a microarray dataset. However, such results often contain false positives because genes from many transcription modules may be simultaneously perturbed upon a given type of conditions. In this study, we propose and validate that genes, which form a coexpression cluster in multiple microarray datasets across diverse conditions, are more likely to form a transcription module. However, identifying genes coexpressed in a subset of many microarray datasets is not a trivial computational problem.

Results: We propose a graph-based data-mining approach to efficiently and systematically identify frequent coexpression clusters. Given m microarray datasets, we model each microarray dataset as a coexpression graph, and search for vertex sets which are frequently densely connected across {lceil} {theta} m {rciel} datasets (0 ≤ {theta} ≤ 1). For this novel graph-mining problem, we designed two techniques to narrow down the search space: (1) partition the input graphs into (overlapping) groups sharing common properties; (2) summarize the vertex neighbor information from the partitioned datasets onto the ‘Neighbor Association Summary Graph's for effective mining. We applied our method to 105 human microarray datasets, and identified a large number of potential transcription modules, activated under different subsets of conditions. Validation by ChIP-chip data demonstrated that the likelihood of a coexpression cluster being a transcription module increases significantly with its recurrence. Our method opens a new way to exploit the vast amount of existing microarray data accumulation for gene regulation study. Furthermore, the algorithm is applicable to other biological networks for approximate network module mining.

Availability: http://zhoulab.usc.edu/NeMo/

Contact: xjzhou{at}usc.edu

{dagger}The authors wish it to be known that, in this opinion, the first two authors should be regarded as joint first authors.



Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
J. D. Wren
A global meta-analysis of microarray expression data to predict unknown gene functions and estimate the literature-data divide
Bioinformatics, July 1, 2009; 25(13): 1694 - 1701.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
E. Georgii, S. Dietmann, T. Uno, P. Pagel, and K. Tsuda
Enumeration of condition-dependent dense modules in protein interaction networks
Bioinformatics, April 1, 2009; 25(7): 933 - 940.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Rajaram
A novel meta-analysis method exploiting consistency of high-throughput experiments
Bioinformatics, March 1, 2009; 25(5): 636 - 642.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.