Bioinformatics Vol. 19 no. 15 2003
pages 1917-1926
© 2003 Oxford University Press
Prediction of regulatory networks: genome-wide identification of transcription factor targets from gene expression data
1 Department of Ophthalmology, Johns Hopkins Medical School, Baltimore, MD 21287, USA and 2 Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
Received on May 1, 2003
; revised on July 15, 2003
; accepted on July 15, 2003
Motivation: Defining regulatory networks, linking transcription factors (TFs) to their targets, is a central problem in post-genomic biology. One might imagine one could readily determine these networks through inspection of gene expression data. However, the relationship between the expression timecourse of a transcription factor and its target is not obvious (e.g. simple correlation over the timecourse), and current analysis methods, such as hierarchical clustering, have not been very successful in deciphering them.
Results: Here we introduce an approach based on support vector machines (SVMs) to predict the targets of a transcription factor by identifying subtle relationships between their expression profiles. In particular, we used SVMs to predict the regulatory targets for 36 transcription factors in the Saccharomyces cerevisiae genome based on the microarray expression data from many different physiological conditions. We trained and tested our SVM on a data set constructed to include a significant number of both positive and negative examples, directly addressing data imbalance issues. This was non-trivial given that most of the known experimental information is only for positives. Overall, we found that 63% of our TFtarget relationships were confirmed through cross-validation. We further assessed the performance of our regulatory network identifications by comparing them with the results from two recent genome-wide ChIP-chip experiments. Overall, we find the agreement between our results and these experiments is comparable to the agreement (albeit low) between the two experiments. We find that this network has a delocalized structure with respect to chromosomal positioning, with a given transcription factor having targets spread fairly uniformly across the genome.
Availability: The overall network of the relationships is available on the web at http://bioinfo.mbb.yale.edu/expression/echipchip
Contact: Mark.Gerstein{at}yale.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. W. Shin, R. Huggenberger, and M. Detmar Transcriptional profiling of VEGF-A and VEGF-C target genes in lymphatic endothelium reveals endothelial-specific molecule-1 as a novel mediator of lymphangiogenesis Blood, September 15, 2008; 112(6): 2318 - 2326. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Stender, J. Frasor, B. Komm, K. C. N. Chang, W. L. Kraus, and B. S. Katzenellenbogen Estrogen-Regulated Gene Networks in Human Breast Cancer Cells: Involvement of E2F1 in the Regulation of Cell Proliferation Mol. Endocrinol., September 1, 2007; 21(9): 2112 - 2123. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. P. Brynildsen, T.-Y. Wu, S.-S. Jang, and J. C. Liao Biological network mapping and source signal deduction Bioinformatics, July 15, 2007; 23(14): 1783 - 1791. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Goh, S. K. Murphy, S. Muhkerjee, and T. S. Furey Genomic sweeping for hypermethylated genes Bioinformatics, February 1, 2007; 23(3): 281 - 288. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Yu, J. Lin, D. J. Zack, and J. Qian Computational analysis of tissue-specific combinatorial gene regulation: predicting interaction between transcription factors in human tissues Nucleic Acids Res., October 18, 2006; 34(17): 4925 - 4936. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Zhao, L. Fang, N. Chen, R. C. Johnsen, L. Stein, and D. L. Baillie Distinct Regulatory Elements Mediate Similar Expression Patterns in the Excretory Cell of Caenorhabditis elegans J. Biol. Chem., November 18, 2005; 280(46): 38787 - 38794. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Yu and K.-C. Li Inference of transcriptional regulatory network by two-stage constrained space factor analysis Bioinformatics, November 1, 2005; 21(21): 4033 - 4038. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Yu, W. Sun, S. Yuan, and K.-C. Li Study of coordinative gene expression at the biological process level Bioinformatics, September 15, 2005; 21(18): 3651 - 3657. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Gonzalez, V. Espinosa, A. T. Vasconcelos, E. Perez-Rueda, and J. Collado-Vides TRACTOR_DB: a database of regulatory networks in gamma-proteobacterial genomes Nucleic Acids Res., January 1, 2005; 33(suppl_1): D98 - D102. [Abstract] [Full Text] [PDF] |
||||




