Skip Navigation


Bioinformatics Advance Access originally published online on September 6, 2008
Bioinformatics 2008 24(21):2544-2545; doi:10.1093/bioinformatics/btn467
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrowOA All Versions of this Article:
24/21/2544    most recent
btn467v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Bauer, D. C.
Right arrow Articles by Bailey, T. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bauer, D. C.
Right arrow Articles by Bailey, T. L.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2008 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

STREAM: Static Thermodynamic REgulAtory Model of transcription

Denis C. Bauer * and Timothy L. Bailey *

Institute for Molecular Bioscience, University of Queensland, Brisbane, Qld. 4072, Australia

*To whom correspondence should be addressed.


    ABSTRACT
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 APPROACH AND USAGE
 3 METHODS
 4 FUTURE DEVELOPMENTS
 ACKNOWLEDGEMENTS
 REFERENCES
 

Motivation: Understanding the transcriptional regulation of a gene in detail is a crucial step towards uncovering and ultimately utilizing the regulatory grammar of the genome. Modeling transcriptional regulation using thermodynamic equations has become an increasingly important approach towards this goal.

Here, we present STREAM, the first publicly available framework for modeling, visualizing and predicting the regulation of the transcription rate of a target gene. Given the concentrations of a set of transcription factors (TFs), the TF binding sites (TFBSs) in a regulatory DNA region, and the transcription rate of the target gene, STREAM will optimize its parameters to generate a model that best fits the input data. This trained model can then be used to (a) validate that the given set of TFs is able to regulate the target gene and (b) to predict the transcription rate under different conditions (e.g. different tissues, knockout/additional TFs or mutated/missing TFBSs).

Availability: The platform independent executable of STREAM, as well as a tutorial and the full documentation, are available at http://bioinformatics.org.au/stream/. STREAM requires Java version 5 or higher.

Contact: d.bauer{at}imb.uq.edu.au; t.bailey{at}imb.uq.edu.au


    1 INTRODUCTION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 APPROACH AND USAGE
 3 METHODS
 4 FUTURE DEVELOPMENTS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Transcription of a gene can be induced by the binding of specific transcription factors (TFs) to so-called cis-regulatory modules (CRMs). The frequency and duration of the binding events are influenced by the concentrations of the TFs, the binding affinities and location of the TF binding sites (TFBSs) in the CRM and the properties of the TFs themselves (e.g. effectiveness, competitive interaction with other TFs). With the availability of an increasing number of detailed measurements of gene concentrations in different situations (e.g. tissues, developmental time points) as well as TF-DNA binding affinities, it has become possible to build mathematical models for transcriptional regulation. Building mathematical models to associate a specific occupation of a specific CRM with an observed transcriptional response promotes a better understanding of the transcriptional regulation and enables in silico hypothesis testing about postulated regulatory TFs or mechanisms.

An increasingly successful approach to mathematically simulating transcriptional regulation is using thermodynamic models that model the interaction of TFs and DNA using kinetic equations. Several such thermodynamic models have been proposed in the last years (Janssens et al., 2006; Segal et al., 2008; Zinzen et al., 2006). These models take the CRM sequence, a set of TFs along with their concentrations and predict the transcriptional response of the target gene as mediated by the CRM and the TFs. A training algorithm is used to optimize the model's internal parameters to minimize the difference between the observed and predicted transcriptional response.


    2 APPROACH AND USAGE
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 APPROACH AND USAGE
 3 METHODS
 4 FUTURE DEVELOPMENTS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Here we present STREAM, a Java-implemented framework to calculate and visualize transcriptional regulation using thermodynamic modeling approaches. STREAM currently uses the thermodynamic model introduced by Reinitz et al. (2003), but the framework is flexible and can be used in conjunction with other models implemented in Java. STREAM offers several optimization methods including gradient descent and simulated annealing for adjusting the internal parameters of the model to best fit the user's input data. To the best of our knowledge, STREAM is to data the only publicly available framework for modeling the regulation of the rate of transcription. STREAM has been tested extensively on the even-skipped gene (eve) in Drosophila melanogaster (Bauer and Bailey, 2008).

STREAM can be executed using a graphical user interface (GUI) as well as via the command line. The GUI is illustrated in Fig. 1. It offers the same functionality (e.g. multistart options of the optimization or automatic cross-validation evaluation) as the command-line tool, but in an intuitive and dialogue-based fashion. Both the command-line and the GUI version can save the current result and settings of the program into a file, which makes saving and modifying previous experiments simple.


Figure 1
View larger version (48K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 1. Screen shot of the GUI of STREAM. The interface is exemplified on a model trained for the even-skipped gene (eve).

 

    3 METHODS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 APPROACH AND USAGE
 3 METHODS
 4 FUTURE DEVELOPMENTS
 ACKNOWLEDGEMENTS
 REFERENCES
 
In order to generate a model for a particular target gene, the user must identify a set of putative TFs and a putative regulatory region. The suspected role of each TF, x, as an activator xisinA or repressor xisinB must be specified by the user. The program also requires measurements of the concentrations and binding preferences of those TFs, as well as the transcriptional output of the target gene. In the following section, we introduce the required input data, D, which contains the concentration and rate data, C, and a TFBS map, S.

The concentration and rate data, C, comprises a set of independent data points. Each data point is a pair (V, vt), with V=(a1],...,[an]), a vector listing the protein concentrations of the putative regulatory TF proteins (a1,..., an), and vt giving the corresponding observed transcription rate of the target gene.

The concentration and rate data, C, can be measured by various methods. To measure the TF concentrations, in situ antibody staining can be used to measure V. To obtain the corresponding vt, one can proceed indirectly by measuring the mRNA levels of a reporter gene by staining against the mRNA of the reporter (Jaeger et al., 2004).

For visualization purposes, STREAM allows the user to label the data points with up to two ‘features’ (e.g. the ‘condition’ under which the measurements were made). Features may be ‘continuous’ (real numbers as in the example above), or ‘categorical’ such as cell types, tissue types or experimental treatments. STREAM utilizes the values of the features for visualization only, and the user may interactively select either feature as the X-axis for plots of the data.

Besides the concentration data, the input data also contains the TFBS map, S, corresponding to the regulatory region of interest. The TFBS map, S=(s1, s2,...,sn), is a list, where each si represents a TFBS as a triple (a, l, s) giving the name, a, of the binding TF, the position, l, of the binding site and the log-odds score (natural logarithm), s, of the site. The log-odds score is proportional to the binding strength of the TFBS (Stormo, 2000). S can be constructed from experimentally verified binding sites, in silico predicted sites using a prediction algorithm such as FIMO (http://meme.sdsc.edu), or a combination of both. For more detailed information, see Bauer and Bailey (2008).

The objective of the optimization is to determine the set of model parameters, {Theta}, that optimally explains the input data, D. {Theta} depends on the thermodynamic model. Currently, STREAM implements the model introduced by Reinitz et al.(2003). This model uses free parameters {Theta}=({theta}0, R0, W), where {theta}0 is an energy barrier, R0 is the maximal transcription rate, and W contains a tuple, (Kx, Ex), for each TF, x, where Kx is the association constant of the TF to the DNA and Ex is the effectiveness of the TF to activate transcription, if xisinA, or to repress, if xisinB. For more details see Reinitz et al. (2003) and Bauer and Bailey (2008).

Four different optimization methods are implemented: simulated annealing (SA), gradient descent (GD), genetic algorithm (GA) and limited-memory quasi-Newton unconstrained optimization (LBFGS) (for a comparison of the optimization methods see D.C.Bauer and T.L.Bailey, manuscript in preparation). All optimization methods seek to optimize the free parameters of the Reinitz model by minimizing the root mean-squared (RMS) error between the known transcription rate and the rate predicted by the Reinitz model, averaged over all input points, D (Bauer and Bailey, 2008).


    4 FUTURE DEVELOPMENTS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 APPROACH AND USAGE
 3 METHODS
 4 FUTURE DEVELOPMENTS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Besides the Reinitz model, we plan to provide additional models with enhanced functionality to simulate interacting TFs in a more detailed way, e.g., by incorporating TF-TF cooperation. Furthermore, simplifying models by using discreet TFBSs might introduce artifacts, hence, changing the model approach to use continuous binding gradients deems beneficial. Future research on other CRMs will guide the development of new models and our framework can provide the environment to directly compare them.

In addition, we plan to extend the approach to optimize one model to fit more than one CRM. Being able to fit one model to the data of several CRMs, which are suspected to have the same regulatory TFs, increases the confidence in the produced model.

Finally, we plan to improve the functionality of the GUI in manipulating input data, e.g., by an interactive interface to vary the properties of the TFBS map and to directly observe the changes in the predicted transcription rate.


    ACKNOWLEDGEMENTS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 APPROACH AND USAGE
 3 METHODS
 4 FUTURE DEVELOPMENTS
 ACKNOWLEDGEMENTS
 REFERENCES
 
The authors wish to acknowledge John Reinitz, Stefan Maetschke and Fabian Buske for their input on conceptual and implementation details for STREAM as well as Lachlan Dufton for providing the initial implementation of SA.

Funding: Australian Research Council Centre of Excellence in Bioinformatics (to T.L.B.); National Institutes of Health (RO-1 RR021692-01 to T.L.B.), UQ International Research Tuition Award (D.C.B.).

Conflict of Interest: none declared.


    FOOTNOTES
 
Associate Editor: Joaquin Dopazo

Received on June 27, 2008; revised on August 11, 2008; accepted on August 28, 2008

    REFERENCES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 APPROACH AND USAGE
 3 METHODS
 4 FUTURE DEVELOPMENTS
 ACKNOWLEDGEMENTS
 REFERENCES
 

    Bauer DC, Bailey TL. Studying the functional conservation of cis-regulatory modules and their transcriptional output. BMC Bioinformatics (2008) 9:220.[CrossRef][Medline]

    Jaeger J, et al. Dynamical analysis of regulatory interactions in the gap gene system of Drosophila melanogaster. Genetics (2004) 167:1721–1737.[Abstract/Free Full Text]

    Janssens H, et al. Quantitative and predictive model of transcriptional control of the Drosophila melanogaster even skipped gene. Nat. Genet (2006) 38:1159–1165.[CrossRef][Web of Science][Medline]

    Reinitz J, et al. Transcriptional control in Drosophila. Complexus (2003) 1:54–64.[CrossRef]

    Segal E, et al. Predicting expression patterns from regulatory sequence in Drosophila segmentation. Nature (2008) doi:10.1038/nature06496.

    Stormo GD. DNA binding sites: representation and discovery. Bioinformatics (2000) 16:16–23.[Abstract/Free Full Text]

    Zinzen RP, et al. Computational models for neurogenic gene expression in the Drosophila embryo. Curr. Biol (2006) 16:1358–1365.[CrossRef][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
D. C. Bauer and T. L. Bailey
Optimizing static thermodynamic models of transcriptional regulation
Bioinformatics, July 1, 2009; 25(13): 1640 - 1646.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrowOA All Versions of this Article:
24/21/2544    most recent
btn467v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Bauer, D. C.
Right arrow Articles by Bailey, T. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bauer, D. C.
Right arrow Articles by Bailey, T. L.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?