Bioinformatics Advance Access originally published online on September 27, 2005
Bioinformatics 2005 21(22):4176-4180; doi:10.1093/bioinformatics/bti674
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Functional stoichiometric analysis of metabolic networks
Institute of Pharmacology, University of Bern Friedbühlstrasse 49, CH-3010 Bern, Switzerland
*To whom correspondence should be addressed.
| ABSTRACT |
|---|
|
|
|---|
Motivation: An important tool in Systems Biology is the stoichiometric modeling of metabolic networks, where the stationary states of the network are described by a high-dimensional polyhedral cone, the so-called flux cone. Exhaustive descriptions of the metabolism can be obtained by computing the elementary vectors of this cone but, owing to a combinatorial explosion of the number of elementary vectors, this approach becomes computationally intractable for genome scale networks.
Result: Hence, we propose to instead focus on the conversion cone, a projection of the flux cone, which describes the interaction of the metabolism with its external chemical environment. We present a direct method for calculating the elementary vectors of this cone and, by studying the metabolism of Saccharomyces cerevisiae, we demonstrate that such an analysis is computationally feasible even for genome scale networks.
Contact: robert.urbanczik{at}pki.unibe.ch
| 1 INTRODUCTION |
|---|
|
|
|---|
Thanks to the abundance of genome data, it has become possible to reconstruct chemical reaction networks that comprehensively model the metabolism of microorganisms such as Escherichia coli and Saccharomyces cerevisiae (Reed et al., 2003; Duarte et al., 2004). In the stoichiometric network analysis of such models, the possible flows through the network are constrained by the requirement that in the long term there be no net accumulation or depletion of any chemical compound that cannot cross the cell boundary. Since typically many of the reactions are irreversible, the space of possible flows through the network is further constrained, and in mathematical terms corresponds to a high-dimensional polyhedral cone, which is often called the flux cone.
In Flux Balance Analysis (Ibarra et al., 2002) predictions about the response of the microorganism to changing environmental conditions or even to the deletion of certain genes are obtained by using linear programming to study this cone. But not all relevant questions readily fit into the framework of linear or even convex programming. A case in point would be the enumeration of all minimal media able to sustain the microorganism. Hence a second approach to studying chemical reaction networks has been to obtain an exhaustive description of the flux cone by calculating its elementary vectors (Stelling et al., 2002; Schuster et al., 2002a), also called elementary flux modes. Based on a complete set of these, many conclusions about the metabolism can easily be drawn since each elementary vector represents a unique, stoichiometrically viable and non-redundant pathway through the entire network. However, the price to pay for this exhaustive description by elementary fluxes is the combinatorial explosion of their number (Klamt and Stelling, 2002) and the approach has therefore been limited to networks that are an order of magnitude smaller than, for example, the full genome-scale network of S.cerevisiae.
To achieve genome-scale, we shall adopt a functional perspective and focus on describing the effect of the microorganism on its external chemical environment, for instance, the consumption of energy sources such as glucose and the production of waste such as CO2. In disregarding the internal mechanism, our concept of conversions between external compounds, which can be effected by the metabolism, is similar to the overall reactions studied by Happel and Sellers (1989). But, crucially, we do not assume all reactions to be reversible. Hence the set of possible conversions in general is not a vector space but a polyhedral cone. While this conversion cone is a projection of the flux cone, we shall show how the elementary vectors of the conversion cone can be found, without first having to compute the flux cone. Using this algorithm, elementary conversion analysis becomes computationally tractable for networks far larger than the ones considered in the elementary flux analysis. This is demonstrated by studying a conversion cone associated with the above mentioned metabolic network of S.cerevisiae.
| 2 BASIC CONCEPTS |
|---|
|
|
|---|
In stoichiometric analysis (Clarke, 1980; Heinrich and Schuster, 1996; Price et al., 2004; Stucki, 2004) a metabolic network is modeled by an m by n stoichiometry matrix S, which relates the flows
through the n reactions to changes
in the concentrations c of the m metabolites by
= S
. Typically many of the reactions will be irreversible and this means that the flow
j through such a reaction cannot be negative. Further, one can often assume that the concentration levels of some (external) metabolites, such as CO2, are maintained by the environment or large enough that the changes caused by the reaction system become negligible. For many metabolites, however, this will not be the case and hence in a stationary state of the network
i = 0 most hold for such an internal metabolite. Thus the possible steady states of the metabolic model are required to satisfy
![]() | (1) |
denotes the subset of irreversible reactions and
the subset of internal metabolites. Sometimes one further wants to subclassify some of the external metabolites into inputs (
i
0) and outputs (
i
0). While such a distinction can easily be incorporated into the formal framework given below, for brevity, we shall not explicitly consider this here.
The above notation is a bit unusual since we do not incorporate exchange fluxes into the stoichiometry matrix S. Often the system is augmented by pseudo reactions (Ci
0 for
) for the external metabolites, and the stoichiometry matrix of the augmented system is denoted by N. Then a steady state flux vector J of the augmented system must satisfy N J = 0 and the irreversibility conditions. But, if one orders the columns in N appropriately, any pair (
,
) satisfying Equation (1) corresponds to the steady-state flux vector
, where
denotes the restriction of
to the external metabolites. Since condition III means that
, this correspondence is easily seen to be a bijection, and the two notations are entirely equivalent.
The set of all pairs (
,
) satisfying the three conditions in Equation (1) is a polyhedral cone, and we shall call it the flux cone
. Our notation highlights that the two components of the pair (
,
) provide quite different information on the stationary state of the network. Whereas
tells us how the network is operating in terms of the flows through its reactions,
describes what effect the networks operation has on the environment, in terms of the conversions taking place between the external metabolites. In order to separate the what from the how, we now introduce the conversion cone
as the set of all vectors
, which can be part of a stationary flux (
,
). Formally
![]() | (2) |
or simply an elementary conversion if e lies, for some s
{1,1}m, on an edge of the pointed cone
, where
is the orthant:
![]() | (3) |
each edge of the intersection cone
is represented by exactly one vector in this set.
|
The importance of elementary conversions stems from the following observation. Any vector
of the conversion cone will lie in at least one orthant
and can hence be decomposed into vectors e(l) lying on the edges of
as
![]() | (4) |
, in the matrix (e(1), e(2), ..., e(k)) each row is either semi-positive or semi-negative and no cancellation can occur during the addition in Equation (4). Of course, by definition, the e(l) are elementary. So we have established that any vector in
can be decomposed into elementary conversions without cancellation and this implies that many properties of the conversion cone can be found by simply inspecting the elementary vectors. For instance, a statement such as consumption of glucose must be accompanied by the production of ethanol and CO2 will hold for any stationary state if and only if it is true for all elementary conversions.
In passing we note that one can define elementary vectors of the flux cone
analogously by requiring that they lie on an edge of
, where
is an orthant. While at first glance this definition differs from the one used in Schuster et al. (2002a) and Rockafellar (1970), where elementary vectors are defined by having a maximal set of zeroes, in Supplementary Material A.4 (Lemma 2) we show that the two definitions are in fact completely equivalent for flux cones. Hence, any elementary conversion is the projection of at least one elementary flux.
| 3 CALCULATING ELEMENTARY CONVERSIONS |
|---|
|
|
|---|
We have recently presented the Nullspace algorithm (Urbanczik and Wagner, 2005), a double description method (Fukuda and Prodon, 1995) tailored to the calculation of elementary fluxes. The procedure first obtains a representation, in terms of linear inequalities, of a so-called coordinate cone isomorphic to the flux cone
. Then, based on these inequalities, a generating set of the coordinate cone is calculated, i.e. a set such that the entire coordinate cone can be obtained by taking all linear combinations of the vectors in this set, involving only non-negative scalar factors. In fact, the generating set found is minimal, so no proper subset generates the cone. Then, based on these generating vectors, the algorithm finds a complete set of elementary fluxes. Here, we shall use similar ideas to obtain all elementary conversions, without first having to calculate the elementary fluxes.
Our starting point is the cone
of all vectors satisfying the first two conditions in Equation (1),
![]() | (5) |
, i.e. the vectors in
that also satisfy the condition III in Equation (1), would be quite easy if we had a different representation of
. In particular, assume that we have an inequalities representation of
, that is, a set of vectors H
m such that
![]() | (6) |
![]() | (7) |
first a minimal generating set and then all elementary conversions can be calculated using minor modifications of the Nullspace algorithm described in Supplementary Material A.
We still need to specify how to find the set H representing
by inequalities. For this, we first note that using the definition in Equation (5) a, possibly redundant, generating set of
is easily obtained from the stoichiometry matrix. If there are no reversible reactions, the columns S(j) of S are already a generating set and for reversible reactions we just need to take the reverse direction into account as well. So, a generating set of
is
![]() |
, the vectors in G are an inequalities representation of a cone
, which is called the dual of
,
. Clearly, given G, we can use the Nullspace algorithm to obtain a minimal generating set of
and, surprisingly, this set is just the set H we are looking for. The reason is that, by standard results in Convex Analysis (Rockafellar, 1970; Fukuda, 2004, www.ifor.math.ethz.ch/fukuda/polyfaq/), the vectors of a generating set of the dual cone
are an inequalities representation of the primal cone
. In fact any algorithm computing a minimal generating set from a linear inequalities representation also solves the converse problem of obtaining a minimal linear inequalities representation, given a generating set. In summary, the basic procedure is to obtain H from G by running Nullspace (as detailed in Supplementary Material A), then to augment the set H by the vectors reflecting the additional constraints for the conversion cone (Equation 7) and, finally, to run Nullspace again on the augmented set to get the complete set of elementary conversions. This is illustrated in Figure 2 using a very simple example.
|
| 4 APPLICATIONS |
|---|
|
|
|---|
To compare the conversion cone with the flux cone we first study the stylized model of the central carbon metabolism of E.coli presented in Stelling et al. (2002). A version of this metabolic model consisting of 99 reactions, with 11 external compounds (5 input, 5 output and 1 input/output), was analyzed in Klamt and Stelling (2002) and it was found that an excess of 5 x 105 elementary fluxes are needed for a complete set. In contrast to this, we find that a complete set of elementary conversions has only 344 elements, highlighting the drastic reduction in descriptive complexity achieved by focusing on the conversion cone.
In fact an even simpler description is possible if one considers all external compounds as being inputs as well as outputs and only calculates a minimal generating set of the conversion cone. One finds that this set consists of only 27 vectors, summarizing the basic metabolic capabilities of the model.
We next consider the genome-scale metabolic network of S.cerevisiae published in Duarte et al. (2004). This fully compartmentalized model has 1149 reactions as well as an additional reaction reflecting the metabolic needs of the organism for sustaining growth. To track the flow through this growth reaction we added the formal external metabolite Biomass to its output.
A coupling to the environment was used, where 21 of the metabolites marked as extracellular in the model are external. Interestingly, the conversion cone obtained for this coupling is only 10 dimensional, although it is embedded in the 21 + 1 dimensional space given by the external metabolites. Further, the complete set for this conversion cone consists of 40 969 extremal conversions, 98% of which sustain growth. However, there are only 1313 extremal conversions in this set that do not consume any amino-acid but nevertheless show growth. These are analyzed in more detail in Figure 3.
|
Having obtained an overview of the metabolic capabilities of an organism, it will often be of interest to determine how a specific function is achieved. Luckily, it is conceptually simple to determine just the elementary fluxes that give rise to a single elementary conversion. To analyze an elementary conversion e in this manner, one will define the matrix S' obtained by augmenting the stoichiometry matrix S, with e as an irreversible column vector, as S' = (S, e). One can then compute the elementary fluxes of S', treating all compounds as internal. Among the elementary vectors
of the restricted flux cone thus obtained, a few may be futile cycles (
= 0) but all others give rise to the conversion e, since S
=
e.
To illustrate this, we analyzed the following conversion:
![]() |
| 5 DISCUSSION |
|---|
|
|
|---|
We have shown that the analysis of elementary conversion is possible for genome-scale metabolic systems in situations where it seems highly probable that the enumeration of all elementary fluxes will remain computationally intractable for the foreseeable future. Although it is difficult to make rigorous statements about the complexity of our procedure, in the examples we have considered, the computational requirements are quite benign. For instance, the calculation of the 40 969 elementary conversion for S.cerevisiae takes <3 h of computing time on a standard workstation despite of the fact that we do not rely on machine precision but use infinite precision arithmetic in a mixed Mathematica/Matlab/C implementation of the Nullspace procedure. In part, this relatively fast computation time results from employing transformations (detailed in Supplementary Material B) that simplify the metabolic network but leave the conversion cone invariant.
While the conversion cone is an interesting object in its own right, enabling the determination of minimal media or the study of carbon fates, it is also important that conversion analysis can be linked up with flux analysis. We have already mentioned the possibility of directly determining the elementary fluxes that give rise to a given elementary conversion. Incidentally, this technique can also be used to study the degeneracy of the optimization problems considered in the Flux Balance Analysis, and this seems computationally cheaper than the approach considered in Lee et al. (2000) and Reed and Palsson (2004), where the enumeration of the alternative optimal flux vectors is carried out by solving a sequence of NP-complete problems (mixed-integer linear programming).
A second way of linking conversion analysis with flux analysis is to use our technique to study more general projections of the flux cone. If one is interested only in the flows through a few reactions, one can modify the network by adding a new formal external metabolite as a tag for each reaction one wishes to track. For instance, to track the reaction A
B one could replace it by A
B + AtoBext. Then the flows through the reactions, which have been tagged in this way, can be read of from the stoichiometric factors of the formal metabolites appearing in the elementary conversions of the modified system. In a rudimentary form, we have already used this in our analysis of S.cerevisiae when employing the formal compound Biomass to track the growth reaction.
A third option for linking the two approaches, which we have yet to explore, arises in the analysis of subnetworks. Elementary flux analysis has variously been applied to larger networks by decomposing these into subnetworks and treating any metabolite linking two or more of the subnetworks as external (Schilling and Palsson, 2000; Schuster et al., 2002b; Dandekar et al., 2003). While regarding such linkage metabolites as external, formally makes it possible to consider each subnetwork in isolation, it has remained difficult to relate the elementary fluxes of the subnetworks to those of the entire network. An alternative is to focus on a single subnetwork and also treat the linkage metabolites as external but to use this to calculate the conversion cone of all of the reactions not belonging to the subnetwork. The minimal generating set of this cone enumerates the potential contributions of the rest of the network to the operation of the subnetwork. One can now augment the subnetwork by the conversions in this generating set. Then any elementary flux of the augmented subnetwork represent at least one elementary flux of the entire system. In this manner one arrives at valid descriptions of the operation of the entire network, which are detailed with respect to the reactions in the subnetwork but only sketch the operations of the other reactions.
Large metabolic networks show a wealth of possible modes of operation. While this is probably essential for the survival of the microorganism in changing environmental conditions, it makes it practically impossible to enumerate all the possibilities. Hence, in describing the metabolism it is useful to focus on specific aspects of its operation and abstract away details that are not of interest in a given context. In stoichiometric network analysis a formal equivalent of this is to focus on suitable projections of the flux cone. The results presented above show that in this manner it does indeed become possible to tackle genome-scale networks.
| Acknowledgments |
|---|
The authors thank B. Palsson and S. Schuster for providing us with machine readable forms of the metabolic network of S.cerevisiae and the central carbon metabolism of E.coli, respectively. It is a pleasure to acknowledge the fruitful discussions with J. Stucki. This work was supported by the Swiss National Science Foundation, Grant No. 3100A0-102269.
Conflict of Interest: none declared.
Received on May 18, 2005; revised on August 17, 2005; accepted on September 8, 2005
| REFERENCES |
|---|
|
|
|---|
Clarke, B. (1980) Stability of complex reaction networks. In Prigogine, I. and Rice, S. (Eds.). Advances in Chemical Physics, , New York Wiley Vol. 43, , pp. 1216.
Dandekar, T., et al. (2003) A method for classifying metabolites in topological pathway analyses based on minimization of pathway number. Biosystems, 70, 255270[CrossRef][Web of Science][Medline].
Duarte, N., Herrgard, M., Palsson, B. (2004) Reconstruction and validation of Saccharomyces cerevisiae iND750, a fully compartmentalized genome-scale metabolic model. Genome Res., 14, 12981309
Frequently Asked Questions in Polyhedral Computation Fukuda, K. (2004) .
Fukuda, K. and Prodon, A. (1995) Double description method revisited. In Deza, M., Euler, R., Manoussakis, I. (Eds.). Combinatorics and Computer Science, , London, UK 1120, Lecture Notes in Computer Science Springer-Verlag, pp. 91111.
Happel, J. and Sellers, P. (1989) The characterization of complex systems of chemical reactions. Chem. Eng. Commun., 83, 221240.
Heinrich, R. and Schuster, S. The Regulation of Cellular Systems, (1996) , New York Chapman & Hall.
Ibarra, R., et al. (2002) Escherichia coli k-12 undergoes adaptive evolution to achieve in silico predicted optimal growth. Nature, 420, 186189[CrossRef][Medline].
Klamt, S. and Stelling, J. (2002) Combinatorial complexity of pathway analysis in metabolic networks. Mol. Biol. Rep., 29, 233236[CrossRef][Web of Science][Medline].
Lee, S., et al. (2000) Recursive MILP model for finding all the alternate optima in LP models for metabolic networks. Comput. Chem. Eng., 24, 711716[CrossRef].
Price, N., et al. (2004) Genome-scale models of microbial cells: evaluating the consequences of constraints. Nat. Rev. Microbiol., 2, 886897[CrossRef][Web of Science][Medline].
Reed, J. and Palsson, B. (2004) Genome-scale in silico models of E.coli have multiple equivalent phenotypic states: assessment of correlated reaction subsets that comprise network states. Genome Res., 14, 17971805
Reed, J., et al. (2003) An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR). Genome Biol., 4, R54.1R54.12.
Rockafellar, R. Convex Analysis, (1970) , Princeton Princeton University Press.
Schilling, C. and Palsson, B. (2000) Assessment of the metabolic capabilities of Haemophilus influenzae rd through a genome-scale pathway analysis. J. Theor. Biol., 203, 249283[CrossRef][Web of Science][Medline].
Schuster, S., et al. (2002a) Reaction routes in biochemichal reactions systems: algebraic properties, validated calculation procedure and example from nucleotide metabolism. J. Math. Biol., 45, 153181[CrossRef][Web of Science][Medline].
Schuster, S., et al. (2002b) Exploring the pathway structure of metabolism: decomposition into subnetworks and application to Mycoplasma pneumoniae. Bioinformatics, 18, 351261
Stelling, J., et al. (2002) Metabolic network structure determines key aspects of functionality and regulation. Nature, 420, 190193[CrossRef][Medline].
Stucki, J. (2004) Chromokinetics of metabolic pathways. Eur. J. Biochem., 271, 27452754[Web of Science][Medline].
Urbanczik, R. and Wagner, C. (2005) An improved algorithm for stoichiometric network analysis: theory and applications. Bioinformatics, 21, 12031210
This article has been cited by other articles:
![]() |
M. Terzer and J. Stelling Large-scale computation of elementary flux modes with bit pattern trees Bioinformatics, October 1, 2008; 24(19): 2229 - 2235. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Fenner, J. Gao, S. Kramer, L. Ellis, and L. Wackett Data-driven extraction of relative reasoning rules to limit combinatorial explosion in biodegradation pathway prediction Bioinformatics, September 15, 2008; 24(18): 2079 - 2085. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||




. Intersecting
shows that e(1) and e(2) are elementary. Since the intersection of
, as well as with
, is empty, the only elementary vectors of 





is described in Supplementary Material A.3.

