## Abstract

### Background

In recent years, constrained optimization – usually referred to as flux balance analysis (FBA) – has become a widely applied method for the computation of stationary fluxes in large-scale metabolic networks. The striking advantage of FBA as compared to kinetic modeling is that it basically requires only knowledge of the stoichiometry of the network. On the other hand, results of FBA are to a large degree hypothetical because the method relies on plausible but hardly provable optimality principles that are thought to govern metabolic flux distributions.

### Results

To augment the reliability of FBA-based flux calculations we propose an additional side constraint which assures thermodynamic realizability, i.e. that the flux directions are consistent with the corresponding changes of Gibb's free energies. The latter depend on metabolite levels for which plausible ranges can be inferred from experimental data. Computationally, our method results in the solution of a mixed integer linear optimization problem with quadratic scoring function. An optimal flux distribution together with a metabolite profile is determined which assures thermodynamic realizability with minimal deviations of metabolite levels from their expected values. We applied our novel approach to two exemplary metabolic networks of different complexity, the metabolic core network of erythrocytes (30 reactions) and the metabolic network iJR904 of *Escherichia coli* (931 reactions). Our calculations show that increasing network complexity entails increasing sensitivity of predicted flux distributions to variations of standard Gibb's free energy changes and metabolite concentration ranges. We demonstrate the usefulness of our method for assessing critical concentrations of external metabolites preventing attainment of a metabolic steady state.

### Conclusion

Our method incorporates the thermodynamic link between flux directions and metabolite concentrations into a practical computational algorithm. The weakness of conventional FBA to rely on intuitive assumptions about the reversibility of biochemical reactions is overcome. This enables the computation of reliable flux distributions even under extreme conditions of the network (e.g. enzyme inhibition, depletion of substrates or accumulation of end products) where metabolite concentrations may be drastically altered.

### Similar content being viewed by others

## Background

Sequencing the whole genomes in conjunction with high-throughput analyses of mRNA, protein and metabolite profiles [1] has paved the way for a fast reconstruction of metabolic networks [2]. For a quantitative assessment of metabolic fluxes, Palsson and co-workers have developed a theoretical approach commonly referred to as flux balance analysis (FBA) [3]. This method relies on the hypothesis that the most likely distribution of stationary fluxes in the network has to be optimal with respect to a feasible optimization criterion linking the fluxes with cellular functions. In most applications of FBA the fluxes have been determined to maximize a specific network output as the production of biomass [4–6]) or the production of ethanol in yeast [7]. Whereas the maximization of biomass production appears to be a reasonable objective of the cellular metabolism of rapidly growing and replicating primitive cells such as bacteria, the flux distribution in complex eukaryotic cells is governed by a larger variety of cellular functions that have to be met simultaneously. Therefore, the principle of flux-minimization was proposed as a more general optimization criterion for FBA [8–10]. The extension of FBA outlined in this paper will be tested by choosing as flux objective both the maximization of biomass and the minimization of internal fluxes at given output fluxes. Flux distributions predicted by FBA are hypothetical because they depend essentially upon the choice of the flux evaluation criterion used. Therefore, to increase the reliability of FBA results, one has to seek for strategies to include additional biochemical knowledge into FBA. One way is to include measured flux rates as further side constraints. However, flux measurements – except for exchange reactions that deliver metabolites into the external space – are still difficult and costly to perform as they require determining labeled isotopomers in a time-dependent manner [11–13]. Another possibility to increase the credibility of flux balance calculations is to include some basic thermodynamics of the reactions and transport processes constituting the network. The thermodynamic consensus rule dictates that a positive net flux through a reaction implies a negative corresponding change of the Gibb's free reaction energy and vice versa. Based on this fundamental criterion one may check whether given flux directions conflict with known Gibb's free energy changes. This allows to identify putative regulatory sites in the network [14, 15] or to decide on the reversibility/irreversibility of reactions [16–20]. However, regarding flux distributions predicted by constrained optimization methods as FBA it is desirable to judge their feasibility not only post-hoc but to include thermodynamic constraints on flux directionalities directly into their calculation [21–23]. In our previous work [8, 9] this was accomplished by weighting negative (backward) fluxes with the thermodynamic equilibrium constant of the reaction. The rationale behind this empirical weighting procedure is to impede reversing the direction of a reaction (such that the change of Gibb's free energy has the opposite sign than under standard conditions) with increasing value of the thermodynamic equilibrium constant. However, this way of mixing the costs for the maintenance of metabolic fluxes with the thermodynamic 'costs' for reversing the direction of a reaction in one and the same objective function is questionable for two reasons. First, the concentrations of metabolites in a cell differ significantly from 1 M so that the actual free energy changes of biochemical reactions may considerably differ from their standard values. Second, increasing or decreasing the concentration of the reactants to an extent that enables reversal of the flux direction might occur in the cell by regulations that does not cause much real 'costs' in terms of the production of more enzyme and of used external resources. One way to overcome this shortcoming of our previous approach [8, 9] is to incorporate fulfillment of the thermodynamic consensus rule as additional side constraint into the calculation of the flux distribution. Such an approach was recently outlined by Henry et al. [23]. These authors studied the range of metabolite concentrations that is still compatible with a thermodynamically feasible flux distribution in a genome-scale network of *E. coli* under conditions of optimal bacterial growth. Here we go one step further to include information on metabolite concentrations directly into the calculation of the flux distribution. Our algorithm considers the optimization of two different objectives: On one hand a functionally optimal and thermodynamically feasible flux distribution is demanded an on the other hand the calculated metabolite concentrations are required to deviate as little as possible from set-point values prescribed on the basis of biochemical knowledge. In the following we outline the method and provide applications to two different metabolic networks: (i) the energy- and redox metabolism of red blood cells for which a detailed kinetic model has been established [24] thus allowing to check the feasibility of our method and (ii) the large-scale genome-based metabolic network of *Escherichia coli* iJR904 [25] which has already been subjected to FBA in several studies [23, 26, 27].

## Results

### Algorithm

#### Thermodynamic constraints

The directionality of the net flux of a chemical reaction and the change of Gibb's free energy are related to each other by the consensus rule

sgn(*v*) = -sgn (ΔG_{r}), (1)

where 'sgn()' is the sign function, ΔG_{r} denotes the change of Gibb's energy of the reaction and *v* is the net flux (rate) through the reaction. Actual changes of Gibb's energy can be calculated from changes of standard Gibb's energy (where each reactant has a concentration of 1 M) according to

where [*M*] is the active concentration (activity) of the metabolite *M*, S and P denote the sets of substrates and products of the reaction, respectively. R is the universal gas constant and *T* is the absolute temperature. The change of the standard Gibb's energy is related to the equilibrium constant *K*_{eq} of the reaction by

It has to be noted that standard Gibb's energy changes depend on temperature, pH value, and ion strength and thus may significantly differ from those determined under specific in vitro conditions. For a metabolic network, (eq. 2) reads in vector notation \frac{\overline{\Delta {\text{G}}_{\text{r}}}}{\text{R}T}=\frac{\overline{\Delta {\text{G}}_{\text{r}}^{\text{0}}}}{\text{R}T}+S\text{C}, where \overline{\Delta {\text{G}}_{\text{r}}} is the column vector of the ΔG_{r} values for all reactions of the network, \overline{\Delta {\text{G}}_{\text{r}}^{\text{0}}} is the column vector of the \Delta {\text{G}}_{\text{r}}^{\text{0}} values, C is the column vector of the natural logarithms of the active metabolite concentrations. (Concentrations are assumed to be strictly positive.) **S** is the stoichiometric matrix of the system, where rows refer to reactions and columns refer to metabolites. A positive or negative matrix element of **S** represents the stoichiometric coefficient with which the metabolite indicated by the column number appears as a product or substrate of the reaction indicated by the row number. Changes of the standard Gibb's energies of reactions can be additively composed of changes of the standard Gibb's energies of the formation of their reactants [28]:

Owing to the first law of thermodynamics the values of the standard Gibb's free energy changes are not independent from each other but have to obey the principle of micro-reversibility dictating the sum of standard free energy values in a closed system to be zero. In several flux balance studies [16, 17, 29–32] this criterion has been referred to as generalization of Kirchhoff's loop law which [see Additional file 3]. The problem is that experimentally determined values for the changes of standard Gibb's energies are not consistent with the principle of micro-reversibility per se because of experimental errors. Therefore, we add correction terms (forming the vector E) to all observed values of standard Gibb's energy changes and determine minimal corrections necessary to assure the principle of micro-reversibility. The corresponding optimization problem reads

where ||E|| is the 2-norm of the vector E, and {\overline{\Delta {\text{G}}_{\text{f}}}}^{\ast} are hypothetic Gibb's free energy changes of formation. {\overline{\Delta {\text{G}}_{\text{r}}^{\text{0}}}}^{\ast}-\text{E} is then used as the modified vector of standard Gibb's energy changes fulfilling the condition of micro-reversibility.

#### Constraints on metabolite concentrations

In case that the metabolite concentrations might assume arbitrary non-negative values it would be always possible to let a chemical reaction proceed in either forward or backward direction. Hence, including information on metabolite concentrations as additional constraints in FBA makes only sense if the concentration of the metabolites can be restricted to a feasible range. If the concentration of a metabolite is known, we use this value as set-point which should be approximated as best as possible by the calculated metabolite concentration. Thus, we add the term

to the objective function where W denotes the set of metabolites *m* for which a set-point (logarithmic) concentration value *s*_{
m
}is available and *c*_{
m
}is the component of C related to metabolite *m*. If the concentration of a metabolite is not exactly known but can be restricted to a narrower concentration range based on metabolite profiling (for *E. coli* such a profile has been summarized by Kümmel et al. [14]) we use this information to define so-called soft concentration bounds denoted by *c*_{low} and *c*_{high}. In case that such physiologically feasible concentration range has not been reported yet we set the lower and upper soft bound close to the minimum and maximum of all known cellular metabolite concentrations. However, it may happen that a non-trivial flux distribution can only be found if the concentration of some metabolites drops off the range defined by the soft bounds. This may be due to the improper choice of soft bounds for some metabolites resulting from large experimental errors in the determination of cellular metabolite concentrations or the (unknown) binding of metabolites to macromolecular structures lowering their effective free concentrations inside the cell. Therefore, concentrations lying outside the range of the soft bounds are allowed in our algorithm but are penalized in the optimization criterion:

Here *c* is the (logarithmic) active concentration of the metabolite. The penalty function for the whole concentration vector is

As in Henry et al. [23] we introduce a second type of bounds, so-called hard bounds, to exclude metabolite concentrations which are impossible from the biochemical point of view. The combined effect of set-point values, soft and hard concentration bounds on the scoring function of the optimization algorithm is shown in figure 1.

Metabolic network models may contain reactions which are simplified in a way that reactants are dropped from the reaction formula. For example, the oxidation of glutathione (GSH) to glutathione disulfide (GSSG) is usually written as an overall reaction 2GSH → GSSG. Actually, this reaction should read 2GSH + R-OOH → GSSG + R-OHH_{2}O where R-OOH represents a large group of not further specified hydroperoxides that can be detoxified by the glutathione system. For these lumped reactions it is impossible to give a realistic \Delta {\text{G}}_{\text{r}}^{\text{0}} value. For other reactions a \Delta {\text{G}}_{\text{r}}^{\text{0}} value is simply not known (e. g. for 37 reactions in *E. coli* [23]). For such reactions the consensus rule (eq. 1) is not applied.

#### Setting up the constrained optimization problem

In FBA, formulation of the optimization problem requires to define the following three elements: (i) a physiologically meaningful scoring function to evaluate flux distributions, (ii) the steady-state conditions for all internal metabolites valid for the time-scale of interest (e. g. the time-scale of growth) and (iii) further constraints taking into account biochemical knowledge as, for example, maximal enzyme capacities limiting the flux rates [10] or thermodynamic constraints on flux directions as those discussed above. The steady-state condition can be formulated as

**S'** V = 0 (9)

where **S'** derives from the full stoichiometric matrix **S** of the network upon deletion of all columns referring to those metabolites which are exchanged with the external environment and thus need not to be balanced. According to the principle of minimal fluxes [8, 9] we set up the scoring function as the sum of the absolute values of all reaction fluxes |V| while assigning fixed values *L*_{
j
}, *j* ∈ *J* to all output fluxes, which are directly linked to cellular functions, the so-called target fluxes, the set of which is denoted by *J*. Adding the weighted terms (eq. 6) and (eq. 8) to the scoring function and including the constraint (eq. 9) the complete optimization problem is written as

\u212d is a vector of ranges defined by the hard concentration bounds. V is the vector of flux rates and *v*_{
j
}is the *j*-th component of V. *λ*_{1}, *λ*_{2} ∈ ℝ^{+} are empirical factors weighting the relative contribution of the various penalty scores relative to the scoring function of fluxes. (For our computations we have chosen *λ*_{1} = 100, *λ*_{2} = 0.01 putting a lower weight to the attainment of set-point concentration values than to the restriciton of the metaboite concentration values to physiologically feasible soft bounds.) *n* is the number of reactions, and for any 1 ≤ *j* ≤ *n*, *d*_{
j
}is a binary variable. *α* is set to a positive number which is larger than any possible flux value and larger than any possible Gibb's energy value, and it can easily be shown that the constraints 0 ≤ *v*_{
j
}+ *αd*_{
j
}≤ *α* and 0 ≤ -\Delta {\text{G}}_{\text{r}}^{j} + *αd*_{
j
}≤ *α* are equivalent to *v*_{
j
}≠ 0 → sgn(*v*_{
j
}) = -sgn (\Delta {\text{G}}_{\text{r}}^{j}). Intentionally, for a zero flux through a reaction the change of Gibb's free energy is not constrained because it might be substantially different from zero if the corresponding enzyme is missing or inhibited. The optimization problem corresponds to a mixed integer (boolean) linear program with quadratic scoring function.

We call a flux distribution obtained by solving the above optimization problem (eq. 10) thermodynamically realizable and refer to it in the following as **TR-fluxmin**, i.e. **t** hermodynamically **r** ealizable **flux**-**min** imized solution. If the maximization of biomass is used as flux objective, the sum of internal fluxes appearing in the objective function is replaced by the negative biomass production rate.

### Testing

#### Application to a metabolic network of human erythrocytes

The method described above was applied to a metabolic network of human red blood cells [24, 33] for which stationary flux distributions have already been calculated in our previous work [8]. The network comprises basically two cardinal metabolic pathways of this cell: glycolysis including the so-called 2,3-bisphosphoglycerate shunt, and the pentose phosphate cycle dividing into an oxidative and a non-oxidative part. The network consists of 27 biochemical reactions, 5 transport processes and 32 metabolites (see figure 2 and the supplementary material for the complete description of the model). The orientation of the arrows in the reaction scheme corresponds to the net direction of the reaction flux at standard concentrations. Standard Gibb's energies have been derived from the equilibrium constants contained in the kinetic model [24, 33]. The functionally essential target reactions that have to be maintained by the network are the following: (i) formation of 2,3-bisphospho-D-glycerate (2,3P_{2}G, reaction #9) required to modulate oxygen affinity of hemoglobin, (ii) ATP-utilization (ATPase, #16), which is mostly spent on the Na^{+}/K^{+}-ATPase to build up the Na^{+}/K^{+}-gradient across the plasma membrane, (iii) oxidation of GSH (GSHox, #21) to prevent oxidative damage of cellular proteins and lipids, (iv) synthesis of PRPP (PRPPS, #26) required for the salvage of adenine nucleotides. The magnitude of these 4 target reactions depends on the specific external conditions of the cell as, for example, osmolarity of the blood or preservation medium, oxidative stress caused by reactive oxygen species, or lowering of the oxygen tension during hypoxia. In our calculations the flux values for these 4 target reactions were chosen as reported for the normal in vivo state of erythrocytes: DPGM = 0.49 mmol/h, ATPase = 2.38 mmol/h, GSHox = 93 *μ* mol/h, PRPPS = 26 *μ* mol/h. With these values for the target fluxes, the comprehensive kinetic model [24, 33] yielded metabolite concentrations as shown in figure 2. These values are in good concordance with experimentally determined concentrations and thus will be referred to in the following as 'observed' concentrations.

Using the same values of standard Gibb's free reaction energy changes as used in the kinetic model and putting the set-point values of the metabolite concentrations to the 'observed' ones, the TR-fluxmin solution of the optimization problem turns out to be identical with the flux-minimized solution determined by our previous approach [8]. A detailed description of the model and the solution mentioned below can be found in the supplements [see Additional file 1].

##### Perturbation analysis

To investigate the impact of errors in the observed metabolite concentrations on the predicted flux distribution the concentration values given in figure 2 were perturbed by multiplying them with a random factor obeying an exponential normal distribution with controlled standard deviation (see caption of figure 3). The hard concentration bounds were chosen as follows: 0.1 *μ* M...100 mM (glucose), 0.1 *μ* M...25 mM (CO_{2} and phosphate), and 0.01 *μ* M...10 mM (28 remaining compounds). Calculation of the flux distribution with randomly altered set-point concentration values was repeated in 1000 trials. Surprisingly, for smaller perturbations the reference solution (= TR-fluxmin for 'true' set-point values) was retained in all trials. But perturbations with a standard deviation of 2 (corresponding to an average factor 7.4 in the change of the concentration values) resulted in a second, slightly different, flux distribution in some trials. For perturbations with standard deviation of 6 (corresponding to an average 400-fold change in the concentration values) this second alternative flux distribution dominated and on top a third alternative flux distributions was obtained in a significant number of trials (see figure 3).

In a second perturbation analysis, the standard Gibb's energy change values were randomly altered in a similar manner. Also here, there was an increasing tendency towards the two alternative flux distributions found before when increasing the magnitude of perturbations (see figure 4). These alternative flux distributions already occurred at a standard deviation of 3 (corresponding to an average deviation of 7.7 kJ/mol of the standard Gibb's energy changes) and their relative share became dominant at a standard deviation value of 6 (corresponding to an average deviation of 15.5 kJ/mol of the standard Gibb's energy changes).

##### Inspection of alternative flux distributions

The three alternative flux distributions obtained in the perturbation studies differ in the uptake flux of glucose (1.50495 mmol/h for the unperturbed network; 1.5015 mmol/h and 1.4972 mmol/h for the two alternative flux distributions) and the fluxes in the non-oxidative pentose phosphate pathway converting ribulose-5-phosphate (Ru5P) into fructose-5-phosphate (Fru6P) and glyceraldehyde 3-phosphate (GraP). The reference solution predicts this pathway to proceed in forward direction thereby forming 20.7 *μ* mol/h ribulose-5-phosphate. In the second solution (glucose uptake 1.5015 mmol/h) this pathway is not used at all whereas in the third flux distribution (glucose uptake 1.4972 mmol/h) it is used in backward direction producing 25.8 *μ* mol/h ribulose-5-phosphate. Interestingly, the latter flux distribution is also obtained for the unperturbed network if the maximization of biomass production used as flux criterion. Notably, all three different flux distributions obtained as solution of the minimization problem (eq. 10) for randomly altered thermodynamic parameters and set-point concentrations are feasible from the kinetic view point, i.e. the kinetic model of the erythrocyte metabolism yields a stable stationary solution.

##### Effect of external concentrations

Our algorithm allows assessing how the predicted flux distributions are affected by changes in the concentration of external metabolites. In vivo, such a situation may occur if some essential fuels for the cellular metabolism are depleted, for example, due to a reduced blood flow through vessels with severe atherosclerotic stenoses, or some end products of the cellular metabolism accumulate because of a reduced excretion capacity of the body. For example, in case of strong physical exercise the concentration of lactate in human blood may rise to values as high as 19.5 mM (in blood plasma) respectively 7.0 mM (in erythrocyte cytoplasm) [34] indicating that the lactate production by the anaerobic skeletal muscle clearly exceeds its rate of re-conversion to glucose in the liver and its utilization rate in the heart muscle. To investigate the consequences of such high blood lactate levels for the metabolism of red cells we calculated thermodynamically realizable flux distributions at gradually increasing concentration of external lactate. For all metabolites except external lactate, the hard bounds were put to ± 25% and the soft bounds to ± 10% deviation from of the 'observed' concentration values. For external lactate concentrations up to a critical value of 12.4 mM our algorithm predicted a thermodynamically realizable flux distribution. For concentrations higher than 12.4 mM no stationary flux distribution solution was found. Increasing gradually the concentration of external lactate up to the critical value of 12.4 mM, the concentrations of pyruvate, NAD^{+} and NADH tended towards the hard bounds to ensure the flux through the lactate dehydrogenase (EC:1.1.1.27) to be directed towards formation of lactate. Our find of a metabolic threshold effect with respect to blood lactate levels corresponds well with clinical observations. At lactate levels higher than 4 mM a reduced deformability of erythrocytes is observed, which may account for the exercise-induced arterial hypoxemia occuring in athletes [35]. Decreasing deformability of erythrocytes is a clear indication for a severely perturbed metabolism of the cell.

#### Application to a metabolic network of *E. coli*

To check the applicability of our algorithm to genome-scale metabolic networks comprising hundreds of reactions and metabolites, we performed the same type of analysis as described above with respect to the metabolic network iJR904 of the bacterium *E. coli* reconstructed by Palsson and co-workers [25]. In this model a minimal medium composed of glucose, ammonium, sulfate, oxygen, phosphate is sufficient for growth according to the biomass creation formula associated with the model. Experimental flux data for *E. coli* has been determined by Emmerling et al. [36] which correspond to 17 internal fluxes of the iJR904 network (using the projection of Segre et al. [37] onto the iJE660a network of *E. coli* [38].) The thermodynamic properties of the iJR904 network, consisting of 659 metabolites and 931 reactions, have been analyzed previously [14, 15, 20, 23, 32]. Since experimentally determined Gibb's free energies are available only for a minor fraction of reactions [20, 39] we use computed values given by Henry et al. [23].

These values were obtained by a slightly modified version of the group contribution method [40, 41]. Physiological concentration ranges were available for 22 internal metabolites (given in Kümmel et al. [14]) and 10 external metabolites (given in Henry et al. [23]). For the other metabolites generic concentration bounds were used based on typical cellular concentration ranges reported in the literature: 20 *μ* M-0.5 mM (soft bounds), 5 *μ* M-2 mM (hard bounds). Further details of the model are given in the supplement [see Additional file 2].

We calculated the flux distribution in this network according to the proposed optimization principle (eq. 10) using as flux objective the maximization of the biomass production. No a priori assumptions were made with respect to the directionality of reactions with two exceptions: The direction of the exchange fluxes was fixed according to the experimental conditions [36] and the direction of 37 internal reactions for which no Gibb's energy value was given in Henry et al. [23] was also fixed [see Additional file 2, archive member 'Ecoli-model.txt', section 'reactions excluded from the TR-property', to see which]. As shown in Fig. 5) (case: 'TR-biomax', data points symbolized by blue triangles) the thermodynamically realizable solution provided a good concordance with observed flux values available for 17 internal reactions. To check the influence of the thermodynamic side constraints on the quality of the flux distribution, we omitted the condition of thermodynamic realizability from the optimization algorithm, again making no a priori assumptions on flux directionalities. In this case ('biomax, fully reversible', data symbolized by red squares in Fig. 5)) the concordance between predicted and observed flux values diminished significantly. This example shows that our algorithm may significantly improve the reliability of flux predictions even if the concentration range of most metabolites is only roughly restrained.

In a third calculation we again omitted the condition of thermodynamic realizability from the optimization algorithm but instead used the heuristic classification of reactions into reversible and irreversible ones as outlined in [25] (case: 'biomass, heuristic irreversibilities', data points symbolized by green diamonds). The obtained flux distribution also yielded a reasonably good concordance between predicted and observed flux values. Notably, this 'classical' variant of FBA gave no better predictions of the observed fluxes than the TR-solution obtained with our algorithm. This qualifies our method as a valuable flux predictor for large-scale networks without the need to apply heuristic rules for the assignment of flux directionalities.

##### Perturbation analysis

Using the same perturbation analysis as outlined above for the erythrocyte network we investigated the impact of alterations in the values of the Gibb's free standard energies on the predicted flux distributions. Such an analysis is of importance as the values of standard Gibb's free energy changes computed by the group contribution method may generally exhibit a large degree of uncertainty [42].

Compared with the findings for the erythrocyte network, much smaller perturbations already resulted in a multitude of alternative flux distributions (see figure 6). Thus, the higher the complexity of the network the more susceptible is the predicted flux distribution is to the choice of the standard Gibb's energies. Closer inspection of the predicted alternative flux distributions showed that the main differences are concentrated in some distinct parts of the networks. We found the largest variability of predicted fluxes for the exchange of CO_{2}, the 3-reaction pathway leading from acetaldehyde and CoA to formation of ATP from ADP via acetyl-phosphate as intermediate, and in the import of *a*-ketoglutarate. The possible fluxes through these reactions appear to be strongly determined by thermodynamic constraints and thus are difficult to predict given low accuracy of thermodynamic data.

## Discussion & Conclusion

Quantitative evaluation of genome-scale metabolic models by means of FBA is becoming more and more appealing because it works without knowledge of the kinetics and regulation of the underlying enzymes and membrane transporters. However, the outcome of FBA is rather hypothetical because it relies on plausible but hardly provable optimality principles that are thought to govern metabolic flux distributions.

Therefore, a challenge for computational systems biology lies in the incorporation of all biochemical knowledge that is obtainable at genome-scale (which is not the case for enzyme kinetics). One important restriction of fluxes in the network arises from thermodynamics. Reactions associated with a decrease of free energy larger than 30 kJ/mol are generally thought to be irreversible. This condition can be used as an additional constraint on feasible flux distributions. Kümmel and co-workers [20] have recently developed an algorithm that – based on thermodynamics, network topology and heuristic rules – automatically assigns reaction directions in metabolic models such that the reaction network is thermodynamically feasible with respect to the production of energy equivalents. However, an a priori distinction between reversible and irreversible reactions may become problematic under extreme conditions, e.g. depletion of substrates or accumulation of intermediates due to inhibition of enzymes, where metabolite concentrations may drastically change thus allowing to reverse reactions that are normally designated to be irreversible.

For example, under hypoxic conditions, the cellular concentration of oxygen may become so low that the respiratory chain – usually thought to carry electrons from hydrogen to oxygen in a strictly irreversible manner – may indeed operate in the reverse direction, i.e. reducing NAD to NADH_{2} [43]. Thus, it is necessary to replace the rigid priori classification of reactions into reversible and irreversible ones by a more flexible constraint that assures the flux directions to be compatible with the change of Gibb's free energies, exhibiting a wide range of values depending on the actual metabolite concentrations. An important step into this direction was recently made by Henry et al. [23] who included the thermodynamic consensus rule as additional side constraint into FBA. In their study, they investigated the range of metabolite concentrations that allow in a genome-scale network of *E. coli* the realization of a specific flux distribution assuring optimal bacterial growth. In contrast to this approach, the algorithm proposed in this work aims at employing reliable information on metabolite concentrations to restrain the solution space of FBA.

Hence, depending on reported ranges of metabolite concentrations, our algorithm may yield different flux distributions. In other words, in our approach we do not ask for metabolite concentrations that are compatible with a given flux distribution but in contrast ask for the flux distribution that is compatible with a given metabolite profile. As demonstrated for two exemplary metabolic networks, even if no measurements of metabolite concentrations are available restriction of concentrations to physiologically feasible ranges alone allows the prediction of reliable flux distributions if no a priori assumptions are made on the reversibility of the reactions. As demonstrated for the erythrocyte network, our approach may provide valuable information about alterations in the external conditions of a cell that may result in a metabolic dysfunction. Of course, FBA cannot assess whether a stable steady state may exist at very high concentrations of external lactate because this is determined by kinetic regulation. Possibly the metabolite concentrations may vary even in a larger interval than imposed in our calculations. This problem can be addressed better by a comprehensive kinetic network model. Nevertheless, our method may provide valuable information on external conditions causing metabolic problems merely for thermodynamic reasons. Concerning the predictive capacity of our method it must be critically noted that – based on a comparison with a relatively small number of measured fluxes – the most reliable flux distribution for the *E. coli* network is still obtained if the directionality of fluxes is a priori defined based on biochemical conventions. This is obviously due to the fact that – owing to the lack of reliable experimental data – the soft bounds for the metabolite concentration used in our method have been too generously chosen. Indeed, enlarging systematically the physiologically feasible concentration ranges one eventually obtains a network without any constraint of flux directionalities. Hence, the usefulness of the proposed method essentially depends upon the availability of reliable information on values of free energy changes and metabolite concentrations. As long as this information is not available, the benefit of our method consists mostly in the generation of alternative flux distributions by varying the values of standard Gibb's free energy changes and/or in the physiologically relevant concentration ranges of metabolites. Applying such perturbation analysis to two networks of different complexity has provided evidence that the larger the network is, the more alternative flux distributions occur, even at relatively modest variation of energy values of about 5 kJ/mol. Inspection of such alternative flux distributions reveals critical reactions for which fluxes are largely undetermined by the FBA approach. In this respect, our method represents a useful complement to the thermodynamic evaluation method recently proposed by Kümmel et al. to identify putative regulatory sites by network-embedded thermodynamic analysis of metabolome data [14].

## References

Fell D: Enzymes, metabolites and fluxes. J Exp Bot. 2005, 56 (410): 267-72. 10.1093/jxb/eri011.

Poolman M, Bonde B, Gevorgyan A, Patel H, Fell D: Challenges to be faced in the reconstruction of metabolic networks from public databases. Syst Biol (Stevenage). 2006, 153 (5): 379-384.

Varma A, Boesch B, Palsson B: Stoichiometric interpretation of

*Escherichia coli*glucose catabolism under various oxygenation rates. Appl Environ Microbiol. 1993, 59 (8): 2465-2473.Varma A, Palsson B: Metabolic capabilities of

*Escherichia coli*. 2. Optimal-growth patterns. J Theor Biol. 1993, 165: 503-522. 10.1006/jtbi.1993.1203.Edwards J, Ibarra R, Palsson B: In silico predictions of

*Escherichia coli*metabolic capabilities are consistent with experimental data. Nat Biotechnol. 2001, 19 (2): 125-130. 10.1038/84379.Dien SV, Lidstrom M: Stoichiometric model for evaluating the metabolic capabilities of the facultative methylotroph

*Methylobacterium extorquens*AM1, with application to reconstruction of C(3) and C(4) metabolism. Biotechnol Bioeng. 2002, 78 (3): 296-312. 10.1002/bit.10200.Jin Y, Jeffries T: Stoichiometric network constraints on xylose metabolism by recombinant

*Saccharomyces cerevisiae*. Metab Eng. 2004, 6 (3): 229-238. 10.1016/j.ymben.2003.11.006.Holzhütter H: The principle of flux minimization and its application to estimate stationary fluxes in metabolic networks. Eur J Biochem. 2004, 271 (14): 2905-2922. 10.1111/j.1432-1033.2004.04213.x.

Holzhütter S, Holzhütter H: Computational design of reduced metabolic networks. Chembiochem. 2004, 5 (10): 1401-1422. 10.1002/cbic.200400128.

Holzhutter H: The generalized flux-minimization method and itsapplication to metabolic networks affected by enzyme deficiencies. Biosystems. 2006, 83: 98-107. 10.1016/j.biosystems.2005.04.008.

Dien SV, Strovas T, Lidstrom M: Quantification of central metabolic fluxes in the facultative methylotroph

*Methylobacterium extorquens*AM1 using^{13}C-label tracing and mass spectrometry. Biotechnol Bioeng. 2003, 84: 45-55. 10.1002/bit.10745.Iwatani S, Dien SV, Shimbo K, Kubota K, Kageyama N, Iwahata D, Miyano H, Hirayama K, Usuda Y, Shimizu K, Matsui K: Determination of metabolic flux changes during fed-batch cultivation from measurements of intracellular amino acids by LC-MS/MS. J Biotechnol. 2007, 128: 93-111. 10.1016/j.jbiotec.2006.09.004.

Sauer U: Metabolic networks in motion:

^{13}C-basedflux analysis. Mol Syst Biol. 2006, 2: [Article number 62]Kummel A, Panke S, Heinemann M: Putative regulatory sites unraveled by network-embedded thermodynamic analysis of metabolome data. Mol Syst Biol. 2006, 2: [Article number 34]

Henry C, Jankowski M, Broadbelt L, Hatzimanikatis V: Genome-scale thermodynamic analysis of

*Escherichia coli*metabolism. Biophys J. 2006, 90 (4): 1453-1461. 10.1529/biophysj.105.071720.Beard D, Babson E, Curtis E, Qian H: Thermodynamic constraints for biochemical networks. J Theor Biol. 2004, 228 (3): 327-333. 10.1016/j.jtbi.2004.01.008.

Qian H, Beard D, Liang S: Stoichiometric network theory for nonequilibrium biochemical systems. Eur J Biochem. 2003, 270 (3): 415-421. 10.1046/j.1432-1033.2003.03357.x.

Yang F, Qian H, Beard D: Ab initio prediction of thermodynamically feasible reaction directions from biochemical network stoichiometry. Metab Eng. 2005, 7 (4): 251-259. 10.1016/j.ymben.2005.03.002.

Yang F, Beard D: Thermodynamically based profiling of drug metabolism and drug-drug metabolic interactions: a case study of acetaminophen and ethanol toxic interaction. Biophys Chem. 2006, 120 (2): 121-134. 10.1016/j.bpc.2005.10.013.

Kummel A, Panke S, Heinemann M: Systematic assignment of thermodynamic constraints in metabolic network models. BMC Bioinformatics. 2006, 7: 512- 10.1186/1471-2105-7-512.

Mavrovouniotis M: Identification of localized and distributed bottlenecks in metabolic pathways. Proc Int Conf Intell Syst Mol Biol. 1993, 1 (5): 275-283.

Mavrovouniotis M: Duality theory for thermodynamic bottlenecksin bioreaction pathways. Chem Eng Sci. 1996, 51 (9): 1495-1507. 10.1016/0009-2509(95)00308-8.

Henry C, Broadbelt L, Hatzimanikatis V: Thermodynamics-based metabolic flux analysis. Biophys J. 2007, 92 (5): 1792-1805. 10.1529/biophysj.106.093138.

Schuster R, Holzhütter H: Evolution and optimal design of metabolic pathways: the possible consequences of large-scale enzyme alterations on the metabolic efficiency of human erythrocytes as studied on the basis of a mathematical model. J Biol Syst. 1995, 3: 207-215. 10.1142/S0218339095000204.

Reed J, Vo T, Schilling C, Palsson B: An expanded genome-scalemodel of

*Escherichia coli*K-12 (iJR904 GSM/GPR). Genome Biology. 2003, 4 (9): R54.1-R54.12. 10.1186/gb-2003-4-9-r54.Reed J, Palsson B: Genome-scale in silico models of

*E. coli*have multiple equivalent phenotypic states: assessment of correlated reaction subsets that comprise network states. Genome Res. 2004, 14 (9): 1797-1805. 10.1101/gr.2546004.Wang Q, Chen X, Yang Y, Zhao X: Genome-scale in silico aided metabolic analysis and flux comparisons of

*Escherichia coli*to improve succinate production. Appl Microbiol Biotechnol. 2006, 73 (4): 887-894. 10.1007/s00253-006-0535-y.Alberty R: Thermodynamics of Biochemical Reactions. 2003, Hoboken, NJ: Wiley & Sons

Price N, Famili I, Beard D, Palsson B: Extreme pathways and Kirchhoff's second law. Biophys J. 2002, 83 (5): 2879-2882.

Beard D, Liang S, Qian H: Energy balance for analysis of complex metabolic networks. Biophys J. 2002, 83: 79-86.

Beard D, Qian H: Thermodynamic-based computational profiling of cellular regulatory control in hepatocyte metabolism. Am J Physiol Endocrinol Metab. 2005, 288 (3): E633-644. 10.1152/ajpendo.00239.2004.

Price N, Thiele I, Palsson B: Candidate States of

*Helicobacter pylori*'s Genome-Scale Metabolic Network upon Application of "Loop Law" Thermodynamic Constraints. Biophys J. 2006, 90 (11): 3919-3928. 10.1529/biophysj.105.072645.Schuster R, Holzhütter H: Use of mathematical models for predicting the metabolic effect of large-scale enzyme activity alterations. Eur J Biochem. 1995, 229 (2): 403-418. 10.1111/j.1432-1033.1995.0403k.x.

Hildebrand A, Lormes W, Emmert J, Liu Y, Lehmann M, Steinacker J: Lactate concentration in plasma and red blood cells during incremental exercise. Int J Sports Med. 2000, 21 (7): 463-468. 10.1055/s-2000-7412.

Varlet-Marie E, Brun J: Reciprocal relationships between blood lactate and hemorheology in athletes: another hemorheologic paradox. Clin Hemorheol Microcirc. 2004, 30 (3–4): 331-337.

Emmerling M, Dauner M, Ponti A, Fiaux J, Hochuli M, Szyperski T, Wuthrich K, Bailey J, Sauer U: Metabolic flux responses to pyruvate kinase knockout in

*Escherichia coli*. J Bacteriol. 2002, 184: 152-164. 10.1128/JB.184.1.152-164.2002.Segre D, Vitkup D, Church G: Analysis of optimality in naturaland perturbed metabolic networks. Proc Natl Acad Sci USA. 2002, 99 (23): 15112-15117. 10.1073/pnas.232349399.

Edwards J, Palsson B: The

*Escherichia coli*MG1655 in silico Metabolic Genotype: Its Definition, Characteristics, and Capabilities. Proc Natl Acad Sci USA. 2000, 97: 5528-5533. 10.1073/pnas.97.10.5528.Goldberg R: Thermodynamics of enzyme-catalyzed reactions: Part 6 – 1999 update. J Phys Chem Ref Data. 1999, 28: 931-965. 10.1063/1.556041.

Mavrovouniotis M: Group Contribution for Estimating Standard Gibbs energies of Formation of Biochemical Compounds in Aqueous Solution. Biotechnol Bioeng. 1990, 36: 1070-1082. 10.1002/bit.260361013.

Mavrovouniotis M: Estimation of standard Gibbs energy changes of biotransformations. J Biol Chem. 1991, 266 (22): 14440-14445.

Maskow T, von Stockar U: How reliable are thermodynamic feasibility statements of biochemical pathways. Biotechnol Bioeng. 2005, 92 (2): 223-230. 10.1002/bit.20572.

Jin Q, Bethke C: Kinetics of electron transfer through the respiratory chain. Biophys J. 2002, 83 (4): 1797-1808.

Klamt S, Stelling J, Ginkel M, Gilles E: FluxAnalyzer: exploring structure, pathways, and flux distributions in metabolic networks on interactive flux maps. Bioinformatics. 2003, 19 (2): 261-269. 10.1093/bioinformatics/19.2.261.

Pickover C: Keys to Infinity. 1995, chap 31: 233-247. New York, NY, U.S.A.: John Wiley & Sons, Inc

Box G, Muller M: A note on the generation of random normal deviates. Ann Math Stat. 1958, 29: 610-611.

## Acknowledgements

The work of AH was funded by the German Federal Government's Program "Systems Biology of Hepatocytes – HepatoSys".

## Author information

### Authors and Affiliations

### Corresponding author

## Additional information

### Authors' contributions

The basic idea of the presented theoretical concept was developed by HGH. AH and SH prepared of the network models. AH developed the algorithms and carried out the computations. All authors jointly contributed to the manuscript.

## Electronic supplementary material

### 12918_2007_23_MOESM1_ESM.zip

Additional file 1: This archive contains three files relating to the erythrocyte network. The file "Ery-model.txt" gives the definition of the network as tab-separated text file which is organized in the sections: 'Metabolites', 'Reactions', 'Reactions excluded from the TR property', 'Equilibrium constants', 'Targetfluxes', 'Fixed concentrations', The file "Ery.sbml" gives the network description in SBML format. The file "Ery-solutions.txt" contains three solutions: the thermodynamically realizable fluxmin solution (TR-fluxmin) together with the associated metabolite concentrations and the two alternative solutions obtained in the perturbation analysis. (ZIP 4 KB)

### 12918_2007_23_MOESM2_ESM.zip

Additional file 2: This archive includes two directories relating to the *E. coli* iJR904 computations: "FullyReversible" and "HeuristicIrreversibilities". Both contain the file "Ecoli-model.txt", a tab-separated text file, as the definition of the network which is organized in the sections: 'Metabolites', 'Reactions', 'Reactions excluded from the TR property', 'Equilibrium constants', 'Targetfluxes', 'Concentration bounds', and 'Setpoint concentrations'. Both directories also contain the file "Ecoli.sbml", the network description in SBML format. The file "Biomax.txt" also residing in both directories contains the solution of the biomass maximization without the constraint of thermodynamic realizability as a tab-separated text file assigning a reaction identifier with a flux value if it is not zero.. Additionally the directory "FullyReversible" contains the file "TR-Biomax.txt" as the solution to the thermodynamically realizable biomass maximization together with the hypothetic metabolite concentrations compatible with the flux directions. (ZIP 143 KB)

### 12918_2007_23_MOESM3_ESM.pdf

Additional file 3: This document shows the proof that thermodynamic realizability implies that there is no net flux in a closed loop – a consequence of the generalization of Kirchhoff's loop law. (PDF 80 KB)

## Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

## Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

## About this article

### Cite this article

Hoppe, A., Hoffmann, S. & Holzhütter, HG. Including metabolite concentrations into flux balance analysis: thermodynamic realizability as a constraint on flux distributions in metabolic networks.
*BMC Syst Biol* **1**, 23 (2007). https://doi.org/10.1186/1752-0509-1-23

Received:

Accepted:

Published:

DOI: https://doi.org/10.1186/1752-0509-1-23