Modeling genome-wide enzyme evolution predicts strong epistasis underlying catalytic turnover rates

Heckmann, David; Zielinski, Daniel C.; Palsson, Bernhard O.

doi:10.1038/s41467-018-07649-1

Modeling genome-wide enzyme evolution predicts strong epistasis underlying catalytic turnover rates

Article
Open access
Published: 10 December 2018

Volume 9, article number 5270, (2018)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue

Modeling genome-wide enzyme evolution predicts strong epistasis underlying catalytic turnover rates

Download PDF

David Heckmann¹,
Daniel C. Zielinski¹ &
Bernhard O. Palsson ORCID: orcid.org/0000-0003-2357-6785^1,2

3750 Accesses
10 Citations
17 Altmetric
1 Mention
Explore all metrics

Abstract

Systems biology describes cellular phenotypes as properties that emerge from the complex interactions of individual system components. Little is known about how these interactions have affected the evolution of metabolic enzymes. Here, we combine genome-scale metabolic modeling with population genetics models to simulate the evolution of enzyme turnover numbers (k_cats) from a theoretical ancestor with inefficient enzymes. This systems view of biochemical evolution reveals strong epistatic interactions between metabolic genes that shape evolutionary trajectories and influence the magnitude of evolved k_cats. Diminishing returns epistasis prevents enzymes from developing higher k_cats in all reactions and keeps the organism far from the potential fitness optimum. Multifunctional enzymes cause synergistic epistasis that slows down adaptation. The resulting fitness landscape allows k_cat evolution to be convergent. Predicted k_cat parameters show a significant correlation with experimental data, validating our modeling approach. Our analysis reveals how evolutionary forces shape modern k_cats and the whole of metabolism.

Most genetic roots of fungal and animal aging are hundreds of millions of years old according to phylostratigraphy analyses of aging networks

Article 12 June 2024

A Beginners Guide to Estimating the Non-synonymous to Synonymous Rate Ratio of all Protein-Coding Genes in a Genome

In silico Methods for Identification of Potential Therapeutic Targets

Article Open access 26 November 2021

Introduction

The biological systems we observe today are the results of evolutionary trajectories that were shaped by their underlying genotype-to-fitness map, termed the fitness landscape. The components of the system constantly change to increase fitness in the current environment. It is thus tempting to assume that, given the right environment, biological systems can be described as the state that results in the highest fitness possible under all biophysical constraints. Whereas such optimality assumptions were successfully applied to understand a variety of systems properties like bacterial growth rates^1,2, gene expression patterns^3,4,5, and metabolic fluxes^6,7, they are expected to prove futile when the underlying fitness landscape is rugged and exhibits local optima^8,9, or when the natural selection cannot overcome genetic drift to establish potential fitness gains^10,11,12. The topography of the fitness landscape is determined by epistasis¹³, i.e., the extent to which the fitness effect of a mutation depends on the genetic background. Understanding epistasis is thus crucial for understanding evolutionary dynamics and constraints, and systems models can serve as a key tool to understand these interactions^9,14,15.

It was suggested that the catalytic turnover numbers (k_cats) of metabolic enzymes constitute an example of a system state that is distant from a potential optimum, as the efficiency of most enzymes remains far from its theoretical maximum^16,17. Enzyme turnover numbers span over six orders of magnitude and are essential for understanding biological processes on a quantitative level, as they quantitatively describe the proteomic demands of reaction flux, growth, and thus fitness^{2,18,19,20,21,22,23}. In contrast to this high variability and functional importance, experimental data on k_cat is scarce (data in the enzyme kinetics database BRENDA²⁴ accounts for about 10% of the reactions in the E. coli model iJO1366^16,25) and exhibits high noise¹⁶. An improved understanding of the evolutionary and biophysical forces that shape the distribution of kinetic parameters on a systems scale would thus constitute an important step towards quantitative understanding of cellular metabolism. A meta-analysis of databases of k_cats showed two major patterns¹⁶. On the one hand, k_cats in primary metabolism are consistently higher than those in pathways of secondary metabolism, a finding that can be interpreted as the result of differential selection pressure on the respective genes. On the other hand, the underlying biochemical mechanism has a measurable effect on k_cat, suggesting that an interplay between biophysical and evolutionary constraints determines metabolic k_cats. How these factors have acted mechanistically to result in the diverse kinetic turnover numbers we observe today is unknown.

The study of evolution is often limited to retrospective phylogenetic analysis of genome sequences. Nevertheless, when the selective advantage conferred by a metabolic system can be identified, quantitative models can be used to predict fitness correlates and evolution. In the past, systems models of metabolism have been used successfully to describe a variety of evolutionary phenomena like the dynamics of genome reduction²⁶, properties of ancient metabolism²⁷, the global optimum of metabolic adaptation¹, and the trajectories of photosynthesis evolution²⁸. In this study, we aim to understand the evolutionary mechanisms that underlie k_cat evolution and its apparent failure to reach optimality. As k_cats provide a quantitative link between proteome costs and metabolic flux, metabolic models can be used to predict how k_cats affect growth as a proxy for fitness. To this end, we combine genome-scale modeling of metabolism with population genetics models to simulate how modern k_cats evolved from slow ancestors in a network context. We predict that k_cat evolution is convergent and constrained by strong epistasis. In order to validate the model, we compare end points of our evolutionary simulations to experimental turnover rates from in vitro and in vivo sources.

Results

A model for simulating systems-wide k _cat evolution

As k_cats affect fitness by controlling the proteomic cost of enzyme reactions^2,18,19,29, we hypothesize that genome-scale models of cell growth can be used to retrace k_cat evolution in a network context.

The core structure of the metabolic network is conserved across the tree of life^30,31, and thus modern metabolic networks can be expected to contain information about the network context in which enzymes evolved. Because of the quality of its metabolic reconstruction and the relatively high coverage of kinetic data, we choose the metabolic network of E. coli K-12 MG1655 as an ideal candidate to study k_cat evolution.

To predict k_cat-dependent growth as a proxy for fitness, we use the MOMENT algorithm⁴ and a genome-scale reconstruction of E. coli metabolism²⁵. The MOMENT algorithm optimizes growth under a constraint on the total metabolic proteome a cell can sustain. As changes in gene expression can be achieved by the gene regulatory network of the cell or through mutations in a genetic target that is much larger than that for kinetic parameter evolution, we model gene expression as growth optimal.

Modern enzymes exhibit relatively high substrate specificity, but are assumed to have evolved from slow multifunctional ancestors^32,33,34. We aim to model adaptation of kinetic turnover numbers after specificity increased, but where turnover numbers were still low. We thus assign turnover numbers of 10⁻³ s⁻¹, similar to the slowest enzymes observed today¹⁶. Starting from these ancestral slow enzymes, mutations are drawn randomly as multiplicative changes in k_cats of a random reaction, where the majority are assumed to be decreasing k_cat (decreasing:increasing = 100:1, see Fig. 1 and Methods). Whether a novel mutation achieves fixation is then calculated for the estimated effective population size of E. coli (N_e = 2.5e7^35,36), and k_cat evolution is simulated with a Markov Chain Monte Carlo approach (MCMC, Fig. 1). The model thus uses a strong-selection-weak-mutation regime³⁷.

As biological catalysts are limited to natural amino acids to stabilize transition states, it is expected that many reactions will have a distinct biophysical upper limit to the turnover rate that is lower than the theoretical limit resulting from diffusion rate of collisions. As certain reaction mechanisms were shown to consistently exhibit high k_cats¹⁶, we use the enzyme commission (EC) number to decide on a candidate set of 569 biophysically unconstrained reactions (see Methods). The remaining 1087 enzymes were considered biophysically constrained and were fixed to the median of in vitro k_cat measurements (13.7 s⁻¹). In the context of evolutionary predictions, the number of enzymes in the constrained and unconstrained set are more meaningfully compared in terms of reactions that are contributing to growth. Flux variability analysis³⁸ for aerobic growth on glucose reveals that 278 growth-relevant reactions (see Methods) are unconstrained, while 183 carry a biophysical constraint; the majority of in silico growth-relevant reactions is thus evolving without upper limit.

Evolutionary trajectories exhibit jumps and convergence

When simulating k_cat evolution with the MCMC algorithm, we can trace the dynamics of adaptation through evolutionary trajectories of growth rates (Fig. 2a). As a starting point, we choose an aerobic glucose environment. Ancestral slow enzymes cause initial growth rates to be low, but fixation of mutations that increase selected k_cats leads to an irregular increase in growth rates that eventually saturates towards a growth rate close to 0.5 h⁻¹. This behavior is reproducible across replicates, and final growth rates are convergent across these independent evolutionary trajectories. The average trajectory shows a sigmoidal shape that can be explained by a simple analytical model (Supplementary Note 1, Supplementary Figs. 16, 17, and 20), where variance in fitness is highest in intermediate states. Even though the majority of growth-contributing reactions—as determined by flux variability analysis³⁸—were not assigned biophysical constraints on the evolution of higher k_cats, growth rates are unable to surpass 0.5 h⁻¹, even when simulations are continued further than shown in Fig. 2 (Supplementary Figs. 2 and 5). This effect is the result of diminishing returns epistasis (DRE) acting between the evolving genes: the same mutation will result in a smaller fitness gain when the genetic background already enables a high growth rate (inset of Fig. 2b). Due to this effect, even large improvements in k_cats of high-flux pathways can only confer a fitness benefit that approaches that of a neutral mutation and thus become subject to drift rather than selection^10,11 (Fig. 2b). We confirm this idea by using a greedy search that iteratively fixes the most beneficial mutations that double kcat: the maximum achievable fitness gain will reach the neutral barrier (where s is smaller ~1/N_e^10,11) without achieving a growth rate >0.5 h⁻¹ (Supplementary Fig. 5). The underlying mechanism for the observed DRE is the dispersion of biophysical constraints through the shared metabolic proteome (Supplementary Note 1, Supplementary Fig. 4); as genome-wide adaptation progresses, improvements of already high k_cats free up little protein that can be invested in limited reactions. This effect is independent of whether multiplicative or additive mutations are used and is particularly strong because many enzymes contribute to fitness (Supplementary Note 1, Supplementary Figs. 17, 18, 19, and 21). We simulated a maximum growth rate that ignores evolutionary constraints by setting the k_cat of all unconstrained reactions to a value similar to the fastest known enzymes of 1e5 s⁻¹. We find a theoretically achievable growth rate of 1.58 h⁻¹, more than three times the rate of the evolved result. This result indicates the strong effect that DRE has in constraining k_cat evolution: it acts to keep the system far from a theoretical fitness optimum.

Although in vitro data and biochemical reaction mechanisms defined our set of biophysically constrained reactions, the true identity of this set is unknown. We thus conducted a sensitivity analysis for the identity and size of this set. The identity of the evolving set affects the final growth rate, but not the qualitative dynamics of adaptation or the occurrence of DRE (Supplementary Fig. 7). The speed of adaptation decreases with the size of the evolving set, as more reactions are required to acquire mutations to reach higher growth rates. An additional source of uncertainty comes from the nature of the distribution of mutational effects, which is unknown. We varied the mean of the distribution of mutational effects, but again found no effect on the qualitative dynamics of adaptation or the occurrence of DRE, but a small quantitative effect on the final growth rate (Supplementary Fig. 9).

Multifunctional enzymes cause evolutionary jump dynamics

In order to understand the irregular increase in growth rate observed in adaptive trajectories (Fig. 2A), we summarized genes for which mutation coincided with unusually high fitness gains. We found a small set of genes that was repeatedly associated with large jumps in fitness (Supplementary Table 1). When removing reactions catalyzed by the product of these genes, fitness jumps are drastically reduced and the speed of adaptation increases (p < 2e−3, Wilcoxon rank-sum test on the number of mutations required to reach half the end point growth rate), showing that they are indeed responsible for the irregular adaptation dynamics. Investigation of metabolic network model and gene-protein-reaction context of these genes revealed that all of them are multifunctional enzymes that catalyze multiple reactions in the same linear pathways. These genes are involved in histidine biosynthesis (histb), purine biosynthesis (purH), cell wall biosynthesis (glmU), and fatty acid biosynthesis (fabG). The irregular behavior in adaptive trajectories thus has a mechanistic reason that lies in the structure of the underlying network: protein cost of the linear pathway cannot be mitigated by increasing an individual k_cat of a single active site, resulting in a fitness landscape that shows synergistic epistasis (Fig. 3). The pathway can then become a bottleneck for the adaptation process, where fixation of a specific neutral mutation in a multifunctional enzyme is required for further fitness gains (Fig. 3b).

Most reactions show repeatable evolution

The high level of convergence that is exhibited in the adapted growth rates (Fig. 2a) is reflected in the turnover numbers of the evolved populations: vectors of adapted k_cats show a high correlation across replicates (all Pearson’s R > = 0.9, Supplementary Fig. 1). Clustering of the most divergent reactions reveals that the remaining differences in evolved k_cats cannot be exclusively attributed to the stochasticity of the adaptation process: redundant metabolic routes in central carbon metabolism and redox metabolism cause k_cat evolution to be divergent (Supplementary Fig. 1B). Nevertheless, k_cat evolution is highly convergent and repeatable, indicating that similar patterns in turnover numbers across species could be the result of independent evolutionary trajectories.

The evolved k _cats agree with in vivo and in vitro data

How well do our simulated end points of k_cat evolution agree with experimental data on modern k_cats? In order to answer this question, we simulate k_cat evolution in randomly changing model environments to model a more realistic environmental diversity. We randomly chose a set of environmental carbon, nitrogen, and sulfur components, as well as random availability of oxygen (see Methods) and compared prediction performance of this diverse environment simulation with the simulations under constant aerobic glucose conditions.

In vitro measurements of k_cat were previously mined from the BRENDA database and filtered for natural substrates¹⁶. We compared the simulated end points for both constant and diverse environments to this dataset while focusing on reactions without data-driven biophysical constraints to avoid circular conclusions. We found that the predictions agree in magnitude (Fig. 4a, Supplementary Fig. 11 A) and show a significant correlation (Pearson’s R = 0.37, p < 6e−4 for diverse environments. R = 0.25, p < 0.02 for aerobic growth on glucose. See Methods) with the in vitro data (Fig. 4b, Supplementary Fig. 11B). Simulation of evolution in diverse environments thus results in a better agreement with in vitro data. In addition to in vitro measurements, estimates of in vivo maximal turnover rates (k_app,max) became recently available based on the combination of proteomics data and flux predictions across multiple conditions³⁹. The predicted k_cats from both diverse and constant evolutionary environments agree with this in vivo data in magnitude (Fig. 4c, Supplementary Fig. 11C) and show a highly significant correlation (R = 0.67, p < 5e−29, for diverse environments. R = 0.57, p < 2.4e−19 for aerobic growth on glucose. See Methods). Like in the case of in vitro measurements, a model of diverse environments explains in vivo data better than constant environments.

What factors affect the speed of evolution of a reaction’s k_cat until system-wide DRE prevents further adaptation? We find that the k_cats in the end points of evolution in diverse environments are correlated with enzyme molecular weight (R = 0.28, p < 4.4e−6. See Methods) and with the mean of fluxes of parsimonious FBA⁴⁰ across diverse growth environments (R = 0.62, p < 2.2e−16. See Methods), indicating that these two factors are the major determinants of selection pressure on a given reaction. This finding explains why the enzymes that catalyze high flux reactions in central metabolism are associated with high in vitro k_cats¹⁶. When we repeat our evolutionary simulations in models with random perturbations of reaction stoichiometries and biomass components, agreement with experimental data are abolished (Supplementary Fig. 15). This result confirms the important role of reaction flux as a selection pressure in k_cat evolution.

Finally, the convergent behavior we found for evolution in a static environment (Supplementary Fig. 1) is also present in the end points of evolution in diverse environments (all Pearson’s R > 0.87 across three replicates).

Discussion

The turnover numbers of enzymes in central energy metabolism are significantly higher than those of pathways in amino acid, fatty acid, nucleotide, and secondary metabolism¹⁶, even though phylogenetic evidence suggest that the core of the metabolic network is conserved across the tree of life^30,31 and extensive enzyme optimization should thus have had sufficient time to occur. In order to understand the mechanistic reason for this observation, we developed an in silico model that predicts the dynamics and long-term end point of k_cat evolution, and validated these predictions with experimental data.

It has been suggested that the suboptimal turnover number of many enzymes is the result of an increasing difficulty to achieve k_cat improvements that occurs in all metabolic genes¹⁶. We show that even without such intragenic constraints, a small number of biophysically constrained reactions are sufficient to cause diminishing returns epistasis in otherwise unconstrained reactions (Fig. 2, Supplementary Fig. 7, Supplementary Note 1). As the fitness gain of improvements in k_cats (i.e., their selection coefficient s) decreases, it approaches the neutral boundary that lies around 1/N_e^10,11,41, and mutations that yield large improvements in k_cat are rendered effectively neutral. Metabolic control theory⁴² has been used in the past to postulate the occurrence of diminishing fitness returns when the activity of a single enzyme changes, e.g., explaining the genetic dominance of metabolic genes⁴³ and the frequency of neutral mutations⁴¹. In our framework, that situation is comparable to assigning a single reaction to the unconstrained set.

Diminishing returns are often implicitly assumed in quantitative models of adaptation, e.g. in the form of Gaussian fitness landscapes¹³, and our results on k_cat evolution give a mechanistic example of how diminishing returns can arise, even when the population is still distant from a global optimum. In terms of experimental data, intergenic diminishing returns epistasis has been found to play a crucial row in a long-term evolutionary experiment⁴⁴ and in adaptation to heterologous pathway optimization⁴⁵. In the latter example, the expression cost of a heterologous pathway was reduced by reducing over-expression, a process conceptually similar to the reduction of protein costs through the increase in kinetic parameters. Whereas the adjustment of expression levels is a mechanism commonly found in experimental evolution, kinetic parameter evolution is a smaller mutational target and thus more difficult to study in such a framework.

Structural genomics studies have found convergent evolution of function to be a common pattern in enzyme evolution⁴⁶. Our model shows that kinetic parameter evolution is likely to similarly exhibit convergent behavior. The evolutionary end points show a high correlation of k_cats across replicates—even though some reactions diverge—(Supplementary Fig. 1), and final growth rates are very similar (Fig. 2). This suggests a smooth single-peaked phenotypic fitness landscape, where the low level of divergence indicates a plateau of comparable fitness that is reached in a repeatable and convergent manner. Pairwise averaging of end point k_cats shows that these intermediate points are also intermediate in fitness (Supplementary Fig. 12), thus confirming the lack of fitness valleys between end points. Remarkably, this high level of convergence is even found when environments differ during the adaptation process (all R > 0.87 between end points, also see Fig. 4). As our analysis of end point k_cats indicates that selection pressure is mostly determined by flux and—to a lesser extent—enzyme molecular weight, convergence might be caused by correlated flux distributions across environments. We calculate the correlation of flux across 10,000 environments chosen by our sampling algorithm (see Methods) and find a median Pearson correlation of 0.7 between flux distributions on log scale, indicating that this similarity in flux underlies the observed high level of convergence.

Even though diminishing returns epistasis arises for the growth rate effect of mutations, epistatic effects of mutations in the same gene are not modeled explicitly. Thus, even though structural models argue against this⁴⁷, intragenic sign epistasis—where the sign of a mutation’s effect depends on the genetic background—could cause a more rugged landscape.

Although the model suggests a remarkably smooth fitness landscape, multifunctional enzymes cause “neutral plateaus” that slow adaptation by requiring a neutral mutation to occur before k_cat improvements can yield fitness gains (Fig. 3): when removing reactions catalyzed by the product of these genes, fitness jumps are drastically reduced, and the speed of adaptation increases (Supplementary Fig. 6). Most of these cases are caused by multifunctional enzymes that possess two distinct active sites and that have likely resulted from gene fusion events—e.g. purH⁴⁸ and histb⁴⁹. It is thus likely that these gene fusion events occurred after the individual gene products had been selected for higher k_cats. Gene fusions are highly polyphyletic^50,51,52, a finding that supports this idea.

Further genes associated with jump behavior catalyze multiple reactions using the same binding site—e.g., fabG (Supplementary Table 1). Kacser and Beeby³³ discussed the effect of such multifunctional enzymes for a scenario of highly un-specific proto-enzymes, where gene duplication becomes necessary to render increased specificity adaptive. Nevertheless, the mechanism Kacser and Beeby³³ proposed requires assumptions about how mutations affect each catalytic activity, where experimental data indicate that such effects have to be studied on a case-by-case basis³². For the case of multifunctional enzymes that result from gene fusion events, independent mutation effects on both active sites seem a reasonable assumption.

A variety of sources of uncertainty make it difficult to predict experimental k_cat data with the ab initio approach we present. Condition-dependent metabolite levels and enzyme affinities (i.e., the K_m values) will affect enzyme saturation where our model assumes full saturation. Undersaturation is thus expected to influence k_cat evolution by increasing the selection pressure on k_cat. A similar effect is expected for the backward flux in thermodynamically unfavorable reactions; e.g., the simulations predict a k_cat for the thermodynamically unfavorable malate dehydrogenase reaction of 805 s⁻¹ that underestimates in vitro data (931 s⁻¹ ⁵³), whereas in vivo data suggest a much lower effective turnover rate of 7 s⁻¹ ³⁹, probably caused by substantial backward flux³⁹. Whereas computational feasibility will be a challenge, modeling the interaction between k_cat, K_m, metabolite concentrations, and allosteric regulation is a promising topic for future studies that could also shed light on the co-evolution of isozymes that often vary in K_m⁵⁴. As gene duplication is frequently observed in short-term adaptation⁵⁵, we assume that most k_cats evolved before isozymes emerged and model k_cat mutation at the reaction level. Furthermore, our model has to make an assumption about the identity of biophysically constrained reactions. Whereas EC numbers serve as a first approximation for estimating this set, there is still a high level of uncertainty in its true identity. It is in fact possible that a growth-limiting process outside of metabolism causes diminishing returns epistasis, e.g., the expression machinery of the cell. Encouragingly, sensitivity analyses indicate that the qualitative adaptation dynamics and agreement of simulated k_cats with experimental data are robust against the identity of the constrained set (Supplementary Figs. 7 and 8, Supplementary Table 2). As studies shed more light on the nature of intragenic fitness landscapes⁵⁶, it will be valuable to model the relative contribution of intergenic and intragenic diminishing returns in more detail. The effect of K_m and allosteric effects mentioned above might affect the shape of the inferred fitness landscape; e.g., k_cat and K_m frequently show trade-offs⁵⁷, a factor that might result in local optima on the fitness landscape. Other sources of uncertainty lie in the choice of selective environments and the shape and parameters of the distribution of mutation effects. Again, sensitivity analyses show that our results are robust against these factors (Supplementary Figs. 9 and 11). As decreases in k_cat are expected to be either fitness-neutral or deleterious, they are associated with very low fixation probabilities. Thus, even though we assume mutations that decrease k_cats to occur a hundred times more frequently than those that increase k_cat, only 1.8% of fixed mutations decrease k_cats in our evolutionary simulations of varying environments. When ancestral k_cat vectors are sampled randomly from the empirical distribution of k_cats, the correlation of end points with experimental data decreases (k_cat in vitro: R = 0.29, p < 0.007; k_app,max: R = 0.5, p < 2e−14; Supplementary Fig. 13, see Methods) as well as the degree of convergence between end points (mean R²= 0.26, Supplementary Fig. 14, see Methods). This effect is due to the slow accumulation of deleterious mutations that is negligible on the timescale tractable for our simulations—reactions that have a high initial k_cat assigned are very unlikely to have substantially decreased it in the end point, even if the reaction is not used in the simulated conditions (Supplementary Fig. 14).

Finally, the strong-selection-weak-mutation regime (SSWM) we use to model adaptation dynamics does not account for the effects of clonal interference, like a decreased rate of adaptation and higher fitness gains of fixed mutations⁵⁸. As the occurrence of diminishing returns are independent of the mutation dynamics, we do not expect clonal interference to have a large effect on end point k_cat distributions, although it could prove to be important in future studies quantifying the timescale of k_cat fixation.

To validate the assumptions of our modeling approach we compared model predictions to in vitro and in vivo datasets. Despite the sources of uncertainty listed above and the high level of noise in the experimental data (see Bar-Even et al.¹⁶ for discussion) we found a significant agreement with in vitro data and in vivo estimates, where the model explained about 45% of the observed variance in in vivo k_cats. In vitro k_cats were shown to correlate with enzyme molecular weight and reaction flux (R = 0.22 and R = 0.45, respectively⁴). Similarly, predicted k_cats in our model for diverse environments are correlated with enzyme molecular weight (R = 0.28, p < 4.4e−6) and with the mean of fluxes of parsimonious FBA⁴⁰ across diverse growth environments (R = 0.62, p < 2.2e−16). This result indicates that enzyme usage and size determine the selection pressure on individual reactions and thus the magnitude of final k_cats, a hypothesis that we confirmed by sensitivity analysis: randomly perturbing network stoichiometry, biomass components, and enzyme molecular weights abolishes the correlation with experimental data (Supplementary Fig. 15). Surprisingly, we found agreement not only by correlation, but also by magnitude (Fig. 4a). This finding is consistent with the realistic growth rates to which the adaptation process converges (Fig. 2). The in vivo data used are based on quantitative proteomics data and flux estimates that assume growth maximization³⁹. The better agreement of our simulations with in vivo data might be due to the latter being less noisy than in vitro estimates, but in vivo data could also be biased to prefer our model-based predictions, as model-derived fluxes were used in combination with proteomics data to derive k_app,max³⁹. Nevertheless, using the limited flux data available from metabolic flux analysis (MFA) instead of model-derived flux, a high correlation with model-derived k_app,max was found (R²= 0.85)³⁹. Sensitivity analyses (Supplementary Figs. 7 and 9) and our minimal model (Supplementary Note 1) show that the magnitude of evolved k_cats can depend on the size of the evolving set, the distribution of mutational effects, and the magnitude of biophysical constraints (Supplementary Fig. 10). We thus provide a consistent set of these parameters, but additional data are required to confirm this parameter set in the future.

In summary, the presented models suggest the following mechanism for k_cat evolution: initially, ancestral inefficient enzymes are under strong selection to increase their k_cat in order to reduce the protein costs of metabolism. This selection pressure increases with the average flux through the respective reaction and—to a lesser extent—with the molecular weight of the catalyzing enzyme. As soon as some growth-relevant reactions do not have mutations available that could increase their k_cat—i.e., the reaction becomes biochemically constrained—diminishing returns epistasis affects all other enzymes in the network, and the extent of these diminishing returns is more pronounced in large networks (Supplementary Note 1). Reactions that carry high flux, e.g., those in primary carbon metabolism, still yield substantial fitness benefits and evolve faster than low-flux reactions. Nevertheless, the extent of diminishing returns increases with each mutation that improves a reaction’s k_cat until selection coefficients become too small to distinguish beneficial from neutral mutations and adaptation comes to a halt. The evolutionary end points exhibit fitness levels that are far lower than theoretically possible states, a property associated with large metabolic networks (Supplementary Note 1).

The prediction of evolutionary outcomes is an ultimate goal in evolutionary biology⁹. The model we present predicts data on k_cat in terms of correlation and magnitude, showing that evolutionary long-term end points of k_cat evolution can be predicted using evolutionary systems models with considerable accuracy, especially given the sources of model uncertainty listed above. The model predicts that diminishing returns epistasis keeps k_cats—and thus fitness—far from the global optimum, indicating the potential of engineering strategies for more efficient enzymes. Whereas we chose E. coli as a model organism to study k_cat evolution, the patterns we find are likely to generalize across the tree of life, where organisms with smaller effective population size than E. coli can be expected to show an even stronger mark of insufficient selection in their catalytic properties.

Optimality assumptions are a promising tool for understanding complex biological systems, but finite population sizes and epistatic interactions can render individual molecules far from theoretical optima—even when the underlying fitness landscape is smooth. Seeing cells through the systems perspective and modeling evolutionary history can be crucial for understanding cell behavior, as is the case for kinetic turnover numbers.

Methods

Growth rate predictions using MOMENT

In the simulation of kinetic parameter evolution, the growth rate that results from a given vector of catalytic turnover rates κ is predicted using the MOMENT algorithm⁴. MOMENT is conceptually similar to flux balance analysis (FBA⁵⁹), in that it maximizes the growth rate µ by maximizing flux into a biomass reaction (v_z) given a set of constraints (v_min and v_max):

$${\mathrm{max}}(v_z)\:s.t.$$

$${\mathbf{Sv}} = 0$$

$$v_{{\rm min},i} \le v_i \le v_{{\rm max},i}.$$

Here, S represents the stoichiometric matrix and v the vector of fluxes. MOMENT extends FBA by introducing enzyme concentrations as model variables (g_i, mmol g_DW⁻¹) and recursively parsing gene-protein-reaction (GPR) rules to obtain upper limit constraints on metabolic fluxes:

$$v_i \le f(\kappa _i,G_i),$$

where G_i represents the set of genes involved in catalyzing reaction i. The respective GPR is parsed by using the maximum of enzyme concentrations to represent AND relations and the sum to model OR relations. Finally, the total weight of the metabolic proteome (C, g_protein g_DW^-1) and the respective enzyme molecular weights (MW) are used to constrain enzyme concentrations:

$$\mathop {\sum }\nolimits{g_i\mathrm{MW}}_i \le C.$$

MOMENT was used to simulate growth in iJO1366, a genome-scale model of E. coli K-12 MG1655 metabolism²⁵. Enzyme molecular weights were calculated based on the E. coli K12 MG1655 protein sequences (NCBI Reference Sequence NC_000913.3), and C was set to 0.32 g_protein g_DW⁻¹ in accordance with the E. coli metabolic protein fraction across diverse growth conditions^4,60. Linear programming problems were constructed using the R⁶¹ packages sybil⁶² and sybilccFBA and solved using IBM CPLEX version 12.7. The growth rate μ (compare Fig. 1) can then be obtained as the flux into the biomass reaction v_z.

We classify a reaction as contributing to in silico growth using flux variability analysis³⁸. When either the maximal flux or the absolute minimal flux through a reaction that still optimizes the growth rate μ in FBA is >10⁻⁶ mmol g_DW⁻¹ h⁻¹, we call a reaction “contributing to growth in silico”.

An MCMC algorithm for simulating k _cat evolution

We assume a genetically homogenous population of cells with a population size equal to the effective population size estimated for E. coli (N_e = 2.5e7³⁵). A single iteration of the Markov Chain Monte Carlo (MCMC) algorithm starts as follows: A mutation affecting the k_cat of a single randomly chosen reaction i is simulated as multiplying an original k_cat (= κ_i) by a factor α that is drawn from a lognormal distribution with mean and standard deviation in log scale log(3/2) and 0.3, respectively. This distribution determines the jump size in the space of k_cats, but not the ratio between deleterious to advantageous mutations (see below).

$$\kappa _{i,{\rm mut}} = \alpha \kappa _i.$$

As formulated by the Haldane relationship⁶³, k_cats of forward and backward directions and respective K_ms cannot change independently from each other. To account for the Haldane relationship, we implement mutations that affect the forward and backward k_cat of reversible reactions equally. The growth rate of the original strain (µ) and the strain carrying the mutation affecting κ_i (µ_mut) is then calculated by solving the MOMENT problem detailed above (also see Fig. 1). Assuming that fitness is proportional to growth rate, we can obtain the selection coefficient s and the fixation probability π³⁶:

$$s = 1 - \frac{\mu }{{\mu _{{\rm mut}}}},$$

$$\pi = \left\{ {\begin{array}{*{20}{c}} {\frac{1}{N},\,{\rm if}\,s = 0} \\ {\frac{{1 - e^{ - 2s}}}{{1 - e^{ - 2Ns}}},{\rm otherwise}} \end{array}} \right..$$

The fixation probability π is then used to decide the fixation of the novel mutation. In case of a successful fixation event, the vector of k_cats, κ, is updated at position i with the newly fixed mutation, or, in case of an unsuccessful fixation event, the previous κ_i remains the most abundant allele. The next iteration of the algorithm starts with introducing a novel change in the k_cat of a random enzyme, and so on. A typical simulation run simulates around 10⁸ mutations that have the chance to become fixed, requiring 10⁸ linear programs to be solved for a single replicate.

The high population size allowed us to optimize simulation performance by heuristically setting the ratio of deleterious to advantageous mutations: the growth rate for a deleterious mutation was simulated once, but their fixation was sampled multiple times to arrive at a 100:1 ratio between deleterious and advantageous mutations (see Supplementary Table 3 for sensitivity analysis). Certain reaction mechanisms were shown to consistently exhibit low k_cats¹⁶. We use the enzyme commission (EC) number to set the reactions belonging to the three (out of six) top level codes with the highest median in vitro k_cat—namely oxidoreductases, hydrolases, and isomerases—as biophysically unconstrained. In order to allow an unbiased comparison to experimental data, all reactions for which data was available were also set as unconstrained. The remaining reactions were considered biophysically constrained and were fixed to the median of in vitro k_cat measurements (13.7 s⁻¹). The k_cats of unconstrained reactions were initialized to 10⁻³ s⁻¹. See Supplementary Figures 7 and 8, and Supplementary Table 2 for sensitivity analysis against the identity of the constrained set.

In order to simulate diverse environments, we applied random sampling of a new environment every 1000 iterations. Here, oxygen uptake was allowed with probability 1/2, and the environment always contained at least one randomly chosen source of each carbon, nitrogen, sulfur, and phosphate. A number of additional sources were drawn from a binomial of size 2 with success probability 1/2. This process was repeated until a growth sustaining environment was found and the following 1000 mutations were simulated in this novel environment.

Statistics

Pearson’s R was used to test for significant correlation with a two-sided t-test as implemented in the cor.test() function of the R environment⁶¹.

Code availability

R code for the simulations presented in this study is available from the authors upon request.

Data availability

Predicted k_cat end points that are presented in this study are available from the authors upon request.

References

Ibarra, R. U., Edwards, J. S. & Palsson, B. O. Escherichia coli K-12 undergoes adaptive evolution to achieve in silico predicted optimal growth. Nature 420, 186–189 (2002).
Article ADS CAS PubMed Google Scholar
O’Brien, E. J., Lerman, J. A., Chang, R. L., Hyduke, D. R. & Palsson, B. O. Genome-scale models of metabolism and gene expression extend and refine growth phenotype prediction. Mol. Syst. Biol. 9, https://doi.org/10.1038/msb.2013.52 (2013).
Article CAS Google Scholar
Noor, E. et al. The protein cost of metabolic fluxes: prediction from enzymatic rate laws and cost minimizatioron. PLoS Comput. Biol. 12, https://doi.org/10.1371/journal.pcbi.1005167 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Adadi, R., Volkmer, B., Milo, R., Heinemann, M. & Shlomi, T. Prediction of microbial growth rate versus biomass yield by a metabolic network with kinetic parameters. PLoS Comput. Biol. 8, https://doi.org/10.1371/journal.pcbi.1002575 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Reimers, A. M., Knoop, H., Bockmayr, A. & Steuer, R. Cellular trade-offs and optimal resource allocation during cyanobacterial diurnal growth. Proc. Natl Acad. Sci. USA 114, E6457–E6465 (2017).
Article CAS PubMed PubMed Central Google Scholar
Schuetz, R., Zamboni, N., Zampieri, M., Heinemann, M. & Sauer, U. Multidimensional optimality of microbial metabolism. Science 336, 601–604 (2012).
Article ADS CAS PubMed Google Scholar
Chen, X. W., Alonso, A. P., Allen, D. K., Reed, J. L. & Shachar-Hill, Y. Synergy between C-13-metabolic flux analysis and flux balance analysis for understanding metabolic adaption to anaerobiosis in E. coli. Metab. Eng. 13, 38–48 (2011).
Article CAS PubMed Google Scholar
Poelwijk, F. J., Tănase-Nicola, S., Kiviet, D. J. & Tans, S. J. Reciprocal sign epistasis is a necessary condition for multi-peaked fitness landscapes. J. Theor. Biol. 272, 141–144 (2011).
Article PubMed MATH Google Scholar
de Visser, J. A. G. M. & Krug, J. Empirical fitness landscapes and the predictability of evolution. Nat. Rev. Genet. 15, 480–490 (2014).
Article CAS PubMed Google Scholar
Kimura, M. The neutral theory of molecular evolution. (Cambridge University Press, Cambridge, 1983).
Li, W. H. Maintenance of genetic-variability under joint effect of mutation, selection and random drift. Genetics 90, 349–382 (1978).
MathSciNet CAS PubMed PubMed Central Google Scholar
Wagner, A. Neutralism and selectionism: a network-based reconciliation. Nat. Rev. Genet. 9, 965–974 (2008).
Article CAS PubMed Google Scholar
Martin, G., Elena, S. F. & Lenormand, T. Distributions of epistasis in microbes fit predictions from a fitness landscape model. Nat. Genet. 39, 555–560 (2007).
Article CAS PubMed Google Scholar
Segre, D., DeLuna, A., Church, G. M. & Kishony, R. Modular epistasis in yeast metabolism. Nat. Genet. 37, 77–83 (2005).
Article CAS PubMed Google Scholar
Heckmann, D. Modelling metabolic evolution on phenotypic fitness landscapes: a case study on C₄ photosynthesis. Biochem. Soc. Trans. 43, 1172–1176 (2015).
Article CAS PubMed Google Scholar
Bar-Even, A. et al. The moderately efficient enzyme: evolutionary and physicochemical trends shaping enzyme parameters. Biochemistry 50, 4402–4410 (2011).
Article CAS PubMed Google Scholar
Pettersson, G. Effect of evolution on the kinetic-properties of enzymes. Eur. J. Biochem. 184, 561–566 (1989).
Article CAS PubMed Google Scholar
Khodayari, A. & Maranas, C. D. A genome-scale Escherichia coli kinetic metabolic model k-ecoli457 satisfying flux data for multiple mutant strains. Nat. Commun. 7, (2016).
Ebrahim, A. et al. Multi-omic data integration enables discovery of hidden biological regularities. Nat. Commun. 7, (2016).
Radzicka, A. & Wolfenden, R. A proficient enzyme. Science 267, 90–93 (1995).
Article ADS CAS PubMed Google Scholar
Goelzer, A. et al. Quantitative prediction of genome-wide resource allocation in bacteria. Metab. Eng. 32, 232–243 (2015).
Article CAS PubMed Google Scholar
Mallmann, J. et al. The role of photorespiration during the evolution of C₄ photosynthesis in the genus Flaveria. eLife. https://doi.org/10.7554/eLife.02478 (2014).
Article PubMed PubMed Central Google Scholar
Sánchez, B. J. et al. Improving the phenotype predictions of a yeast genome‐scale metabolic model by incorporating enzymatic constraints. Mol. Syst. Biol. 13, (2017).
Article CAS PubMed PubMed Central Google Scholar
Schomburg, I., Chang, A. & Schomburg, D. BRENDA, enzyme data and metabolic information. Nucleic Acids Res. 30, 47–49 (2002).
Article CAS PubMed PubMed Central Google Scholar
Orth, J. D. et al. A comprehensive genome-scale reconstruction of Escherichia coli metabolism-2011. Mol. Syst. Biol. 7, https://doi.org/10.1038/msb.2011.65 (2011).
Article Google Scholar
Pal, C. et al. Chance and necessity in the evolution of minimal metabolic networks. Nature 440, 667–670 (2006).
Article ADS CAS PubMed Google Scholar
Goldford, J. E., Hartman, H., Smith, T. F. & Segre, D. Remnants of an ancient metabolism without phosphate. Cell 168, 1126–1134 (2017).
Article CAS PubMed Google Scholar
Heckmann, D. et al. Predicting C₄ photosynthesis evolution: modular, individually adaptive steps on a Mount Fuji Fitness Landscape. Cell 153, 1579–1588 (2013).
Article CAS PubMed Google Scholar
Karr, J. R. et al. A whole-cell computational model predicts phenotype from genotype. Cell 150, 389–401 (2012).
Article CAS PubMed PubMed Central Google Scholar
Peregrin-Alvarez, J. M., Tsoka, S. & Ouzounis, C. A. The phylogenetic extent of metabolic enzymes and pathways. Genome Res. 13, 422–427 (2003).
Article CAS PubMed PubMed Central Google Scholar
Ouzounis, C. A., Kunin, V., Darzentas, N. & Goldovsky, L. A minimal estimate for the gene content of the last universal common ancestor - exobiology from a terrestrial perspective. Res. Microbiol. 157, 57–68 (2006).
Article CAS PubMed Google Scholar
Khersonsky, O. & Tawfik, D. S. Enzyme promiscuity: a mechanistic and evolutionary perspective. Annu. Rev. Biochem. 79, 471–505 (2010).
Article CAS PubMed Google Scholar
Kacser, H. & Beeby, R. Evolution of catalytic proteins or on the origin of enzyme species by means of natural-selection. J. Mol. Evol. 20, 38–51 (1984).
Article ADS CAS PubMed Google Scholar
Conant, G. C. & Wolfe, K. H. Turning a hobby into a job: How duplicated genes find new functions. Nat. Rev. Genet. 9, 938–950 (2008).
Article CAS PubMed Google Scholar
Charlesworth, J. & Eyre-Walker, A. The rate of adaptive evolution in enteric bacteria. Mol. Biol. Evol. 23, 1348–1356 (2006).
Article CAS PubMed Google Scholar
Kimura, M. Diffusion models in population genetics. J. Appl. Probab. 1, 177–232 (1964).
Article MathSciNet MATH Google Scholar
Gillespie, J. H. Some properties of finite populations experiencing strong selection and weak mutation. Am. Nat. 121, 691–708 (1983).
Article Google Scholar
Mahadevan, R. & Schilling, C. H. The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. Metab. Eng. 5, 264–276 (2003).
Article CAS PubMed Google Scholar
Davidi, D. et al. Global characterization of in vivo enzyme catalytic rates and their correspondence to in vitro k _cat measurements. Proc. Natl Acad. Sci. USA 113, 3401–3406 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Holzhutter, H. G. The principle of flux minimization and its application to estimate stationary fluxes in metabolic networks. Eur. J. Biochem. 271, 2905–2922 (2004).
Article CAS PubMed Google Scholar
Hartl, D. L., Dykhuizen, D. E. & Dean, A. M. Limits of adaptation - the evolution of selective neutrality. Genetics 111, 655–674 (1985).
CAS PubMed PubMed Central Google Scholar
Kacser, H. & Burns, J. A. The control of flux. Symp. Soc. Exp. Biol. 27, 65–104 (1973).
CAS PubMed Google Scholar
Kacser, H. & Burns, J. A. The molecular basis of dominance. Genetics 97, 639–666 (1981).
CAS PubMed PubMed Central Google Scholar
Khan, A. I., Dinh, D. M., Schneider, D., Lenski, R. E. & Cooper, T. F. Negative epistasis between beneficial mutations in an evolving bacterial population. Science 332, 1193–1196 (2011).
Article ADS CAS PubMed Google Scholar
Chou, H.-H., Chiu, H.-C., Delaney, N. F., Segrè, D. & Marx, C. J. Diminishing returns epistasis among beneficial mutations decelerates adaptation. Science 332, 1190–1192 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Galperin, M. Y. & Koonin, E. V. Divergence and convergence in enzyme evolution. J. Biol. Chem. 287, 21–28 (2012).
Article CAS PubMed Google Scholar
Lobkovsky, A. E., Wolf, Y. I. & Koonin, E. V. Predictability of evolutionary trajectories in fitness landscapes. PLoS Comput. Biol. 7, e1002302 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, Y., Morar, M. & Ealick, S. E. Structural biology of the purine biosynthetic pathway. Cell Mol. Life Sci. 65, 3699–3724 (2008).
Article CAS PubMed PubMed Central Google Scholar
Alifano, P. et al. Histidine biosynthetic pathway and genes: structure, regulation, and evolution. Microbiol. Rev. 60, 44–69 (1996).
CAS PubMed PubMed Central Google Scholar
Henry, C. S. et al. Systematic identification and analysis of frequent gene fusion events in metabolic pathways. BMC Genom. 17, 473 (2016).
Article CAS Google Scholar
Grieshaber, M. & Bauerle, R. Structure and evolution of a bifunctional enzyme of tryptophan operon. Nat. New. Biol. 236, 232–235 (1972).
Article Google Scholar
Yourno, J., Kohno, T. & Roth, J. R. Enzyme evolution - generation of a bifunctional enzyme by fusion of adjacent genes. Nature 228, 820–824 (1970).
Article ADS CAS PubMed Google Scholar
Nicholls, D. J. et al. The importance of arginine 102 for the substrate-specificity of Escherichia coli malate dehydrogenase. Biochem. Biophys. Res. Commun. 189, 1057–1062 (1992).
Article CAS PubMed Google Scholar
Markert, C. L., Shaklee, J. B. & Whitt, G. S. Evolution of a gene. Science 189, 102–114 (1975).
Article ADS CAS PubMed Google Scholar
Romero, D. & Palacios, R. Gene amplification and genomic plasticity in prokaryotes. Annu. Rev. Genet. 31, 91–111 (1997).
Article CAS PubMed Google Scholar
Tokuriki, N. et al. Diminishing returns and tradeoffs constrain the laboratory optimization of an enzyme. Nat. Commun. 3, https://doi.org/10.1038/ncomms2246 (2012).
Savir, Y., Noor, E., Milo, R. & Tlusty, T. Cross-species analysis traces adaptation of Rubisco toward optimality in a low-dimensional landscape. Proc. Natl Acad. Sci. USA 107, 3475–3480 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
de Visser, J. A. G. M. & Rozen, D. E. Clonal interference and the periodic selection of new beneficial mutations in Escherichia coli. Genetics 172, 2093–2100 (2006).
Article CAS PubMed PubMed Central Google Scholar
Orth, J. D., Thiele, I. & Palsson, B. O. What is flux balance analysis? Nat. Biotechnol. 28, 245–248 (2010).
Article CAS PubMed PubMed Central Google Scholar
Arike, L. et al. Comparison and applications of label-free absolute proteome quantification methods on Escherichia coli. J. Proteom. 75, 5437–5448 (2012).
Article CAS Google Scholar
R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2017).
Gelius-Dietrich, G., Desouki, A. A., Fritzemeier, C. J. & Lercher, M. J. sybil – Efficient constraint-based modelling in R. BMC Syst. Biol. 7, 125 (2013).
Article CAS PubMed PubMed Central Google Scholar
Haldane, J. B. S. Enzymes. (Longmans, London, 1930).

Download references

Acknowledgements

The authors would like to thank Abdelmoneim Amer Desouki for his support in using the sybilccFBA package, and Ron Milo and Laurence Yang for helpful discussion. This research used resources of the National Energy Research Scientific Computing Center, a DOE Office of Science User Facility supported by the Office of Science of the U.S. Department of Energy grant number DE-SC0008701. This work was supported by the Novo Nordisk Foundation grant number NNF10CC1016517.

Author information

Authors and Affiliations

Department of Bioengineering, University of California, San Diego, La Jolla, CA, 92093-0412, USA
David Heckmann, Daniel C. Zielinski & Bernhard O. Palsson
The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark
Bernhard O. Palsson

Authors

David Heckmann
View author publications
You can also search for this author in PubMed Google Scholar
Daniel C. Zielinski
View author publications
You can also search for this author in PubMed Google Scholar
Bernhard O. Palsson
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.H., D.C.Z., and B.O.P. designed the study. D.H. conducted all modeling, simulation, and data analysis. D.H., D.C.Z., and B.O.P. wrote the paper.

Corresponding author

Correspondence to Bernhard O. Palsson.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Heckmann, D., Zielinski, D.C. & Palsson, B.O. Modeling genome-wide enzyme evolution predicts strong epistasis underlying catalytic turnover rates. Nat Commun 9, 5270 (2018). https://doi.org/10.1038/s41467-018-07649-1

Download citation

Received: 28 November 2017
Accepted: 13 November 2018
Published: 10 December 2018
DOI: https://doi.org/10.1038/s41467-018-07649-1
Springer Nature Limited

This article is cited by

Optimal enzyme utilization suggests that concentrations and thermodynamics determine binding mechanisms and enzyme saturations
- Asli Sahin
- Daniel R. Weilandt
- Vassily Hatzimanikatis
Nature Communications (2023)

Modeling genome-wide enzyme evolution predicts strong epistasis underlying catalytic turnover rates

From

Abstract

Similar content being viewed by others

Most genetic roots of fungal and animal aging are hundreds of millions of years old according to phylostratigraphy analyses of aging networks

A Beginners Guide to Estimating the Non-synonymous to Synonymous Rate Ratio of all Protein-Coding Genes in a Genome

In silico Methods for Identification of Potential Therapeutic Targets

Introduction

Results

A model for simulating systems-wide k _cat evolution

Evolutionary trajectories exhibit jumps and convergence

Multifunctional enzymes cause evolutionary jump dynamics

Most reactions show repeatable evolution

The evolved k _cats agree with in vivo and in vitro data

Discussion

Methods

Growth rate predictions using MOMENT

An MCMC algorithm for simulating k _cat evolution

Statistics

Code availability

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Electronic supplementary material

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

About this article

Cite this article

This article is cited by

Optimal enzyme utilization suggests that concentrations and thermodynamics determine binding mechanisms and enzyme saturations

Navigation

Modeling genome-wide enzyme evolution predicts strong epistasis underlying catalytic turnover rates

Abstract

Similar content being viewed by others

Introduction

Results

A model for simulating systems-wide k cat evolution

Evolutionary trajectories exhibit jumps and convergence

Multifunctional enzymes cause evolutionary jump dynamics

Most reactions show repeatable evolution

The evolved k cats agree with in vivo and in vitro data

Discussion

Methods

Growth rate predictions using MOMENT

An MCMC algorithm for simulating k cat evolution

Statistics

Code availability

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation

A model for simulating systems-wide k _cat evolution

The evolved k _cats agree with in vivo and in vitro data

An MCMC algorithm for simulating k _cat evolution