Background

Ribonucleotide reductase (RNR) is a key component of de novo deoxynucleotide (dNTP) metabolism and an important target of cancer therapies [1]. This enzyme, which reduces ribonucleoside diphosphates into corresponding deoxyribonucleoside diphosphates, is exquisitely controlled to properly balance dNTP fluxes in the face of changing scheduled (S phase) and unscheduled (DNA damage/repair) dNTP synthesis demands [2].

Recent data [36] suggest that ribonucleotide reductase (RNR) exists not only as a heterodimer R12R22 of R12 and R22 homodimers [2], but also as a R14R24 tetramer and as a R16R26 hexamer, where hexamer formation is driven by ATP binding to a previously undescribed hexamerization site. Thus, in addition to its four substrate binding possibilities in ADP, GDP, CDP, or UDP, and four dimerization/specificity site binding possibilities in dATP, ATP, dGTP, or dTTP, the current view (Figure 1) is that R1 has two tetramerization/inhibitory site binding possibilities in dATP or ATP, and one hexamerization/activation site binding possibility in ATP, in addition to possibilities of unbound site states. The resulting large number of possible R1 states implies an even larger number of quaternary states, and this leads to a complicated mathematical model of RNR activity [36]. This model, although useful for explaining RNR activity data, is not useful for biochemical network simulations because: a) it is unwieldy (including up to six-fold nested sums), b) it uses different mathematical structures under different substrate-modulator conditions, and c) it requires root solutions of high order polynomials to determine R1 proportions in mono-, di-, tetra- and hexamer quaternary states, and thus RNR activity, as a function of modulator and total R1 concentrations. Simpler mathematical reaction rate models of RNR are needed if deoxynucleotide metabolism [7] is to be represented using Systems Biology Markup Language (SBML) [810], a standard which requires single algebraic expression reaction rate laws in some applications [11, 12]. Based on recent data from Cooperman's group [46], such expressions are provided here for RNR.

Figure 1
figure 1

Quaternary states of R1. Modulators of RNR activity listed in this figure bind R1 to create higher order quaternary states. Tetramers exist in an equilibrium between low activity states (see k4 in Table 1) and inactive states (kcat = 0). Adapted from Scheme 1 in [4,5].

Results

Reaction activities are viewed here as weighted sums of enzyme state specific activities multiplied by probabilities of enzymes being in specific states. For example, a Michaelis-Menten reaction rate law is viewed as

where the probability that the enzyme is in a loaded/reactive state (with activity kcat) is P(EA) and the probability that the enzyme is in an empty/unreactive state (with no activity) is P(E).

The RNR models presented here are based on the following four enzyme state probability assumptions:

1. The probability that a particular R1 subunit is bound to NDP is assumed to be

2. The probability that L ∈ {ATP, dATP, dTTP, dGTP} is also bound to the dimerization/specificity site, conditional on NDP binding, is assumed to be

where, if L = dTTP and NDP = GDP for example,

is the probability that dTTP is bound to the dimerization/specificity site given that GDP is bound to the substrate site; the probability of an empty dimerization/specificity site is thus

3. The probability that the tetramerization site is either empty, occupied by ATP, or occupied by dATP, is assumed to be, respectively,

4. Finally, the probability that the hexamerization site is occupied by ATP is assumed to be

The subscripts a (activation), i (inactivation), and s (specificity) on the binding constants correspond to h (hexamerization), t (tetramerization), and d (dimerization) subscripts on P, respectively. That parameter values differ depending upon which NDP substrate is bound to the active site (see Table 1) is indicated by the conditional probabilities.

Table 1 Parameter estimates of the reductase models. Fits to data are as shown in Figure 2.

Previous work [36] has shown that the dimer and hexamer states are active, that the tetramer state is slightly active for ADP and GDP and is otherwise inactive, that dimer state activity for CDP and UDP exists when ATP or dATP is bound to the dimerization/specificity site, and that an empty dimerization/specificity site still permits the formation of some dimer with CDP reductase activity, see Table 5 of [5]. Thus, based on the enzyme state probabilities given above, for kcat implicitly defined through

we propose the following expressions:

ADP reduction

GDP reduction

CDP reduction

UDP reduction

In these equations, for ADP and GDP, the first factor is the probability that the dimer site is occupied, and for CDP and UDP, the first factor is the expectation of kcat conditional on R1 being in a dimer state (i.e. having an empty tetramerization site). In the ADP and GDP models, the second factor is the conditional expectation of kcat given that the dimerization site is occupied: the first term of this second factor has in its numerator the statement that kcat = k2 if the tetramerization site is empty, or kcat = k4 if it is occupied by either dATP or ATP, and the second term states that kcat = k6 if the hexamerization site is occupied by ATP. For the CDP and UDP models, the first term of the second factor is the probability of an empty tetramerization site (the event that the corresponding first factor was conditioned on), and the second term states that if the ATP concentration is high enough that the hexamerization/second term dominates the tetramerization/first term whilst the first factor approaches k2A, k6 is the overall kcat. This rationale served as our model selection guide. Importantly, the models fitted recent data [36] very well, see Figure 2 and Table 1.

Figure 2
figure 2

Data from [4-6] and corresponding curve fits (Table 1) of the RNR activity models. In these plots, from left to right, for ADP reduction dGTP was 2.1 uM or variable, for GDP reduction dTTP was 100 uM, 300 uM, variable, or 85 uM, and for pyrimidines specificity site binding concentrations were as shown. In all cases NDP and R2 were at saturating levels.

Discussion

In general, when an integrated system is engineered from component subsystems, the behavior of the overall system depends on component input-output specifications more so than the details of component implementations. By analogy, when enzymological data are applied to biochemical network modeling, rather than the elucidation of reaction mechanisms, it can be expected that the reaction surfaces themselves (i.e. the enzyme's input-output characteristics) determine network behavior more so than the details of how such surfaces are represented. Thus, for applications to systems biology, large confidence intervals (CI) in the model parameter estimates of Table 1 (not shown) are not a problem because only goodness-of-fit (Fig. 2) really matters; this claim assumes an operating range within the data range, since similarly fitting models often veer apart when used in extrapolations. If reaction mechanism inferences were instead being sought, the large CI in the model parameter estimates would have been a problem, e.g. the squared terms in the model suggest cooperative binding, but this choice provides only slightly better curve fits compared to linear terms (not shown), so cooperative binding cannot be inferred from this model.

In the RNR model presented here, the proportion of R1 units existing in monomer, dimer, tetramer, or hexamer states, and thus the RNR activity per unit enzyme, depends on site binding occupancies but does not depend on the total R1 concentration. In the more complicated previous model [36], higher total R1 concentrations favor higher order quaternary states. The degree to which this is so is illustrated by plots of predicted GDP reductase activity as a function of ATP concentration at various R1 concentrations (Figure 3). Consistent with the formation of higher order quaternary R1 states, these plots contract to the left as the total R1 concentrations increase from 1 μM to 100 μM. In future work, the model given here will be altered to capture such trends without losing its simplicity; the total R1 concentration will enter such a model not only as a linear modulator of the reaction surface amplitude (i.e. E0 in Eq. 10), but also as a modifier of reaction surface shape parameters, e.g. KaATP will be replaced by a decreasing function of R1.

Figure 3
figure 3

GDP reductase as a function of ATP concentration at various R1 concentrations, predicted using the earlier model [3-6].

Conclusion

We identified a rational polynomial model of RNR activity that has single algebraic expressions for each reductase reaction rate law. The expressions provide reasonably good fits (Fig. 2) to recent data [36]. Compared to previous reaction rate expressions for this data [36], the new expressions are simpler and thus better suited for biochemical network simulations, particularly those constrained to use enzyme reaction rate laws defined as single algebraic expressions [11, 12].

Methods

The parameter estimates shown in Table 1 were obtained through a trial-and-error iterative process of nonlinear least squares curve fitting under various, convergence enabling, parameter fixations (i.e. profile searches). In the end, the curve fits were those of Figure 2 with corresponding parameter estimates in Table 1; large 95% confidence intervals (not shown) allowed rounding of the parameter estimates to somewhat arbitrary choices. Non-linear least squares parameter estimations were performed using the optimization method of Nelder and Mead [13] and the statistical computing environment R [14]. All parameters were estimated as ec to assure positive values. For additional details, R scripts are available with the data as supplementary material [15].