# Systemic properties of metabolic networks lead to an epistasis-based model for heterosis

- 789 Downloads
- 15 Citations

## Abstract

The genetic and molecular approaches to heterosis usually do not rely on any model of the genotype–phenotype relationship. From the generalization of Kacser and Burns’ biochemical model for dominance and epistasis to networks with several variable enzymes, we hypothesized that metabolic heterosis could be observed because the response of the flux towards enzyme activities and/or concentrations follows a multi-dimensional hyperbolic-like relationship. To corroborate this, we used the values of systemic parameters accounting for the kinetic behaviour of four enzymes of the upstream part of glycolysis, and simulated genetic variability by varying in silico enzyme concentrations. Then we “crossed” virtual parents to get 1,000 hybrids, and showed that best-parent heterosis was frequently observed. The decomposition of the flux value into genetic effects, with the help of a novel multilocus epistasis index, revealed that antagonistic additive-by-additive epistasis effects play the major role in this framework of the genotype–phenotype relationship. This result is consistent with various observations in quantitative and evolutionary genetics, and provides a model unifying the genetic effects underlying heterosis.

## Keywords

Enzyme Concentration Dominance Effect Upstream Part Variable Enzyme Phenotype Relationship## Introduction

Most of the present studies on heterosis, or hybrid vigour, focus on QTL detection to untangle the genetic effects underlying the phenomenon (Hua et al. 2003; Meyer et al. 2009), or on the search for non-additive expression of transcripts or proteins in hybrids to identify possible molecular mechanisms accounting for heterosis for macroscopic traits (Paschold et al. 2009). Despite the interest of these descriptive, model-free approaches, they can hardly provide a general framework to comprehend a universal phenomenon which has many evolutionary and agronomical implications.

In this paper, we propose a fundamentally different way to tackle the problem of heterosis, by using a systemic approach based on metabolic network modelling, which provides a biologically realistic genotype–phenotype relationship.

As soon as 1934, Wright proposed a general explanation for the prevalence, in natural populations, of the dominance of wild alleles upon deleterious alleles. He considered the relationship between the activity of one enzyme in a linear metabolic pathway and the steady-state rate of production of the product of the chain, i.e. the flux. Because the product of one enzyme is the substrate for the next, the effect of changing the activity of a particular enzyme depends on the activities of all the others enzymes of the pathway. Even though at this time the biochemical theory of metabolic fluxes was not very developed, he predicted a hyperbolic relationship between enzyme activity and flux. Thus, if the wild-type enzyme activity is at the plateau of the flux curve and the enzyme activity of the heterozygote is intermediate, null or deleterious mutations will be recessive. This idea was brightly confirmed by the theoretical developments of Kacser and Burns (1981) for chains of Michaelian reversible enzymes. They also showed that epistasis is inherent to this non-linear model of genotype–phenotype relationship: an allelic substitution at one locus will change the effect of allelic substitution at all other loci. Actually, the hyperbolic-like relationship between enzyme activity and flux seems to be valid for most of the networks, regardless their complexity (see Fiévet et al. 2006, for a discussion). In addition Fiévet et al. (2006) reconstructed in vitro the first part of glycolysis, and varied in turn the concentration of the successive enzymes, the concentrations of the other enzymes being fixed. In spite of regulation and branching in the system they used, in all cases they observed a quasi-hyperbolic ascending curve.

More or less directly, fluxes affect all macroscopic traits, including agronomically or horticulturally important traits: seed/fruit weight depends on lipid, starch and/or sugar content, fruit ripening on ethanol synthesis, resistance against herbivores is related to glycosinolate profile, flowering date to hormonal balance, flower colour to anthocyanins, etc. Therefore fluxes through metabolic networks can be considered as model quantitative traits, depending on all the genes coding and/or regulating the enzymes of the network. In this framework, the genetically variable enzyme parameters represent the genotype, whereas the flux is the phenotype. As activity and/or concentration of several enzymes may vary together, the genotype–phenotype relationship can be modelled according to a multidimensional hyperbolic surface. Relying on this biologically realistic modelling, we simulated series of crosses between parents differing for the concentrations of enzymes of the upstream part of glycolysis. Best-parent heterosis was frequently observed, and the decomposition of the flux into genetic effects revealed a tight relationship between heterosis and antagonistic (“less-than-additive”) additive-by-additive epistasis.

## Theoretical developments

### The metabolic model

#### Linear pathways

*n*Michaelian enzymes far from saturation:

_{0}and X

_{ n }are respectively the initial substrate and the final product of the pathway (external metabolites), S

_{1},…, S

_{ j }are the successive substrates of the pathway (internal metabolites), and E

_{1},…, E

_{ n }are the enzymes. At the steady state, the flux through the pathway is (Kacser and Burns 1973):

*X*

_{0}and

*X*

_{ n }are respectively the concentrations of X

_{0}and X

_{ n },

*K*

_{1,j }(resp.

*K*

_{1,n }) is the product of the equilibrium constants of the reactions from X

_{0}to S

_{ j }(resp. X

_{ n }),

*K*

_{mj }is the Michaelis–Menten constant of enzyme

*j*and

*V*

_{ j }is the maximum velocity of enzyme

*j*.

*E*

_{ j }, we can alternatively write:

*k*

_{catj }the catalytic constant of enzyme

*j*. In this paper, we will assume that the

*A*

_{ j }’s are not genetically variable, so all the variability in enzyme activity is due to the variability on the concentrations

*E*

_{ j }. However the formal developments would be identical if the

*A*

_{ j }’s were genetically variable.

#### Networks: the general genotype–phenotype relationship

The previous developments do not apply to pathways that contain non-Michaelian and regulated enzymes, and/or to branched pathways, which require specific derivations of flux expressions based on the detailed kinetic equations of the individual reactions. However, we showed recently that a simple modification of the previous modelling allowed reliable predictions of the flux values for regulated and branched systems (Fiévet et al. 2006). The idea was based on data from the literature showing that increasing the concentration of a given enzyme in a system of any complexity, the concentration of other enzymes being fixed, usually results in quasi-hyperbolic response curve of the flux (saturation curve). In addition, an experimental system fully corroborated this view. We reconstructed in vitro the first part of glycolysis, from hexokinase to glycerol 3-phosphate dehydrogenase (therefore with the TPI branching), and included a cycle to regenerate ATP from ADP with creatine kinase. Increasing from 0 the concentrations of phosphoglucose isomerase (PGI), phosphofructokinase (PFK), fructose-1,6-bisphosphate aldolase (FBA) or triosephosphate isomerase (TPI), the concentrations of other enzymes being fixed, resulted in all cases in quasi-hyperbolic ascending saturation curves (Fiévet et al. 2006).

*X*is a constant,

*A*

_{ j }is a parameter accounting for the kinetic behaviour of enzyme

*j*within the system (therefore its expression is more complex than previously defined), and

*d*

_{ j }is a parameter accounting for the “dispensability” of enzyme

*j*(if there is branching in the pathway, removing some enzymes does not drive the flux to 0), and

*E*

_{tot}is the sum of enzyme concentrations \( \left( {E_{\text{tot}} = \sum\nolimits_{j = 1}^{n} {E_{j} } } \right). \) Note that in the Fiévet et al.’s paper, this expression was a little bit simpler, with the parameter

*p*

_{ j }=

*d*

_{ j }

*E*

_{tot}, i.e.

*E*

_{tot}did not appear explicitly. Here we introduced

*E*

_{tot}for more generality, both to take into account the fact that

*E*

_{tot}may vary (in the cited paper

*E*

_{tot}was fixed), and to derive easily the summation property of the control coefficients (not shown).

This is the relation we have used for modelling the genotype–phenotype relationship. The “phenotype” is flux *J* and the “genotype” is the vector of enzyme concentrations *E* _{ j }’s genetically variable. In silico or in vitro, it is thus possible to simulate genetic variability by varying enzyme concentrations *E* _{ j }, and for each genotype to calculate or measure the flux value from Eq. 2.

### Genotype construction and flux computation

*XA*

_{ j }and

*Xd*

_{ j }

*E*

_{tot}by hyperbolic fitting of the titration curves obtained by varying in turn the concentration of each enzyme. The values were

*XA*

_{PGI}= 499.4 s

^{−1},

*XA*

_{PFK}= 115.5 s

^{−1},

*XA*

_{FBA}= 22.5 s

^{−1},

*XA*

_{TPI}= 22,940 s

^{−1},

*Xd*

_{PGI}

*E*

_{tot}= 0,

*Xd*

_{PFK}

*E*

_{tot}= 0,

*Xd*

_{FBA}

*E*

_{tot}= 0,

*Xd*

_{TPI}

*E*

_{tot}= 61.8 μM/s. As in these experiments

*E*

_{tot}was equal to 2.82 μM, we get

*Xd*

_{TPI}= 21.9 s

^{−1}. Thus for any set of

*E*

_{ j }values, that is to say for any virtual genotype, a flux (“phenotype”) value could be computed from the equation:

Thousand pairs of virtual parental genotypes were created. As the total enzyme amount allocated to the system was variable, but should be necessarily limited to remain biologically realistic (Lion et al. 2004), we proceeded in the following way to choose the enzyme concentrations. For each of the four enzymes, 10 concentration values evenly distributed from 0 to \( E^{\phi } \) were defined (excluding of course these two extreme values), with \( E^{\phi } \) the sum of the physiological concentrations of the enzyme estimated in the yeast strain S288C (Fiévet et al. 2004) (PGI: 9.1 mg/l, PFK: 10.4 mg/l, FBA: 60.1 mg/l and TPI: 22.3 mg/l). The proportions of the three remaining enzymes were drawn at random using beta distributions (α = 1, \( {{\upbeta}} = {\frac{{1 - e_{i}^{\phi } }}{{e_{i}^{\phi } }}}, \) with \( e_{i}^{\phi } \) the physiological proportion of enzyme *i*), to cover a large range of variability of enzyme concentrations. Twenty-five independent drawings were performed for each concentration of the target enzyme, resulting in 1,000 parental distributions (4 enzymes × 10 concentrations × 25 drawings). Each of them was randomly associated to another one to get 1,000 pairs of parents. The total enzyme concentration varied from 0.07 to 2.73 μM.

The predicted flux of each parental genotype was computed according to Eq. 3. The flux of the 1,000 hybrids was computed assuming that (i) there is additivity of all enzyme concentrations, or (ii) there is positive or negative non-additivity of concentrations of PFK and/or FBA (FBA is the most abundant enzyme), with the hybrid concentrations remaining within the range of parental concentrations.

- (i)
\( E_{i1*2} = \max \left( {E_{i1} ,E_{i2} } \right) \): “complete positive non-additivity” (the hybrid concentration is equal to the highest parental concentration).

- (ii)
\( E_{i1*2} = 0.8\max \left( {E_{i1} ,E_{i2} } \right) + (1 - 0.8)\min \left( {E_{i1} ,E_{i2} } \right) \): “partial positive non-additivity”.

- (iii)
\( E_{i1*2} = 0.5\max \left( {E_{i1} ,E_{i2} } \right) + 0.5\min \left( {E_{i1} ,E_{i2} } \right) \): additivity (reference case).

- (iv)
\( E_{i1*2} = 0.2\max \left( {E_{i1} ,E_{i2} } \right) + (1 - 0.2)\min \left( {E_{i1} ,E_{i2} } \right) \): “partial negative non-additivity”.

- (v)
\( E_{i1*2} = \min \left( {E_{i1} ,E_{i2} } \right) \): “complete negative non-additivity” (the hybrid concentration is equal to the lowest parental concentration).

*H*=

*J*

_{1*2}− max (

*J*

_{1},

*J*

_{2}). For each pair of parents, the highest parental flux was noted

*J*

_{2}.

### Decomposition of the genotypic values in the multilocus case

*A*and

*B*, the genotypic values

*G*of the nine possible genotypes can be decomposed as a sum of nine genetic parameters, according to the so-called

*F*

_{∞}-metric model (Van Der Veen 1959):

| | | |
---|---|---|---|

| \( \mu - a_{A} - a_{B} + e_{AB} \) | \( \mu - a_{B} + d_{A} - e_{{Bd_{A} }} \) | \( \mu + a_{A} - a_{B} - e_{AB} \) |

| \( \mu - a_{A} + d_{B} - e_{{Ad_{B} }} \) | \( \mu + d_{A} + d_{B} + e_{{d_{A} d_{B} }} \) | \( \mu + a_{A} + d_{B} + e_{{Ad_{B} }} \) |

| \( \mu - a_{A} + a_{B} - e_{AB} \) | \( \mu + a_{B} + d_{A} + e_{{Bd_{A} }} \) | \( \mu + a_{A} + a_{B} + e_{AB} \) |

*μ*is the mean of the four homozygous genotypes,

*a*

_{ A }and

*a*

_{ B }are the additive effects of genes

*A*and

*B,*respectively,

*d*

_{ A }and

*d*

_{ B }are the dominance effects of genes

*A*and

*B,*respectively,

*e*

_{ AB }is the additive-by-additive epistasis effect between

*A*and

*B*, \( e_{{Ad_{B} }} \)and \( e_{{Bd_{A} }} \) are the additive-by-dominance epistasis effects and \( e_{{d_{A} d_{B} }} \)is the dominance-by-dominance epistasis effect. We chose the

*F*

_{∞}-metric model rather than the

*F*

_{2}-metric model (e.g. Melchinger et al. 2007) because it resulted in equations simpler and easier to interpret (see Yang 2004; Zeng et al. 2005, for discussions on these models).

*L*variable loci:

*δ*

_{ i }takes the value −1 for one of the homozygous genotypes, +1 for the other homozygous genotype, and 0 for the heterozygous genotype.

*μ*is the mean of the homozygote genotypic values,

*a*

_{ i }is the additive effect of gene

*i*,

*d*

_{ i }is the dominance of gene

*i*,

*e*

_{ ij… }is the additive-by-additive epistasis of any order, \( e_{{i \ldots d_{j} \ldots }} \) is the additive-by-dominance epistasis of any order, and \( e_{{d_{i} d_{j} \ldots }} \) is the dominance-by-dominance epistasis of any order; the suspension points are for all the possible epistasis terms for the number

*L*of loci considered. This model is completely determined: the number of parameters is equal to the number of genotypes (3

^{ L }), so there is a complete specification of the genotypic values when the parameters are given, and vice versa. Let

**G**the vector of the genotypic values,

**Δ**the 3

^{ L }× 3

^{ L }matrix of the signs of the genetic parameters (determined from the indicator variables) and

**T**the vector of the genetic parameters. We have

### Determining the genetic effects for the flux with four variable enzymes in the system

*A*

_{1}

*A*

_{1}

*B*

_{1}

*B*

_{1}

*C*

_{1}

*C*

_{1}

*D*

_{1}

*D*

_{1}(respectively

*A*

_{2}

*A*

_{2}

*B*

_{2}

*B*

_{2}

*C*

_{2}

*C*

_{2}

*D*

_{2}

*D*

_{2}). With four biallelic loci, there are 81 possible genotypes (3

^{4}). For each of them, the flux can be computed from Eq. 3. For instance, the flux of genotype

*A*

_{1}

*A*

_{2}

*B*

_{1}

*B*

_{1}

*C*

_{1}

*C*

_{1}

*D*

_{1}

*D*

_{2}if there is additivity of enzyme concentrations is

If there is not additivity, the flux is computed using the coefficients of inheritance as exemplified above.

For each pair of parents, the 81 flux values were computed, from which we derived the 81 genetic effects using Eq. 5: the mean *μ*, 4 additive effects (*a* _{ A } *,* *a* _{ B }, *a* _{ C }, *a* _{ D }), 4 dominance effects (*d* _{ A } *,* *d* _{ B }, *d* _{ C }, *d* _{ D }), 11 additive-by-additive epistasis effect (6 *e* _{ AB }-type, 4 *e* _{ ABC }-type, 1 *e* _{ ABCD }), 50 additive-by-dominance epistasis effects (12 \( e_{{Ad_{B} }} \)-type, 12 \( e_{{ABd_{C} }} \)-type, 4 \( e_{{ABDd_{D} }} \)-type, 12 \( e_{{Ad_{B} d_{C} }} \)-type, 6 \( e_{{ABd_{C} d_{D} }} \)-type, 4 \( e_{{Ad_{B} d_{C} d_{D} }} \)-type) and 11 dominance-by-dominance epistasis effect (6 \( e_{{d_{A} d_{B} }} \)-type, 4 \( e_{{d_{A} d_{B} d_{C} }} \)-type, 1 \( e_{{d_{A} d_{B} d_{C} d_{D} }} \)-type). Thus we got 1,000 vectors of genetic effects.

### Expressing heterosis in terms of genetic effects

*J*

_{ 1 }and

*J*

_{2}the fluxes of two parents genetically different, P1 and P2. We assumed that

*J*

_{2}>

*J*

_{1}. There is best-parent heterosis if the flux of the hybrid,

*J*

_{1*2}, is higher than

*J*

_{2}, or if

*H*can be expressed as a function of the genetic parameters previously defined. Consider that the two lines P1 and P2 differ for

*L*genes, and note their genotypes

*A*

_{1}

*A*

_{1}

*B*

_{1}

*B*

_{1}

*C*

_{1}

*C*

_{1}…

*L*

_{1}

*L*

_{1}and

*A*

_{2}

*A*

_{2}

*B*

_{2}

*B*

_{2}

*C*

_{2}

*C*

_{2}…

*L*

_{2}

*L*

_{2}, respectively. The genotypic value

*G*

_{2}is, from Eq. 4:

*e*

_{add}.

*A*

_{1}

*A*

_{2}

*B*

_{1}

*B*

_{2}…

*L*

_{1}

*L*

_{2}, and its genotypic value, noted G

_{1*2}, is:

*G*

_{1*2}depends only on dominance and on dominance-by-dominance epistasis effects of any order, hereafter noted

*e*

_{dom}.

*H*

*=*

*G*

_{1*2}−

*G*

_{2}writes

*H*is positive, or if

*H*for the 1,000 crosses: from Eq. 6 or from the difference

*J*

_{1*2}−

*J*

_{2}. We checked that both values were identical.

### A generalized epistasis index

*G*

_{ hh }−

*G*

_{ ll }(subscripts

*h*and

*l*for high and low, respectively). Define

*G*

_{ hl }and

*G*

_{ lh }are the genotypic values for genotypes with one high and one low allele. If

*I*= 1, there is additivity, i.e. the difference between

*G*

_{ hh }and

*G*

_{ ll }is just accounted for by the sum of the effects of every individual allelic substitution on the flux. The epistasis is synergistic if

*I*> 1 and antagonistic

*I*< 1. This index is identically valid for diploid homozygote genotypes (pure lines), as considered below.

*G*

_{2}−

*G*

_{1}

*,*with

*G*

_{2}≥

*G*

_{1}) may be compared to the sum of the differences generated by individually substituting each allele of P1 for the allele from P2. Thus we defined the index:

*t*and the P1 alleles for all other genes.

*e*

_{odd}and

*e*

_{even}stand for additive × additive epistasis of any order involving an odd and an even number of genes, respectively.

In the particular case where the genotypic value is a flux through a network, the hyperbolic relation between the flux and the enzyme parameters results in a necessarily positive value for *I*, even if some differences \( \left( {J_{t1} \; - \;J_{1} } \right) \) are negative (see Electronic Supplementary Material). So there is synergistic epistasis if *I* > 1, antagonistic epistasis if 0 < *I* < 1, and additivity if *I* = 1.

The *I* values could be computed either from the flux values (Eq. 7) or from the genetic effects (Eq. 8). We checked that they were identical.

## Results

### A geometric view of heterosis

*J*with respect to the variations of enzyme concentrations

*E*

_{ j }follows a multidimensional hyperbolic surface (Kacser and Burns 1981; Fiévet et al. 2006):

*a*

_{ j }and

*b*

_{ j }are systemic parameters accounting for the kinetic behaviour of enzyme

*j*in the network, and

*E*

_{tot}is the total enzyme amount of the network (see Theoretical developments).

*J*

_{1*2}is higher than the mean parental flux (

*J*

_{1}+

*J*

_{2})/2. This can be seen geometrically on the two-dimensional hyperbolic flux response surface obtained for two variable enzymes (Fig. 1, parents P1 and P2). More interesting, best-parent heterosis can be observed, i.e. hybrid flux

*J*

_{1*2}is

*higher*than the best parental flux:

*J*

_{1*2}> max (

*J*

_{ 1 }

*, J*

_{2}). Best-parent heterosis is expected for the flux whenever the parents are complementary for the “high” and “low” alleles of various enzymes (Fig. 1, parents P3 and P4).

If there is not additivity, the hybrid point is no more mid-way on the line relating the parental points, but is on the part of the surface defined by the upper and lower limits of enzyme concentrations.

### In silico heterosis: data from the upstream part of glycolysis

Results of the simulations of 1,000 crosses between parents differing for the distribution of concentrations of four glycolytic enzymes

Inheritance | Heterosis | | | | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|

PFK | FBA | No | Mid-parent | Best-parent | ∑ | ∑ | ∑ | ∑ | Nb | % | Nb | % |

1 | 0.5 | 0 | 488 | 512 | 0.24 | 0.25 | 0.22 | 0.7 | 501 | 97 | 447 | 87 |

0.8 | 0.5 | 0 | 527 | 473 | 0.27 | 0.2 | 0.19 | 0.72 | 469 | 99 | 423 | 89 |

0.5 | 0.5 | 0 | 600 | 400 | 0.35 | 0.12 | 0.13 | 0.75 | 398 | 99 | 371 | 93 |

0.2 | 0.5 | 209 | 516 | 275 | 0.51 | 0.06 | 0.05 | 0.66 | 275 | 100 | 266 | 97 |

0 | 0.5 | 502 | 372 | 126 | 0.57 | 0.26 | 0.1 | 0.31 | 126 | 99 | 126 | 100 |

0.5 | 1 | 0 | 376 | 624 | 0.09 | 0.36 | 0.36 | 0.73 | 578 | 92 | 508 | 81 |

0.5 | 0.8 | 0 | 456 | 544 | 0.16 | 0.26 | 0.28 | 0.77 | 527 | 97 | 467 | 86 |

0.5 | 0.5 | 0 | 600 | 400 | 0.35 | 0.12 | 0.13 | 0.75 | 398 | 99 | 371 | 93 |

0.5 | 0.2 | 378 | 396 | 226 | 0.55 | 0.15 | 0.05 | 0.47 | 225 | 99 | 218 | 96 |

0.5 | 0 | 599 | 305 | 96 | 0.55 | 0.36 | 0.11 | 0.21 | 96 | 99 | 96 | 100 |

1 | 1 | 0 | 253 | 747 | 0.04 | 0.48 | 0.5 | 0.66 | 618 | 83 | 542 | 73 |

1 | 0 | 510 | 324 | 166 | 0.46 | 0.41 | 0.15 | 0.19 | 165 | 99 | 165 | 99 |

0 | 1 | 335 | 410 | 255 | 0.35 | 0.36 | 0.19 | 0.28 | 255 | 100 | 255 | 100 |

0 | 0 | 892 | 108 | 0 | 0.74 | 0.34 | 0.03 | 0.17 | 0 | – | 0 | – |

When both PFK and FBA had a non-additive inheritance, the number of cases of best-parent heterosis ranged from 748 (complete positive non-additivity for both enzymes) to 0 (complete negative non-additivity for both enzymes), but in the latter case, mid-parent heterosis was still observed 108 times (Table 1).

### Translating flux heterosis into genetic effects: the major role of epistasis

*H*=

*J*

_{1*2}− max(

*J*

_{1},

*J*

_{2}) can be expressed as a sum of genetic effects:

*a*), the sum of the dominance effects (∑

*d*), the sum of the dominance-by-dominance epistasis effects (∑

*e*

_{dom}) and the sum of the additive-by-additive epistasis effects (∑

*e*

_{add}) (see Theoretical developments), and analysed their relationship with index

*H*(Fig. 3).

#### Additivity of enzyme concentrations

*H*, and the sum of dominance effects and the sum of dominance-by-dominance epistasis effects were positively correlated to

*H*(Fig. 3). Due to the large number of data, these correlations were all significant (

*p*< 0.001), but there were striking differences between the

*R*

^{2}values (Table 1). While the

*R*

^{2}was very weak for the sum of dominance effects and the sum of dominance-by-dominance effects (

*R*

^{2}= 0.12 and 0.13 respectively), it was moderate for the sum of additive effects (

*R*

^{2}= 0.35), and quite high for the sum of additive-by-additive epistasis effects (

*R*

^{2}= 0.75) (Fig. 3d). With only two exceptions, the 400 positive

*H*values (i.e. best-parent heterosis) corresponded to negative values of the sum of the additive-by-additive epistasis effects. Plotting the

*H*value against the novel, general epistasis index we defined revealed that antagonistic epistasis is the main factor explaining best-parent heterosis, since 93% (371/400) of the positive

*H*values corresponded to an epistasis index between 0 and 1 (Fig. 4).

#### Non-additivity of enzyme concentrations

These results were quite robust with regard to non-additivity of concentration of one enzyme. In no case the sum of dominance effects and the sum of dominance-by-dominance epistasis effects had the highest *R* ^{2} values. The sum of additive-by-additive epistasis effects kept the highest values when there was positive non-additivity, and also when there was partial negative non-additivity of PFK (Table 1). More importantly, the association between best-parent heterosis and additive-by-additive epistasis was consistently very high: 92–100% of the cases of best-parent heterosis corresponded to negative ∑*e* _{add} values, and 81% to 100% corresponded to antagonistic epistasis.

When there was non-additive inheritance for both enzymes, the highest *R* ^{2} was observed for ∑*e* _{add} when there was complete positive non-additivity for both enzymes; otherwise the highest *R* ^{2} was obtained for the sum of additive effects. But again, the association between best-parent heterosis and additive-by-additive epistasis was very strong, with 83% to 100% of the cases corresponding to negative ∑*e* _{add} values, and 73% to 100% to antagonistic epistasis (Table 1).

## Discussion

The classical linear genotype–phenotype relationship of quantitative genetics has been very powerful for plant and animal breeding, but it is biologically questionable. Finding an explicit function to describe this relationship is of course out of reach, given the cellular complexity and the dramatic increase of the number of parameters with the size of the systems. For that reason, various modelling efforts based on conceptual shortcuts have been proposed to simulate complex cellular behaviours from a limited amount of biological data. In this connection, the metabolic control theory (MCT) proved to be quite powerful (Fell 1992, for a review). One of the outcomes of MCT has been to show that if the concentration/activity of an enzyme changes while the parameters of all other enzymes in the pathway are fixed, the flux displays a saturation curve (Kacser and Burns 1973; Heinrich and Rapoport 1974). Even though there are exceptions (see discussion in Fiévet et al. 2006), there are innumerable examples of such a behaviour (e.g. Kacser and Burns 1981; Fell 1997; Niederberger et al. 1992; Cronwright et al. 2002; Koebmann et al. 2005, etc.). Actually, as argued by Fiévet et al. (2006), the hyperbolic-like relationship between enzyme activity and flux could well be valid also for complex networks. These authors reconstructed in vitro a small network with regulation and branching, and observed such a relationship for the four enzymes the concentration of which was modified. More importantly, the hyperbolic-like relation can also be observed at other levels of cell/individual organization, from transcription to integrated phenotype. Rossignol et al. (2003) showed that the deleterious effects of a mitochondrial mutation were accounted for by a saturation curve at various levels of the expression of the mutation: translation, enzyme complex activity, respiratory flux, cell growth and clinical manifestations. Therefore, modelling the genotype–phenotype relying on a hyperbolic-like relationship could be biologically relevant for a large range of macroscopic traits.

To analyse heterosis in this framework, we approximated the flux through a network using a multidimensional hyperbolic modelling in which the kinetic behaviour of each enzyme was described by two systemic parameters. We decomposed the flux into genetic effects, and examined the relationships between these genetic effects and heterosis for glycolytic flux in a series of 1,000 virtual crosses between parents differing for their distribution of enzyme concentrations. We varied the concentrations because theoretical studies (Pettersson 1989) and experimental data suggested that enzyme concentrations are more likely to vary than their catalytic properties (Bulfield et al. 1978; Eanes et al. 1990; Tarun et al. 1998). In any case, introducing kinetic variable systemic parameters into the model is possible, and would not modify the theoretical framework. Concerning inheritance of enzyme concentrations, additivity is supported by classical observations (Kacser and Burns 1981). However, proteomic studies have shown that even though protein concentrations are indeed in majority additive, there are cases of non-additive inheritance (Leonardi et al. 1991; Kollipara et al. 2002; Hoecker et al. 2008). So we performed simulations assuming on the one hand additivity, on the other hand positive and negative non-additive inheritance for PFK and/or FBA, the latter being the most variable enzyme among the parents.

If there is additivity of enzyme concentrations, the convexity of the response of the flux towards enzyme concentrations makes mid-parent heterosis inevitable and, depending on the distribution of the parental enzyme concentrations, may result in best-parent heterosis. If there is partial dominance of the low allele, heterosis is no more inevitable. In all cases, the sum of the additive-by-additive epistasis effects, and in a lesser extent the sum of the additive effects, were negatively correlated to the difference between hybrid flux and the best-parent flux (*H*). Best-parent heterosis (*H* > 0) corresponded in almost all cases to negative values of the sum of the additive-by-additive epistasis effects. By contrast, and unexpectedly, the dominance and dominance-by-dominance epistasis effects did not seem to play a large role. Actually, this apparent paradox can be explained in the following way. As shown geometrically Fig. 1, dominance is sufficient to have mid-parent heterosis (parents P1 and P2), while both dominance *and* additive-by-additive epistasis are required to get best-parent heterosis (parents P3 and P4). It is mainly the level of additive-by-additive epistasis that drives the *H* value (Fig. 3d). The novel and general epistasis index we defined (*I*) does not include any dominance or dominance-by-dominance epistasis effect. When there was best-parent heterosis, index *I* took usually a value lower than unity, indicative of antagonistic (less-than-additive) epistasis. In other words, when the phenotypic difference between two parents is below the sum of the effects of every individual allelic substitution in the lowest genotype, their hybrid usually exhibits a high phenotypic value.

*A/a*and

*B/b*,

*a*and

*b*being recessive, and leading to a flux equal to zero when homozygous. We get the following table of possible phenotypic values (0 and 1 for absence and presence of flux, respectively):

| | | |
---|---|---|---|

| 1 | 1 | 0 |

| 1 | 1 | 0 |

| 0 | 0 | 0 |

The cross between *AAbb* and *aaBB*, which have no flux, will produce the hybrid *AaBb*, with a restored flux. This situation represents the simplest possible case for determining the genetic effects and indices we defined. We have *H* = 1 (best-parent heterosis), ∑*e* _{add} = *e* _{ AB } = −0.25 and *I* = 0 (maximum antagonistic epistasis). In an F2 progeny, this case corresponds to the classical 9:7 segregation, where dominance *and* epistasis occurs (with dominance alone, we would have a 9:6:1 segregation).

Heterosis appears as an emergent property of the system, because the properties of each enzyme separately are not sufficient to account for the phenomenon. With total enzyme amount half-way between its parents, hybrids can display best-parent heterosis, which corresponds to a better exploitation of cell resources than in the parents. This conclusion is consistent with the negative correlation between energy cost of growth and mean individual heterozygosity classically described in marine animals (e.g. Koehn 1991; Bayne and Hawkins 1997; Danzmann et al. 1987).

Antagonistic epistasis, which is supposed to be favoured by natural selection (Desai et al. 2007), is commonly observed in populations, as attested by experiments of accumulation of mutations which show that the decrease of fitness with the number of mutations is “less-than-additive” (e.g. Maisnier-Patin et al. 2005; Silander et al. 2007). Antagonistic epistasis has also been evidenced from the comparison of chromosome substitution strains, in plants and animals, which have revealed that the sum of the individual chromosomal effects often dramatically exceeded the difference between the parental strains (Redden 1991; Shao et al. 2008). Finally, marker-based studies have revealed less-than-additive interactions between QTL (Eshed and Zamir 1996; Ming et al. 2001). All these data are consistent with the frequent occurrence of heterosis observed in all species.

We may speculate that the common occurrence of both antagonistic epistasis and heterosis in natural populations reflects the non-linearity of the genotype–phenotype relationship for the vast majority of the phenotypic traits. Any evolutionary process that may stabilize the favourable epistatic interactions, su£ch as gene duplications or constraint on the recombination rate, should be selected as it reduces the genetic load inherent to heterosis in populations.

## Notes

### Acknowledgments

We thank Professor Olivier C. Martin for his help to demonstrate that the generalized epistasis index is positive.

### Open Access

This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

## Supplementary material

## References

- Bateson W, Saunders ER, Punnett RC, Hurst CC (1905) Reports to the Evolution Committee of the Royal Society, Report II (Harrison and Sons, London)Google Scholar
- Bayne BL, Hawkins AJ (1997) Protein metabolism, the costs of growth, and genomic heterozygosity: experiments with the Mussel
*Mytilus galloprovincialis*Lmk. Physiol Biochem Zool 70:391–402CrossRefGoogle Scholar - Bulfield G, Moore EA, Kacser H (1978) Genetic variation in activity of the enzymes of glycolysis and gluconeogenesis between inbred strains of mice. Genetics 89:551–561PubMedGoogle Scholar
- Cronwright GR, Rohwer JM, Prior BA (2002) Metabolic control analysis of glycerol synthesis in Saccharomyces cerevisiae. Appl Environ Microbiol 68:4448–4456CrossRefPubMedGoogle Scholar
- Danzmann RG, Ferguson MM, Allendorf FW (1987) Heterozygosity and oxygen-consumption rate as predictors of growth and developmental rate in rainbow trout. Physiol Zool 60:211–220Google Scholar
- Desai MM, Weissman D, Feldman MW (2007) Evolution can favor antagonistic epistasis. Genetics 177:1001–1010CrossRefPubMedGoogle Scholar
- Dooner HK, Robbins TP, Jorgensen RA (1991) Genetic and developmental control of anthocyanin biosynthesis. Annu Rev Gene 25:173–199CrossRefGoogle Scholar
- Eanes W, Katona L, Longtime M (1990) Comparison of in vitro and in vivo activities associated with the g6pd allozyme polymorphism in
*Drosophila melanogaster*. Genetics 125:845–853PubMedGoogle Scholar - Eshed Y, Zamir D (1996) Less-than-additive epistatic interactions of quantitative trait loci in tomato. Genetics 141:1147–1162Google Scholar
- Fell DA (1992) Metabolic control analysis: a survey of its theoretical and experimental background. Biochem J 286:313–330PubMedGoogle Scholar
- Fell DA (1997) Understanding the control of metabolism. Portland Press, LondonGoogle Scholar
- Fiévet J, Dillmann C, Lagniel G, Davanture M, Négroni L, Labarre J, de Vienne D (2004) Assessing factors for reliable quantitative proteomics based on two-dimensional electrophoresis. Proteomics 4:1939–1949CrossRefPubMedGoogle Scholar
- Fiévet JB, Dillmann C, Curien G, de Vienne D (2006) Simplified modelling of metabolic pathways for flux prediction and optimization: lessons from an in vitro reconstruction of the upper part of glycolysis. Biochem J 396:317–326CrossRefPubMedGoogle Scholar
- Hayman BI, Mather K (1955) The description of genetic interactions in continuous variation. Biometrics 11:69–82CrossRefGoogle Scholar
- Heinrich R, Rapoport TA (1974) A linear steady-state treatment of enzymatic chains. General properties, control and effector strength. Eur J Biochem 42:89–95CrossRefPubMedGoogle Scholar
- Hoecker N, Lamkemeyer T, Sarholz B et al (2008) Analysis of nonadditive protein accumulation in young primary roots of a maize (
*Zea mays L*) F(1)-hybrid compared to its parental inbred lines. Proteomics 8:3882–3894CrossRefPubMedGoogle Scholar - Hua J, Xing Y, Wu W, Xu C, Sun X, Yu S, Zhang Q (2003) Single-locus heterotic effects and dominance by dominance interactions can adequately explain the genetic basis of heterosis in an elite rice hybrid. Proc Natl Acad Sci USA 100:2574–2579CrossRefPubMedGoogle Scholar
- Kacser H, Burns JA (1973) The control of flux. Symp Soc Exp Biol 27:65–104PubMedGoogle Scholar
- Kacser H, Burns JA (1981) The molecular basis of dominance. Genetics 97:639–666PubMedGoogle Scholar
- Keightley P (1996) Metabolic models of selection response. J Theor Biol 182:311–316CrossRefPubMedGoogle Scholar
- Koebmann B, Solem C, Jensen PR (2005) Control analysis as a tool to understand the formation of the las operon in
*Lactococcus lactis*. FEBS J 272:2292–2303CrossRefPubMedGoogle Scholar - Koehn RK (1991) The cost of enzyme synthesis in the genetics of energy balance and physiological performance. Biol J Linn Soc 44:231–247CrossRefGoogle Scholar
- Kollipara KP, Saab IN, Wych RD, Lauer MJ, Singletary GW (2002) Expression profiling of reciprocal maize hybrids divergent for cold germination and desiccation tolerance. Plant Physiol 129:974–992CrossRefPubMedGoogle Scholar
- Leonardi A, Damerval C, Gallais A, de Vienne D (1991) Association of protein amount polymorphism (PAP) among Maize lines with performance of their hybrids. Theor Appl Genet 82:552–560CrossRefGoogle Scholar
- Lion S, Gabriel F, Bost B, Fiévet J, Dillmann C, de Vienne D (2004) An extension to the metabolic control theory taking into account correlations between enzyme concentrations. Eur J Biochem 271:4375–4391CrossRefPubMedGoogle Scholar
- Maisnier-Patin S, Roth JR, Fredriksson A, Nyström T, Berg OG, Andersson DI (2005) Genomic buffering mitigates the effects of deleterious mutations in bacteria. Nat Genet 37:1376–1379CrossRefPubMedGoogle Scholar
- Melchinger AE, Utz HF, Piepho HP, Zeng ZB, Schön CC (2007) The role of epistasis in the manifestation of heterosis: a systems-oriented approach. Genetics 177:1815–1825CrossRefPubMedGoogle Scholar
- Meyer RC, Kusterer B, Lisec J, Steinfath M, Becher M, Scharr H, Melchinger AE, Selbig J, Schurr U, Willmitzer L, Altmann T (2009) QTL analysis of early stage heterosis for biomass in Arabidopsis. Theor Appl Genet (this issue)Google Scholar
- Ming R, Liu SC, Moore PH, Irvine JE, Paterson AH (2001) QTL analysis in a complex autopolyploid: genetic control of sugar content in sugarcane. Genome Res 11:2075–2084CrossRefPubMedGoogle Scholar
- Niederberger P, Prasad R, Miozzari G, Kacser H (1992) A strategy for increasing an in vivo flux by genetic manipulations. The tryptophan system of yeast. Biochem J 287:473–479PubMedGoogle Scholar
- Paschold A, Marcon C, Hoecker N, Hochholdinger F (2009) Molecular dissection of heterosis manifestation during early maize root development. Theor Appl Genet (this issue)Google Scholar
- Pettersson G (1989) Effect of evolution on the kinetic properties of enzymes. Eur J Biochem 184:561–566CrossRefPubMedGoogle Scholar
- Phillips PC (2008) Epistasis—the essential role of gene interactions in the structure and evolution of genetic systems. Nat Rev Genet 9:855–867CrossRefPubMedGoogle Scholar
- Redden R (1991) The effect of epistasis on chromosome mapping of quantitative characters in wheat. I. Time to spike emergence. Aust J Agric Res 42:1–11CrossRefGoogle Scholar
- Rossignol R, Faustin B, Rocher C, Malgat M, Mazat JP, Letellier T (2003) Mitochondrial threshold effects. Biochem J 370:751–762CrossRefPubMedGoogle Scholar
- Shao H, Burrage LC, Sinasac DS, Hill AE, Ernest SR, O’Brien W, Courtland HW, Jepsen KJ, Kirby A, Kulbokas EJ, Daly MJ, Broman KW, Lander ES, Nadeau JH (2008) Genetic architecture of complex traits: large phenotypic effects and pervasive epistasis. Proc Natl Acad Sci USA 105:19910–19914CrossRefPubMedGoogle Scholar
- Silander OK, Tenaillon O, Chao L (2007) Understanding the evolutionary fate of finite populations: the dynamics of mutational effects. PLoS Biol 5:e94CrossRefPubMedGoogle Scholar
- Sinnott EW, Dunn LC (1939) Principles of genetics. McGraw-Hill Book Company, Inc., New-York and LondonGoogle Scholar
- Tarun A, Lee J, Theologis A (1998) Random mutagenesis of 1-aminocyclopropane-1-carboxylate synthase: a key enzyme in ethylene biosynthesis. Proc Natl Acad Sci USA 95:9796–9801CrossRefPubMedGoogle Scholar
- Van Der Veen JH (1959) Tests of non-allelic interaction and linkage for quantitative characters in generations derived from two diploid pure lines. Genetica 30:201–232CrossRefGoogle Scholar
- Wright S (1934) Molecular and evolutionary theories of dominance. Am Nat 68:24–53CrossRefGoogle Scholar
- Yang RC (2004) Epistasis of quantitative trait loci under different gene action models. Genetics 167:1493–1505CrossRefPubMedGoogle Scholar
- Zeng ZB, Wang T, Zou W (2005) Modeling quantitative trait loci and interpretation of models. Genetics 169:1711–1725CrossRefPubMedGoogle Scholar