Introduction

In homozygotes, additive-by-additive interactions (QTL-by-QTL interaction, gene–gene interaction or epistasis) can play a very important role in controlling the expression of quantitative traits and are typically defined as statistical deviation from the additive genetic effects. However, information about quantitative trait locus (QTL) epistatic interaction can facilitate marker-assisted selection (MAS) for quantitative traits in breeding programs of different plants, for example QTLs with epistatic effects can be used in methods of MAS. If epistasis is important, then MAS schemes should be designed to exploit it. In the presence of epistasis, MAS generally yields more persistent responses than that based exclusively on additive or additive—dominance model. Neglecting epistasis could result in considerable loss in response, that will become more pronounced in later generations. Tests of epistasis are a powerful tool that developmental biologists use to determine the order of developmental triggers in gene regulation pathways, but such tests are most effective when alleles conferring complete loss-of-function of the genes under study are used.

With increasing evidence supporting the claim that epistatic interactions are usually involved in the genetic variation of complex traits (Mao et al. 2006; Tabanao and Bernardo 2007), several complicated mapping models were developed to analyze epistatic effects: expanded composite interval mapping (CIM) to multiple interval mapping (Kao et al. 1999), mixed linear model based CIM (Wang et al. 1999), Bayesian approach (Yang et al. 2007; Yi et al. 2007), and weighted multiple linear regression (Bocianowski 2012c). Jannink and Jansen (2001) suggested mapping QTLs with epistasis between QTLs and backgrounds using one-dimensional genome search.

The importance of epistasis has been suggested in numerous classic quantitative genetics studies (Spickett and Thoday 1966; Falconer 1981; Mather and Jinks 1982; Pooni et al. 1987; da Silva Guimarães et al. 2010). Epistasis as an important genetic basis of complex phenotypes has also been revealed in several recent QTL mapping studies (Li et al. 1997; Yu et al. 1997; Ma et al. 2005; Liu et al. 2011; Rahman et al. 2011; Borràs-Gelonch et al. 2012; Krajewski et al. 2012). However, marker-based analyses for estimation of QTL effects usually assume the absence of epistasis among QTLs (Lander and Botstein 1989; Jansen 1993; Zeng 1994; Bocianowski and Krajewski 2009; Rovaris et al. 2011). This assumption was made largely for simplification of the statistical models. However, acceptance of this assumption may result in biased estimates of the positions and effects of QTLs and lower precision and power for QTL detection.

Currently, epistatic interaction is taken into account in research with regard to its important role in control of quantitative traits (Bocianowski 2008, 2012a, b, c). In mapping studies, epistasis is usually defined as an interaction between QTLs with the assumption that epistatic interaction effects are shown only by loci with significant additive gene action effects. One of the conclusions of a previous simulation study (Bocianowski 2012a) was that the estimate of the total epistatic interaction effect based on the QTLs with significant additive gene action effects was smaller than the effect obtained from traditional quantitative genetics method (based on only phenotypic observations—without marker observations). However, in many mapping cases, when considering inbred lines, genes with small additive effects do not come into the final model and the total epistatic interaction effect is biased. Many loci may not have a significant effect directly on the trait, but they may affect trait expression by interacting with other loci. Hence, the aim of this paper is to estimate epistatic interactions, using inbred lines as an example, between all the loci used in the experiment in comparison to the values when epistatic effects were assessed using only QTLs with significant additive effects.

Materials and methods

Datasets

The first dataset included 120 doubled haploid (DH) lines of barley, derived from the cross between the Australian barley variety Clipper and the Algerian landrace Sahara 3771 (CS) at the Waite Agricultural Research Institute, University of Adelaide, Australia (Karakousis et al. 2003). The lines were analyzed with respect to four phenotypic traits: beta-amylase activity (BA), alpha-amylase activity (AA), beta-glucanase activity (BG), and cyst nematode resistance (CCN). Observations of 183 molecular markers (SSR and RFLP) were used in the research.

The second dataset was the barley Harrington × TR306 (HT) DH population (Tinker et al. 1996), a well-known population from the North American Barley Genome Mapping Project (http://wheat.pw.usda.gov/ggpages/maps/Hordeum). The data matrix consisted of 145 DH lines. The DH lines were analyzed for seven phenotypic traits (weight of grain harvested per unit area, WG; number of days from planting until emergence of 50 % of heads on main tillers, NH; number of days from planting until physiological maturity, NM; plant height, H; lodging transformed by \(\arcsin \sqrt {{x \mathord{\left/ {\vphantom {x {100}}} \right. \kern-0pt} {100}}}\), L; 1,000 kernel weight, KW; and test weight, TW). The map used in the study was composed of 127 molecular markers (mostly RFLP) with the mean distance between the markers being 10.62 cm. The results shown are observations from five locations (in four locations where the observations were made during 2 years): ON92a—Ailsa Craig, Ontario, 1992; ON93a—Ailsa Craig, Ontario, 1993; ON92b—Elora, Ontario, 1992; ON93b—Elora, Ontario, 1993; MB92—Brandon, Manitoba, 1992; MB93—Brandon, Manitoba, 1993; QC93—Ste-Anne-de-Bellevue, Quebec, 1993; SK92a—Outlook, Saskatchewan, 1992; and SK93a—Outlook, Saskatchewan, 1992.

The third population dataset included 150 DH lines of barley, obtained from the Steptoe × Morex (SM) cross, used in the North American Barley Genome Mapping project and tested in sixteen environments (Kleinhofs et al. 1993; Romagosa et al. 1996; http://wheat.pw.usda.gov/ggpages/SxM). The linkage map used consisted of 223 molecular markers, mostly RFLP, with a mean distance between markers of 5.66 cM. The DH lines were analyzed for eight phenotypic traits (alpha amylase, AA; diastatic power, DP; grain protein, GP; grain yield, GY; height, H; heading date, HD; lodging, L; and malt extract, ME) (Hayes et al. 1993). Grain protein, lodging and malt extract were transformed by \(\arcsin \sqrt {{x \mathord{\left/ {\vphantom {x {100}}} \right. \kern-0pt} {100}}}\).

Missing marker data in all the three datasets were estimated by the method of Martinez and Curnow (1994), that is, with the use of non-missing data of flanking markers.

Genetic model

Estimation of additive × additive epistatic interactions (aa) was based on the assumption that the genes responsible for the trait were closely linked to observed molecular marker (Bocianowski 2012b). A two-stage algorithm was employed for the selection of significant pairs of loci (markers) with significant epistasis interaction effects.

In the first step of the selection, a fixed linear model for the simultaneous search for two interacting loci (\(m_{{l_{i} }}\) and \(m_{{l_{j} }}\), i, j = 1, 2,…, p; l 1, l 2,…, l p ∈{1, 2,…, q}, where p denotes the number of selected loci, q is the number of loci) can be expressed as follows:

$$y_{k} \; = \;\mu \; + \;a_{{l_{i} }} m_{{l_{i} k}} \; + \;a_{{l_{j} }} m_{{l_{j} k}} \; + \;aa_{{l_{i} l_{j} }} m_{{l_{i} k}} m_{{l_{j} k}} \; + \;\varepsilon_{k} ,$$
(1)

where y k is the phenotypic value of a quantitative trait measured on the k-th individual (k = 1, 2, …, n), μ is the population mean, \(a_{{l_{i} }}\) and \(a_{{l_{j} }}\) are the additive effects (fixed) of the two loci (\(m_{{l_{i} }}\) and \(m_{{l_{i} }}\)), respectively, \(aa_{{l_{i} l_{j} }}\) is the epistatic interaction effect (fixed) between \(m_{{l_{i} }}\) and \(m_{{l_{j} }}\), \(m_{{l_{ik} }}\) and \(m_{{l_{jk} }}\) are observations of the i-th and j-th loci of the k-th individual, and \(\varepsilon_{k} \sim N\left( {0,\;\sigma^{2} } \right)\) is the random residual effect.

Model (1) can be written as a matrix form of the fixed linear model:

$$y\; = \;X\beta \; + \;\varepsilon ,$$

where y is an n-dimensional vector of phenotypic values, X is known incidence matrix, β = \(\left( {\mu , \, a_{{l_{i} }} , \, a_{{l_{j} }} , \, aa_{{l_{i} l_{j} }} } \right)^{T}\) is a 4-vector of fixed effects, and ε ~ \(N\left( {0, \, \sigma^{2} I} \right)\) is an n-vector of random residuals.

Marker pairs selected in the first step were subjected to backward stepwise selection in the second stage. The final model is as follows:

$$y_{k} \; = \;\mu \; + \;\sum\limits_{t = 1}^{p} {a_{{l_{t} }} m_{{l_{tk} }} } \; + \;\sum\limits_{t = 1}^{p - 1} {\sum\limits_{\begin{subarray}{l} t^{\prime} = t + 1 \\ t^{\prime} \ne t \end{subarray} }^{p} {aa_{{l_{t} l_{{t^{\prime}}} }} m_{{l_{tk} }} m_{{l_{{t^{\prime}k}} }} } \; + \;\varepsilon_{k} ,}$$
(2)

where l t , l t ∈ {1, 2, …, q}. In the second step the critical significance level amounting to 0.001, resulting from the Bonferroni correction (Province 1999), was used. The Bonferroni correction is a method used to counteract the problem of multiple comparisons. It is considered the simplest and most conservative method to control type I errors for multiple tests. If total number of all epistatic pairs is equal to 50, than the global significance level needed to be adjusted to α = 0.05/50 = 0.001.

The total epistatic interaction effect of gene pairs influencing the trait, defined as the sum of values of individual pairs’ effects from model (2), can be found as:

$$\widehat{aa}\; = \;\sum\limits_{t = 1}^{p - 1} {\sum\limits_{\begin{subarray}{l} t^{\prime} = t + 1 \\ t^{\prime} \ne t \end{subarray} }^{p} {\widehat{aa}_{{l_{t} l_{{t^{\prime}}} }} .} }$$
(3)

The coefficients of determination were used to measure how the models (1) and (2), respectively, fitted the data and, in this study, were the amount of the phenotypic variance explained by individual pairs of interactive markers (R 2 i ) and by total pairs of interactive markers (R 2 T ).

Analyses of the data were performed using the statistical package GenStat v. 10.1 (GenStat 2007).

Results

A total of 31 pairs of loci with epistatic effects were detected for CS mapping population (Table 1), 194 pairs for HT population (Table 2) and 503 pairs in SM population (Table 3). The observed epistatic total effects of loci pairs were: negative in 40 cases, while positive in 23 cases, for the HT population (Table 2), negative in 34 cases (37.78 %), while positive in 56 cases for SM population (Table 3). For individual traits the sign of the epistatic effect depended on the environment. The percentage of phenotypic variation explained by an individual pair of loci (R 2 i ) ranged from 6.7 to 23.9 % for the CS population (Table 1), from 6.7 to 11.5 % for the HT population (Table 2) and from 6.5 to 14.9 % for the SM population (Table 3). The total percentage of phenotypic variation explained by all the pairs of epistatic loci together (R 2 T ) ranged from 44.2 to 62.9 % for the CS population (Table 1), from 6.7 to 48.5 % for the HT population (Table 2) and from 6.9 to 93.2 % for the SM population (Table 3). For the CS population the loci pair CDO105-BCD135 influenced two phenotypic traits: AA and BG. Therefore, CDO105-BCD135 might be a pleiotropic pair of loci or very tight linkage. Additionally, in the Tables 1, 2 and 3 are presented differences between epistatic effects assessed using all loci (aa) and epistatic effects assessed using only loci with significant additive effects (aa g ), it should be noted that the aa g values have been presented in previous studies (Bocianowski 2008, 2012b).

Table 1 Number of epistatic interaction pairs (No.), the total epistatic interaction effect (aa), difference between aa and epistatic effects assessed using only QTLs (aa−aa g ), percentage of variation explained by individual pairs (R 2 i ) and total percentage of variation explained by all pairs—R 2 T for 120 DH lines of barley from the Clipper × Sahara 3771 cross
Table 2 Number of epistatic interaction pairs (No.), the total epistatic interaction effect (aa), difference between aa and epistatic effects assessed using only QTLs (aa−aa g ), percentage of variation explained by individual pairs (R 2 i ) and total percentage of variation explained by all pairs—R 2 T for doubled haploid lines from Harrington × TR306 cross
Table 3 Number of epistatic interaction pairs (No.), the total epistatic interaction effect (aa) and difference between aa and epistatic effects assessed using only QTLs (aa−aa g ), percentage of variation explained by individual pairs (R 2 i ) and total percentage of variation explained by all pairs—R 2 T for doubled haploid lines from Steptoe × Morex doubled haploid lines barley population

Discussion

The genetic variation in continuous traits is usually governed by a polygenic network system, composed of many genes with small effect, and sometimes including one or a few genes with large effect. Loci with minor or no individual effect can also be involved in epistatic interaction. The statistical power to detect pairwise epistatic interactions is lower than for the main QTL because the tests of significance must be conducted for two intervals rather than just one, and consequently a higher critical threshold per test must be applied to overcome the problem of multiple tests (Melchinger et al. 2007). This can be translated into small-effect interactions that would remain undetected unless a large number of individuals are considered. Quantitative traits are determined by many genes with small effects. In this paper the epistatic effects were estimated as an interaction between all pairs of loci. In total, 728 pairs of loci were found for the three datasets. The very important and difficult problem is the size of the population for estimating of epistasis effect based on the model with a huge amount of effects. The larger power of detection of pairwise epistatic interactions would be the result of use of larger mapping population. Additionally, for a larger mapping population we can obtain greater precision of estimate, which can be a positive implication in plant breeding.

More epistatic pairs were obtained for all the four cases of the DH lines from the CS population than in the previous paper (Bocianowski 2008), in which epistatic effects were considered as interactions between loci with significant additive effects. After the analysis of the HT DH lines in the four cases (NM in MB92, H in ON93a, L in ON92a and L in ON92b), epistatic effects showed only loci with significant additive effects—differences aa−aa g equal to 0 (compared with results presented by Bocianowski 2012b). In 38 cases, the epistatic effects calculated as locus × locus interactions were non-significant (see Bocianowski 2012b). In this paper, those effects were statistically significant different aa−aa g  = aa (Table 2). In two cases (AA in MTi91 and ME in WA92) for the SM DH lines, the epistatic effects had the same values as were reported by Bocianowski (2012b) (aa−aa g  = 0, Table 3). In most cases an improvement in estimates of epistatic interaction effects was observed in relation to the results obtained with the assumption that epistatic effects are only relevant between loci with significant additive genetic effects. This improvement was expressed as: (1) significant epistatic effects in the cases when they were absent for markers with additive effects (aa−aa g  = aa); (2) estimates of epistatic effects closer to those obtained by the traditional quantitative genetics method based on only phenotypic observations—without marker observations (see Bocianowski 2012a, b) in providing results which are closer to the true (unknown) value.

Interactions between loci with non-significant main-effects were obtained by Ma et al. (2007), Imtiaz et al. (2008) and Peng et al. (2011), while Charmet et al. (1999) considered situations when at least one of the interactive QTLs had an additive effect. Bocianowski (2013) investigated the effect of consideration/non-consideration of the epistatic effects on the additive gene action effects. Comparison of the assessment of the estimated additive gene action effects, based on the model without epistasis, with that resulting from the model taking into consideration the epistasis effect, was made. The results obtained showed the advantages of including the interactions by giving an improvement in accuracy in predicting the genotypic value of the progenies.

The proposed method of estimation of epistasis as an interactions between all pairs of loci is more effective than the method using only QTLs with significant additive effects. Estimation of additive × additive interaction epistatic effects on the basis of model presented by Bocianowski (2012a, b), using only selected genes—with significant main effects, can be biased. The estimator of epistasis effects based on method proposed in this paper is unbiased because the model contains all potential pairs of loci which determine the quantitative trait.

Quantitative trait locus-analysis tools are useful for the analysis of complex traits in DH populations and for identification of favorable alleles in diverse germplasm. Characterization of the determinants of economically important phenotypes showing complex inheritance (by estimation of genetic parameters in an additive and additive-by-additive epistasis interaction model) should lead to more effective use of genetic resources. So, by way of illustration, a positive (negative) epistatic effects of additive-by-additive interactions (aa ij ) suggests that the two epistatic loci with homozygous alleles from the same parent (QiQiQjQj or qiqiqjqj) would increase (decrease) the trait value, otherwise QiQiqjqj or qiqiQjQj could decrease (increase) the phenotype. Thus the “favorable” allele is a relative concept because of epistasis. A “favorable” allele with positive additive effects may become “unfavorable” following transfer into a new variety due to large negative additive-by-additive effects in the new genetic background. Therefore, not only the QTLs with additive effects should be considered in selection programs, but also additive-by-additive epistatic effects among these QTLs and others. In this paper the epistatic effects have different signs in the individual environments. Hence, this opens the potential for research on an issue that could be very important in breeding programs.

The results obtained indicate that many loci with epistatic effects might not have significant direct effects for quantitative trait in their own right but might affect its expression by their epistatic effects with the other loci.