Advanced backcross-QTL analysis in spring barley (H. vulgare ssp. spontaneum) comparing a REML versus a Bayesian model in multi-environmental field trials

Bauer, Andrea Michaela; Hoti, F.; von Korff, M.; Pillen, K.; Léon, J.; Sillanpää, M. J.

doi:10.1007/s00122-009-1021-6

Advanced backcross-QTL analysis in spring barley (H. vulgare ssp. spontaneum) comparing a REML versus a Bayesian model in multi-environmental field trials

Original Paper
Open access
Published: 11 April 2009

Volume 119, pages 105–123, (2009)
Cite this article

Download PDF

You have full access to this open access article

Theoretical and Applied Genetics Aims and scope Submit manuscript

Advanced backcross-QTL analysis in spring barley (H. vulgare ssp. spontaneum) comparing a REML versus a Bayesian model in multi-environmental field trials

Download PDF

Andrea Michaela Bauer¹,
F. Hoti^2,3,
M. von Korff⁴,
K. Pillen⁵,
J. Léon¹ &
…
M. J. Sillanpää³

2252 Accesses
25 Citations
Explore all metrics

Abstract

A common difficulty in mapping quantitative trait loci (QTLs) is that QTL effects may show environment specificity and thus differ across environments. Furthermore, quantitative traits are likely to be influenced by multiple QTLs or genes having different effect sizes. There is currently a need for efficient mapping strategies to account for both multiple QTLs and marker-by-environment interactions. Thus, the objective of our study was to develop a Bayesian multi-locus multi-environmental method of QTL analysis. This strategy is compared to (1) Bayesian multi-locus mapping, where each environment is analysed separately, (2) Restricted Maximum Likelihood (REML) single-locus method using a mixed hierarchical model, and (3) REML forward selection applying a mixed hierarchical model. For this study, we used data on multi-environmental field trials of 301 BC₂DH lines derived from a cross between the spring barley elite cultivar Scarlett and the wild donor ISR42-8 from Israel. The lines were genotyped by 98 SSR markers and measured for the agronomic traits “ears per m²,” “days until heading,” “plant height,” “thousand grain weight,” and “grain yield”. Additionally, a simulation study was performed to verify the QTL results obtained in the spring barley population. In general, the results of Bayesian QTL mapping are in accordance with REML methods. In this study, Bayesian multi-locus multi-environmental analysis is a valuable method that is particularly suitable if lines are cultivated in multi-environmental field trials.

QTL underlying some agronomic traits in barley detected by SNP markers

Article Open access 07 July 2016

Multi-parent advanced generation inter-cross in barley: high-resolution quantitative trait locus mapping for flowering time as a proof of concept

Article 21 February 2015

Variability of Spring Barley Traits Essential for Organic Farming in Association Mapping Population

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Detecting favourable exotic quantitative trait loci (QTLs) and introducing them into elite lines could greatly enhance breeding success. Tanksley and Nelson (1996) proposed an advanced backcross QTL analysis combining QTL discovery and variety development in a single step. Using advanced backcross populations derived from a cross of an elite cultivar with an exotic donor, it is possible to identify superior exotic QTLs, whereas the number of negative alleles from the unadapted material is reduced.

In order to map QTLs, the plant material is genotyped by DNA markers and measured on agronomic traits in multi-environmental field trials. In the following statistical analysis, significant associations between DNA markers and phenotypic traits are determined. As quantitative traits are influenced by multiple genes having effects of different magnitudes, it is of primary interest in QTL mapping to select the appropriate model and to estimate the effects and locations of the QTLs (Broman and Speed 2002; Sillanpää and Corander 2002). A common difficulty in QTL mapping is that QTLs may show environment specificity, i.e., QTL effects may significantly differ across environments (Kang and Gauch 1996).

Several authors have examined multi-environmental data in composite interval mapping (Jansen et al. 1995), where selection of background markers is performed in several steps. Usually, uncorrelated residuals, i.e., no genetic (background) correlation among environments, are assumed in these models. Tinker and Mather (1995) implemented composite interval mapping to multi-environmental data using the least-squares estimation (Haley and Knott 1992). They included a test for QTL-by-environment interaction and used partial regression coefficients from background markers to control genetic variance due to non-target QTLs. Recently, Yandell et al. (2007) presented a software package called “R/qtlbim” providing Bayesian interval mapping by accounting for gene-by-environment interaction. Verbyla et al. (2003) computed a multiplicative mixed model for QTL-by-environment interaction of the factorial analysis type. The mixed-model method and the least-squares estimation were used by Piepho (2000). In this study, the genetic correlation among environments was also taken into account. In order to consider genetic correlations, Jiang et al. (1999) used a multi-trait approach of Jiang and Zeng (1995) and regarded expressions of the same trait in different environments as different traits. Fixed effects were pre-corrected by SAS software prior to the QTL analysis. Also, Boer et al. (2007) proposed a modeling approach of genotype-by-environment interactions accounting for genetic correlations between environments and error structure within environments of F₅ maize testcross progenies. A multi-locus analysis was applied by Crossa et al. (1999). In this study, partial least-squares regression and factorial regression models were used utilizing genetic markers and environmental covariables for studying QTL-by-environment interaction. Korol et al. (1998) presented an approach where the dependence of a putative QTL effect on environmental conditions is expressed as a function of environmental mean value of the regarded trait. This strategy allows for considering QTL-by-environment interactions across a large number of environments.

Concerning the known literature, a multi-locus QTL mapping approach that simultaneously considers model selection in multi-environmental data has not been fully developed. Since the magnitude of QTL effects can depend on the specific environmental conditions, it is important to account for these effects in the model.

The objective of our research was to compare different approaches of multi-environmental QTL detection considering Bayesian and Restricted Maximum Likelihood (REML) methods: (i) REML single-locus analysis using a mixed hierarchical model, (ii) REML multi-locus analysis by a forward selection approach applying a mixed hierarchical model, (iii) Bayesian multi-locus mapping analyzing one environment at a time, and (iv) Bayesian multi-locus mapping in all environments jointly.

For this purpose, we used field data for an advanced backcross BC₂DH population derived from a cross of the malting barley cultivar Scarlett with the wild barley accession ISR42-8 from Israel (von Korff et al. 2006). In order to verify the results obtained with the real dataset, additionally a simulation study was performed. First, in a REML single-locus analysis, a mixed hierarchical model was computed in the Mixed procedure of the software package SAS 9.1 (SAS Institute 2004). Then, the same statistical model was applied by using a forward selection approach where the most significant marker of the current one-dimensional search round was always taken as a fixed cofactor in the model of the next estimation round. Furthermore, we applied a Bayesian multi-locus approach that was extended to handle multi-environmental data. In this approach, only marker points were considered as putative QTLs. In all cases, it was possible to account for QTL effects in multiple environments. This was compared to a Bayesian model where separate single-environmental analyses were executed for one environment at a time. In all analyses, we assumed the absence of genetic (background) correlation among environments.

Materials and methods

Real dataset of a spring barley population

A population with 301 BC₂DH lines originating from the cross of the German spring barley variety Scarlett with the Israeli wild barley accession ISR42-8 was developed. The BC₂DH population was genotyped by 98 SSR markers. Phenotypic evaluation of the traits “ears per m²” (Ear), “days until heading” (Hea), “plant height” (Hei), “thousand grain weight” (Tgw), and “grain yield” (Yld) was carried out under field conditions in unreplicated experiments at four different locations during the seasons 2003 and 2004. Data on the parental lines were collected but, as we considered the BC₂DH lines for QTL mapping, not included in the analysis. A detailed description is given in von Korff et al. (2004, 2006).

Simulation study

In the computer simulation, the real marker data of 301 BC₂DH lines were used by imposing known simulated genetic effects influencing the quantitative phenotype. For the genetic effects, marker main, marker interaction (crossover and non-crossover), and markers having both, a main and an interaction effect, were simulated. The positions and effect sizes of the simulated markers are presented in Table 5. As in the field dataset, a population of 301 DH lines was assumed being cultivated in six different environments. Normally distributed phenotypic values of a trait with a heritability of 0.59 were simulated. In the simulation, residuals were assumed to be independent (no correlation structure) with a standard deviation of 1.2 [N(0, 1.2)] and variance was considered to be the same for all environments. Also, no additional environmental effects were generated.

QTL mapping strategies

In our study, we compared different approaches of multi-locus multi-environmental QTL detection in the real and in the simulated dataset:

(1)
REML single-locus analysis

The single-locus analysis was performed with SAS 9.1 software (SAS Institute 2004) using REML method of the Mixed procedure. Then, the applied mixed hierarchical model was as follows:

$$ Y_{ijkm} = \mu + M_{i} + L_{j} \left( {M_{i} } \right) + E_{k} + M_{i} * E_{k} + \varepsilon_{{m\left( {ijk} \right)}} $$

With phenotypic observations Y _ijkm, general mean μ, fixed effect M _i of the ith marker, random effect L _j(M _i) of the jth BC₂DH line nested in the ith marker, random effect E _k of the kth environment, random interaction effect M _i * E _k of the ith marker with the kth environment, and residue ε _m(ijk) of Y _ijkm.

In this analysis, the random factor L _j(M _i) can be interpreted as a genetic background effect. The residuals were assumed to be identically and independently normally distributed. For each marker, a value of F-statistic, used to test the marker effect, is computed considering the residual mean of squares as an error term. The marker-by-environment interactions are tested by the value of t-statistic.

Missing marker data are handled by omitting each observation with a missing marker value from the dataset. Thus, the amount of phenotypic information is reduced due to missing marker data.

The relative performance of the homozygous exotic genotype (RP[Hsp]) was calculated by $ RP\left[ {Hsp} \right] = \frac{Hsp - Hv}{Hv} * 100 $, where Hsp represents the least square mean of the homozygous exotic genotype and Hv the least square mean of the elite genotype.

The computing time was about 1 min for one trait of both, the spring barley population and the simulated dataset on a Pentium IV 2.0 GHz processor.

(2)
REML multi-locus analysis using a forward selection approach

The same mixed hierarchical model as described above was applied here for stepwise variable selection in SAS Proc Mixed. The stepwise variable selection strategy is described in Sillanpää and Corander (2002) and has been applied for example in Kilpikari and Sillanpää (2003). The first round of forward selection procedure corresponds to the single-locus analysis. Next, the marker with the most significant effect (based on the P value of hypothesis test Type III F-statistic) is chosen as a fixed cofactor in the model of the following estimation rounds. Using this extended model, the marker effects are estimated again. This procedure is repeated until no further significant markers can be found. The computing time for this method was about 20 min for one trait of the real and of the simulated dataset.

(3)
Bayesian multi-locus analysis using multi-environmental data

Additionally, we performed Bayesian multi-locus QTL mapping using multi-environmental data.

The statistical model for phenotypic trait values Y _jk was as follows:

$$ Y_{jk} = \mu + \sum\limits_{i = 1}^{n} {M_{ij} } + E_{k} + \sum\limits_{i = 1}^{n} {M_{ijk} } + \varepsilon_{jk} $$

where μ is the overall sample mean of the phenotypes, M _ij is the effect of the ith marker genotype of the jth line, E _k is the effect of the kth environment, M _ijk is the effect of the ith marker genotype of the jth line in the kth environment (i.e. genotype-by-environment interaction), and, n is the number of markers.

Residuals are assumed to be independently and identically normally distributed as ɛ _jk∼N(0, σ ²₀ ), where σ ²₀ = residual variance common to all environments.

In the Bayesian setting, we parametrized the statistical model so that for each marker one genotype effect was assigned a value of zero; thus, for each marker we only needed to estimate one main effect M _ij. Similarly, in each environment for each marker, one environment-specific genotype effect was assigned a value of zero, resulting in one estimable coefficient M _ijk at each environment. By denoting the genotype (A or B) of line j at marker i with X _ij, the effects can be written as $ M_{ij} = \beta_{i} 1\left( {X_{ij} = B} \right)\,{\text{and}}\,M_{ijk} = \beta_{ik} 1\left( {X_{ij} = B} \right) $. The parameters β _i and β _ik are interpreted as the difference of the main genotype effects and the differences of environment-specific genotype effects. Note, however, that unlike REML, this model is still oversaturated. The prior densities of the unknown marker effect differences in a K-environment model, θ = (β ₁,…, β _n, β ₁₁,…, β _nk), were specified following Xu (2003); Hoti and Sillanpää (2006), where each effect θ _r, r = 1,…, (K + 1)n, in the statistical model is assigned a zero mean normal distribution with its own variance parameter σ ²_r combined with Jeffreys’ scale invariant prior $ p\left( {\sigma_{r}^{2} } \right) \propto \frac{1}{{\sigma_{r}^{2} }} $. The prior of the overall mean was p(μ) ∝ 1, and the priors of the environmental effects were $ p\left( {E\left| {\sigma_{E}^{2} } \right.} \right) = \prod\limits_{k} {p\left( {E_{k} \left| {\sigma_{E}^{2} } \right.} \right)} $, where $ p(E_{k} \left| {\sigma_{E}^{2} } \right.) $ is a normal distribution with zero mean and a common variance $ \sigma_{E}^{2} $. The variance of the environmental effects and the variance of the residual term were assigned improper uniform priors, $ p\left( {\sigma_{E}^{2} } \right)\,\alpha \,1\,{\text{and}}\,{\text{p}}\left( {\sigma_{0}^{2} } \right)\,\alpha \,1, $ respectively.

In order to obtain Markov Chain Monte Carlo (MCMC) samples of the joint posterior distribution of marker effects, Gibbs sampling (Geman and Geman 1984) and Metropolis-Hastings algorithms (Hastings 1970) were used. Here, we give the fully conditional posterior distributions of the environmental effects E _k and the effect variance σ ²_E . The sampling distributions/updating steps of the remaining parameters and handling of missing data are described in Hoti and Sillanpää (2006). The fully conditional posterior distribution of the environmental effect E _k is a normal distribution with mean $ \sum\limits_{j = 1}^{N} {e_{j,k} } \left( {N + \frac{{\sigma_{0}^{2} }}{{\sigma_{E}^{2} }}} \right)^{ - 1} $and variance $ \sigma_{0}^{2} \left( {N + \frac{{\sigma_{0}^{2} }}{{\sigma_{E}^{2} }}} \right)^{ - 1} $, where N is the total number of lines. For the effect variance $ \sigma_{E}^{2} $, the fully conditional posterior distribution is the scaled inverted chi-squared distribution with the degree of freedom parameter K and the scale parameter$ \sum\limits_{k = 1}^{K} {\sum\limits_{j = 1}^{N} {e_{j,k}^{2} } } $.

The Bayesian analysis was implemented using Matlab 7 (2007). The missing values were randomly assigned initial values from their empirical distributions. The MCMC algorithm was run for 400,000 rounds in the field dataset and 50,000 rounds in the simulated dataset. In order to reduce autocorrelation, only every 10th round was stored. In all cases, in the field data the first 380,000 MCMC-rounds (simulated data: 20,000 rounds) were considered to be “burn-in” rounds and were thus not considered in the final results. Computing time was about 33 h for one trait of the real dataset and about 20 min for the simulated dataset.

In the QTL analysis, we obtained estimates of marker main (M _ij) and interaction (M _ijk) effects. For each MCMC-sample, the sum of main and corresponding environment interaction effect was calculated. Then, to interpret the results at each marker locus the median of the posterior distribution of marker effect over all MCMC-rounds was computed. If this median was non-zero in all environments, this marker had a main effect on the specific trait value. Otherwise, if the median was non-zero in some environments only, it was interpreted as a specific kind of a marker-by-environment interaction effect.

(4)
Bayesian multi-locus analysis using single-environmental data

In order to determine whether multi-environmental QTL testing improves the results, we also conducted the same Bayesian multi-locus mapping as described above, but used data of each environment separately. Thus, the statistical model was reduced to:

$$ Y_{j} = \mu + \sum\limits_{i = 1}^{n} {M_{ij} } + \varepsilon_{j} . $$

In this analysis, computing time of the main analysis was about 9 h for one trait of the real dataset and about 15 min for the simulated dataset.

Analogous to the multi-environmental Bayesian QTL analysis, the posterior median of the marker effects over all MCMC-rounds was computed. In both single- and multi-environmental Bayesian analysis, model selection and parameter estimation were based on adaptive shrinkage (Xu 2003; Hoti and Sillanpää 2006). Note that this approach is closely related to the so-called genome-wide selection (Meuwissen et al. 2001). In the genome-wide selection approach, breeding values are predicted based on molecular markers covering the whole genome. This strategy is in contrast to the use of genetic similarities, which are calculated based on the molecular marker data, in the prediction of breeding values (Bauer et al. 2006, 2008).

Convergence of the MCMC—chain

The convergence assessment of the Bayesian mapping strategies was performed by plotting the MCMC paths for the markers with estimated non-zero effects as suggested by Kass et al. (1998).

Significance threshold of estimated marker effects

In order to determine whether the detected QTL effects were due to spurious effects, we estimated an experimentwise critical value following Churchill and Doerge (1994). In this estimation, the data are shuffled by computing random permutations of the phenotypic observation vector. The ith observation is assigned to the ith line whose index is given by the ith element of the permutation. Thus, the association between marker data and observations is destroyed. The shuffled data were analysed for Bayesian single-environmental and REML single-locus analysis. Overall, 50 permutations were calculated. In order to obtain the experimentwise critical value for a trait analysed by Bayesian single-environmental mapping, first the maximum median of the marker effects of every QTL analysis of permuted data is selected. In REML single-locus analysis, the maximum F value (for marker main effects) and the maximum t value (for marker interaction effects) of all permuted QTL analyses are chosen. In each mapping strategy, these values are ordered. The experimentwise critical value then corresponds to the 100(1 − α) percentile, where α equals 0.05. In order to detect QTL effects in the original data and to determine statistical significance, the results of the QTL analysis can be compared to this critical value.

The forward selection approach utilizes the significance threshold obtained from REML single-locus analysis. As computing time was demanding for a Bayesian multi-environmental analysis, the calculation of a permutation analysis was not possible. Therefore, following Hoti and Sillanpää (2006) the MCMC-samples of all traits and markers were standardized to a common scale by multiplying each MCMC-sample with $ {{\hat{\sigma }_{g} } \mathord{\left/ {\vphantom {{\hat{\sigma }_{g} } {\hat{\sigma }_{p} }}} \right. \kern-\nulldelimiterspace} {\hat{\sigma }_{p} }}, $ where $ \hat{\sigma }_{g} $ is the empirical standard deviation of each marker and $ \hat{\sigma }_{p} $ corresponds to the empirical standard deviation of phenotypic data. In the field dataset, a marker was defined to be significant if its standardized effect was greater than +0.17 or smaller than −0.17. In the simulated dataset, a significance threshold of ±0.10 was chosen, thus, all markers having an effect greater than −0.10 or smaller than +0.10 are not considered to be significant.

Putative QTLs

Following Pillen et al. (2003), for each QTL mapping strategy linked significant markers that had a distance of ≤20 cM were interpreted as a single putative QTL.

Bin marker map

A Bin marker class was assigned to all used SSR markers following Kleinhofs and Graner (2001); Costa et al. (2001). Additionally, for the markers HVM62, GBM1015, HVM67, HVLTPPB, HVM36, and GBM1052, Bin classes were also available from the high-density consensus map recently published by Marcel et al. (2007). In the following, Bin classes obtained from Marcel et al. (2007) are given in italics.

Genetic variance explained by a marker

The genetic variance explained by a marker (R²) was computed by:

$$ R^{2} = \left[ {{{{\text{SQ}}_{M} } \mathord{\left/ {\vphantom {{{\text{SQ}}_{M} } {\left( {{\text{SQ}}_{M} + {\text{SQ}}_{L(M)} } \right)}}} \right. \kern-\nulldelimiterspace} {\left( {{\text{SQ}}_{M} + {\text{SQ}}_{L(M)} } \right)}}} \right] * 100 $$

with SQ_M = sum of squares of markers obtained from hypothesis test Type I; SQ_L(M) = Type I sum of squares of lines nested in markers.

In order to obtain SQ_M and SQ_L(M) we calculated the following mixed model in SAS Proc Mixed:

$$ Y_{ijkm} = \mu + M_{i} + L_{j} \left( {M_{i} } \right) + E_{k} + \varepsilon_{{m\left( {ijk} \right)}} $$

where all parameters have been fixed factors.

Heritability

The heritability of the traits was obtained by REML variance component estimation using the Varcomp procedure in the SAS software package:

$$ Y_{jkm} = \mu + L_{j} + E_{k} + \varepsilon_{jkm} . $$

Then the heritability follows from $ h^{2} \, = \,\frac{{V_{g} }}{{V_{g} + V_{e} }} $, where V _g = genetic variance of the BC₂DH lines and V _e = residual variance.

Results

Field data of a spring barley population

In general, similar QTLs were detected using REML single-locus analysis, the REML forward selection approach, Bayesian multi-locus multi-environmental method considering all environments jointly in the analysis, and Bayesian multi-locus single-environmental mapping where each environment is analysed separately (Table 1). Depending on the heritability of the trait, some QTLs could be found to have a significant effect in all four mapping strategies. For example, considering “plant height,” a trait with a high heritability h² of 0.76, three of nine QTLs were detected with all analyses. In contrast, regarding “ears per m²,” a trait with a low heritability of 0.21, only one of 11 QTLs could be found to be significant with all approaches. In addition, only marker main effects were detected using a REML mapping method, whereas both marker main and interaction effects could be found by using a Bayesian approach.

Table 1 Detected QTLs of REML single-locus analysis (I), REML forward selection (II), Bayesian single-environmental (III), and Bayesian multi-environmental (IV) mapping for several traits in the spring barley population

Full size table

In the following, detailed results of the QTL analyses will be described for every trait separately, where traits are grouped according to their heritability (Table 1):

“Days until heading” (h² ≈ 0.77)

Overall, 15 QTLs distributed over all chromosomes were found to be significant for the trait “days until heading.” Two QTLs were significant for four analyses, three QTLs for three analyses, four QTLs for two analyses, and six QTLs were found in one analysis.

“Plant height” (h² ≈ 0.76)

For “plant height,” nine QTLs on the chromosomes 2H, 3H, 4H, 5H, and 7H were detected. Three QTLs were found in all four approaches, in three analyses, and with only one strategy, respectively.

“Grain yield” (h² ≈ 0.70)

Eleven QTLs for the trait “grain yield” were located on the chromosomes 1H, 2H, 3H, 5H, and 7H. Two QTLs were found with all QTL mapping strategies, two QTLs were detected with three analyses, two QTLs with two analyses, and five QTLs were found in only one mapping strategy.

“Thousand grain weight” (h² ≈ 0.54)

For the trait “thousand grain weight” the analyses revealed 11 QTLs on all chromosomes with the exception of 5H. One QTL was detected with all mapping approaches, four QTLs with three analyses, one QTL with two analyses, and five QTLs were found to be significant in only one analysis.

“Ears per m²” (h² ≈ 0.21)

For the trait “ears per m²,” overall, 11 QTLs could be detected on all chromosomes except 7H. One QTL was found to be significant in all four mapping strategies, one QTL in three analyses, two QTLs in two analyses, and seven QTLs were detected in only one approach.

In order to illustrate the QTL mapping strategies, the results of all statistical analyses will be presented in more detail for the trait “grain yield.” Considering REML single-locus analysis, overall, 14 markers on chromosomes 1H, 2H, 3H, and 7H (Table 2) showed a F value greater than the significance threshold (obtained from permutation test). The P value of F test ranged between 0.001 and 0.017, and the estimated marker effects of the exotic allele ranged between −11.46 and −2.46. If a REML forward selection approach was performed, only four markers had a value of F-statistic greater than the significance threshold. These markers showed a P value of F test ranging from 0.001 to 0.009 and estimated effects from −7.35 to −3.35.

Table 2 Significant SSR markers with their chromosomal positions, estimated effects of the exotic allele, F and P values from REML single-locus analysis and the REML forward selection approach for the trait “grain yield” of the spring barley population

Full size table

Table 3 Occupancy probabilities P of marker effects being higher than the significance threshold in Bayesian multi-environmental mapping for all traits in the spring barley population

Full size table

Table 4 Detected QTLs by REML and Bayesian analyses in the spring barley population compared to QTL mapping studies using other barley populations and different molecular markers

Full size table

In Bayesian single-environmental mapping, overall, 14 markers showed a significant effect resulting in nine QTLs (Fig. 1; Table 1). None of the markers showed a significant effect in all six environments (Fig. 1). In contrast, considering Bayesian multi-environmental analysis, only five markers having a significant effect were mapped, yielding five QTLs (Fig. 2; Table 1). In Bayesian multi-environmental mapping markers flanking a significant QTL on the same chromosome often showed negligible effects (Fig. 2). For example, the marker 11 has estimated (standardized) effects between 0.22 and 0.33, and is hence defined to be significant. The flanking markers with the numbers 12–15 have small (standardized) effects ranging from −0.05 to +0.04.