Comparison of linear and semi-parametric models incorporating genomic, pedigree, and associated loci information for the prediction of resistance to stripe rust in an Austrian winter wheat breeding program

Morales, Laura; Ametz, Christian; Dallinger, Hermann Gregor; Löschenberger, Franziska; Neumayer, Anton; Zimmerl, Simone; Buerstmayr, Hermann

doi:10.1007/s00122-023-04249-6

Comparison of linear and semi-parametric models incorporating genomic, pedigree, and associated loci information for the prediction of resistance to stripe rust in an Austrian winter wheat breeding program

Original Article
Open access
Published: 24 January 2023

Volume 136, article number 23, (2023)
Cite this article

Download PDF

You have full access to this open access article

Theoretical and Applied Genetics Aims and scope Submit manuscript

Comparison of linear and semi-parametric models incorporating genomic, pedigree, and associated loci information for the prediction of resistance to stripe rust in an Austrian winter wheat breeding program

Download PDF

2042 Accesses
1 Altmetric
Explore all metrics

A Publisher Correction to this article was published on 23 March 2023

This article has been updated

Abstract

Key message

We used a historical dataset on stripe rust resistance across 11 years in an Austrian winter wheat breeding program to evaluate genomic and pedigree-based linear and semi-parametric prediction methods.

Abstract

Stripe rust (yellow rust) is an economically important foliar disease of wheat (Triticum aestivum L.) caused by the fungus Puccinia striiformis f. sp. tritici. Resistance to stripe rust is controlled by both qualitative (R-genes) and quantitative (small- to medium-effect quantitative trait loci, QTL) mechanisms. Genomic and pedigree-based prediction methods can accelerate selection for quantitative traits such as stripe rust resistance. Here we tested linear and semi-parametric models incorporating genomic, pedigree, and QTL information for cross-validated, forward, and pairwise prediction of adult plant resistance to stripe rust across 11 years (2008–2018) in an Austrian winter wheat breeding program. Semi-parametric genomic modeling had the greatest predictive ability and genetic variance overall, but differences between models were small. Including QTL as covariates improved predictive ability in some years where highly significant QTL had been detected via genome-wide association analysis. Predictive ability was moderate within years (cross-validated) but poor in cross-year frameworks.

Genome-wide association study and genomic selection of spike-related traits in bread wheat

Article 15 May 2024

Multiomics-assisted characterization of rice-Yellow Stem Borer interaction provides genomic and mechanistic insights into stem borer resistance in rice

Article 07 May 2024

Identification of candidate genes for adult plant stripe rust resistance transferred from Aegilops ventricosa 2NvS into wheat via fine mapping and transcriptome analysis

Article 02 May 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The fungus Puccinia striiformis f. sp. tritici (Pst) causes stripe rust (yellow rust), an economically important foliar disease of wheat (Triticum aestivum L.). Resistance breeding is the most effective strategy for combating stripe rust epidemics (Chen 2020). Resistance to stripe rust in wheat is both qualitatively and quantitatively inherited (Rosewarne et al. 2008; Zegeye et al. 2014; Waqar et al. 2018; Blake et al. 2019; Ye et al. 2019). While most Yr genes confer complete (qualitative) resistance against specific Pst races and favorable Yr alleles can be efficiently deployed via marker assisted selection (MAS), Yr-mediated resistance can be easily overcome by rapidly evolving Pst populations (Poland et al. 2009; Buerstmayr et al. 2014; Hovmøller et al. 2016; Chen 2020; Klymiuk et al. 2020; Tehseen et al. 2020). Quantitative trait loci (QTL) and adult plant resistance (APR) Yr genes provide partial, race non-specific resistance that can be more durable in comparison with race-specific Yr genes (Poland et al. 2009; Chen 2020) but the effects of quantitative resistance mechanisms can be epistatically masked in the presence of race-specific Yr resistance alleles (Poland and Rutkoski 2016; Michel et al. 2022). Selection for and pyramiding of resistance QTL and APR genes via MAS can be an efficient strategy for achieving high levels of quantitative resistance (Ragimekula et al. 2013; Poland and Rutkoski 2016; Chen 2020).

Genomic prediction is a powerful tool for plant breeding, accelerating the breeding cycle and increasing genetic gain for quantitative traits (Heffner et al. 2010; Heslot et al. 2015; Poland and Rutkoski 2016; Crossa et al. 2017). Both linear and semi-parametric modeling have been shown to accurately predict stripe rust resistance (Juliana et al. 2017; Muleta et al. 2017; Tehseen et al. 2021; Shahinnia et al. 2022), but semi-parametric methods can improve prediction accuracy under epistatic interactions (Gianola et al. 2006; Gianola and Van Kaam 2008; Heslot et al. 2012; Juliana et al. 2017). Incorporating known QTL as model covariates can also enhance prediction accuracy for stripe rust resistance (Juliana et al. 2017; Shahinnia et al. 2022) and other quantitative disease resistance traits (Poland and Rutkoski 2016).

Prediction modeling for stripe rust has been previously assessed only under cross-validation and in highly controlled, artificially inoculated experiments with limited numbers of genotypes and environments (Juliana et al. 2017; Muleta et al. 2017; Tehseen et al. 2021; Shahinnia et al. 2022). Stripe rust resistance mechanisms in an active wheat breeding program can be influenced not only by genetic changes in the wheat population as a result of breeders’ decisions, but also by genetic changes in rapidly evolving Pst populations. As such, the evaluation of prediction models for stripe rust resistance should reflect these dynamic and interacting processes.

Here, we tested the predictive ability of linear and semi-parametric models incorporating genomic, pedigree, and QTL information on the prediction of stripe rust resistance under cross-validation and various cross-population frameworks using a historical dataset on more than 5000 Austrian winter wheat breeding lines evaluated over 11 years, largely under natural Pst infection (Morales et al. 2021). Linear models included genomic and pedigree-based best linear unbiased prediction (GBLUP and PBLUP, respectively) (Meuwissen et al. 2001; Endelman and Jannink 2012) and non-parametric models included genomic and pedigree-based reproducing kernel Hilbert spaces prediction (GRKHS and PRKHS, respectively) (Gianola et al. 2006; Gianola and Van Kaam 2008; González-Camacho et al. 2012). The QTL used as prediction model covariates in this study had been previously identified via genome-wide association (GWA) in this material (Morales et al. 2021).

Materials and methods

Phenotypic, genotypic, and pedigree data

Here we analyzed a historical stripe rust dataset from the winter wheat breeding program of Saatzucht Donau GmbH & CoKG (Probstdorf, Austria), as described previously by Morales et al. (2021). Briefly, 20,529 genotypes were evaluated for adult stripe rust resistance on a 1 (most resistant) to 9 (most susceptible) scale in 71 trials across 53 locations from 2008 to 2018, where the majority (60/71 trials) of the trials were naturally infected by Pst (Morales et al. 2021). The phenotypic dataset is highly unbalanced, with most genotypes only evaluated in one plot in one trial (Morales et al. 2021). Within-trial spatial variation in stripe rust severity was adjusted using the “SpATS” package (Rodríguez-Álvarez et al. 2018) in R Core Team (2020), Morales et al. (2021). Within and across years, a mixed model was fit with the spatially-adjusted stripe rust plot values as the response, genotype as a fixed effect, and trial as a random effect using the “breedR” package (Muñoz and Sanchez 2020) in R Core Team (2020) and the genotype best linear unbiased estimates (BLUEs) were then extracted from the model for further analysis (Morales et al. 2021). The within- and across-year genotype BLUEs (Morales et al. 2021) were used for genomic and pedigree-based prediction in this study (Online Resource 1). For prediction models including multiple years in the training set, we fit mixed models with the spatially-adjusted stripe rust plot values from the years in the training set as the response, genotype as a fixed effect, and trial as a random effect using the “breedR” package (Muñoz and Sanchez 2020) in R Core Team (2020) and then extracted the BLUEs for further analysis (Online Resource 1).

Pedigree information was available for 41,461 individuals (Online Resource 2). A subset of 5233 lines selected based on good agronomic performance, grain quality, and disease resistance had also been genotyped with 9744 single nucleotide polymorphisms (SNPs) derived from a custom 6 K Illumina marker array (Illumina, Inc., San Diego, CA, USA) and DArTseq (Diversity Arrays Technology Pty Ltd, Canberra, Australia) genotyping-by-sequencing (Akbari et al. 2006; Elshire et al. 2011) technology (Morales et al. 2021) (Online Resource 3). SNP genotypes were coded in terms of alternate alleles “a” and “A,” where − 1 = aa (homozygous “a” allele), 0 = Aa (heterozygous), and 1 = AA (homozygous “A” allele), and missing SNP data were imputed with the “missForest” package (Stekhoven and Bühlmann 2012) in R (Morales et al. 2021; R Core Team 2020) (Online Resource 3).

Morales et al. (2021) previously identified 150 SNPs that were significantly associated with stripe rust resistance within 2009, 2010, 2011, 2014, 2015, 2018, and across 2008–2018 in this dataset, representing 56 QTL (Online Resource 4). For years in which no SNPs were significantly detected (2008, 2012, 2013, 2016, 2017), we selected the SNPs with the lowest p-values (Online Resource 4).

Prediction models

All statistical analyses were conducted in R Core Team (2020). For genomic and pedigree-based best linear unbiased prediction (GBLUP and PBLUP, respectively) and genomic and pedigree-based reproducing kernel Hilbert spaces prediction (GRKHS and PRKHS, respectively), we used the “breedR” package (Muñoz and Sanchez 2020) to fit the following mixed model (Meuwissen et al. 2001):

$${\varvec{y}}={1}_{n}{\varvec{\mu}}+{\varvec{Z}}{\varvec{u}}+{\varvec{\varepsilon}},$$

where y is the vector of genotype BLUEs for stripe rust resistance, µ is the vector of overall means, Z is the design matrix of random effects, u is the vector of genotype random effects (${\varvec{u}}\sim N(0, {\varvec{K}}{\sigma }_{a}^{2}$)), and ε is the vector of residuals (${\varvec{\varepsilon}}\sim N(0,{\varvec{I}}{\sigma }_{\varepsilon }^{2}$)). The variance of the genotype term was modeled as Kσ²_a, where K is the realized additive relationship matrix (Endelman and Jannink 2012) and σ²_a is the estimated additive genetic variance (Yu et al. 2006). For each GBLUP model, we calculated K using SNP data from the lines included in the model with the “rrBLUP” package (Endelman and Jannink 2012). We used pedigree data to estimate K for all lines using the “AGHmatrix” package (Amadeu et al. 2016) and K was then subset for the lines included in each PBLUP model and which had also been genotyped. Using SNP data from the lines included in each GRKHS model and the pedigree K matrix subset for the lines in each PRKHS model and which had also been genotyped, we used the “BGGE” package (Granato et al. 2018) to model K as the following reproducing Gaussian kernel:

$$\mathrm{K}\left({x}_{i},{x}_{j}\right)=\mathrm{exp}\left(\frac{-{\sum }_{k}{\left({x}_{ik}-{x}_{jk}\right)}^{2}}{q}\right),$$

where the numerator is the Euclidian distance between individuals based on SNPs (GRKHS) (González-Camacho et al. 2012) or twice the coefficient of ancestry (PRKHS) (Juliana et al. 2017), scaled by the percentile of the square of the Euclidean distance q (González-Camacho et al. 2012).

In addition, we incorporated the QTL previously identified via GWA (Morales et al. 2021) in each prediction model (GBLUP-A, PBLUP-A, GRKHS-A, PRKHS-A). For the across-year models and each within-year model, QTL significantly associated with stripe rust across years or in that year (Morales et al. 2021), respectively, were included as fixed covariates. Similarly, for pairwise and forward prediction, the QTL associated in the test year(s) were included as covariates. The following mixed model (Meuwissen et al. 2001) was fit with the “breedR” package (Muñoz and Sanchez 2020):

$${\varvec{y}}={1}_{n}{\varvec{\mu}}+{\varvec{X}}{\beta }_{i}\dots {\varvec{X}}{\beta }_{j}+{\varvec{Z}}{\varvec{u}}+{\varvec{\varepsilon}},$$

where y is the vector of genotype BLUEs for stripe rust resistance, µ is the vector of overall means, X_i…j are the matrices of SNPs i to j, β_i…j are the fixed effects of SNPs i to j, Z is the design matrix of random effects, u is the vector of genotype random effects (${\varvec{u}}\sim N(0, {\varvec{K}}{\sigma }_{a}^{2}$)), and ε is the vector of residuals (${\varvec{\varepsilon}}\sim N(0,{\varvec{I}}{\sigma }_{\varepsilon }^{2}$)). Covariance structures were specified as described previously.

We also conducted ordinary least squares (OLS) regression using a similar approach as described above, with the only difference being that the random genotypic term was not included. The following mixed model (Meuwissen et al. 2001) was fit with the “breedR” package (Muñoz and Sanchez 2020):

$${\varvec{y}}={1}_{n}{\varvec{\mu}}+{\varvec{X}}{\beta }_{i}\dots {\varvec{X}}{\beta }_{j}+{\varvec{\varepsilon}},$$

where y is the vector of genotype BLUEs for stripe rust resistance, µ is the vector of overall means, X_i…j are the matrices of SNPs i to j, β_i…j are the fixed effects of SNPs i to j, and ε is the vector of residuals (${\varvec{\varepsilon}}\sim N(0,{\varvec{I}}{\sigma }_{\varepsilon }^{2}$)).

Prediction frameworks

We used cross-validation (five-fold, 10 replications) to evaluate each prediction model within and across years. Within each fold of each replication of each model, the response vector y included the genotype BLUEs of the training set and missing values for the test set. K was estimated using SNP data from all genotypes in both the training and test sets. Predictive ability was defined as the Pearson’s correlation between the observed and predicted values of the test set in each fold of each replication. For each model within and across years, we estimated heritability within each replication/fold as the proportion of the total variance explained by the random genotypic term.

Because GRKHS had the best predictive ability and highest heritability overall in the cross-validated analysis and because GBLUP is the most commonly used model for genomic prediction (Zhang et al. 2021), we conducted further cross-year testing on GRKHS and GBLUP. In the forward prediction framework, BLUES within each year (2009–2010) were used as the test set. For each test set, we comprised progressive training set(s) of BLUEs from the previous year(s), with the first training set only including the year immediately before the test year and the last training set including all years prior to the test year. For example, the training sets for the 2011 test year included BLUEs from 2010, 2009–2010, and 2008–2010. We also evaluated GBLUP and GRKHS between pairs of years. For each pair of years, one year was used as the training set and the other year as the test set, and vice versa. The forward prediction and between-year test sets were selected in two ways: (1) all genotypes in the training and test sets were included (“overlap”) and (2) genotypes that were present in both the training and test sets were excluded from the test set (“no overlap”). For each model, the response vector y included the genotype BLUEs of the training set and missing values for the test set. K was estimated using SNP data from all genotypes in both the training and test sets. Predictive ability was defined as the Pearson’s correlation between the observed and predicted values of the test set and heritability was estimated as the proportion of the total variance explained by the random genotypic term.

Results

Comparison of cross-validated stripe rust prediction models within and across years

Cross-validated predictive ability ( PA) for stripe rust was moderate, with a grand mean of PA = 0.40 ± 0.24. The number of lines per year ranged from 47 to 1639 (Table 2). Overall, the difference in predictive ability among kinship-based models (all models except OLS) was small, ranging from PA = 0.37 for PBLUP to PA = 0.49 for GRKHS-A, while OLS had the lowest predictive ability (PA = 0.22) (Table 1). GRKHS and GRKHS-A had the greatest heritability (h² = 0.75–0.79), while GBLUP and GBLUP-A had the lowest heritability (h² = 0.34–0.42) (Table 2). In an overall comparison among years, predictive ability was highest across years and within 2014 and 2016 ( PA= 0.57–0.58) and lowest within 2008 and 2009 (PA = 0.29–0.30), while heritability was highest within 2009 (h² = 0.84) and lowest within 2011 (h² = 0.26) (Tables 1 and 2). Predictive ability and heritability were weakly positively correlated (r = 0.09, p = 9 × 10^–11).

Table 1 Predictive ability of cross-validated stripe rust prediction models within and across years (2008–2018)

Full size table

Table 2 Heritability of cross-validated stripe rust prediction models within and across years (2008–2018)

Full size table

Predictive ability was highest within 2009 and 2010 with GBLUP (Table 1). GRKHS best predicted 2011, while GRKHS-A had the best predictive ability within 2008, 2012, 2014, 2015, and 2017 (Table 1). Within 2016, GRKHS-A, PRKHS, and PRKHS-A has the highest predictive ability, while PRKHS-A best predicted 2018. In the across-years analysis, predictive ability was best with GRKHS, GRKHS-A, and PRKHS-A (Table 1). All kinship-based models performed equally within 2013 (Table 1).

For the genomic prediction methods, including QTL as covariates did not significantly improve predictive ability. Overall and within/across years, GBLUP and GBLUP-A performed equally, as did GRKHS and GRKHS-A (Table 1). However, pedigree-based models that included QTL covariates had higher predictive ability than their counterparts without QTL covariates in some cases. Overall, PBLUP-A and PRKHS-A had better predictive ability than PBLUP and PRKHS (Table 1). Similarly, PBLUP-A and PRKHS-A had higher predictive ability than PBLUP and PRKHS within 2008, 2014, and 2015, and across years (Table 1). Within 2008, 2014, and 2015, OLS had predictive ability comparable to or higher than PBLUP-A and PRKHS-A (Table 1).

Comparison of between-year and forward prediction models for stripe rust

Both between-year and forward predictive ability were generally poor, with a grand mean of PA = 0.12 ± 0.14 for the between-year framework and PA = 0.14 ± 0.14 for forward prediction (Table 3, Figs. 1 and 2). Overall, the models in which genotypes present in both the training and test sets where excluded from the test set (GBLUP–no overlap; GRKHS–no overlap; PA_between = 0.14; PA_forward = 0.17) had better predictive ability than the models where all genotypes in the training and test sets were included (GBLUP–overlap; GRKHS–overlap; PA_between = 0.09–0.11; PA_forward = 0.10–0.12) (Table 3). Heritability was also higher with the “no overlap” models compared to the “overlap” models and the GBLUP models had greater heritability than their respective GRKHS models (Table 3).

Table 3 Predictive ability and heritability of between-year and forward prediction models for stripe rust

Full size table

In the between-year framework, there was no consistent relationship between predictive ability and the number of years between the training and test sets (Pearson’s correlation r = 0.002; p = 0.9), although some trends were apparent. For example, the years 2013–2015 better predicted each other than other years (Fig. 1). Conversely, the training years 2008–2012 tended to have better predictive ability for the test years 2016–2017 than with other years, and vice versa (Fig. 1). The test year 2018 was poorly predicted by all training years (Fig. 1). The phenotypic correlation between pairs of years was generally higher than the corresponding genomic predictive ability (Fig. 1, Table 4). Adjacent pairs of years tended to have higher phenotypic correlations than pairs further apart in time (Table 4). The number of lines shared between pairs of years ranged from 14 to 541 (Table 4).

Table 4 Phenotypic correlations and number of shared genotypes between pairs of years from 2009–2018

Full size table

In the forward prediction framework, we found no apparent trend with respect to the number of previous years in the training set versus predictive ability (Pearson’s correlation r = 0.1; p = 0.8). However, we did find trends in predictive ability among test years and models. With the GBLUP–no overlap and GRKHS–no overlap models, predictive ability was higher for the test years 2012 (PA = 0.34 ± 0.02), 2015 (PA = 0.26 ± 0.01), and 2017 (PA = 0.29 ± 0.02) than other test years (PA = 0.06–0.19) (Fig. 2). While predictive ability for the test year 2016 was poor with the “no overlap” models (PA = 0.12 ± 0.03), its predictive ability with the “overlap” methods was moderate (PA = 0.41 ± 0.07) and comparable to cross-validated predictions (Fig. 2, Table 1). In contrast, prediction for the test year 2013 was very poor with the “overlap” models (PA = -0.14 ± 0.03) compared to the “no overlap” models (PA = 0.13 ± 0.02) (Fig. 2). The test years 2011 (PA = 0.06 ± 0.01) and 2018 (PA = −0.04 ± 0.03) were poorly predicted across all scenarios (Fig. 2).

Discussion

Here, we evaluated linear and semi-parametric methods using genomic, pedigree, and QTL information for genomic prediction of resistance to stripe rust across 11 years in an Austrian winter wheat breeding program. Resistance to stripe rust in an active wheat breeding program is partially influenced by the combination of two dynamic processes: (a) breeders’ decisions about family selection at every generation/year and (b) rapidly changing Pst populations. We found small differences in performance among prediction models and that cross-validated predictive ability was moderate within years but poor in most cross-year scenarios.

GRKHS modeling yielded the best overall predictive ability in the cross-validated framework but the difference between GRKHS/GRKHS-A and the other models was small, with insignificant differences with GBLUP modeling (3–4%) and slightly larger differences with the pedigree-based models (4–10%). Previous studies comparing genomic prediction models for stripe rust resistance in wheat found that GRKHS had similar performance to GBLUP (Juliana et al. 2017; Tehseen et al. 2021) and slightly better accuracy than pedigree-based models (Juliana et al. 2017). GRKHS had greater heritability than all other models under both cross-validated and cross-year prediction, with notable differences ranging from 7 to 45% under cross-validation. Previous studies also found that RKHS methods reduce error variance and capture a greater amount of the genetic variance (Gianola et al. 2006; Crossa et al. 2010) and may improve prediction under epistasis (Gianola et al. 2006; Gianola and Van Kaam 2008; Heslot et al. 2012; Juliana et al. 2017).

As expected given the quantitative inheritance of stripe rust resistance in this population (Morales et al. 2021), OLS had poor predictive ability compared to the genomic and pedigree-based kinship models. While genomic prediction is an effective tool for improving quantitative traits (Heffner et al. 2010; Heslot et al. 2015; Poland and Rutkoski 2016; Crossa et al. 2017), approaches that incorporate individual markers, such as OLS and MAS, can be used successfully under less complex genetic architecture and where major QTL are present (Ragimekula et al. 2013; Poland and Rutkoski 2016; Juliana et al. 2017; Chen 2020; Shahinnia et al. 2022).

The QTL used as prediction model covariates here, which had been previously identified in a GWA study in this population, had small effects on stripe rust resistance (Morales et al. 2021). Small-effect QTL are not ideal targets for MAS, but previous studies have found that the inclusion of small- and medium-effect QTL as prediction model covariates can improve predictive ability for stripe rust resistance (Juliana et al. 2017; Shahinnia et al. 2022). The inclusion of QTL covariates in genomic prediction modeling did not significantly increase predictive ability when compared to the respective models without QTL covariates (e.g., GBLUP vs. GBLUP-A) in our dataset. Our results suggest that background quantitative resistance mechanisms were driving the signal for genomic prediction, complementing previous findings of genome-wide selection signatures in this breeding program (Morales et al. 2021). In addition, Morales et al. (2021) found that rapid changes in allele frequencies led to the fixation of QTL detected by GWA in this population. As such, modeling QTL covariates may not be a reliable approach for long-term improvement of genomic prediction for stripe rust resistance in some breeding programs. The utility of QTL in prediction or MAS for resistance to stripe rust largely depends on the plant material. Breeding programs should—and often do—evaluate different strategies for genomic prediction modeling and/or MAS.

In 2011, the Warrior race (genetic group PstS7) emerged across Europe and quickly became the dominant Pst race thereafter (Hovmøller et al. 2016; Global Rust Reference Center 2021). Predictive ability and heritability were very poor for 2011 under both cross-validated and forward prediction. In addition, including genotypes that were present in both the training and test years in between-year genomic prediction modeling reduced predictive ability from 2011 to 2013. These results suggest that some resistance alleles in the population may have become ineffective with the emergence of the Warrior race. However, resistance to stripe rust in this breeding program appears to have been largely driven by quantitative mechanisms, as demonstrated by (a) our previous findings of quantitative inheritance and genome-wide selection (Morales et al. 2021), (b) the lack of improvement in genomic prediction accuracy by incorporating QTL covariates, and (c) the non-relationship between proximity in time and predictive ability under cross-year genomic prediction frameworks.

Here, predictive ability for stripe rust resistance was higher under cross-validation than in the cross-year prediction frameworks, similar to previous reports where prediction accuracy for other traits was higher within populations than across populations (Thavamanikumar et al. 2015; Haile et al. 2021; Isidro y Sánchez and Akdemir 2021). Compared to a study on genomic prediction for stripe rust in bread wheat landraces from Afghanistan, we found similar levels of cross-validated predictive ability (Tehseen et al. 2021), while our cross-validated genomic predictive ability results were lower than those reported in advanced lines from the CIMMYT bread wheat program (Juliana et al. 2017) and in a panel of Central European winter wheat (Shahinnia et al. 2022). The higher cross-validated predictive ability in previous studies may have been the result of more highly controlled, replicated, and artificially inoculated experiments (Juliana et al. 2017; Shahinnia et al. 2022).

The data used in this study was distinct from previous experiments in that (a) it derived from an active breeding program, in which more than 5000 genotypes were evaluated, with breeders’ decisions leading to rapid genetic changes in the population over time and (b) the trials were conducted over 11 years at more than 50 locations, largely under natural Pst infection. In addition, all previous studies on genomic prediction modeling for stripe rust resistance have been conducted within populations, while our work has assessed models under both cross-validated (within-year/population) and cross-population (forward and between-year) frameworks (Juliana et al. 2017; Muleta et al. 2017; Tehseen et al. 2021; Shahinnia et al. 2022). Forward and between-year genomic prediction was poor, while phenotypic correlation between pairs of years was moderate. In addition, including genotypes that were observed in both the training and test years in genomic prediction modeling decreased predictive ability in the cross-year frameworks. Spatiotemporal changes in Pst population composition can lead to changes in observed levels of stripe rust resistance, as resistance alleles can break down with genetic changes in the pathogen (Michel et al. 2022). The complex pathosystem between Pst and wheat, especially in an active breeding program, makes genomic prediction for stripe rust challenging in the long term.

Our results suggest that although cross-validated, within-environment prediction can appear promising, genomic prediction across years and germplasm, which would be a more realistic application in a breeding program, may not be sufficient for selection of resistance to stripe rust alone. Screening germplasm for stripe rust resistance in multi-environmental trials is crucial for making informed selection decisions. Although visual phenotypic assessment of stripe rust resistance is less expensive than genotyping, conducting trials across multiple locations/years can be costly (e.g., labor, field space, seed availability) and environmental conditions are not always conducive for Pst infection and stripe rust symptom development, even under artificial inoculation. As such, selective phenotyping and genotyping strategies should be optimized within breeding programs to maximize the efficiency of selection for stripe rust resistance.

Supplementary information

Online Resource 1. Genotype best linear unbiased estimates (BLUEs) for stripe rust resistance within and across years from 2008 to 2018.

Online Resource 2. Pedigree information for 41,461 lines, including genotype IDs of each line and of each parent.

Online Resource 3. Data for 9744 SNPs, including SNP ID, chromosome, physical position (bp), and genotypes of 5233 lines.

Online Resource 4. Information for 64 QTL used as covariates in prediction models, including SNP ID, chromosome, physical position (bp), and year in which the QTL was identified.

Data availability

All phenotypic, genotypic, and pedigree data and results from the analyses presented here are included in the manuscript materials.

Code availability

The scripts used to conduct the analyses presented here are available upon request.

Change history

23 March 2023
A Correction to this paper has been published: https://doi.org/10.1007/s00122-023-04323-z

References

Akbari M, Wenzl P, Caig V et al (2006) Diversity arrays technology (DArT) for high-throughput profiling of the hexaploid wheat genome. Theor Appl Genet 113:1409–1420. https://doi.org/10.1007/s00122-006-0365-4
Article CAS PubMed Google Scholar
Akdemir D, Isidro-Sánchez J (2019) Design of training populations for selective phenotyping in genomic prediction. Sci Rep 9:1–15. https://doi.org/10.1038/s41598-018-38081-6
Article CAS Google Scholar
Amadeu RR, Cellon C, Olmstead JW et al (2016) AGHmatrix: R Package to construct relationship matrices for autotetraploid and diploid species: a blueberry example. Plant Genome 9:1–10. https://doi.org/10.3835/plantgenome2016.01.0009
Article Google Scholar
Blake VC, Woodhouse MR, Lazo GR et al (2019) GrainGenes: Centralized small grain resources and digital platform for geneticists and breeders. Database 2019:1–7. https://doi.org/10.1093/database/baz065
Article Google Scholar
Buerstmayr M, Matiasch L, Mascher F et al (2014) Mapping of quantitative adult plant field resistance to leaf rust and stripe rust in two European winter wheat populations reveals co-location of three QTL conferring resistance to both rust pathogens. Theor Appl Genet 127:2011–2028. https://doi.org/10.1007/s00122-014-2357-0
Article PubMed PubMed Central Google Scholar
Chen X (2020) Pathogens which threaten food security: Puccinia striiformis, the wheat stripe rust pathogen. 239–251
Combs E, Bernardo R (2013) Accuracy of genomewide selection for different traits with constant population size, heritability, and number of markers. Plant Genome 6:1–7. https://doi.org/10.3835/plantgenome2012.11.0030
Article CAS Google Scholar
Crossa J, De Los CG, Pérez P et al (2010) Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers. Genetics 186:713–724. https://doi.org/10.1534/genetics.110.118521
Article CAS PubMed PubMed Central Google Scholar
Crossa J, Pérez-Rodríguez P, Cuevas J et al (2017) Genomic selection in plant breeding: methods, models, and perspectives. Trends Plant Sci 22:961–975. https://doi.org/10.1016/j.tplants.2017.08.011
Article CAS PubMed Google Scholar
Elshire RJ, Glaubitz JC, Sun Q et al (2011) A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One 6:e19379. https://doi.org/10.1371/journal.pone.0019379
Article CAS PubMed PubMed Central Google Scholar
Endelman JB, Jannink J-L (2012) Shrinkage estimation of the realized relationship matrix. G3 2:1405–1413. https://doi.org/10.1534/g3.112.004259
Article PubMed PubMed Central Google Scholar
Gianola D, Van Kaam JBCHM (2008) Reproducing kernel Hilbert spaces regression methods for genomic assisted prediction of quantitative traits. Genetics 178:2289–2303. https://doi.org/10.1534/genetics.107.084285
Article PubMed PubMed Central Google Scholar
Gianola D, Fernando RL, Stella A (2006) Genomic-assisted prediction of genetic value with semiparametric procedures. Genetics 173:1761–1776. https://doi.org/10.1534/genetics.105.049510
Article CAS PubMed PubMed Central Google Scholar
Global Rust Reference Center (2021) Yellow Rust Tools - maps and charts. https://agro.au.dk/forskning/internationale-platforme/wheatrust/yellow-rust-tools-maps-and-charts/
González-Camacho JM, de los Campos G, Pérez P et al (2012) Genome-enabled prediction of genetic values using radial basis function neural networks. Theor Appl Genet 125:759–771. https://doi.org/10.1007/s00122-012-1868-9
Article PubMed PubMed Central Google Scholar
Granato I, Cuevas J, Luna-Vázquez F et al (2018) BGGE: A new package for genomic-enabled prediction incorporating genotype × environment interaction models. G3 8:3039–3047. https://doi.org/10.1534/g3.118.200435
Article PubMed PubMed Central Google Scholar
Haile TA, Walkowiak S, N’Diaye A et al (2021) Genomic prediction of agronomic traits in wheat using different models and cross-validation designs. Theor Appl Genet 134:381–398. https://doi.org/10.1007/s00122-020-03703-z
Article CAS PubMed Google Scholar
Heffner EL, Lorenz AJ, Jannink JL, Sorrells ME (2010) Plant breeding with genomic selection: gain per unit time and cost. Crop Sci 50:1681–1690. https://doi.org/10.2135/cropsci2009.11.0662
Article Google Scholar
Heslot N, Yang HP, Sorrells ME, Jannink JL (2012) Genomic selection in plant breeding: a comparison of models. Crop Sci 52:146–160. https://doi.org/10.2135/cropsci2011.06.0297
Article Google Scholar
Heslot N, Jannink J-L, Sorrells ME (2015) Perspectives for genomic selection applications and research in plants. Crop Sci 55:1–12
Article Google Scholar
Hovmøller MS, Walter S, Bayles RA et al (2016) Replacement of the European wheat yellow rust population by new races from the centre of diversity in the near-Himalayan region. Plant Pathol 65:402–411. https://doi.org/10.1111/ppa.12433
Article Google Scholar
Isidro y Sánchez J, Akdemir D, (2021) Training set optimization for sparse phenotyping in genomic selection: a conceptual overview. Front Plant Sci 12:1–14. https://doi.org/10.3389/fpls.2021.715910
Article Google Scholar
Jannink JL, Lorenz AJ, Iwata H (2010) Genomic selection in plant breeding: from theory to practice. Briefings Funct Genom Proteom 9:166–177. https://doi.org/10.1093/bfgp/elq001
Article CAS Google Scholar
Juliana P, Singh RP, Singh PK et al (2017) Genomic and pedigree-based prediction for leaf, stem, and stripe rust resistance in wheat. Theor Appl Genet 130:1415–1430. https://doi.org/10.1007/s00122-017-2897-1
Article PubMed PubMed Central Google Scholar
Klymiuk V, Fatiukha A, Raats D et al (2020) Three previously characterized resistances to yellow rust are encoded by a single locus Wtk1. J Exp Bot 71:2561–2572. https://doi.org/10.1093/jxb/eraa020
Article CAS PubMed PubMed Central Google Scholar
Liu X, Wang H, Wang H et al (2018) Factors affecting genomic selection revealed by empirical evidence in maize. Crop J 6:341–352. https://doi.org/10.1016/j.cj.2018.03.005
Article Google Scholar
Meuwissen THE, Hayes BJ, Goddard ME (2001) Prediction of total genetic value using genome-wide dense marker maps. Genetics 157:1819–1829. https://doi.org/10.1093/genetics/157.4.1819
Article CAS PubMed PubMed Central Google Scholar
Michel S, Loeschenberger F, Ametz C, Buerstmayr H (2022) Towards combining qualitative race-specific and quantitative race-nonspecific disease resistance by genomic selection. Theor Appl Genet, in review
Morales L, Michel S, Ametz C et al (2021) Genomic signatures of selection for resistance to stripe rust in Austrian winter wheat. Theor Appl Genet 134:3111–3121. https://doi.org/10.1007/s00122-021-03882-3
Article CAS PubMed PubMed Central Google Scholar
Muleta KT, Bulli P, Zhang Z et al (2017) Unlocking diversity in germplasm collections via genomic selection: a case study based on quantitative adult plant resistance to stripe rust in spring wheat. Plant Genome 10:1–15. https://doi.org/10.3835/plantgenome2016.12.0124
Article Google Scholar
Muñoz F, Sanchez L (2020) breedR: Statistical Methods for Forest Genetic Resources Analysts. http://famuvie.github.io/breedR/
Poland J, Rutkoski J (2016) Advances and challenges in genomic selection for disease resistance. Annu Rev Phytopathol 54:79–98. https://doi.org/10.1146/annurev-phyto-080615-100056
Article CAS PubMed Google Scholar
Poland JA, Balint-Kurti PJ, Wisser RJ et al (2009) Shades of gray: the world of quantitative disease resistance. Trends Plant Sci 14:21–29. https://doi.org/10.1016/j.tplants.2008.10.006
Article CAS PubMed Google Scholar
R Core Team (2020) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/
Ragimekula N, Varadarajula NN, Mallapuram SP et al (2013) Marker-assisted selection in disease resistance breeding. J Plant Breed Genet 1:90–109. https://doi.org/10.1016/b978-0-444-63661-4.00009-8
Article Google Scholar
Rodríguez-Álvarez MX, Boer MP, van Eeuwijk FA, Eilers PHC (2018) Correcting for spatial heterogeneity in plant breeding experiments with P-splines. Spat Stat 23:52–71. https://doi.org/10.1016/j.spasta.2017.10.003
Article Google Scholar
Rosewarne GM, Singh RP, Huerta-Espino J, Rebetzke GJ (2008) Quantitative trait loci for slow-rusting resistance in wheat to leaf rust and stripe rust identified with multi-environment analysis. Theor Appl Genet 116:1027–1034. https://doi.org/10.1007/s00122-008-0736-0
Article CAS PubMed Google Scholar
Shahinnia F, Geyer M, Schuermann F et al (2022) Genome-wide association study and genomic prediction of resistance to stripe rust in current Central and Northern European winter wheat germplasm. Theor Appl Genet 135:3583–3595. https://doi.org/10.1007/s00122-022-04202-z
Article CAS PubMed PubMed Central Google Scholar
Stekhoven DJ, Bühlmann P (2012) Missforest-non-parametric missing value imputation for mixed-type data. Bioinformatics 28:112–118. https://doi.org/10.1093/bioinformatics/btr597
Article CAS PubMed Google Scholar
Tehseen MM, Tonk FA, Tosun M et al (2020) Genome-wide association study of resistance to PstS2 and Warrior races of Puccinia striiformis f. sp. tritici (stripe rust) in bread wheat landraces. Plant Genome. https://doi.org/10.1002/tpg2.20066
Article PubMed Google Scholar
Tehseen MM, Kehel Z, Sansaloni CP et al (2021) Comparison of genomic prediction methods for yellow, stem, and leaf rust resistance in wheat landraces from Afghanistan. Plants 10:558. https://doi.org/10.3390/plants10030558
Article PubMed PubMed Central Google Scholar
Thavamanikumar S, Dolferus R, Thumma BR (2015) Comparison of genomic selection models to predict flowering time and spike grain number in two hexaploid wheat doubled haploid populations. G3 5:1991–1998. https://doi.org/10.1534/g3.115.019745
Article PubMed PubMed Central Google Scholar
Waqar A, Khattak SH, Begum S et al (2018) Stripe rust: A review of the disease, Yr genes and its molecular markers. Sarhad J Agric 34:188–201. https://doi.org/10.17582/journal.sja/2018/34.1.188.201
Article Google Scholar
Ye X, Li J, Cheng Y et al (2019) Genome-wide association study of resistance to stripe rust (Puccinia striiformis f. sp. tritici) in Sichuan wheat. BMC Plant Biol 19:1–15. https://doi.org/10.1186/s12870-019-1764-4
Article Google Scholar
Yu J, Pressoir G, Briggs WH et al (2006) A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet 38:203–208. https://doi.org/10.1038/ng1702
Article CAS PubMed Google Scholar
Zegeye H, Rasheed A, Makdis F et al (2014) Genome-wide association mapping for seedling and adult plant resistance to stripe rust in synthetic hexaploid wheat. PLoS One 9:e105593. https://doi.org/10.1371/journal.pone.0105593
Article CAS PubMed PubMed Central Google Scholar
Zhang J, Liu F, Reif JC, Jiang Y (2021) On the use of GBLUP and its extension for GWAS with additive and epistatic effects. G3 11:1–12. https://doi.org/10.1093/g3journal/jkab122
Article Google Scholar

Download references

Funding

Open access funding provided by University of Natural Resources and Life Sciences Vienna (BOKU). This work was partially funded by the Austrian Federal Ministry of Agriculture, Regions and Tourism (Grant number DaFNE-101402) within the ERA-NET Cofund on Sustainable Crop Production.

Author information

Authors and Affiliations

Institute of Biotechnology in Plant Production, Department of Agrobiotechnology, University of Natural Resources and Life Sciences Vienna, Tulln, Austria
Laura Morales, Hermann Gregor Dallinger, Simone Zimmerl & Hermann Buerstmayr
Saatzucht Donau GmbH and CoKG, Probstdorf, Austria
Christian Ametz, Hermann Gregor Dallinger, Franziska Löschenberger & Anton Neumayer

Authors

Laura Morales
View author publications
You can also search for this author in PubMed Google Scholar
Christian Ametz
View author publications
You can also search for this author in PubMed Google Scholar
Hermann Gregor Dallinger
View author publications
You can also search for this author in PubMed Google Scholar
Franziska Löschenberger
View author publications
You can also search for this author in PubMed Google Scholar
Anton Neumayer
View author publications
You can also search for this author in PubMed Google Scholar
Simone Zimmerl
View author publications
You can also search for this author in PubMed Google Scholar
Hermann Buerstmayr
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization was contributed by LM; data curation was contributed by LM, SM, CA; formal analysis was contributed by LM; funding acquisition was contributed by HB; investigation was contributed by FL, AN; methodology was contributed by LM; project administration was contributed by LM, SM, FL, AN, HB; resources were contributed by FL, AN, CA, HB; software was contributed by LM, SM, HGD, SZ, CA; supervision was contributed by SM, FL, HB; validation was contributed by LM; visualization was contributed by LM; writing—original draft, was contributed by LM; writing—review and editing, was contributed by LM, SM, HB, HGD.

Corresponding author

Correspondence to Laura Morales.

Ethics declarations

Conflicts of interest

CA, HGD, FL, and AN were employed by the company Saatzucht Donau GmbH & CoKG. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Additional information

Communicated by Philomin Juliana.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (TXT 848 KB)

Supplementary file2 (TXT 1006 KB)

Supplementary file3 (TXT 116197 KB)

Supplementary file4 (TXT 2 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Morales, L., Ametz, C., Dallinger, H.G. et al. Comparison of linear and semi-parametric models incorporating genomic, pedigree, and associated loci information for the prediction of resistance to stripe rust in an Austrian winter wheat breeding program. Theor Appl Genet 136, 23 (2023). https://doi.org/10.1007/s00122-023-04249-6

Download citation

Received: 26 April 2022
Accepted: 11 November 2022
Published: 24 January 2023
DOI: https://doi.org/10.1007/s00122-023-04249-6

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Comparison of linear and semi-parametric models incorporating genomic, pedigree, and associated loci information for the prediction of resistance to stripe rust in an Austrian winter wheat breeding program

Abstract

Key message

Abstract

Similar content being viewed by others

Genome-wide association study and genomic selection of spike-related traits in bread wheat

Multiomics-assisted characterization of rice-Yellow Stem Borer interaction provides genomic and mechanistic insights into stem borer resistance in rice

Identification of candidate genes for adult plant stripe rust resistance transferred from Aegilops ventricosa 2NvS into wheat via fine mapping and transcriptome analysis

Introduction

Materials and methods

Phenotypic, genotypic, and pedigree data

Prediction models

Prediction frameworks

Results

Comparison of cross-validated stripe rust prediction models within and across years

Comparison of between-year and forward prediction models for stripe rust

Discussion

Supplementary information

Data availability

Code availability

Change history

23 March 2023

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (TXT 848 KB)

Supplementary file2 (TXT 1006 KB)

Supplementary file3 (TXT 116197 KB)

Supplementary file4 (TXT 2 KB)

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation