Optimization of training sets for genomic prediction of early-stage single crosses in maize

Kadam, Dnyaneshwar C.; Rodriguez, Oscar R.; Lorenz, Aaron J.

doi:10.1007/s00122-020-03722-w

Optimization of training sets for genomic prediction of early-stage single crosses in maize

Original Article
Published: 04 January 2021

Volume 134, pages 687–699, (2021)
Cite this article

Theoretical and Applied Genetics Aims and scope Submit manuscript

Dnyaneshwar C. Kadam¹,
Oscar R. Rodriguez² &
Aaron J. Lorenz ORCID: orcid.org/0000-0002-4361-1683¹

993 Accesses
13 Citations
2 Altmetric
Explore all metrics

Abstract

Key message

Training population optimization algorithms are useful for efficiently training genomic prediction models for single-cross performance, especially if the population is extended beyond only realized crosses to all possible single crosses.

Abstract

Genomic prediction of single-cross performance could allow effective evaluation of all possible single crosses between all inbreds developed in a hybrid breeding program. The objectives of the present study were to investigate the effect of different levels of relatedness on genomic predictive ability of single crosses, evaluate the usefulness of deterministic formula to forecast prediction accuracy in advance, and determine the potential for TRS optimization based on prediction error variance (PEVmean) and coefficient of determination (CDmean) criteria. We used 481 single crosses made by crossing 89 random recombinant inbred lines (RILs) belonging to the Iowa stiff stalk synthetic group with 103 random RILs belonging to the non-stiff stalk synthetic heterotic group. As expected, predictive ability was enhanced by ensuring close relationships between TRSs and target sets, even when TRS sizes were smaller. We found that designing a TRS based on PEVmean or CDmean criteria is useful for increasing the efficiency of genomic prediction of maize single crosses. We went further and extended the sampling space from that of all observed single crosses to all possible single crosses, providing a much larger genetic space within which to design a training population. Using all possible single crosses increased the advantage of the PEVmean and CDmean methods based on expected prediction accuracy. This finding suggests that it may be worthwhile using an optimization algorithm to select a training population from all possible single crosses to maximize efficiency in training accurate models for hybrid genomic prediction.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Forecasting the accuracy of genomic prediction with different selection targets in the training and prediction set as well as truncation selection

Article 01 August 2015

The effects of training population design on genomic prediction accuracy in wheat

Article Open access 19 March 2019

Portability of genomic predictions trained on sparse factorial designs across two maize silage breeding cycles

Article Open access 07 March 2024

Abbreviations

BSSS:: Iowa stiff stalk synthetic
CD:: Coefficient of determination
DHL:: Doubled haploid line
NSSS:: Non-stiff stalk synthetic
PEV:: Prediction error variance
RIL:: Recombinant inbred lines
TRS:: Training set
TS:: Test set

References

Akdemir D (2017) STPGA: selection of training populations with a genetic algorithm. BioRxiv. https://doi.org/10.1101/111989
Article Google Scholar
Akdemir D, Isidro-Sánchez J (2019) Design of training populations for selective phenotyping in genomic prediction. Sci Rep 9:1446. https://doi.org/10.1038/s41598-018-38081-6
Article CAS PubMed PubMed Central Google Scholar
Akdemir D, Isidro-Sanchez J, Jannink J-L (2015) Optimization of genomic selection training populations with a genetic algorithm. Genet Sel Evol 47:38
Article PubMed PubMed Central Google Scholar
Albrecht T, Wimmer V, Auinger H-J et al (2011) Genome-based prediction of testcross values in maize. Theor Appl Genet 123:339
Article PubMed Google Scholar
Albrecht T, Auinger H-J, Wimmer V et al (2014) Genome-based prediction of maize hybrid performance across genetic groups, testers, locations, and years. Theor Appl Genet 127:1375–1386
Article PubMed Google Scholar
Araus JL, Cairns JE (2014) Field high-throughput phenotyping: the new crop breeding frontier. Trends Plant Sci 19:52–61
Article CAS PubMed Google Scholar
Bernardo R (1994) Prediction of maize single-cross performance using RFLPs and information from related hybrids. Crop Sci 34:20–25. https://doi.org/10.2135/cropsci1994.0011183X003400010003x
Article Google Scholar
Bernardo R (2002) Breeding for quantitative traits in plants. Stemma Press, Woodbury
Google Scholar
Bernardo R (2020) Breeding for quantitative traits in plants, 3rd edn. Stemma Press, Woodbury
Google Scholar
Butler DG, Cullis BR, Gilmour AR, Gogel BJ (2009) Mixed models for S language environments: ASReml-R reference manual. Training and development series, no. QE02001 QLD Department of Primary Industries and Fisheries, Brisbane
Clark SA, Hickey JM, Daetwyler HD, van der Werf JH (2012) The importance of information on relatives for the prediction of genomic breeding values and the implications for the makeup of reference data sets in livestock breeding schemes. Genet Sel Evol 44:4
Article PubMed PubMed Central Google Scholar
Daetwyler HD, Villanueva B, Woolliams JA (2008) Accuracy of predicting the genetic risk of disease using a genome-wide approach. PLoS ONE 3(10):e3395. https://doi.org/10.1371/journal.pone.0003395
Article CAS PubMed PubMed Central Google Scholar
Daetwyler HD, Pong-Wong R, Villanueva B, Woolliams JA (2010) The impact of genetic architecture on genome-wide evaluation methods. Genetics 185:1021–1031
Article CAS PubMed PubMed Central Google Scholar
Dekkers JM (2007) Marker-assisted selection for commercial crossbred performance. J Anim Sci 85:2104–2114
Article CAS PubMed Google Scholar
Elshire RJ, Glaubitz JC, Sun Q et al (2011) A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 6:e19379
Article CAS PubMed PubMed Central Google Scholar
Endelman JB (2011) Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome 4:250–255
Article Google Scholar
Fristche-Neto R, Akdemir D, Jannink J-L (2018) Accuracy of genomic selection to predict maize single-crosses obtained through different mating designs. Theor Appl Genet 131:1153–1162. https://doi.org/10.1007/s00122-018-3068-8
Article CAS Google Scholar
Glaubitz JC, Casstevens TM, Lu F et al (2014) TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline. PLoS ONE 9:e90346
Article PubMed PubMed Central Google Scholar
Guo T, Yu X, Li X et al (2019) Optimal designs for genomic selection in hybrid crops. Mol Plant 12:390–401. https://doi.org/10.1016/j.molp.2018.12.022
Article CAS PubMed Google Scholar
Isidro J, Jannink J-L, Akdemir D et al (2015) Training set optimization under population structure in genomic selection. Theor Appl Genet 128:145–158
Article PubMed Google Scholar
Jacobson A, Lian L, Zhong S, Bernardo R (2014) General combining ability model for genome-wide selection in a biparental cross. Crop Sci 54:895–905
Article Google Scholar
Jannink J-L, Lorenz AJ, Iwata H (2010) Genomic selection in plant breeding: from theory to practice. Brief Funct Genom 9:166–177. https://doi.org/10.1093/bfgp/elq001
Article CAS Google Scholar
Kadam DC, Lorenz AJ (2019) Evaluation of nonparametric models for genomic prediction of early-stage single crosses in maize. Crop Sci 59:1411–1423. https://doi.org/10.2135/cropsci2017.11.0668
Article CAS Google Scholar
Kadam DC, Potts SM, Bohn MO et al (2016) Genomic prediction of single crosses in the early stages of a maize hybrid breeding pipeline. G3 Genes Genomes Genet 6:3443–3453
Google Scholar
Laloë D (1993) Precision and information in linear models of genetic evaluation. Genet Sel Evol 25:557–576
Article PubMed Central Google Scholar
Lin Z, Hayes BJ, Daetwyler HD (2014) Genomic selection in crops, trees and forages: a review. Crop Past Sci 65:1177–1191
Article Google Scholar
Lorenz A, Nice L (2017) Training population design and resource allocation for genomic selection in plant breeding. In: Varshney RK, Roorkiwal M, Sorrells ME (eds) Genomic selection for crop improvement. Springer, Berlin, pp 7–22
Chapter Google Scholar
Lorenz AJ, Chao S, Asoro FG et al (2011) Genomic selection in plant breeding: knowledge and prospects. In: Sparks DL (ed) Advances in agronomy. Elsevier, Berlin, pp 77–123
Google Scholar
Massman JM, Gordillo A, Lorenzana RE, Bernardo R (2013) Genomewide predictions from maize single-cross data. Theor Appl Genet 126:13–22
Article PubMed Google Scholar
Meuwissen THE, Hayes BJ, Goddard ME (2001) Prediction of total genetic value using genome-wide dense marker maps. Genetics 157:1819–1829
CAS PubMed PubMed Central Google Scholar
Reif JC, Hallauer AR, Melchinger AE (2005) Heterosis and heterotic patterns in maize. Maydica 50:215–223
Google Scholar
Reif JC, Gumpert F-M, Fischer S, Melchinger AE (2007) Impact of interpopulation divergence on additive and dominance variance in hybrid populations. Genetics 176:1931–1934. https://doi.org/10.1534/genetics.107.074146
Article CAS PubMed PubMed Central Google Scholar
Rincent R, Laloë D, Nicolas S et al (2012) Maximizing the reliability of genomic selection by optimizing the calibration set of reference individuals: comparison of methods in two diverse groups of maize inbreds (Zea mays L.). Genetics 192:715–728
Article CAS PubMed PubMed Central Google Scholar
Sarinelli JM, Murphy JP, Tyagi P, Holland JB, Johnson JW, Mergoum M, Mason RE, Baber A, Harrison S, Sutton R, Griffey CA, Brown-Guedira G (2019) Training population selection and use of fixed effects to optimize genomic predictions in a historical USA wheat panel. Theor Appl Genet 132:1247–1261
Article CAS PubMed PubMed Central Google Scholar
Schulz-Streeck T, Ogutu JO, Karaman Z et al (2012) Genomic selection using multiple populations. Crop Sci 52:2453–2461
Article Google Scholar
Seye AI, Bauland C, Charcosset A, Moreau L (2020) Revisiting hybrid breeding designs using genomic predictions: simulations highlight the superiority of incomplete factorials between segregating families over topcross designs. Theor Appl Genet 133:1995–2010
Article CAS PubMed Google Scholar
Technow F, Riedelsheimer C, Schrag TA, Melchinger AE (2012) Genomic prediction of hybrid performance in maize with models incorporating dominance and population specific marker effects. Theor Appl Genet 125:1181–1194
Article PubMed Google Scholar
Technow F, Schrag TA, Schipprack W et al (2014) Genome properties and prospects of genomic prediction of hybrid performance in a breeding program of maize. Genetics 197:1343–1355
Article PubMed PubMed Central Google Scholar
VanRaden PM (2008) Efficient methods to compute genomic predictions. J Dairy Sci 91:4414–4423
Article CAS PubMed Google Scholar
Xu S, Zhu D, Zhang Q (2014) Prediction hybrid performance in rice using genomic best linear unbiased prediction. Proc Natl Acad Sci USA 111:12456–12461
Article CAS PubMed PubMed Central Google Scholar
Zhao Y, Gowda M, Liu W et al (2012) Accuracy of genomic selection in European maize elite breeding populations. Theor Appl Genet 124:769–776
Article PubMed Google Scholar
Zhao Y, Li Z, Liu G et al (2015) Genome-based establishment of a high-yielding heterotic pattern for hybrid wheat breeding. Proc Natl Acad Sci USA 112:15624–15629. https://doi.org/10.1073/pnas.1514547112
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This research was partially supported by grants from the Nebraska Corn Board and National Research Initiative or Agriculture and Food Research Initiative Competitive Grants Program (Grant No. 2012-67013-19460) from the USDA National Institute of Food and Agriculture. D.C. Kadam was supported by an Indian Council of Agricultural Research International Fellowship. We are indebted to Ronal Haarman of Corteva (formerly Dupont Pioneer) for growing trials and collecting data at the York, NE location.

Author information

Authors and Affiliations

Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
Dnyaneshwar C. Kadam & Aaron J. Lorenz
Department of Agronomy and Horticulture, University of Nebraska, Lincoln, NE, 68583, USA
Oscar R. Rodriguez

Authors

Dnyaneshwar C. Kadam
View author publications
You can also search for this author in PubMed Google Scholar
Oscar R. Rodriguez
View author publications
You can also search for this author in PubMed Google Scholar
Aaron J. Lorenz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

DCK contributed to the study design, collected the data, analyzed the data, and wrote the manuscript. ORR contributed data. AJL led design of the study, supervised data analysis, and wrote the manuscript.

Corresponding author

Correspondence to Aaron J. Lorenz.

Ethics declarations

Conflict of interest

The authors declare they have no conflict of interest.

Additional information

Communicated by Matthias Frisch.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (DOCX 14 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kadam, D.C., Rodriguez, O.R. & Lorenz, A.J. Optimization of training sets for genomic prediction of early-stage single crosses in maize. Theor Appl Genet 134, 687–699 (2021). https://doi.org/10.1007/s00122-020-03722-w

Download citation

Received: 20 January 2020
Accepted: 03 November 2020
Published: 04 January 2021
Issue Date: February 2021
DOI: https://doi.org/10.1007/s00122-020-03722-w

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Optimization of training sets for genomic prediction of early-stage single crosses in maize