Abstract
Genomic selection (GS) can potentially accelerate genetic improvement of soybean [Glycine max L. (Merrill)] by reducing the time to complete breeding cycles. The objectives of this study were to (1) explore the accuracy of GS in soybean, (2) evaluate the contribution of intrapopulational structure to the accuracy of GS, and (3) compare the efficiencies of phenotypic selection and GS in soybean. For this, phenotypic and genotypic data were collected from 324 soybean genotypes (243 recombinant inbred lines and 81 cultivars) and GS was performed for five yield related traits. BayesB methodology with a 10-fold cross-validation was used to compute accuracies. The GS accuracies were evaluated for grain yield, plant height, insertion of first pod, days to maturity, and 1000-grain weight at eight locations. We found that GS can reduce the time required to complete a selection cycle in soybean, which can lead to increased production of this commercially important crop. Furthermore, genotypic accuracy was similar regardless of population structure correction.


References
Abell CE, Dekkers JC, Rothschild MF, Mabry JW, Stalder KJ (2014) Total cost estimation for implementing genome-enabled selection in a multi-level swine production system. Genet Sel Evol 46:32. https://doi.org/10.1186/1297-9686-46-32
Alexandratos N, Bruinsma J (2012) World agriculture towards 2030/2050, the 2012 revision. ESA Working Paper 12–03. Food and Agriculture Organization of the United Nations (FAO)., Rome http://www.fao.org/docrep/016/ap106e/ap106e.pdf
Aljanabi SM, Martinez I (1997) Universal and rapid salt-extraction of high quality genomic DNA for PCR-based techniques. Nucleic Acids Res 25:4692–4693
Bassi FM, Bentley AR, Charmet G, Ortiz R, Crossa J (2016) Breeding schemes for the implementation of genomic selection in wheat (Triticum spp.). Plant Sci 242:23–36. https://doi.org/10.1016/j.plantsci.2015.08.021
Bates D, Maechler M, Bolker B, Walker B (2015) Fitting linear mixed-effects models using lme4. J Stat Softw 67:1–48. https://doi.org/10.18637/jss.v067.i01
Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES (2007) TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23:2633–2635. https://doi.org/10.1093/bioinformatics/btm308
Conab (2017) Companhia Nacional de Abastecimento. Séries históricas. Available at: https://www.conab.gov.br/info-agro/safras/serie-historica-das-safras?start=20. Accessed 1 Feb 2018
Core Team R (2017) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna URL https://www.R-project.org/
de Felipe M, Gerde JA, Rotundo JL (2016) Soybean genetic gain in maturity groups III to V in Argentina from 1980 to 2015. Crop Sci 56:3066–3077. https://doi.org/10.2135/cropsci2016.04.0214
de los Campos G, Hickey JM, Pong-Wong R, Daetwyler HD, Calus MPL (2013a) Whole-genome regression and prediction methods applied to plant and animal breeding. Genetics 193:327–345. https://doi.org/10.1534/genetics.112.143313
de los Campos G, Vazquez AI, Fernando R, Klimentidis YC, Sorensen D (2013b) Prediction of complex human traits using the genomic best linear unbiased predictor. PLoS Genet 9:e1003608. https://doi.org/10.1371/journal.pgen.1003608
de los Campos G, Naya H, Gianola D, Crossa J, Legarra A, Manfredi E, Weigel K, Cotes JM (2009) Predicting quantitative traits with regression models for dense molecular markers and pedigree. Genetics 182:375–385. https://doi.org/10.1534/genetics.109.101501
Desta ZA, Ortiz R (2014) Genomic selection: genome-wide prediction in plant improvement. Trends Plant Sci 19:592–601. https://doi.org/10.1016/j.tplants.2014.05.006
Falconer DS, Mackay TF (1996). C. 1996. Introduction to quantitative genetics, 4
Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw 33:1–22
Gianola D (2013) Priors in whole-genome regression: the Bayesian alphabet returns. Genetics 194:573–596. https://doi.org/10.1534/genetics.113.151753
Gianola D, van Kaam JBCHM (2008) Reproducing kernel Hilbert spaces regression methods for genomic assisted prediction of quantitative traits. Genetics 178:2289–2303. https://doi.org/10.1534/genetics.107.084285
Gorjanc G, Battagin M, Dumasy JF, Antolin R, Gaynor RC, Hickey JM (2017) Prospects for cost-effective genomic selection via accurate within-family imputation. Crop Sci 57:216–228. https://doi.org/10.2135/cropsci2016.06.0526
Goslee SC, Urban DL (2007) The ecodist package for dissimilarity-based analysis of ecological data. J Stat Softw 22:1–19. https://doi.org/10.18637/jss.v022.i07
Guo Z, Tucker DM, Basten CJ, Gandhi H, Ersoz E, Guo B, Xu Z, Wang D, Gay G (2014) The impact of population structure on genomic prediction in stratified populations. Appl Genet 127(3):749–762. https://doi.org/10.1007/s00122-013-2255-x
Habier D, Fernando RL, Dekkers JCM (2007) The impact of genetic relationship information on genome-assisted breeding values. Genetics 177:2389–2397. https://doi.org/10.1534/genetics.107.081190
Habier D, Fernando RL, Kizilkaya K, Garrick DJ (2011) Extension of the Bayesian alphabet for genomic selection. BMC Bioinformatics 12:186. https://doi.org/10.1186/1471-2105-12-186
Hatfield JL, Walthall CL (2015) Meeting global food needs: realizing the potential via genetics × environment × management interactions. Agron J 107:1215–1226. https://doi.org/10.2134/agronj15.0076
Heffner EL, Jannink J-L, Iwata H, Souza E, Sorrells ME (2011a) Genomic selection accuracy for grain quality traits in biparental wheat populations. Crop Sci 51:2597–2606. https://doi.org/10.2135/cropsci2011.05.0253
Heffner EL, Jannink J, Sorrells ME (2011b) Genomic selection accuracy using multifamily prediction models in a wheat breeding program. Plant Genome 4:65–75. https://doi.org/10.3835/plantgenome2010.12.0029
Heffner EL, Lorenz AJ, Jannink J, Sorrells ME (2010) Plant breeding with genomic selection: gain per unit time and cost. Crop Sci 50:1681–1690. https://doi.org/10.2135/cropsci2009.11.0662
Hickey JM, Dreisigacker S, Crossa J, Hearne S, Babu R, Prasanna BM, Grondona M, Zambelli A, Windhausen VS, Mathews K, Gorjanc G (2014) Evaluation of genomic selection training population designs and genotyping strategies in plant breeding programs using simulation. Crop Sci 54:1476–1488. https://doi.org/10.2135/cropsci2013.03.0195
Huang M, Cabrera A, Hoffstetter A, Griffey C, Van Sanford D, Costa J, Sneller C (2016) Genomic selection for wheat traits and trait stability. Theor Appl Genet 129:1697–1710. https://doi.org/10.1007/s00122-016-2733-z
Isidro J, Jannink JL, Akdemir D, Poland J, Heslot N, Sorrells ME (2015) Training set optimization under population structure in genomic selection. Theor Appl Genet 128(1):145–158. https://doi.org/10.1007/s00122-014-2418-4
Karmakar PG, Bhatnager PS (1996) Genetic improvement of soybean varieties released in India from 1969 to 1993. Euphytica 90:95–103. https://doi.org/10.1007/BF000251655
Koester RP, Skoneczka JA, Cary TR, Diers BW, Ainsworth EA (2014) Historical gains in soybean (Glycine max Merr.) seed yield are driven by linear increases in light interception, energy conversion, and partitioning efficiencies. J Exp Bot 65:3311–3321. https://doi.org/10.1093/jxb/eru187
Lehermeier C, Schön CC, de los Campos G (2015) Assessment of genetic heterogeneity in structured plant populations using multivariate whole-genome regression models. Genetics. https://doi.org/10.1534/genetics.115.177394
Li Z, Sillanpaa MJ (2012) Overview of LASSO-related penalized regression methods for quantitative trait mapping and genomic selection. Theor Appl Genet 125:419–435. https://doi.org/10.1007/s00122-012-1892-9
Ma Y, Reif JC, Jiang Y, Wen Z, Wang D, Liu Z, Guo Y, Wei S, Wang S, Yang C, Wang H, Yang C, Lu W, Xu R, Zhou R, Wang R, Sun Z, Chen H, Zhang W, Wu J, Hu G, Liu C, Luan X, Fu Y, Guo T, Han T, Zhang M, Sun B, Zhang L, Chen W, Wu C, Sun S, Yuan B, Zhou X, Han D, Yan H, Li W, Qiu L (2016) Potential of marker selection to increase prediction accuracy of genomic selection in soybean (Glycine max L.). Mol Breed 36:113. https://doi.org/10.1007/s11032-016-0504-9
Matei G, Benin G, Woyann LG, Dalló SC, Milioli AS, Zdziarski AD (2017) Agronomic performance of modern soybean cultivars in multi-environment trials. Pesq Agrop Brasileira 52(7):500–511. https://doi.org/10.1590/s0100-204x2017000700004
Meuwissen T (2007) Genomic selection: marker assisted selection on a genome wide scale. J Anim Breed Genet 124:321–322. https://doi.org/10.1111/j.1439-0388.2007.00708.x
Meuwissen THE, Hayes BJ, Goddard ME (2001) Prediction of total genetic value using genome-wide dense marker maps. Genetics 157:1819–1829
Morrison MJ, Voldeng HD, Cober ER (1999) Physiological changes from 58 years of genetic improvement of short-season soybean cultivars in Canada. Agron J 91:685–689. https://doi.org/10.2134/agronj1999.914685x
Oliveira EJ, de Resende MDV, da Silva Santos V, Ferreira CF, Oliveira GAF, da Silva MS, de Oliveira LA, Aguilar-Vildoso CI (2012) Genome-wide selection in cassava. Euphytica 187:263–276. https://doi.org/10.1007/s10681-012-0722-0
Pérez P, de los Campos G (2014) Genome-wide regression prediction with the BGLR statistical package. Genetics 198:483–495. https://doi.org/10.1534/genetics.114.164442
Pérez-Rodríguez P, Daniel Gianola D, González-Camacho JM, Crossa J, Manès Y, Dreisigacker S (2012) Comparison between linear and nonparametric regression models for genome-enabled prediction in wheat. G3 (Bethesda) 2:1595–1605. https://doi.org/10.1534/g3.112.003665
Ray DK, Mueller ND, West PC, Foley JA (2013) Yield trends are insufficient to double global food production by 2050. PLoS One 8:E66428. https://doi.org/10.1371/journal.pone.0066428
Rutkoski JE, Heffner EL, Sorrells ME (2011) Genomic selection for durable stem rust resistance in wheat. Euphytica 179:161–173. https://doi.org/10.1007/s10681-010-0301-1
Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T, Nelson W, Hyten DL, Song Q, Thelen JJ, Cheng J, Xu D, Hellsten U, May GD, Yu Y, Sakurai T, Umezawa T, Bhattacharyya MK, Sandhu D, Valliyodan B, Lindquist E, Peto M, Grant D, Shu S, Goodstein D, Barry K, Futrell-Griggs M, Abernathy B, Du J, Tian Z, Zhu L, Gill N, Joshi T, Libault M, Sethuraman A, Zhang XC, Shinozaki K, Nguyen HT, Wing RA, Cregan P, Specht J, Grimwood J, Rokhsar D, Stacey G, Shoemaker RC, Jackson SA (2010) Genome sequence of the palaeopolyploid soybean. Nature 463(7278):178–183. https://doi.org/10.1038/nature08670
Spindel J, Begum H, Akdemir D, Virk P, Collard B, Redoña E, Atlin G, Jannink J, McCouch SR (2015) Genomic selection and association mapping in rice (Oryza sativa): effect of trait genetic architecture, training population composition, marker number and statistical model on accuracy of rice genomic selection in elite, tropical rice breeding lines. PLoS Genet 11:e1004982. https://doi.org/10.1371/journal.pgen.1004982
Thavamanikumar S, Dolferus R, Thumma BR (2015) Comparison of genomic selection models to predict flowering time and spike grain number in two hexaploid wheat doubled haploid populations. G3: Genes|Genomes|Genetics 5:1991–1998. https://doi.org/10.1534/g3.115.019745
Visscher PM, Hill WG, Wray NR (2008) Heritability in the genomics era—concepts and misconceptions. Nat Rev Genet 9:255–266. https://doi.org/10.1038/nrg2322
Wickham H (2009) ggplot2: elegant graphics for data analysis. Springer-Verlag, New York
Xavier A, Muir WM, Rainey KM (2016) Assessing predictive properties of genome-wide selection in soybeans. G3: Genes|Genomes|Genetics 6:2611–2616. https://doi.org/10.1534/g3.116.032268
Zhang X, Pérez-Rodríguez P, Burgueño J, Olsen M, Buckler E, Atlin G, Crossa J (2017) Rapid cycling genomic selection in a multi-parental tropical maize population. G3: Genes|Genomes|Genetics 7:2315–2326. https://doi.org/10.1534/g3.117.043141
Acknowledgments
The authors extend their gratitude to Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (Capes), for granting the masters and doctoral scholarships. GM, LGW, and IBO were funded by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) PSDE scholarships: 99999.000044/2015-06, 88881.135500/2016-01, and 88881.131685/2016-01, respectively.
Author information
Authors and Affiliations
Contributions
GM, GB, ASGC, and TF conceived and designed the study. GM developed the recombinant inbred lines (RIL). GM, LGW, ASM, ADZ, RZ, and GB designed and carried out the field trials and generated the phenotypic data. GM, RZ, ADZ, and TF designed and carried out the greenhouse experiment for the genotypic data. GM, IBO, and ASGC carried out the statistical analyses. GM, LGW, and ASM wrote the manuscript. All of authors revised and improved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
Key Message
Higher efficiency was achieved by genomic selection when compared to phenotypic selection. The correction of population structure did not affect the accuracy of genomic selection models in this soybean population.
Electronic supplementary material
ESM 1
(DOCX 524 kb)
Rights and permissions
About this article
Cite this article
Matei, G., Woyann, L.G., Milioli, A.S. et al. Genomic selection in soybean: accuracy and time gain in relation to phenotypic selection. Mol Breeding 38, 117 (2018). https://doi.org/10.1007/s11032-018-0872-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11032-018-0872-4