, 215:54 | Cite as

Genome analysis to identify SNPs associated with oil content and fatty acid components in soybean

  • R. H. G. PriolliEmail author
  • C. R. L. Carvalho
  • M. M. Bajay
  • J. B. Pinheiro
  • N. A. Vello


The nutritional value, flavor and stability of soybean oil are determined by its five dominant fatty acids: saturated palmitic and stearic, monounsaturated oleic, and polyunsaturated linoleic and linolenic acids. Identifying molecular markers or quantitative trait loci associated with these components has the potential to facilitate the development of improved varieties and thus improve soybean oil content and quality. In this study, we used the BARCSoySNP6K BeadChip array to conduct a genome analysis of diverse soybean accessions evaluated for 2 years under Brazilian field conditions. The results demonstrated high broad-sense heritability, suggesting that the soybean genotype panel could be useful for oil trait breeding programs. Moreover, the range of oil trait variation among the plant introductions (PIs) was superior to that among the Brazilian cultivars in this study, indicating that a PI population could be used to find genes controlling these traits. The genome analysis showed that the genetic structure of the soybean germplasm comprised two main genetic groups, and it revealed linkage disequilibrium decay of approximately 300 kb. A total of 19 single-nucleotide polymorphism (SNP) loci on ten different chromosomes significantly associated with palmitic acid, oleic acid and total oil contents were discovered. Analysis of the SNP annotations revealed enzymes associated with several oil-related physiological metabolisms. Loci and specific alleles in our soybean panel that contributed to lower palmitic acid contents and higher oleic acid and total oil contents were identified. Overall, this genome analysis confirmed previous findings and identified SNP markers that may be useful to rapidly improve oil traits in soybean.


Glycine max Soybean germplasm Fatty acids Oil quality Molecular markers Soybean breeding 



This work was supported by the Sao Paulo Research Foundation (FAPESP, project FAPESP/BIOEN 2016/01823-9). M.M. Bajay thanks Coordination for the Improvement of Higher Education Personnel (CAPES); J.B. P and N.A. Vello thank the National Council for Scientific and Technological Development (CNPq) for scholarships.

Supplementary material

10681_2019_2378_MOESM1_ESM.docx (50 kb)
Supplementary material 1 (DOCX 49 kb)
10681_2019_2378_MOESM2_ESM.docx (18 kb)
Supplementary material 2 (DOCX 18 kb)
10681_2019_2378_MOESM3_ESM.docx (17 kb)
Supplementary material 3 (DOCX 17 kb)
10681_2019_2378_MOESM4_ESM.docx (29 kb)
Supplementary material 4 (DOCX 28 kb)
10681_2019_2378_MOESM5_ESM.docx (17 kb)
Supplementary material 5 (DOCX 16 kb)
10681_2019_2378_MOESM6_ESM.jpg (1.4 mb)
Supplemental Figure 1 Pairwise LD values (r2) plotted against the genetic distance of each chromosome estimated among 5,220 SNP loci and 96 soybean accessions (JPEG 1446 kb)
10681_2019_2378_MOESM7_ESM.jpg (86 kb)
Supplemental Figure 2 Quantile-quantile plot using the MLM model for the 96 soybean accessions. Observed and expected -log P values (a) oil content; (b) palmitic acid (c) oleic acid (JPEG 86 kb)


  1. Aransiola EF, Ojumu TV, Oyekola OO, Madzimbamuto TF, Ikhu-Omoregbe DIO (2014) A review of current technology for biodiesel production: state of the art. Biomass Bioenergy 61:276–297CrossRefGoogle Scholar
  2. Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES (2007) TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23:2633–2635CrossRefGoogle Scholar
  3. Cao YC, Li SG, Wang ZL, Chang FG, Kong JJ, Gai JY, Zhao TJ (2017) Identification of major quantitative trait loci for seed oil content in soybeans by combining linkage and genome-wide association mapping. Front Plant Sci 8:1222CrossRefGoogle Scholar
  4. Clemente TE, Cahoon EB (2009) Soybean oil: genetic approaches for modification of functionality and total content. Plant Physiol 151:1030–1040CrossRefGoogle Scholar
  5. Contreras-Soto RI, de Oliveira MB, Costenaro-da-Silva D, Scapim CA, Schuster I (2017) Population structure, genetic relatedness and linkage disequilibrium blocks in cultivars of tropical soybean (Glycine max). Euphytica 213:173CrossRefGoogle Scholar
  6. Deshmukh R, Sonah H, Patil G, Chen W, Prince S, Mutava R, Vuong T, Valliyodan B, Nguyen HT (2014) Integrating omic approaches for abiotic stress tolerance in soybean. Front Plant Sci 5:244CrossRefGoogle Scholar
  7. Diers BW, Shoemaker RC (1992) Restriction-fragment-length-polymorphism analysis of soybean fatty-acid content. J Am Oil Chem Soc 69:1242–1244CrossRefGoogle Scholar
  8. Earl DA, vonHoldt BM (2012) STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour 4:359–361CrossRefGoogle Scholar
  9. Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol 14:2611–2620CrossRefGoogle Scholar
  10. Fehr WR, Welke GA, Hammond EG, Duvick DN, Cianzio SR (1991) Inheritance of elevated palmitic acid content in soybean seed oil. Crop Sci 31:1522–1524CrossRefGoogle Scholar
  11. Feng S, Wang SC, Chen CC, Lan L (2011) GWAPOWER: a statistical power calculation software for genome-wide association studies with quantitative traits. BMC Genet 12:12CrossRefGoogle Scholar
  12. Ginestet C (2011) GGPLOT2: elegant graphics for data analysis. J R Stat Soc Ser A Stat Soc 174:245–246CrossRefGoogle Scholar
  13. Gizlice Z, Carter TE, Burton JW (1994) Genetic base for North-American public soybean cultivars released between 1947 and 1988. Crop Sci 34:1143–1151CrossRefGoogle Scholar
  14. Graef G, LaVallee BJ, Tenopir P, Tat M, Schweiger B, Kinney AJ, Van Gerpen JH, Clemente TE (2009) A high-oleic-acid and low-palmitic-acid soybean: agronomic performance and evaluation as a feedstock for biodiesel. Plant Biotechnol J 7:411–421CrossRefGoogle Scholar
  15. Hiromoto DM, Vello NA (1986) The genetic base of Brazilian soybean (Glycine-max (L) Merrill) cultivars. Braz J Genet 9:295–306Google Scholar
  16. Hubisz MJ, Falush D, Stephens M, Pritchard JK (2009) Inferring weak population structure with the assistance of sample group information. Mol Ecol Resour 9:1322–1332CrossRefGoogle Scholar
  17. Hwang EY, Song QJ, Jia GF, Specht JE, Hyten DL, Costa J, Cregan PB (2014) A genome-wide association study of seed protein and oil content in soybean. BMC Genom 15:1CrossRefGoogle Scholar
  18. Hyten DL, Pantalone VR, Saxton AM, Schmidt ME, Sams CE (2004) Molecular mapping and identification of soybean fatty acid modifier quantitative trait loci. J Am Oil Chem Soc 81:1115–1118CrossRefGoogle Scholar
  19. Hyten DL, Choi IY, Song QJ, Specht JE, Carter TE, Shoemaker RC, Hwang EY, Matukumalli LK, Cregan PB (2010) A high density integrated genetic linkage map of soybean and the development of a 1536 universal soy linkage panel for quantitative trait locus mapping. Crop Sci 50:960–968CrossRefGoogle Scholar
  20. Jakobsson M, Rosenberg NA (2007) CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23:1801–1806CrossRefGoogle Scholar
  21. Keenan K, McGinnity P, Cross TF, Crozier WW, Prodöh PA (2013) diveRsity: an R package for the estimation and exploration of population genetics parameters and their associated errors. Methods Ecol Evol 4:782–788CrossRefGoogle Scholar
  22. Koboldt DC, Steinberg KM, Larson DE, Wilson RK, Mardis ER (2013) The next-generation sequencing revolution and its impact on genomics. Cell 155:27–38CrossRefGoogle Scholar
  23. Leamy LJ, Zhang HY, Li CB, Chen CY, Song BH (2017) A genome-wide association study of seed composition traits in wild soybean (Glycine soja). BMC Genom 18:18CrossRefGoogle Scholar
  24. Lee JD, Bilycu KD, Shannon JG (2015a) Genetics and breeding for modified fatty acid profile in soybean seed oil. J Crop Sci Biotech 10:201–210Google Scholar
  25. Lee YG, Jeong N, Kim JH, Lee K, Kim KH, Pirani A, Ha BK, Kang ST, Park BS, Moon JK, Kim N, Jeong SC (2015b) Development, validation and genetic analysis of a large soybean SNP genotyping array. Plant J 81:625–636CrossRefGoogle Scholar
  26. Li YH, Guan RX, Liu ZX, Ma YS, Wang LX, Li LH, Lin FY, Luan WJ, Chen PY, Yan Z, Guan Y, Zhu L, Ning XC, Smulders MJM, Li W, Piao RH, Cui YH, Yu ZM, Guan M, Chang RZ, Hou AF, Shi AN, Zhang B, Zhu SL, Qiu LJ (2008) Genetic structure and diversity of cultivated soybean (Glycine max (L.) Merr.) landraces in China. Theor Appl Genet 117:857–871CrossRefGoogle Scholar
  27. Li HW, Zhao TJ, Wang YF, Yu DY, Chen SY, Zhou RB, Gai JY (2011) Genetic structure composed of additive QTL, epistatic QTL pairs and collective unmapped minor QTL conferring oil content and fatty acid components of soybeans. Euphytica 182:117–132CrossRefGoogle Scholar
  28. Li YH, Reif JC, Ma YS, Hong HL, Liu ZX, Chang RZ, Qiu LJ (2015) Targeted association mapping demonstrating the complex molecular genetics of fatty acid formation in soybean. BMC Genom 16:841CrossRefGoogle Scholar
  29. Mackay TFC, Stone EA, Ayroles JF (2009) The genetics of quantitative traits: challenges and prospects. Nat Rev Genet 10:565–577CrossRefGoogle Scholar
  30. Mensink R, Katan M (1990) Effect of dietary trans fatty acids on high-density and low-density lipoprotein cholesterol levels in healthy subjects. N Engl J Med 323:439–445CrossRefGoogle Scholar
  31. Monteros MJ, Burton JW, Boerma HR (2008) Molecular mapping and confirmation of QTLs associated with oleic acid content in N00-3350 soybean. Crop Sci 48:2223–2234CrossRefGoogle Scholar
  32. Nyquist WE (1991) Estimation of heritability and prediction of selection response in plant-populations. CRC Crit Rev Plant Sci 10:235–322CrossRefGoogle Scholar
  33. Panthee DR, Pantalone VR, Saxton AM (2006) Modifier QTL for fatty acid composition in soybean oil. Euphytica 152:67–73CrossRefGoogle Scholar
  34. Paradis E, Claude J, Strimmer K (2004) APE: analyses of phylogenetics and evolution in R language. Bioinformatics 20:289–290CrossRefGoogle Scholar
  35. Priolli RHG, Campos JB, Stabellini NS, Pinheiro JB, Vello NA (2015) Association mapping of oil content and fatty acid components in soybean. Euphytica 203:83–96CrossRefGoogle Scholar
  36. Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype data. Genetics 155:945–959PubMedPubMedCentralGoogle Scholar
  37. R Development Core Team (2015) R: a language and environment for statistical computing [Internet]. R Foundation for Statistical Computing, ViennaGoogle Scholar
  38. Smallwood CJ, Gillman JD, Saxton AM, Bhandari HS, Wadl PA, Fallen BD, Hyten DL, Song Q, Pantalone VR (2017) Identifying and exploring significant genomic regions associated with soybean yield, seed fatty acids, protein and oil. J Crop Sci Biotech 20(4):243–253CrossRefGoogle Scholar
  39. Snedecor GW, Cochran WG (1989) Statistical methods, 8th edn. Iowa State University Press, AmesGoogle Scholar
  40. Sneller CH (1994) Pedigree analysis of elite soybean lines. Crop Sci 34:1515–1522CrossRefGoogle Scholar
  41. Sneller CH (2003) Impact of transgenic genotypes and subdivision on diversity within elite North American soybean germplasm. Crop Sci 43:409–414CrossRefGoogle Scholar
  42. Song QJ, Hyten DL, Jia GF, Quigley CV, Fickus EW, Nelson RL, Cregan PB (2013) Development and evaluation of SoySNP50K, a high-density genotyping array for soybean. PLoS ONE 8:e54985CrossRefGoogle Scholar
  43. Song Q, Jia G, Quigley C, Fickus E, Hyten D, Nelson R, Cregan P (2014) Soybean BARCSoySNP6K Beadchip—a tool for soybean genetics research. In: Plant animal genome XXII, Jan 10–15, 2014, San Diego. Abstract No. P306. Accessed 28 June 2018
  44. Spencer MM, Pantalone VR, Meyer EJ, Landau-Ellis D, Hyten DL (2003) Mapping the Fas locus controlling stearic acid content in soybean. Theor Appl Genet 106:615–619CrossRefGoogle Scholar
  45. Ude GN, Kenworthy WJ, Costa JM, Cregan PB, Alvernaz J (2003) Genetic diversity of soybean cultivars from China, Japan, North America, and North American ancestral lines determined by amplified fragment length polymorphism. Crop Sci 43:1858–1867CrossRefGoogle Scholar
  46. Vaughn JN, Nelson RL, Song QJ, Cregan PB, Li ZL (2014) The genetic architecture of seed composition in soybean is refined by genome-wide association scans across multiple populations. G3 (Bethesda) 4:2283–2294CrossRefGoogle Scholar
  47. Vello NA, Fehr WR, Bahrenfus JB (1984) Genetic-variability and agronomic performance of soybean populations developed from plant introductions. Crop Sci 24:511–514CrossRefGoogle Scholar
  48. Vuong TD, Sonah H, Meinhardt CG, Deshmukh R, Kadam S, Nelson RL, Shannon JG, Nguyen HT (2015) Genetic architecture of cyst nematode resistance revealed by genome-wide association study in soybean. BMC Genom 16:593CrossRefGoogle Scholar
  49. Wang J, Chu SS, Zhang HR, Zhu Y, Cheng H, Yu DY (2016) Development and application of a novel genome-wide SNP array reveals domestication history in soybean. Sci Rep 6:20728CrossRefGoogle Scholar
  50. Wilson R, Burton JW, Pantalone VR, Dewey RE (2002) New gene combinations governing saturated and unsaturated FA composition in soybean. In: Kuo TM, Gardner HW (eds) Lipid biotechnology. Marcel Dekker Inc, New York, pp 95–113Google Scholar
  51. Wysmierski PT, Vello NA (2013) The genetic base of Brazilian soybean cultivars: evolution over time and breeding implications. Genet Mol Biol 36:547–555CrossRefGoogle Scholar
  52. Zhang ZW, Ersoz E, Lai CQ, Todhunter RJ, Tiwari HK, Gore MA, Bradbury PJ, Yu JM, Arnett DK, Ordovas JM, Buckler ES (2010) Mixed linear model approach adapted for genome-wide association studies. Nat Genet 42:355-U118Google Scholar
  53. Zhang HY, Song QJ, Griffin JD, Song BH (2017) Genetic architecture of wild soybean (Glycine soja) response to soybean cyst nematode (Heterodera glycines). Mol Genet Genom 292:1257–1265CrossRefGoogle Scholar
  54. Zimmermann FJP (2014) Estatística aplicada à pesquisa agrícola, 2nd edn. Embrapa, BrasíliaGoogle Scholar

Copyright information

© Springer Nature B.V. 2019

Authors and Affiliations

  1. 1.FIFOUNISANTASantosBrazil
  2. 2.NEPAUNICAMPCampinasBrazil
  3. 3.IACCPDRGVCampinasBrazil
  4. 4.CERESUDESCLagunaBrazil
  5. 5.Department of Genetics, ESALQUSPPiracicabaBrazil

Personalised recommendations