Background

Spinach (Spinacia oleracea L., 2n = 2× = 12) is an economically important vegetable crop worldwide with an estimated annual value of $11.8 billion. The United States (US) is the second largest producer of spinach after China with over 550,000 tons harvested, valued at over $300 million annually since 2009 [1, 2]. In addition to its economic importance, spinach is one of the rising vegetable crops in the US in terms of per capita consumption and is considered a healthy vegetable for humans as it is a source of vitamins and mineral nutrients, as well as several health-promoting phytochemicals [3, 4].

Minerals originate in the earth and cannot be made by living organisms [5]. Mineral elements are present in different forms in nature and some of these elements are essential for the body to perform different functions [6]. Most of them mediate vital biochemical reactions by acting as a cofactor or catalyst for many enzymes. They also act as centers of building stabilizing structures such as enzymes and proteins. The five major minerals in the human body are calcium (Ca), phosphorus (P), potassium (K), sodium (Na), and magnesium (Mg) [6, 7]. All of the remaining elements in a human body are called “trace elements”. The trace elements that have a specific biochemical function in the human body are iron (Fe), cobalt (Co), copper (Cu), zinc (Zn), manganese (Mg), molybdenum (Mo), iodine (I), and selenium (Se) [8]. Spinach is a dietary source of Ca, Cu, Fe, K, Mg, Mn, P, Zn, folate, vitamins and dietary fiber [9]. Therefore, breeding new spinach varieties with high nutritional components including the mineral elements is one of the main goals in spinach breeding programs worldwide.

Molecular plant breeding has been the foundation for twenty-first-century crop improvement [10]. Marker-assisted selection (MAS) has been successfully used to incorporate specific genes/alleles in plant breeding [11,12,13]. Single nucleotide polymorphism (SNP) with its high abundance, cost efficiency, and high-throughput scoring, has become a powerful tool in genome mapping, association studies, diversity analysis, and tagging of important genes in plant genomics [14,15,16,17]. Therefore, identification of SNP markers for mineral elements will be useful in spinach MAS breeding programs.

Genotyping by sequencing (GBS) is one of the next-generation sequencing platforms that utilizes a simple highly-multiplexed system for constructing reduced representation libraries which reduces sample handling, requires fewer PCR and purification steps, no size fractionation and uses inexpensive barcoding [18, 19]. As a cost-effective tool for MAS, GBS has been used to facilitate genome-wide association studies (GWAS), genetic linkage analysis, molecular marker discovery, and studies of genomic diversity or selection [18, 20, 21]. As the GBS method has no requirement for a priori knowledge of the species genomes, it has been shown to be robust across a range of species and SNP discovery and genotyping are completed together [22, 23]. The spinach genome assembly (PacBio Assembly) (980 Mbp) has been reported on January 14, 2014 [24, 25], but it has not been publically available yet. The spinach genome Spinach-1.0.1 is available to the public at http://www.ncbi.nlm.nih.gov/Traces/wgs/?val=AYZV02 and also at “The Beta vulgaris Resource” website with the page at http://bvseq.molgen.mpg.de/Genome/Download/Spinach/, representing approximately half of the spinach genome [26, 27]. We used the AYZV02 as the reference of spinach genome sequences for short reads assembly and SNP discovery in each spinach sample in this study.

To date, several association studies for different phenotypic traits of spinach have been reported, such as oxalate concentration [28], leafminer (Liriomyza spp.) resistance [29], Verticillium wilt resistance [30], Stemphylium leaf spot resistance [31], and leaf traits [32]. However, no genetic studies have been conducted to evaluate the genetic diversity of mineral elements and no research has been reported regarding mineral elements using association mapping in spinach natural populations to date. Therefore, the objectives of this study were to perform genetic diversity analysis and GWAS for spinach mineral elements in the USDA spinach germplasm collection, and to identify SNP markers associated with the 14 mineral elements: B, Ca, Co, Cu, Fe, K, Mg, Mn, Mo, Na, Ni, P, S, and Zn. The results will provide information on how to use spinach germplasm accessions with high mineral concentrations and new molecular markers for spinach breeding programs.

Results

Phenotypic variation in the mineral concentrations in spinach USDA-GRIN germplasm

The 14 mineral elements in spinach were analyzed for their mean (average) concentration, range, standard deviation (stdev), and coefficient of variation (CV) for each mineral element (Table 1). Potassium had the highest concentration with greater than 79,500 ppm; Mg the second with 8697 ppm; P, Ca, and S were greater than 4200 ppm; Na and Zn greater than 100 ppm; Mn, Fe, and B greater than 10 ppm; and Cu, Mo, Ni, and Co were less than 7 ppm, with Ni and Co with less than 1 ppm.

Table 1 The average, range, standard deviation, and coefficient of variation of the 14 mineral compounds in spinach

All elements showed broad concentration ranges across the germplasm accessions. K, P, Fe, Mg, B, Ca, S, and Cu had about two to five times difference (Maximum/Minimum); Na, Co, Zn, and Mn showed approximately eight to fourteen times difference; while Ni (18 times) and Mo (42 times) exhibited the largest differences (Table 1). The standard deviation and stdev error showed similar trends with their averages. K had the highest average, and it also had the highest standard deviation and stdev error. The coefficient of variation (CV), also known as relative standard deviation (RSD), is a standardized measure of dispersion of a probability distribution or frequency distribution, where CV = mean/Stdev × 100. All 14 mineral elements had a large CV value of greater than 14%. Among them, Co, Mn, Mo, Ni, and Zn had greater than 50% CV value, indicated that the dispersion in the variable was greater (Table 1), showing there were large variations in these mineral elements across the spinach germplasm accessions. Analyzed from the distributions of mineral concentration values (Fig. 1), Co, Mn, Ni, and Zn showed a skewed distribution toward the lower range, and others had near-normal distributions.

Fig. 1
figure 1

Distributions of the 14 mineral element concentrations in 292 USDA spinach germplasm accessions. The x-axis signifies the mineral concentrations and y-axis the number of spinach accessions, where the B, Co, Cu, Fe, Mn, Mo, Na, Ni, and Zn used μg/g dry weight (DW) and the Ca, K, Mg, P, and S used mg/g DW

The correlation among the 14 mineral elements was analyzed by JMP Genomics 7. The correlation coefficients among Co, Mn, S, and Zn were greater than 0.7, indicating strong associations (correlations) among the four mineral elements. In addition, the pairwise correlation coefficients of Fe and S, Fe and Co, Fe and Zn, Cu and Mo, K and P were greater than 0.4, indicating significant associations between each pair (Table 2). The two dimension plot of Biplot can be used to visualize the analysis of two–way data. As shown in Fig. 2, the lines that extend from the center and connect to each mineral element trait are considered as index vectors. The angles of the index vectors indicate that correlations existed between the indices (each mineral element trait). The cosine of these angles indicates the genetic correlation. An angle that is less than 90° indicates a positive correlation, and an angle that is greater than 90° indicates a negative correlation. If the angle is close to 0° or 180°, the two indices were highly correlated. The smaller angle between Co, Mn, S, and Zn, than other mineral elements, indicates their higher correlation than the other elements (Fig. 2).

Table 2 Correlation coefficient among fourteen mineral components in spinach
Fig. 2
figure 2

The two dimension plot of Biplot for 14 mineral element concentrations in 292 spinach germplasm accessions

Genetic diversity analysis of spinach germplasm

The population structure of the 292 spinach accessions was initially inferred using STRUCTURE 2.3.4 [33] and the peak of delta K was observed at K = 4, indicating the presence of the four main populations (clusters, Q1-Q4) in the 292 spinach accessions (Fig. 3a). The classification of accessions into populations based on the model-based structure was shown in Fig. 3 and Additional file 1: Table S1. In total, 247 accessions (84.6%) were assigned to one of the four populations (Q1, Q2, Q3, and Q4). Population 1, 2, 3, and 4 (Q1, Q2, Q3, and Q4) consisted of 33 (11.3%), 26 (8.9%), 109 (37.3%), and 79 (27.0%) accessions, respectively. The remaining 45 accessions (15.4%) were categorized as having admixed ancestry, including two, three, and four population admixed among Q1, Q2, Q3, and Q4 (Additional file 1: Table S1).

Fig. 3
figure 3

Model-based populations in association panels consisting of 292 USDA GRIN spinach germplasm accessions: a Delta K values for different numbers of populations (K) assumed in the analysis completed with the STRUCTURE software. b Classification of 292 spinach accessions into four populations using STRUCTURE Version 2.3.4, where the numbers on the y-axis show the subgroup membership, and the x-axis shows the different accession. The distribution of accessions into different populations is indicated by the color coding and shape (Cluster 1, Q1, is red round shape; Cluster 2, Q2, is the green triangle; Cluster 3, Q3, is the blue triangle; Cluster 4, and Q4, is yellow diamond). c Maximum Likelihood (ML) tree of the 292 spinach accessions drawn by MEGA 6. The color code for each population is consistent with (b) and (c), and the empty black square represents accessions aligned with the admixture cluster or population in 292 USDA GRIN spinach germplasm accessions

The genetic diversity was analyzed using the Maximum Likelihood (ML) method by MEGA 6 [34]. Several phylogenetic trees were drawn based on interpretation of results. We defined Q1, Q2, Q3, and Q4 as the clusters and used the same colors as the population structure Q1 (red), Q2 (green), and Q3 (blue), and Q4 (yellow) from the STRUCTURE 2.3.4 (Fig. 3b) to draw the subtrees of the phylogenetic trees in MEGA 6. Two phylogenetic trees were included: (1) without taxon names assigned in order to compare the populations from STRUCTURE (Fig. 3c), and (2) the ring phylogenetic tree (Additional file 2: Figure S1). The phylogenetic trees from MEGA 6 (Fig. 3c and Additional file 2: Figure S1), were well consistent with the structure populations (Q1-Q4) from STRUCTURE 2.3.4 (Fig. 3a and b), indicating that there were four well-differentiated genetic populations and admixture in the spinach panel.

Association mapping and SNP marker identification

Six methods were used for association analysis of the 14 mineral elements among the 292 spinach accessions using 2401 SNPs. The six methods included: (1) SMR_Qgene: single marker regression using the QGene 4.3.10 [35], (2) SMR_tassel: single marker regression without structure and without kinship using TASSEL 5, (3) GLM_tassel: general linear model using TASSEL 5, (4) MLM_tassel: mixed linear model methods using TASSEL 5, (5) cMLM_gapit: compressed mixed linear model methods using GAPIT, and (6) EcMLM_gapit: enriched compressed mixed linear model methods using GAPIT. The selection standardization in this research is based on LOD values: SMR_QGene, SMR_tassel, and GLM_tassel > = 2.5, and one of MLM (either MLM_tassel, CMLM_gapit, or EcMLM_gapit) > = 2.5.

Based on the criteria above, a total of 45 SNPs were identified to be strongly associated with the 13 mineral elements except for K (Table 3). Among the 45 SNP markers, four were associated with B; one with Ca; seven with Co; two with Cu; six with Fe; four with Mg; five with Mn; one with Mo; five with Na; one with Ni; one with P; seven with S; and five with Zn. In addition, two SNP markers, AYZV02123399_305 and AYZV02147304_372, were detected to be associated with K having a LOD value > = 2.0 at SMR_QGene and SMR_tassel, LOD > = 2.5 at GLM_tassel, and one of MLM models (either MLM_tassel, CMLM_gapit, or EcMLM_gapit). Of these identified markers, four SNPs showed pleiotropic effects: AYZV02017731_35 was associated with both Co and S; AYZV02057049_393 with both Fe and S; AYZV02073631_255 with both Co and S; and AYZV02225779_181 with both Fe and S, indicating each of the four markers can be used to select two mineral components in MAS spinach breeding. Three SNP markers, AYZV02017731_40, AYZV02094133_57, and AYZV02281036_185 were identified to be associated with concentrations of four mineral components, Co, Mn, S, and Zn based on LOD values: SMR_Qgene (Table 4). T-test result validated the three SNP markers were significantly associated with the four mineral elements (Table 5), suggesting that it may be possible to select high contents of the four elements, Co, Mn, S, and Zn at the same time through MAS using these markers in breeding.

Table 3 The information of the significant SNP markers associated with 14 mineral compounds among the 292 spinach accessions using six statistical models, SMR_Qgene, SMR_tassel, GLM_tassel, MLM_tassel, CMLM_gapit, and EcMLM_gapit
Table 4 The information of three significant SNP markers associated with four mineral compounds among the 292 spinach accessions using six statistical models, SMR_Qgene, SMR_tassel, GLM_tassel, MLM_tassel, CMLM_gapit, and EcMLM_gapit. Co, Mn, S, and Zn based on T-tesing at P = 0.05
Table 5 Three SNP markers significantly associated with two to four of the four mineral compound, Co, Mn, S, and Zn based on T-tesing at P = 0.05

R-square (Rsq) of the detected markers using the six different methods were listed in Table 3 with a large range from 1.84% of the AYZV02144992_13 marker, associated with Mg in MLM model, to 15.3% of the AYZV02136507_242, associated with Mn in SMR from Tassel. According to the LOD value, the larger Rsq value of a marker, the stronger association is the marker which makes more contribution to the trait. In all detected SNP markers, AYZV02136507_242 had the greatest Rsq value with 10.6% Rsq in SMR model from QGene, 15.3% Rsq in SMR, 13.8% Rsq in GLM, and 12.9% Rsq in GLM from Tassel for Mn element, indicating that the AYZV02136507_242 is strongly associated with Mn from this study.

Evaluation and genetic diversity analysis of the top three spinach germplasm accessions for each mineral

We identified the top 3 accessions with the highest concentrations in each of the 14 mineral elements. First, we ordered the 292 spinach accessions based on their mineral concentrations by each individually from the highest to lowest values and gave the order number from 1 to 292 plus the mineral name for each mineral element. For B, as an example, we ordered the B concentration from the highest to lowest and gave each accession with an order ID from B1 to B292 such as NSL6557 had the highest B value with 52.36 ppm and was given B1, and NSL6083 the lowest B value with 17.07 ppm given B292 (Additional file 1: Table S1). By combining the 14 mineral elements, each accession was given a mineral concentration order ID including the 14 mineral names and their order numbers. For example, the accession NSL6557 was given the designation B1Ca63Co125Cu41Fe133K284Mg34Mn67Mo36Na161Ni41P19S72Zn100, which means a B rank of 1, Ca rank of 63, Co rank of 125, Cu rank of 41, Fe rank of 133, etc. for this accession (Additional file 1: Table S2). After ranking all 14 mineral elements, 31 spinach accessions had been chosen because they had at least one mineral element (out of 14) ranked in the top three among the 292 spinach accessions (Additional file 1: Table S2), indicating that the 31 accessions were good mineral element resources for spinach breeding to improve mineral concentrations. Eight out of these 31 spinach accessions had more than two mineral elements ranked in the top 3 highest (PI604777, PI604786, PI175595, PI604782, PI360895, PI339547, PI176372, and PI169671). Three out of the eight accessions had three mineral elements ranked in the top 3 highest. PI604777 was ranked as No. 1 in S, No. 2 in Fe, and No. 3 in Co; PI604786 as No. 2 in Zn, No. 2 in Mn, and No. 3 in Cu; and PI175595 was No. 1 in Ni, No. 3 in P, and No. 3 in K (Additional file 1: Table S2).

The 31 spinach accessions were collected from 10 countries plus one unknown location: ten from Turkey, two from Afghanistan, India, Iran, Japan, Macedonia, and Netherlands, respectively, and one from Mongolia, Belgium, China, Ethiopia, Hungary, Italy, Nepal, and the US, respectively (Additional file 1: Table S2), The genetic diversity analysis was performed for the 31 spinach accessions and the phylogenetic tree was drawn using MEGA 6 (Fig. 4). Based on genetic distances among the 31 genotypes, there were three clusters: Cluster I, consisted of 24 accessions; Cluster II had only two accessions; and Cluster III, included five accessions. The cluster I can be further divided into five sub-clusters (groups): I-1 with seven accessions, I-2, six accessions, I-3, three accessions, I-4, five accessions, and I-5, only two accessions. In addition, PI604788 is an outlier (Fig. 4). The genetic diversity and phylogenetic analysis will provide the information for breeders to choose these spinach accessions as parents in breeding programs.

Fig. 4
figure 4

The ring phylogenetic tree created by the Maximum Likelihood (ML) method from MEGA 6 in 31 spinach germplasm accessions that had at least one mineral element ranked in the top three highest concentration among the 292 spinach accessions for 14 mineral elements

Discussion

Application of marker-assisted selection in the genetic improvement of spinach breeding

In this study, we conducted a comprehensive GWAS to identify genetic loci with SNP markers associated with the 14 mineral elements (B, Ca, Co, Cu, Fe, K, Mg, Mn, Mo, Na, Ni, P, S, and Zn) in 292 spinach accessions of the USDA collection. A total of 45 SNP markers were identified to be associated with the 13 mineral elements except K, based on six different association mapping models. Similar mineral element association mapping research has been reported in other crops, such as pea [36] and rice [37,38,39,40] using RIL or diversity populations. Ma et al. (2017) [36] conducted a comprehensive QTL mapping study and identified genetic loci associated with mineral element concentrations (B, Ca, Fe, K, Mg, Mn, Mo, P, S and Zn) in pea seeds using a RIL population.

Co-localization of SNPs for different element concentrations has been discovered in this research. AYZV02017731_35 and AYZV02073631_257 were related to Co and S, AYZV02057049_393 was related to both Fe and S, and AYZV02225779_181 was related to Fe and S (Table 3). Co-localization of QTLs for different element concentrations in seeds has previously been reported in rice [37, 38, 41]. Genetically, the phenomenon of co-localization may be caused by pleiotropy of a single gene product being involved in the transport and/or physiological processing of multiple elements. Another possibility is the presence of clustered genes that are tightly associated together and responsible for the accumulation of different elements [42].

Accordingly, the high positive correlations among Co, S, Mn, and Zn ranged from 0.70 to 0.81 in this study (Table 2). The positive correlations suggest that high Co, S, Mn and Zn concentrations could be possible in individual spinach accessions. For the four mineral elements (Co, S, Mn and Zn), eleven highly significantly associated SNP markers were identified in this study (Table 3).

These markers related to mineral elements have low LOD and small R-squared values because the mineral elements are controlled by multiple genes with minor effects. How to apply these SNP markers in spinach germplasm evaluation and breeding is a challenge for spinach breeders. Next step of research is going to validate these SNP markers using KASP SNP genotyping: 1) To validate the stability of these SNP markers across multiple locations and years; 2) To conduct an additional association mapping study on another natural population panel in order to identify potential overlaps with what we are reporting; 3) To develop bi-parental populations to validate and identify SNP markers linked to mineral elements and try to find SNP markers with major effect; 4) To use genomic selection approach to select the minor effect alleles to improve element compositions in spinach cultivars.

This comprehensive spinach mineral nutrient study provides a foundation of SNP markers to improve mineral content in spinach cultivar development. The future release of a spinach reference genome sequence will enable a complete analysis of these trait loci.

Mineral elements related to nutrition and human healthy food in spinach

The mineral elements are essential and indispensable for growth and health, having a direct or indirect effect on the metabolism and physiological processes of humans and plants as well. Deficiencies or insufficient intake of minerals may lead to several dysfunctions and diseases in humans [43]. There is a growing interest in the mineral and phytochemical composition of foods and diets, and especially in leafy vegetables. Recent studies with Mexican, Central American, and African green leafy vegetables, including Cnidoscolus aconitifolius, Crotalaria longirostrata, Solanum scabrum, Gynandropsis gyandra, and several leafy Amaranthus species, have highlighted the contributions that these vegetables can provide to one’s daily intake of essential nutrients and health-beneficial compounds [44, 45].

Mineral elements in spinach have been reported by other studies employing a large number of techniques [46,47,48,49,50,51]. On a moisture-free basis, the highest levels of K, Ca, Na, P, Mg, S, Zn, Cu, Co, and B were found in spinach compared to parsley, dill, and mint [52]. In this research, 14 mineral elements were detected in 292 spinach accessions, including macro-elements (K, Mg, P, Ca, and S) and micro-elements (Na, Zn, Mn, B, Fe, Cu, Mo, Co, and Ni). These findings provide a new range of spinach mineral values for dieticians and nutrition scientists to consider when calculating nutrient intakes of spinach-containing diets, or when developing healthful menu recommendations.

Na, K, Mg, and Ca are the four main and essential electrolytes for humans, Na is responsible for controlling the total amount of water in the body. It is also important for regulating blood volume and maintaining muscle and nerve function. Mg is the most abundant intracellular divalent cation. It is an essential cofactor for a multitude of enzymatic reactions that are important for the generation of energy from ATP and for physiologic processes, including neuromuscular function and maintenance of cardiovascular tone [53]. Ca is the major component of bone and assists in tooth development [54]. K is an important component of cell and body fluids that helps to control heart rate and blood pressure (http://www.nutrition-and-you.com/spinach.html). Among the 292 spinach accessions from this research, the highest levels of Na, K, Mg, and Ca were found to have 1484.01 ppm, 108,935.36 ppm, 14,293.27 ppm, and 10,210.76 ppm in PI209644 from Iraq, PI531456 from Hungary, PI204732 from Turkey, and PI205231 from Turkey, respectively (Additional file 1: Table S1). In addition, the top three spinach germplasm accessions in each mineral element were also listed and provide information on how to use these high mineral accessions in breeding programs. The PI accessions containing beneficial SNPs associated with mineral elements will be highly valuable for the spinach breeders to use for the development of cultivars with high mineral element concentrations through MAS and GS (genomic selection). The significant genetic variations among genotypes as evidenced by the 2 to 42 times difference in mineral concentration (Table 1) suggest that the genetic improvement of mineral traits is feasible in spinach. The co-localization of SNP markers and the positive correlations in concentrations for many mineral elements make it possible to pyramid high concentrations of multiple elements into a single cultivar in a spinach breeding program.

Conclusions

A total of 14 mineral elements: boron (B), calcium (Ca), cobalt (Co), copper (Cu), iron (Fe), potassium (K), magnesium (Mg), manganese (Mn), molybdenum (Mo), sodium (Na), nickel (Ni), phosphorus (P), sulfur (S), and zinc (Zn) were evaluated in 292 spinach accessions originally collected from 29 countries. The 45 SNP markers strongly associated with the concentrations of the 13 mineral elements: B, Ca, Co, Cu, Fe, Mg, Mn, Mo, Na, Ni, P, S, and Zn using six statics methods, including single marker regression using Q-gene (SMR), single marker regression using Tassel (SMR), general linear model using Tassel (GLM), mixed linear model using Tassel (MLM), compressed mixed linear model using Gapit (cMLM), and enriched compressed mixed linear model using Gapit (EcMLM). Three SNP markers, AYZV02017731_40, AYZV02094133_57, and AYZV02281036_185 were identified to be associated with concentrations of four mineral components, Co, Mn, S, and Zn. The markers could be used in breeding programs to improve the nutritional quality of spinach through marker-assisted selection (MAS). Thirty-one spinach accessions with high concentrations of one to several mineral elements can be used as potential parents for spinach breeding programs.

Methods

Plant materials

A total of 292 accessions of spinach (S. oleracea) USDA-GRIN (US Department of Agriculture, Germplasm Resources Information Network) germplasm originally collected from 29 countries plus 18 unknown locations were used for genetic diversity and association analysis of mineral elements in this study (Additional file 1: Table S1). All seeds were kindly provided by David Brenner at USDA-ARS (Agricultural Research Service) and Iowa State University at Ames, IA, US.

Leaf mineral concentration evaluation

Concentrations of 14 mineral components: B, Ca, Co, Cu, Fe, K, Mg, Mn, Mo, Na, Ni, P, S, and Zn were evaluated in 292 USDA spinach germplasm accessions (Additional file 1: Table S1). The phenotypic data of the nine among the 14 elements in spinach germplasm accessions, including Ca, Cu, Fe, Mg, Mn, Mo, Ni, P, and Zn, have been published partially in the USDA GRIN website at http://www.ars-grin.gov/cgi-bin/npgs/html/eval.pl?492376.

Accessions were grown in 1-l black plastic pots filled with a 2:1 (vol: vol) mixture of commercially available soil (Metro-Mix 360; Scotts-Sierra Horticultural Products Co., Marysville, Ohio, USA) and vermiculite (Strong-Lite Medium Vermiculite; Sun Gro Horticulture Co, Seneca, Illinois, USA). There were six plants of each accession in a pot, with pots randomly distributed within a growth chamber (model PGW36; Controlled Environments Ltd., Winnipeg, Manitoba, Canada). Plants were grown on a 12 h. photoperiod of 300 μmol m−2 s−1 photosynthetically active radiation (incandescent and fluorescent lamps) with a 20 ± 0.5 °C / 15 ± 0.5 °C day/night temperature regime. Relative humidity was maintained at 50% ± 5%. Pots were initially irrigated with deionized water, and after emergence, plants were subirrigated daily with a nutrient solution containing the concentrations of mineral salts: 1.2 mM KNO3, 0.8 mM Ca(NO3)2, 0.8 mM NH4NO3, 0.2 mM MgSO4, 0.3 mM KH2PO4, 25 μM CaCl2, 25 μM H3BO3, 2 μm MnSO4, 2 μM ZnSO4, 0.5 μM CuSO4, 0.5 μM H2MoO4, 0.1 μM NiSO4, and 10 μM Fe(Ш)-N, N′-ethylenebis[2-(2-hydroxyphenyl)-glycine] (Sprint 138; Becker-Underwood, Inc., Ames, Iowa, USA).

Plants were harvested at 4–5 weeks after planting when they had 5–6 fully expanded leaves. Harvested material included both mature and immature leaves (leaf blades and petioles) from the six plants of each accession. Soil contamination of the samples was minimized by cutting plants 0.5 cm above the soil surface. Leaves were dried in paper bags at 65–70 °C for a minimum of 48 h, and after cooling the pooled leaves from the 6 plants were ground to a uniform powder using a coffee grinder with stainless steel blades (model IDS 55; Mr. Coffee, Boca Raton, Florida, USA). Two 0.25 g (dry weight) subsamples of each accession were wet digested in borosilicate glass tubes using ultra-pure nitric and perchloric acids, as previously described [55]. Digestages were resuspended in ultra-pure nitric acid and analyzed for concentrations of B, Ca, Co, Cu, Fe, K, Mg, Mn, Mo, Na, Ni, P, S, and Zn using inductively coupled plasma optical emission spectrometry (CIROS ICP Model FCE12; Spectro, Kleve, Germany). Tomato leaf standards (SRM 1573A; National Institute of Standards and Technology, Gaithersburg, Maryland, USA) were digested and analyzed as quality control along with each run of 50 spinach samples to verify the reliability of the procedures and analytical measurements. Results are reported on a dry weight basis as the average of the two subsamples in ppm (parts per million; equivalent to micrograms per gram).

DNA extraction, GBS, and SNP discovery

Genomic DNA was extracted from freeze-dried fresh leaves of spinach plants using the CTAB (hexadecyltrimethyl ammonium bromide) method [56]. DNA sequencing was done by next generation sequencing technologies using GBS [18, 20] and GBS was conducted by HiSeq 2000 in Beijing Genome Institute (BGI). SOAP family software (http://soap.genomics.org.cn/) was used for sequence assembly, mapping and SNP discovery of GBS data. The GBS data averaged 3.26 M short-read and 283.74 Mbp data-points for each spinach sample. The short reads of the GBS data were aligned to spinach genome reference AYZV02 (http://www.ncbi.nlm.nih.gov/Traces/wgs/?val=AYZV02) by using SOAPaligner/soap2 (http://soap.genomics.org.cn/), while SOAPsnp v. 1.05 was used for SNP calling [57, 58]. Approximately a half million SNPs were discovered from the original GBS data from BGI among the 292 spinach germplasm accessions. The spinach accessions and SNPs were filtered before conducting genetic diversity and association analyses. If an accession had greater than 35% missing SNP data, the accession was removed from the panel. The SNP data were filtered by setting the parameters of minor allele frequency (MLF) > 2%, missing data <20%, and heterozygous genotype <10%. After filtering, 2402 SNPs among 292 spinach accessions were used for genetic diversity and association analysis.

Phenotypic data analysis

Phenotypic data of the 14 mineral elements in spinach were analyzed using Microsoft Excel 2016 for the average, range, standard deviation, and coefficient of variation (CV) and the distributions of the 14 mineral elements were drawn using QGene [35]. The correlation coefficients of the 14 mineral elements were calculated using JMP Genomics 7 software (SAS Institute, Cary, NC, USA). In JMP Genomics 7, the dendrogram construction was done using multivariate methods to do hierarchical clustering. After being clustered, the multivariate principal component analysis (PCA) was used to create biplot on covariance.

Genetic diversity and population structure analysis

The model-based program STRUCTURE 2 [33] was used to infer population structure. In order to identify the number of populations (K) capturing the major population structure in the tested spinach association panel, the burn-in period was set at 50,000 with the Markov Chain Monte Carlo iterations and the run length was set at 10,000 in an admixture model and correlated allele frequencies independent for each run [59]. Ten runs were performed for each simulated value of K, ranging from 1 to 10. The delta K was calculated using the formula provided by Evanno et al. (2005) [60]. The optimal K was determined with Structure Harvester [61]. After the optimal K was determined, a Q-matrix was generated; this was used in Tassel 5 for association analysis of mineral elements. Each spinach accession was also assigned to a cluster (Q) based on a probability for that accession in a cluster, using a cut-off probability of 0.50. Based on the optimal K, a Bar plot with ‘Sort by Q’ was obtained to visualize the population structure of the spinach association panel.

Genetic diversity was also assessed and the phylogeny trees were drawn using MEGA 6 [34] based on the Maximum Likelihood tree method with the following parameters [17]. Test of Phylogeny: Bootstrap Method, No. of Bootstrap Replications: 500, Model/Method: General Time Reversible model, Rates among Sites: Gamma distributed with Invariant sites (G + I), Number of Discrete Gamma Categories: 5, Gaps/Missing Data Treatment: Use all sites, ML Heuristic Method: Subtree-Pruning-Regrafting-Extensive (SPR level 5), Initial Tree for ML: Make initial tree automatically (Neighbor Joining), and Branch Swap Filter: Moderate. During the drawing of the phylogeny trees, the population structure and the cluster information were imported to MEGA 6 for combined analysis of genetic diversity. For sub-tree of each Q (cluster), the shape of ‘Node/Subtree Marker’ and the ‘Branch Line’ was drawn with the same color as in the figure of the Bar plot of the population clusters from the STRUCTURE analysis.

Association analysis

Association analysis was conducted with the single marker regression (SMR) without structure and kinship, the general linear model (GLM), and the mixed linear model (MLM) methods as described in TASSEL 5 [62] (http://www.maizegenetics.net/tassel) and the analysis was also performed with compressed mixed linear model (cMLM) [63] and enriched compressed mixed linear model (EcMLM) [64] implemented in the GAPIT R package [65]. The QGene 4.3.10 was also used to conduct SMR for all SNPs [35], although QGene was developed for QTL mapping, it can also be used in association analysis through SMR. The effect of SNP markers were also conducted by T-test using JMP Genomics and Microsoft Excel 2016.