Background

Majority of the global population consume rice daily, particularly in Asiatic countries. But, rice grain is poor source of micronutrients such as iron (Fe) and zinc (Zn). The practice of consuming polished rice as a staple food in India aggravates malnutrition. Substantial amounts of iron and zinc are removed during milling. The polished rice contains around 2 mg kg− 1 Fe while the recommended dietary intake is 10–15 mg kg− 1. Similarly, polished rice contains around 12 mg kg− 1 of Zn, whereas the recommended intake for humans is 12–15 mg kg− 1 [1]. Iron is an important constituent of haemoglobin in red blood cells and is essential for the proper functioning of several enzymes in the body. Zinc is required for the metabolic activity of 300 enzymes, and is essential for enzymes involved in cell division, protein synthesis and growth [2]. To date, proper attention has not been given for improvement of these micronutrients in rice grain. Literature survey on existence of grain Fe and Zn content diversity reported in rice is high in natural rice germplasms [3,4,5,6,7,8,9,10,11,12,13]. Increasing the iron and zinc content in rice grain through breeding is cheaper and an easier option to reduce malnutrition in the developing countries. Increasing Fe and Zn content in rice is possible by utilizing elite germplasms possessing enormous genetic potential for grain Fe-Zn content in rice breeding program [5, 6, 13,14,15].

Popular rice varieties usually contain lesser micronutrients in grains compared to the traditional cultivars and landraces [16, 17]. QTLs responsible for further enhancing these nutrients in the high yielding rice varieties need to be identified and utilized. Through classical breeding approach, a very limited success could be achieved in this context. Earlier reports on low heritability of micronutritients concentration, negative relationship with high grain yield and genotype and environment interaction were the main limitations of genetic enhancement for these nutrients [18,19,20,21]. However, the development of CR Dhan 311, DRR Dhan 45 and Chhattishgarh Zinc Rice-1 are the recent examples of high yielding varieties in India with high grain Zn content. But, these achievements were through non-targeted classical breeding approach. Success in breeding for micronutrients enrichment is limited due to involvement of many quantitative trait loci (QTLs) with small effects and interacts highly with the environmental factors. Precision breeding is a better approach for enriching the elite rice varieties with desired micronutrients. The availability of information on robust markers for different QTLs and potential donors are pre-requisite for the success of precision breeding. Several QTLs for grain Fe and Zn related traits have been reported in rice from different genetic backgrounds of intraspecific and interspecific crosses [6, 9, 13,14,15, 18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36] are hardly being in use in modern molecular breeding for Fe-Zn enhancement.

Mapping reports of Fe-Zn content in rice indicated the use of various bi-parental mapping populations for detecting the genes controlling these two micronutrients. Gene mapping using bi-parental mapping populations is more time consuming, costlier and lesser resolution than association mapping [17, 37,38,39,40]. These limitations can be overcome by linkage disequilibrium (LD) mapping or association mapping. Association mapping is based on linkage disequilibrium or variations existing in natural or developed populations [17, 37, 39, 40]. The main purpose of association mapping here is to estimate the correlations between molecular markers with the grain Fe and Zn content in a panel containing representative rice population that possess considerable amount of variability for these two traits. Only few research reports on association mapping for grain Fe-Zn content are available [41, 42]. But, report on QTL mapping through association mapping in rice are available for grain yield and agronomic traits [29, 41, 43, 44]; seedling stage cold tolerance [40, 45]; cold tolerance at germination and booting stages [46]; high temperature stress tolerance [39]; grain quality traits [47]; salinity tolerance [48]; drought tolerance traits [49, 50]; early seedling vigour [51] and grain protein content [17]. However, the information on association mapping for grain Fe and Zn content in rice is scarce. In this study, a panel population containing 102 genotypes shortlisted from 485 germplasm lines were analyzed using 100 SSR and gene specific molecular markers for genetic diversity, population structure and association mapping for grain Fe and Zn content.

Results

Phenotyping of the panel population for grain Fe and Zn content in milled rice

The panel population was prepared by shortlisting genotypes from different phenotypic groups based on screening results of 485 germplasm lines for Fe-Zn content. Almost equal proportions of genotypes were pooled from each Fe-Zn phenotypic group along with the check varieties for constitution of the panel containing 102 germplasm lines. The mean iron content in of the panel population varied from 1.07 to 5.38 mg kg− 1 in the milled rice phenotyping results of wet seasons, 2016 and 2017. A higher Fe-content of > 4 mg kg− 1 in milled rice was observed in 13 genotypes viz., Chittimuthylu, IET25450, IET 24775, IET 24760, IET 24316, BPT5204, IET 23832, Kalanamak, IET25441 and IET25465, IET 23829 and IET 24779 (Additional file 3: Table S1). The check varieties for micronutrients viz., Chittimuthylu and Kalanamak showed mean grain Fe content of 4.8 and 4.48 mg kg− 1, respectively. Genotypes containing ≥25 mg kg− 1 zinc and /or ≥ 10 mg kg− 1 iron in brown rice along with at par or higher grain yield compare to the yield checks are considered desirable biofortified rice lines. While comparing in milled rice, genotype containing ≥4 mg kg− 1 Fe and ≥ 20 mg kg− 1 Zn and producing higher yield than yield check variety is considered as desirable genotypes. It is observed that the genotypes enlisted in the Table containing > 4 mg kg− 1 Fe in milled rice were found to be with > 12 mg kg− 1 in brown rice. Similarly, genotypes present in the panel with > 20 mg kg− 1 Zn content in milled rice showed > 25 mg kg− 1 in brown rice. From the results, nine biofortified rice genotypes, two micronutrients check cultivars and landrace Sadakadam showed more than 4 mg kg− 1 Fe in milled rice and categorized under high Fe containing genotypes in our study (Table 1). Forty two genotypes comprised of the moderate group (3–4 mg kg− 1 Fe in milled rice) while 48 showed low values (< 3 mg kg− 1 Fe). Mean zinc content in milled rice varied from 7.43–27.97 mg kg− 1 showing ≥20 mg kg− 1 in fourteen genotypes (Additional file 3: Table S1). These fourteen genotypes were categorized under high grain Zn containing rice. The two check varieties for micronutrients, Chittimuthylu and Kalanamak had mean value of 22.3 mg kg− 1 and 25.91 mg kg− 1 zinc in grain, respectively. In the studied panel, 56 genotypes showed moderate level of 15–20 mg kg− 1 grain Zn, while 32 lines exhibited low level (< 15 mg kg− 1 grain Zn). Evaluation of 102 genotypes for grain yield and Fe-Zn content over 2 years revealed two biofortified lines namely IET24779 and IET25465 showing at par yield with the yield check variety IR64 (Table 1). The genotypes containing high Fe or high Zn in milled rice and at par grain yield with yield check variety may be selected as cultivar or as donor parent for the breeding programs. The desirable genotypes with high Fe content and good grain yield were IET23829, IET 23832, BPT5204, IET24316, IET24760, IET24775, IET24779, IET25441 and IET25465. The genotypes with high Zn content and high grain yield were IET23824, IET24391, IET25446, IET25461, IET25465, IET25457, IET25477, Mumihunger, Sneha, Lalat and Chittimuthyalu. Genotypes IET24779, IET25465 and Chittimuthyalu produced at par yield with standard check variety IR64 along with high grain Fe-Zn content. The frequency of genotypes having low, moderate and high grain Fe and Zn content is depicted in spider graph and histogram (Fig. 1).

Table 1 Mean estimate values of days to 50% flowering, grain Fe- Zn content, panicles/m2 and grain yield of 102 genotypes including biofortified lines and check varieties studied during wet season, 2016 & 2017
Fig. 1
figure 1

Fe and Zn content of 102 genotypes and their frequency distribution in the panel population. a Spider graph showing the Fe and Zn content of the genotypes. b Frequency of high, moderate and low Fe-Zn genotypes in the panel population

Relatedness among genotypes for grain yield and Fe-Zn content through genotype-by-trait biplot analysis

The scatter diagram was plotted in the first two principal components axis to generate genotype-by-trait biplot graph for grain Fe, grain Zn concentration and grain yield of the 102 genotypes present in the panel (Fig. 2). The first and second principal components had 87.82 and 8.08% of the total variability with eigen value of 10.18 and 0.937, respectively (Additional file 1: Figure S1). Among the 3 traits from the principal component analysis, the grain yield contributed maximum diversity, followed by grain Fe content and grain Zn content in the panel population (Fig. 2). From the scattering pattern of genotypes in the 4 quadrants revealed the placement of the genotypes for high grain yield and Fe content in opposite direction. Higher Zn containing genotypes are in between grain yield and Fe content. However, there are some genotypes which are located nearer to origin possessing higher estimates of grain yield and grain Fe-Zn content. These genotypes have been encircled in the figure (Fig. 2). The top right (Ist quadrant) and bottom right (2nd quadrant) possessed majority of the genotypes with better in Fe and Zn content in kernel along with higher grain yield. The 3rd (bottom left) and 4th quadrant (top left) accommodates genotypes most of which were poor in grain Fe and Zn content with low grain yield (Fig. 2). Many genotypes with moderate in Fe and Zn content are found in 2nd quadrant. The desirable genotypes containing moderate to high grain Fe- Zn and grain yield are located both side of the X-axis and are encircled in the scatter diagram (Fig. 2).

Fig. 2
figure 2

Genotype-by-trait biplot graph showing 102 genotypes in two main principal components for three traits. Fe: grain iron content; Zn: grain zinc content; yld: grain yield (kg/ha); PN: panicle number; DFF: days to 50% flowering. The dot numbers in the figure represent the serial number of the genotypes enlisted in Table 1

Genetic diversity in the panel population using 100 molecular markers

The panel containing 102 germplasm lines exhibiting wide genetic variation for grain Fe and Zn content were genotyped using 100 molecular markers including 25 gene specific and 75 SSR markers. The loci used for genetic diversity and the calculated parameters are depicted in the table (Table 2). Two hundred forty four amplicons were obtained in toto with 2.44 average alleles per locus. The number of alleles ranged from 1 to 6 per marker with RM340 showing the highest number of alleles in the panel genotypes. The average major allele frequency of Fe and Zn linked polymorphic markers was observed to be 0.675 ranging within the bracket of 0.4023 (RM152) and 1.0000 (RM309, RM452, RM137, RM1789 and RM441) (Table 2). The PIC value ranged from 0.000 (RM6209, sZIP8, RM3409, RM309, RM452, RM137, RM1789, RM556, RM23, RM34, RM441 and GRMM9–2) to 0.6713 (RM407) with mean PIC of 0.3553. The observed average heterozygosity (Ho) was 0.0496 with a range of 0.00–0.6714. Amongst the markers used, only 36 showed heterogygocity value to be more than zero, whereas 64 exhibited zero values. The average gene diversity (He) varied from 0.0000 (RM309, RM452, RM137, RM1789, RM44, RM590 and RM258) to 0.6711 (RM3392) with an average value of 0.35.

Table 2 Details of 100 SSR and direct marker loci used for genotyping a panel containing 102 rice genotypes and their genetic diversity parameters

Population structure

A Bayesian clustering approach was followed for estimation of genetic structure by taking probable sub-populations (K) and higher delta K-value using the STRUCTURE 2.3.6 software. The genotypes in the panel exhibiting variation for grain Fe and Zn content were evaluated for genetic structure following. It categorized the genotypes into two sub-populations (Fig. 3a, b) with a high ∆K peak value of 495.2 at K = 2 among the assumed K (Fig. 3a). But, the classification was not robust with respect to Fe-Zn content in the genotypes. However, at K = 3 with ∆K value of 117.9 categorized the population into three distinct sub-populations with perfect grouping. At K = 3, the sub-population 1 (SP1), sub-population 2 (SP2) and sub-population 3 (SP3) included 18, 44 and 40 genotypes, respectively whereas eleven genotypes were admixture type having major genetic constituent from three sub-groups (Fig. 3c; Table 3). The SP1 and SP2 included majority of moderate and high grain Fe-Zn containing genotypes accommodating 100% of total high Fe (> 4 mg kg− 1) genotypes, 97.3% of total moderate Fe (3–4 mg kg− 1) genotypes, 90% of total high Zn (> 20 mg kg− 1) genotypes and 69.2% of total moderate Zn (15–20 mg kg− 1) genotypes. Maximum allele frequency divergence between the two sub-populations (net nucleotide distance) 1–2, 1–3 and 2–3 were 0.1239, 0.1813 and 0.2480, respectively. The average distance (expected heterozygosity) between individuals in cluster 1, cluster 2 and cluster 3 were 0.3140, 0.3051 and 0.3115, respectively. The three sub-populations showed fixation index values (FST) of 0.287 for SP1, 0.385 for SP2 and 0.4853 for SP3. A lower value of alpha (alpha = 0.0388) was found for the studied panel population. The distribution pattern of alpha-value in the panel population showed a leptokurtic symmetry while distribution of FST values in the sub-populations were almost in symmetric shape with same to the left and right from the centre (Additional file 2: Figure S2). Further at K = 7 with ∆K value of 3.946, the software categorized the entire population into 7 distinct sub-populations classifying the population into more subtle classes according to grain Fe-Zn content.

Fig. 3
figure 3

a Graph of delta K value, an ad-hoc statistic related to the rate of change in the log probability of data between successive K values; b Population structure of the 102-panel population placed based on membership probability fractions of individual genotypes at K = 2 and c Population structure of the 102-panel population placed based on membership probability fractions of individual genotypes at K = 3. The genotypes with the probability of ≥80% membership fractions were assigned to corresponding subgroups with others categorized as admixture. The numbers in the figure represent the serial number of the genotypes enlisted in Table 1

Table 3 The inferred ancestry value and population structure in the panel population containing 102 genotypes with their Fe-Zn classification category

Analysis of molecular variance (AMOVA) and LD decay plot

The analysis of molecular variance (AMOVA) showed genetic variations between and within the sub-populations at K = 2, K = 3 and K = 7 (Table 4). The genetic variations between and within the two sub-populations (K = 2) was 41% among the populations, 50% among individuals and 9% variation within individuals in the panel population. At 3 sub-populations, 38% of the variation among populations, 52% among individuals and 10% variation within individuals in the panel population. But, the analysis at K = 7 revealed 39% of the variation among populations, 51% among individuals and 10% variation within individuals resulted from the analysis. Wright’s F statistic was used to calculate the deviation from Hardy-Weinberg’s prediction. The FIS and FIT value for all the 100 loci were 0.844 and 0.904, whereas FST was 0.385 among populations at three sub-populations analysis. The values of FST, FIS and FIT for 7 sub-population level were 0.386, 0.831 and 0.897, respectively. The FST values at both K = 3 and K = 7 could discriminate the sub-populations from each other revealing differences among themselves. A moderate linearized FST value of 0.385 at K = 3 and 0.386 at K = 7 were computed for the sub-populations. The FST values of each sub-population and their distribution pattern showed a clear differentiation among the sub-populations from each other (Additional file 2: Figure S2).

Table 4 Analysis of molecular variance (AMOVA) of the sub-populations of panel population at K = 2 and K = 3 for Fe and Zn content in milled rice of 102 genotypes

The linkage disequilibrium decay rate is crucial for maintenance of disequilibrium in a population. This provides scope for mapping of complex traits through marker–trait association. Linkage disequilibrium plot was generated for a population by plotting Syntenic r2 values against the physical distance in million base pair. A sharp decline in LD decay was obtained for the linked markers at 1–2 M base pair and subsequently showed a very slow and gradual decay (Fig. 4).

Fig. 4
figure 4

Linkage disequilibrium (LD) decay (r2) curve plotted against the physical distance (base pairs, bp) between pairs of loci on chromosomes in rice. The decay started in million bp estimated by taking 95th percentile of the distribution of r2 for all unlinked loci

Grouping of panel population by principal coordinates and cluster analyses

Principal coordinate analysis (PCoA) is depicted in two dimension using 100 markers for determining the genetic relatedness among the genotypes (Fig. 5). The first two components accounted for 36.45 and 6.16% of total inertia. The panel population containing 102 genotypes used in the study were distributed in four quadrants showing three major groups (Fig. 5). The 1st, 2nd, 3rd and 4th quadrant consisted of 24, 46, 24 and 8 number of genotypes, respectively. The genotypes belonging to three different sub-populations are grouped in different quadrants. The 2nd quadrant genotypes are divided into two groups of which one group is closer to axis 1 and another is to axis 2. These 2nd group genotypes closer to axis 2 are admix type depicted in red colour (Fig. 5a).

Fig. 5
figure 5

Principal coordinate analysis (PCoA) of 102 genotypes in the panel population for grain Fe and Zn content using 100 molecular markers. The dot numbers in the figure represent the serial number of the genotypes enlisted in Table 1. The numbers are coloured on the basis of (a) sub-populations obtained from structure analysis (SP1-green; SP2-blue; SP3-Pink; admix type-red) (b) grain Fe and Zn content (common high grain Fe and Zn content – red; common moderate grain Fe and Zn content – green)

The majority of the genotypes containing moderate to high Fe and Zn content were placed in the 1st (top right) and 2nd (bottom right) quadrants of the PCoA. All the genotypes that are moderate to high for both zinc and iron namely, IET23834, Chittimuthyalu, IET24780, IET24771, IET24766, IET24783, IET24772, IET25443, IET25444, IET25447, IET25449, IET25450, IET25452, IET25453, IET25463, IET25465, IET25469, IET25471, IET25474, IET25479 and Safed Luchai-2 are placed in the red encircled area except five genotypes Kalanamak, IET24557, IET24787, IET25454 and IET25473 located in 1st quadrant (top right) and CR Dhan 907 in the 3rd quadrant (Fig. 5b). The genotypes namely Chittimuthyalu, IET25450, IET25450 and IET25465 containing high zinc and iron are placed in the red encircled area including Kalojeera, Kalonunia, Swarnakranti and IET25464. The green encircled area consisted of mostly the moderate grain Fe and/or Zn containing genotypes, whereas the blue encircled area covering the 3rd and 4th quadrant included mostly the germplasm lines low to Fe and/or Zn content (Fig. 5b).

The cluster analysis based on genotyping of panel population with the 100 markers was performed to determine their collective discrimination ability. The tree constructed with unweighted-neighbour joining method could differentiate the genotypes into 3 different clusters (Fig. 6). The cluster I, II and III consisted of 32, 51 and 19 genotypes, respectively. The three clusters clearly separated the three sub-populations obtained by STRUCTURE analysis. The sub-populations SP1, SP2 and SP3 were grouped separately under cluster III (green), II (blue) and I (pink), respectively (Fig. 6a). The admix genotypes, Kalonunia, Kalanamak, Kalojeera, Swarnakranti, IET25464 formed a distinct sub-cluster under cluster II remaining together with SP2. Significant correlation was observed between the iron and zinc content of rice grain and clustering pattern (Fig. 6b, c). The cluster depicted in red color indicate high, green for moderate and blue for low iron/zinc content in the tree. When iron content was compared, cluster I consisted 30 low iron and 2 (CR Dhan 907 and Nuadhusura) moderate in iron containing genotypes, Cluster III included majority of the high and moderate Fe containing genotypes. Of the 51 genotypes in the cluster II, more than 80% are low to moderate for grain Fe content. Hence, cluster II and III can be classified as moderate to high Fe containing group. Similarly, when Zn content of rice grain was compared with clustering pattern, cluster II and III included majority of the genotypes with high Zn (Fig. 6c). In cluster III, only three genotypes out of 19 genotypes were low for Zn content, where rest were moderate to high Zn. Six genotypes with high and 28 genotypes with moderate Zn content were included in the cluster II of the tree (Fig. 6c).

Fig. 6
figure 6

Unrooted tree using unweighted-neighbour joining method depicting clustering pattern of 102 germplasm lines with respect to 100 molecular markers coloured on the basis of (a) sub-populations obtained from structure analysis (SP1-green; SP2-blue; SP3-Pink; admix type-red), b iron content (High Fe content-red; Moderate Fe content-green; Low Fe content-blue) and c Zinc content (High Zn content-red; Moderate Zn content-green; Low Zn content-blue) in milled rice

Association of marker alleles with grain Fe and Zn content in rice

Association of grain Fe, Zn content, panicles/m2 and grain yield with the molecular markers was estimated using TASSEL 5 software. The association parameters were based on Generalized Linear Model (GLM) and Mixed Linear Model (MLM/ K + Q model). The marker-trait comparisons were subjected to filtration at less than 5% error i.e. 95% confidence (p value < 0.05). The GLM approach computed average r2 value of 0.1133 with upper limit of 0.6095 and lower limit of 0.0386, whereas MLM showed r2 values ranging from 0.0392 to 0.146 with an average of 0.0622 (Additional file 5: Table S3; Additional file 6: Table S4). With r2 > 0.10 and p ≤ 0.05 filter, 42 and 3 markers showed association with grain Fe and Zn content respectively by using GLM model (Additional file 5: Table S3). However, taking MLM model, only one marker GRMm9–1 showed association with grain Zn content (Table 5). But when r2 > 0.05 was taken into account, 11 (RM243, RM122, RM234, RM80, RM315, RM339, RM1132, GRMM9–1, OsZIP8A, OsYSL2A, OsZIP6A) and 3 (RM80, RM300, RM1132) markers were associated with grain Fe and Zn content, respectively (Additional file 5: Table S3; Additional file 6: Table S4). Considering both the GLM and MLM, ten and seven markers showed significant association with grain Fe and Zn content, respectively at p < 0.05 (Table 5). Four primers namely RM80, RM300, RM1132 and GRMM9–1 and six primers viz. RM243, RM234, RM80, RM339, OsZIP8B, OsZIP6A were associated with grain Zn and Fe content, respectively with r2 > 0.05 and p ≤ 0.05 filter in both the models (Table 5). The Q-Q plot also confirmed the association of the markers with grain Fe and Zn content in rice (Fig. 7).

Table 5 Association of marker alleles with Fe and Zn content in milled rice detected both in GLM and MLM analyses in a shortlisted panel population of 102 genotypes
Fig. 7
figure 7

Quantile–Quantile (Q-Q) plot and distribution of marker-trait association from Generalized Linear Model analysis for grain iron-zinc content, panicles/m2 and grain yield at (a) p < 0.01 and (b) at p < 0.05

Agronomy traits like panicles/m2 and grain yield also showed significant association with 44 and 25 primers, respectively using GLM at P < 0.05 (Additional file 5: Table S3). The numbers reduced to 13 for each trait when analyzed with MLM model (Additional file 6: Table S4). However, considering GLM and MLM, twelve and eight markers showed significant association with panicles/m2 and grain yield, respectively at p < 0.05. Of these, nine markers (RM248, RM17, RM440, RM85, RM556, RM23, RM340, OsNARMP1A, OsFER1) with GLM and four markers (RM17, RM421, OsNARMP1A, OsFER1) with GLM as well as MLM showed r2 > 0.10 for panicles/m2. Similarly, for yield trait, RM17, RM517, RM339 and RM1132 were associated with r2 > 0.10 using GLM and that with both the models only RM517 showed r2 > 0.10 (Table 5). The Q-Q plot also confirmed the association of the markers with grain panicles/m2 and grain yield in rice (Fig. 7).

Common markers were observed to be associated with different traits. Sixteen and four common markers were observed for Fe and Zn content using GLM and MLM model respectively (Additional file 5: Table S3 and Additional file 6: Table S4). This number reduced to three (RM80, RM339, RM1132) when both the models were considered simultaneously. Considering GLM model, 12 markers viz., RM243, RM488, RM248, RM17, RM3392, RM440, RM201, RM421, RM585, RM34, RM339 and RM1132 showed significant association with all the traits viz., grain Fe and Zn content, panicles/m2 and yield. RM1132 and RM339 were commonly associated with grain Fe, Zn and yield considering both GLM and MLM models (Table 5). OsFER1 was associated with all the traits except grain yield.

Quantitative real time– PCR for validation of associated markers/genes with grain Fe and Zn content

The qRT-PCR analysis was performed for validating the identified markers in the marker-trait association analysis. Two significantly associated markers OsZIP8B and OsZIP6A for the genes OsZIP8 and OsZIP6 respectively were selected for validation study. The results of qRT-PCR for the genes were normalized with housekeeping gene b-tubulin. The expression of the genes was studied in root as well as shoot of two genotypes Kalanmak (high grain Fe and Zn) and Swarna (low grain Fe and Zn) under Fe-Zn normal and deficient condition. Both the genes were up-regulated under Fe-Zn deficient condition as compared to the control situation (Fig. 8). The fold change of expression of both the genes ranged 1.69–6.22 in deficient condition. Higher expression was obtained in root. Swarna showed more upreglation in roots as compared to Kalanamak.

Fig. 8
figure 8

Fold change of expression of genes OsZIP6 and OsZIP8 under Fe-Zn deficient treatment as compared to that of normal condition in roots and shoots of two contrasting genotypes Kalanamak and Swarna

Discussion

Modern high yielding varieties are poor in essential micronutrients like grain Fe and Zn content in rice [16, 17]. On an average, polished rice has 2 mg kg− 1 Fe, while the recommended dietary intake for humans is 10–15 mg kg− 1. Previous germplasm study for grain Fe-Zn content reported variability range of 6.3–24.4 mg kg− 1 for Fe and 13.5–28.4 mg kg− 1 for Zn in rice grain strongly suggests existence of genetic variability of these two micronutrients in rice germplasms [6, 8, 9, 25, 52, 53]. Evaluation of the panel population containing 102 genotypes showed a maximum value of 4.48 mg kg− 1 Fe and 26.96 mg kg− 1 Zn content in the milled rice (Table 1). Further, biplot analysis revealed a wide variability in Fe and Zn content in the milled rice of the panel population (Fig. 2). It is observed that among the genotypes with > 4 mg kg− 1 Fe in milled rice showed > 12 mg kg− 1 in brown rice (Table 1). Similarly, the genotypes containing > 20 mg kg− 1 Zn in milled rice showed > 25 mg kg− 1 in brown rice. Therefore, mapping of QTLs responsible for high Fe and Zn content in the materials seems to be logical. Soil characteristics at the growing locations and genotype x environment has a great role in determining the grain micronutrient content in rice [17, 27]. Therefore, 2 years pooled data for phenotyping of grain Fe and Zn content is more reliable than single year. On the basis of 2 years mean data, the panel population was grouped into three classes based on their grain Fe and Zn content. Therefore, the population structure of the panel population for these traits is very important. Prichard’s structure analysis at K = 3 also grouped the population into three distinct sub-populations.

Further, it was observed that a comparable grain Fe and Zn content were obtained in some varieties and biofortified lines as in the positive check varieties (Table 1). A positive correlation of Fe and Zn content in rice grain in some genotypes had been reported in previous publications [54,55,56]. Mean performance table generated here also revealed that few genotypes with high Zn content possessed high Fe content also (Table 1). Besides, evaluation results showed that some genotypes were with at par yield with the yield check varieties and contain moderate to high Fe-Zn content in the grains. Therefore, improvement of Fe and Zn content in rice grain along with high grain yield is possible. Relationship of high grain yield with Zn content is reported in earlier publication [57]. A similar observation for grain protein content has been reported recently in rice [17]. From panel’s genotype-trait biplot analysis, it is clear that Ist and 2nd quadrant accommodated genotypes with better in Fe and Zn content along with better grain yield (Fig. 2). Previous mapping results also reported linkage of primer RM490 for Zn content associated with yield component QTLs like panicles/plant and 1000-grain weight [58]. Current association mapping revealed association of RM260 with Zn content in rice grain. Also, qDTY12.1 an-yield QTL under drought was reported to be linked to primer RM260 [36]. The same is associated with Zn content in rice grain (Table 5). Therefore, improvement of Fe-Zn content along with high grain yield is challenging but possible.

The principal component and coordinate analyses distributed the panel’ population according to their grain Fe-Zn content along with grain yield and allocated distinct spots in the four quadrants (Figs. 2 and 5). Distribution of genotypes into different quadrants indicated the presence of genetic variation for Fe and Zn content in the panel population. The neighbour joining tree also grouped the germplasms into different classes on the basis of genotyping results of 100 molecular markers (Fig. 6). The figure showed various groups and sub-groups in the panel population based on genotyping using 100 markers for these traits clearly categorized the panel population into different groups indicating involvement of different genes/QTLs for these classes (Fig. 6). Presence of these different groups in the population increased the continuation of linkage disequilibrium and favoured for getting the marker-trait association for Fe-Zn content in rice grain. Similar results of trait mapping through marker-phenotype association were reported earlier for various rice phenotypic traits [17, 39, 40, 43, 45, 47]. A moderate level of genetic diversity was observed in the studied population for grain Fe and Zn content. The moderate genetic diversity detected in this study for Fe and Zn content is similar to the earlier results on genetic diversity obtained by other workers for these traits [53, 59,60,61]. However, rich genetic diversity values for these traits were also reported earlier in rice [6, 7, 9, 16, 62,63,64]. In our structure analysis, we observed a lower value of alpha (α = 0.0388) indicating that these two traits had a common primary ancestor. Subsequently, it evolved with each sub-population with few admix genotypes. The values obtained from inferred ancestry indicated small effects of individual QTLs responsible for trait groups and subgroups in the population. These small effects QTLs need to be put together into a favourable high yielding background through molecular breeding approaches. Similar observations are available from earlier association studies also support this suggestions for gene(s) / QTLs stacking in popular varieties [17, 39, 40, 47].

The FST values estimated for the pair of sub-populations showed a clear variation in the values. The FST value for SP1 and SP2 was found to be moderate (0.247) value. The variation in FST values and their pattern of distribution revealed a clear cut differentiation among the sub-populations from each other with respect to Fe and Zn content in the grains. Thus, the population is categorized into clear genetic groups based on the inter and intra-sub FST values. It is therefore expected that the populations and individuals showing higher FST values may produce better progenies containing Fe and Zn content in recombination breeding. Therefore, attempt to pyramid these QTLs responsible for higher Fe and Zn may be put from different populations lead to higher Fe and Zn content in the progenies. Similar suggestions were also provided by earlier workers for increasing high and low temperature stress tolerance and grain yield in rice [17, 39, 40, 65]. A sharp decline and then very slow and gradual LD decay value was noticed in the panel population (Fig. 4). This may be due to the self pollination nature of rice crop. The LD decay is usually high in open pollinated crops.

TASSEL analysis detected marker and grain Fe-Zn content association in the tested panel population. Higher F and lower p value with high r2 were detected for 10 and 7 markers associated with Fe and Zn content in kernel, respectively. These were the reliable markers to be used in Fe and Zn enhancement breeding program as detected through both GLM and MLM analyses. Thus, suggesting a strong association of marker with grain Fe-Zn content in the panel population. This is further evident from Q-Q plot exhibiting expression of many QTLs for grain Fe and Zn content (Fig. 7). This marker-trait association provides clue for presence of various QTLs in the germplasms containing more grain Fe and Zn. It may be due to the involvement of different donor lines carrying different QTLs for these two traits. The accumulation of useful QTLs for these traits in few landraces may be possible by natural introgression over a long time period ultimately exhibiting a higher quantity of the two micronutrients in those germplasm lines. Similar predictions for accumulation of useful QTLs in stress tolerant wild and landraces genotypes were also earlier reported in rice [17, 39, 40].

All the significantly associated markers for grain Fe content namely RM243, RM80, RM339, OsZIP6A and OsZIP8B detected through both GLM and MLM models with r2 > 0.05 and high F value indicated a very strong association. Probably, these QTLs contribute to variability for Fe content in rice grain. Some of these markers were reported to be linked to Fe enhancing QTLs in rice. Similarly, four markers namely RM80, RM300, RM1132 and gRMm9–1 showed significant association with Zn content detected through both the models at r2 > 0.05. These primers are located in different chromosomes which may correspond to different QTLs (Table 5).

Ten markers namely RM243, RM122, RM234, RM7, RM168, RM80, RM339, RM1132, OsZIP6A and OsZIP8B showed significant association with grain iron content detected with both GLM and MLM models. RM243, RM168, RM122, RM234 and RM80 are earlier reported for qFe1.1, qFe3.1, qFe5.1, qFe7.1 and qFe8.1 QTLs, respectively [9]. The marker RM339 is physically located closer to RM80 (qFe8.1), it will be useful in marker-assisted transfer of qFe8.1. Hence, these six markers and five QTLs were validated using the panel population and these reliably can be deployed in MAS breeding for improvement of Fe content in rice grain. OsZIP8 and OsZIP6 are Zinc-iron transporters belonging to ZIP family of micronutrient transporters. Particularly, OsZIP8 is important for Zn transport to seed [66]. Both these transporters being associated with grain Fe and Zn content in our study imply that both of these genes are also responsible for Fe transport to seed. These two genes were also reported earlier and currently detected by both the models [8, 28, 66]. Therefore, these two markers can reliably be used for Fe enhancement. Two markers RM7 and RM1132 were reported earlier for qZn3.1 and qZn7, respectively for grain Zn content [9, 67]. However, these two primers were now strongly associated for grain Fe content detected by both the models. QTLs controlling Fe content located in these locations of the two chromosomes were not reported earlier. Hence, these two locations can be designated as QTLs qFe3.3 and qFe7.3 and are novel QTLs. Two primers, RM340 and RM1132 reported for qZn6 and qZn7 are now detected to be strongly associated with grain Zn content [67]. Hence, these two QTLs are validated and reliably may be deployed for Zn enhancement in rice grain. Two markers namely RM339 and gRMm9–1 (LOC_Os09g27330: oxidoreductase/ transition metal binding protein) were reported earlier and currently detected by both the models [8, 15]. Therefore, these two markers can also reliably be used for Zn enhancement. Two primers RM80 and RM260 were reported earlier for qFe8.1 and qFe12.2 for grain Fe content [9]. However, these two primers under this study were strongly associated for grain Zn content detected by both the models. QTLs controlling Zn content present in these regions of the chromosomes were not reported in previous publications. Hence, these two positions on chromosome 8 and 12 are designated as QTLs qZn8.3 and qZn12.3 and considered to be novel QTLs controlling grain Zn content in rice. RM300, located on chromosome 2, is detected to be strongly associated with Zn content. No earlier reports could be found, hence, designated as qZn2.2 that controls grain Zn content. From mapping study, it is reported that RM1132 is associated with grain Zn content with qZn7 controlling the trait [67]. However, we detected a strong association for grain Fe content in the same location on the chromosome 7. As these locations on chromosome 7, 8 and 12 are commonly responsible for Zn and Fe content, is can be said that these Fe and Zn enhancing QTLs may be co-localized.

Several markers strongly associated with grain Fe and/or Zn content were observed with single model (GLM or MLM). The different markers used for zinc/iron transporters belonging to ZIP family, yellow stripe gene family (YSL), ferritin, metal tolerance protein (MTP) family, Natural Resistance Associated Macrophage protein 1 (NRAMP) family were associated with Fe content indicating common transport mechanism for Fe and Zn. Ferritins are iron storage proteins having two functional units for rapid uptake of Fe2+ and nucleation of Fe3+ [68, 69]. Association of OsFER1 with Zn content and panicle number may imply the involvement of this gene with Zn storage along with Fe thereby increasing panicle number of plants as both are essential micronutrients for plant growth. As plant ferritins lack iron response elements [69]. There may be chance of carrying Zn along with Fe. But this needs further confirmation with detailed experimentation.

Markers associated with Fe and Zn were analysed to see their involvement in enhancement of yield and its component traits. Twelve markers viz., RM243, RM488, RM248, RM17, RM3392, RM440, RM201, RM421, RM585, RM34, RM339, RM1132 showed significant association with grain Fe and Zn content as well as panicles/m2 and yield considering GLM model only (Additional file 5: Table S3). This indicated that the QTLs controlling these four traits may be co-localized and hence higher chance of co-inheritance. Similarly two markers RM1132 and RM339 were commonly associated with grain Fe, Zn and yield detected considering both GLM and MLM (Table 5). The QTLs controlling grain weight, qTBGW8 [70] and plant architecture, OsSPL14 [71] are located only 0.46 Mb and 7.33 Mb apart, respectively, from RM339 located on chromosome 8. Also, RM1132 is located only 5.1 Mb away from qPN7 reported to control panicle number of rice plant [70]. These observations provide clues for common inheritance of these QTLs. Hence, enhancement of both the traits viz., grain Fe and Zn content and yield parameters can easily be achieved simultaneously. Also, in a recent publication, it is established that grain Fe-Zn can easily be improved with few yield component traits [36]. Similar was the case of grain protein content heritability, negative relationship with grain yield and influence of genotype and environment interaction [18,19,20,21]. However, the recent results published on protein content mapping in rice indicated about the possibility of improving grain protein content along with high grain yield [17].

The qRT-PCR analysis was performed for validating the identified markers in the marker-trait association analysis. The expression of the two studied genes OsZIP8 (Fe-Zn transporter) and OsZIP6 (Zn transporter) under Fe-Zn normal condition and their upregulation in deficient condition confirmed the accuracy of the association study. Hence, it can be concluded that other QTLs significantly associated will be useful. Both OsZIP8 and OsZIP6 were up-regulated under Fe-Zn deficient condition as compared to the control situation (Fig. 8). Earlier studies also reported higher expression of these genes under deficient condition [72, 73]. Higher expression of both the genes in roots of Swarna (low grain Fe and Zn) under deficient condition indicates more uptake of the nutrient in the roots but it may not be getting properly channelized to the grain as evident from lower upregulation in shoots of Swarna, It might be utilising the nutrient more for its survival rather storing in the grains. But in case of Kalanamak (high grain Fe and Zn), both the genes showed almost equal up-regulation in roots and shoots thereby translocating Zn and Fe to the grain.

Conclusion

Polished rice consumed in India is very low in iron and zinc micronutrient content. Phenotyping results categorized genotypes based on grain Fe and Zn content each into 3 sub-classes. Evaluation results indicated that 3 genotypes namely IET24779, IET25465 and Chittimuthyalu produced at par yield with standard check variety IR64 along with high grain Fe-Zn content. Therefore, it is possible to improve grain Fe and Zn content in rice along with high grain yield. However, the studied panel showed a moderate level of genetic diversity for Fe and Zn content based on 100 molecular markers. Linkage disequilibrium for Fe and Zn content was observed in the studied population showing deviation from Hardy-Weinberg’s expectation based on F statistic values. The analysis detected 38% of the variation among populations, 52%among individuals and 10% within individuals in the studied panel population. The panel population was categorized into three sub-populations by the STRUCTURE analysis. The analysis also revealed a common primary ancestor for each sub-population with few admix individuals. Association mapping detected 10 QTLs for grain Fe and 7 for grain Zn content to be strongly associated through both Generalized Linear Model (GLM) and Mixed Linear Model (MLM). Novel QTLs namely qFe3.3 and qFe7.3 for grain Fe and qZn8.3 and qZn12.3 for grain Zn content were detected. Four QTLs controlling grain Fe and Zn were detected to be co-localized on the same chromosome. Besides, some Fe-Zn controlling QTLs were detected to be co-localised with the yield and its component QTLs. Incorporation of these QTLs may enhance grain yield too. The strongly associated markers with QTLs controlling high grain Fe-Zn namely qFe1.1, qFe3.1, qFe5.1, qFe7.1, qFe8.1, OsZIP8B (Zn-Fe regulated transporter), OsZIP6A (LOC_Os05g0164800), qFe3.3 and qFe7.3 for Fe and qZn6, qZn7, qZn2.2, gRMm9–1 (LOC_Os09g27330), qZn8.3 and qZn12.3 will be useful QTLs in stacking for developing nutrient dense rice.

Methods

Plant materials and field experiment

A total of 485 germplasm lines were screened for Fe and Zn content of milled rice during 2013 to 2015 [74,75,76]. The landraces and cultivars were obtained from ICAR- National Rice Research Institute (NRRI), gene bank while the Fe-Zn biofortified lines were from national coordinated trials. A panel population containing 102 shortlisted genotypes representing the three distinct phenotypic classes for grain Fe-Zn content and three check varieties were included in the panel for association study of the micronutrient contents in rice (Additional file 3: Table S1). The cultivars, Kalanamak and Chittimuthyalu for high Fe-Zn content while IR64 for grain yield were used as checks. The genotypes were planted in randomized block design with three replications during wet season, 2016 and 2017 at ICAR-NRRI, Cuttack following a standard procedure for a good crop. Phenotypic performances of the genotypes for days to 50% flowering, panicles/m2, iron and zinc content in milled rice and grain yield were recorded during both the years. Data analyses of the traits were performed using the software CROPSTAT v7.0.

Phenotyping for grain iron and zinc content in rice

Each harvested paddy sample was processed to white rice using Krishi international H-810 non-ferrous Huller and Krishi international K-710 polisher. Before dehusking, the grains were washed with 0.1 N HCl followed by rinsing with double distilled water to make the sample free from contamination. Iron and zinc content (mg/Kg) were determined at ICAR-Indian Institute of Rice Research, Hyderabad using energy dispersive x-ray fluorescence spectrophotometer (ED-XRF) X-supreme 8000 from 5 g polished rice samples. The published protocol of ED-XRF was used to estimate both the micro-nutrients [11]. These genotypes were grouped into three different phenotypic classes based on milled rice grain iron content for our study as high (> 4 mg kg− 1), medium (3–4 mg kg− 1) and low (< 3 mg kg− 1) and for zinc content as high (> 20 mg kg− 1), medium (15–20 mg kg− 1) and low (< 15 mg kg− 1). Principal component analysis (PCA) and scatter plots were generated as per standard procedure following previous publications [39, 40, 77].

DNA isolation, PCR amplification and visualization of markers for Fe and Zn content in polished rice

The seedlings of the 102 genotypes were grown and the leaf samples were collected from 20 days old seedling for genomic DNA isolation. The DNA was isolated following standard protocol [78]. One hundred SSR and gene specific markers across the genome were used for the study (Additional file 4: Table S2). The gene specific markers were selected based on reported literature on Fe-Zn content in rice. PCR was performed in 20 μl reaction volume and amplified by following our earlier report [17]. Agarose gel (3.5%) in TBE buffer (pH 8.0) was used for electrophoresis. Ethidium Bromide stain was used for visualisation of the amplicons. 50 bp DNA ladder was used to determine the size of the amplicon.

Analyses for determination of population structure, genetic diversity parameters and association of markers

A matrix was constructed on the basis of presence and/or absence of alleles for each genotype-primer combination. PowerMarker Ver3.25 software was used for estimating allele number, polymorphic information content (PIC), allele frequency, gene diversity and heterozygosis [79]. NEI coefficient for dissimilarity index was calculated and unweighted neighbor joining un-rooted tree was constructed [80] with bootstrap value of 1000 by using DARwin5 program [81]. A Bayesian clustering approach was followed for estimation of genetic structure by taking probable sub-populations (K) and higher delta K-value using the STRUCTURE 2.3.6 software [82]. A 150,000 burn-in period with 150,000 MCMC replications and K value run of 10 times was used for estimating population structure. The highest value of delta K estimated from Evanno table was taken as number of probable sub-populations present in the model and used in next step to detect the sub-population values in the panel population [17, 39, 40, 83, 84]. The deviation from Hardy-Weinberg expectation within and between the population structures (FIT, FIS, FST) was estimated through Analysis of molecular variance (AMOVA) using GenAlEx 6.5 software [85]. Marker-trait association for Fe and Zn content of polished rice was performed using TASSEL 5 program [86].

Linkage disequilibrium plot was constructed following earlier reports [17]. In order to enhance accuracy of marker- trait association, false discovery rate (FDR) and adjusted p values (q values) were calculated using the methods described in previous studies [40].

Gene expression analysis by quantitative real time-PCR

Two genotypes Kalanmak (high grain Fe and Zn) and Swarna (low grain Fe and Zn) were used for studying expression level of two genes OsZIP6 and OsZIP8 under Fe-Zn normal and deficient condition. Eleven day old seedlings were exposed to Fe-Zn normal and deficient condition under hydroponics culture in Yoshida growth medium was used for control, whereas both Fe and Zn were omitted from Yoshida medium for deficiency condition [72, 73, 87]. Expression analysis in response to Fe-Zn deficiency was performed for OsZIP6 and OsZIP8 genes using real-time primers [72]. Prior to real-time expression analysis, the total mRNA isolation, yield and purity evaluation of RNA were performed following the methods of earlier reports [88]. The increase/decrease of expression (fold change) of the OsZIP6 and OsZIP8 genes under Fe-Zn deficiency condition as compared to controlled condition was calculated by using 2-ΔΔ Ct method [89], where normalization was performed with rice β-tubulin gene [88]. Three technical replicates each from three independent biological replicates were taken used for the study following earlier reports [88].