Introduction

Rice constitutes a significant portion of the daily diet particularly in developing countries of Asia, supplying 50–80% of the daily caloric intake, proteins, minerals and vitamins etc1,2. The annual global rice production of 700 million tons was produced on an area of 158 million hectares in 20172. However, rice production must be increased by additional 8–10 million tons each year and there is a strong impetus to improve the nutritional value with multiple essential vitamins and minerals to meet food and nutritional demands of a growing population2. Thus, there is an urgent need to breed for rice varieties with improved yield and nutritional value for sustaining global food and nutritional security3,4.

Micronutrients such as Fe and Zn are highly essential for the normal growth and development of both plants and animals5. The recommended human intake of Fe and Zn is 15 mg per day6. However, it is rarely met by rice-based diets consumed in Asian populations7. Fe and Zn deficiencies significantly impact human health causing anemia, stunting, decreased immunity, poor cognitive development8. These have been recognized as global health concerns and need to be addressed on a priority basis9. Dietary diversification, food fortification, supplementation and biofortification have been advocated and are being employed to address malnutrition. Among these, biofortification of staple crops is in the forefront because of its affordability and easy access to the needy and targeted vulnerable populations, and the scale and impact that can be achieved by this intervention will be huge10.

Modern high-yielding rice varieties are poor source of nutrients in their polished form11. However, the huge genetic variability in grain mineral nutrients available in the rice germplasm can be exploited to develop biofortified high yielding rice varieties12. Even though genetic complexity and G x E were reported to be a hindrance for improving mineral content with high yield, much progress has been achieved in elucidating the genetic basis and G x E effect on grain micronutrient accumulation, yield and yield related traits in rice13,14.

Identification of new donors for agronomic, yield and mineral nutrient traits, dissecting their genetic basis and using them to diversify the breeding population is a continuous process in developing high-yielding and nutritious rice varieties15,16,17. Several QTLs and genes have been identified and successfully used in rice improvement. Marker-Assisted Breeding (MAB) and Genomic Selection (GS) are powerful approaches to improve multiple complex traits18,19. The mainstreaming of mineral nutrients (i.e. including nutritive value as a standard selection criterion within breeding programs), especially grain Zn in staple crops has been well recognized and the application of genomics can enhance the rate of genetic gain10,20.

Doubled haploid lines (DH) are important genetic resources for mapping QTLs because of their rapid development with less deleterious alleles, minimal background noise and these fixed lines can be readily evaluated across years and locations21. DH populations together with Single Nucleotide Polymorphisms (SNPs) are an ideal combination for QTL mapping. They are cost-effective to assay using automated platforms, while allele calling, data analysis, and the database is straightforward due to their bi-allelic nature22. In previous studies several DH populations were genetically characterized for yield, yield components and grain mineral elements23,24. These DH populations utilized japonica type rice cultivars as donors in which grain Zn ranged from 10.37 to 18.00 ppm in brown rice form18,24,25. On the other hand, other studies used Backcross Introgression Lines (BILs) derived from O. sativa ‘Nipponbare’ and O. meridionalis W162726, O. sativa cv. Swarna x O. nivara27, and Oryza sativa x O. rufipogon28.

Genomics based breeding approaches including MAS are becoming more common and to be efficient, research breeding programs need robust and stable QTLs across environments and genetic backgrounds29. Genotype by environment interactions (GEI) are the expression of QTLs and GEI QTLs are important as they significantly influence the total phenotypic variance and additive effect of the main effect QTL30. Moreover, haplotype-based allele mining was used to detect allelic variation in genes controlling agronomic traits31. On the other hand, gene pyramiding is important in improving the efficiency of biofortification breeding with SNP chip technology, allowing the development of lines with simultaneously introgressed small-effect QTLs for the precise development of genetic stocks32.

We characterized a DH population for yield, yield related traits and grain mineral elements, identified QTLs and candidate genes for yield and yield components, grain Fe and Zn; analyzed epistatic interactions, QTL x environment interactions, QTL pyramiding effects and haplotype analyses of major QTLs for YLD and Zn.

Results

Phenotypic analysis in DH population

The DH population showed wide variability for all traits measured in all three seasons (Supplementary Fig. 1). For agronomic traits, DF ranged from 71.7 to 106.2 days with a mean of 86.9 days. PH ranged from 70.1 to 127.7 cm with a mean of 95.5 cm while YLD ranged from 2669.9 to 7409.1 kg ha−1 with a mean of 5446.1 kg ha−1. GL ranged from 6.4 to 10.3 mm with a mean of 9.02 mm and GW ranged from 1.8 to 2.3 mm with a mean of 1.97 mm. In grain Fe and Zn concentrations, Fe ranged from 2.8 to 5.8 ppm with mean of 4.1 ppm while Zn ranged from 8.7 to 19.7 ppm with a mean of 12.6 ppm (Table 1). The parent IR05F102 exhibited higher values for agronomic traits NT, NP, and YLD in all three seasons while IR69428 exhibited higher values for DF, PH, TGW, GW, Fe and Zn across three seasons and BLUEs. Coefficient of variation for agronomic traits ranged from 2.88% to 19.20%. Low CVs (<10%) were recorded for DF, PH, TGW, GL, and GW. Genotypic effects were highly significant and heritability values were high (0.52–0.98) for all agronomic traits and grain Zn concentration while lower heritabilities were observed for Fe, viz. 0.22 in S3 and 0.49 in S1 (Table 1). Pearson’s correlation co-efficient between BLUEs and the three season’s data showed significant positive correlations ranging between 0.52 and 0.97. Of the 45 possible correlations; 27 were significant in S1, with 12 positively correlated and 15 negatively correlated; while 26 were significant in S2, with 11 positively correlated and 15 negatively correlated; and 20 were significant in S3, of which 11 were positively correlated and nine were negatively correlated. A total of 29 correlations were significant using BLUEs, 13 of them were positive and 16 were negative (Fig. 1, Supplementary Table 1). Consistent strong positive correlations (p < 0.001) were observed between NT & NP, PH & YLD, PH & GL, YLD & GL, TGW & GW, Zn & GW, and Zn & Fe. Meanwhile, strong negative correlations were observed between DF & TGW, GW & NT, GW & NP, GW & YLD. The first two principal components (PC) accounted for 55.94% of the total variation (Fig. 2). PC1 contributed 30.34% of the total variation while PC2 contributed 25.60%. For PC1, GL and YLD showed the highest positive loadings (0.46 and 0.44) among the agronomic traits analyzed, and Fe and Zn exhibited negative loadings (−0.43 and −0.46). This indicates that GL, YLD, Fe and Zn contributed most of the variation in PC1 (Supplementary Table 2). In PC2, TGW and GW contributed most of the variation with loadings of 0.37 and 0.40, respectively.

Table 1 Phenotypic variation for agronomic traits, Fe and Zn concentration in DH population.
Figure 1
figure 1

Correlations among agronomic traits and grain Fe and Zn concentrations in DH population. *Indicates significance at p ≤ 0.05. **Indicates significance at p ≤ 0.01. ***Indicates significance at p ≤ 0.001. †DF: days to flowering (days); PH: plant height (cm); NT: number of tillers; NP: number of panicles; YLD: yield (kgha−1); TGW: thousand grain weight (g); GL: grain length (mm); GW: grain width (mm); Fe: Iron (ppm); and Zn: Zinc (ppm).

Figure 2
figure 2

Principal component analysis in DH population using the two PC with highest proportion of variance. Numbers inside the box correspond to DH lines.

Identification of QTLs

A total of 23 QTLs for agronomic traits and grain Fe and Zn concentrations were identified in the DH population based on BLUEs (Table 2, Fig. 3). They were distributed on chromosomes 1, 2, 3, 5, 6, 7, 9, 10, and 12. The phenotypic variance explained (PVE) by the QTLs ranged from 7.18% to 22.03%. Two QTLs for DF (qDF1.1 and qDF3.1) were identified on chromosomes 1 and 3. These QTLs explained 14.82% and 12.63% PV. QTL qDF3.1 was contributed by the high-yielding parent IR05F102. One QTL each was identified on chromosome 3 for PH, NT and NP, each explained more than 12% PV. Four QTLs for YLD (qYLD1.1, qYLD3.1, qYLD7.1 and qDF12.1) were identified on chromosomes 1, 3, 7, and 12 contributed by IR05F102. Three of them had PVE values of more than 10%. qYLD3.1 had the highest PVE (22.03%). One QTL for TGW was identified on chromosome 6 (qTGW6.1) contributed by IR05F102 with a PVE of 15.79 %. Similarly, four QTLs were identified for GL, one each on chromosomes 1, 6, 7 and 10. The PVE of these QTLs varied from 7.35% to 16.45%. qGL1.1, qGL6.1, and qGL7.1 had a PVE of more than 10% each. qGL6.1 had the highest PVE (16.45%). While, three QTLs for GW (qGW2.1, qGW5.1, and qGW9.1) were identified on chromosomes 2, 5, and 9 all derived from IR69428 with PVE values of 11.71%, 10.31%, and 14.76%, respectively. Two QTLs for Fe (qFe9.1 and qFe12.1) were identified on chromosomes 9 and 12 derived from IR69428 with PVE values of 11.79% and 13.34%, respectively. But for Zn, four QTLs were identified on chromosomes 1, 5, 9, and 12 contributed by IR69428. Three (qZn5.1, qZn9.1, qZn12.1) of them had PVE values of more than 10%. qZn12.1 had the highest PVE (15.26%).

Table 2 QTLs for agronomic traits and grain element concentrations in DH population.
Figure 3
figure 3

QTLs identified for agronomic traits and grain micronutrient concentrations in DH population. †Blue font: QTLs for agronomic traits; red font: QTLs for grain element concentrations.

QTL co-locations

Three QTL co-locations consisting of QTLs for two traits each were observed on chromosomes 3 and 9, two for agronomic traits and one for grain mineral elements (Fig. 3). Interestingly, these traits that share common QTLs were positively correlated. Agronomic trait QTLs: qDF3.1 and qPH3.1 were co-located on chromosome 3 at 33.97–33.98 Mb within SNP marker interval 3500757–3501392 with allelic contribution from P1. QTLs qNT3.1 and qNP3.1 were co-located on chromosome 3 at 34.72–35.23 Mb within SNP marker interval id3017089 - 3536043 and with allelic contribution from P1. On the other hand, grain mineral element QTLs, qFe9.1 and qZn9.1 were co-located on chromosome 9 at 19.79–20.13 Mb within SNP marker interval id9809545 - 9819278 Mb with allelic contribution from P2.

QTL x environment interaction analysis

To understand the stability and interaction of the identified QTLs with environment, QEI analysis was conducted. Estimated effects of 10 consistent QEI QTLs can be found in Table 3. One consistent QEI QTL was detected for DF on chromosome 1; two for YLD on chromosomes 3 and 7; two for GL on chromosomes 1 and 10; three for GW on chromosomes 3, 5 and 9; and two for Zn on chromosomes 9 and 12 (Fig. 4). All identified QEI QTLs were stable as indicated by the larger LODA (1.55–9.67) than LODAE (0.12–3.83) values and larger PVEA (1.93–13.91%) than PVEAE (0.05–3.21%) values (Table 3).

Table 4 Candidate genes for agronomic traits and grain Fe and Zn concentrations identified in the DH population.
Figure 4
figure 4

QTL by Environment Mapping: Chromosome locations of QTLs identified in the DH population.

Epistatic interaction analysis

One epistatic interaction for PH was identified in the DH population (Supplementary Table 4; Fig. S2). The major di-genic epistatic interaction was identified between loci on chromosomes 3 with SNP marker interval id3495083 - 3500757 Mb and on chromosome 7 with SNP marker interval id7862147 - 7892971 Mb with a PVE value of 28.40%.

Candidate genes underlying QTLs for agronomic traits, Fe and Zn

Seven agronomic trait QTLs and five Fe and Zn QTLs were harbored in or near candidate genes (Table 4). For agronomic traits, two candidate genes (OsLFL1 and OsHD6) were identified for DF located on chromosomes 1 and 3. OsLFL1 with locus id Os01g0713600 was identified as a gene for flower development while OsHD6 with locus id Os03g0762000 was for days to heading. One candidate gene, OsLTS1 was identified for PH on chromosome 3 with locus id Os03g0837300. Four candidate genes were identified for YLD such as, OsDST, OsIPT4, OsMED5_3, and OsNAC139 on chromosomes 3, 7, and 12, respectively. Two candidate genes for GW, OsSRS3 and OsGS5 were identified on chromosome 5. Among QTLs for grain mineral element concentrations, for Fe, OsLysM-RLK10 and OsSWEET13 were candidate genes identified on chromosome 9 and 12, while candidate genes OsGATA8, OsSar1b, OsZIP6, and Os09g0511500 on chromosomes 1, 5, and 9 were identified for Zn (Supplementary Figs. S3 and S4). There were significant sequence differences in the candidate genic regions of the major QTLs (Supplementary Table S5).

Table 3 Estimated effects of QTL by Environment Interaction QTLs of agronomic traits and grain element concentrations identified in the DH population.

QTL classes for YLD and Zn

To understand the effects of combining the identified QTLs for YLD and Zn in this study, QTL classes were calculated. The mean YLD of DH lines without QTLs was 4430.9 kg ha−1. On the other hand, the mean Zn of the DH lines without QTLs was 11.18 ppm. The genotype classes for YLD and Zn showed a wide range of 4129.4–6373.0 kg ha−1 and 11.1–17.7 ppm, respectively. An increase in YLD and Zn was observed with an increase in the number of QTLs (Tables 5 and S3). The IR05F102 allele at all four loci increased YLD. DH lines with three to four QTLs (genotype class 15 and 16) showed the highest YLD, 6373.0 kg ha−1 and 6285.7 kg ha−1, respectively, among all YLD genotype classes. The IR69428 alleles at all four loci increased Zn. DH lines with four QTLs (qZn1.1 + qZn5.1 + qZn9.1 + qZn12.1) showed the highest Zn, 17.7 ppm, among all Zn genotype classes.

Table 5 Means and effects of 12 QTL classes for Zn across three seasons.

Identification of DH lines with good agronomic traits and high grain Zn

The top ten high Zn lines were identified using BLUEs of the 148 DH lines (Table 6). IR 91153-AC 113-1 was early for DF and had the highest YLD (6697.23 kg ha−1) among the top lines. All four YLD QTLs (qYLD1.1, qYLD3.1, qYLD7.1, qYLD12.1) and one Zn QTL (qZn12.1) were detected in this line. IR 91153-AC726-1 had the highest value for Zn and Fe of 16.21 and 4.82 ppm, respectively. Two Zn QTLs were identified in this line (qZn5.1, qZn12.1) and one YLD QTL (qYLD1.1). These DH lines can be used as donors in breeding programs or can be directly tested in multi-location trials to further evaluate their performance and potential to be released as high Zn rice varieties.

Table 6 Agronomic traits and Fe and Zn concentrations of the parents and top ten high-Zn rice DH lines.

Discussion

Compared to other cereals, rice is a poor source of essential micronutrients to fulfill daily human nutritional requirements33. Zn biofortification is considered to be a major solution that appears to be the most sustainable and cost-effective approach for addressing micronutrient malnutrition or hidden hunger34. Knowledge of genetic variation in agronomic traits, grain mineral elements concentration such as Fe and Zn and genes underlying allelic variation is vital for rice biofortification breeding to fast track the development of rice varieties that can make a positive contribution to human health. In this study, a DH population of rice was used to map QTLs for agronomic traits, Fe and Zn concentrations with SNP markers and BLUEs. Epistatic interactions, QTL by environment interactions, QTL pyramiding effects and haplotype analysis of major QTLs for YLD and Zn were also examined.

The DH population showed wide variation in all traits studied which specifies the traits complex and polygenic nature. BLUEs for grain Fe showed relatively little variation while grain Zn ranged from 8.7 to 19.7 ppm. The variation observed in the DH population for agronomic traits and grain Zn is important and a prerequisite for QTL mapping35. The variation indicates that a proportion of the phenotypic variance can be attributed to genotypic variance and that the agronomic traits and grain Zn can be exploited through selective breeding36. A wide range of variation also displayed the role of genotype as well as the effects of environment on the expression of the studied traits37. Several prior studies have also reported significant genetic variation specifically for grain Zn concentration38,39,40 which supports that selection for high levels of Zn coupled with advanced molecular marker technologies is a feasible approach.

High H2 values (>0.50) for all agronomic traits and grain Zn concentration were observed in the DH population which is consistent with previous studies that also work with rice agronomic traits and grain micronutrient concentrations41,42,43. However, the heritability was much lower for grain Fe concentration. Reliability of early generation selection of highly heritable agronomic traits (DF, PH, TGW, GW, and GL) is highly feasible, which may result in significant response to selection44. The low heritability observed for Fe indicates significant influence of environment and early selection for Fe is not feasible.

The first two PCs explained 55.94% of the total variation observed in the DH population. In general, principal components with vector coefficients more than 0.3 irrespective of their direction of influence are considered important45. Component loadings for each principal component revealed agronomic traits, such as YLD, TGW, GL and GW were among the phenotypic traits contributing positive projections while NT, NP, Fe and Zn contributing negative projections in two principal components which explained 30.34% and 25.60% of variation respectively. Through PCA we identified a number of traits which are responsible for the observed genotypic variation in the DH population, thereby identifying the traits with greatest impact on the phenotype of DH lines, which is informative for the selection of lines in breeding programs.

Analysis of relationships among quantitative traits is important for assessing the feasibility of joint selection of two or more traits and the effect of selection for secondary traits in genetic gain for the primary trait46. A positive genetic correlation between two desirable traits makes the job of the plant breeder easy in improving both traits simultaneously47. Clear insight on genetic correlations of agronomic traits and grain Fe and Zn concentrations can help plant breeder devise a suitable breeding strategy to enhance micronutrient density in rice. In the present study, DF was negatively correlated with TGW in BLUEs and across seasons, while PH was positively correlated with YLD. The trait NT was positively correlated with NP and YLD, but negatively correlated with GW. Similarly, GL was negatively correlated with Fe, while Fe and Zn were positively correlated. Previous studies also reported a positive correlation between grain Fe and Zn concentrations in rice15,16,39,40 which infers common mechanisms for uptake, translocation and loading of Fe and Zn, and suggests that grain Zn can also be used for grain Fe selection. Negative correlations between YLD and Zn have also been observed39,40, though it was not consistent, and these highlight the need for biofortification breeding programs to give significant weight to YLD while selecting for high grain Zn genotypes.

Considering the agronomic traits such as DF, PH, and YLD, Fe and Zn, the present study identified a DH line for which YLD was 6697.23 kg ha−1 with a Zn content of 15.11 ppm (IR 91153-AC 113-1). The high Zn lines identified in this study could be used in biofortification breeding programs to improve the grain Zn levels in rice resulting in improved rice varieties that could directly impact human nutrition, especially in populations that heavily rely on rice-based diets.

In this study, seventeen QTLs were mapped for agronomic traits and six QTLs were identified for Fe and Zn. It was notable that all the QTLs for grain Fe and Zn concentrations as well as QTLs for GW were derived from IR69428. Meanwhile, IR05F102 contributed all the QTLs for agronomic traits PH, NP, NT, YLD, TGW, and GL. These QTLs are good candidates for further studies such as gene fine-mapping and cloning. Our QTL results corroborated with recent studies on QTL mapping for agronomic traits and grain Fe and Zn concentrations in rice using diverse populations such as RILs26, BILs28, BC2F3 derived lines19, DH lines15, and MAGIC lines40 that revealed multiple loci located on all rice chromosomes. Consequently, these QTLs were detected in different environments and genetic backgrounds. The above results clearly showed the complexity of agronomic traits and grain Zn concentration14,15,42. Since the present study used BLUE values which excluded environmental effects and magnify genetic effects, the QTLs identified in this study may well prove more useful in rice biofortification breeding programs than the QTLs identified using single environment phenotypic values.

Haplotypes for two stable and major effect QTLs for grain Zn were further examined (Fig. 5). For qZn5.1, two haplotypes, CA and AG were observed in the DH population indicating that there is a tight linkage between the two loci. The CA haplotype is associated with high Zn. Similarly, the two haplotypes in qZn12.1 were GC and AA. The latter was associated with high Zn. The stability of the markers in these loci has clearly shown its potential for use in biofortification breeding programs although further validation is still necessary.

Figure 5
figure 5

The haplotypes of the major effect QTLs for grain Zn, qZn5.1 (A) and qZn12.1 (B), and their effect on Zn phenotype in different environments. Blue and orange color indicates different haplotypes within the QTLs, n indicates the number of DH lines with different haplotypes.

Co-localized QTLs are important for simultaneous improvement of traits of interest such as grain Fe and Zn concentrations in rice from plant breeder point of view48. QTLs controlling correlated traits were usually mapped in the same or close chromosome regions49. Some previous studies have reported the phenomenon of QTL co-locations for rice micronutrient concentrations15,19,23. In this study, several co-located QTLs linked with correlated traits were identified on chromosomes 3 and 9. For instance in agronomic traits, qDF3.1 and qPH3.1 were co-localized on chromosome 3 in the same SNP marker interval 3500757–3501392. For grain Fe and Zn, qFe9.1 and qZn9.1 were co-localized in the same SNP marker interval 9809545–9819278. These co-located QTLs may be the result of pleiotropy or multiple tight linkage of genes controlling the traits50. These regions might be useful for development of high grain Zn rice with highly productive agronomic traits. The co-location of QTLs for Fe and Zn will be beneficial for their simultaneous improvement with Marker Assisted Breeding15,19,23.

Identification of environment-specific and stable QTLs having consistent genetic effects across a wide range of environments is of great importance in rice biofortification breeding37,40. Several QTLs for agronomic traits and grain Fe and Zn concentrations have been identified, but the positions, effect sizes and genetic effect directions of these QTLs were mostly genotype and environment specific51. Studies on QEI contribute to the effective use of marker-assisted selection (MAS) in biofortification breeding, better understanding of the genetic architecture of grain Zn and agronomic traits, and the interactions of the genotype and environment52. The QTLs that were only detected in one environment mostly showed poor stability. All the consistent QTLs identified in this study: one for DF; two for YLD, GL, and Zn; and three for GW based on QEI analysis confirms stability as indicated by higher LODA and PVEA values as compared to LODAE and PVEAE values. The environment plays only a minor role in the additive effects of these QTLs. For instance, the additive effects of Zn QTL qZn12.1 were 0.38 in S1, whereas in S2 the additive effects were only 0.11.

Expression of the phenotype is a result of polygenes interacting to control complex traits53. Understanding epistatic interactions can be useful to target genes through marker-assisted selection strategies for the improvement of complex traits such as YLD and grain Zn. In this study, we detected one epistatic interaction on chromosomes 3 and 7 for PH. The total variation explained by epistatic loci for PH were significant suggesting that epistasis in the form of additive by additive interactions, plays a very important role in controlling PH.

Several QTLs harbor genes for agronomic traits and grain Fe and Zn concentration. In this study, genes for flowering development on the chromosome 1 and days to heading on chromosome 3 were co-located with QTLs identified for DF54,55. OsDST, OsIPT4, OsMED5_3, and OsNAC139 genes for grain number, 1000-seed weight, and grain width56 co-located with QTLs identified for YLD on chromosomes 3, 7, and 12. The candidate genes for grain Fe and Zn concentrations are involved in uptake, translocation, and metal homeostasis within rice plants. A network analysis of genes Sar1a and SWEET13 were conducted to understand their interaction with other traits and genes through the Knetminer program (Supplementary Figs. S3 and S4). Sar1a was positioned within 12.71–13.79 Mb on chromosome 1 for Zn while SWEET13 was positioned within 16.98–17.57 Mb on chromosome 12 for Fe. Six QTLs and 35 traits were linked to Sar1a, while 12 QTLs and 42 traits were linked to SWEET13. A QTL for Zn on chromosome 5 was linked to candidate gene OsZIP6, which translocates Fe and Zn and may be responsible for high concentrations of these micronutrients in the grains57,58 These candidate genes which were tightly linked with QTLs identified are worthy of further investigation.

QTL pyramiding is successfully utilized in rice breeding to develop lines resistance/tolerance to biotic and abiotic stresses59,60. To evaluate the pyramiding effect of each QTL in the present study, we compared the means of YLD and Zn between genotype classes in all combinations. Although, the arithmetic mean of YLD of all the genotype classes were higher than that of class 1 (no QTLs), no significant differences were detected. However, the means of class 15 (6373.00 kg ha−1) and class 16 (6285.74 kg ha−1) were significantly higher than that of class 4 (4129.43 kg ha−1). On the other hand, the mean Zn concentration of classes 3 to 11 ranged from 11.84 to 15.73 ppm, which was higher than that of class 0, though there were no significant differences detected among these groups. A significant difference was detected between class 12 (17.70 ppm) and class 1 (11.18 ppm). Hence, it will be necessary to pyramid favorable alleles of Zn and YLD QTLs in popular variety background to accumulate a large number of desirable alleles.

Conclusion

The DH population showed wide variation for agronomic traits, grain Fe and Zn. High heritability was obtained for DF, PH, and GL and moderate for TGW and Zn. Significant positive correlations between PH & YLD, YLD & GL, TGW & GW, and Fe & Zn were observed. Most of the QTL alleles associated with improved grain Zn and Fe were contributed by the donor parent IR69428. Six QTLs contributed more than 15% of the phenotypic variance; qNT3.1, qNP3.1, qYLD3.1, qTGW6.1, qGL6.1, and qZn12.1. Based on QEI analysis, all the consistent QTLs were relatively stable. The epistatic QTL analysis detected one significant di-genic interaction for PH with 28.40% PVE. QTLs for grain Fe and Zn concentrations were associated with the candidate genes invovlved in metal homeostasis. A positive correlation between YLD and Zn levels was found in several combinations of three to four QTLs. The high Zn lines identified in this study could be used in biofortification breeding program to improve the grain Zn levels in rice resulting in improved rice varieties that could directly impact human nutrition.

Methods

Plant materials

A doubled-haploid (DH) population, derived from IR05F102 (P1) x IR69428 (P2) cross composed of 148 lines, was used in the study. The mapping population was evaluated under irrigated conditions during the dry seasons of 2016 and 2017 (S1, S2) and wet season of 2017 (S3) in Robert Zeigler Experimental Station (ZES) at International Rice Research Institute (IRRI), Los Baños (LB), Laguna. The experiment was laid out in Randomized Complete Block Design (RCBD) with two to three replications per trial. Seedlings were transplanted at 21 days with a spacing of 20 × 20 cm, and the plant population was 2 rows of 10 hills in each plot. Standard agronomic practices and plant protection measures were applied to ensure good crop growth and complete grain development.

Phenotyping

The population was evaluated for eight agronomic traits and two grain mineral elements. The agronomic traits were measured following the standard evaluation system61, including days to 50% flowering (DF), plant height (PH), number of tillers (NT), number of panicles (NP), grain yield (YLD), thousand grain weight (TGW), grain length (GL), and grain width (GW). For grain Fe and Zn analysis, 50 g paddy samples were dehulled using a Satake dehuller and milled for one minute using a K-710 mini-lab rice polisher. Milled rice samples weighing at least 3 g representing each plot were analyzed using X-ray Fluorescence Spectrometry (XRF) (Oxford)62. Measurements were done twice per sample and was expressed in parts per million (ppm). The average reading per plot was considered for further statistical analysis.

Statistical analysis

A summary of basic statistical parameters was generated using STAR v.2.0.1. Analysis of Variance (ANOVA) was performed on single-season data and combined season data using one-stage multi-environment analysis implemented in PBTools v1.4. Best linear unbiased estimates (BLUEs) were generated by setting genotype effects as fixed and season effects as random. The BLUEs were used to perform QTL analysis. Boxplots and Pearson,s correlation coefficients between pairs of traits were estimated using R Core team63.

The model used for ANOVA was:

$${{Y}}_{{ijko}}={\mu }+{{\alpha }}_{{i}}+{{r}}_{{j}}+{{l}}_{{kj}}+{{b}}_{{ojk}}+{{\varepsilon }}_{{ijko}}$$

where μ is the overall mean, αi is the effect of the ith genotype; rj is the effect of the jth season, lkj the effect of the kth replicate within the jth season, bojk was the effect of the oth block at the jth season of the kth replicate and εijk the error. The genotypes were considered fixed while replicates and blocks within replicates were random.

Broad-sense heritability (H2) for each trait in each season was calculated as:

$${{H}}^{{2}}=\frac{{{\sigma }}_{{g}}^{{2}}}{{{\sigma }}_{{p}}^{{2}}}\,and\,{{\sigma }}_{{p}}^{{2}}={{\sigma }}_{{g}}^{{2}}+\frac{{{\sigma }}_{{e}}^{{2}}}{{r}}$$

where \({{\sigma }}_{{p}}^{{2}}\) is the phenotypic variance, \({{\sigma }}_{{g}}^{{2}}\) is the genotypic variance, \({{\sigma }}_{{e}}^{{2}}\) is the error variance, and r is the number of replications.

Genotyping

Plant genomic DNAs were extracted from leaf tissue using the cetyl trimethylammonium bromide (CTAB) method64. Quality check of DNA samples was carried out in 1% agarose gel. We submitted ~50 ng of DNA from the complete set of the doubled-haploid mapping population, along with the parents for genotyping with 7 K SNP array technology at the Genotyping Services Laboratory (GSL) at IRRI. Scanned image calls and automatic allele calling were loaded in the Illumina Genome Studio data analysis version V2011.1.

Linkage mapping and QTL analysis

The linkage map of the DH population was constructed using 379 high quality SNP markers using IciMapping v4.165. The distribution of SNP markers varied across chromosomes. The number of SNPs per chromosome ranged from 15 SNPs (chromosomes 8 and 9) to 57 SNPs (chromosome 3). The total length of the linkage map was 1,629.6 cM while the average interval length was 4.3 cM (Supplementary Fig. S5). The linkage map was created using Kosambi function66. For analysis of QTLs, the BLUEs of each line in the DH population were used. The permutation method was used to obtain an empirical threshold for claiming QTLs based on 1000 runs of randomly shuffling the trait values at 95% confidence level using the BIP function, whereas epistatic interactions were identified by setting the logarithm of odds (LOD) threshold value at 6.0 utilizing inclusive composite interval mapping (ICIM) model in QTL IciMapping ver.4.1. The MET function in ICIMapping was used to study QTL x Environment Interaction. QTLs were visualized using MapChart v.2.367.

Candidate gene analysis

The physical position of each QTL was determined by the position of the flanking SNP markers and genes physically located within or near QTLs for agronomic and grain Fe and Zn traits were considered candidate genes. Annotated genes with functions related to agronomic traits, metal transport and homeostasis were compiled and the physical positions of annotated genes were determined using the RAP DB Genome Browser68 (http://rapdb.dna.affrc.go.jp/viewer/gbrowse/ irgsp1). Annotation and functions attributed to different candidate genes were downloaded from Oryzabase69 (https://shigen.nig.ac.jp/rice/oryzabase/gene). A network of the top hit candidate genes and QTLs was created using K-netminer program70 (http://knetminer.rothamsted.ac.uk/Oryza_sativa/).