Abstract
Sesame is an important and specific oilseed crop with high quality. Here, we systematically investigated the relationships between plant seed yield (PSY) and nine related traits in 369 sesame core accessions under five environments. The PSY and related traits had general heritability ranging from 14% for PSY to 72% for height to the first capsule (FCH) and final flowering stage (FFS). Correlation analysis suggested that seven yield-related traits, including capsule number per plant (CNP, r = 0.67), capsule stem length (CSL, 0.56), thousand-seed weight (TSW, 0.26), plant height (PH, 0.24), FFS (0.17), initial flowering stage (IFS, − 0.16) and FCH (− 0.14) were the key factors associated with seed yield in sesame (P < 0.01). For the uniculm and branching groups, CNP, CSL, and TSW were the common key factors. Path analysis showed similar direct effects of these key factors on PSY. Association mapping identified 13 simple sequence repeat (SSR) markers significantly associated with the nine seed yield-related traits (P < 0.001). Among these, three SSR markers were repeatedly detected in two environments. Marker Hs635 was associated with the seed number per capsule (SNC). Hs345 was associated with the CNP. Hs618 was associated with the IFS in uniculm accessions. These results provide insights into the traits that influence PSY and their genetics in sesame.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Sesame (Sesamum indicum L.) is an ancient and worldwide oilseed crop with high oil quality (Bedigian et al. 1985; Bedigian and Harlan 1986; Ashri 1998; Zhang et al. 2019); it is widely grown in tropical and subtropical regions due to its high tolerance to high temperatures and barren soil. Sesame seeds are abundant in unsaturated fatty acids (oil content, ~ 55%), proteins (~ 20%), and antioxidant lignans (~ 0.5%), such as sesamin (Anilakumar et al. 2010; Zhang et al. 2019). Therefore, sesame seeds are widely used in food and medicines (Kanu et al. 2010; Zhang et al. 2019). Nevertheless, sesame is a crop with a low seed yield and relatively low economic efficiency (~ 0.2) (Zhang et al. 2019). Therefore, it is necessary to better understand the genetic and molecular bases underlying seed yield and its component traits to efficiently improve sesame seed yield.
Seed yield is one of the most important and complex quantitative traits of crops. Research has shown that seed yield is controlled by multiple loci in sesame (Zhang et al. 2019). In addition to population density, a dozen agronomic traits contribute to capsule and seed formation, such as capsule number per plant (CNP), seed number per capsule (SNC), seed size or weight, plant type (uniculm or branching), plant growth and development rhythm (e.g. budding stage), initial flowering stage (IFS), and final flowering stage (FFS). The ability of sesame to adapt to environments, such as resistance to diseases and tolerance to waterlogging and other abiotic stresses, also affects the final seed yield per plant (PSY) (Biabani and Pakniyat 2008; Muhamman et al. 2010; Akbar et al. 2011; Tabatabaei et al. 2011; Daniya et al. 2013; Jatothu et al. 2013; Zhang et al. 2019).
Previous research indicated that 1000-seed weight (TSW), capsule stem length (CSL), CNP, and plant height (PH) are the key seed yield component traits in sesame because of their direct effects on seed yield in a field unit (Solanki and Gupta 2000; Yol et al. 2010). Goudappagoudra et al. (2011) found that CNP was positively correlated with seed yield (r = 0.732), followed by seed number per plant (SNP), branch number per plant (BNP), PH, and TSW. Akbar et al. (2011) showed that the CNP, PH, capsule length (CL), and TSW significantly and positively contributed to seed yield. The BNP, seed weight per plant (SWP), and height to the first capsule (FCH) indirectly contributed to the final seed yield, and thus, were regarded as seed yield-related traits (Sarwar et al. 2005; Daniya et al. 2013). Miao et al. (2020) created a dwarf mutant ‘dw607’ (dwf1) with a significantly shorter internode length (decreased from 8.0 to 4.2 cm) by EMS-induced mutagenesis. Interestingly, the dwarf cultivar, Yuzhi Dw607, selected from the ‘dw607’ mutant, had a capsule node number of 62 and a seed yield of 3450 kg per hectare in Xinjiang province, China (Zhang et al. 2021). The decreased PH and reduced internode length resulted in higher seed yield and tolerance to lodging (Miao et al. 2020), suggesting that PH, internode length, and lodging tolerance are also seed yield-related traits that contribute to the final sesame seed harvest.
Genetic and molecular research on seed yield and its related traits provided information necessary for understanding the relationships between seed yield and its related traits, as well as its genetic and molecular bases. The genetic map of sesame provides a good foundation and platform for sesame molecular breeding (Wei et al. 2009; Zhang et al. 2013). Recently, with the aid of the newly developed sesame reference genome, resequencing data intensive quantitative trait loci (QTL) mapping and genome-wide association study (GWAS) research have been conducted on seed yield-related traits in sesame (Zhang et al. 2019). Several candidate genes for these traits have been identified (Wu et al. 2014; Wei et al. 2015, 2019; Zhou et al. 2018; Zhang et al. 2019; Miao et al. 2020; Mei et al. 2021). Wu et al. (2014) constructed a high-density genetic map using a recombinant inbred line (RIL) population consisting of 224 RILs based on restriction-site associated DNA sequencing (RAD-seq) data and identified 13 QTLs for seven seed yield-related traits. Of the three QTLs with an R2 value > 10.0%, Qtgw-11 (R2 = 7.7–12.3%) was responsible for TSW, Qgn-6 (R2 = 8.0–18.3%) for SNC, and Qcl-12 (R2 = 52.2–75.6%) for CL (Wu et al. 2014). Wei et al. (2015) performed a genome-wide association study (GWAS) for 56 agronomic traits, including 14 yield-related traits, with a panel of 705 sesame accessions and identified 15 SNP/InDel markers (P < 10−6) associated with seed yield traits. Zhou et al. (2018) used GWAS on 39 seed yield-related traits with the same sesame accession panel across three environments, from which 6 environment-consistent QTLs and 76 QTLs, each associated with 2–5 seed yield-related traits and 48 candidate genes, were identified. Miao et al. (2020) cloned the sesame dwarf gene Sidwf1 using an RIL population derived from the dw607 (dwf1) mutant and the wild-type accession 15N41 (wt type) with the aid of a new method designated as cross-population association mapping. Similar to the green revolution genes Rht-B1 and Rht-D1, in wheat (Peng et al. 1999), SiDWF1 encodes a gibberellin receptor GID1B-like protein that controls internode length and PH in sesame. An SNP mutation in Sidwf1 resulted in a decrease in internode length and PH, thereby enhancing the seed yield of the mutant (Miao et al. 2020).
However, the basic characteristics of an ideal sesame cultivar with high seed yield remain unclear. Comprehensive knowledge of the relationship between seed yield and yield-related traits can help facilitate the breeding of high seed-yielding sesame cultivars. Thus, we systematically surveyed the phenotypic characteristics that could potentially contribute to seed yield and examined their relationships with plant seed yield (PSY) using a natural population consisting of 369 sesame core accessions from five environments. We conducted a GWAS on seed yield and its component traits with 369 sesame core accessions in five environments and screened the candidate genes for seed yield-related traits. The findings of this study provide the genetic and molecular bases and knowledge necessary for breeding high-yielding sesame plants.
Materials and methods
Plant materials
A panel of 369 sesame core accessions collected from 16 countries (Table S1) was used in this study. 318 of the accessions were collected from 19 provinces in China and 51 from other 15 countries. These accessions are available at the Sesame Germplasm Reservoir of the Henan Sesame Research Center, Henan Academy of Agricultural Sciences, China (HSRC, HAAS, China).
Phenotypic data collection of seed yield and yield-related traits
Each of the 369 accessions was self-pollinated for six or more generations before phenotyping. The 369 accessions were grown in five environments during the 2011 and 2012 seasons in Henan Province, including Pingyu (114.62°E, 32.97°N) (2011 and 2012), Xinyang (114.08°E, 32.13°N) (2012) and Yuanyang (113.96°E, 35.05°N) (2011 and 2012). A randomized block design with three replicates was applied for the field experiments (Edmondson 2005; Verdooren 2020). Each entry plot contained two rows, 5 m long and with a 40 cm distance between the rows. Field management was performed using routine treatments in the sesame field trials and breeding.
Plant seed yield (PSY) and its nine related traits, including PH, FCH, tip length (TL), CSL, CNP, SNC, TSW, initial flowering stage (IFS), and FFS were examined as described by Zhang and Feng (2006) and Langham (2017a, b, 2018). Accordingly, plants with branches number ≤ 2 were classified as “uniculm type”, and those with branches number ≥ 3 as “branched type”. PH was measured from the ground to the stem tip for uniculm accession plants and to the main stem tip for branching of the accession plants. The CSL was the stem length of a uniculm accession or the main stem length of a branching accession. TSW was measured using the SC-G machine (WSeen Detection Technology Company, Hangzhou, China).
Phenotypic data statistics and heritability analysis
Phenotypic data for seed yield and related traits were analyzed using SAS 8.1 software (https://www.sas.com/zhcn/home.html). The effects of genotype, environment, and genotype-environment interaction (G × E) were modeled using PROC GLM, y = g + e + ge, where g is the genotype effect, e is the environment effect, and ge is the effects of the genotype-environment interaction (G × E) (SAS User’s Guide Version 8 1999; Piepho 2018; Cai et al. 2014). Broad-sense heritability (H2) was calculated by: \({{H}^{2}=\delta }_{\mathrm{g}}^{2}/({\delta }_{\mathrm{g}}^{2}+\frac{{\delta }_{\mathrm{g}e}^{2}}{n}+\frac{{\delta }_{e}^{2}}{nr})\), where \({\delta }_{\mathrm{g}}^{2}\) is the genetic variance; \({\delta }_{\mathrm{ge}}^{2}\) is the variance of G × E interaction; \({\delta }_{\mathrm{e}}^{2}\) is the environment variance; n is the number of environments; r is the number of replicates within environment. \({\delta }_{\mathrm{g}}^{2}\), \({\delta }_{\mathrm{ge}}^{2}\) and \({\delta }_{\mathrm{e}}^{2}\) were calculated using analysis of variance (ANOVA) by considering the environment as a random effect. Distributions of the yield and yield-related traits were tested for normal analysis using the Shapiro–Wilk test with a significance level of P < 0.05 (SAS User’s Guide Version 8 1999). Multiple comparisons were conducted based on the LSD method at the 0.05 and 0.01 levels, which were marked by * and **, respectively.
Correlation relationship analysis and path analysis of seed yield and yield-related traits
Correlation analysis was performed using Pearson's coefficients in the ‘ggcorrplot’ package of R software (https://www.r-project.org/). Path coefficient analysis was carried out as described by Dewey and Lu (1959). Path analysis procedures were carried out using the SPSS 20.0 (https://www.ibm.com/cn-zh/analytics/spss-statistics-software).
Single sequence repeat genotyping, population structure and principal component analysis
One hundred and twelve polymorphic single sequence repeat (SSRs) were chosen from the SSR marker dataset based on our previous research results (Yue et al. 2012; Zhang et al. 2012). Genomic DNA was extracted and SSR genotyping was performed as described by Li et al. (2014). Polymorphisms in SSR markers were analyzed using polymerase chain reaction (PCR), as described by Zhang et al. (2012). The polymorphic alleles of the 112 SSR markers among the 369 core accessions and their genotypic data determined by Li et al. (2014), and the population structure results were used in this study (Fig. S6).
The population structure (Q = 2) calculated by Li et al. (2014) was used in the present study. Principal component analysis (PCA) analysis in the 369 germplasm accessions was estimated using ‘Poppr’ in R software of 112 SSR markers. The 369 resources were grouped and labeled according to plant type and population structure.
Association mapping of seed yield and yield-related traits
TASSEL software (version 3.0) with a mixed linear model (MLM) (Q + K) (Laurentin and Karlovsky 2006; Bradbury et al. 2007) was used for association. Population structure (Q) and kinship (K) of the accessions were calculated as previously described (Li et al. 2014). The global significance level was set at (− log10 (P value) ≥ 3.0) (Hou et al. 2018). Manhattan and Q–Q plots were generated using the ‘qqman’ package in R software (Turner 2014). Phenotypic data were analyzed according to the genotypes of the screened candidate SSR markers. We used the reference genome of the sesame cultivar Yuzhi 11 to locate genes close to the associated markers.
Results
Phenotypic variation of seed yield and yield-related traits in sesame
We phenotyped 369 sesame core accessions for PSY and 9 yield-related traits, including PH, FCH, TL, CSL, CNP, TSW, SNC, IFS and FFS, in 2011 and 2012 in five environments to determine the characteristics of PSY and its relationships with seed yield-related traits (Table 1; Fig. S1). Each of the traits varied substantially and exhibited high diversity among the 369 accessions. PSY varied between 1.79 and 8.63 g, with an average of 5.32 g. CNP ranged from 28.8 to 94.02 with an average of 57.15. The ShapiroWilk normality test indicated that PSY, TL, FCH, CSL, CNP, and TSW were normally distributed (P = 0.86), whereas the other four traits did not fit a normal distribution (P < 0.01) (Fig. S1; Table 1).
ANOVA showed that genotype (G) and environment (E) had significant effects on most traits (P < 0.01) (Table 1), suggesting their sensitivity to the environment. PH, FCH, and SNC showed medium-to-high broad-sense heritabilities (H2) of 62%, 72%, and 72%, respectively. PSY, TSW, and TL had lower H2 values, ranging from 14 to 23%, suggesting that they were not stably inherited under the influence of the environment.
Relationships of seed yield and yield-related traits in accessions with different plant types
For the 369 accessions, we correlated PSY with its related traits (Fig. S2). The results showed that PSY was positively and significantly correlated with PH, CSL, CNP, FFS, and TSW (r = 0.173–0.669, P < 0.01), and the strongest correlation was present between PSY and CNP (0.669). In contrast, PSY was significantly negative correlation with IFS (− 0.160) and FCH (− 0.141). No significant correlations were found between PSY and the two remaining traits, TL and SNC.
To check whether plant type (i.e., uniculm or branching type) affects the growth habit and final seed yield in sesame, we further classified the 369 core accessions into two groups according to plant type and carried out a correlation analysis between PSY and its related traits in each group (Fig. 1). For the 210 uniculm accessions, PSY was positively and significantly correlated with CNP, CSL, PH, FFS, and TSW, with correlation coefficients of 0.754, 0.594, 0.272, 0.214, and 0.206, respectively (P < 0.01). For the 159 branching accessions, PSY was shown to significantly correlate with CNP, CSL, TSW, and IFS with correlation coefficients of 0.581, 0.549, 0.335, and − 0.205, respectively (P < 0.01) (Fig. 1). These results confirmed that CNP, CSL, and TSW were key traits affecting seed yield in both uniculm and branching accessions.
Path coefficient analysis showed that CNP, CSL, TSW, SNC, IFS, and FFS were directly related to PSY (Table 2). CNP had a positive and considerably strong direct effect on PSY (0.703). The indirect effects of CNP were positive through CSL and FFS and were very small. SNC was observed to have a positive and very high direct effect on PSY (0.305) but also a negative indirect effect through CNP (− 0.260). CSL had a positive direct effect on PSY and an indirect effect via CNP showed a strong positive effect (0.325). The IFS showed a negative direct effect (− 0.198) on PSY. The indirect effects of the IFS were positive for CNP, SNC, and FFS. If branching type groups are considered separately in the uniculm group, CNP had the highest positive direct effect (0.755) and followed by CSL (0.281), SNC (0.254), TSW (0.202), and FFS (0.109) (Table S2). PH had a direct negative effect on PSY. The indirect effects of PH were positive for CNP, CSL and FFS. In the branching group, five traits (CNP, TSW, SNC, IFS, and PH) showed high direct effects on PSY. CNP also had the greatest effect on PSY (P = 0.726), whereas IFS had the highest negative direct effect (− 0.394) (Table S3). PH has a direct positive effect on PSY. The indirect effect of PH was negative through the SNC and IFS.
The traits with highly significant positive association and direct positive effects on PSY were CNP, CSL, TSW, PH and FFS. Therefore CNP had the highest directly influence, whereas CSL had the highest indirect influence.
Phenotypic variation of seed yield and yield-related traits in population structure
Li et al. (2014) grouped 369 sesame core accessions into two subgroups: one (Q1) consisting of 243 accessions and the other (Q2) consisting of 126 accessions. Differences in the traits between the two subgroups were estimated. Phenotypic values of PH, FCH, TL, CNP, IFS, and FFS in the Q1 subgroup were higher (P < 0.01) and the value of TSW was lower (P < 0.01) than those in the Q2 subgroup. For example, the PH of group Q1 was 144.45 cm taller than that of group Q2 (130.68 cm), and the final flower stage of group Q1 was 77.56 d longer than that of group Q2 (73.68 d). The CSL and SNC phenotypes were not significantly different between the two subgroups (Fig. S3).
PCA was performed to explore the relationships among the 369 accessions based on 112 SSR markers (Fig. 2). The results showed that no significant correlation between geographic origin and the subgroups. We then grouped the 369 accessions obtained by PCA according to different plant types and found that the 369 accessions were grouped into two groups, in which uniculm accessions were grouped together (blue) and branching accessions were grouped together (orange) (Fig. 2, 3). To determine, whether plant type affected on the phenotypic variations in PSY and its related traits, we compared the grouped uniculm and branching accessions (Fig. 4). FCH, TL, and CSL were higher in the uniculm accessions than in the branching accessions (P < 0.01), whereas FFS in branching subgroup accessions was higher than in the uniculm accessions.
Association mapping and marker effect of seed yield and yield-related traits
We performed association mapping for PSY and its related traits using 369 core accessions and 112 SSR markers. The adjusted significance threshold (− log10 (P)) was set at 3.0. As a result, a total of 27 genomic loci were identified to associated with PSY and its related traits (P < 0.001) (Table S4). Among them, two SSR loci (Hs4209 and Hs345) were significantly associated with PSY, but with low phenotypic variance (R2 = 6.54 and 5.97%, respectively). For PH, Hs4089 and Hs1775 were detected for PH and explained of 12.50% and 8.16% of the phenotypic variance, respectively (Table S4). For FCH, three SSR markers were detected, with phenotypic variance explained by 4.61%, 6.94%, and 7.43%, respectively. For TL, six SSR markers were detected, with phenotypic variance ranging from 4.96 to 6.68%. For CSL, only one locus (Hs4089) was detected with a phenotypic variance of 9.44%. With CNP, six associated loci were detected, with phenotypic variance explained between 4.79 and 11.73%, of which Hs345 explained the largest portion of the CNP (R2 = 11.73%) in the 2011Pingyu (Fig. S4). For SNC, six associated loci were detected, and the explained phenotypic variance ranged from 3.90 to 8.66%. Hs635 was detected in two of the five environments (Fig. S5). For TSW, Hs395 and Hs4325 were detected with phenotypic variances of 3.85% and 9.20%, respectively. For IFS, five markers were found and the explained phenotypic variance ranged from 5.43 to 8.81%. For FFS, only one marker (Hs270) was detected with a phenotypic variance of 6.33%. Interestingly, we found that six SSR loci were simultaneously associated with several traits. For example, Hs4082 was associated with CNP, TL, and SNC; Hs233 was associated with PH and CSL; and Hs270 was associated with IFS and FFS.
To reveal whether plant type affected the association mapping results, we conducted association mapping of PSY and its related traits using the uniculm and the branching accessions separately (Table S5 and S6). 20 unique SSR loci were associated with PH (2), FCH (2), CNP (2), TL (1), CSL (2), SNC (4), TSW (1), IFS (2), and FFS (2) (Table S5) in the uniculm accessions, with 6.96–19.08% phenotypic variance. Hs4089 explained the largest portion of the phenotypic variation in PH. Two markers (Hs4089 and Hs464) were associated with PH and CSL, Hs485 was associated with IFS and TL, and Hs618 was associated with IFS in both environments.
When the branching accessions were analyzed by GWAS, 14 unique SSR loci were associated with FCH (4), CNP (2), CSL (1), SNC (3), TL (2), and TSW (2) (Table S6), explaining the 9.22–24.91% phenotypic variance. The Hs345 and Hs1385 loci explained the largest proportion of the variation in CNP (R2 = 24.91%) and FCH (R2 = 20.34%), respectively, whereas the Hs425 locus had the least effect on the variation in FCH (R2 = 9.22%). Hs376 is associated with FCH and SNC.
Two SSR loci were detected in both plant types. Three SSR loci were commonly detected between the uniculm and 369 accessions. In both groups, Hs4089 was associated with PH and CSL, whereas Hs4082 and Hs376 were associated with SNC and FCH, respectively. Four SSR loci were detected between the branching accessions and the 369 accessions. Hs345 was associated with CNP; Hs233 and Hs376 with FCH; and Hs250 with SNC.
To explore the association between the above markers, we analyzed the allelic effects of the six markers associated with yield and yield-related traits in the five environments (Table 3). Hs4082 had the largest effect on the variation of SNC. The Hs4082-1:1 (55.05 ± 9.31), Hs4082-1:2 (52.08 ± 6.52), and Hs4082-2:2 (50.84 ± 7.41) showed the different variation effects on SNC trait in 369 accessions (Fig. 5d). Genotypes carrying the Hs345-1:1 (58.61 ± 11.11), Hs345-2:2 (54.13 ± 11.27), and Hs345-1:2 (61.32 ± 11.33) showed different variation effects on CNP trait in 369 accessions (Fig. 5b). Furthermore, specific alleles had a negative or positive effect on yield and yield-related traits.
Comparative genome analysis
Finally, we aligned 13 non-redundant SSR markers associated with PSY and other traits related to the sesame var. Yuzhi 11 reference genome (version 2.0) (Zhang et al. 2016; Zhao et al. 2018) (Fig. 3). The 13 markers were located on 8 sesame chromosomes (Table 4); additionally, four of those markers were located in gene regions and nine were in intergenic regions. The Hs1703 locus is close to SiACS8, which is associated with CNP (Wei et al. 2015; Zhou et al. 2018).
Discussion
Key component traits to seed yield
Seed yield is the result of interactions between numerous traits and genotypes through environmental interaction (Pathirana 1995; Haruna et al. 2012; Agrawal et al. 2017). In the present study, PH, FCH, CSL, CNP, TSW, IFS, and FFS were likely the key traits for seed yield in the sesame population. The results of the correlation analysis in this study suggest that any increase in PH, CNP, CSL, TSW, and FFS or a decrease in FCH and IFS will lead to an improvement in PSY. In previous studies, PH, CSL, CNP, and TSW were regarded as the key component of seed yield (Liu et al. 1980; Sumathi et al. 2007; Gnanasekaran et al. 2008; Banerjee and Kole 2009; Yol et al. 2010; Sumathi and Muralidharan 2010; Gangadhara et al. 2012; Ibrahim and Khidir 2012). IFS showed a negative and significant correlation with PSY, which is comparable to the study by Akbar et al. (2011). Ibrahim and Khidir (2012) found a significant positive correlation between IFS and PSY in 220 F5 families derived from ten sesame. This may be due to differences in germplasm resources and populations. FFS showed a positive and significant correlation with PSY, suggesting that improvements in the plant growth cycle will bring about an increase in PSY.
Path coefficient analysis provides a more realistic picture of the relationship, as it considers the direct and indirect effects of the variables by partitioning the correlation coefficients. The results showed that CNP, CSL, TSW, SNC, IFS, and FFS were directly associated with PSY (Table 2). CNP had the highest positive direct effect on PSY, followed by SNC, whereas CSL and FFS had a small direct effect but a higher indirect effect through CNP on PSY. In previous research, CNP, TSW, and SNC have been shown to have strong and direct effects on PSY (Shim et al. 2001; Azeez and Morakinyo 2011; Navaneetha et al. 2019). Therefore, CNP, CSL, TSW, and FFS were the main traits affecting sesame yield, and these traits can be considered as selection criteria for improving sesame yield.
Effects of plant type on seed yield and component traits
Plant type is one of the most important traits in sesame, because it determines plant architecture and affects seed yield and cultivation practices (Kobayashi 1986; Van Zanten 2001; Mei et al. 2017). Brar and Ahuja (1979) reported that the branching habit (plant type) is monogenically controlled and that the monostem (unbranched) is recessive. In China, most cultivars are of the uniculm type, which allows for higher plant density under high input conditions and results in a high seed yield (Van Zanten 2001). In Africa and Latin America, most cultivars are branched, have soft stems and reduced plant height, and are thus easier to cut (Beech and Imrie 2001). Moreover, branching cultivars are highly adapted to input conditions and space. The present study is the first to investigate the interactions of plant type with seed yield and related traits using the uniculm and branching accession groups separately (Fig. 1; Fig. S3). CNP, CSL, and TSW were shown to be the three common key component traits of seed yield for both uniculm and branching accessions, and therefore, are the most important component traits for sesame seed yield. PH and FFS were also the key traits for uniculm accessions, whereas IFS was also the key component trait for branching accessions. Path analysis of the uniculm group showed similar results (Table 2; Table S2 and S3). Associations between the ten traits and plant type may be caused not only by genetic differentiation, but also by the adaptation of plant type to distinct environments. Thus, plant type should be considered to improve seed yield and related traits during sesame breeding (Daniya et al. 2013).
For ideal sesame cultivars with high yield and adaption to mechanization, some sesame breeders and scientists suggest that the plant height of a sesame cultivar should be 120–130 cm, with 82 cm of reproductive stem and 28 node pairs of capsules (Zhang et al. 2021). The above results suggest that it would be efficient to breed ideal sesame cultivars with uniculm plant types and high seed yields by increasing CSL, CNP, TSW, and/or FFS. To breed high-yield cultivars with branching plant types, it may be efficient to increase CSL, CNP, and/ or TSW, as well as decrease IFS. Interestingly, Miao et al. (2020) reported a high-seed-yielding dwarf sesame cultivar, Yuzhi Dw607, with shorter internode length and shorter PH. Compared to the wild type, the plant height of this cultivar was decreased to 1.1–1.6 m, while its CNP and seed yield was highly increased under sufficient nutrition and watering (Zhang et al. 2019; Miao et al. 2020). These results provide advanced ideas for sesame breeding with desirable plant types and high seed-yield potential.
Key SSR markers for seed yield and yield-related component traits
The genetic bases of yield-related traits have been thoroughly dissected in many crops by QTL mapping (Xing and Zhang 2010; Li et al. 2011). Several QTLs with major effects have been identified using the map-based cloning of sesame (Miao et al. 2020; Wei et al. 2019). In this study, the genetic bases of ten yield-related traits were analyzed using association mapping based on phenotypic data collected over three years. Thirteen markers were repeatedly detected (P < 0.001; Table 3; Table S5 and S6), suggesting that the QTLs associated with these markers were insensitive to the growing environment (Zoric´et al. 2012). Nine markers were identified to colocalize with the QTLs identified in previous research (Table 4; Li et al. 2014; Wei et al. 2015; Zhou et al. 2018). However, owing to a lack of common markers between the genetic maps, the other marker-trait associations could not align with previous QTLs. Loci that were detected in more than two environments or were consistent with QTLs identified in previous research would be very useful for marker-assisted selection of yield-related traits. Fine mapping of such chromosomal regions would help identify candidate genes responsible for the natural variation of these yield-related traits.
One SSR marker, Hs635, linked to SNC, was located on Chromosome SiChr.3. The SNC with 1:1 genotype (50.63 ± 6.77) was much lower than the 2:2 genotype (56.27 ± 10) (Fig. 5c). Hs233, which is linked to four yield-related traits including FCH and IFS, is located on Chromosome SiChr.10. The accessions with Hs233-3:3 had low FCH (49.54 ± 11.31 cm). Hs1703 was associated with CNP, and the alleles of 1:1 (61.15 ± 10.63), 1:2 (57.92 ± 11.97), and 2:2 (56.53 ± 11.63) indicated the phenotype variation of CNP (Fig. 5a). The Hs1703 locus on chromosome SiChr.11 is close to the gene SiACS8, which is involved in CNP (Wei et al. 2015; Zhou et al. 2018). The results provide an effective way to enhance trait performance of sesame ccultivars for yield-related traits, which depict us the perspective of application in molecular breeding.
Conclusion
We systematically explored the association between yield and nine yield-related traits in 369 sesame accessions worldwide using 112 SSR markers. A significant positive correlation between CNP, CSL, TSW, and PSY was observed. Among the different plant types, CNP, CSL, and TSW were the key components. Path analysis showed a similar direct effect. Thirteen SSR markers were significantly associated with nine seed yield-related traits using MLM method. The three SSR markers were repeatedly detected in both environments. These results would provide an efficient platform for insight into the traits that influence PSY and MAS breeding in sesame. In the future, we plan to develop or screen additional SSR markers or genes to support the breeding of high-yield molecules in sesame cultivars.
Abbreviations
- AOV:
-
Analysis of variance
- BNP:
-
Branch number per plant
- CNP:
-
Capsule number per plant
- CSL:
-
Capsule stem length
- CV:
-
Coefficient value
- FCH:
-
Height to the first capsule
- FFS:
-
Final flowering stage
- G:
-
Genotype
- GLM:
-
General liner model
- GWAS:
-
Genome-wide association study
- IFS:
-
Initial flowering stage
- InDel:
-
Insertion and deletion
- LD:
-
Linkage disequilibrium
- MIM:
-
Multiple interval mapping
- PCA:
-
Principal component analysis
- PCR:
-
Polymerase chain reaction
- PH:
-
Plant height
- PSY:
-
Plant seed yield
- QTL:
-
Quantitative trait loci
- RAD-seq:
-
Restriction-site associated DNA sequencing
- RIL:
-
Recombinant inbred line population
- SNC:
-
Seed number per capsule
- SNP:
-
Seed number per plant
- SNPs:
-
Single nucleotide polymorphisms
- SSR:
-
Simple sequence repeat
- SWP:
-
Seed weight per plant
- TL:
-
Tip length
- TSW:
-
Thousand seed weight
References
Agrawal MM, Singh S, Wawge MN, Macwana S, Sasidharan N (2017) Correlation and path analysis for seed yield and yield attributing traits in sesame germplasm (Sesamum indicum L.). Int J Chem Stud 5:1099–1102
Akbar F, Rabbani MA, Shinwari ZK, Khan SJ (2011) Genetic divergence in sesame (Sesamum indicum L.) landraces based on qualitative and quantitative traits. Pak J Bot 43:2737–2744
Anilakumar KR, Pal A, Khanum F, Bawa AS (2010) Nutritional, medicinal and industrial uses of sesame (Sesamum indicum L.) seeds an overview. Agric Conspec Sci 75:159–168
Ashri A (1998) Sesame breeding. In: Jules J (ed) Plant breeding reviews 16. Wiley, New York, pp 179–228. https://doi.org/10.1002/9780470650110.ch5
Azeez MA, Morakinyo JA (2011) Path analysis of the relationships between single plant seed yield and some morphological traits in sesame (Genera sesamum and Ceratotheca). Int J Plant Breed Genet 5:358–368. https://doi.org/10.3923/ijpbg.2011.358.368
Banerjee PP, Kole PC (2009) Combining ability analysis for seed yield and some of its component characters in sesame (Sesamum indicum L.). Int J Plant Breed Genet 3:11–21. https://doi.org/10.3923/ijpbg.2009.11.21
Bedigian D, Harlan JR (1986) Evidence for cultivation of sesame in the ancient world. Econ Bot 40:137–154
Bedigian D, Seigler DS, Harlan JR (1985) Sesamin, sesamolin and the origin of sesame. Biochem Syst Ecol 13:133–139. https://doi.org/10.1016/0305-1978(85)90071-7
Beech DF, Imrie BC (2001) Breeding for mechanised sesame production in Australia. In: Van ZL (ed) Sesame improvements by induced mutations. Joint FAO/IAEA division of nuclear techniques in food and agriculture, pp 63–70
Biabani AR, Pakniyat H (2008) Evaluation of seed yield-related characters in sesame (Sesamum indicum L.) using factor and path analysis. Pak J Biol Sci 11:1157–1160. https://doi.org/10.3923/pjbs.2008.1157.1160
Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES (2007) Tassel: software for association mapping of complex traits in diverse samples. Bioinformatics 23:2633–2635. https://doi.org/10.1093/bioinformatics/btm308
Brar GS, Ahuja KL (1979) Sesame: its culture, genetics, breeding and biochemistry. In: Malik CP (ed) Annual review of plant science. Kalyani publishers, New Delhi, pp 245–313
Cai D, Xiao Y, Yang W, Ye W, Wang B, Youna M, Wu J, Liu K (2014) Association mapping of six yield-related traits in rapeseed (Brassica napus L.). Theor Appl Genet 127:85–96. https://doi.org/10.1007/s00122-013-2203-9
Daniya E, Dadari SA, Ndahi WB, Kuchinda NC, Babaji BA (2013) Correlation and path analysis between seed yield and some weed and quantitative components in two sesame (Sesamum indicum L.) varieties as influenced by seed rate and nitrogen fertilizer. J Biol Agri Healthc 3:12–16
Dewey DR, Lu KH (1959) A correlation and path analysis of components of crested wheat grass seed production. Agron J 51:515–518. https://doi.org/10.2134/agronj1959.00021962005100090002x
Edmondson R (2005) Past developments and future opportunities in the design and analysis of crop experiments. J Agr Sci Cambrid 143:27–33. https://doi.org/10.1017/S0021859604004472
Gangadhara K, Prakash JC, Badiger B, Shadakshari TV, Yathish KR, Rajesh AM (2012) Genetic divergence, genetic advance and heritability in sesame (Sesame indicum L.). A Q J Life Sci 9:457–462
Gnanasekaran M, Jebaraj S, Muthuramu S (2008) Correlation and path co-efficient analysis in sesame (Sesamum indicum L.). Plant Arch 8:167–169
Goudappagoudra R, Lokesha R, Ranganatha ARG (2011) Trait association and path coefficient analysis for yield and yield attributing traits in sesame (Sesamum indicum L.). Eletron J Plant Breed 2:448–452
Haruna IM, Aliyu L, Olufajo OO, Odion EC (2012) Contributions of some growth characters to seed yield of sesame (Sesamum indicum L.). ISABB J Food Agric Sci 2:9–14
Hou S, Zhu G, Yuan L, Li W, Fu J, Niu E, Li L, Zhang D, Guo W (2018) Genome-wide association studies reveal genetic variation and candidate genes of drought stress related traits in cotton (Gossypium hirsutum L.). Front Plant Sci 9:1276. https://doi.org/10.3389/fpls.2018.01276
Ibrahim SE, Khidir MO (2012) Genotypic correlation and path coefficient analysis of yield and some yield components in sesame (Sesamum indicum L.). Int J Agrisci 2:664–670
Jatothu JL, Dangi KS, Kumar SS (2013) Evaluation of sesame crosses for heterosis of yield and yield attributing traits. J Trop Agr 51:84–91
Kanu PJ, Bahsoon JZ, Kanu JB, Kandeh JB (2010) Nutraceutical importance of sesame seed and oil: a review of the contribution of their lignans. Sierra Leone J Biomed Res 2:4–16. https://doi.org/10.4314/sljbr.v2i1.56583
Kobayashi T (1986) Early maturing, short internode varieties of sesame. In: Martinez JF (ed). Sesame and safflower Newsletter 2. Institute of sustainable agriculture, Cordoba, Spain, pp 33–35
Langham DR (2017a) VII. Capsule descriptors of sesame (Sesamum indicum L.). ResearchGate, p 263
Langham DR (2017b) VIII Capsule zone descriptors of sesame (Sesamum indicum L.). ResearchGate, p 276. https://doi.org/10.13140/RG.2.2.14789.55520
Langham DR (2018) IV Sesame root and stem descriptors (Sesamum indicum L.). ResearchGate, p 51. https://doi.org/10.13140/RG.2.2.14891.95522
Laurentin HE, Karlovsky P (2006) Genetic relationship and diversity in a sesame (Sesamum indicum L.) germplasm collection using amplified fragment length polymorphism (AFLP). BMC Genet 7:10. https://doi.org/10.1186/1471-2156-7-10
Li Y, Fan C, Xing Y, Jiang Y, Luo L, Sun L, Shao D, Xu C, Li X, Xiao J, He Y, Zhang Q (2011) Natural variation in GS5 plays an important role in regulating grain size and yield in rice. Nat Genet 43:1266–1269. https://doi.org/10.1038/ng.977
Li C, Miao H, Wei L, Zhang T, Han X, Zhang H (2014) Association mapping of seed oil and protein content in Sesamum indicum L. using SSR markers. PLoS ONE 9:e105757. https://doi.org/10.1371/journal.pone.0105757
Liu J, Ding F, Tu L (1980) Association analysis of seed yield component characters in sesame (Sesamum indicum L.). Chin J Oil Crop Sci 2:55–60
Mei H, Liu Y, Du Z, Wu K, Cui C, Jiang X, Zhang H, Zheng Y (2017) High-density genetic map construction and gene mapping of basal branching habit and flowers per leaf axil in sesame. Front Plant Sci 8:636. https://doi.org/10.1371/journal.pone.0105757
Mei H, Liu Y, Cui C, Hu C, Xie F, Zheng L, Du Z, Wu K, Jiang X, Zheng Y, Ma Q (2021) QTL mapping of yield-related traits in sesame. Mol Breed 41:43. https://doi.org/10.1007/s11032-021-01236-x
Miao H, Li C, Duan Y, Wei L, Ju M, Zhang H (2020) Identification of a Sidwf1 gene controlling short internode length trait in the sesame dwarf mutant dw607. Theor Appl Genet 133:73–86. https://doi.org/10.1007/s00122-019-03441-x
Muhamman MA, Mohammed SG, Lado A, Betel MD (2010) Interrelationship and path coefficient analysis of some growth and yield characteristics in sesame (Sesamum indicum L.). J Agr Sci Camb 2:99–105. https://doi.org/10.5539/jas.v2n4p100
Navaneetha JS, Murugan E, Parameswari C (2019) Correlation and path analysis for seed yield and its components in sesame (Sesamum indicum L.). Electronic J Plant Breeding 10:1262–1268
Pathirana R (1995) Comparison of selection procedures in breeding for seed yield in segregating sesame populations. Euphytica 82:73–78. https://doi.org/10.1007/BF00028711
Peng J, Richards DE, Hartley NM, Murphy NM, Devos KM, Flintham JE, Beales J, Fish LJ, Worland AJ, Pelica F, Sudhakar D, Christou P, Snape JW, Gale MD, Harberd NP (1999) ‘Green revolution’ genes encode mutant gibberellin response modulators. Nature 400:256–261
Piepho HP (2018) Allowing for the structure of a designed experiment when estimating and testing trait correlations. J Agr Sci Camb 156:59–70. https://doi.org/10.1017/S0021859618000059
Sarwar G, Haq MA, Saleem MM, Fernández-Martínez J, Velasco L, Griffee P (2005) Genetic parameters and correlation study in diverse types of sesame germplasm. In: Martínez JF (ed) Sesame and safflower newsletter 20. Institude of Sustainable Agriculture, Cordoba, Spain, pp 29–33
Sas. (1999) SAS/STAT® User’s Guide. Version 8, SAS Institute Inc., Cary, 3809
Shim KB, Kang CW, Lee SW, Kim DH, Lee BH, Fernández MJ (2001) Heritabilities, genetic correlations and path coefficients of some sgronomic traits in different cultural environments in sesame. In: Martínez JF (ed) Sesame and safflower newsletter 16. Institute of Sustainable Agriculture, Cordoba, Spain, pp 16–22
Solanki ZS, Gupta D (2000) Heterosis in sesame. Indian J Genet 60:403–405
Sumathi P, Muralidharan V (2010) Inheritance of branching and important biometrical traits in sesame (Sesamum indicum L.). Indian J Genet Pl Br 70:97–101
Sumathi P, Muralidharan V, Manivannan N (2007) Trait association and path coefficient analysis for yield and yield attributing traits in sesame (Sesamum indicum L.). Madras Agric J 94:174–178
Tabatabaei I, Pazouki L, Bihamata MR, Mansoori S, Javaran MJ, Niinemets U (2011) Genetic variation among Iranian sesame (Sesamum indicum L.) accessions vis-a-vis exotic genotypes on the basis of morpho-physiological traits and RAPD markers. Aust J Crop Sci 5:396–1407
Turner SD (2014) qqman: an R package for visualizing GWAS results using Q-Q and manhattan plots. Biorxiv 005165. https://doi.org/10.1101/005165
Verdooren R (2020) History of the statistical design of agricultural experiments. J Agr Biol Envir St 25:457–486. https://doi.org/10.1007/s13253-020-00394-3
Wei L, Zhang H, Zheng Y, Miao H, Zhang T, Guo W (2009) A genetic linkage map construction for sesame (Sesamum indicum L.). Genes Genom 31:199–208
Wei X, Liu K, Zhang Y et al (2015) Genetic discovery for oil production and quality in sesame. Nat Commun 6:8609. https://doi.org/10.1038/ncomms9609
Wei L, Li C, Duan Y, Qu W, Wang H, Miao H, Zhang H (2019) A SNP mutation of SiCRC regulates seed number per capsule and capsule length of cs1 mutant in sesame. Int J Mol Sci 20:4056. https://doi.org/10.3390/ijms20164056
Wu K, Liu H, Yang M, Tao Y, Ma H, Wu W, Zou Y, Zhao Y (2014) High-density genetic map construction and QTLs analysis of grain yield-related traits in sesame (Sesamum indicum L.) based on RAD-seq technology. BMC Plant Biol 14:274. https://doi.org/10.1186/s12870-014-0274-7
Xing Y, Zhang Q (2010) Genetic and molecular basis of rice yield. Annu Rev Plant Biol 61:421–442
Yol E, Karaman E, Furat S, Uzun B (2010) Assessment of selection criteria in sesame by using correlation coefficients, path and factor analyses. Aust J Crop Sci 4:598–602
Yue W, Wei L, Zhang T, Li C, Miao H, Zhang H (2012) Genetic diversity and population structure of germplasm resources in sesame (Sesamum indicum L.) by SSR markers. Acta Agron Sin 38:2286–2296. https://doi.org/10.3724/SP.J.1006.2012.02286
Van Zanten L (2001) Sesame improvement by induced mutations: results of the co-ordinated research project and recommendation for future studies. In: Sesame improvement by induced mutations. Final reports of an FAO/IAEA co-ordinated research project, Vienna
Zhang H, Wei L, Miao H, Zhang T, Wang C (2012) Development and validation of genic-SSR markers in sesame by RNA-seq. BMC Genom 13:316–326. https://doi.org/10.1186/1471-2164-13-316
Zhang H, Miao H, Wei L, Li C, Zhao R, Wang C (2013) Genetic analysis and QTL mapping of seed coat color in sesame (Sesamum indicum L.). PLoS ONE 8:e63898. https://doi.org/10.1371/journal.pone.0063898
Zhang H, Miao H, Li C, Wei L, Duan Y, Ma Q, Kong J, Xu F, Chang S (2016) Ultradense SNP genetic map construction and identification of SiDt gene controlling the determinate growth habit in Sesamum indicum L. Sci Rep-UK 6:31556. https://doi.org/10.1038/srep31556
Zhang H, Miao H, Ju M (2019) Potential for adaption to climate change through genomic breeding in sesame. In: Kole C (ed) Genomic designing of climate-smart oilseed crops. Springer, Switzerland, pp 371–440
Zhang H, Langham DR, Zhao Y, Khalafalla A, Miao H (2021) Traditional breeding in sesame. In: Miao H, Zhang H, Kole C (eds) The sesame genome. Springer, Switzerland, pp 145–158
Zhang X, Feng X (2006) Descriptors and data standard for sesame (Sesamum indicum L.). China Agriculture Press
Zhao R, Miao H, Song W, Chen C, Zhang H (2018) Identification of sesame (Sesamum indicum L.) chromosomes using the BAC-FISH system. Plant Biol 20:85–92. https://doi.org/10.1111/plb.12647
Zhou R, Dossa K, Li D, Yu J, You J, Wei X, Zhang X (2018) Genome-wide association studies of 39 seed yield-related traits in sesame (Sesamum indicum L.). Int J Mol Sci 19:2794. https://doi.org/10.3390/ijms19092794
Zorić M, Dodig D, Kobiljski B, Quarrie S, Barnes J (2012) Population structure in a wheat core collection and genomic loci associated with yield under contrasting environments. Genetica 140:259–275. https://doi.org/10.1007/s10709-012-9677-2
Acknowledgements
We gratefully acknowledge Prof. Hongbin Zhang from Texas A&M University, USA for his revision and suggestion on the manuscript.
Funding
This work was financially supported by the earmarked fund for China Agriculture Research System of MOF and MARA (CARS-14), the Key Project of Science and Technology of Henan province (201300110600), Zhongyuan Science and Technology Innovation Leading Talent Plan (214200510020), Henan Province Specific Professor Position Program (SPPP2020), Zhongyuan Scientist Workshop Construction (ZSWC2019, 214400510026), Innovation Scientists and Technicians Troop Construction Project of Henan Province (ISTTCPHP2016), and Key Research and Development Project of Henan Province (221111520400).
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by CW, CL, YD, MJ, JN, LW, HW, and HC. The first draft of the manuscript was written by CW, HM, HZ, and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare there are no conflicts of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wang, C., Niu, J., Miao, H. et al. Genetic variation, correlation, and association mapping of seed yield and its component traits in sesame. Genet Resour Crop Evol 71, 603–619 (2024). https://doi.org/10.1007/s10722-023-01644-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10722-023-01644-2