Introduction

Wheat (Triticum aestivum L.) is an essential cereal crop globally [1]. While there have been significant increases in wheat yield and total production in recent decades, the current level of wheat grain production is insufficient to meet the future demands of a growing global population [2]. Moreover, this challenge is exacerbated by the gradual reduction of arable land [3, 4]. Thus, improving wheat grain yield is vital for global food and nutrition security [5]. Consequently, increasing grain yield has become the primary goal in wheat breeding.

The yield of wheat is primarily determined by three components: grain number per spike (GNS), grain weight, and spike number [6, 7]. Previous studies have shown that GNS significantly impacts wheat grain yield more than grain weight [8,9,10]. In wheat breeding programs, high yield is primarily achieved by increasing GNS rather than grain weight [11,12,13,14,15]. Exploring the genetic variation of GNS holds excellent potential for future yield improvement [16]. GNS is a complex quantitative trait controlled by multiple genes [17,18,19]. Quantitative trait locus (QTL) mapping effectively analyzes such traits [17, 20, 21]. With the development of molecular markers, numerous QTL analyses of GNS have been conducted, leading to the identification of several QTLs [22,23,24]. These reports have identified QTLs distributed across the entire wheat genome.

Saturated genetic linkage maps play a crucial role in QTL mapping by providing measurements of marker effects and offering useful DNA markers for marker-assisted selection (MAS) in breeding practices [25, 26]. However, due to limited molecular markers, the use of unsaturated genetic linkage maps in the past has led to the presence of linked markers that are further from the target QTL genetic distance [27,28,29]. This condition has restricted the understanding of the genetic architecture of GNS and the application of markers in wheat breeding [30]. With the development of next-generation sequencing (NGS) technologies, reference genomes for Chinese Spring wheat and other varieties have been published, providing a way to overcome these limitations [31]. NGS-based SNP genotyping platforms have recently been developed, leading to significant progress in QTL analysis for wheat GNS [32, 33]. High-density SNP genotyping arrays, such as the 9 K, 55 K, 660 K, and 820 K SNP arrays, have been developed for wheat and widely used for QTL analysis [34, 35]. For instance, the 90 K wheat SNP chip was employed to study the genetic control of yield-related traits in 66 elite wheat varieties derived from Xiaoyan 6, resulting in the identification of 803 significant marker-trait associations that explained up to 35.0% of the phenotypic variation.

To address the limited genetic diversity in common wheat, it is crucial to identify new genetic loci controlling yield-related traits to broaden genetic variation and accelerate wheat breeding improvements [36, 37]. Wild relatives, such as the Agropyron genus, are necessary gene resources for improving common wheat [38, 39]. Intergeneric hybrids between common wheat and Agropyron cristatum (A. cristatum) have been generated successfully to transfer beneficial allele genes from the Agropyron genus to wheat [40, 41]. Several novel wheat-A cristatum resources with desirable agronomic traits have been produced [42, 43]. Furthermore, studies have been conducted on the genetic mechanisms of A. cristatum in the background of common wheat, focusing on chromosomal segments or genes [44, 45]. For example, a previous study has studied Pubing2978, a wheat-A. Cristate translocation line with high GNS using techniques like genomic in situ hybridization (GISH), fluorescence in situ hybridization (FISH), and molecular markers. The A. cristatum 6P chromosomal segment is critical in increasing GNS [45]. Another study has identified an enhancer grain weight locus on the 7P chromosome of A. cristatum and mapped it to 7PS1-2 using 158 STS markers. Further studies on the mechanism of this locus on wheat grain weight revealed that two translocation lines with 7P chromosomal segments (7PT-A18 and 7PT-B4) could simultaneously increase grain weight, length, and width [41].

Previous studies have mainly focused on common wheat when investigating the genetic factors influencing GNS and have identified several QTLs. However, there is still limited knowledge about the molecular mechanisms controlling GNS in wheat germplasm derived from crossing common wheat with A. cristatum, such as the newly bred wheat germplasm, Pubing3228 [46]. To address this gap, we developed a population of RILs derived from crossing Pubing3228 with Jing4839, which exhibits significant variations in GNS. Based on the recently developed wheat 55 K gene genotyping array, this study aimed to identify novel genetic regions and favorable alleles associated with GNS.

Therefore, the integration of advanced genomic technologies and innovative breeding approaches is essential for fulfilling the increasing demand for wheat. By integrating environmental and climatic considerations into breeding efforts, we can deepen our understanding of their influence on GNS and other critical yield determinants. Analyzing the relationships among GNS, grain weight, and spike number across varied environmental settings offers key insights for harmonizing yield component enhancement. Exploring the genetic diversity of common wheat and its wild relatives, especially untapped genetic resources, opens novel pathways for breeding initiatives. The advent of high-throughput phenotyping and sophisticated bioinformatics tools accelerates identifying and applying novel QTLs and genetic markers. Additionally, elucidating the role of A. cristatum-related genes in the wheat genome may pave new avenues for enhancing the resilience of wheat to stress and adaptability to various agricultural conditions.

Materials and methods

QTL analysis method for novel wheat germplasm Pubing3228 RIL population

For the QTL analysis, this study utilized a RIL population consisting of 210 families derived from the cross between Pubing3228 and Jing4839. Pubing3228, a genetically stable derivative breed, was selected from the offspring of the wheat-A. cristatum chromosome addition line 4844-12 (2n = 44). This hybrid germplasm, developed by Professor Li Lihui at the Chinese Academy of Agricultural Sciences over several decades, possesses elite wheat germplasm traits, including long spikes, a large number of spikelets per spike, and a high GNS [46, 47]. Conversely, Jing4839 is characterized by a higher grain weight but a lower GNS. Significant differences in several yield traits, especially in GNS, were observed between the two parents. The F9 RIL population was generated through successive selfing of the F1 generation derived from crossing Pubing3228 and Jing4839, employing the single seed descent method.

Collection of phenotypic data for interregional field trials and novel wheat germplasm Pubing3228 progeny

Field experiments were carried out at three locations, namely Pingdingshan, Yangling, and Xianyang, using 210 RILs derived from the cross between Pubing3228 and Jing4839, along with their respective parents. The experiments were conducted over three crop years (2018, 2019, 2020) in Pingdingshan, two crop years (2019 and 2020) in Yangling, and one crop year (2020) in Xianyang. The field layout was organized in a randomized complete block design, with each design replicated three times. Each replication consisted of three rows of crops, each row being 2 m in length and spaced 30 cm apart. Field management was conducted in accordance with local standards. Ten plants from each replication were selected for harvesting and manual threshing upon reaching physiological maturity. The GNS was counted directly, and the data focused on recording the average values related to the central spike of every ten plants.

Field management practices were governed by local standards. When physiological maturity was reached, ten individuals were randomly selected from each replication for harvesting and subsequent manual threshing. The spikes obtained from this process were directly counted to ascertain the GNS, with the data pertaining to the central spike of each plant being systematically recorded.

DNA extraction and SNP genotyping of hybrid offspring in wheat

Genomic DNA was extracted from the RIL population and its progenitors employing the sodium dodecyl sulfate (SDS) method [48]. Agarose gel electrophoresis was conducted to verify DNA quality using a 0.8% gel. The concentration of DNA was quantified utilizing a microplate reader to ensure compliance with the criteria for further analyses. Subsequently, the DNA samples underwent genotyping at Beijing Capital Bio Company, utilizing the Illumina Infinium iSelect 55 K SNP array. During data processing, specific markers were selectively excluded to enhance the genotyping data’s accuracy and dependability. Notably, markers with a minimum allele frequency (MAF) under 5%, those exhibiting more than 10% missing data, and markers with heterozygosity rates above 20% were removed. Following this filtration process, 3334 high-quality single nucleotide polymorphism (SNP) markers were retained for further QTL analysis. These markers were employed to explore the presence and effect of genetic variants on the targeted traits.

Construction and localization of QTL linkage map in hybrid offspring of wheat

In this investigation, outliers within the phenotypic data were initially removed before calculating the best linear unbiased estimates (BLUEs) based on the average values for each environment, which then facilitated subsequent QTL mapping analysis. The QTL mapping analysis employed the QTL IciMapping software V4.2 (http://www.isbreeding.net) to construct a genetic linkage map using polymorphic SNP markers between Pubing3228 and Jing4839. The initial step involved the application of the BIN function to the SNP markers, facilitating the identification of segregating distortions, missing data, and superfluous markers. Following this preliminary screening, the residual SNP markers were employed to establish the framework of the genetic linkage map via the MAP function. The Kosambi function and maximum likelihood estimation were applied to ascertain the order and distance of the markers.

Subsequently, the generated genetic linkage map, in conjunction with the IciMapping software V4.2, was applied to execute QTL mapping. The composite interval mapping (ICIM) model was selected to identify QTLs associated with Grain Number per Spike (GNS). A QTL was deemed significant if it exhibited a Log of Odds (LOD) score exceeding 2.5. The QTL analysis was performed individually for each environment, and QTLs identified in two or more environments were regarded as stable. The nomenclature for the wheat QTL adhered to the following pattern: QGns.wa-1D, where “Q” signifies QTL, succeeded by the trait indicator; “wa” denotes the laboratory, and “1D” indicates the chromosome.

Identification of candidate genes associated with GNS in wheat

The selection of candidate genes for GNS was not limited to sequences associated with grain traits. Instead, genes located within the physical intervals of QTL and expressed explicitly in grain tissues were identified as potential candidates. Information regarding these candidate genes was sourced from the JBrowse website (https://urgi.versailles.inra.fr/jbrowseiwgsc). Expression profiles of the candidate genes across various tissues were obtained from the expVIP website (http://wheat-expression.com). By comparing the expression patterns of these preliminary candidate genes, those associated with GNS were determined. This approach facilitated precise genomic localization of genes potentially influencing GNS and further validated their functional relevance through expression profile analysis.

Results

Environmental response of wheat varieties Pubing3228 and Jing4839 to GNS and their contributions to the analysis of QTL

This study focused on comparing the performance of GNS in two wheat varieties, Pubing3228 and Jing4839, under identical environmental conditions. The results revealed significant differences in GNS between Pubing3228 and Jing4839, with Pubing3228 consistently exhibiting higher GNS than Jing4839 across all tested environments (Table 1; Fig. 1A). To further investigate the genetic control of GNS, a RIL population derived from the hybridization of these two parental varieties was analyzed for GNS variability across three environments from 2019 to 2021 (Table 1, Supplementary Material 1). The continuous variation observed in GNS in the RIL population suggests that it is a quantitatively inherited trait with typical characteristics, making it suitable for QTL analysis (Fig. 1B-G).

Table 1 Comparative analysis of GNS in Pubing3228/Jing4839 RIL population across different environments
Fig. 1
figure 1

Distribution of GNS in the Pubing3228 and Jing4839 RIL population across different years and locations. Note (A) Comparative bar chart of GNS from 2019 to 2021, where E1, E2, and E3 represent the distinct environments of Pingdingshan, Yangling, and Xianyang, respectively. Blue bars indicate the Pubing3228 variety, while red bars represent the Jing4839 variety. (B-G) Histograms and fitted curves showing the distribution of GNS in the RIL population under the environments of Pingdingshan in 2019 (E1, 2019), Pingdingshan in 2020 (E1, 2020), Yangling in 2020 (E2, 2020), Xianyang in 2020 (E3, 2020), Pingdingshan in 2021 (E1, 2021), and Yangling in 2021 (E2, 2021)

Notably, the highest GNS, approximately 140 grains, was observed in the Pingdingshan (E1) environment in 2021, while the lowest GNS, about 19.67 grains, was recorded in the Yangling (E2) environment, indicating significant environmental effects on GNS (Table 1). Moreover, the average GNS of the RIL population ranged from 47.81 to 59.69 grains across different environments, implying an interaction between genetic factors and environmental conditions in influencing GNS (Table 1). The near-normal distribution of GNS data, with skewness and kurtosis absolute values mostly less than 1, further supported the suitability of this trait for quantitative genetic analysis (Table 1). Significant differences in GNS were observed between Pubing3228 and Jing4839, as well as variations in the RIL population under different environments.

Analysis of the genetic structure of hexaploid wheat through high-density linkage mapping

In this study, we constructed a high-density genetic linkage map using the genetic diversity analysis of two wheat varieties, Pubing3228 and Jing4839. Our approach involved using a 55 K iSelect SNP array, through which we selected 3,334 polymorphic markers out of a total of 52,478 SNP markers. These markers, which exhibited differences between the two varieties, were used to build the map (Table 2). To ensure the map’s accuracy and reliability, we excluded markers missing more than 10% of individuals or exhibiting co-segregation at the exact location.

Table 2 High-density genetic linkage maps of Pubing3228/Jing4839 RIL population

The constructed linkage map accurately represented each chromosome of hexaploid wheat, resulting in 21 linkage groups. Notably, 16 chromosomes were selected for this study (Supplementary Material 2). The total map length was 9,494.36 cM, with an average interval of 2.90 cM between adjacent markers. Notably, the number of markers and marker intervals varied across specific chromosomes. For example, there were 76 markers on chromosome 4D, while chromosome 7D had 262 markers. The most minor average interval, 0.89 cM, was observed on chromosome 1 A, whereas the most significant average interval, 9.03 cM, was found on chromosome 4D.

Regarding the genome breakdown, the linkage map for the A genome comprised 1,166 markers, accounting for 34.97% of the total markers. It had a total length of 2,077.44 cm, with an average interval of 1.76 cm. The B genome consisted of 1,017 markers, representing 30.50% of the total markers, and had a total length of 1,871.35 cm, with an average interval of 1.86 cm. On the other hand, the D genome contained 1,151 markers, accounting for 34.53% of the total markers. It had a total length of 5,545.57 cm, with an average interval of 5.08 cm. Interestingly, while the number of markers in the A genome was similar to that in the B and D genomes, the marker coverage in the B genome was lower, suggesting potential differences in genetic diversity.

Analysis of QTLs for spikelet number revised to grain number per spike in wheat across multiple environments

GNS is a crucial trait that determines wheat yield. Understanding the genetic basis of GNS is vital for breeding. Thus, we conducted QTL mapping of GNS in two populations, Pubing3228 and Jing4839, under different environmental conditions to identify the key genetic factors controlling GNS.

Between 2019 and 2021, across three diverse environments, a total of 40 QTLs related to GNS were identified, distributed across 16 chromosomes, encompassing all chromosomes except for 1B, 2A, 5B, 6A, and 7A (Table 3). Chromosomes 5D and 7D harbored the highest number of QTLs, with six identified on each, whereas chromosomes 2B, 3B, 4D, 6B, and 6D each had only one QTL identified. The log of odds (LOD) scores of these QTLs ranged from 3.01 to 33.21, accounting for 4.25–17.17% of the phenotypic variation. Among all detected QTLs, QGns.wa-1 A.e1.2 exhibited the highest percentage of phenotypic variation explained (PVE), reaching up to 17.17%, located between AX-94,465,571 and AX-109,859,174 on chromosome 1 A. Following this were QGns.wa-7Da.e1 and QGns.wa-1D.e1.1, contributing 15.12% and 14.83% of PVE, respectively. Four QTLs were consistently detected across different environments, indicating stability, including QGns.wa-1A.e1.2, QGns.wa-1D.e1.1, QGns.wa-5A.e1, and QGns.wa-7Da.e1, all showing high PVE. QGns.wa-1A.e1.2 on chromosome 1A was detected in environments 2021E1, 2020E2, and 2020E3, contributing 8.2–17.17% to phenotypic variation. QGns.wa-1D on chromosome 1D was observed in environments 2019E1, 2020E1, and 2020E3, contributing 9.89–14.83% to phenotypic variation. QGns.wa-5A on chromosome 5A was identified in environments 2020E1, 2020E2, and 2020E3, explaining 9.095–11.13% of the phenotypic variation. QGns.wa-7Da.e1 on chromosome 7D accounted for 4.92–15.12% of the phenotypic variation. Notably, positive additive effects indicated an increase in the effect from Pubing3228, while negative additive effects suggested an increase from Jing4839. Except for two QTLs, all others exhibited positive additive effects, indicating that all favorable alleles originated from Pubing3228, which carries genetic traits promoting high GNS.

Table 3 Significant QTLs identified for GNS in the Pubing3228/Jing4839 RIL populations across different environments

Exploration of the molecular mechanisms of GNS in wheat

To clarify the molecular mechanism behind GNS in wheat, this study analyzes stable QTLs that impact GNS. The study constructs an expression heatmap of relevant genes using the Wheat Expression Browser public database (http://www.wheatexpression.com). Out of the 1265 genes analyzed, emphasis is placed on genes expressed explicitly in wheat grains, considering them as candidate genes that may influence GNS. This approach identifies 11 potential candidate genes (Table 4) related to GNS. These genes are primarily located on chromosomes 4A, 5D, and 7D.

Table 4 Identified candidate genes for GNS in wheat from the Pubing3228/Jing4839 RIL population

Among the candidate genes, TraesCS4A01G601800 is the only one on chromosome 4A. It is potentially involved in the function of 4-hydroxybenzoate acetyltransferase in wheat. On chromosome 5D, three candidate genes are identified: TraesCS5D01G044000, TraesCS5D01G051000, and TraesCS5D01G033700. These genes may encode ribosomal protein L28 [49], aging-related protein, and YABBY transcription factor. On chromosome 7D, seven candidate genes with diverse functions are found. They include TraesCS7D01G056100, potentially encoding S-adenosylmethionine decarboxylase precursor, TraesCS7D01G060100 for cysteine proteinase, TraesCS7D01G064300 for starch synthase, TraesCS7D01G109200 for GDSL esterase/lipase, TraesCS7D01G148000 for an aging-related protein, TraesCS7D01G117600 for an ethylene-responsive transcription factor, and TraesCS7D01G117100 for histone H3 [50, 51].

Discussion

GNS, a key determinant of wheat yield, has been extensively studied [52]. Previous research has attributed the enhancement of wheat yield primarily to the increase in GNS [53], making it a crucial selection target in wheat breeding practices. Despite significant progress in GNS over the past decades in China, which has substantially contributed to yield improvements in wheat breeding, recent years have not seen notable advancements in wheat yield enhancement. This stagnation may be attributed to slow progress in identifying and utilizing new genetic resources related to GNS [54, 55]. The germplasm Pubing3228, derived from distant hybridization between common wheat and Agropyron, represents a novel wheat germplasm with numerous desirable agronomic traits, especially those associated with GNS. Therefore, it is imperative to further explore the genetic mechanisms underlying the high grain number in Pubing3228 [56, 57].

QTL comparative study of genetic mechanisms for GNS in Pubing3228 wheat germplasm

We conducted a study using an F2:3 population derived from a cross between Pubing3228 and Jing4839 to investigate the underlying genetic mechanism of GNS in Pubing3228. Our investigation revealed 12 QTLs spread across multiple chromosomes (Supplementary Material 3), marking a notable increase to 40 identified QTLs, with many not previously detected. This variance could stem from using an RIL population and a higher-density genetic linkage map constructed with 3334 SNP markers, unlike the earlier study’s temporary F2:3 population and 179 SSR markers.

We employed the BLAST method to determine the physical positions of the QTL identified in the previous study. It is important to note that the QTLs identified in our study were found within or near the mapping intervals of the previous QTL (Supplementary Material 3). For example, a previously identified QTL on chromosome 5A has been refined in our study from a 20 Mb interval, highlighting potential consistency between the two studies and narrowing the mapping intervals for several QTLs, such as those on chromosomes 7A and 7B. This detailed comparison underscores the consistency of our findings with previous research while offering more precise mapping intervals for identified QTLs.

Comparative analysis and new insights of QTLs for GNS in Pubing3228 wheat germplasm

In this study, we delved into the genetic underpinnings of GNS to enhance wheat yield and facilitate MAS in breeding. Through QTL analysis, an established method for uncovering genetic resources, we reviewed and identified 170 GNS QTLs across all 16 chromosomes (Supplementary Material 4) [58, 59], aligning the physical positions of 153 previously reported QTLs for comparison (Supplementary Material 4). Comparing our results with previous studies revealed some significant differences. However, our study reveals that the genomic locations of several QTLs, which account for approximately 21.33% of the total variation, are consistent with previously identified QTLs (Supplementary Material 5). Our findings showed significant differences and consistencies with prior research, notably confirming the locations of several QTLs that contribute to about 21.33% of the total variation. For instance, we identified QTLs on chromosome 1 A at the same position (498–499 Mb) as previously reported and discovered three GNS QTLs within a broad genetic region on chromosome 7B, previously noted in another study [60, 61]. Our research also matched previously identified GNS QTLs on chromosomes 5 A and 6B [62].

In this study, we identified several GNS QTLs that were consistent with previous research, indicating the reliability of our results. We mapped three major QTLs (QGns.wa-1A, QGns.wa-7D, and QGns.wa-1D) to specific regions on chromosomes 1A, 7D, and 1D, respectively. These regions were between AX-94,465,571 and AX-109,859,174, AX-110,826,147 and AX-108,748,734, AX-109,994,213 and AX-89,491,149. Among the QTLs identified, these QTLs exhibit the highest PVE in all three environments and demonstrate stability. However, the other two major GNS QTLs have yet to be documented in previous studies [63, 64].

An exciting aspect of our study is that we detected more GNS QTLs than in previous studies. This condition may be attributed to the utilization of new germplasm Pubing3228, which is derived from a cross between common wheat and A. cristatum. Moreover, to enhance the detection of GNS QTLs, we constructed a high-density linkage map using many gene-based SNP markers. It is important to note that our study did not find GNS QTLs on chromosomes 5B or 6A. However, previous studies have identified these QTLs controlling GNS [60, 65].

Essential chromosomes and region analysis of QTL for GNS in Pubing3228 wheat germplasm

This study observed an uneven distribution of QTLs for GNS across different chromosomes. Most GNS QTLs were concentrated on a small subset of chromosomes, namely 1A, 1D, 3A, 3D, 5A, 5D, and 7D. For instance, chromosome 1D harbored five GNS QTLs within the range of 126 cM to 129 cM, while chromosome 5A contained five QTLs within the 326 cM to 338 cM. Although only four GNS QTLs were identified on chromosome 3A, they were concentrated in a relatively short range of 203 cM to 258 cM. Other noteworthy regions included the interval of 809 cM to 835 cM on chromosome 5D and 165 cM to 168 cM on chromosome 1A, each containing four GNS QTLs. On the other hand, GNS QTLs on chromosomes 3D and 7D were scattered throughout the chromosomes. These significant chromosomal regions should be prioritized for future research. Similar regions and chromosomes associated with GNS have been identified in previous studies [66,67,68]. Although various studies have explored yield-related traits in wheat, limited research has specifically investigated GNS [69,70,71]. A previous study identified and validated major regions on chromosomes 5A and 2A for high GNS using an RIL population derived from the hybridization between synthetic hexaploid wheat ‘W7984’ and the spring wheat variety ‘Opata M85’. Notably, two GNS QTLs were identified on chromosome 6D. Pubing3228 originated from the crossbreeding of common wheat and A. cristatum [70]. Prior research demonstrated that increased floret and grain numbers in the wheat-A. cristatum chromosome addition line 4844-12 (2n = 44) was controlled by a pair of A. cristatum chromosomes (6P) substituting the wheat chromosome 6D. Pubing3228 was developed through several generations of selection from the progeny of 4844-12 [72]. A prior study has also highlighted the importance of the A. cristatum 6P chromosome segment in increasing GNS in wheat-A. cristatum translocation line Pubing2978. Hence, further analysis is required to determine if these two QTLs on chromosome 6D have origins in A. cristatum [45].

Prediction and analysis of candidate genes associated with GNS in Pubing3228 wheat germplasm

Although numerous QTLs associated with GNS have been identified in wheat, only some candidate genes for GNS have been reported, leaving the genetic mechanism of GNS in wheat largely unknown. This study pinpointed seven stable QTLs for GNS across various environments and proceeded with an analysis to predict 11 potential candidate genes based on functional annotation and expression levels.

Three candidate genes were particularly noteworthy due to their significant association with the most impactful SNP AX-108,748,734, on chromosome 7D, contributing to the highest average PVE (12.62%). The first, TraesCS7D01G148000, is hypothesized to encode a protein linked to aging processes in wheat, crucial for grain yield as they potentially facilitate nutrient reallocation to developing seeds [73]. The second, TraesCS7D01G117600, is believed to encode an ethylene-responsive transcription factor (ERF), part of the AP2/ERF family, instrumental in stress response, reproduction, defense, and hormone secretion regulation [74], with implications for flower development control [75]. The third, TraesCS7D01G117100, might be associated with histone H3, playing a role in the vernalization-induced transition from vegetative to reproductive growth through histone methylation [50, 51], suggesting its potential impact on GNS due to the established relationship between GNS and growth phases [76].

Four other candidate genes were also identified on chromosome 7D. Two genes (TraesCS7D01G056100, TraesCS7D01G060100) correspond to SNP AX-108,759,691, with TraesCS7D01G056100 speculated to encode S-adenosylmethionine decarboxylase precursor, potentially playing a role in polyamine biosynthesis. Research has suggested that polyamines, including putrescine, spermidine, and cadaverine, are involved in cell division, embryogenesis, flower development, and fruit development [77, 78]. The candidate gene TraesCS7D01G060100 may encode a cysteine protease, a significant class of proteases in plants involved in various processes such as post-translational modification, development, aging, programmed cell death, and antibiotic response [79, 80]. The candidate genes TraesCS7D01G064300 and TraesCS7D01G109200 may encode starch synthase and GDSL esterase/lipase, respectively. Starch synthase is a critical enzyme in starch synthesis in wheat endosperm, directly affecting final yield [81, 82], while GDSL esterase/lipase plays a regulatory role in plant development and morphogenesis [83, 84].

On chromosome 5D, three candidate genes have been pinpointed: TraesCS5D01G044000, TraesCS5D01G051000, and TraesCS5D01G033700. TraesCS5D01G044000 and TraesCS5D01G051000 are postulated to encode for the 60 S ribosomal protein l28 and a protein linked to aging processes, respectively. The 60 S ribosomal protein l28 plays a crucial role in translation, potentially influencing mitochondrial translation and various plant cellular activities [85]. Research indicates that the 60 S ribosomal proteins are integral to maintaining ribosomal complex stability and enhancing protein biosynthesis, suggesting their possible impact on GNS. Additionally, TraesCS5D01G033700 is implicated in the regulation of pistil and stamen development within wheat flowers. Various studies have documented the significant expression of TaYABBY genes throughout cereal development phases, underscoring their importance in plant growth and reproduction [86, 87]. Furthermore, a significant SNP, AX-108,908,317, on chromosome 4A has been associated with a gene encoding 4-hydroxybenzoic acid acetyltransferase. Previous research has demonstrated that this enzyme is crucial for ubiquinone biosynthesis in rice [88]. Ubiquinone is an essential lipophilic electron carrier required for the mitochondrial respiratory chain in eukaryotic cells [89, 90]. In Arabidopsis, studies have shown that mutants lacking ubiquinone experience developmental arrest during early embryogenesis stages [91,92,93,94].

Scientific and production significance

This study offers significant scientific value in providing new avenues and tools for enhancing the genetic potential of wheat. Firstly, by analyzing the hybridization of Pubing3228 and Jing4839 and conducting subsequent QTL analysis, this study not only elucidates the genetic basis of GNS but also identifies valuable molecular markers that can be employed in wheat breeding programs. Developing high-yielding wheat varieties can be expedited by utilizing these markers in MAS. The identification and analysis of GNS-related genes in this study contribute to our understanding of the underlying mechanisms involved in wheat yield formation, thereby offering crucial insights for future efforts in crop improvement. Moreover, although this study primarily focuses on plant genetics, its methodology and findings extend beyond this area and can inspire a diverse range of biological and agricultural research. For instance, comprehending the genetic control mechanisms behind complex traits like GNS can facilitate similar investigations in other crops or biological traits.

Limitations and future prospects

Despite the advancements achieved in our research, certain limitations require attention. While candidate genes for GNS have been pinpointed, their exact roles and mechanisms in GNS regulation need experimental validation for a comprehensive understanding. Additionally, the impact of environmental variables on GNS was not exhaustively analyzed. Future studies should evaluate the stability and performance of QTLs across various environmental settings. This investigation predominantly centered on the Pubing3228 wheat germplasm, potentially restricting the universality of the identified QTLs and genes. Expanding research to include a broader spectrum of wheat germplasms will broaden the relevance and application of these findings. Employing precise gene-editing methods, such as CRISPR/Cas9, to alter candidate genes can validate their functions and investigate their suitability for molecular breeding strategies. In conclusion, our study contributes valuable perspectives on genetic advancement and yield optimization in wheat, yet further exploration is essential to validate these insights fully.

Conclusion

This study identified 40 QTLs associated with GNS in the wheat germplasm Pubing3228 (Fig. 2), thereby enriching our understanding of wheat’s genetic diversity and illuminating the intricate genetics of the GNS trait. These QTLs, distributed across 16 chromosomes, underscore the multi-gene influence on GNS rather than its being governed by a single gene. Five QTLs demonstrated substantial phenotypic variability across different environments, accounting for over 10% of the variation and highlighting their importance in wheat yield development. The majority (approximately 94.67%) of beneficial GNS QTLs were derived from Pubing3228, underscoring its genetic resource value. Furthermore, the study proposes 11 candidate genes potentially influencing GNS, involving processes like aging and ethylene response, offering crucial insights into the genetic regulatory mechanisms of GNS.

Fig. 2
figure 2

Graphical summary of the genetic basis of high GNS in the wheat-A. cristatum hybrid germplasm Pubing3228