Introduction

As an important agronomic trait of rice, rice grain type is a principal index to measure the appearance quality of rice; moreover, it is also the key factor to affect rice yield. The main attributes reflecting grain type traits include grain length (GL), grain width (GW), grain thickness (GT) and grain aspect ratio (Ertao et al. 2008). Grain size is mostly measured by thousand-grain weight, which is a quantitative trait and is controlled by different genetic factors, such as the embryos, endosperm, and maternal plants. The grain type of rice is mainly determined by the glume, and the growth of the glume is regulated by cell numbers and size. The growth of glumes occurs mainly through cell division to increase the number of glume cells in the early stages of development. With the continuous growth of glumes, cell division is gradually be replaced by cell expansion (Li et al. 2018). Genetically, glumes are derived from diploid maternal plants, and their shape and size are controlled by the maternal genotype, while the endosperm, which accounts for the largest volume of seeds, develops from the triploid fertilized polar nucleus, and the embryo develops from a fertilized egg. Therefore, grain shape and size may be affected by both the maternal plant genotype and the zygotic genotype. In addition, cytoplasmic inheritance, such as the genetic material in chloroplasts or mitochondria, will also affect the development of grain.

From the perspective of heritability, the heritability of grain length and grain width is moderately high, and these traits are relatively stable under different environments, the heritability of grain thickness is low, with it being easily affected by the environment (Shi et al. 1999). With the development and application of molecular marker technology along with rice functional genomics and resequencing methods, using different genetic populations, more than 500 QTLs associated with rice grain size have been detected, and 93 rice grain size-related genes have been cloned (Wei et al. 2018). The number of clones distributed on 12 pairs of chromosomes differs, with the highest number on chromosome 3, and fewer on chromosomes 1, 9, and 12 (Huang et al. 2013). The QTLs that have been determined to control grain length and grain width mainly exhibit additive effects, but they also have a dominant effect. Grain thickness is mainly controlled by additive QTLs, which also influence maternal effects. The aspect ratio is a composite trait composed of grain length and grain width, and it is also mainly controlled by additive or dominant QTLs (Zuo and Li, 2014).

Rice grain development involves complex regulation, from the beginning of cell proliferation or elongation to the end of grain filling. This process entails plant hormones, the ubiquitin–proteasome pathway, the mitogen-activated protein kinase (MAPK) signaling pathway, the G protein signaling pathway, and epigenetic modification (Kesavan et al. 2013; Liu et al. 2015c, 2018; Zhang et al. 2016; Hu et al. 2018; Xu et al. 2018; Li et al. 2012). The genes involved in MAPK signaling cascade, G-protein signaling and the ubiquitin–proteasome pathway influence rice size by controlling cell proliferation, while pathways involving plant hormones and epigenetic modification simultaneously affect cell proliferation and expansion, and ultimately control grain size (Sun et al. 2018). An increasing number of studies have shown evidence for an interactive regulatory relationship between different grain size regulatory pathways (Li and Li, 2016).

GW2 was the first gene controlling rice grain width to be cloned and it encodes a RING-type protein with E3 ubiquitin ligase activity. The gw2 mutant is functionally disabled, unable to transfer ubiquitin to the target protein. The substrate could not be specifically recognized and degraded, and subsequently activating the division of the glume cells and increasing the width of the glume (Song et al. 2007). WTG1 encodes an otubain-like protease, which is homologous to human OTUB1 and has deubiquitinase activity. Overexpression of WTG1 causes a more slender grain through cell expansion mechanisms (Huang et al. 2017). In addition, GW5, as a nucleus localization gene, that acts by ubiquitin proteasome pathway (Shomura et al. 2008; Weng et al. 2008). BG1 is an auxin-responsive gene and stimulates glume cells elongation and division, resulting in regulates the organ volume of plants (Linchuan et al. 2015). Hu (Hu et al. 2015) cloned the rice grain length gene GS2, encoding a growth regulator OsGRF4, which plays a key role in cell division during grain development. Dwarf1 (D1) encodes a G protein α subunit Gα. Mutations in the D1 gene result in a small round grain phenotype in rice grains, and the sensitivity to brassinolide (BR) is reduced, indicating that D1-mediated G protein and BR signal transduction pathways through an important mechanism ( Wang et al. 2006; Miura et al. 2009). A negative regulator of grain size is small grain 1 (SMG1), which encodes the rice mitogen-activated protein kinase kinase OsMKK4 that promotes cell division (Penggen et al. 2014). It has been reported that Dwarf and Small Grains1 (DSG1) encodes a mitogen-activated protein kinase (OsMAPK6) homologous to Arabidopsis AtMAPK6, which has phosphorylation activity. Down regulation of DSG1 produces smaller grains and lower thousand-grain weight (Liu et al. 2015c). Promoter region hypomethylation of the related to abscisic acid insensitive3/viviparous1 6 (RAV6) gene is responsible for increasing leaf inclination and smaller grains (Zhang et al. 2015). Furthermore, GW6a has histone acetyltransferase activity (OsglHAT1), and causes an increase in the number of glume cells and results in increasing grain weight and yield. (Song et al. 2015). In rice, grain size is controlled by multiple genes, and exhibits interaction effect. For example, Gao found that GS3 and qGL3 have an additive effect on the regulation of rice grain length (Gao et al. 2015). GW2 and GW5 positively regulate the expression of GS3. At the same time, GW5 inhibits the transcription of GW2, thereby reducing its expression (Yan et al. 2011). GS3 can mask the influence of GW5 on grain length, and GW5 can mask the influence of GS3 on grain width (Yan et al. 2011).

Although many genes that control rice grain size have been cloned, and these genes also involve multiple regulatory pathways, there is still insufficient information concerning the complex mechanism of grain size regulatory pathways and the correlation between different regulatory pathways. In this study, a recombinant inbred line (RIL) population was constructed by crossing 9311 and Changhui 121. Through resequencing the population, a high-density genetic map was constructed, and QTL mapping was performed for grain traits based on data under four environments. This study lays a foundation for comprehensively describing the regulatory mechanism of rice grain development and establishing an ideal grain size regulatory network.

Materials and methods

Plant materials

The RIL population of F14 generation was developed from the cross of 9311 (an indica cultivar and founder parents, pedigree: Yangdao4 × Yan3021) and Changhui 121 (a fragrant rice restorer line, pedigree: Yuexiangzhan × Xiangxian402), using single seed descendant method. The plants were cultivated at the experimental field of Jiangxi Agricultural University, Nanchang province (28°68′ N, 115°86′ E; long-day conditions) in 2018 and 2019 (Nanchang 2018 and Nanchang 2019), and planted at Hainan province (18°31′ N, 109°71′ E; short-day conditions) in 2018 and 2019 (Hainan 2018 and Hainan 019).

Grain size related traits measurement

Thirty fully filled seeds were randomly selected to measure the grain length and width with a Vernier caliper at the mature stage. More than 200 fully filled were weighed with an electronic micrometer balance, with the measurements used to calculate the thousand-weight of the seeds. The average of three measurements was calculated.

Total DNA extraction and QTL analysis

The extraction of rice genomic DNA was performed according to the CTAB method (Paterson et al. 1993). The qualified genomic DNA samples were sequenced using the Hiseq Xten system with PE150 sequencing strategy. Sequences obtained were compared using BWA software. SNP and InDel variations were produced by the GATK3.3.0 process after comparison. The high-quality SNP loci of the population were classified, and LepMap3.0 was used to divide the linkage groups and the consequent construct a genetic map. QTL IciMapping software was used to analyse the QTLs related to the grain size traits with the inclusive composite interval mapping (ICIM) model, and the LOD threshold was 3.0 based on a 1000-permutation test.

Gene knockout vector construction

Referring to the information published on the CRISPR direct website (http://www.crisr.dbcls.jp), target sequences with low off-target probability and high specificity at the candidate gene locus were identified. The methodology was followed to obtain the primer Oligo sequence. Knock-out mutants were generated using the CRISPR/Cas9 system (Beijing Viewsolid Biotech Co., Ltd.), and the specific operation was carried out according to the kit instructions. The recombinant vectors were introduced into rice calli via Agrobacterium-mediated transformation. The knockout was performed in Zhonghua 11, which is a japonica rice variety. The primers used are listed in Table S1.

Results

Phenotypic performance of grain size traits

The grain characteristics of the parents and the RILs population were studied under four types of environments (Nanchang in 2018, Hainan in 2019, Nanchang in 2019 and Nanchang in 2020) (Table 1). There were certain differences in the grain traits of 9311 and Changhui 121, including grain length, grain width and thousand-grain weight (Fig. 1a, b). The grain size data in the RIL population showed continuous and obvious separation under every environment, with a normal distribution, indicating that the grain size traits are regulated by multiple genes and were suitable for QTL analysis (Fig. 1c). In terms of grain length, there was no significant difference in the distribution of multipoint survey data for many years, and the frequency distribution trend was consistent. In terms of grain width, in 2019 and 2020, Hainan had an increase in grain width compared with that of Nanchang, with the normal curve and peak value shifted to the right. As it increased, the thousand-grain weight also changed consistently, demonstrating that the grain width and thousand-grain weight were significantly affected by the environment.

Table 1 Phenotypic values of grain size in recombinant inbred line populations
Fig. 1
figure 1

Grain size of two parents and frequency of the three grain traits in the RIL population. a Grain phenotypes of two rice parents 9311 and Changhui121, Bar, 1 cm. b Grain length, grain width and thousand-grain weight of 9311 and Changhui121. c The frequency of the three grain traits in the RIL population. Student’s t-test was used to generate P values; ** P < 0.01

Construction of genetic map

Based on the re-sequencing of parents and RIL populations, a total of 6,658,585 initial SNPs and 1,371,642 initial InDel were produced. The statistical results of the distribution on each chromosome are as follows (Fig. 2). The genome was basically covered, and the overall distribution was even (Table S2), suggesting that the overall randomness of sequencing with a better quality. After filtering the initial population SNPs obtained, a high density genetic map was constructed using 45,607 SNP markers. A group of SNPs that were mapped to the same location constituted a bin, and the genetic map included 1,910 bin markers and distributed across 12 chromosomes. The total genetic distance was 1951.68 cM, and the average genetic distance between adjacent bin markers was 1.02 cM (Fig. 3).

Fig. 2
figure 2

Distribution of SNPs and InDels on 12 chromosomes. a The number of InDels within 0.1 Mb window size. b The number of SNP within 0.1 Mb window size

Fig. 3
figure 3

High density genetic map

In general, the bin markers were well distributed in the genome. The largest difference occurred in linkage group 02 (LG02), covering 244.56 cM, with 208 bin markers. The average distance between adjacent markers was 1.18 cM. The smallest LG was LG7, covering 120.72 cM, with 133 bin markers, and the average distance between adjacent marks was 0.91 cM (Table 2).

Table 2 Information of genetic linkage group

QTL mapping for grain size

Combining the genetic map and the grain size data of the RIL population investigated for four different environments, QTL mapping was performed. Using QTL IciMapping software with the inclusive composite interval mapping (ICIM) method, 14 QTLs related to grain size with large LOD were detected, and distributed on 6 chromosomes. Among them, there were 4 QTLs affecting grain length, which were distributed on chromosomes 1, 3 and 12, with a total contribution rate of 79.35%. qGL12 had the largest contribution rate of 34%. Six QTLs distributed on 4 chromosomes were identified as affecting grain width, and the total contribution rate was 60.35%. There were 4 QTLs affecting the thousand-grain weight, and distributed on chromosomes 1, 2, 11 and 12 with a total contribution rate of 43.3% (Table 3).

Table 3 Stable QTLs related to GL, GW and TGW traits

Among the QTLs associated with grain size, qGW2-1 and qTGW2 were mapped to the same interval of chromosome 2 and to coincide with the cloned grain width gene GW2. Therefore, the effect of qGW2-1 and qTGW2 may be caused by the major gene GW2. On the other hand, qGL12 and qTGW12a are located in the interval of 22,435,836–22,453,339 on chromosome 12, indicating that there are two main QTLs affecting both grain size and thousand-grain weight in this region, suggesting that these QTLs had pluripotency. qTGW12a is not at the same locus as the QTLs related to grain size previously identified on chromosome 12 (NAL3, OsPPKL3 and OsSUT2); hence, qTGW12a is the novel major QTL for regulating grain weight identified in this study.

Candidate gene analysis of qTGW12a

According to the MSU Rice Genome Annotation Project Database and Resource (http://rice.plantbiology.msu.edu/cgi-bin/gbrowse/rice/), there is a predicted gene in the region of 22,435,836–22,453,339 on chromosome 12 for qTGW12a: LOC_Os12g36660, which is 1498 bp in length, contains 2 exons and one intron, and encodes the multidrug and toxic compound extrusion (MATE) protein. Sequencing data showed that the LOC_Os12g36660 (Os12g0552600) gene of Changhui 121 has 6 nonsynonymous mutations in the coding region, resulting in changes in encoded amino acids, as well as mutations (G → A) at the splicing site, causing an intron retention compared with 9311. The MATE proteins of rice, corn and Arabidopsis were analysed and found that that to be closely related to Os03g0839200 in rice and Zm00001d012883 (BIGE1) in maize (Fig. 4). The BIGE1 transporter is involved in regulating the size of seed organs. Therefore, LOC_Os12g36660 was likely to be a candidate gene for qTGW12a and named TGW12a.

Fig. 4
figure 4

Phylogenetic tree of candidate gene and homologous proteins. The phylogenetic tree was constructed by MEGA 6.0 using the neighbor-joining method with 1000 replications

Functional verification of TGW12a

To determine whether TGW12a is the gene that causes changes in grain weight, we conducted genetic modification verification. The LOC_Os12g36660 gene was knocked out using the CRISPR/Cas9 system, and Zhonghua11 was transformed by the Agrobacterium-mediated method to obtain transgenic positive plants. Two independent mutations were constructed in Zhonghua11 (CR-1and CR-2), including a 19-bp deletion and a 1-bp deletion in the target region (Fig. 5a). The phenotypic investigation showed that the grain size of the progeny of LOC_Os12g36660ZH11 knock-out plants was significantly narrower than that of the wild type, and the allele of TGW12a was responsible for the lower thousand-grain weight (Fig. 5b). It is further proven that the change in grain size was caused by editing LOC_Os12g36660. The mutation of TGW12a altered the grain size, but potentially due to the genetic compensation effect of the TGW12a mutations (resulting in altered expression of its homologs), their effect on the production of grain length was not as drastic as a knockout mutation would be expected to be. Consequently, knockout mutations of TGW12a eventually lead to a reduction in grain weight.

Fig. 5
figure 5

Gene editing and analysis of TGW12a. a Types of mutations and grain phenotypes. Bar, 1 cm. b Grain width, grain length and thousand grain weight of transgenic plants with mutated LOC_Os12g36660-CR. Student’s t-test was used to generate P values; ** P < 0.01

At the same time, the appearance quality of the transgenic rice lines was also affected, and obvious increased was observed in area of chalky endosperm. Hence, it is speculated that this gene may be involved in different regulatory pathways.

Discussion

Yield has always been one of the main goals of rice breeding, but it is a complex trait that is affected by multiple factors. Grain size and panicle traits are two key traits determining grain yield in rice. The genetic basis of these traits is complex and controlled by multiple genes (Xia et al. 2018; Shi et al., Shi et al. 2018; Ruan et al. 2020). With the rapid development of next-generation sequencing technology, through whole-genome resequencing, a bin-marker mapping based on chromosome fragments covering the genome has been perfomed and precise QTL positioning has been applied in many species (Liu et al. 2015a; Jiang et al. 2017; Luo et al. 2020). A large number of SNPs and InDel markers were detected by resequencing with the advantages of high genome coverage and high density, which can improve the accuracy of QTL mapping, and obtain relatively small chromosome reference intervals. For example, QTLs for plant height of foxtail millet using ultrahigh density genetic mapping were identified (He et al. 2021). Furthermore, QTLs for fiber quality traits under salt stress were obtained using a high-density genetic map (Guo et al. 2021).

In the present work, 14 QTLs related to grain size were identified. Comparing results of previous studies, it was determined that qGW2-1 and qTGW2 are the same site, and this interval contains the known major gene GW2 (Song et al. 2007). Several studies have previously reported that GW2 encodes a cyclic E3 ubiquitin ligase; the protein is localized to the cytoplasm and negatively regulates cell division in rice. Furthermore, the qGW3 interval contains the cloned gene Bg1, Bg1 is the original auxin response gene, and is involved in regulating the transport of auxin. It is a positive regulator of the auxin response and transport and affects cell division and cell elongation. Moreover, Bg1 is a plant-specific regulatory factor that controls organ size (Liu et al. 2015b).

The candidate gene TGW12a encodes the MATE transporter. The MATE family members are cation secondary transporters, first obtained in the identification of multidrug efflux proteins from Vibrio parahaemolyticus and E. coli and mainly involved in the transmembrane transport of small molecule compounds (Debeaujon et al. 2001; Diener et al. 2001; Rogers and Guerinot, 2002; Marinova et al. 2007). MATE is ubiquitous in bacteria and eukaryotes. As an increasing number of members of the MATE family of proteins have been identified, the understanding of this family of proteins has gradually improved; however, the function of most MATE proteins is still unclear. In plants, the processes that MATE family proteins participate in mainly include: transport of secondary metabolites, such as anthocyanins; detoxification of toxic compounds or heavy metals; regulation of plant disease resistance; iron translocation and aluminium detoxification (Gomez et al. 2011; Sun et al. 2011; Fujii et al. 2012), TT12 gene is involved in anthocyanin transport(Marinova et al. 2007), ALF5 and ADP1 genes are involved in plant lateral root development (Diener et al. 2001; Li et al. 2014), FDR3 is involved in iron transport and so on (Rogers and Guerinot, 2002). The Arabidopsis ZRZ gene is involved in transmitting leaf-borne signals and determines the rate of organ activation (Burko et al. 2011).

Bige1 (Zm00001d012883), a MATE transporter gene encoded in the maize (Zea mays), is involved in the regulation of maize organogenesis and organ size. Grain sizes developed homozygous mutant Bige1 maize plants were also smaller than those of wild type plants. Bige1 is located in the trans-Golgi apparatus, indicating that it may be involved in the secretion of signal molecules (Suzuki et al. 2015). The specific down regulation of the OsMATE2 gene results in significantly smaller grain size along with early flowering and maturation in rice. Recently, Qin et al. cloned a gene, DG1, that regulates rice grain filling, encoding a MATE transporter. It has been illustrated that leaf-derived ABA controls rice seed development in a temperature-dependent manner and is regulated by DG1 (Qin et al. 2021). Therefore, it is speculated that TGW12a also has a similar function to BIGE1 and OsMATE2 and it participates in the regulatory pathway of cell development.

The CRISPR/Cas9 targeted mutagenesis performs site-specific editing of the target gene and generates different types of allelic mutations. This method has been applied in transgenic verification experiments for multiple QTLs associated with rice grain size. For example, Wang knocked out the annotated gene adjacent to GL7 and found that this gene can negatively regulate GL7 (Wang et al. 2015). After the candidate genes TGW3 and GS9, which control the grain length, were knocked out, the grains of the transgenic plants were changed (Ying et al. 2018; Zhao et al. 2018). In the present study, with the knockout verification test, the grain width and weight of the mutant were significantly decreased compared to those of the wild type. This result provides strong evidence that LOC_Os12g36660 is a candidate gene for qTGW12a. Notably, the previously mapped QTL, qTGW12a, was involved in regulating the function of grain length, while the grain width of the transgenic lines also changed. The difference in expression specificity may reflect the sub-functionalization of the partially redundant MATE gene of the Zhonghua11 homologue gene. Furthermore, we hypothesized that the motif of TGW12a may be regulated by multiple transcription factors.

Comparison of genomic DNA of TGW12a from 9311 and Changhui 121 showed that there were 6 SNPs and 1 splice-site mutations in the coding region of the gene. The relationship between the mutations in this region and the grain phenotype has been not determined. All SNPs may be considered necessary for phenotypic variation, which requires further verification by genetic complementarity experiments.

Conclusion

Our results indicated the potential of the TGW12a gene in regulating grain size, and further research is needed to elucidate the molecular mechanism of this gene in rice. Simultaneously, the newly identified QTLs regulating grain size may provide a potential new approach to exploring rice grain synthesis. These results provide a basis for cloning a QTL that has an important contribution to rice grain size variation.