Introduction

The establishment of the in vitro system through embryogenic callus of higher plants is a bottleneck step for production and genetic breeding. The solution of the embryogenic callus induction depends on the cell totipotent, and some of the plant cells may become totipotent when they treated with proper conditions (Fehér 2019). With the deepening research, early in twentieth century somatic embryogenesis was achieved in Arabidopsis thaliana (Mordhorst et al. 1998), Oryza sativa L. (Karthikeyan et al. 2009), Zea mays L. (Armstrong and Green 1985; Vasil and Vasil 1986), Hordeum vulgare L. (Pasternak et al. 1999), Gossypium hirsutum (Zheng et al. 2014), as well as in two Miscanthus species including Miscanthus sinensis and Miscanthus × giganteus (Gawel et al. 1990; Holme and Petersen 1996). There are probably several reasons affect the Miscanthus × giganteus regeneration including explant types, developmental stages, and culture media (Holme and Petersen 1996). From previous study, somatic embryogenesis had been accomplished in a number of plants with different hormone concentrations or culture conditions; however, the molecular mechanism has not been deciphered clearly. To elucidate the molecular mechanism, the additional research of somatic embryogenesis should be carried out.

It was reported that some genes and transcriptional factors played key roles in somatic embryogenesis formation (Méndez-Hernández et al. 2019; Namasivayam 2007). Among those genes, the somatic embryogenesis receptor kinase (SERK) genes must be mentioned first, which was recognized to be the most important one, involving in a series of developmental processes such as induction and differentiation of callus and cell totipotency in plant (Méndez-Hernández et al. 2019; Pilarska et al. 2016). SERK had been first considered to be an important marker to identify embryogenic cells in carrot (Schmidt et al. 1997). In Arabidopsis thaliana, the loss of SERK was demonstrated to result in abnormal embryo development (Li et al. 2019). The homolog genes of SERK, associated with somatic embryogenesis, have also been detected in some other species, such as in rice (Singla et al. 2009), maize (Baudino et al. 2001), wheat (Singh and Khurana 2012), Medicago truncatula (Nolan et al. 2009), and Citrus unshiu Marc. (Shimada et al. 2005). Three critical cDNA fragments including ZmARF-GEP, ZmKHCP, and ZmTypA involved in the embryogenic callus induction were isolated by cDNA amplified fragment length polymorphism analysis in maize (Sun et al. 2012). All of the three genes showed a higher expression pattern in embryogenic callus than in non-embryogenic callus. In addition, the Arabidopsis leafy cotyledon (LEC) genes, AtLEC1 and AtLEC2, promoted the somatic embryogenesis when they transferred into tobacco (Guo et al. 2013) and also considered to be a key regulators of somatic embryogenesis in cassava (Brand et al. 2019). Previous study also found that carrot homolog of Arabidopsis LEC1 only expressed in somatic embryos (Yamamoto et al. 2005).

Miscanthus lutarioriparius, originated in China and a genus of C4 plant, has great potential as a biomass energy crop (Zhao et al. 2013). High efficiency regeneration from embryogenic callus system is a key bottleneck step for its transgenic engineering. Because of the wild distribution of M. lutarioriparius, it is hard for finding the suitable samples for tissue culture. Molecular marker-assistant breeding has focused on the use of random markers for quantitative trait locus (QTL) analysis of traits, such as single-nucleotide polymorphism (SNP), simple sequence repeats (SSR), and amplified fragment length polymorphism (AFLP) markers (Amini et al. 2016; Feng et al. 2016; Lu et al. 2015).

In this study, a total of 37 M. lutarioriparius samples were used for induction of callus. To investigate the mechanism of somatic embryogenic callus formation in M. lutarioriparius, five candidate genes were used to assess the differential expression between embryogenic and non-embryogenic calli which were differentiated by the morphology of the paraffin section with optical microscope. Three full-length candidate genes and two partial sequence genes were cloned based on the maize and sorghum genome sequence database. In addition, one of the genes showed a significant correlation between genotype and induction of embryogenic callus in M. lutarioriparius. This study may lay the foundation of the molecular mechanism on the embryogenic callus induction of M. lutarioriparius and, perhaps, provide some gist for further study on genetic manipulation.

Materials and methods

Preparation of biological materials

Thirty seven samples of M. lutarioriparius were grown in Ezhou, Hubei Province of China, and used for this study. The information of samples is accessioned in Table S1 and the distribution is shown in Fig S1. Their immature inflorescence tissues were harvested in the middle of September and were then surface-sterilized by washing with 75% ethyl alcohol for 30 s, 0.1% (w/v) mercuric chloride (HgCl2) for 1.5 min, and rinsed thoroughly (at least three times) with sterile distilled water. Immature inflorescence tissues were placed onto the induction medium (4 mg/L 2, 4-D) according to the methods used by Zhao et al. (2013). The induced calli were transferred to the same fresh medium after 21 days and subcultured every 3 weeks. The characteristic of the calli induced from 37 samples were recorded after two rounds of subculturing. The induction frequencies were calculated by SPSS (Statistical Program for Social Sciences) 14.0. The samples with higher frequencies of embryogenic callus induction, i.e., I065 and I093, were used for RNA extraction, gene cloning, and expression study.

RNA extraction and cDNA synthesis

Plant materials were stored at − 80 °C and total RNA was extracted from embryogenic calli using RNAprep pure Plant Kit (TIANGEN BIOTECH., LTD, Beijing, China). DNase treatment was conducted prior to column purification of RNA according to the manufacturers instructions (TIANGEN BIOTECH). cDNAs were synthesized using M-MLV Reverse Transcriptase (Promega Corporation, USA). PCR was carried out with KOD-Plus-Neo DNA Polymerase (TOYOBO CO., LTD, Osaka, Japan) with cDNAs as templates. PCR primers were designed from sequences data in the NCBI of Zeamays and Sorghum, which are shown in Table 1. The reaction mixture of cDNA synthesis contained 5 μl of 2 mM dNTPs, 5 μl of 10 × PCR Buffer for KOD Plus-Neo (TOYOBO, Osaka, Japan), 3 μl of 25 mM MgSO4, 15 pmol of a primer pair, 2 μl of cDNAs template, 2 μl of dimethylsulphoxide (DMSO), and 1 μl of KOD-Plus-Neo and the last supplementary 29 μl of ddH2O with a total volume of 50 μl. PCR amplifications were set to 36 cycles of amplification: pre denaturation at 94 °C for 2 min, and then denaturation at 98 °C with 10 s, annealing for 2–5 min at 68 °C (the criterion: 1000 bp about 1 min) and extension at 16 °C for 1 min as the last step. T4 ligase was used for PCR products cloning in pGEM-T vectors (Promega), following transformation to E. coli DH5α competent cells (TransGen, Beijing, China). Ten positive clones of E. coli in each species were selected to sequence. The plasmids of each gene were isolated using a plasmid extract kit (Tiangen) and stored at − 20 °C for standby application in other experiments.

Table 1 Names and sequences of primers used for PCR amplification of five genes

Sequence alignment and phylogenetic analysis

We used online BLAST server (https://blast.ncbi.nlm.nih.gov/Blast.cgi) to detect the identity of the candidate genes. ORF Finder (https://www.ncbi.nlm.nih.gov/projects/gorf/) was selected for open-reading frames (ORFs) prediction. We performed sequence alignments including DNA and protein by ClustalX 2.1 Program (Larkin et al. 2007). The phylogenetic analysis of candidate genes conducted by MEGA 5.1 (Tamura et al. 2011). To identify the conserved domains and functional sites, InterPro (https://www.ebi.ac.uk/interpro/) was selected as the tool. ARF-GEPs, KHCPs, and TypAs from other species, including both monocots and dicots (Table S2, S3, and S5), were chosen for phylogenetic analyses. SERKs from other species, including monocots, dicots, mosses, and ferns, were chosen for phylogenetic analyses (Table S4). A basic neighbor-joining tree was built by MEGA 5.1.

Expression profiles of five genes at different developmental stages on two types of callus

The embryogenic callus of accession I093 and non-embryogenic callus of A061 were used for analysis of expression level after 3 weeks of callus induction. The relative expression pattern files of MlARF-GEP, MlKHCP, MlSERK1, MlSERK2, and MlTypA in M. lutarioriparius were calculated by qRT-PCR using the comparative CT (2−△△CT) method (Schmittgen 2008). The total RNA isolation was obtained from two callus types at different developmental stages (the 1st, 2nd, 3rd, and 4th weeks) with their morphology observed under optical microscope (Fig. 1). The qRT-PCR primers for amplification of five genes (MlARF-GEP forward: 5′- AGCAGTATCTATGCCTGTCACTTTTA-3′, reverse: 5′- CCTGGTCTGAATCTTGCCACTAA -3′; MlKHCP forward: 5′- GGAAGCAGAAGCAGATAGCAT -3′, reverse: 5′- GCAGCCTTCACCTCAAGTTC -3′; MlSERK1 forward: 5′- TCGGCCCGTGGATTCTG -3′, reverse: 5′- CTCATTTGGCGTGCGTTC -3′ MlSERK2 forward: 5′- GCAGTCGTGGGTGATTTCG -3′, reverse: 5′- AGGACTTGCCAGTGGATAGGTAC -3′; MlTypA forward: 5′- CATACCAGAGCCACGCATTG -3′, reverse: 5′- GTCACCAGCACTAACAGATTCA -3′) were designed from a conservative sites. 18SrRNA1 was chosen as a reference gene according to Huang et al. (Huang et al. 2014). The primers are: forward, 5′- CTACCACATCCAAGGAAG-3′; reverse, 5′- CAATTACCAGACACTAACG -3′. The expected size of amplified fragments of MlARF-GEP, MlKHCP, MlSERK1, MlSERK2, MlTypA, and 18SrRNA1 are 109, 154, 106, 121, 238, and 105 bp, respectively. The specificity of the primers and reference primers were confirmed by qRT-PCR amplification efficiency, using geometric dilution method (100, 10–1, 10–2, 10–3, and 10–4) of embryogenic callus as template. CT value of different concentration cDNA was used for standard curve. The amplification efficiency is ((−k) √10 − 1) × 100% (k was the slope of standard curve). qRT–PCR was carried out in a total volume of 20 μl, and the reaction mixture contained 10 μl of TUNDERBIRD SYBR qPCR Mix (TOYOBO), 6 pmol of primer pair, and 2 μl of cDNA and 6.8 μl of ddH2O according to the manufacturer’s recommendation. ABI StepOneTM Real-Time PCR System (Applied Biosystems, Foster City, California, USA) was used for qRT-PCR amplification, setting with 95 °C for 1 min, 95 °C for 15 s with 40 cycles, and 60 °C for 30 s. Three biological repeats along with three technical repeats were used in this study. StepOne Software v2.3 (Applied Biosystems) used to analyze the experimental data.

Fig. 1
figure 1

Two types of callus and their paraffin section, optical microscope (10 × 100). a Embryogenic callus, b embryogenic callus paraffin section, c non-embryogenic callus, and d non-embryogenic callus paraffin section

PCR amplification and polymorphism analysis of SSR markers

The DNeasy Plant Mini Kit (TIANGEN BIOTECH., LTD, Beijing, China) was used to the total genomic DNA extraction from dried leaf tissue in M. lutarioriparius. According to the genome information of Sorghum in the NCBI, DNA sequences of five genes were amplified. The SSR loci of five genes were detected using SSR Hunter 1.3. PCR amplification-specificity of the designed SSR primers (Table 2) and the presence of a polymorphism were tested on 37 individuals from M. lutarioriparius. PCR amplifications were performed in a total volume of 10 ul, containing 0.2 μl Taq polymerase (0.2 U/ml), 1 μl genomic DNA (50 ng/ml), 1.25 μl 10 × PCR buffer, 1 μl MgCl2 (20 mol m−3), 0.4 μl dNTP (10 mol m−3), and 0.2 μl (25 mol m−3) of each reverse primer and supplementary with 5.75 μl ddH2O. The PCR reaction was set to 95 °C for 5 min, 95 °C for 30 s with 35 cycles, 58 °C for 45 s, and 72 °C for 30 s, following a final elongation step for 10 min at 72 °C. Then, we performed the PCR products for electrophoresis on 6% denaturing polyacrylamide sequencing gels. DNA stripes were silver stained and recorded.

Table 2 Characteristics of SSR loci developed from five genes in Miscanthus lutarioriparius

Data analysis of the SSR marker was performed using SPSS 14.0. The number of alleles per locus and observed results of induction characterization were estimated via Chi-square test.

Results

Cloning of MlARF-GEP, MlKHCP, MlSERK1, MlSERK2, and MlTypA

By degenerate primerr amplification designed on sequences of Sorghum and maize, we cloned the full-length ORF of cDNA sequences of three genes and named them MlARF-GEP, MlSERK1, and MlSERK2 (GenBank accession numbers KU640196, KU640198, and KU640199) based on the obtained cDNA. However, the other two genes were only got the partial ORF of cDNA sequences and named MlKHCP and MlTypA (GenBank accession numbers KU640197 and KU640200) based on the obtained cDNA. All the genes of the PCR amplification results are shown in Fig. 2. The complete ORF sequence of MlARF-GEP is 5385 bp long and encoding a protein of 1794 amino acids. The cDNA sequence of MlSERK1 is 1956 bp long and contains 1872 bp ORF encoding a protein of 624 amino acids. The partial 5′-UTR and 3′-UTR are 9 and 75 bp long, respectively. The cDNA sequence of MlSERK2 is 2119 bp long and contains 1878 bp ORF encoding a protein of 626 amino acids. The partial 5′-UTR and 3′-UTR are 130 and 111 bp long, respectively. The partial ORF sequence of MlKHCP is 1172 bp long and encoding a protein of 387 amino acids. The partial ORF sequence of MlTypA is 1962 bp long and encoding a protein of 654 amino acids.

Fig. 2
figure 2

Five gene electrophoresis results of PCR products

Sequence analysis of MlARF-GEP, MlKHCP, MlSERK1, MlSERK2, and MlTypA

The protein sequence and length of the MlARF-GEP was quite conserved, compared to the other monocots (Brachypodium distachyon, Oryza brachyantha, Oryza sativa Japonica Group, Oryza sativa, Setaria italic, Sorghum bicolor, and Zea mays) and dicots (Arabidopsis thaliana, Brassica napus, Malus domestica, and Ricinus communis) by multiple alignment analysis (Fig S2). The highest identity was 99% (SbARF-GEP) as compared to MlARF-GEP. The neighbor-joining tree built by MEGA 5.1 showed that MlARF-GEP was separated into monocots and dicots groups, respectively (Fig. 3a). It means that MlARF-GEP evolved independently after differentiation of monocot and dicot plants. In addition, MlARF-GEP exhibited a closest genetic relationship with SbARF-GEP (Fig. 3a). Multiple alignment analysis of MlKHCP was conserved with both in the monocots and dicots (Fig S3), and the neighbor-joining tree showed a similar characteristic with MlARF-GEP; MlKHCP exhibited a closest genetic relationship with SbKHCP (Fig. 3b). Furthermore, MlSERK1 and 2 were also verified to be conserved in the sequence and length of protein as compared to those monocots (Oryza sativa Indica Group, Sorghum bicolor, Triticum aestivum, and Zea mays), dicots (Arabidopsis thaliana, Citrus sinensis, Glycine max, Gossypium hirsutum, Nicotiana benthamiana, Solanum lycopersicum, and Vitis vinifera), mosses (Physcomitrella patens), and ferns (Selaginella moellendorffii) (Fig. S4). The neighbor-joining tree built by MEGA 5.1 showed that MlSERK1 and 2 was separated into four key groups including monocots, dicots, mosses, and ferns, respectively (Fig. 3c). MlSERK1 exhibited a closest genetic relationship with SbSERK2, and MlSERK2 exhibited a closest genetic relationship with ZmSERK2 and SbSERK3 (Fig. 3c). Multiple alignment analysis of MlTypA was conserved with both in the monocots and dicots (Fig. S5), and the neighbor-joining tree showed a similar characteristic with MlARF-GEP, MlTypA exhibited a closest genetic relationship with SbTypA (Fig. 3d).

Fig. 3
figure 3

Phylogenetic tree of five genes. a MlARF-GEP ORF and other ARF-GEPs form dicots and monocots. b MlKHCP ORF and other KHCPs form dicots and monocots. c MlSERK1 and MlSERK2 ORF and other SERKs form dicots and monocots. d MlTypA ORF and other TypAs form dicots and monocots. The different color represents the clustering in a branch

Expression patterns of five genes in two types of embryogenic calli and non-embryogenic calli

The primers amplification ability of MlARF-GEP, MlKHCP, MlSERK1, MlSERK2, MlTypA, and 18sRNA1 was examined specifically (Fig. S6). The six primer amplification efficiencies were 107.18%, 92.64%, 94.84%, 98.03%, 100.33, and 106.00%, and their standard curves are shown in Fig. S6. The R2 values of the six standard curves were all between 0.99 and 1.0 (Fig. S7). Five genes were expressed in all developmental stage in two types of calli. As we expected, the similarities of the five genes expression pattern were obtained in four callus developmental stages. The highest expression levels of MlARF-GEP, MlKHCP, MlSERK1, MlSERK2, and MlTypA were observed in the third week after induction in embryogenic callus (Fig. 4). Furthermore, the lowest expression levels were found in the fourth week. In addition, the relative expression levels in embryogenic calli were detected to exceed than in non-embryogenic calli among four developmental stages, the expression of MlARF-GEP, MlKHCP, MlSERK1, MlSERK2, and MlTypA in embryogenic calli exhibited approximately several hundred folds higher than in non-embryogenic calli. Interestingly, the expression of five genes plummeted in the fourth week.

Fig. 4
figure 4

Expression patterns of a MlARF-GEF, b MlKHCP, c MlSERK1, d MlSERK2, and e MlTypA genes in different developmental stages in two types of callus. NEC, Non-embryogenic callus; EC, Embryogenic callus. Error bars indicate standard deviation of three biological replicates. “*”P < 0.05,“**”P < 0.01

Polymorphism analysis of SSR markers in five genes

The induction frequencies and descriptive of calli from 37 individuals of M. lutarioriparius are shown in Table S1. The embryogenic calli were induced in only nine individuals. According to the five genes in Sorghum genome database, there were no SSR loci detected by SSR Hunter 1.3 analysis in MlARF-GEP and MlTypA genes. Although five SSR loci in MlKHCP and one SSR loci in MlSERK2 were detected, no polymorphism was observed. Interestingly, two SSR loci, MlSERK1-3 (Fig S8) and MlSERK1-4, amplified three alleles in 37 M. lutarioriparius individuals. The variation from MlSERK1-3 showed significance with the induction characteristic, but there is no significance from MlSERK1-4 (Table 3). The individuals which could be induced embryogenic calli were all observed in part of the heterozygotes and all homozygotes were induced non-embryogenic calli by the MlSERK1-3 amplification.

Table 3 Results of statistical analysis of SSR from 37 samples

Discussion

Immature inflorescence tissues are frequently used as to the explants in Miscanthus species tissue culture (Holme and Petersen 1996; Zhang et al. 2012); however, the genotype of the materials, the cultural condition of the calli, and the concentration of the plant growth regulator in the medium were simultaneously affected the growth and development of the embryogenic calli (Glowacka et al. 2010). The same results were also exhibited in rice (Niroula et al. 2005) and turfgrass (Salehi and Khosh-Khui 2005), and inflorescence length was one of the reasons of embryogenic calli induction in orchard grass (Çeliktaş et al. 2014). In this study, a total of 37 individuals of M. lutarioriparius were used for callus induction; we examined very different induction frequencies (Table S1). The induction results were affected by the different genotypes. Some genes, especially SERKs, maybe play a very important role in embryogenesis. Our study evaluated two SERK genes, MlSERK1and MlSERK2, they showed a similar expression pattern in different developmental stage of embryogenic calli or not. Except the above results, MlSERK1 showed significance between genotype and callus characteristic. The genotypes which could be induced embryogenic calli were all heterozygotes of MlSERK1, this result maybe implies heterozygotes of MlSERK1 which were more easily used for induction as compared to homozygotes. Our study provided a recommendation that the MlSERK1 protein may play an important role in the formation process of embryogenic calli and following studies should be carried to identify the accurate roles which it makes. The relative expression had an extreme elevation in the third week of MlSERK1 and MlSERK2, it may be caused by the hormone accumulation in medium. The relative expression plummeted in the fourth week maybe because of weakening of the hormone. Many reports suggested that the Miscanthus species subcultured in tissue culture every 3 weeks (Holme and Petersen 1996; Zhao et al. 2013), and thus, the expression pattern of MlSERK1 and 2 demonstrated the above method from molecular level.

ZmARF-GEF is reported to belong to the ADP ribosylation factor (ARF) family; it plays function relays on the alternation between GDP and GTP (Morinaga et al. 1996). In maize, ZmARF-GEF was deduced to associate with the auxin flowing and polarization, which further influenced the formation of embryogenic calli (Sun et al. 2012). MlARF-GEP was closely related to ZmARF-GEF. In addition, according to the expression pattern in two types of callus of MlARF-GEP, it implied that MlARF-GEP in M. lutarioriparius and ZmARF-GEF in Zea mays played a similar role.

KHCP may be a kinesin heavy chain like protein, that can move along cytoskeletal fibers by hydrolyzing ATP to generate energy (Vallee and Shpetner 1990), and they play a significant role in chromosome segregation, assisting the spindle to play function during mitosis and meiosis (Sawin and Endow 1993). The MlKHCP homolog protein ZmKHCP was demonstrated to account for hydrolyzing ATP to generate force to separate the chromosome during cell division in embryogenic calli (Sun et al. 2012).

Homologous sequence alignment analysis showed that the MlTypA was highly similar with TypA (Tyrosine phosphorylation protein A) proteins from the other species. Phylogenetic analysis showed that MlTypA is closed to SbTypA and ZmTypA. TypA is a new protein of the ribosome-binding GTPase superfamily, which involved in various regulatory pathways (Margus et al. 2007). AtTypA genes are involved in pollen tube growth in Arabidopsis (Lalanne et al. 2004). A TypA cDNA fragment is isolated from embryogenic calli in maize; they suggest that ZmTypA may perform and regulate similar roles with other species (Sun et al. 2012).

Our study suggested that MlARF-GEP, MlKHCP, MlSERK1, MlSERK2, and MlTypA genes participated in the process of embryogenic calli formation. In conclusion, the qRT-PCR technique provided valuable information on distinguishing of the gene expression levels between embryogenic calli and non-embryogenic calli. Five candidate genes involved in the formation of embryogenic calli were cloned. The identification of these candidate genes helped to understand the underlying mechanisms related to somatic embryogenesis. However, the clear functions of these candidate genes remain unclear. An effective plant regeneration system has been established, the transformation system is developing in our lab. Further work, the exact role of the five genes should be explored by RNA interference or overexpression technology in M. lutarioriparius.