Identification of genes involved in the mulberry ethylene biosynthesis and signaling pathways
In total, 145 genes were predicted to belong to the ethylene biosynthesis and signaling pathways. Of these, 29 genes were found to be involved in ethylene biosynthesis and signal transduction (Table 1). The AP2/ERF family contains 116 genes (Supplemental Table 1). The MnACS family contains 7 genes, of which MnACS10 and 12 are transaminases. Four of the remaining five MnACS genes each had four exons. MnACS5 only had three exons (Fig. 1). Sequence alignment indicated that each MnACS protein contained seven conserved boxes also found in ACS proteins from other plant species. The conserved glutamate (E) residue in box 1, which determines substrate specificity, is present in all the members of the ACS gene family. Except for MnACS5, the MnACS proteins all shared a Ser residue in the C-terminal, which produced a shorter C-terminus than other ACS isozymes (Supplemental Figure 1). ACO is a member of the Fe II-dependent family of oxidases. They require ascorbate as a substrate and Fe (II) as a cofactor to exert enzymatic activity. Four MnACOs were identified in the mulberry genome. MnACO1 and MnACO4 have three exons, and MnACO2/3 has four exons (Fig. 1). The motifs for binding the cofactor (H-X-D-X-H) and the cosubstrate (R-X-S) are also conserved in MnACOs. In the C-terminus, E-R-E is essential to enzyme activity, particularly R, which may be involved in the mechanism of CO2 activation (Supplementary Figure 2).
Table 1 Genes involved in ethylene biosynthesis and signaling pathways in the M. notabilis genome
Ethylene signal transduction is important to ethylene synthesis. Many genes take part in this process. Ethylene receptors (ETRs) form a family of membrane-localized receptors. There are four MnETRs in the mulberry genome. The N-terminal of MnETRs includes three highly conserved transmembrane regions. This region has been demonstrated to be essential to the ethylene binding activity of MnETRs. The C-terminal region of all MnETRs except MnERS1, which has a truncated C-terminal, contains a receiver domain. In addition, the N-terminal regions of MnETR2 and MnEIN4 are much longer than those of other receptors. SMART prediction revealed that they contain four transmembrane regions (Supplementary Figure 3). EIN3 and EIN3-like proteins are essential to the ethylene signal pathway. They are positive regulators, and they bind to the promoter regions of the downstream genes to regulate their expression. Five EIN3s were found here in the mulberry genome. They all have a conserved amino-terminal acidic domain (AD), pro-rich region (PR), and five small basic domains (BDI-V). The Gln-rich and Asn-rich regions, found in mung bean plants, are only conserved in MnEIN3 and MnEIL1 (Supplementary Figure 4). MnEIN3, MnEIL1, MnEIL4, and MnEIL5 were all located on the same scaffold in the mulberry genome (Table 1).
Expression of mulberry ethylene biosynthesis and signal pathway genes
Genes whose expression was detected by qPCR are listed in Fig. 2. Results showed that the genes in these four families have diverse expression profiles across different tissues. MnACS2, MnACO2, MnACO3, MnETR2, MnEIN3, and MnEIL1 have higher levels of expression. On the contrary, the expression of MnACS3, MnACS4, MnACO4, MnERS1, MnEIN4, MnEIL3, and MnEIL4 was relatively repressed. In addition, MnACS5 exhibited tissue-specific expression in female flowers. Among ACO genes, MnACO1 and MnACO2 exhibited tissue-biased expression in fruit, although the expression levels of MnACO1 were only about one tenth of those of MnACO2. MnEIN3 and MnEIL1 showed more expression than the rest of the EIN3 family in roots and fruits.
Phylogenetic and structural analysis of the AP2/ERF transcription factor family
In the present study, 116 AP2/ERF transcription factors were identified in the mulberry genome. According to the number of AP2 domains and their structural features, these 116 proteins can be divided into five subfamilies. There are 58 genes in the ERF subfamily, 33 genes in the DREB subfamily, 21 genes in the AP2 subfamily, 3 genes in the RAV subfamily, and 1 gene in the Soloist subfamily. A phylogenetic tree was constructed using the amino acid sequences of AP2/ERF transcription factors of Morus and Arabidopsis. This tree was separated into 15 groups. Groups I–IV represent the DREB subfamily, and V–X represent the ERF subfamily (Supplementary Figure 5). All groups except for the V group have lower numbers of genes in mulberry tree than those in Arabidopsis. There are 11 genes in the V group of mulberry and 5 in the V group of Arabidopsis.
The pattern of exon/intron splicing usually provides information useful to understanding of the emergence and evolution of a gene family. Structural analyses of genes indicated that all genes in the AP2 subfamily had different numbers of introns ranging from zero to twelve. However, 28 of the 33 genes in the DREB subfamily, 44 of the 58 genes in the ERF subfamily, and 2 of the 3 genes in the RAV subfamily had no introns. Results showed that most genes in the AP2 family shared similar patterns of exon/intron splicing (Supplementary Figure 6).
Genes in the AP2/ERF family containing the EAR motif
The ERF-associated amphiphilic repression (EAR) motif (DLNxxP or LXLXL) has been reported to be a repression domain in repressor-type ERF proteins. These proteins negatively regulate genes involved in developmental, hormonal, and stress signaling pathways. There are three types of EAR motifs in mulberry plants, DLNXXP, LXLXL, and LDLNLXPP. These three motifs have been found in ERF, DREB, AP2, and RAV subfamily (Fig. 3). All of MnERF proteins bearing EAR motif belong to MnERF-B1 group except MnERF-B4-3 and MnERF-B6-8. Only four genes in MnDREB subfamily contain EAR motif, and the DLNXXP motif has been identified in MnDREB-A5-4 and MnDREB-A5-5. Sequence analyses revealed the 19 genes that bear LXLXL motif found in mulberry proteins.
Expression of MnERF genes
To investigate the levels of expression of AP2/ERF genes in different tissues, RPKM data of these genes were analyzed in roots, barks, leaves, flowers, and buds. Heat maps were constructed based on the RPKM data. Sixteen MnERF genes, including five genes in MnERF-B1, four in MnERF-B3, three in MnERF-B2, two in MnERF-B4, and one each in MnERF-B5 and MnERF-B6, had relatively high levels of expression in five tissues (Fig. 4a). This was also true of 8 of the 33 genes in the DREB subfamily, 6 of the 21 genes in the AP2 subfamily, and 1 of the 3 genes in the RAV subfamily. In addition, several genes were also found to be expressed in a tissue-biased manner. For example, MnERF-B3-21 was expressed solely in male flowers (Fig. 4a). MnDREB-A4-7 was more abundantly expressed in leaves than in other tissues (Fig. 4b). MnAP2-5 had high levels of expression in male flower (Fig. 4c). However, the expression of nine genes (MnERF-B6-1/8, MnDREB-A3-1, MnDREB-A4-4, MnDREB-A6-1/2, MnAP2-13/18/21) was not detectable in any tissue.
Response of MaERF-B2-1 and MaERF-B2-2 to flooding stress
Genes in MnERF-B subfamily were expressed at relatively higher levels in different tissues. All members of this subfamily have a conserved N-terminal motif MCGGAV/II, which is considered to be involved in the regulation of low-oxygen response in plants (Fig. 5). Analyses of the promoter regions of MnERF-B2-1, MnERF-B2-2, and MnERF-B2-3 showed that they contain consensus sequences of transcription factor binding sites. As shown in Table 2, four catalogues of putative regulatory elements (stress response, hormone response, light response, and others) were detected in the flanking regions of three genes. GARE, CE3, ABRE, TCA, and TGACG-motifs are involved in the signaling of mulberry hormones. Typical heat shock element (HSE) and low temperature stress response element (LTR) were also observed. It is noteworthy that ARE and GC motifs, which are known to be responsive to the hypoxemia and anaerobic conditions, were detected in the promoters of MnERF-B2-1 and MnERF-B2-3. In this context, two of these three genes from Husang-32, MaERF-B2-1 and MaERF-B2-2, were cloned, sequenced, and their expression patterns were further investigated. Mulberry cultivar Husang-32 is widely planted in the Three Gorges Reservoir area. As shown in Fig. 6a, MaERF-B2-2 has higher expression than MaERF-B2-1 or MaERF-B2-3. To understand the changes in gene expression that takes place after flooding, the expression of the MaERF-B2-1 and MaERF-B2-2 was detected after treatment lasting 1 h, 3 h, 5 h, 7 h, 1 day, 3 days, 5 days, and 7 days (Fig. 6c). Results showed that MaERF-B2-1 was up-regulated in roots and leaves after 1 day of flooding. The change took place faster in leaves than that in roots. MaERFB2-2 showed fast up-regulation in roots after 1 h of flooding. MaERF-B2-1 showed a greater fold change than MaERF-B2-2. These results indicated that MaERF-B2-1 and MaERFB2-2 may play important roles in Husang-32’s response to flooding.
Table 2
Cis-elements in MnERF-B2-1, MnERF-B2-2, and MnERF-B2-3 promoters