Introduction

Glutathione transferases, also known as glutathione S-transferases (GSTs), are ubiquitous in animals, plants, and microorganisms. They are a group of superfamily enzymes encoded by multiple genes, performing various functions, including promoting metabolism, and elimination of harmful xenobiotics or oxidative products through glutathione binding to these substances (Nutricati et al. 2006). For example, GSTs facilitate the covalent binding of reduced glutathione (GSH) with hydrophobic and electrophilic substrates, forming conjugates that are sequestered in vacuoles or transferred to the apoplast, thereby detoxifying both endogenous and exogenous harmful substances, such as herbicides, and aiding in tolerance to abiotic stresses (Öztetik 2008; Cicero et al. 2015). This detoxification process involves three stages: transformation, conjugation, and compartmentalization (Nianiou-Obeidat et al. 2017). Soluble GST is mainly distributed in the cytoplasm, with some present in chloroplasts and microsomes, and a minor amount in the nucleus and extracellular bodies (Dixon et al. 2002). In addition, a typical GST has two binding sites: the N-terminal GSH binding site (G-site; GST-N) and the adjacent electron substrate binding site (H-site; GST-C), primarily formed by the C-terminal. The GST-N is well- conserved, likely due to its role in binding GSH, while the GST-C is variable, possibly because of its ability to combine with multiple substances (Edwards and Dixon 2005; Sylvestre-Gonon et al. 2019).

Based on the homology of plant proteins and gene structure characteristics, the GST family is divided into eight subfamilies: F (Phi), U (Tau), T (Theta), Z (Zeta), L (Lambda), DHAR, EF1Bγ, and TCHQD (Jain et al. 2010). The F and U subfamilies, unique to plants, have the largest number of members and the greatest abundance compared to other subfamilies. They have a broader substrate spectrum, allowing them to bind to various harmful xenobiotics, such as pesticides and herbicides, to protect plants from toxicity (Munyampundu et al. 2016). Theta-type GSTs possess glutathione peroxidase (GPOX) activity, reducing H2O2 produced by plants under oxidative stress. Zeta-type GSTs with maleoylacetone isomerase activity are involved in the degradation of tyrosine in organisms, indicating common metabolism of compounds in living cells (Board et al. 1997). DHAR-type and Lambda-type GSTs are grouped as mercaptotransferases, mainly functioning as peroxidases. DHAR type members catalyze ascorbate synthesis under stress conditions such as excess light and drought, enhancing plant antioxidant capacity (Lallement et al. 2014). The TCHQD protein may have a serine residue at its active site (Mohsenzadeh et al. 2011). However, the nature of the catalytic residue in the EF1Bγ subfamily remains unclear. It has been reported that members of the EF1Bγ subfamily play a role in the oxidative stress response pathway (Olarewaju et al. 2004).

GST affects growth and development by participating in plant physiology and metabolism, increases resistance to adversity, and is involved in cellular signaling and other pathways. In flax (Linum usitatissimum L.), GST is important for detoxification of reactive oxygen species (ROS) and cell wall modification (Dmitriev et al 2016). In walnut species (Juglans regia L.), JrGSTTau1 plays a positive role in osmotic tolerance and is modulated by upstream regulators (Yang et al. 2019). The tobacco GST gene expression was significantly up-regulated by various stresses, and transgenic tobacco plants expressing Tau-class SbGST genes exhibited enhanced tolerance to abiotic stress (Jha et al. 2011). MaGSTs play a pivotal role in both development and abiotic stress responses in banana (Wang et al. 2013). GST has been reported as a key component in the metabolism of anthocyanins, flavonols, proanthocyanidin, cinnamic acid (Wei et al. 2019; Liu et al. 2019a, b), methyl jasmonate (Zhang et al. 2022a, b), salicylic acid (Gullner et al. 2018), auxin (Wu et al. 2023), and ethylene (Zhang et al. 2022a, b). Previous studies showed that GST-mediated transport is key in anthocyanin accumulation in various plant species, including sweet cherry (Qi et al. 2022), maize (Marrs 1996), petunia (Mueller et al. 2000), pear (Li et al. 2022), Arabidopsis (Yamazaki et al. 2008; Sun et al. 2012), grape (Conn et al. 2008), litchi (Hu et al. 2016), apple (Jiang et al. 2019), kiwifruit (Liu et al. 2019a, b), and peach (Zhao et al. 2020).

Betula platyphylla Suk., a forest species with significant economic importance, is widespread in northern China (Lyu et al. 2020). It is a vital timber resource for the construction and furniture industries and also plays an essential role in ecosystem restoration as a pioneering species in forests recovering from fires. The species prefers sunlight, demonstrates cold resistance, and thrives in acidic and moist soils, highlighting its adaptability and ecological value (Jing et al. 2020; Geng et al. 2022).

Understanding the function of the GST gene family is instrumental in the study of molecular mechanisms of development and stress tolerance of plants. However, the genome-wide GST gene family has not been comprehensively evaluated in birch. In this study, 71 BpGST genes were identified, and their basic biological information analyzed, which included gene structure, evolutionary relationship, conserved motif, gene collinearity and promoter cis-acting elements. Combined with the analysis of the expression pattern of Phi BpGSTFs in different organs and under salt, mannitol, and ABA abiotic stresses, this study provides reference and the basis for exploring the function of GST family genes.

Materials and methods

Identification and sequence analysis of the GST gene birch family

Protein sequences of the Arabidopsis thaliana GST gene family were obtained (http://www.arabidopsis.org/). A local blast database was constructed using white birch proteome sequences (Chen et al. 2021), with A. thaliana GST protein sequence as the seed for Blastp search with the E value cutoff of 1e − 10. The hidden Markov model of GST was downloaded from the Pfam database (http://pfam.xfam.org/) for PF02798, PF00043, PF13410, and PF13417. Searches were conducted using HMMER 3.0 software (http://hmmer.janelia.org) based on E value ≤ 1e − 5. Repetitive sequences from both methods were removed. Using the CDD (https://www.ncbi.nlm.nih.gov/cdd) for domain validation, sequences without GST were deleted, resulting in the final identification of candidate genes.

Analysis of physical and chemical properties

The ExPASy online software ProtParam (https://web.expasy.org/protparam/) was used to predict the physical and chemical properties of the protein primary structure, including the isoelectric point (pI), amino acid number, and relative molecular mass (https://wolfpsort.hgc.jp/).

Phylogenetic tree construction

The protein sequences BpGSTs and AtGSTs were analyzed using MEGA 7 software with ClustalX multiple sequence alignment. An evolutionary tree was constructed with the NJ method (execution parameters: Poisson correction, pairwise deletion, and bootstrap 1000 duplications). The birch GST gene family members were renamed according to the evolutionary tree subfamily.

Gene structure analysis

Gene structure maps were generated using the GFF file annotated by B. platyphylla genome in TBtools software (Chen et al. 2020).

Conservative motif analysis

Motifs with high similarity in the family genes were detected by MEME, setting the width of conserved sites to ≥ 5 and ≤ 50, and the maximum number of conserved sequences to 20. The conservative motif map of the gene family was drawn by TBtools.

Chromosomal localization analysis

Using genome annotation information, the gene location information and structure were visualized using TBtools software.

Analysis of cis-acting elements of promoter

Based on genome annotation information, 2000 bp upstream of the gene was analyzed using PlantCARE (http://bioinformatics.psb.ugent.be/webtools/plantcare/html) for promoter cis-acting element analysis. The results were visualized by TBtools.

Plant materials

Birch in vitro seedlings were cultured in a woody plant medium containing 20 g L−1 sucrose and 7.5–8 g L−1 agar. The climate chamber was set at 24 ± 2 °C light for 16 h using a 46 µmol m−2 s−1 cold white, fluorescent lamp, 8 h of darkness, and a relative humidity of 65% to 75%. Four-week-old seedlings, approximately 2.5 cm high, were transferred to a medium containing 150 mM NaCl, 200 mM mannitol and 100 µM ABA for abiotic stress. Samples were collected on 1, 2, 4, and 7 days after treatment, and the untreated samples are to be named ‘day 0’, serving as the control group. Each group was replicated three times, frozen in liquid nitrogen and stored at − 80 °C.

RNA extraction and cDNA synthesis

Total RNA was extracted from the samples by the CTAB method (Gambino et al. 2008) and RNA integrity was confirmed by electrophoresis on a 1% agarose gel. cDNA was synthesized with the ReverTra Ace® qPCR RT Master Mix with gDNA Remover kit (TOYOBO, Shanghai) according to the manufacturer’s instructions.

Real-time fluorescence quantitative PCR analysis

The synthetic cDNA was diluted tenfold as a template for relative quantitative PCR using a Roche LightCycler 480 II fluorescence quantitative PCR instrument. Reaction conditions included a 95 °C reaction for 30 s; 95 °C for 10 s, 60 °C for 10 s, 72 °C for 30 s for 45 cycles, followed by 95 °C for 10 s, 60 °C for 1 min, and 97 °C for 15 s to confirm PCR product specificity with melting curves. All samples were replicated three times and three technical replicates. Birch 18S rRNA (18S) was used as an internal control. Primer design was based on the online program Primer-Blast (Primer designing tool, https://www.ncbi.nlm.nih.gov/tools/primer-blast/) with all analyzed gene primers listed in Table S2.

For RT-qPCR, TOYOBO THUNDERBIRD SYBR qPCR Mix Without Rox (TOYOBO, Shanghai) was used, totaling 20 µL of 10 µL 2 × SYBR qMix, forward and reverse primers 0.6 µL (10 µM), 2 µL cDNA, and 4.8 µL ddH2O. The relative expression level of the target gene was determined by the 2−ΔΔCt method (Yaish et al. 2010).

Results

Identification and analysis of birch GST gene family

Based on BLASTp and HMM search, 71 GST family members were identified in the birch genome after removing the redundant transcripts. Based on protein sequence phylogenetic subfamily classification and chromosome location information, they were named BpGSTU1-BpGSTU41, BpGSTF1-BpGSTF13, BpDHAR1-BPDHAR3, BpGSTL1-BpGSTL3, BpGSTT1, BpGSTZ1-BpGSTZ5, and BpTCHQD1-BpTCHQD5 (Table S1).

The 71 GST gene family members encoded proteins varied in size from 101 to 875 amino acids, with molecular weights ranging from 94.86 to 116.38 kDa. The BpDHAR3 had the highest molecular weight and the highest number of amino acids (875 amino acid residues). Conversely, the shortest BpGSTU16 protein contained only 101 amino acid residues. The theoretical isoelectric point (pI) was between 4.43 and 9.82. Fifty-two BpGST members had a pI < 7, indicating these proteins may possess acidic properties; the remaining nineteen BpGST members had a pI > 7, suggesting these proteins could have alkaline characteristics. The instability index ranged between 21.78 and 67.27, with fourty-six BpGST proteins with an instability index < 40. In contrast, the other twenty-five BpGST proteins had a high instability index, indicating that they could be unstable proteins. The aliphatic index of these proteins ranged between 73.87 and 112.86. The GRAVY (Grand Average of Hydropathy) index, which estimates a protein’s hydrophobic or hydrophilic character, indicated that most BpGST proteins, except for BpGSTU23, BpGSTU25, BpGSTF11, and BpGSTZ2, were hydrophilic with negative values. Subcellular prediction results indicated that most BpGSTs were localized in the cytoplasm, followed by the nucleus, mitochondria, peroxisome, and the cytoskeleton (Table S1).

Phylogenetic analysis of GST gene family

The evolutionary classification of 53 AtGSTs and 71 BpGSTs based on their amino acid sequences showed that the 71 BpGST protein members in B. platyphylla could be divided into seven subfamilies, including Tau, Zeta, Lambda, Theta, DHAR, TCHQD, and Phi subfamily. Similar to the classification in Arabidopsis, more than half of BpGST members belonged to the U and F subfamilies. Among the 71 birch family members, the U subfamily had the most members with 41 (Fig. 1); F subfamily followed with 13 members. The subfamilies, DHAR and L, consisted of 3 members each. Subfamily T only had one member, BpGSTZ1, and subfamily Z and TCHQD both had 5 respectively (Fig. 1). Notably, the Phi, Theta, and Theta subfamilies were clustered on the same evolutionary clade, indicating a closer relationship among these subfamilies.

Fig. 1
figure 1

Phylogenetic tree of the relationships among 71 BpGSTs in birch and 53 AtGSTs in Arabidopsis. The same color blocks represent the genes belonging to the same subfamilies

Analysis of the structure and motif of GST family genes in birch

For the 71 GST gene structures in birch (Fig. 2), the number of introns did not exceed 12. Twenty-six BpGSTUs, including BpGSTU2, BpGSTU3, BpGSTU7, and others, contained a single intron. Additionally, BpGSTU16, BpGSTF8 and BpGSTF11 lacked introns entirely. The U subfamily members generally shared a consistent gene structure, with most containing 1–3 introns and 2–4 exons. An exception within this subfamily was BpGTU6, which displayed a unique structure of four introns and five exons. Within the F subfamily, BpGSTF3, BpGSTF6, BpGSTF9, BpGSTF12, and BpGSTF13 were distinguished by possessing either 5' or 3' untranslated regions (UTRs), a feature absent in the other eight members. The gene structures within the L, Z, TCHQD, and T subfamilies exhibited considerable variation, highlighting the diversity and complexity of the GST gene structures in birch. Among these, members of the L subfamily were notable for having more than 8 introns and exons, indicating a significant degree of conservation. BpGSTT1, the unique member of the T subfamily, featured 6 introns and 7 exons. Additionally, BpDHAR3 of the DHAR subfamily possessed the longest CDS with 2628 bp, while BpGSTU14 was remarkable for its long introns within the entire GST family. Notably, there existed 12 introns and 13 exons in BpGSTF10, further emphasizing the intricate genetic framework present within these subfamilies.

Fig. 2
figure 2

Gene structure of the BpGST gene family. Exons and introns are indicated by yellow rectangles, and black lines, respectively. UTRs are represented in green boxes

Twenty conserved motifs in the BpGST gene family were identified using the MEME online tool (Fig. 3). The distributions of various motifs in GST proteins are shown in Fig S1. Motif 3 was present in 61 GST proteins, indicating that it can be used as a marker to recognize BpGSTs. Additionally, motif 7 was found in 21 GST proteins. Within the T subfamily, with the exception of BpGSTU16 and BpGSTU39, all members harbored the conserved motif 1, suggesting that this motif may play a significant role in the subfamily. Moreover, the T subfamily was noted for exhibiting the greatest diversity of motif types, highlighting its complex genetic architecture. Interestingly, each member of the L subfamily contained motifs 3, 7, and 15, suggesting these shared motifs might provide insights into the evolutionary divergence of the L subfamily from other gene groups. The arrangement of conserved motifs in BpGSTU5-BpGSTU15, BpGSTU17-BpGSTU30, and BpGSTU33-BpGSTU38 showed remarkable similarity, particularly with motifs 6, 3, and 1 being in comparable positions. This consistency in motif arrangement suggests that these genes may perform similar biological functions, attributable to their shared motifs.

Fig. 3
figure 3

Conservative motif analyses of the BpGST gene family. Different motifs are represented by different coloured boxes

Chromosomal localization and gene collinearity analysis of BpGSTs genes in birch

To understand the distribution on chromosomes, the chromosomal localization of BpGSTs genes was studied based on birch genomic information. The results suggest that 66 BpGSTs genes were localized on 14 chromosomes (Fig. 4). Chromosome 5 harbored the largest number of BpGSTs genes (15), followed by chromosome 6 with 10 BpGSTs genes, while chromosomes 2, 7, and 10 had only one BpGST gene each. Together, these findings indicated that the BpGSTs distribution was not relevant for chromosome size.

Fig. 4
figure 4

Chromosome distribution of BpGST family genes. The yellow cylinder represents 14 chromosomes in birch

According to tandem replication criteria (Qiao et al. 2015), it usually results in gene clusters but segmental duplication might cause family members to become separated. Notably, multiple genes formed dense gene clusters such as BpGSTF2 and BpGSTF3 as well as BpGSTU9 and BpGSTU10 on chromosome 5, and BpGSTU16, BpGSTU17 and BpGSTU18 on chromosome 6. In total, seventeen pairs of tandem duplicated genes were detected in the BpGST gene family, most of which were in the Tau, Phi, Zeta, and TCHQD subfamilies (Fig. 4). Furthermore, five pairs of segmental duplicated genes were found and connected by red curves (Fig. 5). For example, BpGSTF2 and BpGSTF5 on chromosome 5 were highly collinear with genes BpGSTF8 and BpGST13 on chromosomes 8 and 13, respectively. BpGSTU5 on chromosome 4 was collinear with BpGSTU30 on chromosome 10. BpGSTL2 on chromosome 6 and BpGSTL3 on chromosome 14 were collinear, as were BpGSTF26 on chromosome 8 and BpGSTF29 on chromosome 9. Additionally, interspecific collinear relationships between birch GST family members and model plants Arabidopsis and monocotyledon Oryza sativa were analyzed. It was found that the BpGST gene in birch was more closely related to Arabidopsis thaliana than to monocotyledon rice (Fig. 6).

Fig. 5
figure 5

Intraspecific collinearity analysis of BpGST genes in B. platyphylla. The grey lines represent all colinear blocks in the birch genome, and red lines represent duplicated BpGST gene pairs

Fig. 6
figure 6

Collinearity analysis of GST family chromosomes among birch, Arabidopsis and O. sativa (rice)

Analysis of 2 kb cis-acting elements upstream of BpGST family genes

The upstream 2 kb base sequences of 71 BpGST family members were extracted and cis-acting elements predicted using PlantCARE online software. In addition to core promoter elements such as TATA-box and CAAT-box, the majority of regulatory elements are involved in light response, as well as abscisic acid acting elements (ABRE), stress response elements (STRE), anaerobic inducible response elements (ARE), and MYB binding sites. MYB binding sites also participate in various processes such as light response, drought induction, and flavonoid biosynthesis (Fig. 7). The presence of diverse regulatory elements related to abiotic stress response suggests that BpGSTs may play a role in plant stress resistance.

Fig. 7
figure 7

Distribution of 2 kb cis-acting elements upstream of GST gene family promoter in birc

Gene expression patterns of BpGSTFs in different tissues

To clarify the role of Phi BpGST genes in birch growth and development, the expression patterns of 13 BpGSTF genes in different birch tissues were analyzed. This study revealed that BpGSTFs exhibit varied expression patterns in different tissues (Fig. 8). Most BpGSTFs genes were expressed in all 4 tissues examined, with 9 BpGSTF genes (BpGSTF1, BpGSTF2, BpGSTF3, BpGSTF5, BpGSTF6, BpGSTF10, BpGSTF11, BpGSTF12 and BpGSTF13) showing high relative expression in leaves. 9 genes (BpGSTF9, BpGSTF11, BpGSTF12 and BpGSTF13) displayed specific expression patterns in one or some tissues; for example, all genes have low expression in stems, while BpGSTF11 exhibited similar expression in leaves and petioles, and BpGSTF5 and BpGSTF11 had similar expression patterns. BpGSTF1, BpGSTF2, and BpGSTF3 shared similar high expression patterns in leaves. BpGSTF4’s expression in the petiole was about 70 times higher than in roots and leaves, suggesting a possible role in petiole development. The expression patterns of BpGSTF7 and BpGSTF8 were similar, with higher expression in roots and petioles. BpGSTF10 showed similar expression in stems and petioles, approximately four times higher in leaves. The expression of BpGSTF6 varied significantly in roots, stems, leaves, and petioles, indicating diverse functions of this gene in birch.

Fig. 8
figure 8

Analysis of BpGSTFs gene expression patterns in four tissues (root, stem, leaf and petiole). The level in the root is expressed as 1. Error bars represent the standard deviation of the mean using three technical replicates. Data were analyzed using one-way variance (ANOVA). Using the LSD method for multiple comparisons, different letters represent significant differences (P < 0.05)

Expression pattern analysis of BpGSTF genes in response to salt and osmotic stress

Considering the importance of F subfamily genes to abiotic stresses, the expression patterns of BpGSTFs under salt and osmotic stress were analyzed. The results reveal that each gene had significant differences in response and the time of response varied as well. The expression patterns of BpGSTF gene family members under NaCl stress indicated that BpGSTF1, BpGSTF5, and BpGSTF11 showed a trend of down-regulation (Fig. 9). The expression of BpGSTF2, BpGSTF3, and BpGSTF9 genes peaked on the second day after salt treatment, followed by a gradual decrease. BpGSTF6, BpGSTF10, and BpGSTF13 exhibited similar expression patterns, with peak expression on the first day. Notably, BpGSTF13 was up-regulated most significantly on the second day after salt treatment, about 15 times higher than controls, suggesting that the gene was strong inducted by NaCl. The expression of BpGSTF4, BpGSTF7, and BpGSTF8 initially increased, then dropped to their lowest value on the fourth day, which was 7.3, 1.7, and 7.8 times lower, respectively, than their highest values. On the seventh day of salt treatment, the expression of these three genes slightly increased. BpGSTF1 displayed a unique expression pattern, being significantly induced after one day of salt stress, reaching its highest expression, then decreasing the following day and increasing again on the fourth day, but with little variation from the highest value, followed by a subsequent decrease.

Fig. 9
figure 9

Analysis of BpGSTF gene family expression pattern of birch after 150 mM NaCl treatment (0, 1, 2, 4 and 7 d represent the treatment time, and the expression of each gene at day 0 was set as 1; the error line represents the standard deviation of the mean of 3 technical replicates)

To examine the expression of BpGSTFs under osmotic stress, stem segments with terminal buds of tissue culture seedlings were cut and cultured in rooting medium containing 200 mM mannitol WPM for 0, 1, 2, 4, and 7 d. Total RNAs were extracted for quantitative analysis (Fig. 10). The results show that, after mannitol treatment, the expression of BpGSTF1, BpGSTF5, BpGSTF6, BpGSTF7, BpGSTF8, and BpGSTF10 initially decreased and then increased. The expression patterns of BpGSTF5, BpGSTF6, BpGSTF7, and BpGSTF8 were similar, with relative expression levels two days after treatment being significantly low, only 2.3%, 7.1%, 5.0%, and 3.2% of the control (day zero), respectively, and then gradually decreased. The expression patterns of other family members BpGSTF2, BpGSTF3, BpGSTF4, BpGSTF9, BpGSTF11, and BpGSTF13 initially increased and then decreased. BpGSTF4, BpGSTF9, and BpGSTF11 were significantly induced and reached their highest value on the first day, increasing by 1.2, 7.5, and 1.6 times, respectively. They showed a down-regulation trend on the second day, up-regulated again on the fourth day, and then nearly dropped to zero on the seventh day. The expression of BpGSTF13 peaked 2 days after the treatment, about 20 times that of the control, and then decreased.

Fig. 10
figure 10

Analysis of expression patterns of BpGSTF gene family after treatment with 200 mM mannitol (0, 1, 2, 4 and 7 d represent the time of osmotic stress, and the expression of each gene at day zero is 1; the error line represents the standard deviation of the mean of three technical replicates)

Expression pattern analysis of BpGSTF genes in response to ABA

The RT-qPCR results demonstrate that the expression of BpGSTFs were induced by ABA. Post-ABA treatment, the expression patterns were primarily categorized into three types (Fig. 11). Firstly, the expression of BpGSTF1, BpGSTF10, BpGSTF6, BpGSTF7, and BpGSTF9 were significantly up-regulated in the initial 2 days of ABA treatment, decreased on the 4th day, and then showed a temporary up-regulation in the last time period. Secondly, the expression levels of BpGSTF5, BpGSTF8, and BpGSTF13 peaked on the 1st day post-treatment, registering 5.6, 7.1, and 4.2 times their levels on day 0, respectively, followed by a brief increase and then a decrease on the 4th day. It is hypothesized that these genes may share similar functions in response to ABA treatment. Thirdly, BpGSTF3 and BpGSTF12 exhibited an initial up-regulation followed by a decline; in contrast, BpGSTF11 consistently trended downward. The expression patterns of BpGSTF2 and BpGSTF4 were distinct. For BpGSTF2, it increased after 1 day of stress, decreased after 2 days, then rose again between 2 and 4 days, before gradually declining. BpGSTF4 showed a continuous downward trend in the initial 2 days of ABA treatment, was significantly induced on the fourth day, and then declined.

Fig. 11
figure 11

Analysis of BpGSTF gene family expression patterns of birch after 100 μM ABA treatment (0, 1, 2, 4 and 7 d are treatment times, and the expression of each gene on day zero is 1; the error line represents the standard deviation of the mean of 3 technical replicates)

Discussion

The GST gene family is extensively found across various plant species, playing crucial roles in numerous growth and development processes and exerting regulatory functions (Shehu et al. 2018). To date, significant numbers of GST gene family members have been identified in a variety of plants. For example, there are 55, 49, 90, 23, 42, 79, 61, 54, and 97 GST members in Arabidopsis thaliana (Sappl et al. 2009), Gossypium arboreu (Dong et al. 2016), Solanum lycopersicum (Islam et al. 2017), Malus domestica (Jiang et al. 2019), Zea mays L. (McGonigle et al. 2000), O. sativa L. (Jain et al. 2010), Citrus sinensis (Licciardello et al. 2014), Prunus persica (Zhao et al. 2017) and Actinidia Lindl (Liu et al. 2019a, b). However, there have been no related reports on the identification and functional study of GST gene family members in B. platyphylla. Therefore, we identified and characterized the GST genes in birch. This research broadens our insight into the diversity of the GST gene family and also contributes significantly to clarifying their role in growth and stress tolerance in birch.

Based on BLAST and HMM search, 71 members of the GST gene family were identified in the B. platyphylla genome (Table S1). Physicochemical analysis revealed that 25 were classified as unstable proteins. Apart from BpGSTU23, BpGSTU25, BpGSTF11, and BpGSTZ2, which exhibited hydrophobic properties, the remaining were hydrophilic. This distinction could influence their functionality within the cell. Phylogenetic analysis divided these BpGST genes into U, F, DHAR, L, T, Z, and TCHQD subfamilies. Notably, compared to Arabidopsis, Triticeae (Hao et al. 2021) and Ficus carica L. (Liu et al. 2023), the EF1Bγ subfamily was absent in B. platyphylla, a difference possibly resulting from gene loss over the course of evolution. The loss of the EF1Bγ subfamily may be related to gene functional substitution and adaptation to environmental pressures during evolution (Jia et al. 2023), which should be further studied. In addition, the L and U families clustered together, while Z, T, and F families, and DHAR and TCHQD families formed separate branches.

Exon–intron structure is crucial in gene evolution (Xu et al. 2012). In birch, BpGST genes within the same subfamily exhibited similar exon–intron structures, particularly in the T and L subfamilies. A previous study highlighted that, compact genetic structures with fewer introns, enable timely stress responses (Jeffares et al. 2008), suggesting that T and DHAR members with fewer introns might rapidly respond to stress. In addition, most BpGST members of the same subfamily had conserved motifs. These results reveal that BpGST members in same subfamily are evolutionarily conserved across distinct phylogenetic groups, indicating they might perform similar biological functions.

Collinearity analysis involving birch, Arabidopsis, and rice revealed 22 relationships between the GST family and Arabidopsis, and three with rice, suggesting higher homology of the GST gene family with the Arabidopsis GST gene family. Chromosomal analysis indicated that the widespread distribution of BpGST in birch contributes to the diversity and complexity of this gene family, potentially key to their roles in catalysis and detoxification (Abdul Kayum et al. 2018).

Cis elements are known to control or regulate gene expression, thereby influencing the plant’s response to stress and developmental changes (Narusaka et al 2003). In this study’s analysis of upstream cis-acting elements, 43 different cis-acting elements were detected across the 71 members of the GST gene family. These elements can be categorized into groups associated with light response, hormone response, development, abiotic stress response, and other functions. This diversity of cis elements in the BpGST gene promoters suggests that they might work synergistically to endow BpGSTs with the potential to respond to various stimuli. Significantly, multiple GST gene family members contained MYB binding sites, indicating MYB is likely one of the major factors involved in the regulation of BpGSTs transcriptional expression.

Analysis of gene expression patterns can provide crucial insights into their physiological functions. The expression analysis of the BpGSTFs gene family in birch revealed that 13 exhibited specific expression in different tissues under normal growth conditions, suggesting a significant regulatory role in growth and development. The expression patterns of 13 genes in the Phi BpGSTs gene family in response to high salt, drought, and ABA treatment showed that they were influenced by stress treatment and ABA. However, the expression patterns varied, indicating that each member might have distinct functions in response to stress. In other species, GST gene family members have also shown significant responses to stress stimuli (Simarani et al. 2016). For instance, overexpression of GsGST from soybeans enhanced drought and salt tolerance in transgenic tobacco (Ji et al. 2010). Expression of GmGSTL1 from transgenic soybeans could mitigate salt stress symptoms (Chan and Lam 2014). Sharma et al. (2014) reported that OsGSTU4 in rice is induced by ABA and participates in ABA-dependent processes, providing enhanced tolerance in transgenic plants. In Brassica oleracea, Vijayakumar et al. (2016) identified 65 glutathione transferases (BoGST), with most being highly expressed after 1 and 6 h in cold-sensitive and cold-tolerant lines, respectively, identifying three BoGST (BoGSTU10/19/24) genes as candidates for stress resistance. Wang et al. (2019) reported that 14 of 330 TaGST genes in wheat could respond to different abiotic stresses and hormones, especially salt stress and abscisic acid. In rice, 79 GST genes were identified, with many regulated under various abiotic stresses (20), arsenate stress (32), and biotic stress (48) (Jain et al. 2010).

Conclusion

In summary, the BpGST gene family in birch were wide investigated at the genome level. 71 BpGSTs were identified, and were categorized into seven subfamilies which, in the same branch of the evolutionary tree, had similar exon/intron structures and motif constitutions. These BpGSTs were unevenly located on 14 chromosomes. In addition, eighteen pairs of tandem duplication and five pairs of segmental duplicated genes and were found, suggesting that gene duplication is crucial in the evolution and expansion of the birch GST gene family. Tissue-specific expression of BpGSTFs indicated that these genes may have divergent functions in growth and development. The analysis of cis-acting elements and gene expression profiles under multiple stresses and ABA treatment showed that BpGSTFs play a critical regulatory role in resistance to abiotic stress, and the time of stress response was different as well. These results provide a basis for studying functional characterization of BpGSTs genes in birch and other woody species.