Introduction

Traditional Chinese medicine (TCM) formulae are being increasingly used in China and other Asian countries for their broad effectiveness and low toxicity. Several TCM formulae exhibited novel clinical effects, thus attracting the interest of many scientists. For example, Qingfei Paidu decoction and Lianhua Qingwen capsule have been included in the 6th–8th edition of the official guidelines as the primarily recommended formulae for treatment of COVID-19 in China due to their obvious benefit to COVID-19 patients [1]. As is well-known, TCMs are featured as having “multi-component,” “multi-target,” and “synergistic therapeutic values,” thus indicating their working mechanisms are closely and directly attributed to complex ingredients. Therefore, a comprehensively qualitative and quantitative analysis of the chemical substances of TCM formulae constitutes the substantial basis for the further elucidation of its bioactive components and pharmacological mechanism.

Sijunzi decoction (SJZD), as a classic recipe in TCM formulae used to strengthen the spleen and replenish Qi, consists of Ginseng Radix et Rhizoma, Atractylodes Macrocephalae Rhizoma, Poria, and Glycyrrhizae Radix et Rhizoma combined in a 3:3:3:2 mass ratio. Pharmacological studies have revealed that SJZD displays extensive bioactivities such as gastrointestinal-regulating [2,3,4], antioxidant [5, 6], immunoregulatory [7, 8], and anti-tumor activities [9, 10]. Meanwhile, both polysaccharides and non-polysaccharides (NPSs), which mainly include flavonoids, saponins, and terpenoids, are considered two major constituents responsible for its pharmacological or therapeutic effects. Until now, the majority of these studies have focused on the isolation and pharmacological activity of its polysaccharides. For instance, in our previous report, the protective effects of three polysaccharide fractions (S-1, S-2, and S-3) obtained from SJZD were investigated on reserpine-induced Wistar rat models. The results revealed that fraction S-3 could ameliorate the symptom of spleen deficiency [11]. Then, fraction S-3 was further purified to acquire a homogeneous polysaccharide S-3-1 with a molecular mass of 13.5 × 104 Da, which exerted an immunomodulatory role by regulating gut microbes [7, 8]. However, the NPSs in SJZD have not yet been comprehensively interpreted. To the best of our knowledge, there are only a few phytochemical reports on SJZD, regarding flavonoids, saponins, and terpenoids, which used conventional chromatographic techniques, such as liquid chromatography (LC) and liquid chromatography-mass spectrometry (LC-MS) [12,13,14,15]. Restricted by the limited resolving power of one-dimensional (1D) separation, the qualitative characterization of NPSs in SJZD is not adequate as it omits some components, especially the minor and the isomer ones. Besides, identifying minor constituents promotes the research of pharmacodynamic substances of SJZD and provides valuable information on lead compounds for drug development [16].

Nowadays, ultra-performance liquid chromatography coupled with quadrupole time-of-flight tandem mass spectrometry (UHPLC-QTOF-MS/MS) has been increasingly employed for rapid analysis of chemical profile or metabolites in TCMs [17]. Yet, one-dimensional liquid chromatography (1DLC) is unsatisfactory in the separation power of complex samples especially in TCMs. Compared with 1DLC, two-dimensional liquid chromatography (2DLC) largely improved its peak capacity, thus obtaining higher resolution and separation capability, which have been considered as a better solution for the comprehensive characterization of TCMs [16, 18, 19]. Furthermore, an enormous and specific natural product library used for MS1 matching could contribute to the rapid and precise assignment of the profiled compounds to a large extent [19]. Since most of the ingredients in TCMs are derived from secondary metabolites in plants, the same types of metabolites possess basic biosynthetic pathways, and their structures are typically correlated, which could be characterized with MS information by the same or similar fragmentation mode [20,21,22]. Accordingly, metabolite prediction based on structural correlations of the same types of compounds, known as MS-network, could enhance efficiency in the characterization of chemical components in TCM formulas [21]. Besides, the ultra-high-performance liquid chromatography coupled with triple-quadrupole tandem mass spectrometry (UHPLC-TQ-MS) provides a high sensitivity due to the multiple reaction monitoring (MRM) detection mode, which has also been demonstrated to be a powerful tool for the analysis of natural products [23].

In this study, an in-house chemical library was initially built on the basis of reported phytochemicals of the SJZD and its four herbs, which mainly contained saponins, flavonoids, and terpenoids. The structural correlations of these compounds were then analyzed, which led to the construction of three MS-networks of flavonoids, terpenoids, and saponins based on their basic skeletons. Meanwhile, an efficient offline reversed-phase × reversed-phase two-dimensional liquid chromatography quadrupole time-of-flight mass spectrometry (RPLC × RPLC-QTOF-MS) was developed to characterize the NPS components in SJZD. Furthermore, the major representative constituents were chosen for quantification with a rapid, sensitive, and validated method by UHPLC-QTRAP®-MS/MS. The aim of this study was to further uncover the bioactive material basis and lay a foundation for subsequent pharmacokinetics and action mechanism study.

Materials and methods

Materials, reagents, and chemicals

Liquiritin apioside, liquiritin, neoliquiritin, isoliquiritin apioside, isoliquiritin, ononin, naringin, liquiritigenin, poricoic acid A, poricoic acid B, poricoic acid C, glycyrrhetinic acid, dehydropachymic acid, pachymic acid, ginsenoside Re, ginsenoside Rg1, ginsenoside Rf, isoliquiritigenin, ginsenoside Rb1, ginsenoside Rg2, ginsenoside Rg3, ginsenoside Rg5, 20(S)-ginsenoside Rh1, notoginsenoside R1, ginsenoside Ro, ginsenoside Rc, formononetin, ginsenoside Rb2, glycyrrhizic acid, ginsenoside Rd, ginsenoside Rk1, and atractylenolide III were purchased from Chengdu DeSiTe Biological Technology Co., Ltd. (Chengdu, China) or Sichuan Weikeqi Biological Technology Co., Ltd. (Chengdu, China). HPLC-grade acetonitrile was acquired from J&K Scientific (China). LC-MS-grade acetonitrile was acquired from Merck (Darmstadt, Germany). Deionized water was prepared from distilled water using a Milli-Q water purification system (Millipore, Bedford, MA, USA). All other reagents and chemicals were of analytical grade.

Sample and reference standard preparation for qualitative and quantitative analyses

SJZD sample was prepared according to our previous reports [8, 11]. In brief, four individual dried herbs were mixed and boiled twice in distilled water. The solution was centrifuged, and then concentrated to 1 g/mL (calculated as crude drug), resulting in the SJZD sample. Then, 8 mL of ethanol was added to 2 mL of SJZD sample with subsequent stirring, after which it was kept standing for 24 h to remove polysaccharides. The supernatant was centrifuged and then dried under vacuum at 60 °C. The residue was dissolved in 10 mL methanol-water (50:50, v/v) and centrifuged at 14,000 rpm for 10 min to obtain sample A. In total, 50 μL of sample A was transferred into a 50-mL volumetric flask with methanol-water (50:50, v/v) to obtain sample B1. Similarly, 1 mL sample B1 was diluted ten times to acquire sample B2 in a 10-mL volumetric flask.

Each standard was precisely weighed following standard procedures and dissolved in methanol-water (50:50–80:80, v/v). Standard solutions of liquiritin apioside, liquiritin, isoliquiritin apioside, isoliquiritin, ononin, liquiritigenin, ginsenoside Re, ginsenoside Rg1, ginsenoside Rf, isoliquiritigenin, ginsenoside Rb1, ginsenoside Rg2, ginsenoside Rc, formononetin, ginsenoside Rb2, glycyrrhizic acid, ginsenoside Rd, ginsenoside Rg3, and ginsenoside Rk1 were prepared individually in the concentration ranging from 0.184 to 2.130 mg/mL and stored at 4 °C until use. A mixed working solution was prepared by diverting specific amounts of the 19 reference standard solutions into a 5-mL volumetric flask. A set of standard solutions were prepared by appropriate dilution of the mixed stock solution with 50% methanol to obtain the calibration curve. All solutions were filtered through a 0.22-μm membrane before analysis.

Offline 2DLC-QTOF-MS/MS system

The offline 2D separation was performed by using an Agilent 1260/1290 system, consisting of two binary pumps (1D: G7161B; 2D: G4220A), two diode array detectors (1D: G4212A; 2D: G4212B), a degasser (G1322A), a thermostatic column compartment (G1316C), and an autosample injector (G4226A) (Agilent Technologies, Inc., Santa Clara, CA, USA). The first-dimensional (1D) separation was carried out on a 1260 system, equipped with an Agilent Eclipse XDB-C18 (9.4 mm × 250 mm, 5 μm). The mobile phase consisted of water (A) and methanol (B). The flow rate was set at 3 mL/min. The gradient profile was as follows: 0–7 min, 10–18% B; 7–26 min, 18–45% B; 26–40 min, 45–95% B; 40–45 min, 95–100% B; 45–50 min, 100–10% B. The column temperature was 25 °C; the injection volume was 100 μL (sample A), and the detection wavelength was set at 210 and 254 nm. The fraction was collected based on peak-profile and eluent from 0 to 4.89 min was labeled as Fr.1; from 4.90 to 14.06 was labeled as Fr.2; from 14.07 to 16.97 was labeled as Fr.3; from 16.98 to 26.00 was labeled as Fr.4; and from 26.01 to 46.00 was labeled as Fr.5. The replicate injection was repeated 10 times as in Fig. S1 (see Supplementary Information (ESM)), and the five fractions were concentrated in a vacuum and then evaporated to dryness by stead nitrogen flow at ambient temperature. Each residue was dissolved in 1 mL 50% methanol and then centrifuged at 14,000 rpm for 10 min to obtain supernatant for 2D analysis, respectively. A ZORBAX Eclipse Plus C18 (2.1 mm × 100 mm, 1.8 μm) was used as 2D column and eluted by a binary mobile phase consisting of water containing 0.1% (A) and acetonitrile (B) with a gradient as follows: 0–3 min, 5–9% B; 3–20 min, 9–23% B; 20–34 min, 23–32% B; 34–47 min, 32–37% B; 47–54 min, 37–48% B; 54–59 min,48–59% B; 59–60 min, 59–77% B; 60–62 min, 77–95% B; 62–64 min, 95–5% B; flow rate, 0.4 mL/min; column temperature, 40 °C; injection volume, 2 μL.

An Agilent 6545 QTOF spectrometer equipped with a Dual AJS ESI source was operated in both negative and positive ion modes. The instrument was tuned and calibrated as recommended by the manufacturer. The optimized parameters were as follows: gas temperature, 320 °C; gas flow, 8 L/min; nebulizer gas pressure, 35 psi; sheath gas temperature, 350 °C; sheath gas flow, 11 L/min; acquisition rates, 2 spectra/s for MS and 8 spectra/s for MS/MS; capillary potential, 3500 V; fragmentor potential, 165 V; collision energies, 5.00, 15.00, 30.00, and 55.00 eV. The mass ranges of MS and MS/MS were from m/z 100 to 1500 and m/z 100 to 1500, severally.

Data processing

Data acquisition and processing were performed by the MassHunter software (B.08.00) in “preferred auto MS/MS” mode. The initial chemical library of four crude drugs of SJZD was built based on multiple available databases (e.g., Web of Science, SciFinder, CNKI, and TCMSP). The correlations in structures of each type of certain known compounds were summarized by the different substituent groups derived in several skeletons and were employed to predict potential metabolites by the partial transformation of skeleton structures such as hydrogenation (H2), hydroxylation (O), methylation (CH2), acetylation (C2H2O), and glycosylation (C6H10O5, C6H8O6, C5H8O4, and C6H10O4), which led to the construction of MS-network used for qualitative analysis. The MS-network analysis used for SJZD characterization is illustrated in Figs. 1 and 2, which produced more theoretical molecular formulas therewith [21]. The known and newly generated formulas were then incorporated into MassHunter Qualitative Analysis software (B.08.00). Preliminary annotation of the MS data was performed by searching the incorporated in-house chemical library in a mode called “Find by Formula.” Multiple experimental molecular weights were matched with the library data at once by an iterate searching algorithm, and the error range of retention time and mass was set at 0.4 min and 10 ppm. The structures of these matched molecular formulas were further validated using their tandem MS information. Some databases such as HMDB (https://hmdb.ca/) [24] and MassBank (http://www.massbank.jp/Index) [25] were also involved in the data processing.

Fig. 1
figure 1

A schematic diagram exhibiting the integrated strategy used for the comprehensive profiling and quantification of non-polysaccharide composition from SJZD

Fig. 2
figure 2

MS-networks of flavonoids (a), terpenoids (b), and saponins (c) for SJZD. (a) The formulas in A1 were produced by adding O or CH2 to the two basic formulas (C15H10O2 and C15H12O2). A2 = A1+Ac (C2H2O); A1+Glc (C6H10O5); A1+GluA (C6H8O6); A1+Rha (C6H10O4); A1+C5H8O4 (Apio, Xyl, and Ara). Through the same strategy as for A2, A3 to An were built. (b) The MS-network of terpenoids was built based on B1 (several basic skeletons with their molecular formulas) with correlations (H2, CH2, O, and C2H2O). (c) The MS-network of saponins was built based on C1 (C30H52O3, C30H52O4, C30H48O3, C30H52O5, and C30H46O4) with different substituent groups (C6H10O5, C6H8O6, C6H10O4, C5H8O4, and C2H2O)

UHPLC-QTRAP®-MS/MS analysis conditions

An ACQUITY UPLC system (Waters, Milford, MA, USA) coupled to a hybrid triple-quadrupole linear ion trap mass spectrometer (5500 QTRAP®) with a turbo ion spray source from AB SCIEX (Foster City, CA, USA) was employed in this study. Equipment control and data analysis were performed using Analyst software ver. 1.5.1 (AB Sciex). UHPLC was equipped with a binary solvent delivery system and an autosample injector. Chromatographic separation was carried out at 45 °C, using a ZORBAX Eclipse Plus C18 (2.1 mm × 100 mm, 1.8 μm) with mobile phases A (0.1% formic acid in water) and B (acetonitrile). The flow rate was set at 0.4 mL/min; the gradient was operated as follows: 0–18 min, 10–64% B; the injection volume was 3 μL. Two cycles of weak-polarity (100% acetonitrile) and high-polarity (10% acetonitrile) solvent washing of the injecting system were carried out between injections.

The main parameters of QTRAP®-MS/MS performed in negative ionization mode were set as follows: ion spray voltage, −4500 V; probe temperature, 650 °C; curtain gas, 45 psi; nebulizer gas (GS1), 40 psi; auxiliary gas (GS2), 45 psi; collision-activated dissociation, medium. Nitrogen was employed as a nebulizer and collision gas. The MRM was employed in the acquisition mode, which selected ions at the first (Q1) and third quadrupole (Q3). For the appropriate fragmentation patterns of m/z (Q1) and m/z (Q3) for analytes in MRM mode, each single standard solution was further optimized and listed in Table S1 with selected quantifier and qualifier ions (see ESM). Due to high content in SJZD and high response in MS of liquiritin and liquiritin apioside, the four compounds liquiritin, isoliguiritin, liquiritin apioside, and isoliquiritin apioside were analyzed in sample B2, while the remaining were analyzed in sample B1 to reduce the carryover effect in the continuous injections.

UPLC-MS/MS method validation

According to the European Medicines Agency (EMEA) guidelines [26], the developed quantitative method was evaluated in terms of linearity, precision, repeatability, stability, and accuracy. A series of mixed standard solutions were prepared through precise stepwise dilution. Several standard solutions of the 19 analytes were analyzed, and then the calibration curve for each analyte was established using linear regression stimulation of the peak area (y) versus concentration (x) to determine the correlation coefficient (R2). The precision of the method was evaluated by analyzing the 19 standard compounds. The same mixed standard solution was tested for six replicates within 1 day to get the intra-day precisions, and examined in duplicates for consecutive 3 days to obtain inter-day precisions. For the repeatability test, six SJZD samples were extracted and processed to obtain 6 sample B1 and 6 sample B2 solutions, respectively, in line with the methods mentioned above. The obtained sample B1 and sample B2 solutions were analyzed in parallel, and RSD calculated the variations to evaluate the repeatability of this method. The sample B1 and B2 solutions were stored in a sample manager at 8 °C and analyzed at 0, 6, 12, and 24 h, respectively, to investigate the stability. The spike recovery rate is employed to further evaluate the accuracy of this method. The SJZD sample was spiked with mixed standard with low, medium, and high concentrations (75%/100%/125%), followed by extraction (ethanol precipitation), processing, and quantification in accordance with the methods mentioned above in three parallel operations. The results from the determination of recovery were expressed as the percentage of the analytes recovered by the assay.

Results

The construction and optimization of offline 2DLC-QTOF-MS/MS

The constituents of reported SJZD mainly included flavonoids, saponins, and terpenoids, which could be well separated by RPLC. Compared to 1DLC, 2DLC has been proven to have greater advantages in separating complex components in TCM. Among them, RPLC × RPLC is widely used in the separation of saponins, flavonoids, and phenolic compounds, which suggests good resolution capabilities [27,28,29,30,31]. Hence, an offline RP × RP 2DLC-MS system was developed to analyze the NPSs of SJZD with optimized LC and MS conditions.

The first-dimensional separation

In an offline 2DLC system, 1DLC performs the preliminary separation of natural products. Based on the distribution of chromatographic peaks, the elution compounds could be collected in different fractions, resulting in better separation in 2DLC and enrichment of some minor or trace constituents. In order to realize the quick enrichment of fractions for 2DLC detection, Eclipse XDB-C18 with high loading capacity at once was selected. Since different elution solvents of 1D and 2D could improve the resolution and the use of acetonitrile instead of methanol could lead to the remarkable retention time shift, methanol/water was used as the 1D mobile phase, and acetonitrile/acidified water (0.1% formic acid) was used as the 2D mobile phase therewith [28, 29]. Peak-dependent fraction collection was adopted to reduce the complexity of the 2DLC-MS detection, thus resulting in the five fractions as shown in Fig. S1 (see ESM).

Optimization of the second-dimensional RPLC and MS conditions

The selection of the column for 2D separation was based on the column efficiency and resolution of major components. Four columns, namely ACQUITY BEH C18 (2.1 × 100 mm, 1.7 μm), ZORBAX Eclipse Plus C18 (2.1 mm × 100 mm, 1.8 μm), Agilent Poroshell 120 EC-C18 (3.0 mm × 150 mm, 2.7 μm), and ZORBAX SB-C8 (3.0 mm × 100 mm, 1.8 μm), were evaluated (see ESM Fig. S2). As a result, ZORBAX Eclipse Plus C18 showed better performance than others. UHPLC conditions were also optimized, including mobile phase, flow rate, and column temperature, to obtain a good separation. Finally, acetonitrile/acidified water (0.1% formic acid), which showed good peak shapes, more amounts of peaks, and better intensity, was selected as the mobile phase. The flow rate and temperature were 0.4 mL/min and 40 °C, respectively. For the MS conditions, both the positive and negative modes were analyzed. Still, the intensity of NPSs in negative mode is higher than that in positive mode. Besides, more compounds can be detected in negative mode according to the base peak intensity chromatograms (BPCs), as shown in Fig. S3 (see ESM). Therefore, the negative detection was chosen for identification. Other MS parameters have also been analyzed and optimized, such as capillary voltage, atomizing gas pressure, and drying gas flow rate. The results indicated that this study’s newly developed qualitative method was appropriate for detecting its chemical constituents.

Characterization of the NPSs in SJZD

In the current study, sufficient abundance ions and structural information were obtained through negative mode. In total, four hundred forty-nine compounds were identified or tentatively characterized, including 100 flavonoids, 38 terpenoids, 276 saponins, and 35 other compounds. Corresponding chromatographic and mass spectral data of these compounds are shown in Tables S25 (see ESM2). Thirty-two compounds were further confirmed by comparison with reference standards, and 6 potentially novel ones were deduced from known structures by MS-network interpretation. The molecular formulas of all compounds matched their quasi-molecular ions such as [M-H], [M+FA-H] within the mass error of 10 ppm. Due to the high-resolution capability of 2DLC-MS, more compounds, including isomers, could be characterized. Also, when a matched formula could be assigned to several isomers, the structure reported from the four individual herbs of SJZD before had a higher possibility than the others. Eventually, the structures of isomers were deduced by MS-network, fragment ions, and some online databases, such as ChemSpider (www.chemspider.com) [32], HMDB (https://hmdb.ca/) [24], and MassBank (http://www.massbank.jp) [25].

Flavonoids

The basic framework of flavonoids is “C6-C3-C6.” The different substituent groups are often composed of hydroxyl, methoxy, acetyl, and glycosyl. Therefore, the MS-network of flavonoids was built based on two basic flavonoids—flavone (C15H10O2) and flavanone (C15H12O2)—which is shown in Fig. 2a. Herein, more formulas could be matched in MS software. Once one formula was matched, it would be traced in MS-network and initial chemical library firstly, and further validated by its MS/MS information, which led to the identification of 100 flavonoids, including 2 potential novel ones. These structures could be classed as flavonoid aglycones and flavonoid glycosides.

Flavonoid aglycones

Retro Diels-Alder (RDA) fragmentation mechanism was common for flavonoids in MS. The neutral losses of 18.0106 Da (H2O), 15.0235 Da (CH3), 14.0157 Da (CH2), 27.9949 Da (CO), and 43.9898 Da (CO2) were also the characteristic fragmentation behaviors for flavone aglycones. For example, peak A78 showed a parent [M-H] ion at m/z 299.0564, indicating a formula of C16H12O6. Diagnostic ions at m/z 284.0306 [M-H-CH3], m/z 165.0188 [C8H5O4], and m/z 151.0029 [C7H3O4] were suggestive of a flavonoid skeleton with a hydroxyl and one methoxy in ring A, as shown in Figs. 3 and 4a. B-ring bearing two hydroxyls was then deduced via the molecular formula and the reference [33], suggesting the identification of 7-O-methylluteolin for A78. The daughter ions at m/z 271.0606, m/z 256.0369, and m/z 240.0423 were indicative of losses of CO, C2H3O, and CH3O2, respectively, which validated the speculation. A50 acquired a formula of C15H12O4 as the mother ion at m/z 255.0662 [M-H], which yielded two characteristic daughter ions of flavanones at m/z 135.0085 [C7H3O3] and m/z 119.0497 [C8H8O]. Consequently, it was identified as liquiritigenin by comparing elution time and MS fragment ions with reference standard. A75 revealed diagnostic ions at m/z 267.0665 [M-H] and m/z 252.0435 [M-H-CH3], which was finally confirmed by formononetin standard. A81 were presumed as licoisoflavanone in view of the [M-H] ion at m/z 353.1017 together with its daughter ion at m/z 227.0719 [M-H-C6H6O3] and m/z 125.0240 [C6H6O3-H], which conformed to the fragment ion reported in existing reference [34]. The loss of C4H8 was observed in A85 by the mother ion at m/z 353.1028 [M-H] and its daughter ion at m/z 297.0423 [M-H-C4H8], resulting in the characterization of glycyrrhisoflavone [34]. A87 was assigned a formula of C21H20O5 according to the quasi-molecular ion at m/z 351.1238 [M-H]. Diagnostic ions at m/z 336.0998 and m/z 281.0452 were deduced to be the losses of CH3 and C5H10, respectively, which implied the presence of the methoxy and isopentene group. It was tentatively identified as gancaonin A, which had been reported in reference [35]. A88 displayed a mother ion at m/z 337.1443 [M-H] and typical fragment ions at m/z 305.1170 [M-H-CH3OH], m/z 243.1024 [M-H-C6H6O], m/z 201.0918 [M-H-C8H8O2], m/z 187.0762 [M-H-C8H8O2-CH2], and m/z 120.0215 [M-H-C14H17O2], which were consistent with the product ions of licochalcone C [36].

Fig. 3
figure 3

Potential chemical structures of partial NPSs in SJZD. The flavonoids (A), terpenoids (B), saponins (C), and other components (D) were classified. The superscript * means that the compound was identified by a standard substance

Fig. 4
figure 4

Potential fragment pathways of A78 (a), A22 (b), A6 (c), B4 (d), B37 (e), and D30 (f)

Flavonoid glycosides

These compounds can be classified as flavonoid O-glycoside and flavonoid C-glycoside. The elimination of saccharide groups regularly occurs in flavonoid O-glycoside, and the characteristic neutral losses of sugars [Glucose (Glc, 162.05 Da), Apiose (Apio, 132.04 Da), rhamnose (Rha, 146.06 Da), xylose/arabinose (Xyl/Ara, 132.04 Da), glucuronic acid (GluA, 176.03 Da)] could be observed in MS/MS spectrum. The process of qualitative analysis for some peaks was described so as to provide examples. Peak A21 showed a parent ion at m/z 417.1189 [M-H] and a daughter ion at m/z 255.0657 [M-H-C6H10O5], indicating the presence of a glucosyl. Featured ions at m/z 135.0090 and m/z 119.0503 were produced due to the RDA fragmentation. It was further identified as liquiritin by comparing it with the standard. A62 generated [M-H] ion 42 Da greater than that of A21, suggesting that it might be acetyl derivative of liquiritin. The daughter ions at m/z 297.0754 [M-C6H10O5] and m/z 255.0654 [M-C6H10O5-C2H2O] supported the assumption, implying that A62 might be 7-O-acetyl liquiritin. Similarly, the fragment ion at m/z 255.0665 of A29 was also observed, suggesting it as naringin which was further verified by the reference compound. The quasi-molecular ion at m/z 445.1142 [M-H] and its fragment ion at m/z 283.0604 [M-H-C6H10O5] of A22 were suggestive of a glucosyl calycosin. The RDA fragment ions at m/z 163.0396 [C9H7O3], m/z 135.0077 [C7H3O3], and m/z 133.0291 [C8H5O2] led to the identification of calycosin-7-glucoside for A22 (Fig. 4b). A1 with adduct [M+FA-H] ion at m/z 623.1636 and [M-H] ion at m/z 577.1541, yielded diagnostic ions at m/z 415.1030 [M-H-C6H10O5], 253.0505 [M-H-2C6H10O5], suggesting successive losses of glucosyl. Compared with the fragment ions reported in reference, it was tentatively characterized as daidzein 7,4′-diglucoside [37]. Parent ion at m/z 593.1509 [M-H] of A6 was indicative of the C27H30O15 formula. The neutral losses of C4H8O4 (120 Da) and C3H6O3 (90 Da) were observed by the diagnostic ions at [(m/z 593.1504, m/z 473.1078, m/z 353.0665); (m/z 593.1504, m/z 503.1186); (m/z 473.1078, m/z 383.0762)], suggesting that A6 is a C-diglycoside flavonoid [38]. It was presumed as vicenin-2 due to the characteristic ions at m/z 353.0665 [M-H-C8H16O8] and m/z 297.0762 [M-H-C10H16O10], as shown in Fig. 4c. Peak A25 provided fragment ions at m/z 635.1621 [M-H], m/z 591.1724 [M-H-CO2], m/z 549.1606 [M-H-CO2-acetyl], m/z 417.1185 [M-H-acetyl-GluA], and m/z 255.0666 [M-H-GluA-Glc-acetyl]. The neutral losses of saccharide groups also occurred in peak A52 for the daughter ions at m/z 433.1464 [M-H-Apio] and m/z 271.0976 [M-H-Apio-Glc]. The aglycone might be vestitol due to the fragment ions at m/z 271.0976, m/z 255.0650, and m/z 135.0444 [35]. The structures of A25 and A52, which are depicted in Fig. S4A–B (see ESM), were inferred according to the structural correlations of flavonoids from the MS-network and may serve as novel structures not yet found in the SciFinder database.

Terpenoids

The main types of terpenoids reported in SJZD include triterpene carboxylic acids and sesquiterpenes, which mainly originate from Atractylodes Macrocephalae Rhizoma and Poria. The substituent groups often contain hydroxyl, methyl, methoxyl, and acetoxyl. Accordingly, the MS-network of terpenoids derived with H2, CH2, O, and C2H2O was established from the several basic skeleton structures such as poricoic acid C (C31H46O4), dehydrotumulosic acid (C31H48O4), dehydrotrametenoic acid (C30H46O3), and atractylenolide II (C15H20O2) [15, 39]. In addition, the terpenoids tend to eliminate small neutral fragments like H2O, the methyl (CH2, CH3·, and CH4), CO2, C2H2O, and C2H4O2 under collision-induced decomposition conditions, which could also contribute to the quick assignment of these components. A total of 38 terpenoids were unambiguously or tentatively identified. Peak B4 was unequivocally identified as atractylenolide III via comparison with the reference standard. Its mother ion at m/z 247.1338 [M-H] and daughter ions at m/z 203.1448 [M-H-CO2], m/z 187.1098 [M-H-C2H4O2] were indicative of the possible fragmentation pathways, as depicted in Fig. 4d. Peak B37 revealed parent ion at m/z 525.3578 [M-H] with a molecular formula C33H50O5. The diagnostic ions at m/z 465.3346 [M-H-CH3COOH], m/z 421.3512 [M-H-CH3COOH-CO2], and m/z 389.2883 [M-H-CH3COOH-CO2-2CH4] were observed, which indicated the existence of an acetoxyl and a carboxyl group. B38 exhibited similar fragmentation pathways as previously reported triterpenoid [39], and was finally identified as dehydropachymic acid by the corresponding standard substance.

Saponins

Saponins occupy a large proportion in SJZD, which mainly came from Ginseng Radix and Glycyrrhizae Radix. These characterized saponins, based on the difference in sapogenin, could be classified into several subclasses. For example, the ginsenosides mainly contained the following: protopanaxadiol (PPD) type, protopanaxatriol (PPT) type, oleanolic acid (OA) type, and octillol (OT) type [19, 40], of which the primary fragmentation pattern was the successive cleavage at the glycosidic linkages. The typical product ions could be assigned to the sapogenins (m/z 475.38/391.29 for PPT, 459.38/375.29 for PPD, 455.35 for OA, and 491.37/415.32 for OT) and glycosyl linkages (m/z 351.06/193.04 for GluA-GluA; m/z 323.09 for Glc-Glc; m/z 497.1159/321.0806 for GluA-GluA-Rha, and so on), which contribute to the quick identification of unknown compounds. Therefore, the MS-network (Fig. 2c) for saponins was built based on several basic sapogenins-PPD (C30H52O3); PPT (C30H52O4); OA (C30H48O3); OT (C30H52O5); glycyrrhetinic acid (C30H46O4). Peaks of C6, C8, C14, C20, C23, and C79 were suggestive of the same formula C48H82O19 based on the quasi-molecular ions at 961.54 [M-H] or 1007.54 [M+FA-H], which all yielded the characteristic ions corresponding to PPT type at m/z 475.38. The successive loss of Glc was deduced by the daughter ions at m/z 799.48 [M-H-C6H10O5], m/z 637.4277 [M-H-2C6H10O5], and m/z 475.38 [M-H-3C6H10O5], which indicated that these compounds might be notoginsenoside M and its isomers [20]. Peak C19 was also assigned as PPT-type ginsenoside as the typical ion at m/z 475.38. The consecutive elimination of sugar residues including Xyl, Glc, and Glc was observed by the parent ion at m/z 931.53 [M-H] and its daughter ions at m/z 799.48 [M-H-C5H8O4], m/z 637.43 [M-H-C5H8O4-C6H10O5], and m/z 475.38 [M-H-C5H8O4-2C6H10O5], suggesting C19 is notoginsenoside R1 as confirmed by the reference standard. The quasi-molecular ion at m/z 1031.5436 [M-H] and three diagnostic ions at m/z 987.55 [M-H-44.01], m/z 945.54 [M-H-86.00], and m/z 475.38 [C30H51O4] indicated that C44 is a PPT ginsenoside with a malonyl group. Also, the sequent losses of Rha, Glc, and Glc after loss of malonyl group (C3H2O3) were inferred based on the fragment ions at m/z 945.54 [M-H-C3H2O3], m/z 799.48 [M-H-C3H2O3-C6H10O4], m/z 637.43 [M-H-C3H2O3-C6H10O4-C6H10O5], and m/z 475.38 [M-H-C3H2O3-C6H10O4-2C6H10O5], which led to chemical structure determination for C44 depicted in Fig. 5D. C162, C173, C178, and C195 were described as PPD-type ginsenosides on the account of the featured ion at m/z 459.38 [C30H51O3]. Their quasi-molecular ion at m/z 1149.61 [M-H] and the fragment ion at m/z 1107.60 [M-H-C2H2O] were suggestive of an acetyl group. The successive elimination of Glc was deduced by fragment ions at m/z 945.52 [M-H-C2H2O-C6H10O5], m/z 783.49 [M-H-C2H2O-2C6H10O5], m/z 621.44 [M-H-C2H2O-3C6H10O5], and m/z 459.38 [M-H-CH2CO-4C6H10O5]. It was deduced that peak C170 was an OA-type ginsenosides according to the characteristic ion at m/z 455.35. The quasi-molecular ion at m/z 793.44 [M-H] and its fragment ions at m/z 631.38 [M-H-C6H10O5] and m/z 455.35 [M-H-C6H10O5-C6H8O6] were indicative of a Glc and a GluA connecting to OA in C170. Similarly, C2 was tentatively characterized as OT-6-Glc(2,1)Rha-Glc by the [M-H] ions at m/z 961.54 and fragment ions at m/z 799.48 [M-H-C6H10O5], m/z 653.42 [M-H-C6H10O5-C6H10O4], and m/z 491.37 [M-H-C6H10O5-C6H10O4-C6H10O5] with corresponding losses of Glc, Glc-Rha, and Glc-Rha-Glc respectively [20]. Sapogenins of licorice saponins are mainly pentacyclic triterpenes. For example, glycyrrhizic acid, identified through comparison with the standard, gave base-peak ion at m/z 821.40, which was assigned a formula of C42H62O16. As depicted in Fig. 5E, the possible fragmentation pathways of glycyrrhizic acid were deduced via the typical ions at m/z 759.3916 [M-H-CH2O3], m/z 645.3618 [M-H-C6H8O6], m/z 469.3315 [M-H-2C6H8O6], and m/z 351.0568 [C12H15O12]. The potential structures of C73, C94, C154, and C210 were also deduced based on the characteristic ions and main fragmentation pathways as other saponins in MS-network, which might also serve as novel agents (see ESM Fig. S4C–F).

Fig. 5
figure 5

Tandem mass spectra of A78 (A), A6 (B), B4 (C), C44 (D), C172 (E), and D30 (F)

Other components

Despite the efficiency and convenience of MS-network in rapid qualitative analysis, this approach is not always well-suited for identifying all types of components in TCM formulae. Thus, 35 other components, including coumarins, phenols, phenyl derivatives, and similar, were characterized via their MS1 and featured fragment ions as well as corresponding losses. Taking D30 as a case, it gave a precursor ion at m/z 367.12 [M-H], which was assigned a formula of C21H20O6. The diagnostic ions at m/z 352.10 [M-H-CH3], m/z 339.12 [M-H-CO], m/z 337.07 [M-H-C2H6], m/z 324.10 [M-H-CO-CH3], and m/z 309.04 [M-H-C4H10] were ascribed to different fragments, as depicted in Fig. 4f and 5F, which were consistent with those of previously reported glycycoumarin [41]. Comparing the exact molecular weights and MS or MS/MS spectra with those in references and online databases led to the identification of other compounds, as shown in Table S5 (see ESM2).

Quantitative analysis of chemical constituents in SJZD by UHPLC-QTRAP®-MS/MS

To further understand the contents of potential bioactive constituents, a UHPLC-QTRAP®-MS/MS was developed for method validation and quantitative analysis of 19 compounds.

Optimization of UHPLC-MS/MS conditions

To obtain good chromatographic behaviors and appropriate ionization efficiency, different mobile phase systems (methanol/water, acetonitrile-water, methanol-acid aqueous solution, and acetonitrile-acid aqueous solution) were compared, revealing that acetonitrile 0.1% aqueous formic acid was the best choice, which showed a better response.

The MRM mode was used for quantification. In order to select the appropriate fragmentation mode of m/z (Q1) and m/z (Q3), these reference standards were directly tested in the mass spectrum, and the scanning mass spectra of the product ions were recorded. In addition, the MRM conditions were also optimized for the analytes as shown in Table S1 (see ESM).

Method validation

Linearity equations were obtained by plotting corresponding peak areas (y) versus concentrations (x). Acceptable linear correlations of these 19 analytes at these conditions were confirmed by the correlation coefficients (R2, 0.9971–0.9999). As shown in Table 1, the intra-day and inter-day precision of this method calculated as relative standard deviation (RSD) were within the range of 0.49–3.01%. The repeatability range was 1.06–3.51% and the RSDs of stability test of the samples stored at 8 °C were ranging from 1.11 to 3.50% within 24-h analysis. The mean average recovery rates for the 19 analytes ranged from 93.05 to 104.34%, with RSD values of 0.69 to 5.56%. Collectively, the optimized phase and MS methods exhibited good precision, reproducibility, stability, and recovery.

Table 1 Method validation using 19 targeted analytes

Contents of the selected compounds in SJZD

Since the main ingredients of NPSs in SJZD are saponins and flavonoids, the newly developed UHPLC-QTOF-MS/MS was employed in the SJZD, and the components were sorted according to the peak response intensity and peak area in the mass spectrum therewith. After combination with the in vivo and in vitro activity reports of each single herb mainly invoving anti-inflammatory, neuroprotective, and immunomodulatory effects, which related to the traditonal efficiency of SJZD [42,43,44,45,46,47,48,49,50], 19 components were screened out for quantification. The established analytical approach in UHPLC-QTRAP®-MS/MS within 20 min was applied for the simultaneous quantification of these compounds. Representative chromatograms for reference standards and SJZD samples are shown in Fig. 6 and Fig. S5 (see ESM). As shown in Table 2, the quantitative results showed that liquiritin apioside and glycyrrhizic acid had the highest content of all the 19 ingredients, and each content of liquiritin, ginsenoside Re, ginsenoside Rb1, ginsenoside Rb2, and ginsenoside Rd was greater than 0.5 mg/g. Considering their high content and multiple activity reports, these components might have the potential to be developed as quality markers for SJZD. In addition, the content of these selected constituents in licorice and ginseng in SJZD was not lower than 4.551 mg/g and 3.761 mg/g, respectively.

Fig. 6
figure 6

Representative MRM chromatograms of 19 analytes from standard solutions on UHPLC-QTRAP®-MS/MS system (1, liquiritin apioside; 2, liquiritin; 3, isoliquiritin apioside; 4, isoliquiritin; 5, ononin; 6, liquiritigenin; 7, ginsenoside Re; 8, ginsenoside Rg1; 9, ginsenoside Rf; 10, isoliquiritigenin; 11, ginsenoside Rb1; 12, ginsenoside Rg2; 13, ginsenoside Rc; 14, formononetin; 15, ginsenoside Rb2; 16, glycyrrhizic acid; 17, ginsenoside Rd; 18, ginsenoside Rg3; 19, ginsenoside Rk1)

Table 2 The contents (mg/g) of the 19 constituents in SJZD

Discussion

UHPLC-QTOF-MS/MS has been widely used in the qualitative analysis of TCM, but it still has many limitations. The first limitation is related to the peak capacity of UHPLC separation, which is proportional to the average resolution [51, 52]. The second one is that many trace components could not be identified due to the differences in the ingredients’ content and ionization efficiency in the prescription. As Giddings proposed, peak capacity has become the most important separation ability indicator in multi-dimensional separation [51, 52]. Compared to 1DLC-MS, 2DLC-MS can increase the elution system’s peak capacity and the ionization efficiency of trace components greatly, thus is considered a powerful tool for the separation and identification of complex components of TCM [18]. Due to the poor separation of hydrophilic compounds, such as flavonoids and saponins on the HILIC column, RPLC × RPLC was chosen for the separation of NPSs in SJZD. Considering that the 2D column, which is directly connected to the MS, has an important role in the final separation and analysis, in the present study, we compared four columns used for 2D separation based on column efficiency and peak response intensity. As calculated with Neue’s method [53], the theoretical peak capacity of Waters BEH C18, ZORBAX Eclipse Plus C18, Agilent Poroshell 120 EC-C18, and ZORBAX SB-C8 was 107, 126, 133, and 113, respectively. Both Waters BEH C18 and ZORBAX Eclipse Plus C18 were found to exhibit better column efficiency, with the complete separation of liquiritin and liquiritin apioside (see ESM Fig. S2). ZORBAX Eclipse Plus C18 showed higher peak response intensity than the other three, and was finally selected as the 2D column. The qualitative results indicated that the developed offline RPLC × RPLC-MS/MS exhibited excellent performance at separation and ionization of the secondary metabolites in SJZD.

Along with the larger flux of quasi-molecular ions brought about by 2DLC-MS, optimal characterization has become another critical issue. It is necessary to develop an effective and rapid method for comprehensive structural analysis of TCM components to assign as many as detected ions. MS-network analysis pattern utilizing the structural correlation characteristics of secondary metabolites in herbs provides an effective solution for the rapid and accurate identification of components and even its metabolites in vivo [21]. In this work, the MS-networks of SJZD were established based on the molecular formulas of the basic skeletons and structural correlations in flavonoids, terpenoids, and saponins. Additionally, we also considered the issue of isomers in the process of MS-network establishment. For example, in MS-network of flavonoids, the molecular formulas in A1 (see Fig. 2a), derived by two core skeletons, actually contain different isomers. The molecular structure of the basic skeleton was determined by a quasi-molecular ion screening through MS-network firstly and then the probable structure or its isomers were finally determined by the analysis of corresponding losses in the MS/MS in detail, which made the qualitative analysis easier and faster.

In this study, 449 ingredients were identified and tentatively characterized, including 6 potential new compounds. Compared with previous reports, this study reported the largest number of ingredients described in SJZD. The possible sources of these compounds were assigned based on phytochemical reports of four herbs and analysis of the structures shown in Tables S25 (see ESM2) and Fig. 7a, b. The components of different fractions were further analyzed. As depicted in Fig. 7c, most components were collected in Fr.4 and Fr.5, with more distribution of saponins. Additionally, the terpenoids were mainly collected in Fr.5, probably due to the low polarity. A summary of the elution order and quasi-molecular ions of these four classes of compounds characterized in SJZD was also illustrated. As shown in Fig. 7d, the four types of components were clearly distinguished in the distribution of the m/z dimension, among which the average m/z of other types of components in SJZD was the lowest. The m/z values of the majority of flavonoids and terpenoids were below 600, while those of saponin ingredients were above 600. In addition, the retention behavior of the flavonoids and the triterpenoids was mainly located in both ends of the elution order, while the saponins were distributed throughout the whole elution process, which might provide essential information for studies on other TCMs.

Fig. 7
figure 7

A summary of possible sources (a), classification (b), components of Fr.1–5 (c), and the retention time and quasi-molecular ions (d) of these compounds identified from SJZD. Abbreviation notes: G represented Ginseng Radix et Rhizoma; AM represented Atractylodes Macrocephalae Rhizoma; P represented Poria; Gly represented Glycyrrhizae Radix et Rhizoma

In this quantitative study, 19 representative compounds were finally selected based on the content of ingredients in the original decoction as well as considering related activity reports. Considering that some compounds responded better in MS after adding formic acid into the mobile phase, and the formic acid peaks of partial compounds were also selected as quantitative ions, the acetonitrile-acid aqueous solution mobile phase was ultimately chosen as the mobile phase. Compared with previous quantification research on SJZD, more ingredients were quantified [14, 54]. To the best of our knowledge, this is the first application of TQ-MS/MS for the determination of the content of NPSs in SJZD.

Conclusion

In the present study, we established and optimized a MS-network-based offline 2DLC-QTOF-MS/MS for rapid identification of secondary metabolites that was successfully applied for comprehensive characterization of the non-polysaccharides in SJZD. As the major secondary metabolites of SJZD are saponins, flavonoids, and terpenoids, the RPLC × RPLC was developed for the separation and characterization with the optimal LC and MS conditions. In addition, the MS-networks of flavonoids, terpenoids, and saponins in SJZD were established based on basic skeletons of different types of compounds and their structural correlations, which remarkably expanded the in-house chemical library and accelerated qualitative analysis. As a result, a total of 449 components were definitively or tentatively identified, of which 6 were deduced as potentially novel ones. These components were further analyzed and summarized in terms of structure types, distribution in m/z dimension and different fractions, and elution order, thus providing essential information for more advanced analysis of flavonoids, terpenoids, and saponins in SJZD and other TCMs. Meanwhile, based on peak response intensity and activity reports of in vivo and in vitro, 19 representative ingredients with characterized structures were selected and then quantified by a developed method in UHPLC-QTRAP®-MS/MS. Our results showed that liquiritin apioside, glycyrrhizic acid, liquiritin, ginsenoside Re, ginsenoside Rb1, ginsenoside Rb2, and ginsenoside Rd, each with content greater than 0.5 mg/g, may have the potential to be developed as quality markers of SJZD. These results suggested that the integrated strategy we developed is an effective approach for the comprehensive profiling of TCM recipes as well as the discovery of potential new agents.