Introduction

The Coronavirus Disease 2019 (COVID-19) is caused by Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), which broke out in December 2019 and spread rapidly around the world [1]. SARS-CoV-2 belongs to the Coronaviridae family in the order Nidovirales and the subfamily of Orthocoronavirinae, including 4 structural proteins: spike (S) protein, membrane (M) protein, nucleocapsid (N) protein and envelope (E) protein [2]. The virus may enter epithelial cells through SARS-CoV-2 S protein binding to Angiotensin Converting Enzyme 2 (ACE2) receptor. The combination of virus and receptor leads to the fusion of the viral envelope and cell membrane with entry of the viral nucleocapsid into the cell, and the viral RNA hijack the host cell ribosome to translate the virus code, resulting in the production of viral proteins and viral replication [3]. The common clinical symptoms of COVID-19 are cough, fever, fatigue or myalgia, and possible complications include septicemia, respiratory distress syndrome, or multiple organ failure [4]. According to COVID-19 Dashboard by the Center for Systems Science and Engineering (CSSE) of Johns Hopkins University (JHU), as of January 4, 2022, a total of 292,567,808 people worldwide was infected with the SARS-CoV-2, of whom 5,449,575 died. The COVID-19 epidemic has seriously affected people's health, economic development, and social progress all over the world [5]. Although some COVID-19 vaccines have been approved, they are not available in all countries infected with SARS-CoV-2 [6]. With the persevering efforts of researchers, several options have been envisaged to control the emerging infections of COVID-19, such as monoclonal antibodies, peptides, oligonucleotide-based therapies, interferon therapies and small molecule drugs. However, these new interventions may take months to years to develop [7]. Therefore, the fast and effective drugs to combat the COVID-19 epidemic are urgently needed.

Traditional Chinese Medicine (TCM) has been in clinical practice for more than two thousand years and has a unique superiority in the prevention and treatment of infectious diseases. TCM has shown certain effects in the treatment of Severe Acute Respiratory Syndrome (SARS) and Middle East Respiratory Syndrome (MERS) [8]. And now, TCM also plays an important role in COVID-19. In the trial 8th edition of the “Guideline on Diagnosis and Treatment of COVID-19” issued by the National Health Commission of China on August 18, 2020, some TCM prescriptions were recorded as recommended prescription for COVID-19. Xuanbai Chengqi Decoction (XBCQD) adapted from the “Identification of Warm Disease” written by Wu Ju-tong in the Qing Dynasty, which is a classic clinical prescription for the treatment of lung diseases [9, 10]. The formula consists of 4 Chinese medicines including Radix et Rhizoma Rhei (Dahuang), Pericarpium Trichosanthis (Gualoupi), Semen Armeniacae Amarum (Kuxingren) and Gypsum Fibrosum (Shigao). Radix et Rhizoma Rhei has the effect of discharging damp-heat of large intestine, which corresponds to the diarrhea syndrome in some patients [11]. Pericarpium Trichosanthis can reduce phlegm and dissipate blood stagnation [12]. Semen Armeniacae Amarum is effective in treating of cough and asthma accompanied by stuffiness in the chest and profuse expectoration. Gypsum Fibrosum is mainly used for febrile diseases due to exogenous affection with high fever, dire thirst, asthma, and cough caused by heat in the lung [13]. Modern pharmacological researches have demonstrated that XBCQD can significantly alleviate the pulmonary inflammatory responses in patients with acute respiratory distress syndrome, improve static/dynamic lung compliance and reduce the incidence of complications [9]. It was found that XBCQD has a protective effect on cecal ligation and puncture-induced septic lung injury, and could reduce the level of TNF-α, IL-1β and IL-10 in bronchoalveolar lavage fluids [14]. Furthermore, XBCQD has shown significant efficacy in the treatment of COVID-19 and has been widely used in clinical practice [15, 16]. However, due to the complex components of TCM prescription, it is difficult to elucidate the chemical composition and potential mechanism of XBCQD against COVID-19 through traditional pharmacological methods.

Network pharmacology is a new biological analysis technology in recent years, which is considered to have the potential to discover drugs by combining with systems biology, bioinformatics, and polypharmacology [17]. This technology is consistent with the theory of TCM and can reveal the therapeutic mechanism of drugs by establishing a complex network of drugs, targets, pathways, and diseases. Therefore, many researchers have used network pharmacology to explore the material basis of Chinese medicine and predict the mechanism of TCM prescription [8]. Molecular docking is a virtual screening technology used to explore the behavior of molecule at the binding site of a target protein by simulating the geometric structure and interaction between molecule and protein [18]. Molecular dynamic (MD) simulation is an efficient computer simulation method for investigating biological phenomena and molecular mechanisms of interactions between biological macromolecules and ligands [19].

The purpose of this research is to systematically explore the chemical composition and therapeutic mechanism of XBCQD on COVID-19. After extensive data mining, we have obtained effective compounds and related targets. Then, integrate network pharmacology, molecular docking, and MD methods to analyze the interactions between the compounds of XBCQD, the targets and COVID-19. The flowchart of the research is shown in Fig. 1.

Fig. 1
figure 1

The workflow of exploring the therapeutic mechanism of XBCQD on COVID-19

Materials and methods

Collection of XBCQD meridian tropisms, compounds, and targets

The TCM meridian tropism information of the 4 Chinese medicines in XBCQD was obtained from the 2020 edition of the Chinese Pharmacopoeia. Except for Gypsum Fibrosum, all the compounds in XBCQD were derived from the Traditional Chinese Medicine Systems Pharmacology Database and Analysis Platform (TCMSP) (http://tcmspw.com/tcmsp.php) and Bioinformatics Analysis Tool for Molecular Mechanism of Traditional Chinese Medicine (BATMAN-TCM) (http://bionet.ncpsb.org.cn/batman-tcm/), and were screened by the metrics of oral bioavailability (OB) ≥ 30% and drug-likeness (DL) ≥ 0.18 [20,21,22,23]. The compounds of Gypsum Fibrosum were collected from the Traditional Chinese Medicines Integrated Database (TCMID) (http://www.megabionet.org/tcmid/) [24]. In order to make the results more credible, we supplemented the bioactive compounds that did not meet the screening criteria through literature mining. Subsequently, TCMSP, the Encyclopedia of Traditional Chinese Medicine (ETCM) (http://www.tcmip.cn/ETCM/) and SwissTargetPrediction (http://swisstargetprediction.ch/) were employed to identify potential targets of the compounds, except calcium sulfate dihydrate, which used PharmMapper (http://lilab-ecust.cn/pharmmapper/index.html) and TargetNet (http://targetnet.scbdd.com/home/index/) databases [25,26,27,28]. The screening criteria for protein targets of XBCQD compounds were set as follows: In ETCM database, the targets were selected with the prediction confidence index more than 0.8. In PharmMapper database, we set Zscore greater than 0 to filter targets. In SwissTargetPrediction and TargetNet databases, the targets with probability more than 0 were selected as potential targets of the compounds [29].

Screening of COVID-19 related targets and putative targets of XBCQD

COVID-19 related targets were acquired from GeneCards (https://www.genecards.org/) and NCBI (https://www.ncbi.nlm.nih.gov/), with “COVID-19”, “novel coronavirus” and “severe acute respiratory syndrome coronavirus 2” as keywords [30]. To ensure the comprehensiveness and accuracy of target screening, we set search parameters in the database as follows: In the GeneCards database, the targets with relevance scores > 0 were selected. In the NCBI database, we selected species as “Homo sapiens” to screen the targets. All the above targets were normalized into official gene symbols via UniProt database (http://www.uniprot.org/) [31]. Subsequently, the VennDiagram program package in the R 3.6.3 software was used to map disease targets and compound targets, and define their intersection targets as putative targets of XBCQD in the treatment of COVID-19.

Protein–protein interaction (PPI) network and molecular complex detection (MCODE) analysis

The PPI network was constructed through STRING database (https://string-db.org/) to explore interaction between the putative targets of XBCQD in COVID-19 treatment, with the species limited to “Homo sapiens” [32]. We selected a confidence score threshold greater than 0.9 in STRING platform to construct the PPI network to meet the accuracy and identify crucial targets. The visualization and topology analysis of PPI network were achieved by Cytoscape 3.7.2. Then, MCODE analysis was carried out to extract densely connected modules in the PPI network. The MCODE algorithm is based on vertex weighting by local neighborhood density and outward traversal from a locally dense seed protein to separate the dense regions by certain connectivity parameters, which can fine-tune the clusters of interest and monitoring the connectivity of the cluster. To further explore the functional connections of the targets in the clusters, biological process analysis was predicted by R 3.6.3 software.

Construction of the Chinese medicines—effective compounds—hub targets (C–E–H) network of XBCQD and enrichment analysis

In the PPI network, the degree value is an important indicator to evaluate the role of the target in the network. Based on the above PPI data, the crucial target that was greater than or equal to the median degree was defined as hub target, which was inputted to the Cytoscape 3.7.2 to construct the C–E–H network of XBCQD. The compounds linking the hub targets were considered to be effective compounds in XBCQD. Next, the topology features of the C–E–H network were analyzed by NetworkAnalyzer plugin of Cytoscape 3.7.2. Meanwhile, we defined the effective compounds with degree greater than the median degree as the core compounds for further analysis. In order to explore the mechanism of XBCQD in the treatment of COVID-19, R 3.6.3 software was used to conduct Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. Afterwards, the bar plot and chord plot were drawn with the ggplot2 and GOplot packages in R 3.6.3 software.

Drug-likeness analysis and ADMET evaluation

The structures of the core compounds were screened to predict whether these compounds obey Lipinski’s rule of five. At the same time, the rules of Pfizer and Golden triangle were also used to verify drug-like properties [33]. The physicochemical and pharmacokinetic properties of core compounds were evaluated using ADMETlab 2.0 (https://admetmesh.scbdd.com/), which systematically assesses the absorption, distribution, metabolism, excretion, and toxicity (ADMET) of compounds based on a comprehensively collected database [34, 35].

Molecular docking analysis

Molecular docking is a key technology to verify the interaction between small molecule ligands and protein receptors. The 3D structures of the selected core compounds of XBCQD were searched through PubChem database (https://pubchem.ncbi.nlm.nih.gov/) [36]. The sdf files were then converted to mol2 files using Sybyl-X 2.1.1 software and minimized under Tripos force field with Gasteiger-Huckel charge. The crystal structures of hub targets were downloaded from RCSB Protein Data Bank (https://www.rcsb.org/) and were imported into Surflex-Dock module of Sybyl-X 2.1.1 [37]. The proteins were prepared by removing ligands and crystallographic water molecules, adding hydrogens, and filling in missing side chains. Co-crystallized ligands or inhibitors were used to define the active binding pocket and generated the protomols. Residues with a radius of 5.0 Å around the ligand were chosen as the active sites [38]. The other docking parameters were kept at default. Total score of Surflex-Dock was calculated to estimate the ligand-receptor interactions, with binding affinity expressed in -log10 (Kd) units [39]. In general, Total Score > 4.5 suggests a certain binding activity, Total Score > 5.0 suggests a good binding activity, and Total Score > 7.0 suggests a strong binding activity [40].

Molecular dynamic (MD) simulation

The MD simulation was carried out to verify the structural and conformational stability of the ligand-receptor complexes in molecular docking. The protein macromolecule was subjected to MD using a GROMOS96 54a7 force field in GROMACS 2019.6, and the initial topology of the selected core compound was generated using ATB online server (http://atb.uq.edu.au/) [41, 42]. The system was solvated using simple point charges (SPC) water model in a dodecahedron box, and Na + and Cl ions were added to maintain the overall neutrality of the system. The complex system energy was minimized using 50,000 steps steepest descent algorithm. After that, the system was equilibrated for 100 ps using NVT canonical ensemble followed by NPT isothermal–isobaric ensemble, respectively. The particle-mesh Ewald (PME) method was used for long-range electrostatic interactions [43]. Finally, we performed 100 ns of MD simulation on representative complexes. Molecular dynamics trajectories were analyzed using gmx_rms, gmx_rmsf, gmx_gyrate and gmx_hbond tools in the GROMACS suite, with analysis parameters including root mean square deviation (RMSD), root mean square fluctuation (RMSF), radius of gyration (Rg) and hydrogen bonds interactions (H-bonds). MM-PBSA calculations of the complexes were carried out using g_mmpbsa tool, which calculated the binding free energies of the complexes and the individual energy contributions of the residues [44]. Xmgrace (Grace 5.1.21) was used to plot the graphs [45].

Results

Meridian tropism information of Chinese medicines in XBCQD

According to the medicinal properties and meridian tropisms theory of TCM, the 4 Chinese medicines of XBCQD were classified. The Chinese medicines—meridian tropisms network based on meridian tropism information was built by Cytoscape 3.7.2 (Fig. 2). The two nodes with the highest degree of connection in the network were the lung and stomach, followed by the large intestine, spleen, liver, heart, kidney, gallbladder, small intestine and urinary bladder. Three Chinese medicines in XBCQD were associated with the Lung meridian, namely Semen Armeniacae Amarum, Pericarpium Trichosanthis and Gypsum Fibrosum. Among the flavor nodes, the two with the greatest degree of connection were sweet and bitter. The results reflected the treatment concept of holistic therapy of TCM. Meanwhile, it also suggested XBCQD may have specifically role in the treatment of lung and stomach diseases.

Fig. 2
figure 2

The Chinese medicines—meridian tropisms network of XBCQD. The network shows the links of the Chinese medicines in XBCQD and their corresponding information about flavors(sour, bitter, sweet, pungent and salty) and their meridian tropisms(Liver, Heart, Spleen, Lung, Kidney, Gallbladder, Small Intestine, Stomach, Large Intestine, and Urinary Bladder). The triangle, diamond and circular represent the Chinese medicines, flavors and meridians, respectively

Collection of compounds and targets in XBCQD

A total of 61 compounds of XBCQD were selected from TCMSP, BATMAN-TCM, TCMID databases and literature mining after screening and deduplication. Among them, 27 compounds were obtained from Radix et Rhizoma Rhei, and 15 of them were collected from literature mining, involving emodin, chrysophanol, physcion, sennoside A, sennoside B, sennoside C, sennoside D and sennidin A, etc. [46,47,48,49]. There were 18 compounds in Semen Armeniacae Amarum and 1 compound was collected by literature [50]. Pericarpium Trichosanthis contained 17 compounds, 6 of which came from literature mining, including 4-hydroxy-2-methoxybenzoic acid, luteolin-7-O-glucoside, palmitoleic acid, glycerol monopalmitate, oxofangchirine and adenosine [51, 52]. Only 1 compound was included in Gypsum Fibrosum, namely calcium sulfate dihydrate. Next, a total of 638 potential targets corresponding to the above compounds were acquired from TCMSP, ETCM, SwissTargetPrediction, PharmMapper and TargetNet databases after screening and removing the duplicates, among which the numbers of targets of Radix et Rhizoma Rhei, Pericarpium Trichosanthis, Semen Armeniacae Amarum and Gypsum Fibrosum were 214, 339, 265 and 209, respectively. The information of compounds in XBCQD was shown in Table 1.

Table 1 The information of the compounds in XBCQD

Collection of COVID-19 related targets and identification of putative targets

Searched for targets related to COVID-19 from GeneCards and NCBI genes. After removing repetitive targets, a total of 845 targets were collected as disease targets. Further, the 638 targets corresponding to the compounds of XBCQD were mapped to 845 COVID-19 related targets. Finally, 79 putative targets for XBCQD to treat COVID-19 were identified (Fig. 3a, Table 2). As shown in Fig. 3b and Table S1, the Upset plot reflected the distribution of 79 putative targets in 4 Chinese medicines. The number of putative targets in Radix et Rhizoma Rhei, Pericarpium Trichosanthis, Gypsum Fibrosum and Semen Armeniacae Amarum was 42, 41, 30 and 24, respectively. Furthermore, COVID-19 and 4 Chinese medicines of XBCQD had some common targets. For instance, there were 7 common targets in Radix et Rhizoma Rhei, Pericarpium Trichosanthis, Gypsum Fibrosum, Semen Armeniacae Amarum and COVID-19, including NOS2, AR, JAK1, PTGS1, SIGMAR1, DNMT1 and PTGS2. The distribution of common targets indicated the synergistic effect of 4 Chinese medicines of XBCQD in the treatment of COVID-19.

Fig. 3
figure 3

Intersection analysis between COVID-19 disease targets and compounds targets of XBCQD; a Venn diagram for intersection analysis of compound targets and disease targets; b Upset plot of targets among 4 Chinese medicines of XBCQD and COVID-19. The horizontal bar graph at the bottom left shows the total number of targets for each part. Circles and vertical lines on the x-axis show the correlation between the parts being compared. The vertical bar graph at the top shows the number of targets in the comparisons

Table 2 79 putative targets of XBCQD for the treatment of COVID-19

Construction of protein–protein interaction (PPI) network and analysis of clusters

The PPI analysis was performed on 79 putative targets through STRING database. After deleting the disconnected nodes and setting the minimum confidence score of the network to 0.9, we exported the result as a tsv file and imported it into the Cytoscape 3.7.2 software for target relationship mining and topology analysis. As shown in Fig. 4a, there were 67 crucial nodes and 195 edges, with a median degree value of 5. The size of the nodes are proportional to the degree. The larger the node, the greater the degree value. Finally, 34 hub targets were identified in the PPI network, including RELA, MAPK3, TP53, NFκB1, PIK3CA, IL6, JAK2, TNF, IL2, JAK1, IL1B, CASP8, CSF2, LCK, PRKACA, BCL2, CREB1, SIRT1, PIK3CB, PRKCE, PRKCB, PPARG, CASP3, GSK3B, AR, PRKAR2B, F2, PTGS2, MCL1, BCL2A1, NOS2, CCL2, IL1A and PRKCA. These hub targets played a pivotal role in the treatment of COVID-19.

Fig. 4
figure 4

Correlation analysis of crucial targets of intersection targets; a PPI network of crucial targets for XBCQD against COVID-19; b MCODE analysis of PPI network; c Biological process analysis of 4 submodules

To find the functional connections of the clusters in the PPI network, the MCODE module of Cytoscape 3.7.2 software was employed to analyze the 67 crucial targets of the PPI network. 4 submodules were identified, covering 26 out of the 67 targets (Fig. 4b). Module 1 (score: 5.273) was composed of 12 nodes and 29 edges, and the seed gene was PRKCB. Module 2 (score: 4.000) was composed of 8 nodes and 14 edges, and the seed gene was PIK3CA. Module 3 (score: 3.000) consisted of 3 nodes and 3 edges, and the seed gene was VCP. Module 4 (score: 3.000) consisted of 3 nodes and 3 edges, and the seed gene was TLR7. Then GO enrichment analysis was performed to investigate the biological process of 4 submodules (Fig. 4c). The results showed that these 4 modules had anti-inflammatory, anti-viral, anti-tumor, and immune regulation effects.

Construction of the Chinese medicines—effective compounds—hub targets (C–E–H) network

In this section, we used Cytoscape 3.7.2 software to construct a C–E–H network to obtain the effective compounds, and employed the NetworkAnalyzer plugin to analyze its topology properties. In Fig. 5, there were 87 nodes, including 4 Chinese medicines, 49 effective compounds and 34 hub targets. The size of the nodes were proportional to the degree values. The larger the node, the higher the degree value. The complex network showed that one compound in XBCQD could interact with multiple targets, and one target could be regulated by multiple components. For example, epicatechin in Radix et Rhizoma Rhei targeted on 10 hub targets, including PTGS2, TNF, IL6, IL1A, PPKACA, CASP3, JAK1, CCL2, CREB1 and IL2. Emodin could also regulate 9 hub targets, such as PTGS2, TNF, TP53, PPKACA, PPARG, IL1B, CSF2, PPKCE and CASP3. There also have other compounds and related targets, which reflect the positive characteristics of multi-component and multi-target intervention of XBCQD in the treatment of COVID-19. Since calcium sulfate dihydrate (CaSO4·2H2O) is an inorganic salt, we focused on other effective compounds for analysis. Finally, the C–E–H network revealed that 18 effective compounds had higher degree values than the median degree value of 3, indicating that these core compounds had a significant therapeutic effect on COVID-19, such as epicatechin, emodin, aloe-emodin, adenosine, glabridin, licochalcone B, glycyrol, eupatin, diosmetin, luteolin-7-O-glucoside, (-)-catechin, hydroxygenkwanin, stigmasterol, spinasterol, estrone, gondoic acid, 11,14-eicosadienoic acid and beta-sitosterol [53, 54].

Fig. 5
figure 5

The C–E–H network of XBCQD against COVID-19. The red, green, orange, and purple hexagons represented Chinese medicines as Semen Armeniacae Amarum, Pericarpium Trichosanthis, Radix et Rhizoma Rhei and Gypsum Fibrosum. The red, green, orange, and purple rhombuses represented effective compounds of the above Chinese medicines. Yellow rhombuses represented the common compounds of Semen Armeniacae Amarum and Pericarpium Trichosanthis. Cyan circles represented hub targets. The size of the node represented degree value. The line between two nodes represented the interaction

GO function and KEGG pathway enrichment analysis

In order to more intuitively explain the therapeutic mechanism of XBCQD in the treatment of COVID-19, we used R 3.6.3 software to perform GO annotation analysis and KEGG pathway analysis on 34 hub targets. After filtering with p value ≤ 0.05, a total of 1639 GO terms were included, of which 1538 for biological processes (BP), 83 for molecular functions (MF), and 18 for cellular components (CC). The enrichment results were saved and sorted according to the p value of each entry into the selected top biological processes and pathways. As shown in Fig. 6a, the enrichment results of CC and MF revealed that the target genes were mainly expressed in the membrane rafts, membrane microdomains and other cell compartments, and were related to cytokine receptor binding, ubiquitin protein ligase binding and growth factor receptor binding. In terms of BP, apoptosis obviously played a confrontational role in the treatment of COVID-19, in which extrinsic apoptotic, regulation of apoptotic and negative regulation of apoptotic process were significantly enriched. In addition, signal transduction and cellular response were also associated with COVID-19.

Fig. 6
figure 6

GO and KEGG enrichment analysis. a Bar plot showed the top 10 GO enrichment terms of BP (orange), CC (blue) and MF (green), respectively. The pie plot showed the proportion of each part of BP, CC and MF to the whole; b The chord plot showed the top 9 KEGG pathway terms and corresponding targets. The right half of the chord plot referred to the enrichment pathway, and the left half referred to the genes targeted by the pathway. log2Deg was equal to log2Degree, which reflected the size of the degree value

KEGG pathway enrichment analysis provided further information about gene functions and their interactions. The results revealed that 180 KEGG pathway terms were obtained through the enrichment of 34 hub targets, with p value ≤ 0.05. As shown in Fig. 6b, the hub targets mainly participated in pathways associated with viral infection and inflammatory injury, including AGE-RAGE signaling pathway, Hepatitis B, TNF signaling pathway, IL-17 signaling pathway and so on. Among these signaling pathways, there were 14 targets (RELA, MAPK3, NFκB1, PIK3CA, IL6, TNF, IL1B, CASP8, CSF2, CREB1, PIK3CB, CASP3, PTGS2 and CCL2) participating in the TNF signaling pathway, 12 targets (RELA, MAPK3, NFκB1, IL6, TNF, IL1B, CASP8, CSF2, CASP3, GSK3B, PTGS2 and CCL2) participating in IL-17 signaling pathway, 16 targets (RELA, MAPK3, NFκB1, PIK3CA, IL6, JAK2, TNF, IL1B, BCL2, PIK3CB, PRKCE, PRKCB, CASP3, CCL2, IL1A and PRKCA) participating in the AGE-RAGE signaling pathway, 16 targets (RELA, MAPK3, TP53, NFκB1, PIK3CA, IL6, JAK2, TNF, JAK1, CASP8, BCL2, CREB1, PIK3CB, PRKCB, CASP3 and PRKCA) participating in the Hepatitis B, respectively. IL-17 signaling pathway is an important inflammatory pathway. IL-17 ligand binds and activates the corresponding receptor, and then the complex recruits ACT1 from the cytoplasm through the SEFIR domain. ACT1 initiates TNF receptor-associated factor 6 (TRAF6) to activate NF-κB signaling pathway, thereby improving the expression levels of pro-inflammatory factors [55]. According to current research, COVID-19 patients usually have an elevated rise in the cytokine levels, which were associated with significantly worse health conditions in those infected [56]. Therefore, we speculated that the activation of IL-17 signaling pathway was closely related to the development of COVID-19. Moreover, we found that several hub targets could simultaneously act on multiple pathways, which indicated the importance of specific targets in the whole disease bioinformatics network, and changed in these targets may affect the entire network to produce a chain reaction. For example, RELA, NFκB1 and IL6 were significantly enriched in all 9 pathways, as well as PIK3CA appeared in 8 pathways.

Drug-likeness analysis and ADMET evaluation of the selected compounds

Drug-likeness assessment is a key factor in screening compounds with drug-like properties. The molecular properties include molecular weight, Van der Waals volume, number of hydrogen bond acceptors (nHA), number of hydrogen bond donors (nHD), topological polar surface area (TPSA), number of rotatable bonds (nRot), number of rings (nRing), Log of the aqueous solubility (LogS), Log of the octanol/water partition coefficient (LogP), and logP at physiological pH 7.4 (LogD). According to the Lipinski’s rule of five, Pfizer rule and Golden triangle rule, 10 of the 18 core compounds satisfied all the filtration criteria, such as epicatechin, emodin, aloe-emodin, adenosine, licochalcone B, glycyrol, eupatin, diosmetin, (-)-catechin, and hydroxygenkwaninas shown in Table S2.

The ADMET evaluation of the pharmacokinetic properties of these selected 10 core compounds were described in Table S3. Licochalcone B and eupatin have high Caco-2 permeability and high oral bioavailability indicated by F30% than the others. Intestinal absorption analysis (HIA) revealed good absorption of all compounds except aloe-emodin and adenosine. Epicatechin, emodin, aloe-emodin, licochalcone B and (-)-catechin were non-substrates or non-inhibitors of P-glycoprotein, other compounds were substrates or inhibitors of P-glycoprotein. With the exception of adenosine, almost all selected compounds could penetrate the blood brain barrier (BBB). Epicatechin, adenosine and (-)-catechin were all non-inhibitors of cytochrome P450s (CYP). Emodin, aloe-emodin, licochalcone B, glycyrol, eupatin, diosmetin, and hydroxygenkwanin showed inhibitory activities on CYP1A2 and CYP2C9 enzymes. Besides, aloe-emodin, adenosine and eupatin showed lower clearance (CL) and longer half-life (T1/2) than other compounds. All 10 compounds were not hERG blockers. Adenosine and glycyrol showed some human hepatotoxicity (H-HT). Aloe-emodin, adenosine, and licochalcone B were predicted to have a little respiratory toxicity. Only licochalcone B showed possible carcinogenicity toxicity. However, almost all the selected core compounds showed lower toxicity than the reference drugs remdesivir and telaprevir.

Molecular docking analysis

Molecular docking was performed by Sybyl-X 2.1.1 software to evaluate the binding affinity between ligand and receptor. We selected the targets with degree greater than 9.5 (average degree) from 34 hub targets as protein receptors. Due to lack of applicable ligands of a few targets in the PDB, the target of MAPK3 (PDB ID: 2ZOQ), TP53 (PDB ID: 5AOK), PIK3CA (PDB ID: 6GVF), JAK2 (PDB ID: 3FUP), TNF (PDB ID: 6X81), IL2 (PDB ID: 1NBP), JAK1 (PDB ID: 4K6Z), IL1B (PDB ID: 5R87), CASP8 (PDB ID:3KJQ) and LCK (PDB ID: 2ZM1) were selected to construct proteins receptor. The co-crystallized ligands or inhibitors used as the reference ligands were first redocked to the protein's binding site, respectively, to set a threshold binding affinity value for comparing the results with other compounds. Then, we selected 10 core compounds that passed all the screening criteria for drug-likeness assessment, namely epicatechin, emodin, aloe-emodin, adenosine, licochalcone B, glycyrol, eupatin, diosmetin, (-)-catechin, and hydroxygenkwaninas for molecular docking validation. Moreover, we also selected these core compounds to dock with SARS-CoV-2 3CL protease (PDB ID: 6WTT) and ACE2(PDB ID: 1R4L), which are effective targets for the treatment of COVID-19 infection [57,58,59,60].

As shown in Fig. 7, almost all the core compounds exhibited effective affinity for their hub targets contrasted to the reference ligands. Figure 8 shows the 3D diagrams of binding mode between representative core compounds and related hub targets. Compared with the reference ligands, the core compounds made the expected interaction with residues of acitive binding sites and exhibited high degree of geometric complementarity. For the binding mode between hydroxygenkwanin and 3CL protease (Fig. 8a), the residues of Met49, Tyr54, Gly143, Glu166, Asp187 and Gln189 formed hydrogen bond interactions with hydroxygenkwanin, indicating that hydroxygenkwanin bound to the active site of the 3CL successfully. Eupatin formed seven hydrogen bonds with Ala348, Thr371, His374, His378, Glu402, Tyr515 and Arg518 to enhance its stability with ACE2 (Fig. 8b). The residues of Met125, Lys131 and Asp184 in MAPK3 formed hydrogen bonds with licochalcone B (Fig. 8c). Figure 8d shows the docking diagram of glycyrol and TP53. Two hydrogen bonds were formed between glycyrol and Val147 and Asp228, which enhances the stability of the conformation of glycyrol and TP53. As shown in Fig. 8e, epicatechin displayed a good binding ability with PIK3CA. Lys802, Tyr836 and Val851 formed hydrogen bond interactions with epicatechin to maintain conformational stability. The results shown in Fig. 8f indicated that aloe-emodin bound to JAK2 by generating three hydrogen bond with the surrounding residues, including Leu932, Gly935 and Gly993. According to our results, the compounds in XBCQD could act on the targets of COVID-19, which indicated that XBCQD has a certain positive effect in the treatment of COVID-19.

Fig. 7
figure 7

The docking scores between core compounds and hub targets. Total scores are expressed in -log10(Kd) units

Fig. 8
figure 8

The binding mode and conformation plots between representative core compounds and amino acid residues of hub targets. Magenta represents co-crystallized ligand or inhibitor as reference ligand. Yellow and green represent core compounds of XBCQD. Orange dashed line represents hydrogen bond. a 3CL and hydroxygenkwanin; b ACE2 and eupatin; c MAPK3 and licochalcone B; d TP53 and glycyrol; e PIK3CA and epicatechin; f JAK2 and aloe-emodin

Molecular dynamic (MD) simulation results

To analyze the conformational dynamics of the above representative core compounds and targets, 100 ns MD simulations were performed for each complex and compared with the corresponding reference ligand. The RMSDs of the backbone atoms of all complex structures relative to the initial structures were compared to investigate their equilibrium and stability levels throughout the simulations, as shown in Fig. 9. The 3CL-Hydroxygenkwanin complex was stable at an average of about 0.27 nm till 90 ns, then abruptly rose to about 0.4 nm, and finally stabilized to 100 ns. The ACE2-Eupatin complex showed a minor upward trend from 0.4 to 0.6 nm at 15 ns, followed by a stable equilibrium throughout the simulation. The MAPK3-Licochalcone B complex stabilized at about 0.5 nm during 100 ns of MD simulation. TP53-Glycyrol complex maintained an average RMSD of 0.2 nm untill 70 ns, then showed a gradual rise to 0.4 nm at 80 ns, and continued to the end. PIK3CA-Epicatechin complex rose from 0.5 to 0.7 nm around 20 ns, then decreased rapidly and remained stable at 0.5 nm till 100 ns, with a small drop to about 0.4 nm observed at 90 ns. The JAK2-Aloe emodin complex underwent fluctuations untill the end of the 100 ns period, with the mean RMSD remaining at 0.2 nm.

Fig. 9
figure 9

RMSD trajectories values of complexes in the 100 ns period of MD simulations

The RMSF of each residue of the complex was calculated to evaluate the flexibility of residues that contributed to the structural fluctuations. The results showed that all representative compounds exhibited similar trends compared to the corresponding reference ligands, that was, they caused a certain degree of fluctuations in the same region of the proteins. The RMSF plots showed that the 3CL-Hydroxygenkwanin and MAPK3-Licochalcone B complexes had largest fluctuations at the protein terminal residues (Fig. 10a, c). Only a few residues of ACE2-Eupatin, TP53-Glycyrol, PIK3CA-Epicatechin and JAK2-Aloe emodin complexes showed higher fluctuations, indicating the atomic flexibility of loops (Fig. 10b, d–f). Overall, all complexes showed equilibrium fluctuations during the simulation.

Fig. 10
figure 10

RMSF trajectories values of complexes in the 100 ns period of MD simulations

To further understand the structural stability of the complexes, we determined the compactness of the protein structure by computing the Rg (Fig. 11). The Rg plots showed that the structural dynamics of 3CL-Hydroxygenkwanin, MAPK3-Licochalcone B, TP53-Glycyrol and JAK2-Aloe emodin complexes remained fairly stable throughout the simulation time, with the average Rg values of 1.57, 1.76, 1.67 and 1.58 nm, respectively. A slight deviation in the Rg plot of ACE2-Eupatin and PIK3CA-Epicatechin can be seen during the simulation,and the steady equilibrium were finally observed at 100 ns, with mean Rg values of 1.95 and 1.49 nm, respectively.

Fig. 11
figure 11

Rg trajectories values of complexes in the 100 ns period of MD simulations

H-bonds contribute significantly to stabilizing protein–ligand complexes. To understand the binding affinity of ligands to proteins, H-bonds interaction analysis was performed on MD trajectories to calculate the total number of H-bonds formed between protein–ligand complexes (Fig. 12). The 3CL-Hydroxygenkwanin, ACE2-Eupatin, MAPK3-Licochalcone B, TP53-Glycyrol, PIK3CA-Epicatechin and JAK2-Aloe emodin complexes showed the number of H-bonds to be between 0–5, 2–10, 0–9, 0–5, 0–8 and 0–6, respectively.The results suggested that the number of H-bonds formed by all protein–ligand complexes was stable throughout the simulations. The amino acid residues in the active site were also persistent, helping to stabilize the entire complex structure.

Fig. 12
figure 12

The number of hydrogen bonds for complex systems in the 100 ns period of MD simulations

The binding free energies of the complexes were calculated using the MM-PBSA method. As shown in Table 3, the free binding energy of TP53-glycyrrhizin complex was the largest at − 138.106 kJ/mol, followed by PIK3CA-Epicatechin, 3CL-Hydroxygenkwanin, ACE2-Eupatin, MAPK3-Licochalcone B, and JAK2-Aloe emodin complex, with free binding energy of − 75.794, − 74.06, − 71.54, − 68.734 and − 66.252 kJ/mol, respectively. To get a deeper understanding of the contribution of the binding pocket residues to the complex, the binding free energy decomposition per residue was employed (Fig. 13). The free energy decomposition analysis plots showed that Leu27, Cys145 and Met165 were energetically favorable for the binding stability of 3CL-Hydroxygenkwanin complex, with the binding energy values of − 4.259, − 6.819 and − 3.485 kJ/mol, respectively. In contrast, Asp269, Trp271, His378, Glu402, Phe512 and Tyr515 were the major residues that interacted with ACE2-Eupatin and contributed to the total binding energy, with negative energy values of − 2.675, − 5.872, − 8.187, − 7.114, − 4.088 and − 6.448 kJ/mol, respectively. However, Arg273, Glu375 and Arg514 showed positive energy values of 17.886, 24.693 and 8.233 kJ/mol, indicating no favorable binding to eupatin. All other complexes also exhibited a number of favorable and unfavorable residue energy contributions as described above.

Table 3 Calculated binding free energy values with MM-PBSA method
Fig. 13
figure 13

Per residue MM-PBSA energy contribution of complex systems in the 100 ns period of MD simulations

Discussion

In this study, we systematically analyzed the chemical composition and therapeutic mechanism of XBCQD in the treatment of COVID-19 through network pharmacology, molecular docking and MD simulation. XBCQD is a classical prescription in TCM, which is composed of 4 Chinese medicines. Since Gypsum Fibrosum is a kind of sulfate mineral (CaSO4·2H2O), many studies have excluded it in the experiments [61]. Considering Gypsum Fibrosum has a strong heat-clearing ability, and is used frequently in prescriptions for the treatment of COVID-19, Gypsum Fibrosum should also be included in the range of the study [62].

Based on the theory of TCM, effective compounds were identified from the ingredients of XBCQD using network pharmacology methods, and intricate networks were constructed for XBCQD to reveal the complex interactions, properties, and meridian tropisms among 4 kinds of Chinese medicines. Analyzing the meridian tropism network of XBCQD from a systemic perspective, 3 of the 4 Chinese medicines have a specific affinity to the lung, and the lung was the largest organ system degree node. These indicated that XBCQD mainly act on the lung system. According to research, SARS-CoV-2 mainly infects the host cells by targeting the highly expressed ACE2 receptor in the lung, causing lung tissue damage and respiratory tract insufficiency [63]. Our research provided meridian tropisms theory of TCM support for XBCQD in the treatment of COVID-19.

The 67 targets in the PPI network were scrutinized and 34 targets were extracted as hub targets, which usually have significant influence and play a more critical role in the network. RELA is a subunit of the transcription factor NF-κB and the hub target with the highest degree value, which indicated that NF-κB signaling pathway may be a therapeutic approach for XBCQD in the treatment of COVID-19. The nuclear factor NF-κB regulates inflammation, immunity, and cell survival, and is considered to be a typical proinflammatory signaling pathway [64]. Inhibiting the activation of NF-κB can restore systemic hypotension, diminish intravascular coagulation, reduce tissue neutrophil influx, and prevent multiple organ injury [65]. Also, NF-κB induces the expression of inflammation and carcinogenesis proteins, such as IL2, TNF, and CXCL1 can cause inflammatory response and promote the survival and proliferation of lung tumor cells [66]. MAPK is involved in many cell physiological and pathological processes, such as cell growth, cell proliferation, cell differentiation and cell survival [67]. The phosphorylation of MAPK3/1 also enhances the co-transactivation of RELA. In addition, COVID-19 is closely related to cytokine storm [68]. Studies have shown that IL-6 is more relevant to the evaluation of COVID-19 patients and ICU patients, and their interleukin levels (IL-1β, IL-2, IL-4, IL-6, IL-7, IL-8, IL-9, IL-10, IL-13, IL-17), interferon gamma, fibroblast growth factor and TNF are high [69]. Moreover, the serum levels of IL6, IL8, IL10 and TNF in severe patients are higher than those in mild patients, indicating that the severity of COVID-19 is associated with cytokine storm [70]. Therefore, the treatment based on the control of inflammatory factors may be beneficial to infected patients.

In order to further clarify the mechanism of XBCQD in the treatment of COVID-19, GO and KEGG enrichment was performed. The results of GO enrichment showed that the hub targets were mainly involved in the apoptotic, cytokine regulation and lipopolysaccharide response. KEGG enrichment analysis showed that the main pathway involves in viral infection and inflammatory response related signaling pathways. IL-17 is a family of multifunctional cytokines, which plays an important role in the development of inflammatory autoimmune diseases. It can upregulate proinflammatory factors via the activation of NF-κB and MAPK [71]. In fact, the development of COVID-19 is closely associated with inflammation, apoptosis, and immune regulation. SARS-CoV-2 acts on the innate immune system, releasing chemokines and cytokines, which eventually lead to dyspnea and respiratory failure [72]. In the development of COVID-19, systemic inflammatory response and immune system disorders will not only affect cardiovascular system, but also damage lung or other parts of the body [73]. Recent studies have demonstrated that SARS-CoV-2 coding protein open reading frame 8 (ORF8) acted as a contributing factor to cytokine storm during COVID-19 infection. ORF8 could activate IL-17 signaling pathway and promote the expression of proinflammatory factors [74]. Therefore, targeting IL-17 can be used immunologically as a strategy to prevent acute respiratory distress syndrome in COVID-19.

Our current results showed that the effective compounds in XBCQD could regulate the inflammatory process and reduce lung tissue damage to exert its therapeutic effects on COVID-19. Furthermore, 10 core compounds were confirmed to have good drug-likeness and ADMET properties, which provided a basis on the drug-like capacity of a theoretical pharmaceutical. As a flavonoid, epicatechin has a variety of pharmacological activities, including antioxidant, anti-inflammatory and antitumor [75]. Studies have shown that epicatechin can reduce excessive inflammation and inhibits the activation of pro-inflammatory pathways by inhibiting the expression level of TNF-α and IL-6, and reduce acute lung injury [76]. Aloe-emodin is a natural anthraquinone in many Chinese herbal medicines, and has antioxidant and antibacterial activities [77]. Previous studies have shown that aloe-emodin can induce caspase-dependent apoptosis and autophagy. It also reduced the activity of non-small cell lung in a dose-dependent manner [78]. Moreover, aloe-emodin exerted anti-inflammatory effects by reducing the production of pro-inflammatory factors such as IL-6 and IL 1β [79]. Diosmetin exhibits good anti-inflammatory effect. It has been reported that diosmetin inhibits IL-4 and LPS signaling pathways, and is effective in inhibiting Th2-associated mast cells and in suppressing the expression of inflammatory cytokines [80]. Licochalcone B can induce cell cycle arrest, apoptosis, and defective autophagy, which inhibits growth and induces apoptosis of human non-small-cell lung cancer cells by dual targeting of EGFR and MET [81]. Glycyrol is an important anti-inflammatory and anti-virus coumarin derivative. Glycyrol can decrease the mRNA of the IL-1β and IL-6 in a dose-dependent manner, and inhibits I-κBα phosphorylation to prevent NF-κB activation in RAW264.7 macrophages [82]. Eupatin has significantly anti-inflammatory effect. It can reduce the LPS-induced protein expression and phosphorylation of p65 and inducible nitric oxide synthase as well as downstream products IL 6 [83]. The stable molecular docking model exhibited effective combination between core compounds and hub targets, further verifying the internal relationship between XBCQD and COVID-19. MD simulations further analyzed the stability of the receptor-ligand complexes. The RMSD analysis showed that the six simulated systems basically reached the equilibrium after 100 ns of MD simulations. The residual flexibility was evaluated by the RMSF, which suggested that the regions with large fluctuations occurred mainly in protein terminal residues and loop regions. The average values of Rg for six complexes indicated compact receptor structures, suggesting that the binding of the core compounds to the targets had little effect on the protein structure. The hydrogen bond interactions explained the high stability and good binding affinity between core compounds and targets. Moreover, we computed the per-residue decomposition of free energy using the MM-PBSA method. The identification of key residues is of great help to further design inhibitors of SARS-CoV-2 targets.

The current results gave strong theoretical support for XBCQD in the treatment of COVID-19. Nevertheless, it should be noted that there were some limitations due to the lack of verification analyses in our research. The core compounds, hub targets and related pathways predicted by the computational tools require further in vitro and in vivo studies to validate the molecular mechanism of XBCQD for the treatment of COVID-19.

Conclusions

In this study, the chemical composition and therapeutic mechanism of XBCQD in the treatment of COVID-19 were investigated by combining network pharmacology, molecular docking, and MD simulations. According to our analysis, XBCQD could treat COVID-19 by modulating and targeting a variety of cytokines, and exerted potential therapeutic effects through anti-inflammatory and antiviral. The core compounds of XBCQD exhibited high potential for binding to the active site of hub targets, and the ligand-receptor complexes gradually reached stability with the extension of molecular simulation time. This research comprehensively explained the multi-component, multi-target, and multi-pathway intervention mechanism of XBCQD in the treatment of COVID-19, which was expected to provide evidence and new insights for further pharmacological research.