Introduction

Coronaviruses are a family of enveloped, single-stranded RNA viruses. This genus includes a wide range of important human and animal pathogens, such as the porcine epidemic diarrhea virus (PEDV), Middle East respiratory syndrome coronavirus (MERS-CoV) and severe acute respiratory syndrome coronavirus 1 and 2 (SARS-CoV-1 and -2). Vaccines are now available to reduce the spread of SARS-CoV-2 and to limit the pandemic outbreak of coronavirus disease 2019 (COVID-19). Vaccines are also used to prevent and control PEDV infections (Won et al. 2020; Li et al. 2020a) but more efficient and novel approaches are still needed to combat porcine epidemic diarrhea. This highly contagious intestinal disease has led to substantial economic losses in many countries and is a great threat to the swine industry worldwide (Jung et al. 2020; Russell et al. 2020). Similarly, efficient anti-COVID-19 vaccines are available to prevent the disease and limit viral dissemination, but there is no treatment to eradicate the virus and its mutants. Mass vaccination campaigns are taking places in many countries, but the crusade against COVID-19 pandemic, which is currently impacting every country in the world, remains a priority and a challenge (Madkaikar et al. 2021).

In recent years, different small molecules active against PEDV have been identified, including several natural products such as the isoflavonoid puerarin (Wu et al. 2020, 2021), the polyphenol epigallocatechin-3-gallate (EGCG) (Huan et al. 2021), the alkaloid tomatidine (hereafter designated TD) (Wang et al. 2020a) and the flavonoids quercetin and quercetin 7-rhamnoside (Choi et al. 2009; Li et al. 2020b), to cite only a few examples. One of the most frequently targeted protein of PEDV is the coronavirus 3C-like protease (3CLpro), which has a conserved structure and plays a key role in the viral replication process (St John et al. 2016; Ye et al. 2016). Small molecules selectively targeting 3CLpro are actively searched (Shi et al. 2018; Ye et al. 2020). Both quercetin and TD were found to inhibit 3CLpro and to markedly reduce PEDV attachment and entry into cells (Huan et al. 2021; Wang et al. 2020a). TD has been found to block of 3CL protease activity in PEDV-infected cells and the compound has revealed antiviral activities in vitro against other viruses, notably the transmissible gastroenteritis virus (TGEV), porcine reproductive and respiratory syndrome virus (PRRSV), encephalo-myocarditis virus (EMCV) and Seneca virus A (SVA) in vitro. This steroidal alkaloid was discovered in the frame of a drug screening campaign using a specific library of 911 natural products tested for their anti-PEDV activity in cells. Four compounds have emerged from the primary cell-based screening: TD, EGCG, the triterpene saponin buddlejasaponin IVb (BJ-IVb) and the antifungal cyclic-lipopetide pneumocandin B0 (PNB0) (Fig. 1). These four natural products were found to markedly inhibit infection of Vero cells with PEDV in a dose-dependent manner, while showing little or no direct cytotoxic effects. Both TD and PNB0 potently inhibited the viral infection (IC50 = 3.45 and 3.48 mM respectively) whereas BJ-IVb and EGCG were slightly less active (IC50 = 8.14 and 8.76 mM respectively) (Wang et al. 2020a). Based on its favorable selectivity, Wang and coworkers selected TD to characterize further its anti-PEDV activity and in particular its capacity to block viral replication in cells. Moreover, TD was shown to bind to a purified recombinant form of PEDV-3CLpro (Kd = 2.78 μM) and a molecular modeling analysis suggested that the compound can bind to the active site of the enzyme. This elegant study concluded that TD inhibits PEDV replication mainly by targeting 3CL protease (Wang et al. 2020a).

Fig. 1
figure 1

a Structures of tomatidine (TD, CID: 65576), epigallocatechin-3-gallate (EGCG, CID: 65064) and buddlejasaponin IVb (BJ-IVb, CID: 91758420). b Structures of the three echinocandin derivatives pneumocandin B0 (PNB0, CID: 72475), micafungin (477468) et capsofungin (CID: 2826718). c Superimposed ribbon structures of the two viral proteases: Mpro of SARS-CoV-2 with its co-crystalized ligand (PDB code: 5R80) and 3CLpro of PEDV (PDB code: 4XFQ), in green and cyan, respectively

This study prompted us to investigate further the potential binding of all four compounds TD, PNB0, BJ-IVb and EGCG to the PEDV 3C-like protease, taking advantage of the crystallographic structure of the protein (Ye et al. 2016). The interactions of the four compounds with protease 3CLpro were analyzed and compared. In parallel, we investigated the binding of these molecules to the main protease (Mpro) of SARS-CoV-2 because these two coronavirus main proteases present a highly conserved structure and catalytic mechanism (Fig. 1c). We reasoned that the four inhibitors of PEDV-3CLpro might be able to bind to SARS-CoV-2 Mpro, as observed for example, with the dipeptidyl inhibitor GC376 targeting both proteases (Wang et al. 2020b). The primary objective of our computational study was to define and compare the mode of binding of the four compounds to both 3CLpro (PEDV) and Mpro (SARS-CoV-2). Our in silico analysis indicates that PNB0 has the capacity to form much more stable complexes than TD with the two proteases. This work raises the hypothesis that the antifungal drug pneumocandin B0 (PNB0) can interfere with the activity of the viral proteases, suggesting therefore that other member of the echinocandins family of anti-fungal drugs might also be able to interact with these proteins (Fig. 2). The hypothesis was tested in silico with the related drugs capsofungin and micafungin, structurally close to PNB0 (Fig. 1b).

Fig. 2
figure 2

Schematic diagram of the study design. Screening of a natural product library has identified four inhibitors of PEDV replication in cells targeting protease 3CLpro, including pneumocandin B0 (Wang et al. 2020a). We took advantage of the structural analogy between 3CLpro of PEDV and Mpro of SARS-CoV-2 to search for potential viral protease inhibitors, through the analysis of echinocandin-type antifungal agents

Methods

In silico molecular docking procedure

The tridimensional structures of the SARS-CoV-2 main protease in complex with a small molecule inhibitor (Z18197050) and the PEDV 3C-like protease were retrieved from the Protein Data Bank (www.rcsb.org) under the PDB codes 5R80 and 4XFQ, respectively (Douangamath et al. 2020; Ye et al. 2016). Docking experiments were performed with the GOLD software (GOLD 5.3 release, Cambridge Crystallographic Data Centre, Cambridge, UK). Before starting the docking procedure, the structure of the ligands has been optimized using a classical Monte Carlo conformational searching procedure as described in the BOSS software (Jorgensen et al. 2004).

With the 5R80 structure, based on shape complementarity criteria, the binding site for the ligands has been defined around amino acid residue Met165 (binding site for the ligand Z18197050). Given the important structural similarities between the two proteases, the position of the binding site is comparable in the 4XFQ structure and is centered on amino acid residue Glu187. Shape complementarity and geometry considerations are in favor of a docking grid centered in the volume defined by this amino acid. Within the binding site, side chains of specific amino acids have been considered as fully flexible. The flexible amino acids are (i) His41, Met49, Tyr54, Cys145, His164, Met165, Glu166, Phe181, Val186 and Asp187 for structure 5R80 and (ii) His41, Thr47, Tyr53, Cys144, Gln163, Leu164, Glu165, Glu185, Asp186 and Glu187 for structure 4XFQ. The ligand is always defined as flexible during the docking procedure. Up to 100 poses that are energetically reasonable were kept while searching for the correct binding mode of the ligand. The decision to keep a trial pose is based on ranked poses, using the PLP fitness scoring function [which is the default in GOLD version 5.3 used here (Jones et al. 1997)]. The same procedure was used to establish molecular models for all compounds listed in Tables 1 and 2. The empirical potential energy of interaction ΔE for the ranked complexes is evaluated using the simple expression ΔE(interaction) = E(complex)—(E(protein) + E(ligand)). For that purpose, the spectroscopic empirical potential energy function SPASIBA and the corresponding parameters were used (Vergoten et al. 2003; Lagant et al. 2004). Free energies of hydration (ΔG) were estimated using the MM/GBSA model in Monte Carlo simulations within the BOSS software (Jorgensen et al. 2004). The stability of the receptor-ligand complex is evaluated through the empirical potential energy of interaction (Vergoten et al. 2003; Lagant et al. 2004). The molecular mechanics/generalized born surface area (MM/GBSA) procedure was used to evaluate free energies of hydration (Jorgensen et al. 2004) [within the Boss program (Jorgensen and Tirado-Rives 2005)], in relation with aqueous solubility (Zafar and Reynisson 2016). Molecular graphics and analysis were performed using Discovery Studio Visualizer, Biovia 2020 (Dassault Systèmes BIOVIA Discovery Studio Visualizer 2020, San Diego, Dassault Systèmes, 2020).

Table 1 Calculated potential energy of interaction (ΔE, kcal/mol) and free energy of hydration (ΔG, kcal/mol) for the interaction of the two viral proteases with the indicated natural products
Table 2 Calculated potential energy of interaction (ΔE, kcal/mol) and free energy of hydration (ΔG, kcal/mol) for the interaction of the different compounds with Mpro of SARS-CoV-2

The two most widely used methods to investigate protein–ligand stability and affinity are molecular dynamics (MD) and Monte Carlo (MC) simulations. Both methods use an empirical force field to control the total energy (MC, energy minimization) and forces (MD, Newton equations of motion). To use MD simulations confidently, a force field parameterized for dynamical properties is required. The development of a reliable and accurate force field for the conformational analysis is a concern. It requires accuracy of the force field over the whole potential surface, rather than in the region of the global minimum (Homans 1990). The most used academic force fields (CHARMM, AMBER, GROMOS) do not exhibit the required vibrational spectroscopic quality. Determination of normal modes on a minimized protein structure using the above force fields can result in imaginary wavenumbers corresponding to maxima in the potential energy (transition states, mainly due to inadequate barriers to internal rotation). The spectroscopic SPASIBA force field has been specifically developed to provide refined empirical molecular mechanics force field parameters, as described in other studies (Vergoten et al. 2003; Meziane-Tani et al. 2006). For this reason, we preferred to use MC simulations rather than MD which requires a substantial increase in computer time to achieve the same level of convergence (Jorgensen and Tirado-Rives, 1996).

Results

The structures of the studied compounds are shown in Fig. 1. Tomatidine is a steroidal alkaloid found in unripe (green) tomatoes, generally derived from its glycoside a-tomatine (Friedman 2013). Epigallocatechin-3-gallate (EGCG) is a well-known green tea catechin with marked anti-inflammatory and antiviral properties (Chourasia et al. 2021). Buddlejasaponin IVb (BJ-IVb) is a triterpene saponin isolated from Clinopodium chinense (Benth.) O. Kuntze (Zeng et al. 2016) and from the aerial part of Pleurospermum kamtschaticum Auct. (Jung et al. 2005). Pneumocandin B0 (PNB0) is an antifungal lipohexapeptide of the echinocandin family, inhibitor of 1,3-β-glucan synthase of fungal cell (Li et al. 2018). Capsofungin and micafungin, two structural analogues of PNB0, are approved drugs used to treat diverse fungal infections (Hashemian et al. 2020; Taormina et al. 2021). Pneumocandin B0, capsofungin and micafungin belong to the echinocandin family of antifungal agents, which are largely used to treat aspergillosis, candidiasis and other fungal diseases (Mroczyńska and Brillowska-Dąbrowska 2020; Hüttel 2021).

To investigate the interaction of these compounds with the 3CLpro of PEDV, we used the crystal structure of the protease (PDB: 4XFQ) and performed a molecular docking analysis to identify the preferred drug binding site. The interaction site is centered on residue Glu187 (which superimposes well with the known Mpro binding site of SARS-CoV-2, vide infra). Molecular models were constructed with each ligand and we calculated the empirical energy of interaction (ΔE) and free energy of hydration (ΔG), as reported in Table 1. Surprisingly, we found large variations in ΔE values between compounds. Both PNB0 and EGCG appear to form very stable complexes with 3CLpro, much more favorable than the complex formed with BJ-IVb. The ΔE value calculated with TD is only 60% of the value calculated with PNB0. Representative binding models are shown in Fig. 3 for TD and PNB0. At first sight, TD seems to fit well into the drug binding cavity but the molecular contacts between the drug and the protein surface are weaker and fewer than with PNB0. The TD-3CLpro complex is stabilized by different van der Waals contacts and (p)alkyl interactions but there is a single hydrogen bond between the protein and the phenolic OH group of TD (Fig. 3f). On the opposite, multiple H-bonds can be identified between PNB0 and 3CLpro (Fig. 3c). The compound strongly interacts with the protein, mainly via its hydrophilic cyclic peptide moiety whereas the appended lipophilic dimethyl-myristoyl chain essentially points toward the outside of the protein. Similar models were obtained with compounds EGCG and BJ-IVb. The polyphenol EGCG can form equally stable complexes with Mpro compared to PNB0, whereas the triterpene saponin BJ-IVb appears to be much less adapted to interact with the protease. To sum up this first part, the four ligands rank in the order PNB0, EGCG > BJ-IVb > TD for their capacity to form complexes with 3CLpro.

Fig. 3
figure 3

Molecular models of pneumocandin B0 and tomatidine bound to 3CLpro of PEDV. Panels a and d show the ligand bound to the active site cavity of the protein (green). Panels b and e show a detailed view of the compound in the binding site, with the H-bond donor and acceptor groups available within the cavity. Panels c and f refer to the binding map contacts for each ligand, with the indicated color code

The prominent capacity of PNB0 to bind to 3CLpro of PEDV, as determined experimentally (Wang et al. 2020a) and via our computational analysis, prompted us to include two structural analogues, the approved drugs capsofungin and micafungin. We found little difference between PNB0 and capsofungin for binding to 3CLpro, in contrast to micafungin which presents a higher binding capacity (Table 1). Capsofungin bears the same dimethyl-myristoyl chain (R5 in Fig. 1b) as PNB0 but has an additional amino side chain (R3) apparently little contributing to the protein interaction (but reinforcing the water solubility of the compound). The unfused tricyclic side chain (R5) of micafungin lies into a groove adjacent to the binding site, allowing additional contacts with 3CLpro. The model in Fig. 4a shows the position of the phenyl-oxazole-phenyl unit of micafungin, interacting with the edge of the 3CLpro protein. Compared to PNB0, there are additional van der Waals and p-alkyl contacts with micafungin side chain, accounting for its superior capacity to bind to 3CLpro compared to PNB0.

Fig. 4
figure 4

Models of micafungin bound to the two proteases (a, c). Close-up view of the ligand bound to the active site, with the hydrophobic/hydrophilic zones colored in brown and blue, respectively. Note the divergent orientation of the unfused tri-aromatic side chain of micafungin between the two models. The side chain interacts with the protein edge in the case of 3CLpro whereas is essentially free in the case of Mpro

Next, considering the large structural similarity between the main proteases of PEDV and SARS-CoV-2 (Fig. 1c), we performed similar docking analyses with the protein Mpro, based on its known crystallographic structure (PDB: 5R80). In this case, there is a small molecule fragment in the crystal structure occupying a pocket around residues His164/Met165/Asp187 (Douangamath et al. 2020). This ligand was removed prior to evaluating the capacity of each natural product to bind to this site. The calculated ΔE and ΔG values (Table 1) indicate that the lipopeptide PNB0 has the highest capacity to form stable complexes with the protease compared to the three other ligands tested. The four compounds rank in the order PNB0 > EGCG > BJ-IVb > TD for their capacity to form complexes with Mpro of SARS-CoV-2. The difference of ΔE values between TD and PNB0 is huge. The tomato alkaloid TD is clearly not well adapted to interact with the viral protease. In sharp contrast, PNB0 emerges has a potential high-affinity ligand, at least from an in silico perspective. Representative models of PNB0-Mpro complexes are shown in Fig. 5 and similar models were obtained with the other compounds. PNB0 seems to be extremely well adapted to bind to the active site of the SARS-CoV-2 main protease. Multiple interaction sites between the natural product and the protein can be identified, including several H-bonds. The hydrophilic peptide ring of PNB0 fits perfectly into the protein cavity. In particular, the (3R)-hydroxy-glutamine portion of PNB0 interacts with residues Thr24 and Asn142 of Mpro and the (2S,3R)-threonine residue of PNB0 interacts with Gln189 of the protein (Fig. 5c). Compared to 3CLpro, in this case the lipophilic side chain of PNB0 is a little bit more implicated in the interaction with Mpro but the essential of the complex stability is provided by the cyclic peptide moiety.

Fig. 5
figure 5

Molecular models of pneumocandin B0 bound to Mpro of SARS-CoV-2. a The ligand bound to the protein active site. b A close-up view of the compound in the binding site, with the hydrophobic/hydrophilic zones colored in brown and blue, respectively. c Binding map contact, with the indicated color code

Both the empirical energy of interaction (ΔE) and free energy of hydration (ΔG) are extremely high for the PNB0-Mpro complex. In a recent study, we have computed the interaction of more than 30 references compounds, mainly antiviral drugs, with the Mpro protein using the same molecular docking procedure (Vergoten and Bailly 2021). The best ligands were the two anti-hepatitis C virus drugs glecaprevir and pibrentasvir. It is remarkable to see that the ΔE and ΔG values calculated with PNB0 are on the same level as those measured with these two potent antiviral drugs, well superior to the values calculated with all the other antiviral agents (Table 2). Several studies have reported that glecaprevir and pibrentasvir are high-affinity inhibitors of SARS-CoV-2 main protease (Shamsi et al. 2020; Bahadur Gurung et al. 2020; Chtita et al. 2021; Bhat et al. 2021). Our computational analysis suggests that the antifungal compound pneumocandin B0 is equally well suited to interact with this key viral protease.

The remarkable capacity of PNB0 to bind to the Mpro protein prompted us to extend our analysis to the structurally related compounds capsofungin and micafungin. As in the case of 3CLpro, we found little difference between PNB0 and capsofungin for binding to Mpro. They can form equally stable complexes with the SARS-Cov-2 protease (Table 1). In contrast, the unfused tricyclic side chain of micafungin now appears as a negative element for binding to Mpro, whereas it was a positive element for binding to 3CLpro (Fig. 4). In the case of Mpro, the bulky aromatic side chain essentially lies outside of the protein cavity and does not contribute to the drug binding process. In contrast, as mentioned above, this aromatic side chain was found to reinforce drug binding to 3CLpro.

Discussion

Drugs active against coronaviruses are actively searched. Natural products are largely considered as a source of bioactive molecules and several compounds have shown promise for the inhibition of coronavirus in humans (Mani et al. 2020). In this context, our study was built on the recent discovery by Wang and workers of the capacity of the four natural products to inhibit the 3CLpro of PEDV (Wang et al. 2020a). They identified tomatidine, epigallocatechin-3-gallate, buddlejasaponin IVb, and pneumocandin B0 as 3CLpro inhibitors and focused their study on tomatidine which was found to inhibit PEDV replication in cells. They demonstrated that TD could bind to the PEDV protease, likely representing a major target for steroidal alkaloid. Here, we have extended the study of these natural products to the binding to 3CLpro of PEDV and the structurally similar main protease (Mpro) of SARS-Cov-2. Our study failed to confirm that 3CLpro can represent a major target for TD. This alkaloid displays activities against several viruses, including the porcine epidemic diarrhea virus (PEDV) (Wang et al. 2020a, b), dengue virus (Diosa-Toro et al. 2019) and chikungunya virus (Troost et al. 2020). TD is likely a multitargeted agent.

The main point of our work is the identification of echinocandin-type drugs as potential ligands of main proteases of both PEDV and SARS-Cov-2. We show that pneumocandin B0 displays a very high capacity of interaction with Mpro of SARS-Cov-2, comparable to that measured with the reference antiviral drugs glecaprevir and pibrentasvir. Pneumocandin B0 is a biosynthetic precursor of capsofungin (Chen et al. 2015) which appears to be equally able to bind to Mpro according to our calculations. A repurposing of this drug for COVID-19 has been proposed recently (Khadka et al. 2020). Both pneumocandin B0 and capsofungin emerge as good potential binder to Mpro of SARS-Cov-2, less efficient against 3CLpro of PEDV. The calculated empirical energies of interaction (ΔE) are about 15% lower with 3CLpro compared to Mpro (Table 1). Interestingly, we found that the related drug micafungin is extremely well suited to bind to the PEDV protease, surpassing its two analogues when we compare the ΔE values. Micafungin is an antifungal drug of great value, largely used to treat neonatal candidiasis (Taormina et al. 2021) and to prevent invasive fungal infections in cancer patients (Park et al. 2019; Marena et al. 2021). Recently, this member of the echinocandin family has also been shown to inhibit Dengue virus infection (Chen et al. 2021) and has revealed activities against chikungunya virus (Ho et al. 2018) and enterovirus 71 (EV71) which is the main pathogen of the hand, foot, and mouth disease (Kim et al. 2016). EV71 bears a 3C protease essential to promote the replication of the virus (Wen et al. 2020). This specific protease could be a primary target for micafungin. The drug is used in COVID-19 patients to combat opportunistic candidiasis (Salehi et al. 2020; Sari et al. 2021). Very recently, micafungin was identified as an inhibitor of 3CL protease of SARS-Cov-2 in vitro and its capacity to bind to the monomeric form of 3CLpro was characterized using molecular dynamics simulations (Mody et al. 2021). Our own molecular docking approach supports these observations and provides additional information, via the comparison of the echinocandin binding to the two proteases Mpro of SARS-Cov-2 and 3CLpro of PEDV, and preliminary structure-binding information.

At the experimental level, pneumocandin B0 has been found to bind to PEDV-3CLpro (Kd = 2.78 μM) (Wang et al. 2020a; b) whereas micafungin was found to inhibit Mpro of SARS-Cov-2 (IC50 = 47.6 mM) (Mody et al. 2021). Our in silico analysis offers novel perspective with these compounds, anticipating a good capacity of pneumocandin B0 to inhibit Mpro of SARS-Cov-2 and a major effect of micafungin on PEDV-3CLpro. In broader terms, the different studies provide a solid rational to investigate further the antiviral proteases properties of the various echinocandins. This clinically important class of non-ribosomal antifungal lipopeptides should be further considered to target coronavirus proteases and to combat various viral infections.

Many potential inhibitors of the main protease of SARS-Cov-2 have been identified through computational methods and/or virtual screening approaches (Li et al. 2020c; Zhai et al. 2021). Our study adds two members of the echinocandin class of antifungal drugs as potential inhibitors of Mpro. An experimental validation of this hypothesis is now ongoing.