Introduction

In 2019, Wuhan, China, was the host of a new type of coronavirus that caused SARS-like symptoms in human beings. In order to better know this new virus, phylogenetic analysis was conducted considering the complete viral genome (29,903 nucleotides). The results revealed that the new virus has 89.1% nucleotide similarity to genus Betacoronavirus -subgenus Sarbecovirus- that had previously caused an epidemic infectious disease known as SARS [1]. The new virus is called COVID-19 which contains major structural proteins (Spike (S) protein, envelope (E) protein, membrane (M) protein, and nucleocapsid (N) protein.). The schematic representation of a coronavirus is shown in Fig. 1. Among them, S protein can promote fusion of the viral and cellular membranes. So, it facilitates the entry of coronavirus into the host cells [1].

Fig. 1
figure 1

Schematic representation of a coronavirus. COVID-19 contains at least four structural proteins: Spike (S) protein, envelope (E) protein, membrane (M) protein, and nucleocapsid (N) protein

S protein with 1288 amino acid consists of S1 and S2 subunits that mediates the fusion between the viral and host cell membranes. These subunits include the receptor-binding domain (RBD) and fusion peptide (FP), heptad repeat 1 (HR1), heptad repeat 2 (HR2), transmembrane domain (TM) and cytoplasmic domain fusion (CP). S1 and S2 have different roles in this entry; S1 plays a vital role in binding cellular receptor and S2 can facilitate viral fusion and entry [2]. A Conserved Receptor Binding Domain (RBD) connects to angiotensin-converting enzyme 2 (ACE2) that is known to interact with 14 aa in the S1 of SARS-CoV. Studies also show that ACE2 is a receptor of COVID-19. Moreover, sequence variations of the Spike protein cleavage site are important in terms of cellular tropism and pathogenesis. The cleavage site in the S-protein sequence is involved in the pathogenesis of virus. The sequence of S-protein in COVID-19 has 12 extra nucleotides upstream to the single Arg¯ cleavage site 1 which results in PRRAR↓SV sequence that is solvent-exposed and resembles a cleavage site [3]. The reason for the well-organized spread of the virus among the population is this new cleavage site. Previous coronaviruses lacked this system. As mentioned earlier, the S2 subunit in COVID-19 is responsible for the facilitation of viral fusion and entry into the host cell. There exists a kind of interaction regarding cellular membranes between heptad repeats 1 (HR1) and heptad repeat 2 (HR2) which leads to fusion. The changes in the interaction pattern between HR1 and HR2 domains in COVID-19 S2 (in terms of fusion with cell membranes) are caused by mutations inside the HR1 core region [4]; 8 residues out of 21 indicate mutation (~38% difference). It is currently claimed that ACE2 could be the host receptor for COVID-19 as well. Perhaps, the susceptibility, symptoms, and outcome of COVID-19 infection are resulted by the expression level and pattern of ACE2 in tissues. According to the recent research, single-cell analysis of RNA-sequencing (RNA-seq) shows a higher expression of ACE2 in Asian males [5]. COVID-19 engages ACE2 as a receptor for entry into the host cell and employs the cellular transmembrane protease serine 2 (TMPRSS2) for S protein priming. TMPRSS2 activates the viral S protein which facilitates virus-cell membrane fusions [6]. Residues near lysine 31, and tyrosine 41, 82–84 and 353–357 in human ACE2 play an important role in the binding of S-protein coronavirus. The adipose tissue might be in a weak position regarding COVID-19 because ACE2 expression in adipose tissue was higher than that in lung tissue. Considering five different types of cancers, the researchers realized that the expression of ACE2 remarkably increases in tumor tissues, compared to other neighboring tissues. Hence, it is suggested that during COVID-19, obese individuals and patients who suffer from these five types of cancer should be prioritized [7]. In terms of gender and age, it should be mentioned that the expression of ACE2 is positively associated with age [8, 9]. Eight hundred kDa polypeptide is produced during transcription of the COVID-19 genome. This polypeptide is cleaved to produce various proteins. Meanwhile, papain-like protease (PLpro) and 3-chymotrypsin-like protease (3CLpro) facilitate proteolytic processing. Papain-like proteinase (PLpro) is responsible for the cleavages at the N-terminus of replicase polyprotein. 3CLpro plays an important role in coronaviruses and is strongly preserved in this family of viruses. Proteins necessary for viral replication are nonstructural and are produced by 3CLpro which activate the polyprotein at 11 distinct sites. This process would prepare the ground for virus replication; hence, to find anti-viral activities in compounds, researchers pay careful attention to 3CLpro [10]. Some options are suggested as a treatment for COVID-19, among which drug repositioning (repurposing) is considered as the only answer to the epidemic of sudden infectious diseases. Drug repurposing is an effective strategy to treat new diseases caused by infectious agents that spread rapidly. In this paper, the drug repurposing strategy is applied using virtual screening to identify therapeutic options for COVID-19.

Methods and materials

Homology modeling was used to generate five proteins of COVID-19)3CLpro, PLpro, cleavage site, HR1 and RBD in Spike protein (and TMPRSS2. The homology models of these proteins were done in the SWISS-MODEL workspace [11, 12] (https://swissmodel.expasy.org/docs/references). Protein Data Bank (https://www.rcsb.org) was used to have ACE2 coordinates [13] (https://www.rcsb.org/structure/1r42). First, the water molecules were extracted from protein and hydrogen atoms were added to optimized hydrogen bonds. 3Drefine server and the package of GROMACS 5.1.4 were applied to minimize energy in the optimized model of the protein [14]. The section of small-molecules of the Drug Bank database [15] (https://www.drugbank.ca/about) was used to obtain all FDA-approved drugs. The Non-unique structures that were removed during the process are as follows: the compounds containing rare atoms (selenium, platinum, gold, silicon) and organometallic compounds. Finally, the screening database had 2471 compounds. Pyrx tool was used for energy minimizing and preparing ligands in PDBQT format, and AutoDock Vina was applied for virtual screening. It was used AutoDock Vina for drug discovery to screen libraries of compounds against targets [16, 17]. The box was centered on a specific area. This position using PockDrug server and it was determined by using data in uniprot about the main amino acids in the bond. According to previous evidence, the cleavage site in the S-protein and the conserved Receptor Binding Domain (RBD) is important role in the pathogenesis of virus. So at the first, these two targets were selected for virtual screening with all FDA-approved drugs. The drugs that had the desired binding energy at this stage and were reported to have antiviral properties were then selected for the next step. Therefore, virtual screening was performed for the selected drugs with ACE2, 3CLpro and PLpro and HR1 and TMPRSS2 in the next step. 2-D representations of protein/ligand complexes are shown using LIGPLOT program [18] (https://www.ebi.ac.uk/thornton-srv/software/LIGPLOT/).

Results and discussion

FDA-approved drugs were screened against cleavage site and RBD in Spike protein via virtual screening in this present study. It was observed that 128 FDA-approved drugs with the most suitable free-binding energy (out of 2471 drugs) were attached to the cleavage site and RBD in Spike protein. Of these 128 drugs, 18 drugs have either been used currently as antiviral or have been reported to possess antiviral effects. Virtual screening was then performed for the 18 selected drugs with ACE2 (major COVID_19 receptor), 3CLpro and PLpro (main enzymes in viral replication) and HR1 (main domain in virus fusion with the host cell membrane) and TMPRSS2 (main enzyme in activating the viral spike glycoproteins). According to the results, 7 drugs with all the target proteins achieved the most favorable free binding energy (Table 1). The protein complexes with these seven drugs are shown in Figs. 2, 3, 4, 5, 6, 7, and 8. Hydrophobic and hydrogen bonding interactions of docked molecules were visualized using LigPlot program (Table 2). As shown in Table 2, glycyrrhizic acid and hesperidin showed highest number of bonds with ACE2 and RBD. As shown in Table 1, paritaprevir and simeprevir have favorable binding energy to most target proteins and may be considered as therapeutic options for the coronavirus. These drugs were originally used in the treatment of chronic hepatitis C and can effectively bind to target proteins [10, 19, 20]. There are other drugs with good binding energy to target protein; one of them is ledipasvir. According to an in silico study, ledipasvir is an antiviral drug with the minimum side effects that can be applied in the treatment of COVID19 [21]. In the present study, medicinal compounds of plant origin were also attached with favorable binding energy to the target proteins. Glycyrrhizic acid which has plant origins is obtained from the roots of the licorice plant. Triterpene glycoside and glycyrrhetinic acid are used pharmacologically. The latter is applied as a hepatoprotective option to treat chronic hepatitis in Japan and China (https://pubchem.ncbi.nlm.nih.gov/compound/Glycyrrhizic-acid). In addition, glycyrrhizin, being non-toxic [22], is also used widely as a SARS treatment option. According to a study in 2003, glycyrrhizin has a preventive role in SARS [23]. Furthermore, based on studies, it plays an effective role in the hepatitis C virus (HCV) because it can influence the release phase associated with the infection of cells [24]. Using glycyrrhizin to treat hepatitis C is not a new method and it has been applied as a treatment for 20 years in Japan. Fortunately, the studies have shown its preventive role in the progression of cirrhosis and hepatocarcinoma and only few side effects [25]. The positive effects of glycyrrhizin have been proven on many viral diseases, such as herpes simplex type 1 (HSV-1), varicella-zoster virus (VZV), hepatitis A (HAV) and B Virus (HBV), human immunodeficiency virus (HIV), severe acute respiratory syndrome (SARS) and coronavirus, Epstein–Barr virus (EBV), human cytomegalovirus and influenza virus [24]. Glycyrrhizic acid’s preventive impact on ACE2 has been reported in some studies [26]. Aside from glycyrrhizic acid, hesperidin and tannic acid are also considered in medical circles; they have plant origins as well and can attach to proteins with good binding energy. Hesperidin is a flavonoid extracted from citrus fruit. It has multiple biological and antimicrobial activities against human viruses. Based on a study conducted in 2009, glucosyl hesperidin has an inhibiting role against the replication of influenza A virus [27]. In a study aimed at analyzing therapeutic targets for COVID-19 by computational methods, it was shown that hesperidin disrupts in the interaction of ACE2 with RBD [1]. As mentioned earlier, the 3CLpro can be an effective target for treatment of coronavirus. A study on investigation of the effect of active compounds on the inhibition of 3CLpro showed that hesperidin was the most potent inhibitor of 3CLpro among the compounds which were tested. Based on docking score, it was reported that hesperidin and glycyrrhizin are favorable in terms of binding to ACE2 and 3CLpro [28]. TMC-310911 is another FDA-approved drug that has good binding with target proteins. TMC-310911 is a new investigational protease inhibitor that is being investigated for use in HIV-1 infections. TMC-310911 is currently being investigated as a potential treatment for COVID-19 (https://www.drugbank.ca/).

Table 1 Docking results of 18 FDA-approved drugs with ACE2 (major COVID_19 receptor), 3CLpro and PLpro (main enzymes in viral replication) and HR1(main domain in virus fusion with the host cell membrane) and TMPRSS2 (main enzyme in activation the viral spike glycoproteins) by using PyRx (PyRx 0.8) software
Fig. 2
figure 2

Complexes of drugs with 3CLpro

Fig. 3
figure 3

Complexes of drugs with ACE2

Fig. 4
figure 4

Complexes of drugs with ACE2

Fig. 5
figure 5

Complexes of drugs with HR1

Fig. 6
figure 6

Complexes of drugs with PLpro

Fig. 7
figure 7

Complexes of drugs with RBD

Fig. 8
figure 8

Complexes of drugs with cleavage site in Spike protein

Table 2 Details of docking results using LIGPLOT program

Conclusion

Drug repurposing is an effective strategy to treat new diseases caused by infectious agents that spread rapidly. In this paper, the drug repurposing strategy was applied using virtual screening to identify therapeutic options for COVID-19. FDA-approved drugs were screened against cleavage site and RBD in Spike protein via virtual screening. It was observed that 128 FDA-approved drugs (out of 2471 drugs) with the most suitable free-binding energy were attached to the cleavage site and RBD in Spike protein. Out of these 128 drugs, 18 drugs were antiviral. Virtual screening was then performed for the 18 selected drugs with ACE2 (major COVID_19 receptor), 3CLpro and PLpro (main enzymes in viral replication), HR1 (main domain in virus fusion with the host cell membrane) and TMPRSS2 (main enzyme in activating the viral spike glycoproteins). According to the data of this study as well as the related literature, paritaprevir, simeprevir, ledipasvir, glycyrrhizic acid, TMC-310911, and hesperidin can be suitable candidates for further investigation on the treatment of COVID-19 infection. It seems that investigations of RNA polymerase inhibitors may be effective for better selection of drugs in the treatment of COVID-19.