1 Introduction

A new human coronavirus (n-CoV), lately in December 2019 emerged from Wuhan city, China, that resulted in mysterious coronavirus outbreak globally. World Health Organization (WHO) has declared coronavirus disease-2019 (COVID-19) or severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) as a highly transmissible disease that leads from mild to critical pulmonary infections around the globe. The case studies of SARS-CoV-2 infections have pointed towards its connection with seafood market, but still no evidence has been found to support them. The nCoV-19 infections mainly include pulmonary stress, fever, poor renal functioning, sputum production, anosmia, ageusia and multiple misfunctioning (Ji et al., 2020).

The rapid spread of nCoV-19 infections globally gained the attention and created apprehension worldwide. In the past, the two beta coronavirus outbreaks (i.e. severe acute respiratory syndrome (SARS), and Middle East respiratory syndrome (MERS)) were occurred, which were less severe and contagious with 35% fatality rate (Yu et al., 2020). New novel coronavirus has been recognized as a new member of beta genus of Coronaviridae family (Gao et al., 2020). Four major genera of coronaviruses (i.e. alpha, beta, gamma and delta) have been discovered (Saif et al., 2019). Coronaviruses are positive single-stranded enveloped 3-kb RNA viruses. All corona viruses possess specific gene in their open reading frame (ORF) region that encodes for viral replicative, spike and capsid proteins (Shereen et al., 2020).

Coronaviruses (CoVs) employ two open reading frames ORF1a and ORFlab and the polyproteins further split and code structural and non-structural viral proteins. Structural protein has been characterized as spike (S), membrane (M), nucleocapsid (N) and envelope (E) protein, while non-structural proteins include a range from nsp1 to nsp16 (Khan et al., 2020). Along with nCoV-spike protein, the RNA-dependent RNA polymerase is the main viral protease, which is directly involved in human cell infections. Due to this complex formation, the RNA-dependent RNA polymerase (nsp-12) of nCoV-19 adopts right-hand shape and looks like finger, palm and thumb domains.

RNA-dependent RNA polymerase (nsp-12) makes a complex with two cofactors nsp7 and nsp8 to play a key role in RNA synthesis, transcription and replication process of the SARS-CoV-2. Thus, nsp12 is considered as the main target for different antiviral agents to hinder the replication cycle of nCoV-19 directly (Gao et al., 2020). Ongoing COVID-19 outbreak related to highly pathogenic form of nCoV-19 strain has highlighted the necessities of urgent need of drug development. The emerging nCoV-19 infections demand nucleotide analogue-based antiviral therapies directly targeted the external components of the replication machinery of nCoV-19 RNA virus (Gao et al., 2020). Therefore, the current study was planned to target RNA-dependent RNA polymerase by different phytochemicals to inhibit viral transcription and replication.

2 Materials and methods

The methodological framework followed in this study is shown in Fig. 1. Swiss Model server was used to build 3D structure of RNA-dependent RNA polymerase (nsp12) (GenBank accession number: YP_009725307.1) of SARS-CoV-2. 6M71_A from Protein Data Bank (PDB) was used as a template. The predicted 3D models were evaluated via verify 3D (Lüthy et al., 1992), ERRAT (Colovos & Yeates, 1993) and Swiss Model Ramachandran plot (Waterhouse et al., 2018). The superimposition of predicted 3D model and template was done through UCSF chimera work bench (Pettersen et al., 2004).

Fig. 1
figure 1

The framework for research methodology followed in this study

2.1 Ligand database preparation and receptor refinement

From the extensive survey, 200 ligands were listed specifically from medicinal plants. Chemical structures of all the ligands were retrieved from PubChem database (http://www.ncbi.nlm.nih.gov/pccompound) in SDF format and after energy minimization saved into MOE database. The predicted 3D model of nsp12 was used as a receptor protein. Optimization of receptor was done by the removal of water molecules, addition of hydrogen atoms, energy minimization and 3D protonation for the perfect and accurate docking. The minimized structure was then docked against selected ligands.

2.2 Molecular docking

Molecular docking was performed at the active site in chain A of the nsp-12 of SARS-CoV-2. Site finder tool of Molecular Operating Environment (MOE) was used to find the binding residues with parameters; re-scoring 1: London dG, Retain: 10, refinement: force field; re-scoring 1: London dG, retain: 10 to calculate the interactions of ligands with the binding residues of nsp-12 protein of SARS-CoV-2. The prepared ligands were then docked against RNA-dependent RNA polymerase (nsp-12) of SARS-CoV-2 using MOE software. Most appropriate interactions and bindings between the receptor protein of nCoV-19 and ligands were selected on the basis of their best S-scores, root-mean-squared deviation (RMSD) and energy validation rankings.

2.3 Drug scanning

Druglikeness was executed following the Lipinski’s rule of five (http://www.scfbio-iitd.res.in/software/drugdesign/lipinski.jsp). Lipinski’s rule of five reveals the drug-like or non-drug-like molecules and prognosticates high probability of success and failure of drug candidates in preclinical stage complying with five rules. This rule determines the druglikeness on the basis of molecular mass (< 500), hydrogen bond donors (≤ 5) and acceptors (≤ 10), log P value (≤ 5) and molecular refractive index (40–130) (Lipinski, 2004).

2.4 ADMET profiling

Furthermore, ADMET-based chemical scanning of drugs was done using the online comprehensive tool admetSAR (Cheng et al., 2012). Evaluation of ADMET properties (absorption, distribution, metabolism, excretion and toxicity) of potential drug molecules has become important part of in silico drug development and discovery. ADMET-based drug classification based on substructure recognition pattern of molecules helps to predict the likeness of drug from perspectives of medical chemistry.

3 Results

SARS-CoV-2 or nCoV-19 has been listed as a new species among beta corona viruses and declared as a causative agent of recently on going pandemic worldwide (Mahrosh & Mustafa, 2021). RNA-dependent RNA polymerase is a key protein of viral genome that mediates the replication and RNA synthesis machinery of nCoV-19. Therefore, the RNA-dependent RNA polymerase (nsp-12) was selected in this study to explore the structural assessment and binding sites of viral protein in order to target them. The purpose of this study was to construct a receptor to be targeted by selected ligand molecules from medicinal plants. All 200 ligands from different classes of medicinal plants were docked against the selected receptor protein of SARS-CoV-2 (i.e. nsp-12).

3.1 Homology modelling

Protein sequence of nsp-12 of nCoV-19 was used to develop 3D structure via Swiss Model (Waterhouse et al., 2018). Swiss Model is a fully automated protein homology modelling server, which provides structural assessments, comparative modelling and prediction of templates to understand the properties and functioning of proteins. The modelled 3D structure of nsp-12 is shown in Fig. 2a and its superimposition with selected template in Fig. 2b.

Fig. 2
figure 2

3D structure prediction of nsp-12 and its superimposition. a Predicted 3D structure of RdRp (nsp-12), b Superimposition of nsp-12 with selected template (the structure of nsp-12 is shown in blue and that of template in red)

The superimposition of predicted model and template 6M71_A exhibited RMSD value of 0.172 and Q-score: 0.954. Predictions of RMSD and Q-mean were done via UCSF Chimera. Low RMSD value indicates the accuracy and validity of the predicted model (Bukhari et al., 2018). Quality and structural assessment of predicted model was verified using different servers including Verify 3D, ERRAT and Swiss Model Ramachandran plot. These servers demonstrated the quality of the predicted model by checking the refinement, structure and resolution of the model. Verify 3D uses the 3D profile of protein to assess and analyse the compatibility of the model. The evaluation of predicted model of nsp-12 is given in Fig. 3.

Fig. 3
figure 3

Evaluation of predicted model of nsp-12. Evaluation via; a Verify 3D, b ERRAT, c Ramachandran plot

According to Verify 3D, 89.20% of amino acids of nsp-12 exhibited an average 3D-1D score of ≥ 0.2, which indicated the accuracy and validity of the model (Fig. 3a). The ERRAT predicted the overall quality factor that showed that 95% of amino acid residues fall below the threshold level that centred the fitness of the predicted model (Fig. 3b). Swiss Model Ramachandran plot is dedicated to the stereo-chemical validation and reliability of the predicted model. Ramachandran plot analyses the main torsion angles Phi, Psi (φψ) of amino acid residues in a polypeptide chain present in the protein. According to Ramachandran plot, about 96.05% residues fall in the favoured regions, 0.56% in Ramachandran outliers with 1.03 MolProbity score which confirmed the accuracy of the predicted model (Fig. 3c).

3.2 Molecular docking

A ready-to-dock library of 200 selected phytochemicals was prepared and then docked with nsp-12 receptor protein using docking algorithm of MOE software. MOE confirms correct conformation of ligand for getting a minimum energy structure. Top ten conformations of phytochemicals were determined through docking, which occupied maximum binding pocket, strong hydrogen bonding and with best S-score and RMSD value. MOE provided multiple conformations of each phytochemical, and on the basis of minimum S-score, the top 10 selected phytochemicals were found to be verbenalin, epigallocatechin, swertisin, nobiletin, pinoresinol, caftaric acid, hesperetin, islandicin, neochlorogenic acid and sesamin. Details of each ligand with their S-scores, RMSD values and interacting residues are given in Table 1.

Table 1 Interaction details of top ten bioactive phytochemicals in the proposed site of nsp-12 of SARS-CoV-2

3.3 Interaction analysis

Among 200 selected ligands, top 10 were selected on the basis of minimum S-score and binding interactions with SARS-CoV-2 receptor molecule. The interactions of best ten ligands with nsp-12 of nCoV-19 are shown in Fig. 4. First phytochemical, verbenalin formed five interactions with different residues of receptor molecule (i.e. Arg349, Ser664, Asn628, Pro677 and Glu350) (Fig. 4a). Verbenalin is a terpene glycoside and has been found in Verbena officinalis which is a medical herb and used for various clinical purposes such as influenza, malaria, headaches, fever and dermatitis treatment.

Fig. 4
figure 4

Interactions of top ten ligands with nsp-2 of COVID-19 as target protein. a Verbenalin, b Epigallocatechin, c Swertisin, d Nobiletin, e Pinoresinol, f Caftaric acid, g Hesperetin, h Islandicin, i Neochlorogenic acid, j Sesamin

The second phytochemical, epigallocatechin showed interactions with Ser664, Phe326 and Glu665 residues of the receptor protein (Fig. 4b). Being an antioxidant in nature, it provides protection against free radicals in the body. Epigallocatechin is a major constituent of green tea, black tea and almonds and plays a role as an antioxidant. Basically it is flavan-3,3′,4′,5,5′,7-hexol and potent catechin and exhibits 2R, 3R-configurations. Third ligand was found to be swertisin that has been derived from apigenin and is abundantly found in chamomile plants. Swertisin showed binding interactions with Arg349, Ser664, Phe326, Asn628 and Thr324 amino acid residues of the receptor protein (Fig. 4c). Swertisin plays various roles such as an anti-inflammatory agent, an antioxidant and adenosine A1 receptor antagonist. Fourth ligand was nobiletin that interacted with Arg349 and Tyr346 residues of nsp-12 receptor protein (Fig. 4d). Nobiletin is a methoxy flavone, an antineoplastic agent with methoxy groups at 5, 6, 7, 8, 3′ and 4′ positions. Citrus fruits specifically the peel of Citrus nobilis and Citrus aurantium are natural sources of this flavanone.

The fifth phytochemical, pinoresinol exhibited two binding interactions with the receptor (i.e. Ser664 and Tyr346) (Fig. 4e). Pinoresinol is a lignin and enantiomer of ( +)-1S, 4S, 3aR pinoresinol that has been extracted from Forsythia suspensa and Styrax platanifolius. Pinoresinol is a phytoestrogen and hypoglycemic agent. The sixth phytochemical, caftaric acid is the principle esterified phenolic acid and found in grapes, raisins and berries. Caftaric acid showed interactions with Arg349, Lys676 and Leu460 residues of nsp-12 (Fig. 4f). Hesperetin was the seventh phytochemical that formed two interactions with the receptor protein (i.e. Arg349 and Glu350) (Fig. 4g). Hesperetin with three hydroxy groups is a trihydroxy flavanone and has anti-inflammatory, antioxidant, anti-allergic, vasoprotective and hypolipidemic properties. Citrus fruits are the natural sources of hesperetin and therefore also used in lowering the cholesterol level.

The eighth phytochemical, islandicin exhibited two binding interactions with Arg349 and Tyr346 residues of the nsp-12 (Fig. 4h). Islandicin is a trihydroxyanthraquinone and belongs to the organic compound class anthraquinones. It has been extracted from the roots and bark of Ventilago madraspatana plant. Neochlorogenic acid or trans-5-caffeoylquinic acid was the ninth phytochemical, which is a cinnamate ester. Neochlorogenic acid formed interactions with Arg349, Tyr346 and Phe396 residues of the nsp-12 (Fig. 4i). This compound is produced during the metabolic pathways in plants. Neochlorogenic acid is abundantly found in Prunus persica (peaches) but also present in Vaccinium corymbosum (Highbush blueberry), Levisticum officinale (Lovage) and Arctium (Burdock root). The tenth phytochemical, sesamin is a lignin and has been extracted from Cinnamomum camphora (Camphor tree). Sesamin showed interactions with two amino acid residues of the receptor protein (i.e. Ser664, Thr462) (Fig. 4j). It exhibits various properties such as neuroprotective and antineoplastic agent and also exerts cytotoxic effects. The binding patterns of all these ligands with nsp-12 of nCoV-19 are shown in Fig. 5. Results demonstrated the potential of these ligands as they can block the target site (i.e. viral protein). Therefore, these ten molecules could be used as potential drugs against COVID-19 in future with fewer or no side effects.

Fig. 5
figure 5

Binding patterns of top ten ligands with nsp-2 of COVID-19 as target protein. a Verbenalin, b Epigallocatechin, c Swertisin, d Nobiletin, e Pinoresinol, f Caftaric acid, g Hesperetin, h Islandicin, i Neochlorogenic acid, j Sesamin

3.4 Drug scan

The Lipinski’s rule illustrates five parameters to distinguish between drug-like and non-drug-like molecules. According to these parameters, molecular mass of the potential drug candidate should be less than 500 Dalton, the molecule should have high lipophilicity (log P ≤ 5), hydrogen bond donors should be ≤ 5, hydrogen bond acceptors should be (≤ 10) and molar refractivity should be significant for drug pharmacokinetics in the human body (i.e. between 40 and 130). Only the drug that accomplished all these parameters could be proved as a good drug candidate. All the ligands selected in this study followed the Lipinski’s rule and were revealed as potential drug candidates with fewer or no adverse effects (Table 2).

Table 2 Evaluation of selected phytochemicals through Lipinski’s rule of five

3.5 ADMET profiling

After drug scan, ADMET-based properties of drugs molecules were checked. The selected top ten candidates overall fulfilled the criteria of being good drug candidates as the results illustrated that all compounds are non-toxic except islandicin and swertisin, but all were found to be non-carcinogens. Verbenalin, epigallocatechin, swertisin and hesperetin showed little violations, but overall results could be accepted, and these lead compounds are appropriate to be used as drug candidates against receptor molecule (Table 3).

Table 3 ADMET profiling of top ten selected phytochemicals

4 Discussion

The natural flora has been used as medical purposes in various cultures and human inhabitant for decades. A vast variety of phytochemicals with biologically active constituents has been reported from medicinal plants that have been used for maintaining good health and to cure various diseases (Mustafa et al., 2016). The extracts from medicinal plants are comprised of vitamins, minerals and organic acids with fewer side effects (Mustafa et al., 2017).

Computational biology skills help scientist to evaluate the possibilities of binding mode of different ligands before the experimental work in the laboratory. Molecular docking is an elaborative method to discover the interactions of different macromolecules as drugs against the receptor moiety. Molecular docking is an economic and modern drug discovery method that provides valuable information about the active site targeting of particular macromolecule (Mustafa et al., 2020). In the current study, computational docking of phytochemicals was performed to explore the RdRp antagonists. Among 200 ligands used in this study, top ten phytochemicals with best docking scores were found to be verbenalin, epigallocatechin, swertisin, nobiletin, pinoresinol, caftaric acid, hesperetin, islandicin, neochlorogenic acid and sesamin. Arg349 was found to be the common amino acid residue through binding modes and interaction patterns of docking studies of these ten ligands with nsp-12 of the nCoV-19. All the ligands fully accomplished the Lipinski’s rule of five and showed no desecration.

Within span of few months, the nCov-19 has caused the worst pandemic COVID-19 or SARS-CoV-2, which is an extremely pathogenic and has transmitted with high exponential rate (Gao et al., 2020). Currently, no drugs or clinically approved therapeutics are reported for the treatment of SARS-CoV-2 infections. The urgent necessity at this time is to discover some novel drug candidate(s) to overcome the viral infection caused by nCoV-19. Novel coronavirus belongs to the Coronaviridae family and causes acute lung disease accompanied by pulmonary stress and failure and eventually leads to death. The genome of the novel corona virus codes four major structural proteins and 16 non-structural proteins (Khan et al., 2020). In this study, one of the non-structural proteins (i.e. nsp-12 that is an RNA-dependent RNA polymerase) was selected. The mystery of viral eradication still needs more studies to explore the full mechanism and machinery of the viral genome in order to target it directly. RNA-dependent RNA polymerase plays a vital role in the replication, transcriptions and RNA processing machinery of the virus. The RdRp (nsp-12) of SARS-CoV-2 accompanied two more proteins (i.e. nsp-7 and nsp8) as cofactors to make RdRp complex with three domains (Kirchdoerfer et al., 2019). Virus upon interaction with its host cells invade into the cell and multiplies by relying on the RNA synthesis machinery. RdRp encodes RdRp proteins that possess crucial ability to mediate replication and synthesis of RNA viruses. Thus, RdRp of nCoV-19 has potential to be targeted by drugs and nucleotide analogues (Chakrabortya et al., 2020).

The aim of this study was to explore three-dimensional structure of nsp-12 through homology modelling and computational docking to check the binding/ inhibiting potential of various plant-based ligands against viral protein. Total 200 ligands from different medicinal plants were docked against RNA-dependent RNA polymerase (nsp-12) of SARS-CoV-2 to explore their antiviral activities. Antiviral effects of various medicinal plant extracts have been the subject matter in various cases and studies. These phytochemicals exert several effects by blocking the viral receptors, interrupting the enzymatic functioning and inhibiting the biosynthetic machinery of the virus. Moreover, many antiviral agents have been reported with maximum antiviral activity (Adem et al., 2020).

Verbenalin (a terpene) is a potent inhibitor of transcriptional factors. It has been used for traditional treatments of inflammation, fever and infections (Akbar, 2020). Epigallocatechin is a flavan and an antioxidant and is an abundant phytochemical found in green tea. Epigallocatechin prevents liver cell injury, reduces the oxidative stress and mitigates the induction of pro-inflammatory cells (Baliga et al., 2019). Swertisin is a C-glycoside flavone and used as an antagonist of adenosine A1 receptor, antioxidant, anti-inflammatory agent and hypoglycemic agent (Srivastava et al., 2018). Nobiletin (5,6,7,8,3′,4′-hexamethoxyflavone) is a flavonoid and found in citrus fruits with multifunctional pharmaceutical properties. Nobiletin has been reported for its various beneficial effects such as antioxidant and anticancerous activity, anti-inflammation, immunomodulator and anti-osteoclast genesis agent (Huang et al., 2016).

Pinoresinol is a lignin that is a plant secondary metabolite extensively used as anticancer and antiviral agent (Schroeder et al., 2006). Caftaric acid and its enantiomers have been reported previously with anti-allergic, anti-inflammatory and immunostimulant agents (Bardal et al., 2011). Hesperetin also known as trihydroxy flavanone or citrus flavanone is found in lime, lemon and citrus fruits. Hesperetin is found in peels of citrus fruits in great quantity. Hesperetin exhibits antiviral, antioxidant and antineoplastic activity and also used for the lowering of blood cholesterol level (Yumol & Ward, 2018). Islandicin is a trihydroxyanthraquinone recorded with enzymatic inhibitory property (Wiart, 2013). Neochlorogenic acid is the most common polyphenolic compound commonly found in dry fruits. It is mainly reported as an antioxidant and scavenges reactive oxygen species (Ma et al., 2020). Sesamin is a lignan and has been isolated from Fagara plants and Cinnamomum camphor. Sesamin exhibits antitumor, antioxidant and cytotoxic activities (Sato & Matsui, 2012).

All the ten ligands selected in this study showed strong interactions with nsp-12 of nCoV-19. No single drug has been found effective against SARS-CoV-2 till date. The RdRp being important viral replication protein could be a suitable target for antiviral agents. Based on the current findings and binding scores, these ligands could be potential inhibitors of nCoV-19 or SARS-CoV-2.

5 Conclusion

The current COVID-19 pandemic influenced the world severely. This work seeks to find out potential antagonists of SARS-CoV-2. In the current study, plant-derived phytochemicals, which have been proved effective against many viral and bacterial proteins in literature, were used. Out of 200 phytochemicals that were screened, ten were selected based on the hydrophobic interactions, binding amino acid residues and S-scores. Verbenalin, epigallocatechin, swertisin, nobiletin, pinoresinol, caftaric acid, hesperetin, islandicin, neochlorogenic acid and sesamin were found to be the best ones. All ten best selected ligands exhibited strong binding interactions with receptor molecule, and Arg349 was centred in the common side chain as a directly interacting residue. In the light of current results, it is concluded that these natural phytochemicals could be good antagonists of the viral protein to stop its proliferation. The likeness of these drugs as good antagonists of receptor viral protein has satisfied that these ligands could lead as potential drug candidates against SARS-CoV-2.