In the preliminary docking study, CDocker energy as well as the CDocker interaction energy were calculated and considered for the screening of the compounds. The CDocker energy of Discovery Studio (DS) v 2018 provides comparatively accurate information regarding the binding affinity of the compounds in the active site of the target proteins [17]. On the other hand, CDocker interaction energy provides the different non-bonded interactions within the binding site of the target site of the protein [29]. In this study, since the screening compounds were the flavonoid compounds from citrus species, the flavonoid like co-crystal ligand 3WL of Mpro was used as control for the entire study. Molecular docking revealed that 5 compounds out of the 44 selected Citrus species flavonoid compounds showed better CDocker energy and CDocker interaction energy than the co-crystal ligand 3WL (Table 1).
Table 1 Preliminary simulation-based docking results of the top five flavonoid compounds Considering the dynamic nature of the physiological conditions, the best 5 compounds were allowed to redock with the target protein in a flexible mode. During the flexible docking, the residues of the binding site of the target protein were kept flexible. The flexible docking analysis of the best 5 compounds also showed better CDocker energy as well as CDocker interaction energy (Table 2). The binding energy of the compounds to the target protein Mpro was calculated to understand the spontaneity of formation of drug-target/ligand–receptor complex suggesting the stable drug-target complex. The calculated binding energy of the compounds showed lower binding energy in comparison to the control co-crystal inhibitor 3WL shown in Table 2.
Table 2 Flexible docking binding energy of the best five flavonoid compounds as compared to co-crystal inhibitor In the present study, the top 5 compounds formed higher number of H-bonds with the target protein than the co-crystal inhibitor 3WL. This may suggest that the test compounds have a higher tolerability against target protein putative mutations than 3WL [30]. The interaction and the number of H-bonds formed for the 5 screened compounds with Mpro are shown in Fig. 1.
From Fig. 1, it was observed that the compounds Taxifolin, Eriodictyol, Luteolin and Quercetin formed 4 or more H-bonds with the active site of SARS-CoV-2 Mpro, whereas compound Isoscutellarein and the co-crystal inhibitor 3WL formed 2 H-bonds with the target protein. Analysing the residues involved in interactions between the compounds and Mpro, showed that all the compounds interacted with the two important catalytic residues His41 and Cys145 of the active sites of Mpro. These two residues are present in the catalytic domain of SARS-CoV-2 Mpro and actively participate in the catalytic activities of Mpro. Hence, binding of the flavonoid compounds to these residues may reduce the catalytic activities of Mpro which eventually will lead to the reduction of viral replication.
The 5 compounds showing the best results in terms of CDocker energy, CDocker interaction energy, calculated binding energy and number of H-bonds were then subjected for the assessment of drug likeness and assessment of different toxicity parameters. It was observed that 3 compounds, namely Taxifolin, Eriodictyol and Luteolin did not show any toxicity against the toxicity parameters used in the study. On the other hand, the compounds Isoscutellarein and Quercetin showed the presence of mutagenic properties. Quercetin also showed the presence of tumorigenic property. The results of toxicity prediction and the drug likeness property analysis are shown in Table 3. Among all the screened compounds, Taxifolin possessed the highest drug likeliness property followed by Isoscutellarein and Luteolin. Based on the training dataset used by the ORISIS Data Warrior, the compounds with higher or positive drug likeliness values are considered as good drug candidates. Since the compounds Isoscutellarein and Quercetin showed the presence of toxic effects, we did not consider these compounds for further analysis. The non-toxic compounds, namely Taxifolin, Eriodictyol and Luteolin were further subjected to molecular dynamics simulation studies.
Table 3 Toxicity and drug likeness analysis Molecular dynamics simulation study was performed to understand how the ligands bind to the receptor by mimicking in vitro and in vivo experiments [4]. Thereby, the RMSD, RMSF and ROG of the Mpro-ligand complexes were calculated over the simulation period of 30 ns and compared with the control (Mpro-3WL complex) to observe the stability of the complexes. To calculate the RMSD, RMSF and ROG of the complexes, the whole protein–ligand complexes were used. After completion of simulation, the RMSD plots for all the compounds were analyzed where it was observed that Mpro-3WL and Mpro-CF5 (Eriodictyol) had almost similar deviations with control complex (Mpro-3WL) within the simulation period. However, Mpro-CF3 (Taxifolin) and Mpro-CF8 (Luteolin) had comparatively higher deviation than Mpro-3WL; where Mpro-CF8 took more time (almost 10 ns) to reach plateau state (Fig. 2A). The fluctuations of the individual residues within the simulation period were plotted where the RMSF of the residues for the Mpro-3WL and Mpro-CF5 had almost similar pattern with minimum deviations from each other. For the complex Mpro-CF3 and Mpro-CF8, there were significant deviations which may indicate that the presence of ligands influenced the stability of the enzyme Mpro and changed its dynamic behaviour. Notably, the replicate analysis of the protein–ligand complexes showed similar RMSD deviations indicating a convergence of results. Comparing the RMSD of the complexes, Mpro-CF8 showed fluctuations all over the regions. Mpro-CF3 showed considerable fluctuations within residue 248 to 256 (Fig. 2B). These findings were also supported by calculated radius of gyration (ROG) for the Mpro-ligand complexes (Fig. 2C).
In this study, we further analysed the interaction between the compounds with Mpro and checked the formation of H-bonds after 30 ns MD simulation. The interaction pattern of the compounds after 30 ns MD simulation is shown in Fig. 3. After 30 ns MD simulation, the co-crystal ligand 3WL formed 4 H-bonds, whereas compounds Taxifolin
and Eriodictyol formed 7 and 4 H-bonds respectively with the target protein. Luteolin formed 3 H-bonds with the residues of the active site of SARS-CoV-2 Mpro after MD simulation for 30 ns. Although all compounds formed H-bonds with the catalytic residues i.e. His41 and Cys145 as observed during molecular docking, but after 30 ns simulation only Taxifolin interacted with these residues forming H-bonds. The co-crystal ligand 3WL formed weak interaction with Cys145 after 30 ns of MD simulation.
The structural conformations of the protein–ligand complexes before and after MD simulation were also observed by superpositiong the complexes and are depicted in Fig. 4.
In the MD simulation analysis, we analysed the different H-bonds formed within the active site of the target proteins with the flavonoid compounds during the process of simulation upto 30 ns. The number of H-bonds formed and their distances within the simulation period for each conformation were generated (Fig. 5). In Mpro-3WL complex total 5 hydrogen bonds were found, where 1 H-bond with Glu166 residue showed almost consistent average distance with low deviation. However, the other 4 H-bonds with Thr190, Asn142 and Glu166 initially had very large average distance but after approximately 18 ns it was reduced and stabilized. In Mpro-CF3 complex 7 H-bonds were observed where the H-bond with Leu50 residue showed substantial deviations around 10–20 ns and subsequently stabilized by the end of the simulation. In Mpro-CF5 complex 4 H-bonds were observed among which 2 H-bonds with Asp48 residue remained stable throughout the simulation. The H-bond with Ser46 residue stabilized within 4 ns of simulation, while the H-bond with Glu166 residue remained highly unstable with considerable fluctuation of distances throughout the simulation. In Mpro-CF8 complex 3 H-bonds were formed and the distance of bond with Asn119 varied throughout the simulation. The bond with Asp48 showed deviation around 18–26 ns and then reverted to its previous state. The distance for the H-bond with Ser46 gradually decreased to less than 5 Å. The 2D interaction generated for the complexes before and after simulation showed that the number of H-bonds formed increased for Mpro-3WL and Mpro-CF3. For Mpro-CF5, the number of H-bonds remained same but the interacting residues had changed. But for Mpro-CF8 the number of H-bonds decreased after simulation.
During MD simulation analysis, the binding free energies (ΔG) of the protein–ligand complexes were calculated upto 30 ns using MM-PBSA based method. From the result, the average ΔG of the Mpro-3WL complex was found to be − 51.1666 kcal/mol. The average ΔG of the Mpro-CF3, Mpro-CF5 and Mpro-CF8 were found to be − 60.3367 kcal/mol, − 68.3025 kcal/mol and − 55.7587 kcal/mol, respectively. It has been observed from the MM-PBSA analysis that the complexes formed between the flavonoids and the target, possessed lower ΔG than the complex of receptor-co crystal ligand. This indicates the formation of stable complexes with spontaneous interaction by the test ligands in the active binding pocket. The binding free energies (ΔG) of protein–ligand complexes during the MD simulation period are shown in Fig. 6.
The predicted activity (IC50) of the compounds was determined with the help of 3D-QSAR analysis. As the IC50 value of 3WL has not yet been reported, the IC50 value of 3WL was predicted by generating 3D-QSAR model from the inhibitor deposited in PostEra website [26]. To calculate the energy potential in 3D-QSAR method, 3 dimensional structures of a set of compounds were used. The calculated potential energies were then used as descriptors to build the 3D-QSAR model to corelate the 3D-structures and their biological activities. The generated QSAR model gives the information on correlation between the molecular field and the biological activities of the compounds [31]. In this study, the predicted activity i.e. IC50 of the compounds as well as control were determined by using the following linear equation.
$$ Activity\, \left( {predicted} \right) = \mathop \sum \limits_{i = 1}^{NEP} CEP\left( i \right)VEP \left( i \right) + \mathop \sum \limits_{i = 1}^{NVDW} CVDW\left( i \right)VVDW\left( i \right) $$
where NEP: the number of descriptors of electrostatic potential (EP); CEP(i): model coefficient for electrostatic potential descriptor i; VEP: value of electrostatic potential on a grid point; NVDW: number of descriptors of van der Waals (VDW) interaction: CVDW(i): model coefficient for VDW descriptor i; VVDW: van der Waals interaction energy on a grid point.
The linear plot of the training set and the test set are depicted in Fig. 7. The determined R2 value for training set was found to be 0.912 and for test set was found to be 0.846 during validation. From the 3D-QSAR analysis, the predicted IC50 value of 3WL was observed to be 5.98 μM, whereas the compound Taxifolin was observed to be 9.63 μM followed by Luteolin (14.47 μM) and Eriodictyol (16.08 μM). As the actual IC50 value of 3WL has not been reported yet, the predicted IC50 value will not give the actual idea of its minimum inhibitory concentration. Thus, the complexes were considered for further SeeSAR analysis to assess the role of individual atoms towards the binding affinity.
To further assess the binding affinity of 3WL and Taxifolin with Mpro before and after 30 ns MD simulation, HYDE (Hydrogen bonds and Dehydration) analysis was performed using SeeSAR of BiosolveIT [28]. HYDE analysis consistently designates hydrogen bonding between ligand and receptor, hydrophobic effect as well as desolvation. HYDE also helps in predicting the particular region of the complex which undergoes favourable and unfavourable binding ligand receptor. The HYDE scoring determined the Gibb’s free energy by calculating the difference between bonded and unbonded states of the complex [32]. The specific atoms which were favourable for good binding affinity (dark green sphere) and their individual HYDE values for the best compound Taxifolin and the co-crystal inhibitor 3WL in both before and after MD simulation, are shown in Fig. 8. Identification of the role of atoms present in the ligands is crucial in predicting overall binding affinity or interactions with the target sites of the protein. From Fig. 8a, it was observed that in case of 3WL (5,6,7‐trihydroxy‐2‐phenyl‐4H‐chromen‐4‐one), before MD simulation, the phenyl ring at 2nd position had major contributions towards the overall HYDE score (kcal/mol). But mainly the oxygen atom of the 6-hydroxyl group, carbon atom of the carbonyl group of 4 position and oxygen atom of the 1 position of the bicyclic ring system (with red coronas) had negative impact on the overall binding affinity of 3WL. Similarly, in case of Taxifolin ((2R,3S)‐2‐(3,4‐dihydroxyphenyl)‐3,5,7‐trihydroxy‐3,4‐dihydro‐2H‐1‐benzopyran‐4‐one), before MD simulation, the oxygen atom at the 1 position and the oxygen atom of the hydroxyl group at 3 position had negative impact towards the binding affinity of the compound. Moreover, the carbon atom at 7 position and the oxygen atom of the hydroxyl group at 7 position also demonstrated negative impact on the binding affinity. On the other hand, the atoms of the 3,4 dihydroxyphenyl group at 2 position of the bicyclic ring showed positive contributions towards the overall binding affinity of the molecule. In the bicyclic ring, the oxygen atom of the ketone group and carbon atom at the 6 and 8 position had positive effect towards the binding affinity of the molecule (Fig. 8b).
HYDE analysis was performed for the complexes after MD simulation also and represented in (Fig. 8c, d). From Fig. 8c, it was observed that in case of 3WL (5,6,7‐trihydroxy‐2‐phenyl‐4H‐chromen‐4‐one), the phenyl ring at second position had major contributions towards the overall HYDE score (kcal/mol). Further, the carbon atoms at 2 and 3 positions, and oxygen atoms of the hydroxyl groups at 5, 6 and 7 position had also showed positive contributions towards the overall binding affinity. However, the oxygen atom of the carbonyl group at 4 position (with red corona) was seen to have negative impact on the overall binding affinity of 3WL. Similarly, in case of Taxifolin ((2R,3S)‐2‐(3,4‐dihydroxyphenyl)‐3,5,7‐trihydroxy‐3,4‐dihydro‐2H‐1‐benzopyran‐4‐one), after MD simulation, the oxygen atom at the 1 position and the oxygen atom of the hydroxyl group at 3 and 5 position had negative impact towards the binding affinity of the compound. The carbon atom at 6 position of the 3,4-dihydroxyphenyl cyclic ring system present at 2 position, also had a major negative impact on the binding affinity. On the other hand, the carbon atoms of the 3,4-dihydroxyphenyl group at 2 and 5 position, and oxygen atoms of the hydroxyl groups at 3 and 4 positions showed positive contributions towards the overall binding affinity of the molecule. From the bicyclic ring, the oxygen atom of the ketone group and carbon atom at the 3, 4a and 6 position, and oxygen atom of the hydroxyl group at 7 position had positive effect towards the binding affinity of the molecule (Fig. 8d).
The ranges of binding affinity of Taxifolin and co-crystal inhibitor 3WL were also calculated before and after MD simulation and shown in Table 4. From the results, it was found that before MD simulation, Taxifolin had less binding affinity towards the target protein Mpro due to the major negative impact governed by the orientation of oxygen atom at 1 position and oxygen atom of the hydrpxyl group at 3 position. However, in case of complexes after MD simulation, Taxifolin showed better binding affinity towards the target protein Mpro than the co-crystal ligand 3WL. This suggests that during the course of reaction, Taxifolin possesses better binding affinity towards Mpro of SARS-CoV-2.
Table 4 Predicted binding affinity ranges of 3WL and Taxifolin with Mpro before and after MD simulation Taxifolin is a widely distributed natural flavonoid and waste material of forest industry offers an economically viable source for its extraction. Taxifolin has earlier been reported for its antiviral effects against coxsackievirus B and antiradical activities [33, 34]. We believe that no study has been undertaken concerning taxifolin’s potential inhibitory activities against respiratory viruses. It is worth mentioning that our study corroborates a recently published report of potential activity of taxifolin against the main protease of SARS-CoV-2 [35].