INTRODUCTION

The threat of the ongoing spread of the SARS-CoV-2 (COVID-19) coronavirus, which is accompanied by its mutations, requires us to develop new procedures and tests to detect, study, and prevent the spread of the virus. One of the most effective approaches currently used in viral research involves the use of fluorescent probes and labels such as cyanine dyes [14] (in particular, oxamonomethinecyanines [5]). The main advantages of cyanine dyes are their high extinction coefficients, absorption and emission in the spectral range from UV to IR, overlapping the optical transparency window of biological samples, as well as their unique photophysical and photochemical properties, depending on the molecular environment [6]. The noncovalent interaction of cyanine dyes with biomolecules (nucleic acids, proteins) leads to a change in the photophysical and photochemical properties of the dyes and often proceeds with a multiple increase in fluorescence, which creates preconditions for the use of cyanines as probes in natural and artificial systems containing biomacromolecules [712]. The variety of structures and properties of different types of cyanine dyes (which contain polymethine chains of different lengths, as well as different terminal heterocycles and substituents) requires molecular modeling in silico for the preliminary screening of the compounds [1315]. We have previously studied the interaction between various polymethine (cyanine) dyes (45 compounds in total) and the spike protein (S) SARS-CoV-2 in silico (by molecular docking) [16]. In this study, using the modeling method in silico (molecular docking), the study of the interaction of cyanine dyes with the molecular components of the SARS-CoV-2 coronavirus was continued: the interaction of various types of dyes (42 compounds) with different types of the enzyme protein of the SARS-CoV-2 virus, NSP15 endoribonuclease, was simulated: as the original (wild) type and mutant types.

EXPERIMENTAL

Molecular docking was performed using the DockThor site [1719]. In the experiment, we used the target protein structures proposed on the DockThor website; as a reference, we studied the wild type of protein-enzyme NSP15 (PDB-code: 6wxc), corresponding to the reference sequence of the virus’ genome (isolated in Wuhan, Hubei province, China, December 2019) [20]. Docking was also performed with four mutant varieties of the NSP15 protein: S294A, S294T, Y343C, and Y343H. The structure of NSP15 S294A corresponds to a mutation of a key amino acid region with a neutral functional effect related to the replacement of Ser294 by Ala294 (replacement of a hydrophilic residue by a hydrophobic one at the ligand binding site) [21]. The S294T variety has an amino acid substitution of Ser293 for Thr293 (without changing the functional effect); Y343C was obtained by replacing the amino acid residue Tyr343 with Cys 343, which is directly related to the ligand binding site; and the structure of Y343H corresponds to a mutation in the protein binding site with the replacement of Tyr343 by His343 [20].

Nonprotein components (water molecules, ions, cofactors, and ligands) were removed (DockThor), hydrogen atoms were added, and the hydrogen bond network was optimized taking into account pH 6.2 [22]. For mutant forms of NSP15, the wild type (with the PDB code: 6wxc) was used as the template. The proposed structures were used in this study; no additional correction of the protonation of amino acid residues was performed.

The experiments were carried out in the blind docking mode; the size of the approximating grating is 22 Å, the center is at X = 63.94, Y = –72.47, Z = 26.37, and the sampling step is 0.25 Å [1720]. When analyzing the results, up to the five best configurations were taken into account.

Symmetric squarylium dyes were investigated: mono-, tri-, penta- and heptamethine cyanines, both cationic and anionic (having negatively charged sulfogroups), with various heterocycles (benzimidazolyl, benzothiazolyl, benzoxazolyl) and substituents in heterocycles. Trimethinecyanines (carbocyanines) also had various substituents in the heteroatoms and meso-position of the polymethine chain. To create PDB structures of ligand dyes and optimize their geometry (force field MMFF94), we used the Avogadro molecular editor [23]. For 3D visualization and analysis of the docking results, the UCSF Chimera program was used [24]. The original isomeric form of the meso-substituted carbocyanines corresponded to the cis-configuration as the most stable one for dyes of this type in polar solvents [25].

The stability of possible dye-protein complexes was judged by the sign and value of the total energy of the system Etot obtained as a result of docking [17, 20]: it was assumed that the formation of stable complexes is more probable at sufficiently low (i.e., large in absolute value) negative energies.

RESULTS AND DISCUSSION

Molecular Docking of Cationic Carbocyanine and Neutral Squarylium Dyes with NSP15

The interaction of cationic meso-substituted carbocyanine (D1–D7) and neutral squarylium dyes (SD1–SD5) with various variants of NSP15 was studied. The structural formulas of the dyes are shown in Fig. 1. Carbocyanine dyes differ from each other as substituents in the meso-position of the polymethine chain, and by terminal hydrocycles. Squarylium dyes differ from each other by substituents R1 and R2.

Fig. 1.
figure 1

Structural formulas of cyanine dyes, as well as neutral and anionic squarylium dyes.

The investigated cationic and neutral dyes are characterized by positive values of the total energy Etot when interacting with all types of NSP15. In particular, when docking D1 and D7 with wild-type NSP15, we obtained Etot = 16.7 ± 0.7 and 55.4 ± 0.33 kcal mol–1, respectively, which practically coincides with the result obtained for the mutant variant NSP15 Y343H. The smallest values of Etot when docking with the wild type NSP15 were obtained for oxacarbocyanine D5 (Etot = 6.95 ± 0.17 kcal mol–1) and for thiacarbocyanine D3 (Etot = 10.6 ± 0.93 kcal mol–1). Note that for cationic carbocyanine dyes, the van der Waals energies (EvdW) turned out to be negative: for dyes with different heterocycles, they are close to each other and range from –19.6 ± 3.32 kcal mol–1 (dye D4) to –27.8 ± 0.67 kcal mol–1 (dye D3). Similar results (Etot > 0) were obtained earlier for cationic dyes during docking with the SARS-CoV-2 spike protein [16].

Positive Etot values were obtained by docking uncharged squarylium dyes with NSP15. In particular, for dyes SD2 and SD3 with substituents R1 = OH, values Etot = 59.4 ± 1.15 and 15.2 ± 0.79 kcal mol–1 (with the wild type NSP15), respectively. Docking with NSP15 unsubstituted SD1 gives a significantly more favorable (but also positive) energy value (Etot = 7.7 ± 0.56 kcal mol–1). Similar data were obtained during the docking of neutral squarylium dyes with spike protein [16]. Positive Etot values may indicate insufficient stability of complexes of cationic polymethine dyes and neutral squarilium dyes with NSP15.

Regardless of the use of mutant NSP15 subtypes, electrostatic energy, Eel, and van der Waals energy, EvdW, turned out to be negative; for the majority of dyes, their average values were –13.7 ± 1.01 and –13.4 ± 3.7 kcal mol–1, respectively. Bromosubstituted dyes SD4 and SD5 when docked with NSP15 (wild type) have EvdW = –29.1 ± 0.62 and –25.6 ± 0.26 kcal mol–1 and Eel = –0.31 ± 0.51 and –1.28 ± 0.39 kcal mol–1, respectively.

Molecular Docking of Anionic Dyes with NSP15

Anionic Squarylium Dyes. Together with uncharged (neutral) squarylium dyes (SD1–SD5), molecular docking of anionic squarylium dyes SD1.1–SD1.5 with NSP15 was carried out (the structures of the dyes are shown in Fig. 1). As a result of docking, negative values of the total energy (Etot <0) were obtained for most of the anionic squarylium dyes studied. In particular, upon the interaction of SD1.4 with the wild and mutant (Y343H) types of NSP15, Etot = –8.78 ± 0.55 and –9.46 ± 1.14 kcal mol–1. At the same time, in the case of SD1.1, positive values Etot = 20.5 ± 0.42 and 20.6 ± 0.89 kcal mol–1 were obtained, respectively. This is possibly due to steric hindrances to the complex formation created by C(CH3)3 [16].

Strongly negative values Etot were obtained by docking squarylium dyes SD1.2 and SD1.5; at the same time, the most negative values of Etot were obtained when docking with S294T for the dye SD1.2 (Etot = –72.1 ± 1.32 kcal mol–1). The energies of electrostatic interaction and van der Waals for the anionic squarylium dyes SD1.1–SD1.5 turned out to be negative. The lowest values Eel (–57.2 ± 5.1 and –65.1 ± 3.4 kcal mol–1, respectively) were obtained for dyes SD1.2 and SD1.4, which have sulfogroups in the substituents at N atoms. On average, for the studied anionic squarylium dyes, EvdW = –6.2 ± 3.0 kcal mol–1, whereas Eel = –55.7 ± 8.5 kcal mol–1, i.e.; Eel is higher in absolute value than EvdW by a factor of 4.2 to 20, which indicates a significant contribution of the forces of electrostatic interaction to the stabilization of the dye–protein complexes.

Squarylium dyes in complexes with NSP15 often take on a twisted (close to perplanar) configuration, for example, SD1.1. with the wild type NSP15 (Fig. 2a). However, the SD1.7 dye with all varieties of NSP15 is characterized by practically flat structures close to trance-isomers, in which the terminal heterocycles of the dye are only slightly (by ~20°) removed from the plane.

Fig. 2.
figure 2

Docking results with NSP15 dyes: (a) SD1.1 (wild type NSP15) and (b) SD1.5 (Y343H). The initial configurations of the dyes corresponded to cis-isomers.

Docking places ligand molecules in the central cavity of the trimer near one of the protein subunits. Using the UCSF Chimera program [22], we analyzed the surface binding and searched for hydrogen bonds between the ligand and protein atoms, as well as for interatomic conflicts and contacts (based on the van der Waals radii of the molecules). The binding analysis showed that in a complex with NSP15, the squarylium dye molecules SD1.1 are located at a distance of ~4 Å from GLY165, and the TYR89 subunit C, the sulfogroup of the dye, is located at a distance of 4 Å from ARG91. In the cases of SD1.5 and mutant forms S294A and Y343H, docking gave almost identical positions of the ligands (interaction with subunit A). In the case of SD1.5 and Y343H (the structure of the dye in the complex is shown in Fig. 2b), the dye forms hydrogen bonds (2 Å) with GLN203 (with the participation of the terminal sulfogroup) and with ASP269 (due to the O atom of the squarylium ring). The search for interatomic contacts indicates the close (3 Å) contact of ARG92 with the sulfogroup of the dye; and the contact (4 Å) of GLU268 with the squarylium ring is also noted.

Anionic Cyanine Dyes. Molecular docking was also performed with a series of 26 anionic cyanines with different polymethine chain lengths: five anionic monomethinecyanine dyes (see Fig. 1, structures 1.1–1.5), nine meso-substituted trimethine cyanine dyes (carbocyanines, structures 2.1–2.9), seven pentamethine cyanine dyes (structures 3.1–3.7), and five heptamethine cyanine dyes (structures 4.1–4.5). The meso-substituted carbocyanines corresponded the cis-configurations.

Docking showed that ~90% of the selected compounds were characterized by negative values of the total energy (Etot < –19 kcal mol–1; see Table 1). Significant negative values Etot were obtained for anionic monomethinecyanine dyes: for example, when docking dye 1.3, which is a benzoxazolyl derivative, with wild-type endoribonuclease, Etot = –83.8 ± 1.8 kcal mol–1. Note that, for its analogs 1.1 and 1.5, we obtained ~30–44 kcal mol–1 Etot value. It is possible that here, as in the case of the squarylium dyes, the steric factor plays a role, lowering Etot (in absolute value) for the more bulky dyes 1.1 and 1.5 compared to 1.3.

Table 1.   Results of molecular docking of anionic cyanine dyes with NSP15 (wild type): total interaction energy of van der Waals energy and electrostatic interaction (kcal mol–1)

The docking of carbocyanines (Fig. 1, structures 2.1–2.9) with NSP15 (wild type) also showed that these dyes can form energetically stable complexes. In particular, when docking oxacarbocyanines 2.3, 2.4, and 2.6, the values Etot in the range –31.7 to –63.9 kcal mol–1 were obtained.

Thiacarbocyanines 2.1 and 2.2 show comparable values of Etot (see Table 1). Positive values Etot were obtained for dyes 2.5 and 2.7 (Table 1), which have bulky substituents in heterocycles.

Negative Etot values were obtained for most pentamethinecyanine (dicarbocyanine) dyes (in particular, for thiadicarbocyanine 3.2 and its oxa analog 3.4) with NSP15, which may indicate the stability of the intermolecular complexes of these dyes. For dyes 3.1 and 3.3, positive values were obtained for Etot (2.7 ± 1.57 and 4.1 ± 2.01 kcal mol–1, respectively). The values of the energies of electrostatic interaction and van der Waals for most pentamethine cyanine dyes (with the exception of 3.7) turned out to be negative; the smallest values of Eel (Table 1) were obtained for dyes 3.2 and 3.3 with –OCH3 substituents. 1).

The study of the interaction of anionic heptamethinecyanine dyes (compounds 4.1–4.5) with NSP15 showed that noncovalent interaction is energetically possible for some of these dyes, leading to the formation of stable complexes (Etot < 0). For dyes 4.2 (oxacyanin) and 4.4 (indoderivative), Etot (18.3 ± 1.01 and 8.9 ± 2.8 kcal mol–1, respectively) is positive.

The values Eel and EvdW for most heptamethinecyanine dyes turned out to be negative, with the exception of dye 4.5 (for it, EvdW = 5.1 ± 3.31 kcal mol–1). For pentamethine cyanine (dicarbocyanine) dyes 3.1–3.6, the value Eel turned out to be 3.4–24 times higher in absolute value than EvdW; for heptamethine cyanine dyes 4.1–4.4, Eel was higher in absolute value than EvdW  by factors of 5.2 to 9.4. This indicates a significant contribution of the forces of electrostatic interaction to the stabilization of the dye–protein complexes. Indeed, molecular docking of indocarbocyanines with four sulfogroups (compounds 2.9, 3.7, and 4.5) gives significantly more negative values of the total energy (Etot ~–77.8 to –96.0 kcal mol–1) due to the large contribution of the electrostatic interaction to the stabilization of complexes (Eel ~–87.6 to –90.5 kcal mol–1). The greatest negative values Etot were obtained for indocarbocyanin 2.9 by docking with NSP15. Dyes with a longer polymethine chain show more moderate values of Etot (see Table 1).

Docking of anionic cyanine dyes with mutant forms of the target protein (S294A, S294T, Y343C, and Y343H) showed almost the same binding energy parameters as for the wild type. For monomethinecyanine 1.1, the total energy values obtained for the four mutant forms practically coincide with the Etot obtained for the original protein: –53.62 ± 0.49 and ‒53.2 ± 0.86 kcal mol–1, respectively. A similar situation is observed in the case of other anionic dyes. Thus, for the pentamethine cyanine oxa dye 3.4, the values of Etot obtained by docking with S294A, S294T, Y343C, and Y343H were –74.3 ± 1.56 kcal mol–1; for the heptamethine oxa dye 4.2, docking with mutant forms gives Etot = –58.9 ± 1.09 kcal mol–1, which, taking into account the confidence interval, corresponds to the data in Table 1. The only exception is the oxacarbocyanine dye 2.3, when docking with mutant forms of the protein for which Etot decreases by 7% (Etot = –62.0 ± 1.29 kcal mol–1).

Anionic cyanine dyes can bind to SARS-CoV-2 NSP15 in various conformations. Monomethine thiacyanines are characterized by a twisted (nonplanar) configuration of their molecules in the bound states; however, for oxa dyes 1.3 and 1.4, almost flat (energetically optimal) configurations are possible. For monomethinecyanine dyes, the analysis of surface binding did not reveal the presence of hydrogen bonds and interatomic conflicts.

Configurations close to planar are characteristic of carbocyanines 2.1, 2.2, and 2.3. In particular, for 2.2, docking with NSP15 (wild type) gives a dye configuration that is close to the trance-isomer configuration (Fig. 3a); in this case, the methyl group is almost perpendicular to the plane of one of the heterocycles, and the other heterocycle is rotated by an angle of ~15°. The interaction of oxacarbocyanin 2.3 and indocarbocyanin 2.8 is also characterized by an almost flat trance-configuration, while meso-substituents are displayed outside the planes. For dyes 2.4 and 2.6, the calculations give perplanar shapes twisted by ~45°. Docking of the 2.9 dye, which has an additional pair of sulfogroups, with NSP15 (wild type) suggests a distorted configuration close to a trance-isomer (Fig. 3d).

Fig. 3.
figure 3

Results of docking of anionic dyes with NSP15: 2.2 (a), 3.4 (b), 4.1 (c), and 2.9 (d).

Docking pentamethinecyanines 3.1 and 3.2 with NSP15 gives strongly bent structures due to the distorted bond angles of the polymethine chain in which the terminal heterocycles are located almost parallel to each other. Dyes 3.4 (Fig. 3b) and 3.7 (they have 4 sulfogroups) acquire twisted (close to perplanar) structures upon interaction with NSP15; the dyes do not form hydrogen bonds and interatomic contacts with the protein.

For the docking of heptamethinecyanine dyes, various variants of molecular structures have been obtained, although mainly curved structures are characteristic of them. During the docking of dye 4.1, the distortions twist the terminal heterocyclic residues, and the substituents in positions 3 and 3' can be removed from the plane of the molecule (Fig. 3c). For dye 4.2, a structure with an almost perplanar position of terminal heterocycles was obtained; the dye does not form hydrogen bonds and interatomic contacts with NSP15. Dyes 4.3 and 4.5 (4 sulfogroups) have sickle-curved structures.

CONCLUSIONS

The results of molecular docking have shown that the noncovalent complexation of cyanine and squarylium dyes with SARS-CoV-2 NSP15 is significantly influenced by the Coulomb interactions in the dye–protein structure. Positive values of the total energy of the complex (Etot > 0) for cationic cyanine (D1–D7) or neutral squarylium (SD1–SD5) dyes may indicate the instability of the complexes of these dyes with NSP15. For many anionic dyes, negative values of Etot were obtained, which can characterize the higher stability of the corresponding complexes. Even more favorable results were obtained for indocarbocyanines with four sulfogroups that increase the negative charge of the dye molecule (2.9, 3.7, and 4.5). In addition, for most anionic dyes, the value Eel significantly exceeds in absolute value EvdW, which also indicates the leading role of Coulomb forces in the formation of complexes. A similar effect of the Coulomb interaction on the stability of the dye–protein complex was found during the docking of cyanine dyes with the SARS-CoV-2 spike protein [16].

The structural differences in the dye molecules significantly affect the stability of the complexes. For oxa dyes, in general, more negative values were obtained for Etot than for their analogs with thiazole and indole terminal heterocycles. Perhaps this is due to the more compact structure of oxa dyes than for the thia- and indo-analogs.

According to the Etot value criterion characterizing the stability of complexes with NSP15, we can distinguish anionic squarylium dyes SD1.2 and SD1.5, monomethinecyanine anionic dyes 1.1–1.5, thia- and oxa-cyanines 2.1–2.4, 2.6, 3.2, 3.4, 3.5, 4.1, and 4.3, indocyanines 2.8 and 3.6, as well as cyanines with four sulfogroups 2.9, 3.7, and 4.5.

When using dyes for the spectral detection of proteins, together with the stability of the dye–protein complexes, an important role is played by the change in the spectral-fluorescent properties of dyes during complexation. Meso-substituted carbocyanines (trimethinecyanines) are characterized by a sharper increase in the fluorescence intensity and more significant changes in the spectra due to the shift in the dynamic cis-trance-equilibrium, which makes it possible to detect very low concentrations of biomolecules when using dyes as fluorescent probes [7, 8, 10, 12, 26]. Considering the points made above, carbocyanines 2.1–2.4 and 2.6 are promising as such probes.

In complexes of carbocyanines with biomolecules, an intensification of the process of interconversion to the triplet state is observed, which leads to the population of the triplet energy levels [2729]. This determines the possibility of photochemical reactions involving the excited triplet states of dyes in these systems and can potentially lead to the formation of reactive oxygen species (photodynamic effect). Thus, cyanines may be promising for light-induced damage to virus components (proteins to which the dye binds) and, thus, inactivation of the virus itself. Note that the UV photoinactivation of the MERS-CoV and SARS-CoV-2 coronaviruses in blood serum by riboflavin is currently being studied [30, 31]. From this point of view, meso-substituted thiacarbocyanines 2.1 and 2.2 can be isolated for further practical research in this area.

It should be noted that when docking cyanine dyes with mutant forms of NSP15—S294A, S294T, Y343C, and Y343H—the value Etot practically coincided with that for the wild type NSP15 (there was a slight decrease in Etot only for dye 2.3 when docking with mutant forms). This may indicate the efficacy of cyanine dye probes not only for the original form of the SARS-CoV-2 virus but also for its mutant forms, which is especially important now, when the mutant forms of SARS-CoV-2 are becoming more widespread in the world.

The molecular graphics and analyses performed with the UCSF Chimera software were developed by the University of California in San Francisco’s Biocomputer, Imaging and Informatics Resource with support from NIH P41-GM103311.