WIDOCK: a reactive docking protocol for virtual screening of covalent inhibitors

Scarpino, Andrea; Petri, László; Knez, Damijan; Imre, Tímea; Ábrányi-Balogh, Péter; Ferenczy, György G.; Gobec, Stanislav; Keserű, György M.

doi:10.1007/s10822-020-00371-5

WIDOCK: a reactive docking protocol for virtual screening of covalent inhibitors

Open access
Published: 18 January 2021

Volume 35, pages 223–244, (2021)
Cite this article

Download PDF

You have full access to this open access article

Journal of Computer-Aided Molecular Design Aims and scope Submit manuscript

WIDOCK: a reactive docking protocol for virtual screening of covalent inhibitors

Download PDF

Andrea Scarpino¹,
László Petri¹,
Damijan Knez²,
Tímea Imre³,
Péter Ábrányi-Balogh¹,
György G. Ferenczy¹,
Stanislav Gobec² &
…
György M. Keserű ORCID: orcid.org/0000-0003-1039-7809¹

6162 Accesses
22 Citations
6 Altmetric
1 Mention
Explore all metrics

Abstract

Here we present WIDOCK, a virtual screening protocol that supports the selection of diverse electrophiles as covalent inhibitors by incorporating ligand reactivity towards cysteine residues into AutoDock4. WIDOCK applies the reactive docking method (Backus et al. in Nature 534:570–574, 2016) and extends it into a virtual screening tool by introducing facile experimental or computational parametrization and a ligand focused evaluation scheme together with a retrospective and prospective validation against various therapeutically relevant targets. Parameters accounting for ligand reactivity are derived from experimental reaction kinetic data or alternatively from computed reaction barriers. The performance of this docking protocol was first evaluated by investigating compound series with diverse warhead chemotypes against KRAS^G12C, MurA and cathepsin B. In addition, WIDOCK was challenged on larger electrophilic libraries screened against OTUB2 and NUDT7. These retrospective analyses showed high sensitivity in retrieving experimental actives, by also leading to superior ROC curves, AUC values and better enrichments than the standard covalent docking tool available in AutoDock4 when compound collections with diverse warheads were investigated. Finally, we applied WIDOCK for the prospective identification of covalent human MAO-A inhibitors acting via a new mechanism by binding to Cys323. The inhibitory activity of several predicted compounds was experimentally confirmed and the labelling of Cys323 was proved by subsequent MS/MS measurements. These findings demonstrate the usefulness of WIDOCK as a warhead-sensitive, covalent virtual screening protocol.

Covalent docking of large libraries for the discovery of chemical probes

Article 26 October 2014

Binding Mode Prediction and Virtual Screening Applications by Covalent Docking

Docking covalent targets for drug discovery: stimulating the computer-aided drug design community of possible pitfalls and erroneous practices

Article 04 September 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Protein ligands with covalent mechanism of action became increasingly popular in both chemical biology and medicinal chemistry applications [1,2,3]. These compounds form covalent bonds with a targetable nucleophilic residue (most often cysteine, but also others, like lysine, serine, threonine or tyrosine) in an appropriate position at the ligand binding site [4]. Many previous studies have described the advantages and disadvantages of covalent enzyme inhibitors [5,6,7]. Potential advantages include increased ligand efficiency, prolonged duration of action leading to less frequent dosing, and the opportunity to target shallow binding sites that were previously considered as “undruggable”. Most often cited drawbacks of the covalent mechanism of action are related to their potential of idiosyncratic toxicities that points out the importance of the balanced optimization of their affinity and reactivity.

These compounds interact with the target first by forming a non-covalent protein–ligand complex, then the covalent bond is formed [8]. The functional moiety responsible for the covalent bond formation, also known as the “warhead”, is in most cases an electrophilic reactive group. Many functional groups in organic chemistry are able to react with thiol groups, and they are potential warheads for compounds intended to modify cysteine residues of biological systems. Since different chemotypes bind via different reaction mechanisms, it is clear that they are characterized by distinct intrinsic reactivities. Furthermore, the intrinsic reactivity of the warheads can be tailored by their substituents. Recently we showed [9] that the intrinsic reactivity of the electrophilic ligands might influence not only enzyme specificity, but also functional specificity (as observed with the endo- versus exo-peptidase activity of cathepsin B), and species specificity (as observed for MurA from Escherichia coli versus MurA from Staphylococcus aureus). These data confirmed that cysteine residues can be labelled by a variety of warheads, which therefore have to be tailored to the reactivity of the specific residue being targeted.

As alternatives to experimental approaches, virtual screening protocols give significant contribution to the identification of viable chemical starting points. However, their application to covalent inhibitors still faces challenges that derive mainly from the description of covalent bond formation. Conventional non-covalent docking methods are designed to well describe the first step, namely the formation of the non-covalent complex. Despite various covalent docking-scoring tools were recently developed, there is no general computational protocol to properly describe the close contact between reacting atoms, the formation of the covalent bond in a chemical reaction, and the conformations and interactions of the resulting complex.

Covalent docking tools follow different strategies to dock and rank covalent binders. For example, GOLD [10,11,12] uses the post-reaction conformation to rank ligands in a set. The best performing covalent docking protocol developed in AutoDock 4.2 [13] (from now referred to as AD4) is currently the flexible side chain method [14]. This uses post-reaction ligand structures and their conformations are sampled to optimize the interactions in the binding pocket. ICM-Pro [15, 16] generates bound complexes and then ranks ligands poses by excluding the interactions of atoms directly neighboring the newly formed covalent bond. The “Pose Prediction” mode of CovDock [17] combines pre- and post-reaction states. It first performs a non-covalent docking into a binding site where the reactive residue is mutated to Ala in order to avoid close contacts. Next, the rotamer states of the reacting residue are sampled to form the covalent complex with ligand poses occupying beneficial positions according to the previous non-covalent docking step. The final ranking of the ligands is achieved by scoring both pre- and post-reaction states. The “Virtual Screening mode” of CovDock [18] increases the throughput by reducing the number of simulated steps at the expense of somewhat lower binding mode prediction accuracies. A general feature of currently available docking protocols is that they do not explicitly take into account the reaction energy accompanying the covalent bond formation. As a consequence, these docking tools assume that screened ligands have similar intrinsic reactivity that could not be confirmed a priori. Although most of the current covalent docking applications are restricted to a preselected warhead chemotype, intrinsic reactivities are influenced by the substituents at the electrophilic center and should be considerably different.

These limitations of the available covalent docking tools prompted us to develop WIDOCK, a protocol that applies the reactive docking methodology described by Backus et al. [19] and repurposes it into a warhead-sensitive virtual screening solution for diverse electrophilic libraries. The interaction between the ligand and protein atom pairs that are expected to form the covalent bond is modeled by incorporating a pseudo-Lennard–Jones potential into the non-covalent AD4 scoring function. This approach does not involve the formation of a chemical bond between the ligand and the nucleophilic residue, but it rather focuses on the prediction of the non-covalent interactions occurring in the binding pocket before the covalent bond formation and uses a reactivity-scaled reward for compounds able to place the reactive group in the cysteine vicinity. A similar protocol was also described by Forli and Botta [20] to overcome AutoDock’s limitations in treating flexible ring systems. WIDOCK applies the same form of the interatomic potential as in the reactive docking method (see later), with the important difference that we derive the parameters of the potential either from kinetic data measured in reactions of various small compounds against cysteine surrogates, or from calculated quantum chemical reaction barriers. This is a significant simplification with respect to the reactive docking method [19] and its adaptations [21, 22] where parameters were derived from large scale and expensive proteome analysis. Furthermore, while the cited methods were only used to predict cysteines that are most likely to be labeled across the human proteome and to interpret residue ligandability by compounds with limited warhead types, our objective here is to validate and apply WIDOCK as a virtual screening tool. Therefore, we investigate the labeling of reactive cysteine residues of several validated drug targets with compounds having various warheads reacting with diverse chemistries. The main advantage of WIDOCK compared to other available virtual screening tools is that a set of ligands with several warhead types and inherently different cysteine reactivities can be screened against the target of choice and prioritized for experimental testing. In contrast to covalent docking in AD4, this protocol does not require the initial modification of all structures in the set into their post-reaction conformation. While WIDOCK, as a virtual screening tool, primarily aims to discriminate actives from inactives, predicting the ligand conformation in the binding pocket is also a key aspect in the prospective design of covalent inhibitors. In a former study, we showed that non-covalent docking can provide good accuracy in the binding mode prediction of covalent binders, in a reduced time-scale as compared to covalent docking [23]. However, assessing the pose prediction accuracy of WIDOCK would require consistent reactivity data not currently available for a large set of covalent complexes [1, 24].

In the forthcoming sections we first show that by deriving the warheads’ reactivity parameters for the interaction potential from kinetic measurements against β-mercaptoethanol (BME), WIDOCK accurately reproduces the observed inhibitory activities found against KRAS^G12C [25]. KRAS is a widely studied GTPase having multiple oncogenic mutations found in almost 30% of human cancers. The G12C mutation has been shown to reduce the GTP/GDP exchange rate and consequently hyperactivates the enzyme causing abnormal cell growth [26]. KRAS^G12C covalent inhibitors targeting Cys12 can selectively bind to the oncogenic variant over the wild-type protein, thus leading to favorable activity modulation [27, 28]. ARS-853 [29] (1, Fig. 1), a potent KRAS^G12C covalent inhibitor, binds to the GDP-bound oncoprotein thus locking it in its inactive state.

Second, we apply WIDOCK on a covalent fragment library equipped with a diverse set of warheads. This set of compounds was recently screened against MurA (UDP-N-acetylglucosamine enolpyruvyl transferase) and cathepsin B (hereinafter also referred to as CatB) [9], and we show that experimental reactivities against glutathione (GSH) can also be used to derive parameters for the pseudo-Lennard–Jones potentials. MurA is a key enzyme in the first step of bacterial peptidoglycan biosynthesis and it is a promising antibacterial target as it has no human orthologue. Despite intense research, relatively few compounds have been described as potent MurA inhibitors [30, 31]. Fosfomycin [32] (2, Fig. 1) is the only clinically available MurA inhibitor that binds covalently to the Cys115 residue in the active site. Cathepsin B belongs to the family of lysosomal cysteine proteases and has been validated as a promising therapeutic target in various oncological diseases [33,34,35]. Although a variety of CatB inhibitors have been developed and investigated for the treatment of different types of cancer, none has yet been approved as a drug [36]. Current covalent inhibitors of CatB are mostly derived from epoxy-succinyl [37], vinyl-sulfone or nitrile warheads [38, 39]. Thus far, one of the most investigated CatB covalent inhibitors is E-64 [40, 41] (3, Fig. 1). However, other studies also revealed that different Michael acceptors and halomethyl ketones can be able to form a covalent bond with the active site cysteine [42]. As in the case of KRAS^G12C, we were able to accurately reproduce experimental screening results for both of the targets.

Furthermore, the diverse set of warheads tested against MurA and CatB allowed us to show that calculated reaction barriers against methyl-thiolate can also be used to parametrize the pseudo-Lennard–Jones potentials. This approach provided comparable results to those obtained by potentials derived from experimental reactivity parameters.

The virtual screening performance of WIDOCK was also evaluated on a larger scale, by screening the electrophilic library compiled and tested by Resnick et al. [43] against OTUB2 and NUDT7. The authors reported thiol reactivity data for the electrophiles in the set, thus allowing us to parametrize WIDOCK accordingly. OTUB2 is a member of the large family of deubiquitinating enzymes (DUBs). OTUB2 is linked to several biological pathways indicating its therapeutic potential for conditions such as viral infections, amyotrophic lateral sclerosis and diabetes [44,45,46]. NUDT7 is a nudix hydrolase involved in the specific degradation of CoA [47, 48]. Therefore, since CoA metabolism is key for the regulation of glucose homeostasis, also NUDT7 has raised interest for its potential role in the treatment of diabetes [49]. Notably, WIDOCK was able to identify many of the covalent probes identified by Resnick et al. for both these targets, thus showing its scalability in different settings although with varying enrichment rates.

Finally, we have validated WIDOCK prospectively by screening electrophilic fragments against human monoamine oxidase A (MAO-A). MAO-A performs oxidative deamination of monoamine substrates, and plays a key role in the metabolism of neurotransmitters and in the detoxification of amine compounds. Selective inhibitors of MAO-A are used in clinical applications as antidepressants [50,51,52] as they lead to increased levels of neurotransmitters in noradrenergic and serotoninergic systems [52]. Several irreversible inhibitors of MAO-A have been developed in previous years, such as clorgyline [53] (4, Fig. 1). To the best of our knowledge, all reported covalent MAO-A inhibitors with validated labelling position are bound to the FAD cofactor. In contrast, our objective was identifying fragment-sized compounds bound to an active site cysteine and inhibiting the enzyme by blocking the access to the active site. Docking electrophilic fragments to the active site Cys323 of MAO-A by WIDOCK predicted several hits that were confirmed in both biochemical and MS/MS measurements.

Overall, WIDOCK demonstrates that the reactive docking method can be applied as a warhead-sensitive virtual screening tool for the prioritization of covalent binders by incorporating cysteine reactivity parameters into AD4. The usefulness of this approach was demonstrated on a number of retrospective and prospective applications. We believe that the availability of the parameter set and the easy implementation of the protocol into AD4 would facilitate a number of further prospective applications to identify new covalent inhibitors for therapeutic targets.

Methods

Compound sets and reactivity data

The electrophilic compounds tested [25] against KRAS^G12C (5–24, structures shown in Supporting Figure S1 and SMILES provided in Supporting Table S1) were evaluated with both standard covalent docking and WIDOCK in order to analyze the predictive power of the two methods against this challenging target. Compound reactivities measured against BME were obtained from [25]. If the reactivity of the particular compound was not explicitly reported, we considered the one of the closest analogue bearing the same electrophilic warhead.

Electrophilic fragments tested against MurA, CatB and MAO-A (25–53, structures shown in Supporting Figure S2 and SMILES provided in Supporting Table S1) represent eight different warhead chemotypes reacting via two reaction mechanisms: Michael-type nucleophilic addition and nucleophilic substitution. The intrinsic thiol reactivities (see Table 1) were determined by kinetic measurement of adduct formation with l-glutathione (GSH), a widely used cysteine surrogate, by an HPLC–MS methodology, as described in the referred study [9].

Table 1 Experimental results of the investigated electrophilic fragments in kinetic measurements against l-glutathione (GSH) and in single point enzyme activity assays with MurA and CatB, expressed as residual activities (RA) % at 100 μM

Full size table

In the larger-scale retrospective virtual screening, an electrophilic fragment library of chloroacetamides and acrylamides was screened against OTUB2 and NUDT7 as described by Resnick and colleagues [43]. In order to have homogeneous reactivity data, we inspected the thiol reactivity reported for the set of 993 compounds and selected those that best fit to the described kinetic model (630 compounds with R² > 0.8 in all three independent thiol screens). As detailed in the referred study, kinetic data were obtained from a high-throughput thiol-reactivity assay measuring the rate of alkylation by the reduced Ellman’s reagent (DTNB, 5,5′-dithio-bis-2-nitrobenzoic acid). The set of 630 electrophiles was then filtered for compounds whose protein labeling was correctly assigned and reported in the cited reference (599 and 616 compounds for OTUB2 and NUDT7, respectively. SMILES provided in Supporting Table S1).

Ligand preparation

LigPrep [54] from the Schrödinger Suite was used to generate structural isomers, stereoisomers and different protonation states for ligands to be used in both standard and WIDOCK docking simulations. Following the flexible side chain covalent docking protocol available in AD4, ligand structures were modified by attaching the cysteine side chain atoms (Cβ–SG) to the site of alkylation. Modified ligand structures were prepared with LigPrep to generate structural isomers, protonation states and stereoisomers for chiral centers introduced upon cysteine attachment. AutoDockTools was used to prepare structures as PDBQT files for docking calculations.

Protein preparation

Crystal structures in the Protein Data Bank [55] were used for calculations on KRAS^G12C, MurA, CatB, OTUB2, NUDT7 and MAO-A. McGregor and colleagues [25] released two structures in the PDB entries 5V6S and 5V6V, in which Cys12 of KRAS^G12C is covalently bound to an acrylamide-based (5) and an aziridine-based (7) inhibitor, respectively. The two structures exhibit significant differences in the conformations of the highly flexible switch II region surrounding the binding site. Therefore, they were both used for docking calculations. For MurA, two protein conformations were considered. For cysteine reactivity predictions, the protein conformation co-crystallized with the cofactor UNAG and the irreversible inhibitor fosfomycin (PDB 1UAE [56]) was evaluated. This is indeed the conformation adopted by the enzyme in the presence of cofactor prior to addition of electrophilic compounds. For docking calculations, the enzyme’s open conformation in the PDB entry 3KQA [57] was preferred over that in 1UAE [56]. This is mainly due to the higher similarity between the fragment library members experimentally validated as MurA actives and terreic acid, the irreversible covalent inhibitor co-crystallized in 3KQA. It suggests that covalent fragments active against MurA are likely to induce a conformational change upon bond formation causing the opening of the loop containing the active site Cys115. A set of molecular descriptors for fosfomycin, terreic acid and MurA actives is provided in Supporting Table S2. For CatB, the recently deposited PDB entry 6AY2 [58] was selected due to its better resolution as compared to other covalent complexes. The residues in the catalytic diad Cys29-His199 were modeled in the thiolate and in the protonated form, respectively. For the retrospective screening against OTUB2 and NUDT7, we selected the high resolution co-crystal structures deposited by Resnick et al. [43] in the PDB entries 5QIV and 5QHA, respectively. For the former, Prime was used to build and optimize the missing side chain of the binding site residue Arg49. The screening was performed by targeting Cys51 in OTUB2 and Cys73 in NUDT7. For the prospective screening against MAO-A, the protein structure was derived from an irreversible complex with clorgyline. Among two available crystal forms, PDB entry 2BXS [59] was preferred over 2BXR [59] as in the latter the targeted cysteine Cys323 is involved in a disulfide bridge with Cys321. Structures were processed by removing co-crystallized inhibitors, water molecules and irrelevant subunits. Protein structures were prepared with the Protein Preparation Wizard in the Schrödinger Suite [60, 61], which was used to add hydrogen atoms, to optimize the H-bond network and to perform a restrained minimization. All targeted cysteines were modelled in the thiolate form, which is acknowledged to be the one participating in the reaction. AutoDockTools was used to prepare structures as PDBQT files for docking calculations.

Docking calculations

AutoDock4 [13] was used for all docking simulations. Each docking job was defined by a maximum of 100 runs, a population size of 150 individuals, 25 × 10⁵ maximum energy evaluations, 27 × 10³ maximum generations and default Lamarckian Genetic Algorithm settings. A grid box of 60 points in each dimension was centered on the targeted cysteine in case of MurA, CatB and MAO-A, whereas for KRAS^G12C it was placed on the centroid of the ligand co-crystallized in 5V6S since the same scaffold was present in the set of compounds under investigation. Also for OTUB2 and NUDT7 the grid was centered on the co-crystallized ligands as they both belonged to the screening set. The side chain of the targeted cysteine was modelled as flexible during the non-covalent docking while keeping the rest of the structure rigid. The electrophilic libraries were docked into the target proteins by using the flexible side chain method developed to simulate covalent docking in AD4 [14], the standard non-covalent docking in AD4 and WIDOCK, a reactive docking protocol incorporating ligand reactivity information into the docking simulation. To this end, a new atom type was defined for the electrophilic carbon in the ligand and for the reactive cysteine sulfur with the same set of parameters as the respective standard ones and, following Backus et al. [19] a custom 13–7 pseudo-Lennard–Jones potential was introduced for their interaction. The equilibrium distance (r_eq) was set to the length of the covalent bond (1.8 Å); kinetic parameters derived from experimental measurements and from calculated activation energies were scaled in a range between 1.0 and 0.175 to model the reactivity of the different ligands and set as the potential well depth (ε_eq). For each ligand, the best scoring pose was analyzed with a distance-based criterion: if the atoms involved in the formation of the C–S bond were found at a distance lower than 2.20 Å (additional distance is allowed to account for van der Waals repulsions), then the compound was predicted to be a covalent binder. In case of a compound presenting multiple isomers and/or potential reacting centers, all possibilities were enumerated. For each of these, the best scoring pose was analyzed and the one presenting the shortest interatomic distance was considered. Interatomic distances found for all compounds are reported in Supporting Table S3 for the set evaluated against KRAS^G12C, Supporting Table S4 for the set tested against MurA, CatB and MAO-A, and Supporting Table S5 for the sets screened against OTUB2 and NUDT7. More details on the workflow can be found in the supplementary methods within the Supporting Information.

For covalent docking simulations, the flexible side chain method of AD4 was used. The two additional atoms (Cβ–SG) on ligand structures were used for the alignment on the targeted cysteine in the protein through a SMARTS-based definition of overlapping atoms. After alignment, bound ligands were treated as fully flexible residues during the docking simulation. Compounds were ranked by the score calculated using the semi-empirical force field-based scoring function in AD4. Predicted actives were selected as those retrieved in the top N% of the scoring range, where N varied for each protein target (60% for KRAS, 30% for MurA and MAO-A, 10% for CatB, 7% for OTUB2, 5% for NUDT7) to reflect the fraction of compounds experimentally validated as inhibitors. Covalent docking scores for all compounds in the sets are reported in Supporting Tables S3, S4 and S5. In the KRAS^G12C case study, ligands were screened against both protein structures deposited in 5V6S and 5V6V. Following an ensemble approach, WIDOCK and covalent docking poses predicted against the two structures were analyzed. The one with the shorter distance was considered for WIDOCK, and the one with the lower docking score was considered for covalent docking to evaluate virtual screening results.

ROC curves and the area under the ROC curves (AUC) were used as unbiased standard metrics to evaluate the virtual screening performances against all targets. Screening sets were ranked according to C–S interatomic distances and docking scores obtained by WIDOCK and covalent docking, respectively. Furthermore, virtual screening performances were evaluated in terms of the sensitivity, specificity and accuracy displayed by the protocols at the abovementioned target-tailored classification thresholds.

Cysteine characterization

For electrostatic potential calculations, the quantum mechanical (QM) region was defined by including residues within 4 Å from the targeted cysteine. Single point DFT B3LYP calculations with 6-31G* basis were performed in continuum solvation models and the electrostatic potential on the van der Waals surface of atoms included in the QM region was calculated by using QSite [62,63,64]. The web-based platform Cpipe [65] was used to predict pK_a and cysteine reactivity information. Solvent accessible surface area (SASA) calculations were performed on prepared protein structures by using the POPS algorithm [66].

QM calculations

DFT methods were used to calculate reaction energies (ΔG_r) and activation energy barriers (ΔG^‡) against the methyl-thiolate anion (MeS⁻) as a cysteine surrogate. They both have been already validated in predicting reactivities [67,68,69], however, these studies focused on limited warhead chemotypes. We used the Gaussian 09 software package [70] with the SMD implicit solvation model (water) and the M062X functional [71], as it has been shown as one of the most accurate functional to calculate these parameters [72, 73]. We performed geometry optimizations and estimated the entropic contributions with the 6-311G+(d,p) basis set to obtain both energies (E) and Gibbs free energies (G). Then we calculated single point energies (E′) with the larger basis set 6-311++G(3df,3pd). From these data we calculated the Gibbs free energies (G′) of the investigated structures (Eq. 1). We optimized the transition state and product geometries (in the case of Michael acceptors we always considered the s-cis transitional geometry [68]) with the 6-311G+(d,p) basis set. QST3 transition state optimization was applied and IRC calculations were performed in order to prove that the transition states connect two corresponding minima. Frequencies were calculated to assure that transition states are on saddle points having one imaginary frequency, and reactants and products are in local minima having no imaginary frequency. Entropic and thermal corrections were evaluated for isolated molecules using standard rigid rotor harmonic oscillator approximations (i.e. Gibbs free energies were taken as the sum of electronic and thermal free energies of vibrational frequency calculations). The H, G and S values were obtained at standard conditions. In addition, single point energies were calculated with the 6-311++G(3df,3pd) basis set. The activation energy barriers (ΔG^‡) (Eq. 2) were determined as Gibbs free energy differences of the optimized transition states (${\bf{G}}_{\bf{T}\bf{S}}^{{^{\prime}}}$) and the initial compounds. Analogously, the reaction energies (ΔG_r) (Eq. 3) were obtained as the free energy differences of the optimized products (${\bf{G}}_{\text{product}}^{{^{\prime}}}$) and the initial compounds (data are shown in Supporting Table S6).

$${\mathbf{G^{\prime}}} = {\mathbf{E^{\prime}}} + \left( {{\mathbf{G}} - {\mathbf{E}}} \right)$$

(1)

$$\Delta {\mathbf{G}}^{\ddag } = {\mathbf{G^{\prime}}}_{{{\mathbf{TS}}}} - \left( {{\mathbf{G^{\prime}}}_{0} + {\mathbf{G^{\prime}}}_{{{\mathbf{SH}}}} } \right)$$

(2)

$$\Delta {\mathbf{G}}_{r} = {\mathbf{G^{\prime}}}_{{{\text{product}}}} - \left( {{\mathbf{G^{\prime}}}_{0} + {\mathbf{G^{\prime}}}_{{{\mathbf{SH}}}} } \right)$$

(3)

Inhibitory activity data

Inhibitory activities for compounds 5–24 expressed as % of KRAS^G12C labeling at 100 μM compound concentration were obtained from [25]. Compounds showing > 50% labeling were considered as actives. Inhibitory activities for compounds 25–53 expressed as residual activity of MurA and CatB at 100 μM compound concentration were taken from [9]. Compounds showing < 60% residual activity were considered as actives. Inhibitory activities expressed as % labeling for the virtual screening at 200 μM compound concentration against OTUB2 and NUDT7 were taken from [43]. Compounds showing > 50% labeling were considered as actives.

MAO-A activity assay

The effects of the test compounds on MAO-A were investigated using a fluorimetric assay, following a previously described methodology [74]. The inhibitory activity of the compounds was evaluated by their effects on the production of hydrogen peroxide (H₂O₂) from p-tyramine. The production of H₂O₂ was detected using Amplex Red reagent in the presence of horseradish peroxidase, where a highly sensitive fluorescent product, resorufin, is produced at stoichiometric amounts. Recombinant human microsomal MAO-A enzyme expressed in baculovirus infected insect cells (BTI-TN-5B1-4), horse-radish peroxidase (type II, lyophilized powder), and p-tyramine hydrochloride were obtained from Sigma Aldrich. 10-Acetyl-3,7-dihydroxyphenoxazine (Amplex Red reagent) was synthesized as described in the literature [75].

Briefly, 100 µL 50 mM sodium phosphate buffer (pH 7.4, 0.05% Triton X-114) containing the compounds and MAO-A were incubated for 30 min at 37 °C in a flat-bottomed black 96-well microplate, and placed in a dark microplate reader chamber. After the pre-incubation, the reaction was started by adding the final concentrations of 200 µM Amplex Red reagent, 2 U/mL horseradish peroxidase, and 1 mM p-tyramine (final volume, 200 µL). The production of resorufin was quantified based on the fluorescence generated (λ_ex = 530 nm, λ_em = 590 nm) at 37 °C over a period of 30 min, during which time the fluorescence increased linearly. For control experiments, DMSO was used instead of the appropriate dilutions of the compounds in DMSO. To determine the blank value (b), phosphate-buffered solution replaced the enzyme solution. The initial velocities were calculated from the trends obtained, with each measurement carried out in duplicate. The specific fluorescence emission to obtain the final result was calculated after subtraction of the blank activity (b). The inhibitory potencies are expressed as the residual activities (RA): ${\text{RA}} = \frac{{{\text{v}}_{{\text{i}}} - {\text{b}}}}{{{\text{v}}_{0} - {\text{b}}}},$ where v_i is the velocity in the presence of the test compounds, and v₀ the control velocity in the presence of 1.5% DMSO.

Labelling of human MAO-A

MAO-A (150 μL, 52 µM) stored in 50 mM Hepes, pH = 7.5 with 0.25% Triton X-100 was thawed at 37 °C, and then desalted using a G-25 (fine) Sephadex column to 50 mM K₃PO₄, pH = 7.5 containing 0.25% Triton X-100. The resulted 40 µM sample was divided, 35 µL was taken, and the electrophilic fragment (0.5 µL, 100 mM in DMSO) was added. The mixture was incubated at 4 °C for 24 h.

Digestion and LC–MS/MS analysis of labelled human MAO-A

The tryptic digestion method was adapted from our former publication [9]. Briefly, 35 μL of MAO-A (40 μM), 10 μL 0.2% (w/v) RapiGest SF (Waters, Milford, USA) solution buffered with 50 mM ammonium bicarbonate (NH₄HCO₃) were mixed (pH = 7.8). 3.3 μL of 45 mM DTT (~150 nmol) in 100 mM NH₄HCO₃ were added and kept at 37.5 °C for 30 min. After cooling the sample to room temperature, 4.16 μL of 100 mM iodoacetamide (416 nmol) in 100 mM NH₄HCO₃ were added and placed in the dark at room temperature for 30 min. The reduced and alkylated protein was then digested by 10 μL (1 mg/mL) trypsin (the enzyme-to-protein ratio was 1:10) (Sigma, St. Louis, MO, USA). The sample was incubated at 37 °C for overnight. To degrade the surfactant, 7 μL of formic acid (500 mM) solution was added to the digested MAO-A sample to obtain the final 40 mM (pH ≈ 2) and was incubated at 37 °C for 45 min. For LC–MS analysis, the acid treated sample was centrifuged for 5 min at 13 000 rpm.

QTRAP 6500 triple quadruple—linear ion trap mass spectrometer, equipped with a Turbo V source in electrospray mode (AB Sciex, CA, USA) and an Agilent 1100 Binary Pump HPLC system (Agilent Technologies, Waldbronn, Germany) consisting of an autosampler was used for LC–MS/MS analysis. Data acquisition and processing were performed using Analyst software version 1.6.2 (AB Sciex Instruments, CA, USA). Chromatographic separation was achieved by using the Discovery® BIO Wide Pore C-18-5 (250 mm × 2.1 mm, 5 μm). The sample was eluted with a gradient of solvent A (0.1% formic acid in water) and solvent B (0.1% formic acid in ACN). The flow rate was set to 0.2 mL/min. The initial conditions for separation were 5% B for 7 min, followed by a linear gradient to 90% B by 53 min, from 60 to 63 min 90% B is retained; from 64 to 65 min back to the initial conditions with 5% eluent B retained to 75 min. The injection volume was 10 μL (300 pmol on the column).

Information Dependent Acquisition (IDA) LC–MS/MS experiment was used to identify the modified tryptic MAO-A peptide fragments. Enhanced MS scan (EMS) was applied as survey scan and enhanced product ion (EPI) was the dependent scan. The collision energy in EPI experiments was set to rolling collision energy mode, where the actual value was set on the basis of the mass and charge state of the selected ion. Further IDA criteria: ions greater than 400,000 m/z, which exceeds 10⁶ counts, exclude former target ions for 30 s after two occurrences. In EMS and in EPI mode the scan rate was 1000 Da/s as well. Nitrogen was used as the nebulizer gas (GS1), heater gas (GS2), and curtain gas with the optimum values set at 50, 40 and 40 (arbitrary units). The source temperature was 350 °C and the ion spray voltage set at 5000 V. Declustering potential value was set to 150 V. GPMAW 4.2 software was used to analyze the large number of MS–MS spectra and identify the modified tryptic MAO-A peptides.

Results and discussion

Retrospective docking on KRAS^G12C

First, WIDOCK was challenged by a set of compounds covalently labeling Cys12 in KRAS^G12C with a range of electrophilic warheads. McGregor et al. elaborated a reported KRAS^G12C switch II inhibitor scaffold by introducing different warhead types probing Cys12 reactivity. The tested electrophiles included acrylamides, epoxides, aziridines, α-chloroacetamides, β-chloroethylureas, acyl-imidazoles, diazoacetamides and other warheads reacting through nucleophilic substitution. The compounds differ only in the warhead, therefore differences in the inhibitory potency can be assigned to differences in the intrinsic reactivities and in the optimal orientations of the reacting groups. Therefore, this set is perfectly suited to test the docking performance of WIDOCK on a range of warheads. Indeed, the main advantage provided by our method lies in the opportunity to screen and compare covalent ligands bearing multiple warhead types characterized by different intrinsic reactivities. Moreover, the authors assessed the thiol reactivity of the compounds by measuring the extent of covalent adduct formation with β-mercaptoethanol (BME) as a thiol surrogate. Thus, we could use the reported reactivity information to parametrize the customized pseudo-Lennard–Jones potentials for all the compounds. By docking this set of compounds to KRAS^G12C we could inspect the ability of WIDOCK to induce a conformational change in the warhead with respect to docking poses generated by the standard non-covalent docking in AD4. As an illustrative example, Fig. 2 shows the binding modes generated for compound 5, a co-crystallized KRAS^G12C covalent inhibitor. Both standard docking and WIDOCK could provide an overall good consensus with the experimentally determined binding mode (Fig. 2a). However, by focusing on the warhead conformation (Fig. 2b), the best scoring standard docking pose predicted an incorrect geometry of the acrylamide moiety (Fig. 2b-I vs. Fig. 2b-II). On the other hand, the reactivity-scaled interaction potential introduced between reacting atoms in WIDOCK induced a flip in the warhead structure, by placing the reactive β-carbon in close proximity to the targeted cysteine sulfur (2.06 Å) (Fig. 2b-III). This result highlights the capability of WIDOCK to predict the correct geometry of the ligand warhead while keeping the correct binding mode in the pocket.

Overall, by using WIDOCK on KRAS^G12C we could retrieve 10 out of the 12 experimental actives (except for the acyl-imidazole 14 and the α-chloro-acetamide 17) within the defined distance cutoff, that represents 83% sensitivity (or True Positive Rate, TPR) (Fig. 3). In comparison, we screened the library using the dedicated covalent docking module of AD4 (flexible side chain method). By analyzing the results based on a target-tailored classification threshold (see “Docking calculations” in “Methods” section), covalent docking provided a TPR of 75% (Fig. 3b), thus somewhat lower than the one achieved by WIDOCK (more details in Supporting Table S3). In addition, covalent docking showed significantly lower specificity and accuracy (13 and 50%) as compared to WIDOCK (100 and 90%), which emphasize the ability of WIDOCK to discriminate active from inactive compounds in the screening set. Additionally, ROC curves were produced as unbiased performance metrics to evaluate virtual screening results (Fig. 3c). The superior performance of WIDOCK relative to covalent AD4 is shown by both the higher early enrichment of actives (Supporting Figure S4) and the larger AUC values, thus supporting the utility of WIDOCK in handling screening sets composed of diverse warhead chemotypes.

Retrospective docking on MurA and CatB

WIDOCK was next evaluated on electrophilic fragments equipped with diverse warheads against MurA and CatB. In Table 1, we report reactivity and inhibitory activity data for the library members. Their reactivities against l-glutathione (GSH) are characterized by the half-life of adduct formation and by the pseudo first order reaction rate constant (k) derived from the former. These fragments were measured in single point enzyme activity assays at 100 μM concentration against MurA and CatB. Reactivity and inhibition data were taken from a recent publication [9].

Biochemical results shown in Table 1 suggest that a large fraction of highly reactive compounds led to a better inhibitory profile (lower remaining activity) when tested against MurA. This is in line with the idea that a more pronounced reactivity improves the chance of covalent binding, and consequently enhances the inhibitory activity. This trend, however, is not seen by the results obtained against CatB. This highlights that reactivity is not the only factor driving the binding of covalent ligands, and significant degree of target selectivity can be achieved [43]. The role of the binding site residues involved in the initial ligand recognition, and by the cysteine surroundings affecting its intrinsic nucleophilicity can be interpreted by characterizing the active site cysteines of MurA (Cys115) and CatB (Cys29). We used three tools to obtain reactivity and accessibility descriptors (Table 2). QSite [62,63,64] by Schrödinger was used to perform mixed QM/MM calculations to inspect the reactive center in the protein. It provided information about the electrostatic potential minima (ESP_min) on the sulfur atom of the cysteines, thus indicating the relative nucleophilicity of the targeted residues. By using the POPS algorithm, we could retrieve information on the cysteine accessibilities, in terms of solvent accessible surface areas (SASA) of both the whole residue and its side chain sulfur (SG). Finally, the web-based platform Cpipe was used for reactivity and pK_a predictions for the analyzed cysteines.

Table 2 Parameters indicating reactivity and accessibility of cysteines in MurA and CatB

Full size table

By investigating the calculated properties, the lower ESP_min and pK_a values of CatB’s Cys29 compared to MurA’s Cys115 suggest a more pronounced nucleophilicity of the former, which is accompanied, however, by a lower accessibility. Overall, the lower accessibility of the catalytic cysteine in CatB could provide an explanation of the lower number of experimental actives found against this target.

The experimental reactivity data measured against GSH (lnk in Table 1) were used to derive parameters for WIDOCK. Then, ligands of Table 1 were docked against MurA and CatB by targeting the reactive cysteine being able to modulate the functional activity of each protein when covalently modified. Activity prediction was compared to experimental screening results available for the two enzymes. WIDOCK was able to retrieve most of the validated actives, with few false positives having the reacting atom pair within the defined distance cutoff (Fig. 4a, for detailed dataset see Supporting Table S4). Overall, screening by WIDOCK resulted in 60% and 100% sensitivity against MurA and CatB, respectively (Fig. 4b). The varying performance in terms of true positive rates reflects the different structural framework defined by the binding site residues involved in the non-covalent recognition of covalent ligands. Indeed, the non-covalent interactions formed in the pocket are important to place the warhead in the right position and orientation with respect to the protein nucleophile in order to allow for the chemical reaction [2].

Next, we compared the performance of WIDOCK to that of the dedicated covalent docking algorithm of AD4. Contrarily to our protocol, we could not retrieve a significant amount of experimentally validated covalent inhibitors by the covalent docking available in AD4 within the custom classification threshold (Fig. 4a). Considering MurA, covalent docking showed a sensitivity of 20%, thus substantially lower than that achieved by WIDOCK (60%) (Fig. 4b). As for CatB, no actives were correctly predicted (sensitivity is equal to 0%). It is to be noted that the high sensitivity obtained by WIDOCK for CatB is accompanied with lower specificity owing to the generated false positives. This is in sharp contrast with the results obtained by covalent AD4 that generated a single false positive without identifying any true active within the custom threshold. Although the number of investigated compounds and experimental actives are clearly lower than it is typical in virtual screening campaigns, it is worth noting that the high sensitivity at the expense of modest specificity as found with WIDOCK allows the identification of actives although with increased experimental effort. The ROC curves (Fig. 4c) and the enrichment curves (Supporting Figure S4) clearly demonstrate the advantage that WIDOCK provides in terms of better TPR to FPR relation and high early enrichment of MurA and CatB actives as compared to covalent AD4, with the latter resulting in AUC values only slightly better than a random classification. It can also be seen how the lower specificity emphasized for WIDOCK against CatB is due to the incorrect positive classification of an additional 38% of the set following the identification of all true inhibitors (100% TPR already obtained in the top ranked 17% of the set).

We also investigated the ability of calculated reactivity descriptors (ΔG_r, ΔG^‡) for the parametrization of the pseudo-Lennard–Jones potentials. Reaction energy (ΔG_r) did not turn out to be useful for this compound set with diverse warhead chemotypes (R² = 0.066 between ΔG_r and the logarithm of the kinetic rate constant). Although the quantum chemical prediction of reactivity for diverse warheads is highly challenging, reasonable correlation was found (R² = 0.505; RMSD = 1.99; N = 29) between ΔG^‡ and the logarithm of the kinetic rate constant by considering all investigated warhead chemotypes (Fig. 5). The correlation was found to be statistically significant at the p = 1.08 × 10^–5 level using a correlation t-test.

Therefore, we applied this model to predict kinetic rate constants for all compounds in the set, which were then used to derive the parameters of the pseudo-Lennard–Jones potentials for WIDOCK. The performance was shown to be highly similar to that obtained by the experimental reactivity parameters. In particular, considering the fraction of compounds within the distance cutoff, it resulted in 53% and 100% sensitivity against MurA and CatB, respectively (Fig. 6). Interestingly, equal (CatB) or similar (MurA) TPR values at distance cutoffs correspond to a higher specificity (1-FPR) for the protocol based on calculated parameters against both targets (100 and 58% for MurA and CatB, respectively).

We find it instructive to analyze docking poses obtained with WIDOCK and to compare them with the poses calculated by both non-covalent AD4 and covalent AD4. Figure 7 shows the conformations generated by all the evaluated methods when docking selected experimental MurA actives including maleimide (41), α-halo-acetophenone (46) and acrylamide (32) warheads. These three fragments were all predicted as actives by WIDOCK, while the covalent docking in AD4 only predicted the maleimide 41 among the best scored. While the methods provided different poses for 32, the predicted binding modes of 41 and 46 by the covalent docking and WIDOCK overlapped significantly. In contrast, the standard non-covalent AD4 predicted fragments 32 and 41 into a different subpocket. Interestingly, an overlap is displayed by the phenyl ring in 46, although in a flipped conformation that places the reactive carbon far from the reactive cysteine.

By docking our compound set to CatB, α-bromo-acetophenone was found as the only warhead type in experimentally validated inhibitors. WIDOCK was able to predict all the true actives in the set, although together with a higher number of false positives as compared to the other two targets. In Fig. 8, docking poses predicted for two experimental actives (44 and 46) are shown. As in the case of MurA, significant overlap was found between conformations generated by the covalent docking and WIDOCK, with only a slight deviation of the biphenyl system in 44 toward a different subpocket. In addition, both 44 and 46 clearly show the distinct binding mode that is produced by the non-covalent docking method neglecting the reactivity information.

Interestingly, comparing the distances between the reactive atom pairs in the WIDOCK and non-covalent AD4 poses shows that their difference was increased parallel with the inhibitory activities (Fig. 9). The most striking differences involve those compounds that were predicted as covalent binders by WIDOCK, a significant fraction of whom were found as experimental actives. Altogether, these data clearly show that the improved sensitivity of WIDOCK can be traced back to the ligand reactivity considered within the docking process.

Retrospective screening against OTUB2 and NUDT7

To analyze the performance in a larger-scale virtual screening scenario, WIDOCK was applied on the set experimentally screened by Resnick et al. [43] against OTUB2 and NUDT7. The screening library consisted of mildly reactive chloroacetamides and acrylamides for which thiol reactivity data were reported, thus enabling the parametrization of WIDOCK. In detail, the logarithm of the average second-order kinetic rate constant (values in Supporting Table S5) was used to derive parameters for the pseudo-Lennard–Jones potentials, similarly to previously described applications. Overall, considering the performance at the usual distance cutoff, WIDOCK predicted 23 of the 41 experimental actives reported in the screening against OTUB2 (TPR = 56%) and 10 of the 29 actives found against NUDT7 (TPR = 34%). Structures of true positive hits for OTUB2 and NUDT7 are shown in Supporting Figure S3. Interestingly, all the true actives predicted by WIDOCK against NUDT7 (10/10) and most of those found against OTUB2 (20/23) were non-promiscuous hits in the screening carried out by Resnick et al. against ten different targets. The library was also screened by the covalent docking module in AD4 to have a direct comparison with a dedicated program. It is worth noting that such a large scale virtual screening application of covalent docking by AD4 is unprecedented and required automating the generation of post-reaction conformations. The evaluation of WIDOCK and covalent AD4 using the custom cutoffs and an extended cutoff of 10% for covalent AD4 is shown in Fig. 10b (for detailed results see Supporting Table S5). Evaluating the protocols using ROC curves (Fig. 10c), covalent AD4 showed a better performance compared to WIDOCK when applied against OTUB2. The dedicated covalent docking protocol could systematically enrich for more actives at all fractions of the screening set, while also predicting less false positives. This resulted in AUC values of 0.74 and 0.54 for covalent AD4 and WIDOCK, respectively. A different trend could instead be observed for the virtual screening against NUDT7, where the two docking protocols displayed highly comparable performances as shown by the respective ROC curves and enrichment plots (reported in Supporting Figure S4).

Docking poses predicted by WIDOCK and covalent docking for the compounds co-crystallized in 5QIV (OTUB2) and 5QHA (NUDT7) are shown in Fig. 10a. The targeted cysteines (Cys51 in OTUB2 and Cys73 in NUDT7) are located in pockets that are close to the surface, thus challenging the prediction of accurate conformations. The protocols could reproduce the overall shape and conformation of the experimental structures, although apparent deviations were found mainly in the solvent-exposed terminal of the ligands.

Overall, based on the results gathered on all retrospective studies, WIDOCK outperformed covalent AD4 when applied to libraries that included more diverse warhead chemotypes (KRAS^G12C, MurA and CatB), while a comparable (NUDT7) or worse (OTUB2) performance was observed with compound sets spanning less variability in the reactive groups and thus in the intrinsic reactivities. Since WIDOCK was mainly devised to broaden the scope of electrophiles to be investigated in a virtual screening, these results confidently paved the way for its application in prospective studies to confirm its usefulness in the prediction of diverse covalent binders.

Additionally, WIDOCK was tested against these two targets using the QM-based reactivity parameters described for the retrospective docking on MurA and CatB, in order to examine the applicability of the computational parametrization on larger screening sets. To ensure comparable reactivities, computationally derived parameters were assigned to compounds presenting a related warhead in the reference library screened versus MurA and CatB (for more details, see supplementary methods within the Supporting Information). By screening parametrized compounds against OTUB2, WIDOCK showed a sensitivity of 60%, a specificity of 76% and an overall accuracy of 76% considering compounds predicted within the defined distance cutoff. The screening results obtained for the same set of compounds but with experimentally derived parameters showed a higher sensitivity of 100% at the expense of reduced specificity (59%) and accuracy (60%). However, inspecting the performance via the respective enrichment and ROC curves, it is worth noting how the computational parametrization provided a better enrichment of actives in the initial third of the screening set and an overall higher AUC value compared to the protocol based on experimental parameters (Fig. 11 and Supporting Figure S4). When applied against NUDT7, the protocol with computational parametrization resulted in a sensitivity of 29%, with specificity and accuracy of 86 and 83%, respectively. Screening the same set of compounds with experimentally derived parameters exhibited 64% sensitivity and lower specificity (74%) and accuracy (73%) compared to the screening by computationally derived parameters. In general, WIDOCK with computationally derived parameters resulted in poorer performance for NUDT7, as the experimental parametrization could provide a consistently better enrichment of actives (Fig. 11 and Supporting Figure S4). It must also be emphasized that the experimental-based protocol predicted a larger fraction of compounds (~15%) within the distance cutoff against both OTUB2 and NUDT7, thus resulting in the identification of a higher number of actives compared to the computational protocol. However, these results confirmed that promising virtual screening hits can be predicted even by relying solely on a computational parametrization.

Prospective screening against MAO-A by targeting an active site cysteine

Encouraging results obtained during the retrospective validation of WIDOCK prompted us to apply the protocol to identify new covalent MAO-A inhibitors. Although several known MAO-A inhibitors bind covalently to the FAD cofactor, to the best of our knowledge, no validated cysteine-binding covalent inhibitor has been reported yet for MAO-A. Inspecting the residues at the active site, we identified Cys323 in a position that its labelling is likely to block the access to the active site. Moreover, since two additional residues, Cys201 and Cys321, are found near the active site, the reactivity and accessibility of these three cysteines were characterized as explained in the previous section.

By comparing the values obtained for these three residues in MAO-A, we observed that Cys323 not only has the most negative ESP_min, but also the largest SASA considering both the whole residue and the side chain sulfur (Table 3). These data suggest that Cys323 is the cysteine residue having the highest nucleophilic character and accessibility among the ones analyzed in MAO-A. Furthermore, Cpipe predicted it to be reactive, together with Cys321, despite their high pK_a values. It is worth noting that several cysteine residues for which labeling was proved by X-ray crystallography and/or MS experiments were found to have high predicted pK_a values [76]. Altogether, these data suggest that Cys323 is potentially targetable and support that Cys323 labeling may lead to MAO-A inhibition with a novel covalent mechanism of action. This hypothesis is supported by a recent report of several cysteine reactive covalent MAO-A inhibitors, however, their mechanism of action was not confirmed [77]. Figure 12 highlights the location of Cys323 and the surrounding residues (including Cys321) in the binding pocket of MAO-A.

Table 3 Parameters indicating reactivity and accessibility of cysteines in MAO-A

Full size table

Compounds in Table 1 were docked with WIDOCK into MAO-A by using both experimental and computation-derived pseudo-Lennard–Jones potentials (see above). Virtual screening with standard covalent AD4 was also performed to analyze differences in the predictive power. All of the compounds were experimentally tested in MAO-A inhibitory assay. As summarized in Fig. 13a, applying WIDOCK with experimental reactivity parameters resulted in eight compounds predicted within the distance cutoff, which were all experimentally confirmed. Additional four compounds were found to inhibit MAO-A experimentally. These data represent 67% sensitivity, 100% specificity and overall 86% accuracy (Fig. 13b). The performance is slightly lower when the parameters of the potentials were derived from computed reactivities. In this case, one active is left unrecognized (compound 39) and four false positives (30, 31, 33, 51) appeared. Altogether, the sensitivity is 58%, the specificity is 76% and the accuracy is 69%. Thus, the hit rate with experimentally derived parameters is 100%, while it is 64% with the computationally derived parameters. Although the assessment of these results is affected by factors like the size and composition of the investigated library and the promiscuity of the identified inhibitors, the high hit rates are remarkable. Considering the performance of standard covalent AD4, the most notable difference is the lower sensitivity of 33%, as only four out of the 12 experimental actives were correctly predicted using the custom classification threshold, with a hit rate of 57%, accompanied by 82% specificity and 62% accuracy. By investigating the overall performance using ROC curves (Fig. 13c), the remarkably accurate classification ensured by WIDOCK based on experimental reactivity parameters is clearly shown by its ability to predict only active compounds within the specified distance cutoff, finally resulting in an AUC value of 0.70. On the other hand, the computational parametrization of WIDOCK provided a highly comparable performance relative to covalent AD4 within the first half of the library. Considering the fraction of compounds predicted within the distance cutoff by the purely computational protocol (top 38% of the set), covalent AD4 resulted in 50% sensitivity, 71% specificity and 62% accuracy, hence in a slightly poorer performance. Despite covalent AD4 had an overall larger AUC (0.63) compared to WIDOCK based on computational parameters (0.51), this is the result of a detrimental enrichment in the second half of the set past the distance cutoff (see also enrichment plots reported in Supporting Figure S4) and past also the region relevant in most virtual screening applications. Overall, the results collected in this prospective study confirmed a) the efficacy of WIDOCK in enriching for actives within the defined distance threshold, and b) its superior virtual screening performance compared to covalent AD4 when dealing with highly diverse warhead libraries.

The labelling of MAO-A by compound 32 was confirmed by MS/MS studies. Proteomics studies revealed that 32 forms a covalent bond with Cys323 located at the active site of MAO-A (Supporting Figure S5).

Docking poses predicted for two compounds found to inhibit MAO-A activity (32 and 45) are shown in Fig. 14. They illustrate how WIDOCK was able to predict active ligands bearing different warheads that react through different reaction mechanisms (32 via Michael addition and 45 via nucleophilic substitution). Docking poses generated by the non-covalent docking in AD4 are used again as reference. For the acrylamide-based compound 32, the reactive atom in the WIDOCK pose was found to be within bonding distance from the cysteine sulfur. By contrast, the best scoring pose provided by non-covalent docking placed the warhead farther away from the cysteine, showing a hydrogen bond interaction between the acrylamide-NH and the backbone carbonyl of Val210. The reactive carbon of the α-bromo-acetophenone 45 in the best scoring pose was placed at short distance from the cysteine sulfur by WIDOCK. On the other hand, non-covalent docking led to a flipped binding mode due to an H-bond interaction between the carbonyl oxygen and the hydroxyl group of Ser209. Covalent AD4 docking poses are also included to show differences in the predicted binding modes, although neither of the two was predicted among the best scoring ones by this protocol. Furthermore, poses generated by WIDOCK using experimental and computed reactivity parameters were found to be highly overlapping in the majority of cases, thus further supporting computational parametrization.

We compared the distances between the reactive atom pairs in the poses provided by WIDOCK and by the standard non-covalent AD4. Similarly to what we observed for MurA and CatB (see Fig. 9), the difference increased parallel with the inhibitory activities (Fig. 15). These data confirm the tendency that the larger the inhibitory activity, the more pronounced the effect of the pseudo Lennard–Jones potential to produce short interatomic separation for the reacting atoms. This finding underlines the importance of including reactivity information in a docking protocol for covalent binders.

Conclusion

Virtual screening of covalent inhibitors needs a robust docking-scoring scheme applicable to compounds with a wide range of covalent warheads. We presented WIDOCK as a reactive docking protocol that uses a ligand reactivity-based pseudo-Lennard–Jones potential in AutoDock4 to enable virtual screening of diverse warhead libraries. Ligand reactivities were derived from kinetic data obtained either from experiments or from quantum chemical calculations. WIDOCK was evaluated retrospectively against experimental data obtained for focused sets of diverse electrophiles against three targets, KRAS^G12C, MurA and cathepsin B. Additionally, larger electrophilic fragment libraries with limited warhead diversity were screened against OTUB2 and NUDT7. Results were also contrasted to those obtained by covalent docking in AutoDock4. WIDOCK retrieved experimental actives with high sensitivity (true positive rate) and outperformed the dedicated covalent docking module of AutoDock4 in terms of early enrichments and ROC curves when screening libraries that spanned more diverse warhead chemotypes, while comparable or worse performances were obtained with sets characterized by lower variability in the reacting groups and in the corresponding intrinsic reactivities. When tested prospectively for discovering new MAO-A inhibitors with a new mechanism of action targeting Cys323, eight and seven actives (TPR: 67 and 58%) were identified with experimentally and computationally parametrized potentials. One of these compounds was proven to label Cys323 by subsequent MS proteomics measurements. To the best of our knowledge, this is the first experimentally validated case that MAO-A inhibition was achieved via direct Cys323 labelling. These results demonstrate that this warhead-sensitive docking protocol can be considered as a useful tool for the discovery of cysteine targeting covalent inhibitors. Furthermore, it was shown for compounds acting via Michael addition and nucleophilic substitution that the linear relationship between experimental and computed reactivities makes it possible to use computational parametrization of reactivity-based docking without a significant loss of accuracy. Therefore, we believe that the present parameter set of WIDOCK could be easily extended to new ligands acting with the same reaction mechanism. The warhead-sensitive nature of WIDOCK supports the parallel optimization of non-covalent and covalent interactions for the first time that might contribute to identify more specific and safer covalent inhibitors.

References

Baillie TA (2016) Targeted covalent inhibitors for drug design. Angew Chem Int Ed 55:13408–13421. https://doi.org/10.1002/anie.201601091
Article CAS Google Scholar
Singh J, Petter RC, Baillie TA, Whitty A (2011) The resurgence of covalent drugs. Nat Rev Drug Discov 10:307–317. https://doi.org/10.1038/nrd3410
Article CAS PubMed Google Scholar
Bauer RA (2015) Covalent inhibitors in drug discovery: from accidental discoveries to avoided liabilities and designed therapies. Drug Discov Today 20:1061–1073. https://doi.org/10.1016/j.drudis.2015.05.005
Article CAS PubMed Google Scholar
Shannon DA, Weerapana E (2015) Covalent protein modification: the current landscape of residue-specific electrophiles. Curr Opin Chem Biol 24:18–26. https://doi.org/10.1016/j.cbpa.2014.10.021
Article CAS PubMed Google Scholar
Smith AJT, Zhang X, Leach AG, Houk KN (2009) Beyond picomolar affinities: quantitative aspects of noncovalent and covalent binding of drugs to proteins. J Med Chem 52:225–233. https://doi.org/10.1021/jm800498e
Article CAS PubMed PubMed Central Google Scholar
González-Bello C (2016) Designing irreversible inhibitors-worth the effort? ChemMedChem 11:22–30. https://doi.org/10.1002/cmdc.201500469
Article CAS PubMed Google Scholar
De Cesco S, Kurian J, Dufresne C, Mittermaier AK, Moitessier N (2017) Covalent inhibitors design and discovery. Eur J Med Chem 138:96–114. https://doi.org/10.1016/j.ejmech.2017.06.019
Article CAS PubMed Google Scholar
Tuley A, Fast W (2018) The taxonomy of covalent inhibitors. Biochemistry 57:3326–3337. https://doi.org/10.1021/acs.biochem.8b00315
Article CAS PubMed Google Scholar
Ábrányi-Balogh P, Petri L, Imre T, Szijj P, Scarpino A, Hrast M, Mitrović A, Fonovič UP, Németh K, Barreteau H, Roper DI, Horváti K, Ferenczy GG, Kos J, Ilaš J, Gobec S, Keserű GM (2018) A road map for prioritizing warheads for cysteine targeting covalent inhibitors. Eur J Med Chem 160:94–107. https://doi.org/10.1016/j.ejmech.2018.10.010
Article CAS PubMed Google Scholar
Jones G, Willett P, Glen RC, Leach AR, Taylor R (1997) Development and validation of a genetic algorithm for flexible docking. J Mol Biol 267:727–748. https://doi.org/10.1006/JMBI.1996.0897
Article CAS PubMed Google Scholar
Verdonk ML, Cole JC, Hartshorn MJ, Murray CW, Taylor RD (2003) Improved protein-ligand docking using GOLD. Proteins Struct Funct Bioinform 52:609–623. https://doi.org/10.1002/prot.10465
Article CAS Google Scholar
Verdonk ML, Berdini V, Hartshorn MJ, Mooij WTM, Murray CW, Taylor RD, Watson P (2004) Virtual screening using protein−ligand docking: avoiding artificial enrichment. J Chem Inf Comput Sci 44:793–806. https://doi.org/10.1021/ci034289q
Article CAS PubMed Google Scholar
Morris GM, Huey R, Lindstrom W, Sanner MF, Belew RK, Goodsell DS, Olson AJ (2009) AutoDock4 and AutoDockTools4: automated docking with selective receptor flexibility. J Comput Chem 30:2785–2791. https://doi.org/10.1002/jcc.21256
Article CAS PubMed PubMed Central Google Scholar
Bianco G, Forli S, Goodsell DS, Olson AJ (2016) Covalent docking using Autodock: two-point attractor and flexible side chain methods. Protein Sci 25:295–301. https://doi.org/10.1002/pro.2733
Article CAS PubMed Google Scholar
Abagyan R, Totrov M, Kuznetsov D (1994) ICM—a new method for protein modeling and design: applications to docking and structure prediction from the distorted native conformation. J Comput Chem 15:488–506. https://doi.org/10.1002/jcc.540150503
Article CAS Google Scholar
Katritch V, Byrd CM, Tseitin V, Dai D, Raush E, Totrov M, Abagyan R, Jordan R, Hruby DE (2007) Discovery of small molecule inhibitors of ubiquitin-like poxvirus proteinase I7L using homology modeling and covalent docking approaches. J Comput Aided Mol Des 21:549–558. https://doi.org/10.1007/s10822-007-9138-7
Article CAS PubMed PubMed Central Google Scholar
Zhu K, Borrelli KW, Greenwood JR, Day T, Abel R, Farid RS, Harder E (2014) Docking covalent inhibitors: a parameter free approach to pose prediction and scoring. J Chem Inf Model 54:1932–1940. https://doi.org/10.1021/ci500118s
Article CAS PubMed Google Scholar
Toledo Warshaviak D, Golan G, Borrelli KW, Zhu K, Kalid O (2014) Structure-based virtual screening approach for discovery of covalently bound ligands. J Chem Inf Model 54:1941–1950. https://doi.org/10.1021/ci500175r
Article CAS PubMed Google Scholar
Backus KM, Correia BE, Lum KM, Forli S, Horning BD, González-Páez GE, Chatterjee S, Lanning BR, Teijaro JR, Olson AJ, Wolan DW, Cravatt BF (2016) Proteome-wide covalent ligand discovery in native biological systems. Nature 534:570–574. https://doi.org/10.1038/nature18002
Article CAS PubMed PubMed Central Google Scholar
Forli S, Botta M (2007) Lennard-Jones potential and dummy atom settings to overcome the AUTODOCK limitation in treating flexible ring systems. J Chem Inf Model 47:1481–1492. https://doi.org/10.1021/ci700036j
Article CAS PubMed Google Scholar
Mortenson DE, Brighty GJ, Plate L, Bare G, Chen W, Li S, Wang H, Cravatt BF, Forli S, Powers ET, Sharpless KB, Wilson IA, Kelly JW (2018) “Inverse drug discovery” strategy to identify proteins that are targeted by latent electrophiles as exemplified by aryl fluorosulfates. J Am Chem Soc 140:200–210. https://doi.org/10.1021/jacs.7b08366
Article CAS PubMed Google Scholar
Zheng Q, Woehl JL, Kitamura S, Santos-Martins D, Smedley CJ, Li G, Forli S, Moses JE, Wolan DW, Barry Sharpless K (2019) SuFEx-enabled, agnostic discovery of covalent inhibitors of human neutrophil elastase. Proc Natl Acad Sci USA 116:18808–18814. https://doi.org/10.1073/pnas.1909972116
Article CAS PubMed PubMed Central Google Scholar
Scarpino A, Ferenczy GG, Keserű GM (2018) Comparative evaluation of covalent docking tools. J Chem Inf Model 58:1441–1458. https://doi.org/10.1021/acs.jcim.8b00228
Article CAS PubMed Google Scholar
Palazzesi F, Grundl M, Pautsch A, Weber A, Tautermann C (2019) A fast ab initio predictor tool for covalent reactivity estimation of acrylamides. J Chem Inf Model 59:3565–3571. https://doi.org/10.1021/acs.jcim.9b00316
Article CAS PubMed Google Scholar
McGregor LM, Jenkins ML, Kerwin C, Burke JE, Shokat KM (2017) Expanding the scope of electrophiles capable of targeting K-Ras oncogenes. Biochemistry 56:3178–3183. https://doi.org/10.1021/acs.biochem.7b00271
Article CAS PubMed Google Scholar
Prior IA, Lewis PD, Mattos C (2012) A comprehensive survey of ras mutations in cancer. Cancer Res 72:2457–2467. https://doi.org/10.1158/0008-5472.CAN-11-2612
Article CAS PubMed PubMed Central Google Scholar
Ostrem JM, Peters U, Sos ML, Wells JA, Shokat KM (2013) K-Ras(G12C) inhibitors allosterically control GTP affinity and effector interactions. Nature 503:548–551. https://doi.org/10.1038/nature12796
Article CAS PubMed PubMed Central Google Scholar
Nnadi CI, Jenkins ML, Gentile DR, Bateman LA, Zaidman D, Balius TE, Nomura DK, Burke JE, Shokat KM, London N (2018) Novel K-Ras G12C switch-II covalent binders destabilize Ras and accelerate nucleotide exchange. J Chem Inf Model 58:464–471. https://doi.org/10.1021/acs.jcim.7b00399
Article CAS PubMed PubMed Central Google Scholar
Patricelli MP, Janes MR, Li L-S, Hansen R, Peters U, Kessler LV, Chen Y, Kucharski JM, Feng J, Ely T, Chen JH, Firdaus SJ, Babbar A, Ren P, Liu Y (2016) Selective inhibition of oncogenic KRAS output with small molecules targeting the inactive state. Cancer Discov 6:316–329. https://doi.org/10.1158/2159-8290.CD-15-1105
Article CAS PubMed Google Scholar
El Zoeiby A, Sanschagrin F, Levesque RC (2002) Structure and function of the Mur enzymes: development of novel inhibitors. Mol Microbiol 47:1–12. https://doi.org/10.1046/j.1365-2958.2003.03289.x
Article Google Scholar
Hrast M, Sosič I, Šink R, Gobec S (2014) Inhibitors of the peptidoglycan biosynthesis enzymes MurA-F. Bioorg Chem 55:2–15. https://doi.org/10.1016/j.bioorg.2014.03.008
Article CAS PubMed Google Scholar
Silver LL (2012) Rational approaches to antibacterial discovery: pre-genomic directed and phenotypic screening. Antibiotic discovery and development. Springer, Boston, MA, pp 33–75
Chapter Google Scholar
Bian B, Mongrain S, Cagnol S, Langlois M-J, Boulanger J, Bernatchez G, Carrier JC, Boudreau F, Rivard N (2016) Cathepsin B promotes colorectal tumorigenesis, cell invasion, and metastasis. Mol Carcinog 55:671–687. https://doi.org/10.1002/mc.22312
Article CAS PubMed Google Scholar
Kos J, Mitrović A, Mirković B (2014) The current stage of cathepsin B inhibitors as potential anticancer agents. Future Med Chem 6:1355–1371. https://doi.org/10.4155/fmc.14.73
Article CAS PubMed Google Scholar
Olson OC, Joyce JA (2015) Cysteine cathepsin proteases: regulators of cancer progression and therapeutic response. Nat Rev Cancer 15:712–729. https://doi.org/10.1038/nrc4027
Article CAS PubMed Google Scholar
Ruan H, Hao S, Young P, Zhang H (2015) Targeting cathepsin B for cancer therapies. Horizons Cancer Res 56:23–40
CAS Google Scholar
Murata M, Miyashita S, Yokoo C, Tamai M, Hanada K, Hatayama K, Towatari T, Nikawa T, Katunuma N (1991) Novel epoxysuccinyl peptides selective inhibitors of cathepsin B, in vitro. FEBS Lett 280:307–310. https://doi.org/10.1016/0014-5793(91)80318-w
Article CAS PubMed Google Scholar
Vasiljeva O, Reinheckel T, Peters C, Turk D, Turk V, Turk B (2007) Emerging roles of cysteine cathepsins in disease and their potential as drug targets. Curr Pharm Des 13:387–403. https://doi.org/10.2174/138161207780162962
Article CAS PubMed Google Scholar
Mirković B, Renko M, Turk S, Sosič I, Jevnikar Z, Obermajer N, Turk D, Gobec S, Kos J (2011) Novel Mechanism of cathepsin B inhibition by antibiotic nitroxoline and related compounds. ChemMedChem 6:1351–1356. https://doi.org/10.1002/cmdc.201100098
Article CAS PubMed Google Scholar
Gobec S, Frlan R (2006) Inhibitors of cathepsin B. Curr Med Chem 13:2309–2327. https://doi.org/10.2174/092986706777935122
Article PubMed Google Scholar
Barrett AJ, Kembhavi AA, Brown MA, Kirschke H, Knight CG, Tamai M, Hanada K (1982) l-Trans-epoxysuccinyl-leucylamido(4-guanidino)butane (E-64) and its analogues as inhibitors of cysteine proteinases including cathepsins B, H and L. Biochem J 201:189–198. https://doi.org/10.1042/bj2010189
Article CAS PubMed PubMed Central Google Scholar
Schenker P, Alfarano P, Kolb P, Caflisch A, Baici A (2008) A Double-headed cathepsin B inhibitor devoid of warhead. Protein Sci 17:2145–2155. https://doi.org/10.1110/ps.037341.108
Article CAS PubMed PubMed Central Google Scholar
Resnick E, Bradley A, Gan J, Douangamath A, Krojer T, Sethi R, Geurink PP, Aimon A, Amitai G, Bellini D, Bennett J, Fairhead M, Fedorov O, Gabizon R, Gan J, Guo J, Plotnikov A, Reznik N, Ruda GF, Díaz-Sáez L, Straub VM, Szommer T, Velupillai S, Zaidman D, Zhang Y, Coker AR, Dowson CG, Barr HM, Wang C, Huber KVM, Brennan PE, Ovaa H, Von Delft F, London N (2019) Rapid covalent-probe discovery by electrophile-fragment screening. J Am Chem Soc 141:8951–8968. https://doi.org/10.1021/jacs.9b02822
Article CAS PubMed PubMed Central Google Scholar
Li S, Zheng H, Mao A-P, Zhong B, Li Y, Liu Y, Gao Y, Ran Y, Tien P, Shu H-B (2010) Regulation of virus-triggered signaling by OTUB1- and OTUB2-mediated deubiquitination of TRAF3 and TRAF6. J Biol Chem 285:4291–4297. https://doi.org/10.1074/jbc.M109.074971
Article CAS PubMed Google Scholar
Kudo LC, Parfenova L, Vi N, Lau K, Pomakian J, Valdmanis P, Rouleau GA, Vinters HV, Wiedau-Pazos M, Karsten SL (2010) Integrative gene-tissue microarray-based approach for identification of human disease biomarkers: application to amyotrophic lateral sclerosis. Hum Mol Genet 19:3233–3253. https://doi.org/10.1093/hmg/ddq232
Article CAS PubMed Google Scholar
Beck A, Vinik Y, Shatz-Azoulay H, Isaac R, Streim S, Jona G, Boura-Halfon S, Zick Y (2013) Otubain 2 is a novel promoter of beta cell survival as revealed by SiRNA high-throughput screens of human pancreatic islets. Diabetologia 56:1317–1326. https://doi.org/10.1007/s00125-013-2889-x
Article CAS PubMed Google Scholar
Gasmi L, McLennan AG (2001) The mouse Nudt7 gene encodes a peroxisomal nudix hydrolase specific for coenzyme A and its derivatives. Biochem J 357:33–38. https://doi.org/10.1042/0264-6021:3570033
Article CAS PubMed PubMed Central Google Scholar
McLennan AG (2006) The nudix hydrolase superfamily. Cell Mol Life Sci C 63:123–143. https://doi.org/10.1007/s00018-005-5386-7
Article CAS Google Scholar
Jackowski S, Leonardi R (2014) Deregulated coenzyme A, loss of metabolic flexibility and diabetes. Biochem Soc Trans 42:1118–1122. https://doi.org/10.1042/BST20140156
Article CAS PubMed PubMed Central Google Scholar
Riederer P, Lachenmayer L, Laux G (2004) Clinical applications of MAO-inhibitors. Curr Med Chem 11:2033–2043. https://doi.org/10.2174/0929867043364775
Article CAS PubMed Google Scholar
Yanez M, Fernando Padin J, Alberto Arranz-Tagarro J, Camina M, Laguna R (2013) History and therapeutic use of MAO-A inhibitors: a historical perspective of MAO-A inhibitors as antidepressant drug. Curr Top Med Chem 12:2275–2282. https://doi.org/10.2174/1568026611212200011
Article Google Scholar
Finberg JPM, Rabey JM (2016) Inhibitors of MAO-A and MAO-B in psychiatry and neurology. Front Pharmacol 7:340. https://doi.org/10.3389/fphar.2016.00340
Article CAS PubMed PubMed Central Google Scholar
Johnston JP (1968) Some observations upon a new inhibitor of monoamine oxidase in brain tissue. Biochem Pharmacol 17:1285–1297. https://doi.org/10.1016/0006-2952(68)90066-X
Article CAS PubMed Google Scholar
Schrödinger Inc (2019) Schrödinger release 2019-4: LigPrep. Schrödinger Inc, New York, NY
Google Scholar
Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE (2000) The protein data bank. Nucleic Acids Res 28:235–242. https://doi.org/10.1093/nar/28.1.235
Article CAS PubMed PubMed Central Google Scholar
Skarzynski T, Mistry A, Wonacott A, Hutchinson SE, Kelly VA, Duncan K (1996) Structure of UDP-N-acetylglucosamine enolpyruvyl transferase, an enzyme essential for the synthesis of bacterial peptidoglycan, complexed with substrate UDP-N-acetylglucosamine and the drug fosfomycin. Structure 4:1465–1474. https://doi.org/10.1016/S0969-2126(96)00153-0
Article CAS PubMed Google Scholar
Han H, Yang Y, Olesen SH, Becker A, Betzi S, Schönbrunn E (2010) The fungal product terreic acid is a covalent inhibitor of the bacterial cell wall biosynthetic enzyme UDP-N-acetylglucosamine 1-carboxyvinyltransferase (MurA). Biochemistry 49:4276–4282. https://doi.org/10.1021/bi100365b
Article CAS PubMed Google Scholar
Wei B, Gunzner-Toste J, Yao H, Wang T, Wang J, Xu Z, Chen J, Wai J, Nonomiya J, Tsai SP, Chuh J, Kozak KR, Liu Y, Yu S-F, Lau J, Li G, Phillips GD, Leipold D, Kamath A, Su D, Xu K, Eigenbrot C, Steinbacher S, Ohri R, Raab H, Staben LR, Zhao G, Flygare JA, Pillow TH, Verma V, Masterson LA, Howard PW, Safina B (2018) Discovery of peptidomimetic antibody-drug conjugate linkers with enhanced protease specificity. J Med Chem 61:989–1000. https://doi.org/10.1021/acs.jmedchem.7b01430
Article CAS PubMed Google Scholar
De Colibus L, Li M, Binda C, Lustig A, Edmondson DE, Mattevi A (2005) Three-dimensional structure of human monoamine oxidase A (MAO A): relation to the structures of rat MAO A and human MAO B. Proc Natl Acad Sci 102:12684–12689. https://doi.org/10.1073/pnas.0505975102
Article CAS PubMed PubMed Central Google Scholar
Schrödinger Inc (2019) Schrödinger release 2019-4: Protein Preparation Wizard. Schrödinger Inc, New York, NY
Google Scholar
Madhavi Sastry G, Adzhigirey M, Day T, Annabhimoju R, Sherman W (2013) Protein and ligand preparation: parameters, protocols, and influence on virtual screening enrichments. J Comput Aided Mol Des 27:221–234. https://doi.org/10.1007/s10822-013-9644-8
Article CAS PubMed Google Scholar
Schrödinger Inc (2019) Schrödinger release (2019-4): QSite. Schrödinger Inc, New York, NY
Google Scholar
Philipp DM, Friesner RA (1999) Mixed ab initio QM/MM modeling using frozen orbitals and tests with alanine dipeptide and tetrapeptide. J Comput Chem 20:1468–1494. https://doi.org/10.1002/(SICI)1096-987X(19991115)20:14%3c1468::AID-JCC2%3e3.0.CO;2-0
Article CAS Google Scholar
Murphy RB, Philipp DM, Friesner RA (2000) A mixed quantum mechanics/molecular mechanics (QM/MM) method for large-scale modeling of chemistry in protein environments. J Comput Chem 21:1442–1457. https://doi.org/10.1002/1096-987X(200012)21:16%3c1442::AID-JCC3%3e3.0.CO;2-O
Article CAS Google Scholar
Soylu I, Marino SM (2017) Cpipe: a comprehensive computational platform for sequence and structure-based analyses of cysteine residues. Bioinformatics 33:2395–2396. https://doi.org/10.1093/bioinformatics/btx181
Article CAS PubMed Google Scholar
Cavallo L, Kleinjung J, Fraternali F (2003) POPS: a fast algorithm for solvent accessible surface areas at atomic and residue level. Nucleic Acids Res 31:3364–3366. https://doi.org/10.1093/nar/gkg601
Article CAS PubMed PubMed Central Google Scholar
Cee VJ, Volak LP, Chen Y, Bartberger MD, Tegley C, Arvedson T, McCarter J, Tasker AS, Fotsch C (2015) Systematic study of the glutathione (GSH) reactivity of N-arylacrylamides: 1. Effects of aryl substitution. J Med Chem 58:9171–9178. https://doi.org/10.1021/acs.jmedchem.5b01018
Article CAS PubMed Google Scholar
Lonsdale R, Burgess J, Colclough N, Davies NL, Lenz EM, Orton AL, Ward RA (2017) Expanding the armory: predicting and tuning covalent warhead reactivity. J Chem Inf Model 57:3124–3137. https://doi.org/10.1021/acs.jcim.7b00553
Article CAS PubMed Google Scholar
Flanagan ME, Abramite JA, Anderson DP, Aulabaugh A, Dahal UP, Gilbert AM, Li C, Montgomery J, Oppenheimer SR, Ryder T, Schuff BP, Uccello DP, Walker GS, Wu Y, Brown MF, Chen JM, Hayward MM, Noe MC, Obach RS, Philippe L, Shanmugasundaram V, Shapiro MJ, Starr J, Stroh J, Che Y (2014) Chemical and computational methods for the characterization of covalent reactive groups for the prospective design of irreversible inhibitors. J Med Chem 57:10072–10079. https://doi.org/10.1021/jm501412a
Article CAS PubMed Google Scholar
Frisch MJ, Trucks GW, Schlegel HB, Scuseria GE, Robb MA, Cheeseman JR, Scalmani G, Barone V, Mennucci B, Petersson GA (2009) Gaussian 09. Gaussian Inc, Wallingford
Google Scholar
Zhao Y, Truhlar DG (2008) The M06 suite of density functionals for main group thermochemistry, thermochemical kinetics, noncovalent interactions, excited states, and transition elements: two new functionals and systematic testing of four M06-class functionals and 12 other function. Theor Chem Acc 120:215–241. https://doi.org/10.1007/s00214-007-0310-x
Article CAS Google Scholar
Rokob TA, Hamza A, Pápai I (2007) Computing reliable energetics for conjugate addition reactions. Org Lett 9:4279–4282. https://doi.org/10.1021/ol701872z
Article CAS PubMed Google Scholar
Goerigk L, Grimme S (2011) A thorough benchmark of density functional methods for general main group thermochemistry, kinetics, and noncovalent interactions. Phys Chem Chem Phys 13:6670. https://doi.org/10.1039/c0cp02984j
Article CAS PubMed Google Scholar
Bautista-Aguilera OM, Samadi A, Chioua M, Nikolic K, Filipic S, Agbaba D, Soriano E, de Andrés L, Rodríguez-Franco MI, Alcaro S, Ramsay RR, Ortuso F, Yañez M, Marco-Contelles J (2014) N-Methyl-N-((1-methyl-5-(3-(1-(2-Methylbenzyl)piperidin-4-Yl)propoxy)-1H-indol-2-Yl)methyl)prop-2-Yn-1-amine, a new cholinesterase and monoamine oxidase dual inhibitor. J Med Chem 57:10455–10463. https://doi.org/10.1021/jm501501a
Article CAS PubMed Google Scholar
von der Eltz H, Guder H-J, Muehlegger K (1990) New hydrolase substrates. US4900822a
Awoonor-Williams E, Rowley CN (2018) How reactive are druggable cysteines in protein kinases? J Chem Inf Model 58:1935–1946. https://doi.org/10.1021/acs.jcim.8b00454
Article CAS PubMed Google Scholar
Di Paolo ML, Cozza G, Milelli A, Zonta F, Sarno S, Minniti E, Ursini F, Rosini M, Minarini A (2019) Benextramine and derivatives as novel human monoamine oxidases inhibitors: an integrated approach. FEBS J 284:4995–5015. https://doi.org/10.1111/febs.14994
Article CAS Google Scholar

Download references

Acknowledgements

Helpful discussions with Imre Pápai and Dávid Bajusz are gratefully acknowledged. We acknowledge Claudia Binda for providing us with the MAO-A protein.

Funding

Open Access funding provided by ELKH Research Centre for Natural Sciences. This work has been supported by the Marie Sklodowska Curie Action (MSCA) Innovative Training Network grant FRAGNET, by the National Office for Research, Development and Innovation (2017-1.2.1-NKP-2017-00002, K111862 and PD124598 Grants), and by Slovenian Research Agency—ARRS (Grants Z1-1859, P1-0208 and L1-8157).

Author information

Authors and Affiliations

Medicinal Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok krt 2, 1117, Budapest, Hungary
Andrea Scarpino, László Petri, Péter Ábrányi-Balogh, György G. Ferenczy & György M. Keserű
Faculty of Pharmacy, University of Ljubljana, Aškerčeva 7, 1000, Ljubljana, Slovenia
Damijan Knez & Stanislav Gobec
MS Metabolomic Research Laboratory, Research Centre for Natural Sciences, Magyar tudósok krt 2, 1117, Budapest, Hungary
Tímea Imre

Authors

Andrea Scarpino
View author publications
You can also search for this author in PubMed Google Scholar
László Petri
View author publications
You can also search for this author in PubMed Google Scholar
Damijan Knez
View author publications
You can also search for this author in PubMed Google Scholar
Tímea Imre
View author publications
You can also search for this author in PubMed Google Scholar
Péter Ábrányi-Balogh
View author publications
You can also search for this author in PubMed Google Scholar
György G. Ferenczy
View author publications
You can also search for this author in PubMed Google Scholar
Stanislav Gobec
View author publications
You can also search for this author in PubMed Google Scholar
György M. Keserű
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to György M. Keserű.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file 1 (DOCX 1409 KB)

Supplementary file 2 (XLSX 37 KB)

Supplementary file 3 (XLSX 12 KB)

Supplementary file 4 (XLSX 16 KB)

Supplementary file 5 (XLSX 82 KB)

Supplementary file 6 (XLSX 18 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Scarpino, A., Petri, L., Knez, D. et al. WIDOCK: a reactive docking protocol for virtual screening of covalent inhibitors. J Comput Aided Mol Des 35, 223–244 (2021). https://doi.org/10.1007/s10822-020-00371-5

Download citation

Received: 18 June 2020
Accepted: 30 December 2020
Published: 18 January 2021
Issue Date: February 2021
DOI: https://doi.org/10.1007/s10822-020-00371-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

WIDOCK: a reactive docking protocol for virtual screening of covalent inhibitors

Abstract

Similar content being viewed by others

Introduction

Methods

Compound sets and reactivity data

Ligand preparation

Protein preparation

Docking calculations

Cysteine characterization

QM calculations

Inhibitory activity data

MAO-A activity assay

Labelling of human MAO-A

Digestion and LC–MS/MS analysis of labelled human MAO-A

Results and discussion

Retrospective docking on KRASG12C

Retrospective docking on MurA and CatB

Retrospective screening against OTUB2 and NUDT7

Prospective screening against MAO-A by targeting an active site cysteine

Conclusion

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Retrospective docking on KRAS^G12C