An integrated modelling approach for targeted degradation: insights on optimization, data requirements and PKPD predictions from semi- or fully-mechanistic models and exact steady state solutions

Guzzetti, Sofia; Morentin Gutierrez, Pablo

doi:10.1007/s10928-023-09857-9

An integrated modelling approach for targeted degradation: insights on optimization, data requirements and PKPD predictions from semi- or fully-mechanistic models and exact steady state solutions

Original Paper
Open access
Published: 29 April 2023

Volume 50, pages 327–349, (2023)
Cite this article

Download PDF

You have full access to this open access article

Journal of Pharmacokinetics and Pharmacodynamics Aims and scope Submit manuscript

An integrated modelling approach for targeted degradation: insights on optimization, data requirements and PKPD predictions from semi- or fully-mechanistic models and exact steady state solutions

Download PDF

Sofia Guzzetti¹ &
Pablo Morentin Gutierrez¹

3918 Accesses
3 Citations
1 Altmetric
Explore all metrics

Abstract

The value of an integrated mathematical modelling approach for protein degraders which combines the benefits of traditional turnover models and fully mechanistic models is presented. Firstly, we show how exact solutions of the mechanistic models of monovalent and bivalent degraders can provide insight on the role of each system parameter in driving the pharmacological response. We show how on/off binding rates and degradation rates are related to potency and maximal effect of monovalent degraders, and how such relationship can be used to suggest a compound optimization strategy. Even convoluted exact steady state solutions for bivalent degraders provide insight on the type of observations required to ensure the predictive capacity of a mechanistic approach. Specifically for PROTACs, the structure of the exact steady state solution suggests that the total remaining target at steady state, which is easily accessible experimentally, is insufficient to reconstruct the state of the whole system at equilibrium and observations on different species (such as binary/ternary complexes) are necessary. Secondly, global sensitivity analysis of fully mechanistic models for PROTACs suggests that both target and ligase baselines (actually, their ratio) are the major sources of variability in the response of non-cooperative systems, which speaks to the importance of characterizing their distribution in the target patient population. Finally, we propose a pragmatic modelling approach which incorporates the insights generated with fully mechanistic models into simpler turnover models to improve their predictive ability, hence enabling acceleration of drug discovery programs and increased probability of success in the clinic.

A kinetic proofreading model for bispecific protein degraders

Article 22 October 2020

Delivering on the promise of protein degraders

Article 21 February 2023

QSP Toolbox: Computational Implementation of Integrated Workflow Components for Deploying Multi-Scale Mechanistic Models

Article Open access 24 May 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Mechanistic modelling has proven extremely valuable not only in enhancing the understanding of both traditional and new modalities mechanism of action [1,2,3,4], but also in supporting and guiding compound optimization by shedding light on Structure-Activity Relationship (SAR). Especially for emerging new modalities where little is known about the mechanism or where technology lags behind science in generating reliable data on biological or pharmacological quantities of interest, inexpensive high-throughput model simulations can to some extent “bridge the gap” between science and technology by predicting the response of a biological system in scenarios of interest and, more importantly, by identifying which mechanistic parameters are key drivers of the response. Such quantitative understanding can ultimately provide a robust rationale to identify which missing data would be most informative in building an understanding of the pharmacology, and hence which technologies need prioritization for data generation.

On the other hand, though, the use of mechanistic models to explain available data can be impractical and potentially even misleading without a thorough understanding of the amount and type of data that is required to uniquely identify the model parameters. While modelling software is designed to always return parameter estimates, if the model structure is excessively articulated (over-parametrized) for the data, or, conversely, if the amount and/or type of data is insufficient or too simplistic to inform the building blocks of a mechanistic model, multiple parameter estimates can provide an equally optimal data fit. However, the uncertainty on those estimates is typically high, or their reliability low, i.e. the output values are highly unlikely to be truly representative of the biological, chemical or pharmacological quantity they encode (e.g., protein baselines, endogenous turnover rates, on/off binding rates, drug-induced degradation rates, ...). In such case the risk is over-interpreting the parameter values, potentially leading to wrong decisions or scientific conclusions at different levels in a drug discovery cascade, from compound optimization to assessment of therapeutic potential.

Although much has been published on identifiability analysis of differential equations to address this issue [5, 6], the minimal required data on target kinetics can be challenging to generate, and even when this is not the case its generation requires long timelines and significant resources. Actually, in many cases kinetics data provide a level of detail that goes beyond the necessary and sufficient needs for pragmatically understanding the underlying mechanism of action (MoA). On the contrary, the steady state (i.e., dynamical equilibrium) of a biological or pharmacological system can be enough to provide insights on the MoA and hence to build a robust and reliable predictive model from higher throughput and more easily accessible data [7,8,9,10]. Moreover, the steady state can often be mechanistically described by simpler (non-linear) algebraic equations directly derived from a parent set of differential equations, for which an exact solution might even be available (depending on the mathematical structure of the system) [7, 8, 11, 12]. Although the resulting mathematical formulae may still retain some level of complexity (and they typically do), they can facilitate the identification of independent surrogate parameters, i.e. compositions of the original parameters into sums, ratios or other functional forms, which can lead to a reduction of the model parametrization and, ultimately, to improved model reliability. Such approach to identifiability can further be combined with global sensitivity analysis [13] to identify which of the original mechanistic parameters dominate the system response variability, and hence whose experimental measurements would be most impactful.

Whenever an exact steady state solution cannot be calculated or its complexity is still prohibitive, semi-mechanistic models such as indirect-response (turnover) models [14] offer an alternative option. Such models are easier to use in practice due to the smaller number of parameters that require fitting, however while they can successfully explain the data in specific studies or experimental settings, they lack information on binding kinetics or baseline levels, which makes them prone to failure at predicting the response for different compounds or cell lines [15, 16].

In this manuscript we propose an integrated modelling approach which leverages the benefits of each type of model to mitigate the limitations of the others. Firstly we show how exact mechanistic steady state solutions of the MoA of binary- and ternary-complex degraders can (i) provide noise-free insight on the role of each system parameter in driving the pharmacological response regardless of its specific value, (ii) suggest which mechanistic knowledge can be confidently extracted from single time point data and (iii) which additional data needs to be collected to enhance the mechanistic understanding of the system, and (iv) ultimately, inform compound optimization, data generation and resource prioritization.

Secondly, we show how global sensitivity analysis of fully mechanistic models can help to identify the key drivers of the response.

Finally, we propose a pragmatic modelling approach which incorporates the insights generated with fully mechanistic models into simpler turnover models to improve their predictive ability, hence enabling acceleration of drug discovery programs and increased probability of success in the clinic.

Methods

Exact solution of bilinear systems

The life cycle of biological entities and their interaction with chemical compounds can be described via non-linear systems of ordinary differential equations (ODEs), where the type of non-linearity is often bilinear or at most quadratic [17]. Exact steady-state solutions of such systems are challenging to compute and rarely available in closed form, due to potentially many chemical species involved and corresponding interactions, resulting in a large number of non-linear equations. Nevertheless, in some cases implicit relationships can be obtained and solved numerically. Not only, they can inform on the system identifiability, i.e. which parameters (or surrogates thereof) can be uniquely estimated from the data. A mathematical method to obtain an implicit, exact steady state solution to chemical reaction networks with bilinear rate laws is described in [18] and is summarized in Appendix A. Briefly, such method applies thoughtful algebraic manipulations to leverage the linear component of the system while segregating the non-linear part to its core, which can then be solved numerically.

Monovalent degraders

In this paper we refer to compounds that induce degradation of a protein of interest (PoI) by binding solely to it as Monovalent Degraders (MDs). It is very likely that other components are required for degradation of the protein (e.g. the proteasome), nevertheless no assumption is made here on the mechanism of degradation. The only assumption is that the compound needs to bind solely to the PoI to induce its degradation with no requirement to bind any other component of the system (differently from bivalent degraders).

A mechanistic model for monovalent degraders

We assume that under endogenous conditions (i.e. in absence of compound) the target protein synthesis and degradation are zero- and first-order processes, respectively, with corresponding rates $k_{\text {syn}}$, $k_{\text {deg}}$. When compound is added, binding kinetics governed by on/off rates $k^{\text {T}}_{\text {on}}$, $k^{\text {T}}_{\text {off}}$ leads to the formation of a binary complex, which induces degradation at a first order rate $k_{\text {MD}}$. Upon degradation the compound is released and recycled (Fig. 1).

Such mechanism can be mathematically described by the following system of ODEs:

$$\left\{ {\begin{array}{*{20}l} {\frac{{d{\text{T}}}}{{dt}} = k_{{{\text{syn}}}} - k_{{{\text{deg}}}} \cdot {\text{T}} + k_{{{\text{off}}}}^{{\text{T}}} \cdot {\text{TD}} - k_{{{\text{on}}}}^{{\text{T}}} \cdot {\text{T}} \cdot {\text{D}}} \hfill \\ {\frac{{d{\text{D}}}}{{dt}} = - k_{{{\text{on}}}}^{{\text{T}}} \cdot {\text{T}} \cdot {\text{D}} + (k_{{{\text{off}}}}^{{\text{T}}} + k_{{{\text{MD}}}} ) \cdot {\text{TD}}} \hfill \\ {\frac{{d{\text{TD}}}}{{dt}} = k_{{{\text{on}}}}^{{\text{T}}} \cdot {\text{T}} \cdot {\text{D}} - (k_{{{\text{off}}}}^{{\text{T}}} + k_{{{\text{MD}}}} ) \cdot {\text{TD}}} \hfill \\ \end{array} } \right.$$

(1)

with initial conditions $\text {T}(0)=\text {T}_0=k_{\text {syn}}/k_{\text {deg}}$, $\text {D}(0)=\text {D}_0$, $\text {TD}(0)=0$, where $\text {T}$, $\text {D}$, and $\text {TD}$ represent target, drug, and binary complex time-varying concentrations, respectively. Being the MD recycled, the total MD concentration $\text {D}_0= \text {D}+\text {TD}$ is preserved. As a result, the last (or second) equation is redundant as the binary complex (or MD) concentration at any time can be obtained via the linear conservation law as

$$\begin{aligned} \text {TD}(t) = \text {D}_0- \text {D}(t). \end{aligned}$$

(2)

Since we are interested in the steady state, we set the derivatives in (1) to 0 to obtain a steady state model:

$$\left\{ {\begin{array}{*{20}l} {k_{{{\text{syn}}}} - k_{{{\text{deg}}}} \cdot {\text{T}} + k_{{{\text{off}}}}^{{\text{T}}} \cdot {\text{TD}} - k_{{{\text{on}}}}^{{\text{T}}} \cdot {\text{T}} \cdot {\text{D}} = 0} \hfill \\ { - k_{{{\text{on}}}}^{{\text{T}}} \cdot {\text{T}} \cdot {\text{D}} + (k_{{{\text{off}}}}^{{\text{T}}} + k_{{{\text{MD}}}} ) \cdot {\text{TD}} = 0} \hfill \\ {{\text{TD}} + {\text{D}} = {\text{D}}_{0} .} \hfill \\ \end{array} } \right.$$

(3)

Note that, while in system (1) $\text {T}$, $\text {D}$ and $\text {TD}$ are time-dependent variables, they are constant in (3) by definition of steady state (the same notation has been used for the sake of simplicity). Because system (3) is non-linear, calculating an exact solution is not straightforward. Nevertheless, this type of non-linearity (bilinear) falls within the category of tractable systems which can be solved with the mathematical method developed in [18].

PROTACs

Proteolysis targeting chimeras (PROTACs) are a novel drug modality that fosters degradation catalysis by co-opting an E3 ligase (e.g. Cereblon or VHL) to tag the targeted PoI for turnover by the proteasome. Such mechanism of action heavily relies on the formation of a ternary complex with the PoI and E3 ligase, which is known to be liable to auto-inhibition, i.e., impairment of ternary complex formation at high PROTAC concentrations due to an excess of PoI- or E3 ligase-PROTAC binary complex [19, 20]. In other words, while in a two-body binding system the amount of binary complex increases monotonically with the binder concentration up to ligand saturation following a sigmoidal relationship, ternary complex formation can decrease as the PROTAC concentration increases beyond a critical threshold and is hence typically described by a double sigmoidal (bell-shaped) function (Fig. 2, left). From a pharmacological perspective, such binding dynamics in a biological setting can result into the so-called hook effect [19], i.e., a loss of degradation at high PROTAC concentrations, with the implication that maximal effect can only be achieved within a certain concentration window, whose center and breadth are highly specific to each molecule (Fig. 2, right). It is worth noting that not all PROTACs display auto-inhibition in practice. Efforts to reduce the hook effect whenever present have been summarized by Cecchini et al. [21], nevertheless it is fair to say that additional work is required to fully understand how to pragmatically minimize this phenomenon.

PROTACs mechanistic modelling

Ternary complex formation (and subsequent degradation) can happen through two different pathways: from a PROTAC-target ($\text {PT}$) or PROTAC-ligase ($\text {PL}$) binary complex, where the extent of the contribution of each pathway is dictated by binding affinities, PROTAC concentration, and target and E3 ligase baselines (Fig. 3). Once the ternary complex is formed the PoI is degraded while PROTAC and E3 ligase are released and recycled back into the system. It is important to keep in mind that PROTACs are not themselves degraders, rather degradation catalysts. Therefore, the apparent “PROTAC degradation rate” ($k_{\text {PRO}}$) is in fact a surrogate, composite parameter that synthesizes (i) ubiquitin transfer rate (which depends on the stereochemistry), (ii) ubiquitination rate and (iii) proteasomal degradation rate as a whole. While the stereochemistry can be optimized – at least in principle – to facilitate the ubiquitin transfer, the latter two parameters are endogenous to the biological system and can dictate the overall degradation kinetics (i.e. can be rate-limiting). This underscores the importance of understanding biological differences across cell lines, which cannot be conceived separately from the compound’s kinetics.

In vitro data for different targets suggests that three degradation scenarios can occur to the PoI bound into a $\text {PT}$ binary complex:

1.
Endogenous degradation (flat line around the baseline)
2.
Low-moderate degradation (more efficient than endogenous degradation, less efficient than ternary complex degradation), for example when the target warhead is a MD itself (shallow degradation curve)
3.
Stabilization, i.e., the compound inhibits the endogenous degradation machinery (sigmoidal curve above baseline).

Such observation justified the introduction of a binary complex degradation rate ($k_{\text {MD}}$) as an independent model parameter (equal to, greater or less than the endogenous degradation rate, respectively, in the three scenarios described above).

This basic mechanistic model can be further customized to include, e.g., competition with metabolites or endogenous ligands (Fig. 3) – which may be relevant for early PROTACs whose PoI ligand consists of a pre-existing small molecule inhibitor which binds to an active site of the target protein [22], or by unfolding the PROTAC degradation rate in a series of transit compartments to better describe the ubiquitination or de-ubiquitination process, as done, e.g., in [23, 24]. Since the relevance of these model components may be target or chemo-type specific, for the sake of generality they will not be included in this analysis, although the methodology utilized here can be applied to an extended version of the model as well (provided the assumption of total PROTAC and total ligase conservation is met).

The governing equations are derived from mass balancing principles and they read as follows:

$$\left\{ {\begin{array}{*{20}l} {\frac{{d{\text{T}}}}{{dt}} = k_{{{\text{syn}}}} - k_{{{\text{deg}}}} \cdot {\text{T}} + k_{{{\text{off}}}}^{{{\text{PL}}}} \cdot {\text{TPL}} + k_{{{\text{off}}}}^{{\text{T}}} \cdot {\text{PT}} - k_{{{\text{on}}}}^{{\text{T}}} \cdot {\text{T}} \cdot {\text{P}} - k_{{{\text{on}}}}^{{{\text{PL}}}} \cdot {\text{PL}} \cdot {\text{T}}} \hfill \\ {\frac{{d{\text{PL}}}}{{dt}} = k_{{{\text{off}}}}^{{{\text{PL}}}} \cdot {\text{TPL}} + k_{{{\text{PRO}}}} \cdot {\text{TPL}} - k_{{{\text{off}}}}^{{\text{L}}} \cdot {\text{PL}} + k_{{{\text{on}}}}^{{\text{L}}} \cdot {\text{L}} \cdot {\text{P}} - k_{{{\text{on}}}}^{{{\text{PL}}}} \cdot {\text{PL}} \cdot {\text{T}}} \hfill \\ {\frac{{d{\text{PT}}}}{{dt}} = k_{{{\text{off}}}}^{{{\text{PT}}}} \cdot {\text{TPL}} - k_{{{\text{MD}}}} \cdot {\text{PT}} - k_{{{\text{off}}}}^{{\text{T}}} \cdot {\text{PT}} + k_{{{\text{on}}}}^{{\text{T}}} \cdot {\text{T}} \cdot {\text{P}} - k_{{{\text{on}}}}^{{{\text{PT}}}} \cdot {\text{PT}} \cdot {\text{L}}} \hfill \\ {\frac{{d{\text{TPL}}}}{{dt}} = - k_{{{\text{PRO}}}} \cdot {\text{TPL}} - k_{{{\text{off}}}}^{{{\text{PL}}}} \cdot {\text{TPL}} - k_{{{\text{off}}}}^{{{\text{PT}}}} \cdot {\text{TPL}} + k_{{{\text{on}}}}^{{{\text{PL}}}} \cdot {\text{PL}} \cdot {\text{T}} + k_{{{\text{on}}}}^{{{\text{PT}}}} \cdot {\text{PT}} \cdot {\text{L}}} \hfill \\ {\frac{{d{\text{L}}}}{{dt}} = - k_{{{\text{on}}}}^{{\text{L}}} \cdot {\text{L}} \cdot {\text{P}} - k_{{{\text{on}}}}^{{{\text{PT}}}} \cdot {\text{PT}} \cdot {\text{L}} + k_{{{\text{off}}}}^{{\text{L}}} \cdot {\text{PL}} + k_{{{\text{off}}}}^{{{\text{PT}}}} \cdot {\text{TPL}}} \hfill \\ {\frac{{d{\text{P}}}}{{dt}} = - k_{{{\text{on}}}}^{{\text{T}}} \cdot {\text{T}} \cdot {\text{P}} - k_{{{\text{on}}}}^{{\text{L}}} \cdot {\text{L}} \cdot {\text{P}} + k_{{{\text{MD}}}} \cdot {\text{PT}} + k_{{{\text{off}}}}^{{\text{T}}} \cdot {\text{PT}} + k_{{{\text{off}}}}^{{\text{L}}} \cdot {\text{PL}}} \hfill \\ \end{array} } \right.$$

(4)

with initial conditions $\text {T}(0)=\text {T}_0$, $\text {L}(0)=\text {L}_0$, $\text {P}(0)=\text {P}_0$, $\text {PL}(0)=\text {PT}(0)=\text {TPL}(0)=0$, where $\text {T}$, $\text {L}$, $\text {P}$, $\text {PT}$, $\text {PL}$, $\text {TPL}$ stand for target, ligase, PROTAC, PROTAC-target complex, PROTAC-ligase complex, and ternary complex concentration, respectively, and will be referred to as states of the system (dependence on time t has been suppressed to ease the notation). Model parameters are defined in Table 1. Note that on/off rates are correlated via cooperativity $\alpha$ as [25]:

$$\begin{aligned} \dfrac{k^{\text {PT}}_{\text {on}}}{k^{\text {PT}}_{\text {off}}}= \alpha \cdot \dfrac{k^{\text {T}}_{\text {on}}}{k^{\text {T}}_{\text {off}}}, \qquad \dfrac{k^{\text {PL}}_{\text {on}}}{k^{\text {PL}}_{\text {off}}}= \alpha \cdot \dfrac{k^{\text {L}}_{\text {on}}}{k^{\text {L}}_{\text {off}}}, \end{aligned}$$

(5)

and since the proportionality constant $\alpha$ is the same in (5), the relationship between on/off rates can be synthetically expressed as

$$\begin{aligned} \dfrac{k^{\text {T}}_{\text {off}}}{k^{\text {T}}_{\text {on}}}\dfrac{k^{\text {PT}}_{\text {on}}}{k^{\text {PT}}_{\text {off}}}= \dfrac{k^{\text {L}}_{\text {off}}}{k^{\text {L}}_{\text {on}}}\dfrac{k^{\text {PL}}_{\text {on}}}{k^{\text {PL}}_{\text {off}}}. \end{aligned}$$

(6)

This means that, even though the binding kinetics is governed by 8 parameters, only 7 of them are independent. In other words, knowing any 7 on/off rates is sufficient to calculate the remaining one via Eq (6). Adding endogenous synthesis and degradation rates ($k_{\text {syn}}$, $k_{\text {deg}}$) as well as binary and ternary complex degradation rates ($k_{\text {MD}}$, $k_{\text {PRO}}$) gives a total of 11 independent parameters.

Table 1 PROTAC model parameters: nomenclature and description

Full size table

Under the assumption of total PROTAC and total E3 ligase conservation the differential equations for free PROTAC and free E3 ligase are redundant as they can be obtained from conservation laws as

$$\left\{ {\begin{array}{*{20}l} {{\text{L}}(t)\, = \,{\text{L}}_{0} \, - \,{\text{PL}}(t)\, - \,{\text{TPL}}(t)} \hfill \\ {{\text{P}}(t)\, = \,{\text{P}}_{0} \, - \,{\text{PT}}(t)\, - \,{\text{PL}}(t)\, - \,{\text{TPL}}(t),} \hfill \\ \end{array} } \right.$$

(7)

where $\text {L}_0$ and $\text {P}_0$ are the ligase baseline and PROTAC concentration, respectively.

The steady state of the system is obtained by setting each derivative in (4) to 0, and the same method described in [18] previously adopted for MDs can be applied here.

Global sensitivity analysis

Sensitivity analysis is a powerful tool to understand how and to what extent each model parameter (input) affects the response (output) [13]. Local sensitivity analysis studies the system states variability as a single parameter varies, all the others being fixed. While this approach is extremely convenient for its simplicity of implementation and easiness of interpretation, it can be misleading as the sensitivity of the response to one parameter can depend on the values of all the other fixed parameters. In other words, the model output can be sensitive to a parameter $p^\star$ for a given set of the other parameter values and at the same time insensitive to $p^\star$ for a different set of fixed values. Therefore, this approach can be useful and unbiased only if confidence in the fixed parameters is high.

Differently, global sensitivity analysis studies the output variability over the whole parameter space, i.e. as all model parameters are changed simultaneously. As a result, the response variability characterization is more robust as it only depends on the assumed or observed distribution of each parameter on a given feasible range rather than on a single value [26]. At the same time, though, visualizing the response variability and quantifying the contribution of each parameter to it is increasingly challenging with the dimension of the parameter space: it is well known that the number of points to accurately sample a parameter space grows exponentially with the dimension of the parameter space itself (“curse of dimensionality”). In other words, if N samples are sufficient to describe the distribution of a single parameter, an order of $N^P$ points will be required to equivalently accurately sample P parameters. For instance, if 10 samples are used for each parameter of the PROTAC model (4) the total number of required samples would be approximately 100 billions ($10^{11}$). Each set of the 100 billion parameter combinations will generate a model output, and visualizing and interpreting 100 billion model simulations is clearly more challenging than plotting 10 of them (as a single parameter changes), as well as more computationally expensive.

A plethora of mathematical tools to quantify the impact of each parameter on the response and to tackle the curse of dimensionality are available. In this work Sobol indices [27] are used to assess the fraction of total variability associated with each parameter, which is a random variable represented via Polynomial Chaos Expansions (PCEs) [28,29,30,31]. Parameters are assumed to be uniformly distributed around a given mean and standard deviation, and are hence accordingly represented by first-order PCE of Legendre polynomials, which maximize the convergence rate according to the Askey scheme [32, 33]. Parameter uncertainty is propagated in the system via Non-Intrusive Spectral Projection (NISP) [34]. As a result, the stochastic model output can be represented as a PCE as well, whose coefficients can be easily used to calculate Sobol indices. In order to reduce the computational cost of sampling associated with NISP without compromising accuracy, Smolyak sparse quadratures are employed [35]. The sparsity level has been manually increased until no significant change in the Sobol indices estimates was observed. Uncertainty propagation via NISP and Sobol indices calculation was handled with the C++ library UQTk developed at the Sandia National Laboratories [36], embedded in a MATLAB implementation of the mechanistic PROTAC model (4).

Experimental data and modelling

In vitro

Test compounds were evaluated at 12 concentrations obtained with a 1:3 dilution factor, with two replicates per concentration. Remaining PoI levels were measured via Western Blots, ERD9 [37], Immunofluorescence [38] or HiBit technology [39] and expressed as a fraction of protein in DMSO treated cells (i.e. baseline).

The following bi-sigmoidal model describing the remaining fraction of target at steady state ($\widehat{\text {T}}_{\text {SS}}$) as a function of concentration (C) was fitted to each individual dose-response at steady state to capture any potential hook effect and calculate maximal degradation and potency:

$$\begin{aligned} \widehat{\text {T}}_{\text {SS}}(C) = 1 - \text {E}_{max}\dfrac{C^{h}}{C^{h}+\text {EC}_{50}^{h}}\left( 1 - \dfrac{\text {E}_{loss}}{\text {E}_{max}}\dfrac{C}{C+\text {IC}_{50}} \right) . \end{aligned}$$

(8)

Eq (8) describes the response as a combination of two sigmoidal curves with half-maximal concentrations $\text {EC}_{50}$ and $\text {IC}_{50}$, respectively. $\text {E}_{max}$ represents the overall maximal effect, while $\text {E}_{loss}$ can be interpreted as the fraction of degradation lost to the hook effect. To avoid over-parametrization the Hill coefficient of the sigmoid corresponding to the hook effect was fixed to 1, while it was estimated (h) for the sigmoid corresponding to increasing degradation. Because the response is the result of the contribution of two distinct sigmoidal curves, the observed potency ($\text {DC}_{50}$) may not exactly correspond to $\text {EC}_{50}$, therefore it was calculated numerically from (8) as the concentration delivering half-maximal effect. The concentration corresponding to maximal degradation ($\text {DC}_{max}$) was also calculated numerically as the root of the first derivative, and maximal degradation as $\text {D}_{max}= 1 -\widehat{\text {T}}_{\text {SS}}(\text {DC}_{max})$. Note that in absence of the hook effect ($\text {E}_{loss}=0$) Eq. (8) reduces to a simple sigmoidal function where $\text {E}_{max}= \text {D}_{max}$ and $\text {EC}_{50}= \text {DC}_{50}$.

Whenever dose-response time courses were available, model (8) was embedded into the following turnover model describing the fraction of target at any given time ($\widehat{\text {T}}$):

$$\begin{aligned} \dfrac{d\widehat{\text {T}}}{dt} = k_{\text {deg}}\cdot \left( 1 - \dfrac{\widehat{\text {T}}}{\widehat{\text {T}}_{\text {SS}}(C)} \right) , \end{aligned}$$

(9)

where the endogenous fractional turnover rate $k_{\text {deg}}$ was estimated or fixed to experimental data obtained from Stable Isotope Labeling with Amino acids in Cell culture (SILAC) [40].

Experimental data in Sect. 4.1.3 has been generated with AstraZeneca proprietary Selective Estrogen Receptor Degraders (SERDs) [41,42,43,44,45].

Endogenous PoI or E3 ligase levels in different cell lines were assessed via Western Blots.

In vivo

In vivo data was generated in NSG mice implanted with a patient-derived xenograft tumor model. C-PROTAC-006 was dosed orally on a daily schedule for three days at 30 mg/kg, 60 mg/kg or 100 mg/kg. Plasma concentration at different time points was quantified via Liquid Chromatography with tandem Mass Spectrometry (LC-MS/MS). Remaining protein levels relative to vehicle baseline levels were assessed by Western Blots at 6h, 24h, 48h after the last dose.

A one-compartmental pharmacokinetic model with first-order absorption was fitted to the plasma concentration data and used as a driver of the pharmacodynamic model (9) parametrized from in vitro data generated in the same cell line to obtain predictions of in vivo degradation kinetics.