FormalPara Key Points

This study provides a methodology to suppress the individual shrinkage of the usual individual estimates of the pharmacokinetic parameters (empirical Bayes estimate).

This method decreases the estimate's imprecision.

The use of this methodology is illustrated for the estimations of the individual pharmacokinetic parameters of isavuconazole and iohexol.

1 Introduction

Therapeutic drug monitoring (TDM) is used for some drugs to control drug exposure by choosing an appropriate dosage regimen (i.e. drug dosage and frequency of administration) [1].

Classical TDM checks whether or not a concentration is within a concentration range if measured at a predefined time and considers the exposure (i.e. the area under the curve) to be abnormal when the concentration is outside this range. It is very simple to use in practice. However, this classical TDM assumes a perfect correlation between the measured concentration and the associated exposure, which is not the case for all patients. Indeed, two patients could have two very similar concentrations for two samples collected at the same time and still have different pharmacokinetic profiles and hence different exposures. Interpreting a measured concentration without considering the physio-pathological characteristics of the patients on the one hand and by referring to the same range of concentrations for all patients on the other hand can lead to inappropriate dose adjustments. The use of a population pharmacokinetic (PPK) model to analyse the concentrations allows a better evaluation of the patient's exposure than classic TDM, through an estimation of their own pharmacokinetic parameters [2, 34]. These estimates are generally the empirical Bayes estimates (EBEs), computed by assuming that all the PPK model parameters are known without any imprecision [5, 6].

Unfortunately, EBEs can suffer from shrinkage [7]. Shrinkage is an average deviation of the individual pharmacokinetic (IPK) parameter estimate toward the population mean. Because shrinkage is a kind of bias (i.e. a systematic deviation), it should be possible to remove it. For example, Lavielle and Riba [8] proposed a method to limit the shrinkage when validating a PPK model. In the case of TDM, the problem is not an average but an individual deviation, so-called individual shrinkage. This individual shrinkage is specific to each patient (i.e. to their own pharmacokinetic parameter values), to the time when the blood samples are collected, and to the population's parameter values; this is why it is not usually removed [9].

Instead, an average shrinkage is provided for each pharmacokinetic parameter after a PPK model analysis. Usually this shrinkage is expressed as a percentage. When this average shrinkage is large, the general recommendation is to discard or at least not trust the IPK parameters calculated for a patient from only a few concentrations [910]. Thus, a large shrinkage can be a limitation of TDM based on the estimation of the IPK parameters, but it is not the only one. Indeed, it has been shown [1114] that it is not reasonable to expect a precise estimation of the pharmacokinetic parameters for a patient with only a few concentrations obtained from the patient, even using an optimal sampling strategy [151617]. Even without any shrinkage, several combinations of pharmacokinetic parameters can lead to the drug concentrations observed in a patient. One way to quantify this imprecision is thus the length of the interval, or more generally, the volume of the prediction region of all the possible values of pharmacokinetic parameters that are consistent with the observed concentrations. This imprecision is intrinsic to the way the IPK parameters are estimated; in other words, this imprecision is intrinsic to the EBEs. Using the EBEs to estimate the IPK parameters cannot therefore lead to any significant improvements.

The purpose of this work is to propose an estimate of IPK parameters with no shrinkage and with less imprecision than the EBEs.

2 Materials and Methods

2.1 Models and Notations

When a PPK analysis is performed, a model describing the evolution of concentrations over time is used. This model is often written as in Eq. (1).

$$\left\{ {\begin{array}{*{20}l} {\begin{array}{*{20}l} { Y_{ij} } & = & {f\left( {t_{ij} ; \varphi_{i} } \right) + g\left( {t_{ij} ;\varphi_{i} ;b} \right)\varepsilon_{ij} {\text{ where}}\,\, \varepsilon_{ij} \sim_{iid} {\mathbb{N}}\left( {0,1} \right)} \\ {\varphi_{i} } & = & {h\left( {\eta_{i} ,\theta ,A_{{i}}} \right) \quad {\text{where}} \quad \eta_{i} \sim_{iid} {\mathbb{N}}\left( {0,{\Omega }} \right)} \\ \end{array} } \\ \end{array} } \right.$$
(1)

where \(Y_{ij}\) is the jth concentration measured in individual i at the time \(t_{ij}\), \(\varphi_{i}\) is a vector containing the individual pharmacokinetic parameters for the ith individual, \(f\) and g are two known functions, \(\varepsilon_{ij}\) is the jth residual error for individual i at the time \(t_{ij}\) assumed to be independent of \(\varphi_{i}\), h is a known function usually assumed to be \(h\left( {\eta_{i} ,\theta ,A_{{i}}} \right) = A_{{i}}\theta e^{{\eta_{i} }}\), \(A_{i}\) is a known matrix of covariates of the ith individual, \(\theta\) is a fixed effect vector, \(\eta_{i}\) is the vector of the random effects involved in inter-individual variability for the ith individual, \({\Omega }\) is the variance–covariance matrix of the \(\eta_{i}\)s, and \(b\) is the vector of the parameters involved in the residual error model.

Data analysis with a PPK model as described by Eq. (1) provides estimates of \(\theta , \;{\Omega }\), and \(b\), respectively denoted by \(\hat{\theta },\;\; {\hat{\Omega }}\), and \(\hat{b}\). Of course, these estimates are imprecise. However, we will not take into account this uncertainty. In other words, herein, we consider these estimates to be fixed.

Once the model has been created and validated, it can be used daily for hospitalised patients. So, let us consider the concentrations \((Y_{j} )_{j = 1, \ldots , n}\) observed at the time \((t_{j} )_{j = 1,..,n}\) for a hospitalised patient with covariates \(A\) for whom TDM is requested. TDM should be based on the patient’s IPK parameters \(\varphi = h\left( {\eta ,\hat{\theta },A} \right)\), which are unknown when the blood sample is performed. However, a prediction of \(\varphi\) can be computed using both population parameters of the model and the patient’s concentrations \(\left( {Y_{j} } \right)_{j = 1, \ldots ,n}\).

A classical predictor of \(\varphi\) is the EBE obtained as \(\hat{\varphi } = h\left( {\hat{\eta },\hat{\theta },A} \right)\), where (Eq. 2)

$$\hat{\eta } = \mathop {{\text{argsup}}}\limits_{\eta } P\left( {\eta |Y_{1} , \ldots ,Y_{n} } \right)$$
(2)

In a more detailed manner (Eq. 3),

$$\hat{\eta } = \mathop {\arg \inf }\limits_{\eta } \begin{array}{*{20}l} {\underbrace {{\frac{1}{n}\mathop \sum \nolimits_{{j = 1}}^{n} \frac{{\left( {Y_{j} - f\left( {t_{j} ,h\left( {\eta ,\hat{\theta },A} \right)} \right)} \right)^{2} }}{{g^{2} \left( {t_{j} ,h\left( {\eta ,\hat{\theta },A} \right),\hat{b}~~} \right)}} + \ln g^{2} \left( {t_{j} ,h\left( {\eta ,\hat{\theta },A} \right),\hat{b}} \right)}}_{{\left( {T1} \right)}} + ~\underbrace {{\frac{1}{n}\eta ^{\prime } \hat{\Omega }^{{ - 1}} \eta }}_{{\left( {T2} \right)}}~} \\ \end{array} ~$$
(3)

The first term \(\left( {T1} \right)\) of the previous equation is a least squares term. The second term \(\left( {T2} \right)\) corresponds to a penalty. It penalises values deviating far from the population average.

In Eq. (3), when \(n\) increases, \(T2\) decreases, whereas \(T1\) tends towards a fixed value. In this case, the patient’s concentrations govern the \(\varphi\) prediction and the shrinkage should be low. By contrast, when \(n\) is low, \(T2\) dominates \(T1\) and the population parameters dominate the prediction, i.e. the \(\eta\) is closer to the population mean \(0\) than it should be. In this case, one can expect \(\hat{\eta }\) to be “shrunk” toward \(0\). Equation (3) can be interpreted as a special case of a criterion giving a ridge estimator, defined as (Eq. 4):

$$\hat{\eta }\left( \lambda \right) = \,{\text{arginf}}\frac{1}{n}\mathop \sum \limits_{{j = 1}}^{n} \frac{{\left( {Y_{j} - f\left( {t_{j} ,h\left( {\eta ,\hat{\theta },A} \right)} \right)} \right)^{2} }}{{g^{2} \left( {t_{j} ,h\left( {\eta ,\hat{\theta },A} \right),\hat{b}} \right)}} + \ln g^{2} \left( {t_{j} ,h\left( {\eta ,\hat{\theta },A} \right),\hat{b}} \right) + \lambda \frac{1}{n}\eta ^{\prime}\hat{\Omega }^{{ - 1}} \eta .$$
(4)

Actually, \({\text{EBE}}\left( \varphi \right) = \hat{\varphi }\left( 1 \right) = h\left( {\hat{\eta }\left( 1 \right),\hat{\theta },A} \right)\).

The only difference between Eqs. (3) and (4) is the presence of \(\lambda\) in \(T2\). If the predictions \(\hat{\varphi }\left( \lambda \right)\) of \(\varphi\) were good, \(\hat{\varphi }\left( \lambda \right)\) should be close to \(\varphi\), which is unknown. To circumvent this difficulty (i.e. \(\varphi\) being unknown), we evaluate the distance between \(\hat{\varphi }\left( \lambda \right)\) and \(\varphi\) using simulations by proceeding as follows:

  1. 1.

    A set of K vectors \(\eta_{1}^{*} ,\eta_{2}^{*} , \ldots , \eta_{K}^{*}\) are simulated where \(\eta_{k}^{*} \sim_{iid} {\mathbb{N}}\left( {0,{\hat{\Omega }}} \right)\) for \(k \in 1,\ldots,K.\) Then, for each \(k \in 1,\ldots,K\), \({{\varphi }}_{{\text{k}}}^{*} = { }h\left( {\eta_{k}^{*} ,\hat{\theta },A} \right){ }\) are computed.

  2. 2.

    For each \(k \in 1,\ldots,K\), \({Y}_{{{\text{kj}}}}^{*} = { }f\left( {t_{j} ; {{\varphi }}_{{\text{k}}}^{*} } \right) + g\left( {t_{j} ;{{\varphi }}_{{\text{k}}}^{*} ;\hat{b} } \right)\varepsilon_{j}^{*}\) are simulated where \(\varepsilon_{j}^{*} \sim_{iid} {\mathbb{N}}\left( {0,1} \right),\) for j = 1,…,n.

  3. 3.

    For a given \(\lambda\), one computes \(\hat{\eta }_{k}^{*} \left( \lambda \right)\) from Eq. (4) by using \(Y_{kj}^{*}\) instead of \(Y_{j}\).

In other words, for a vector \(A\) containing covariate values for a hospitalised patient, it is possible to simulate K different sets of IPK parameters \(\left( {\varphi_{k}^{*} } \right)\) (line 1). For each set of IPK parameters, it is possible to simulate concentrations \(\left( {Y_{kj}^{*} } \right)\) at the same sampling times as those of the hospitalised patient (line 2). From these concentrations \(Y_{kj}^{*}\), it is possible to estimate IPK parameters \(\left( {\hat{\eta }_{k}^{*} \left( \lambda \right)} \right)\) thanks to a Newton–Raphson algorithm (Eq. 4) for a given \(\lambda\) (line 3). Therefore, for K simulated patients, we have actual IPK parameters \((\eta_{k}^{*} )\) and predicted IPK parameters \(\left( {\hat{\eta }_{k}^{*} \left( \lambda \right)} \right)\) (line 3).

Once this \(\hat{\eta }_{k}^{*} \left( \lambda \right)\) is obtained, one can compute \(\hat{\varphi }_{k}^{*} \left( \lambda \right)\) = \(h\left( {\hat{\eta }_{k}^{*} \left( \lambda \right),\hat{\theta },A} \right)\). We can see whether \(\hat{\varphi }_{k}^{*} \left( \lambda \right)\) is a good predictor of \(\varphi_{k}^{*}\) by drawing \(\varphi_{k}^{*}\) as a function of \(\hat{\varphi }_{k}^{*} \left( \lambda \right)\) or equivalently \(\eta_{k}^{*}\) as a function of \(\hat{\eta }_{k}^{*} \left( \lambda \right).\) If it is a good predictor, the points should be concentrated around the first bisector line. Unfortunately, there may exist a systematic departure from this line, signalling a bias. This is why we look for a function \({\Gamma } \in {\mathcal{L}}\), which minimises the distance between the vectors \(\eta^{*}\) and \({\Gamma }\left( {\hat{\eta }^{*} \left( \lambda \right)} \right)\) (Eq. 5):

$${\Gamma }_{\lambda } = {\text{arginf}}_{{{\Gamma } \in {\mathcal{L}}}} \mathop \sum \limits_{k = 1}^{K} \left\| {\eta_{k}^{*} - {{ \Gamma }}\left( {\hat{\eta }_{k}^{*} \left( \lambda \right)} \right)} \right\|^{2} .$$
(5)

Depending on the space \({\mathcal{L}}\) where the \({\Gamma }\) functions are to be found, there should exist at least one function \({\Gamma }_{{\uplambda }}\) so that the minimum is reached. There exist many possible choices for \({\mathcal{L}}\). As any continuous functions can be locally approximated by polynomials, we chose \({\mathcal{L}}\) as the set of the local polynomial functions. This polynomial function can be estimated, for example, by locally estimated scatterplot smoothing (LOESS) regression. Next, we look for the value \(\hat{\lambda }\), so that (Eq. 6):

$$\hat{\lambda } = {\text{arginf}}_{\lambda } \mathop \sum \limits_{k = 1}^{K} \left\| {\eta_{k}^{*} - {{ \Gamma }}_{\lambda } \left( {{ }\hat{\eta }_{k}^{*} \left( \lambda \right)} \right)} \right\|^{2} .$$
(6)

Because \(\lambda \in {\text{R}}^{ + }\), Eq. (6) can be solved by looking for lambda in an adaptive discretisation of ]0; \(\lambda_{max}\)]. For the practical implementation, \(\lambda_{{{\text{max}}}}\) can be chosen close to 10.

At the end of these two optimisation steps, the best prediction with respect to the mean square error of their actual η for all patients having the same characteristics (i.e. same covariates values) and the same sampling times, is given by \({\Gamma }_{{\hat{\lambda }}} \left( {{ }\hat{\eta }\left( {\hat{\lambda }} \right)} \right)\). Obviously, this predictor is not exact: several values from the same population correspond to a given value of \(\hat{\eta }\left( {\hat{\lambda }} \right)\) obtained from the hospitalised patient (blue double arrow in Fig.1). The length of this range can be used as a prediction interval of \(P\left( {\eta^{*} |\Gamma_{{\hat{\lambda }}} ( \hat{\eta }^{*} \left( {\hat{\lambda }} \right)} \right)\) and so the uncertainty of the \(\eta^{*}\) estimation. A more traditional way of evaluating this uncertainty is to compute empirically a 95% prediction interval defined as the 2.5 and 97.5% percentiles of the \(\eta^{*}\)s corresponding to the \(\hat{\eta }\left( {\hat{\lambda }} \right)\) obtained from the patient of interest. The way to build these intervals (regression quantile methods) is not discussed in this article.

These ideas can be illustrated by the graph as shown in Fig. 1. To simplify, we concentrate on the component of \(\eta_{k}^{*}\) linked to the clearance of the drug. We denote \(\eta_{{{\text{Cl}}, k}}^{*}\) the kth simulated η for the clearance and we restrict \({\Gamma }_{{\uplambda }}\) as a function of clearance only.

Fig. 1
figure 1

The y-axis represents the \(\eta_{{{\text{Cl}}}}^{*}\) that have been used to simulate the concentrations; the x-axis contains the \(\hat{\eta }_{{{\text{Cl}}}}^{*} \left( {\hat{\lambda }} \right)\) obtained from the procedure described in section 2.1. The red line is the first bisector line. For a patient P, having a clearance individual random effect, denoted by \({\eta }^*_{{\text{Cl,P}}}\), far below or above zero or, equivalently, an individual pharmacokinetic parameter far from the typical value, there is a systematic error. The systematic error is represented on the plot by the distance of the green curve from the red curve. The green curve represents \(\Gamma_{{{\hat{\lambda }}}} \left( { \hat{\eta }_{{{\text{Cl}}}} \left( {\hat{\lambda }} \right)} \right)\). The blue band is the 95% prediction interval. Its width, for a given \(\hat{\eta }_{{\text{Cl,P}}} \left( {\hat{\lambda }} \right)\) (i.e., for a given patient) gives an evaluation of the imprecision of our method

The graph in Fig. 1 can be disturbing because intuitively, we would tend to represent the predicted values as a function of the simulated/true values. However, the aim is to minimise the difference between the true value of \(\eta_{Cl}\) and its prediction (i.e. between \(\eta_{{{\text{Cl}}}}^{*}\) and \({\Gamma }_{{\uplambda }} \left( {{ }\hat{\eta }_{{{\text{Cl}}}}^{*} \left( \lambda \right)} \right)\)). This presents itself as a classic nonlinear regression problem, in which we minimise the vertical distance between a point and its predictor.

A crude but global evaluation of the improvement that can be obtained using our prediction instead of the usual EBE can be computed by Eq. (7):

$${\text{Improv}} = 1 - \frac{{\mathop \sum \nolimits_{k = 1}^{K} \left[ {\eta_{{{\text{Cl}},k}}^{*} - \Gamma_{{\hat{\lambda }}} \left( { \hat{\eta }_{{{\text{Cl,}}k}}^{*} \left( {\hat{\lambda }} \right)} \right)} \right]^{2} }}{{\mathop \sum \nolimits_{k = 1}^{K} \left[ {\eta_{{{\text{Cl}},k}}^{*} - \hat{\eta }_{{{\text{Cl}},k}}^{*} \left( 1 \right) } \right]^{2} }}.$$
(7)

This non-negative quantity evaluates how much closer our prediction \({\Gamma }_{{{\hat{\lambda }}}} \left( {{ }\hat{\eta }_{{{\text{Cl}},k}}^{*} \left( {\hat{\lambda }} \right)} \right)\) is to the actual \(\eta_{{{\text{Cl,}}k}}^{*}\) than is the EBE (\(\lambda\)=1) given by \(\hat{\eta }_{{{\text{Cl}},k}}^{*} \left( 1 \right)\). When this improvement is close to 0, the EBE and our predictor are close. As a consequence, our predictor does not improve the estimation of the individual pharmacokinetic parameter. On the contrary, any positive improvement shows that our predictor is better than the EBE. This indicator is a global evaluation that does not give any information on any specific patient located in a specific area of the green curve in Fig. 1.

We also provide estimates of the shrinkage for the EBE and for the estimator that we propose and determine the prediction of the shrinkage based on the method proposed by Combes et al. [14].

Finally, from this prediction, we deduce a prediction for IPK parameters, \(\varphi\) as (Eq. 8):

$$\hat{\varphi } = h\left( {{\Gamma }_{{\hat{\lambda }}} \left( {{ }\hat{\eta }\left( {\hat{\lambda }} \right)} \right),\hat{\theta },A} \right).$$
(8)

Once lambda \(\left( {\hat{\lambda }} \right)\) and gamma \(({\Gamma }_{{\hat{\lambda }}}\)) are defined, they are used to improve predictions made for a given patient (Eq. 8). In a more detailed manner, when we have concentration(s) \((Y_{1} , \ldots ,Y_{n} )\) available for a patient P, simulations allow to compute \(\hat{\lambda }\) and \({\Gamma }_{{\hat{\lambda }}}\). Next, Eq. (4) is solved using the concentrations \((Y_{1} , \ldots ,Y_{n} )\) observed in the patient and with \(\hat{\lambda }\) computed at the previous step instead of \(\lambda .\)

From this computation, we obtain \(\hat{\eta }_{{{\text{Cl}}, P}} \left( {\hat{\lambda }} \right)\) and therefore \({\Gamma }_{{\hat{\lambda }}} \left( {\hat{\eta }_{{{\text{Cl}}, P}} \left( {\hat{\lambda }} \right)} \right)\), which is the de-shrunk estimate of its clearance individual random effect. By taking into account the imprecision observed during the simulations, it is also possible to define the prediction interval from which the individual pharmacokinetic parameter is estimated.

For example, we estimate a \({\Gamma }_{{\hat{\lambda }}} \left( {\hat{\eta }_{{{\text{Cl}}, P}} \left( {\hat{\lambda }} \right)} \right)\) for patient P equal to 0.2. Figure 1 shows that the true median value for the individual pharmacokinetic parameter is 0.18 and the prediction interval is between 0.05 and 0.29.

2.2 Simulations Study

Datasets were simulated using R version 3.5.2.

This methodology was applied to two different models from the literature. The performance evaluation of the proposed methodology was performed on 5000 subjects simulated by the models described in the corresponding articles.

Usually, it is necessary to perform K simulations for each subject; a subject being defined by its sampling times \(t_{j}\) and its covariate values A. Since we simulated patients with exactly the same sample times and covariate values, K = 5000 simulations could be used for all simulated patients.

For the rest of this work, we focus on clearance that controls exposure and use only one concentration to estimate individual clearance as per hospital practice.

For each drug tested, the sampling times were chosen in line with the model itself (i.e. sampling times used for the model construction) and according to a clinical practicability allowing a later clinical use of this methodology.

2.2.1 First Model

The first model used relates to the pharmacokinetics of iohexol in dogs [18]. This molecule allows establishment of the glomerular filtration rate (GFR). As it is an exogenous molecule eliminated mainly by glomerular filtration, its clearance represents GFR. The model was built from iohexol pharmacokinetic data in 49 dogs. Each kinetic profile includes five points (i.e. 5, 15, 60, 90, and 180 min).

The PPK model is a two-compartment model parameterised with clearances and volumes. This implies the \(f\) function of the first equation. The \(g\) function was chosen as \(g\left( {t,{{\varphi }},{\text{b}}} \right) = bf\left( {t,{{\varphi }}} \right)\).

Part of the clearance variability is explained by two covariates: the health status and the plasma creatinine concentration. Health status is a dichotomous covariate defined on the basis of the International Renal Interest Society (IRIS). The plasma creatinine concentration is a continuous covariate.

This results in the h function of Eq. (1) being such that

$$\begin{aligned} {\text{ln(}}\varphi ) & {\text{ = ln(}}h(\eta , \theta , A{))} \\ & {\text{ = ln (Cl, }}V_{{\text{c}}} {, }V_{{\text{p}}} {, }Q){ } \\ & = \left( {{\text{ln}}(\theta_{{{\text{Cl}}}} ) + \theta_{1} 1_{{\left[ {{\text{status}}} \right]}} + {\uptheta }_{2} {\text{creat}} + \eta_{{{\text{Cl}}}} ,{\text{ ln}}(\theta_{{V_{{\text{c}}} }} ) + \eta_{{V_{{\text{c}}} }} ,{\text{ln}}(\theta_{{V_{{\text{p}}} }} ) + \eta_{{V_{{\text{p}}} }} , {\text{ln}}\left( {\theta_{Q} } \right)} \right). \\ \end{aligned}$$

The indicator function \(1_{{\left[ {{\text{status}}} \right]}}\) = 1 for a diseased individual and 0 otherwise (Table 1).

Table 1 Population pharmacokinetic parameters of iohexol published by Baklouti et al. [18]

The variance–covariance matrix \({\Omega }\) of \(\eta = \left( {\eta_{{{\text{Cl}}}} , \eta_{{V_{{\text{c}}} }} ,\eta_{{V_{{\text{p}}} }} } \right)\) is assumed to be a diagonal matrix.

The dose chosen to perform the simulation is equal to 1600 mg. This corresponds to the average dose that was used to build the PPK model in the study by Baklouti et al. [18]. Similarly, we performed the simulations with sick dogs with a plasma creatinine level of 2.25 mg/dL that roughly corresponds to the average concentration reported in diseased animals [18].

We tested this methodology when a single sample is available at 180 min, corresponding to the best sampling time as identified in the study by Baklouti et al. [18].

2.2.2 Second Model

The second model relates the pharmacokinetics of isavuconazole [19]. This molecule is used in the treatment of invasive fungal infections. The model was built from 471 concentrations obtained from 79 patients.

The PPK model has two compartments. The error model for residual variability is a proportional error. Two covariates are included in the model, body mass index (BMI) and sex. BMI explains a part of peripheral distribution volume variability, and sex explains a part of clearance variability.

This results in the h function of Eq. (1) being such that

$$\begin{aligned} \ln (\varphi ) & = {\text{ }}\ln (h(\eta ,\theta ,A)){\text{ }} \\ & = {\text{ }}\ln {\text{ }}(Cl,{\text{ }}V_{c} ,V_{p} ,Q) \\ & = \left( {\ln \left( {\theta _{{Cl}} \times \left( {1 - 1_{{\left[ {sex} \right]}} } \right) + \theta _{1} \times 1_{{\left[ {sex} \right]}} } \right) + \eta _{{Cl}} ,{\text{ }}\ln (\theta _{{V_{c} }} ),} \right. \\ & \quad \left. { \ln (\theta _{{V_{p} }} + \theta _{2} \times \left( {BMI - 25.38} \right) + \eta _{{V_{p} }} ,\ln \left( {\theta _{Q} } \right)} \right). \\ \end{aligned}$$

The indicator function \(1_{{\left[ {{\text{sex}}} \right]}}\) = 1 for a man and 0 for a woman (Table 2).

Table 2 Population pharmacokinetics model of isavuconazole published by Wu et al. [19]

The dose in the simulation was 200 mg, which corresponds to the standard dose administered in hospital. The covariate value for the BMI was 25.4 kg/m2, corresponding to the average value found in the study [19]. The mean clearance selected was the clearance for men because the model was constructed with a majority of men.

The sampling time selected to perform simulations was 195 h because this is when steady state is reached and TDM can be proposed [20].

3 Results

The results for simulated and predicted random effects on clearances are represented in Fig. 2. Figure 2a and b show the results for iohexol, and the Fig. 2c and d are for isavuconazole. On all plots, the vertical axis corresponds to \(\eta_{{{\text{Cl}}}}^{*}\). The horizontal axis represents the EBEs (i.e.\(\hat{\eta }_{{{\text{Cl}}}}^{*} \left( 1 \right)\)) on the left plots (a and c) and \(\hat{\eta }_{{{\text{Cl}}}}^{*} \left( {\hat{\lambda }} \right)\) on the right plots (b and d). Therefore, the left plots correspond to the results before shrinkage correction, and the right plots show the results obtained after our corrections. For the two tested models, the \(\hat{\lambda }\) value is different from 1 (iohexol \(\hat{\lambda } = 0.017\); isavuconazole \(\hat{\lambda } = 0.059\)), so \(\hat{\eta }_{{{\text{Cl}}}}^{*} \left( {\hat{\lambda }} \right)\) are different from the \(\hat{\eta }_{{{\text{Cl}}}}^{*} \left( 1 \right)\) and therefore from the EBEs. The difference between the median curve (obtained using LOESS regression) in green and the first bisector line in red represents the shrinkage.

Fig. 2
figure 2

Graphical representation of \(\eta_{{{\text{Cl}}}}^{*}\) as a function of \(\hat{\eta }_{{{\text{Cl}}}}^{*} \left( 1 \right)\) (i.e. empirical Bayes estimate for clearance) (left) and of \(\eta_{{{\text{Cl}}}}^{*}\) as a function of \(\Gamma_{{\hat{\lambda }}} \left( { \hat{\eta }_{{{\text{Cl}}}}^{*} \left( {\hat{\lambda }} \right)} \right)\) (right). The red curves represent the first bisector, the green curve represents the regression line, and the blue curves represent the prediction interval. After correction, the point clouds are refocused on the first bisector

Figure 2a and c shows that estimations of the IPK parameters provided by EBEs are not good: they are biased (i.e. the two clouds of points are not centred on the first bisector) and imprecise (i.e. the two clouds are thick).

Figure 2a (iohexol) shows that EBEs overestimated the IPK parameters, particularly for the lowest IPK parameter values. In this case, elimination clearance is overestimated and an unjustified dose increase is applied. Conversely, for isavuconazole, elimination clearance is underestimated for the lowest values, leading to an unjustified dose reduction. Thus, for drugs with a narrow therapeutic range, this kind of mistake can be detrimental to the patient; this means a therapeutic failure or, conversely, increased toxicity.

Surprisingly, it can be seen that, for these two experimental designs (covariates, sampling times, PPK model), shrinkage does not affect the high IPK parameter values. However, these results could be radically different for another experimental design and evolve in an unpredictable way. As a consequence, for each experimental design (covariates, sampling times, PPK model), it is necessary to repeat the shrinkage correction.

The second important information observed in these plots is that the IPK parameter estimates are imprecise. Indeed, one can see that a large number of actual IPK parameters are consistent with a single estimated IPK parameter, i.e. the cloud of points is thick. If the role of \(\hat{\lambda }\) is to reduce the thickness of the cloud of points, it seems that this decrease is modest in these two examples. Indeed, there is an incompressible limit to imprecision, mainly governed by the number of available concentrations for the patient. Moreover, the imprecision varies according to the values of η: the further η is from the population average, the greater the imprecision.

Figure 3 shows the improvement obtained with our method (iohexol in Fig. 3a and isavuconazole in Fig. 3b), with respect to EBEs, for different values of departure from median population clearance (i.e. \(\eta_{{{\text{Cl}}}}^{*} ).\) The improvement is maximal for patients with extreme actual clearance (lowest and highest values). Conversely, our method does not improve the clearance estimation for patients with actual clearances close to the median population clearance; this is logical. Finally, one would expect this improvement to be the same for extreme clearance values (small and large). This is true for isavuconazole but surprisingly is not the case for iohexol. To date, we have no explanation for this phenomenon, and it most likely depends on the experimental design used. The overall improvement, as defined by Eq. (7), is 21.2% for iohexol and 15.5% for isavuconazole.

Fig. 3
figure 3

Graphical representation of the evolution of “Improv” of iohexol and isavuconazole. Improvement is much more important for values far from the average than for values close to the mean. The improvement of the estimates following the correction of the empirical Bayes estimate is therefore not always the same. So, this methodology does not provide a great correction for patients who have a pharmacokinetic profile close to the average. However, for pharmacokinetic profiles far from average pharmacokinetic profiles (i.e. patients for whom therapeutic drug monitoring is usually indicated), this methodology greatly improves results

Table 3 presents the results of shrinkage (for the EBE or the proposed estimator) and the prediction of shrinkage provided by the Combes et al. [14] method. The proposed estimator has a lower shrinkage than the EBE, whatever the application (i.e. the PPK model, the sampling times, etc.). Surprisingly, the method proposed by Combes et al. [14] does not provide a good prediction of EBE shrinkage, especially for isavuconazole.

Table 3 Shrinkage estimates calculated from empirical Bayes estimate and the proposed estimator.

4 Discussion

We proposed a method to improve the prediction of IPK parameters using the measured concentration for one patient. The EBE is the traditional predictor; however, this individual prediction suffers from an “attraction” toward the typical population value and moves the prediction away from the actual value of the IPK parameters. This “distance” is called shrinkage [7]. It is usually computed over both different individuals (i.e. with different IPK parameters) and different sampling times. This number gives information on the average departure from the actual value (i.e. actual clearance, volume of distribution, etc.), but it cannot be used as it stands to correct the prediction of the IPK parameters for a particular patient [21, 22].

In contrast, the method we propose allows a specific shrinkage correction for each patient by considering the sampling times, the PPK model, and the patient’s covariate values. Moreover, for each patient, the level of shrinkage correction depends on the proximity of the patient’s pharmacokinetic parameters to the population's pharmacokinetic parameters. Indeed, we have shown that the correction was at its maximum for extreme values. This information is particularly important as, in hospital practice, when TDM is requested, the pharmacokinetic parameters of the patients are quite extreme values (i.e. far from the mean population values). This is the case for example for patients in the intensive care unit and those with cystic fibrosis or obesity, among others [2325]. Thus, this approach allows us to expect a significant improvement in the estimates of the IPK parameters. In addition to TDM, a better estimate of EBEs could help to improve PPK models during their construction and validation steps.

As stated in Eq. (6), \(\lambda\) was chosen to minimise the shrinkage on clearance. Thus, nothing guarantees a shrinkage decrease of other individual pharmacokinetic parameters. This is a not problem because, for drugs with a linear kinetic, clearance drives exposure and then TDM. We estimated this pharmacokinetic parameter with only one concentration both to mimic hospital practice and to guarantee a high shrinkage to be removed. This choice does not exclude the application of the same kind of methodology for all the other pharmacokinetic parameters (i.e. to choose a \(\lambda\) and \(\Gamma\) function specific to the pharmacokinetic parameter of interest).

In this work, the method of Combes et al. [18] poorly predicted EBE shrinkage. We consider that this is mainly due to the assumed asymptotic framework. It is clear that, with only one observation per individual, the variance of \(P\left( {{\Psi }|Y} \right)\) can be quite different from the corresponding Bayesian Fisher information matrix obtained by expanding \({\text{ln }}P\left( {{\Psi }|Y} \right)\) at its mode. Actually, the quality of approximation of \(P\left( {{\Psi }|Y} \right)\) by a Gaussian distribution is difficult to anticipate.

One of the limits to our methodology is the need for a validated PPK model. Indeed, all these results have been obtained conditional on \(\hat{\theta }, \hat{b},\hat{\Omega }\). In other words, an important assumption behind the method is that these estimations are precise enough to be used as if they were the actual value of the parameters. This hopefully happens for estimates obtained for a validated model built with a large number of individuals. If not, a model misspecification could lead to biased predictions. In this case, the shrinkage correction could be negligible compared with the bias. As such, the impact of a model misspecification on the final individual parameter estimate should be investigated.

However, although validation of PPK models for TDM is beyond the scope of this paper, the implicit assumption is that the chosen model properly describes the kinetic profile of the hospitalised patient. From our experience, this is far from the case. An easy way to identify a poor PPK model is to look at the residual terms. The most often used residual error models are the proportional error model (i.e. \(g\left( {t,\varphi ,b} \right) = b_{1} f\left( {t,\varphi } \right)\)) and in some cases a combined error model (i.e. \(g\left( {t,\varphi ,b} \right) = b_{2} + b_{1} f\left( {t,\varphi } \right)\)). A large \(b_{1}\) (e.g. > 40%) indicates a poor PPK model and thus precludes any TDM based on IPK parameter estimations.

To conclude, this innovative methodology is promising since, to the best of our knowledge, no individual shrinkage correction has been proposed and it consistently provided better results than EBEs.