Model-Based Conditional Weighted Residuals Analysis for Structural Model Assessment

Ibrahim, Moustafa M. A.; Ueckert, Sebastian; Freiberga, Svetlana; Kjellsson, Maria C.; Karlsson, Mats O.

doi:10.1208/s12248-019-0305-2

Model-Based Conditional Weighted Residuals Analysis for Structural Model Assessment

Research Article
Open access
Published: 27 February 2019

Volume 21, article number 34, (2019)
Cite this article

Download PDF

You have full access to this open access article

The AAPS Journal Aims and scope Submit manuscript

Model-Based Conditional Weighted Residuals Analysis for Structural Model Assessment

Download PDF

Moustafa M. A. Ibrahim^1,2,
Sebastian Ueckert¹,
Svetlana Freiberga¹,
Maria C. Kjellsson¹ &
…
Mats O. Karlsson¹

2614 Accesses
Explore all metrics

Abstract

Nonlinear mixed effects models are widely used to describe longitudinal data to improve the efficiency of drug development process or increase the understanding of the studied disease. In such settings, the appropriateness of the modeling assumptions is critical in order to draw correct conclusions and must be carefully assessed for any substantial violations. Here, we propose a new method for structure model assessment, based on assessment of bias in conditional weighted residuals (CWRES). We illustrate this method by assessing prediction bias in two integrated models for glucose homeostasis, the integrated glucose-insulin (IGI) model, and the integrated minimal model (IMM). One dataset was simulated from each model then analyzed with the two models. CWRES outputted from each model fitting were modeled to capture systematic trends in CWRES as well as the magnitude of structural model misspecifications in terms of difference in objective function values (ΔOFV_Bias). The estimates of CWRES bias were used to calculate the corresponding bias in conditional predictions by the inversion of first-order conditional estimation method’s covariance equation. Time, glucose, and insulin concentration predictions were the investigated independent variables. The new method identified correctly the bias in glucose sub-model of the integrated minimal model (IMM), when this bias occurred, and calculated the absolute and proportional magnitude of the resulting bias. CWRES bias versus the independent variables agreed well with the true trends of misspecification. This method is fast easily automated diagnostic tool for model development/evaluation process, and it is already implemented as part of the Perl-speaks-NONMEM software.

Population Pharmacokinetics

Nonlinear mixed-effects models for pharmacokinetic data analysis: assessment of the random-effects distribution

Article 13 February 2017

Estimating Costs Associated with Disease Model States Using Generalized Linear Models: A Tutorial

Article Open access 10 November 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

INTRODUCTION

Nonlinear mixed effects (NLME) models are currently advocated to maximize the utilization of gained information throughout all the phases of drug development. These models are adopted for reducing sample size, calculating study power, confirming drug effects, selecting doses, and optimizing trial design as well as supporting final/interim analysis decisions (1). In such settings, the appropriateness of the modeling assumptions is critical in order to draw correct conclusions and the assumptions must be carefully assessed for any substantial violations. Usually, modeling assumptions are assessed from the available knowledge on physiological processes that are to be modeled. However, model misspecifications can occur when the incompatibility of a modeling assumption with the underlying system goes undetected/untested, even though the model appears to give an accurate description of the data (2).

Different numerical and visual techniques had been proposed as reliable model evaluation methods. Numerical diagnostics include assessment of parameters uncertainty, conditional weighted residuals (CWRES) (3), normalized prediction distribution errors (NPDE) (4), posterior predictive checks (PPC) (5), and numerical predictive checks (6). Visual diagnostics based on model predictions include scatterplots of observed versus predicted and residuals versus predicted, while simulation-based diagnostics include visual predictive checks (7) and the graphical versions of NPDE and PPC. The pros and cons of these techniques had been thoroughly discussed by Nguyen et al. (8) where it was clear that even though graphical tools can signal where the model fails to describe the data, none of them can quantify this model misspecification or the gain in goodness of fit upon correction.

Lately, a new diagnostic tool based on residual modeling has been proposed as an easy and fast automated tool for model development/evaluation process (9). Residual modeling showed the superiority of CWRES over other residuals, where CWRES modeling provided guidance for where a potential model misspecification occurred, similar to other visual diagnostics. In addition, it uniquely identified the nature and quantified the magnitude of this misspecifications in terms of objective function value (OFV). In this work, we present a new method based on CWRES modeling to assess structural assumptions as prediction bias in NLME models developed for continuous data, by back-extrapolating a CWRES-based bias using the first-order conditional estimation (FOCE) approximation. First, we introduce CWRES bias calculation, then we derive predication-bias correction based on the calculated CWRES bias. Afterwards, we illustrate the practical use of this method by assessing prediction bias in two integrated NLME models for glucose homeostasis, the integrated glucose-insulin (IGI) model and the integrated minimal model (IMM) (10,11). Both models consist of glucose and insulin sub-models with interconnecting control mechanisms, and were proposed to describe simultaneously the glucose-insulin regulation system following intravenous glucose tolerance test (IVGTT) in healthy subjects.

METHODS

Calculating CWRES Bias

CWRES data outputted from the NLME model execution was treated as the dependent variable (DV) and modeled first by a base model to estimate CWRES distribution mean and variance.

$$ {\overset{\rightharpoonup }{y}}_i={\Theta}_1+{\eta}_i+{\overset{\rightharpoonup }{\varepsilon}}_i $$

(1)

where $ {\overset{\rightharpoonup }{y}}_i $ is a vector of CWRES data from individual i, Θ₁ is the mean of CWRES, η_i is the random unexplained deviation of individual i from the typical value, with variance Ω, and $ {\overset{\rightharpoonup }{\varepsilon}}_i $ is the vector of residual unexplained variability of individual i, with variance Σ and it is assumed to be independent identically distributed. The expected values of Θ₁, Ω, and Σ are 0, 0, and 1, respectively, as CWRES are theoretically expected to follow a normal distribution with mean 0 and variance 1 for a correct model (3). This base model (Eq. 1) was then extended to estimate different means for N number of bins of the independent variable (IDV) at N − 1 cutoff points (X₁,…, X_N − 1) dictated by data density as follow:

$$ \mathrm{If}\ \left(\min \left(\mathrm{IDV}\right)<\mathrm{IDV}<{\mathrm{X}}_1\right)\kern2em {\overset{\rightharpoonup }{y}}_i={\Theta}_1+{\eta}_i+{\overset{\rightharpoonup }{\upvarepsilon}}_i $$

(2)

$$ \mathrm{If}\ \left({\mathrm{X}}_1<\mathrm{IDV}<{\mathrm{X}}_2\right)\kern5.5em {\overset{\rightharpoonup }{y}}_i={\Theta}_2+{\eta}_i+{\overset{\rightharpoonup }{\upvarepsilon}}_i $$

(3)

$$ \mathrm{If}\ \left({\mathrm{X}}_{N-1}<\mathrm{IDV}<\max \left(\mathrm{IDV}\right)\right)\kern2em {\overset{\rightharpoonup }{y}}_i={\Theta}_N+{\eta}_i+{\overset{\rightharpoonup }{\upvarepsilon}}_i $$

(4)

This captured systematic trends in CWRES as well as the magnitude of structural model misspecifications, measured by the difference in objective function values ΔOFV_Bias between base model objective function value OFV_Base and the extended model objective function value OFV_Extended.

$$ {\Delta \mathrm{OFV}}_{\mathrm{Bias}}={\mathrm{OFV}}_{\mathrm{Extended}}-{\mathrm{OFV}}_{\mathrm{Base}} $$

(5)

The estimates of the bin specific means (Θ₁,…, Θ_N) are CWRES bias vector (b) of length N. Another vector $ \overset{`}{b} $ is derived by extending b to have the same dimensions as $ {\overset{\rightharpoonup }{y}}_i $ by repeating each bin specific mean for all observations within this IDV bin. Afterwards,$ \overset{`}{b} $ is used to correct bias in conditional predictions by the inversion of FOCE covariance calculation as follow.

Prediction-Bias Correction

Let y_i be the vector of observation for subject i, E(Y_i) and COV(Y_i) denote the expectation and the covariance-variance of the conditional predictions Y_i calculated by FOCE under the NLME model with CWRES (r):

$$ E\left({Y}_i\right)=f\left(\overset{\rightharpoonup }{\theta },{\widehat{\eta}}_i\right)-{\left.\frac{df}{d{\overset{\rightharpoonup }{\eta}}_i}\right|}_{{\overset{\rightharpoonup }{\eta}}_i={\widehat{\eta}}_i}\cdotp {\widehat{\eta}}_i $$

(6)

$$ COV\left({Y}_i\right)={\left.\frac{df}{d{\overset{\rightharpoonup }{\eta}}_i}\right|}_{{\overset{\rightharpoonup }{\eta}}_i={\widehat{\eta}}_i}\cdotp \Omega \cdotp {\left.\frac{d\overset{`}{f}}{d{\overset{\rightharpoonup }{\eta}}_i}\right|}_{{\overset{\rightharpoonup }{\eta}}_i={\widehat{\eta}}_i}+\mathit{\operatorname{diag}}\left(\kern0.5em {\left.\frac{dh}{d{\overset{\rightharpoonup }{\varepsilon}}_i}\right|}_{{\overset{\rightharpoonup }{\varepsilon}}_i=0}\cdotp \varSigma \cdotp {\left.\frac{d\overset{`}{h}}{d{\overset{\rightharpoonup }{\varepsilon}}_i}\right|}_{{\overset{\rightharpoonup }{\varepsilon}}_i=0}\right) $$

(7)

$$ r\left({Y}_i,{y}_i\right)= COV{\left({Y}_i\right)}^{-\frac{1}{2}}\left({y}_i-E\left({Y}_i\right)\right) $$

(8)

where f denotes individual model predictions, in which $ \overset{\rightharpoonup }{\theta } $ is the vector of population fixed effects, $ {\overset{\rightharpoonup }{\eta}}_i $ is vector of random unexplained individual deviation from the population fixed effects. h is the unexplained residual variability model, $ {\overset{\rightharpoonup }{\varepsilon}}_i $ is the vector of residual errors, and $ {\widehat{\eta}}_i $ is the vector of empirical Bayes estimates. Both random effects, $ {\overset{\rightharpoonup }{\eta}}_i $ and$ {\overset{\rightharpoonup }{\varepsilon}}_i $, are assumed to follow normal distribution with mean 0 and covariance matrix Ω and Σ, respectively. For conditional predictions from the true model $ {Y}_i^{\ast } $:

$$ E\left(r\left({Y}_i^{\ast },{y}_i\right)\right)=0\kern0.75em COV\left(r\left({Y}_i^{\ast },{y}_i\right)\right)=1 $$

(9)

Let $ {Y}_i^{-} $ be conditional predictions from a misspecified model with biased CWRES $ \overset{`}{b} $ :

$$ E\left(r\left({Y}_i^{-},{y}_i\right)\right)=\overset{`}{b} $$

(10)

By defining the distance $ {\delta}_i={Y}_i^{-}-{Y}_i^{\ast } $, we get:

$$ E\left({Y}_i^{-}\right)=E\left({Y}_i^{\ast}\right)+{\delta}_i $$

(11)

$$ E\left(r\left(E\left({Y}_i^{\ast}\right)+{\delta}_i,{y}_i\right)\right)=\overset{`}{b} $$

(12)

$$ E\left( COV{\left({Y}_i^{\ast}\right)}^{-\frac{1}{2}}\left({y}_i-E\left({Y}_i^{\ast}\right)-{\delta}_i\right)\right)=\overset{`}{b} $$

(13)

$$ E\left({y}_i-E\left({Y}_i^{\ast}\right)-{\delta}_i\right)= COV{\left({Y}_i^{\ast}\right)}^{\frac{1}{2}}\bullet \overset{`}{b} $$

(14)

$$ E\left({y}_i\right)-E\left({Y}_i^{\ast}\right)-E\left({\delta}_i\right)= COV{\left({Y}_i^{\ast}\right)}^{\frac{1}{2}}\bullet \overset{`}{b} $$

(15)

$$ 0-{\delta}_i= COV{\left({Y}_i^{\ast}\right)}^{\frac{1}{2}}\bullet \overset{`}{b} $$

(16)

$$ {\delta}_i=- COV{\left({Y}_i^{-}\right)}^{\frac{1}{2}}\bullet \overset{`}{b} $$

(17)

Assuming $ COV\left({Y}_i^{-}\right)= COV\left({Y}_i^{\ast}\right) $, then conditional predictions from misspecified model $ {Y}_i^{-} $ can be corrected by δ_i. The last assumption implies that $ \overset{`}{b} $ explains all of the structural model misspecifications. The distance δ_i and the percentage change of conditional predictions ($ \%\frac{\delta_i}{Y_i} $) can further be binned by same N bins of the IDV and averaged over all subjects to get vectors δ and % δ, respectively, both of N length for graphical purposes.

NLME Models

We chose to demonstrate this method by assessing prediction bias in two integrated NLME models for glucose homeostasis, the IGI model and the IMM; simpler example can be found in the Supplementary materials. Both the IGI model and the IMM claimed an underlying physiologically plausible structure to explain glucose-insulin dynamic interaction while retaining parsimony. The IGI model, shown in Fig. I, was developed for both healthy subjects and patients with type 2 diabetes following labeled IVGTT, thus observations, i.e., dependent variables, included glucose, radiolabeled glucose (tracer), and insulin measurements. The glucose sub-model is a two-compartment model with a central compartment elimination that is divided into insulin-dependent clearance and insulin-independent clearance. The glucose sub-model has two effect compartments accounting for the control mechanisms of glucose on its own production and on second-phase insulin secretion, respectively. The insulin sub-model is a one compartment disposition model with one effect compartment for the effect of insulin on the regulation of glucose clearance. Upon glucose administration, insulin first-phase amount enters insulin first-phase compartment as a system response, then it is released into the insulin central compartment. The IGI model has been widely used in diabetes modeling with applications in exploring drug effects (12), disease progression (13), designing early clinical trials (14), and optimizing IVGTT design (15).

The IMM was developed in healthy subjects following unlabeled insulin-modified IVGTT, so its data was lacking the unique information provided by radiolabeled glucose. The model is divided into two sub-models, glucose and insulin, based on the two-compartment glucose minimal model (16) and insulin minimal model (17), respectively (Fig. II). The glucose sub-model is a two-compartment model with elimination from the central compartment. Transit compartments are used to describe glucose kinetics in the first minutes after glucose dosing. The rate of change of glucose amount in central compartment $ \dot{G_1}(t) $ is the difference between the rate of hepatic glucose production, the rate of glucose disappearance by liver uptake, the rate of glucose disappearance by peripheral tissue, and the distribution between central and peripheral compartments. Since unlabeled IVGTT data did not allow the explicit description of hepatic glucose production, hepatic glucose production and hepatic glucose uptake were lumped into a net hepatic glucose balance, leading to:

$$ \dot{G_1}={S}_G\cdotp {G}_b-\left({S}_G+X\left(\mathrm{t}\right)+{k}_{21}\right)\cdotp {G}_1\left(\mathrm{t}\right)+{k}_{12}\cdotp {G}_2\left(\mathrm{t}\right)\kern4em {G}_1(0)={G}_b $$

(18)

where G₁(t), G₂(t), and G_b are glucose amounts in central compartment, in peripheral compartment, and basal glucose amount, respectively; k₂₁ and k₁₂ are transfer rate parameters; S_G is glucose effectiveness, quantifying the ability of glucose to enhance its own rate of disappearance at basal insulin concentration and is the sum of two parameters: k₅ that describes hepatic glucose uptake as well as the inhibitory effect of glucose on hepatic glucose production, and k₁ that describes peripheral uptake as a function of glucose amount in central compartment; X(t) is the effect of insulin on glucose kinetics. The insulin sub-model consists of a two-compartment disposition model with elimination from the central compartment. A transit compartment was used to describe insulin first-phase secretion, while second-phase insulin secretion rate is derived proportional to glucose concentration. When insulin concentration in the central compartment is higher than its basal steady state concentration, it moves to a remote compartment, representing receptor pool for insulin binding to its target tissues, where it produces its effects to lower glucose concentration. The IMM was proposed to overcome the limitations of the traditional minimal models, while still deriving the important physiological indices: glucose effectiveness S_G and insulin sensitivity S_I, for clinical diagnosis with estimates that are compatible with the traditional minimal model approach.

Settings

One dataset was simulated from each of the IGI model and the IMM according to a standard IVGTT protocol: 0.33 g/kg bolus of glucose with blood sampling at 0, 2, 3, 4, 5, 6, 8, 10, 12, 15, 18, 20, 22, 24, 26, 28, 30, 35, 40, 45, 50, 55, 60, 70, 80, 100,120,140,160,180, 210, and 240 min. Each simulated data set was analyzed with the two models, and visual predictive checks were performed to investigate the goodness of each fit. CWRES outputted from each model fitting was separated based on the two DVs glucose and insulin, where after CWRES for each DV was modeled to calculate ΔOFV_Bias, b, and δ_i as shown in Fig. III. Time, glucose population predictions (PRED), and insulin PRED were the investigated IDV by separate estimations. To evaluate the performance of our method, we calculated the % known bias in conditional predictions of each DV to be the reference bias estimates in Eq. 19, where Y_{i, sim} is the simulated conditional predictions and Y_{i, est} is the estimated conditional predictions of this DV.

$$ \% known\ bias=100\bullet \frac{Y_{i, sim}-{Y}_{i, est}}{Y_{i, sim}} $$

(19)

Also, to avoid bias introduced by binning, a previously recommended random binning technique (18) was implemented by the following specifications, with number of bins being N and the minimum number of observations per bin being M:

Step 1:
Sort CWRES data by the selected IDV.
Step 2:
Generate N − 1 bin boundaries randomly, based on the IDV.
Step 3:
Group CWRES data based on generated bin boundaries.
Step 4:
Estimate b → δ_i.
Step 5:
Repeat steps (2–4) 500 times.

In our investigations, using time as IDV, N was set to 10, otherwise N was set to 5 and M was set to 25. Nonlinear mixed effects analysis, statistical and graphical assessment was performed in PSN (19), NONMEM version 7.3 (20) and R (21). Simulated conditional predictions Y_{i, sim} was outputted from NONMEM using $ETAS and $ESTIMATION with options MCETA = 1 FNLETA = 2.

RESULTS

When either of the two data sets was analyzed with the IGI model or data simulated by the IMM was analyzed by the IMM, ΔOFV_Bias was non-significant for both DVs (glucose and insulin) at $ {\mathcal{X}}_{0.05}^2 $(10 degree of freedom) when time was the IDV, and at $ {\mathcal{X}}_{0.05}^2 $(5) when glucose PRED or insulin PRED was the IDV. When data simulated by the IGI was analyzed with the IMM, ΔOFV_Bias was significant for glucose versus the three IDVs, but not for insulin as shown in Table I.

Table I Calculated ΔOFV_Bias for the Two Dependent Variables Glucose and Insulin for the IGI Model and the IMM Versus the Three Investigated Independent Variables: Time, Glucose PRED, and Insulin PRED. Significant Bias is Indicated in the Table with Italics

Full size table

Plots of estimated bias in conditional predictions calculated by CWRES modeling % δ versus the three investigated IDVs are shown in Figs. IV, V, and VII, where an over prediction bias in glucose sub-model is evident using both fixed or random binning. Visual predictive checks of the IMM when fitted to data simulated from the IGI model are shown in Fig. VII, only glucose sub-model showed an over prediction where the 95% confidence interval around the median of the simulations from the IMM is higher than the median of the data simulated from the IGI model, similar to where this over prediction was captured by the new method. In addition, the new method showed the bias against the interacting predictions of glucose and insulin, which is not routinely checked with visual predictive checks. The over prediction in the IMM glucose sub-model was found at early time points (< 150 min) with binning based on time, at high glucose concentrations (> 90 mg/dl) with binning based on glucose PRED, and at almost all bins with binning based on insulin PRED. The absolute and proportional magnitude of the over prediction versus the three IDVs showed a good agreement between the estimates calculated based on CWRES modeling (% δ) and the reference estimates (% known bias), as presented in Table II and shown in Figs. IV, V, and VI. Finally, these results correctly pointed out a model misspecification in glucose sub-model of the IMM, similar to previously reported results with another analysis methods (22,23).

Table II Estimates of b, % δ, and % Known Bias of the IMM Glucose Sub-model Versus the Three Investigated Independent Variables After Fitting the Model to Data Simulated from the IGI Model

Full size table

DISCUSSION

Nonlinear mixed effects modeling requires assumptions for handling different types of data and the different model components: structural, covariate, and stochastic models; since these assumptions are interconnected with each other such that a violation of one may have consequences for the apparent appropriateness of others, it becomes more challenging to correctly address such violation (8). One of the recently developed methods for model evaluation is CWRES post-processing. By parametric modeling of either the mean or the variance of CWRES distribution, it is possible to identify and quantify if a model misspecification is present and whether this model misspecification arises from the structural model or the stochastic model, in a fast and robust way (9). Based on CWRES modeling, we developed a new method to assess structural assumptions as prediction bias in NLME models developed for continuous data. The new method first calculated the bias in the mean of CWRES distribution, then the deviation between conditional predictions of a misspecified structural model, and expected true structural model, relying on the fact that CWRES under the true structure model is normally distributed with mean 0 and variance 1. We successfully applied the new method to two integrated complex models for glucose homeostasis, the IGI model, and the IMM. Both models claimed an underlying physiologically plausible structure, albeit different, to explain glucose-insulin dynamic interaction with the least possible number of estimated parameters, and so hypothetically both models are less prone to prediction bias. Our method correctly spotted the violation of the underlying structural model assumptions with the highest impact on the IMM performance, similar to % known bias and in agreement to previous investigations (22,23).

Both models use a two-compartment disposition model to describe glucose kinetics with elimination from central compartment. The elimination is divided into two pathways, defined differently in the two models. The IGI models assumes two pathways based on glucose transporters of the uptake tissue, either insulin sensitive transporters, e.g., GLUT4 or insulin insensitive transporters, e.g., GLUT2, while the IMM assumes two pathways based on the anatomy of uptake tissue, either peripheral or hepatic tissue, with each elimination further classified into insulin dependent or not. This difference in elimination as well as the absence of tracer data led to the IMM assumptions regarding net hepatic glucose balance and the hybrid nature of glucose and insulin effect parameters on hepatic tissue. Net hepatic balance is the difference between hepatic glucose production and hepatic glucose uptake, taking positive values when production is dominating and negative values if uptake is dominating and is mathematically derived as the difference between an extrapolated value of net hepatic balance at zero glucose concentrations and the hybrid effects of glucose and insulin to inhibit hepatic glucose production as well as enhancing hepatic glucose uptake (24). Hence, glucose effect parameter S_G is simultaneously measuring both mass flow and control mechanism through k₅, its estimate is unrealistically large. This overestimation of glucose effect on glucose disappearance, constrained insulin effects X(t) on glucose disappearance to take low estimates, creating undesired compensation bias in the rest of glucose sub-model parameters (22). As insulin contributes to glucose elimination only at concentrations higher than basal insulin concentrations, and the hybrid parameter of glucose effects contributes to steady state conditions of glucose, both production and clearance, the impact of these biased parameters cancel out at steady state concentrations of glucose (~ 90 mg/dL). The impact of these biased parameters is magnified on system perturbation where insulin reaches effective concentrations in the remote compartment, but insulin dynamic effects in the model are constrained to underestimate the true consequences of these insulin effective concentrations on glucose disappearance curve. The impact of these biased parameters decreases again as insulin concentrations in remote compartment decrease toward insulin basal concentrations. This behavior explains the captured bias when simulating with the IGI model and estimating with the IMM model, as shown in Figs. 4, 5, and 6. Bias peaks immediately after first-phase insulin secretion then fades away with declining insulin concentrations. This happens at time points before 150 min, when glucose concentrations were higher than 90 mg/dL and when insulin concentrations were above basal insulin. Noting that insulin concentrations peak before glucose concentration, as lower bolus of similar rate of absorptions and central volumes of distributions, this may be the reason behind glucose concentrations higher than 280 mg/dL showed less bias than glucose concentrations in the range 200–280 mg/dL. Also, when simulating with the IMM and estimating with both models, ΔOFV_Bias was lower for the IGI model estimations, as presented in Table I, concluding that the IGI model structural assumptions regarding glucose kinetics were less prone to significant misspecifications. Finally, the magnitude of the IMM glucose sub-model bias peaked to 20% of conditional prediction of glucose, which is a considerably high percentage for such integrated system, and in light of a previous study (14), the utilization of such model in drug development to explore drug effects enhancing glucose disappearance will result in the misleading conclusions of overestimating drug effects on insulin-independent glucose clearance and underestimating drug effects on insulin-dependent glucose clearance.

Regarding insulin kinetics, the IGI model and the IMM assumed different disposition models, none showed significant ΔOFV_Bias, and both models behaved ideally in a sense that when simulating with the IGI model and estimating with both models, ΔOFV_Bias was lower for the IGI model estimations, likewise when simulating with the IMM and estimating with both models, ΔOFV_Bias was lower for IMM estimations, as estimating and simulating with same model in absence of a high impact misspecification should always be almost bias free, unless the used estimation method is inappropriate. We also added a simple PK example as Supplementary material to explain and show the step-by-step implementation of our method in R.

Results from this new method should be interpreted within the context of two main factors and their impact on the purpose of the modeling exercise: the significance of ΔOFV_Bias and the magnitude of the detected bias % δ. For instance, if the purpose of the model was to physiologically describe an underlining system or derive physiological indices for clinical diagnosis as the IMM, then a high % δ in the dynamic relation between glucose and insulin must be addressed even if not accompanied with a significant ΔOFV_Bias. Our new method is generalizable to all NLME models developed for continuous data and is independent of the used estimation method or analysis software if CWRES is available and calculated in the same way. This method inherits the unique merits of CWRES modeling, as being fast, robust, and not suffering from local minima problems. Noting that the method depends on the way of IDV binning. How and where to set the binning is subjective and up to modelers to choose; here we used data density, which was not supporting 10 bins for glucose PRED or insulin PRED as the IDV. Though being time consuming and computing intensive, random binning technique allowed horizontal exploration of additional bins that probably would not be subjectively selected, giving more insight of the present trends in CWRES distribution, and it provided vertical exploration for bins with higher probability of being selected, similar to confidence intervals. How to handle a detected bias is model and purpose dependent with no general recommendations; however, by using different IDVs in the visualization of the bias, clues on which part of the model that is misspecified can be revealed. When model predictions are too close to zero, it will not be possible to calculate %δ, and δ should be used instead.

Unlike residual post-processing (9), which can be applied to other residuals as NPDE and CWRESI, our method was derived only for CWRES as the last outperformed other residuals in residual error model identification. Different derivations will be needed for prediction-bias correction with other residuals and that was not explored in our work.

In conclusion, a new fast and easily automated diagnostic method for structural model assessment was successfully developed, evaluated, and applied to two integrated complex semi-mechanistic models. The new method can identify structural misspecification, wherever this misspecification occurs, and quantify its magnitude and impact on goodness of fit. This method is already implemented in PsN as part of qa tool (available from version 4.8.1) for model development/evaluation process.

References

Lalonde RL, Kowalski KG, Hutmacher MM, Ewy W, Nichols DJ, Milligan PA, et al. Model-based drug development. Clin Pharmacol Ther. 2007;82(1):21–32. https://doi.org/10.1038/sj.clpt.6100235.
Article CAS PubMed Google Scholar
Karlsson MO, Jonsson EN, Wiltse CG, Wade JR. Assumption testing in population pharmacokinetic models: illustrated with an analysis of moxonidine data from congestive heart failure patients. J Pharmacokinet Pharmacodyn. 1998;26(2):207–46.
Article CAS Google Scholar
Hooker AC, Staatz CE, Karlsson MO. Conditional weighted residuals (CWRES): a model diagnostic for the FOCE method. Pharm Res. 2007;24(12):2187–97.
Article CAS Google Scholar
Comets E, Brendel K, Mentré F. Computing normalised prediction distribution errors to evaluate nonlinear mixed-effect models: the npde add-on package for R. Comput Methods Prog Biomed. 2008;90(2):154–66.
Article Google Scholar
Yano Y, Beal SL, Sheiner LB. Evaluating pharmacokinetic/pharmacodynamic models using the posterior predictive check. J Pharmacokinet Pharmacodyn. 2001;28:171–92.
Article CAS Google Scholar
Wilkins JJ, Karlsson MO & Jonsson EN. Patterns and power for the visual predictive check, 2006; PAGE 15, Abstract 1029.www.page-meeting.org/?abstract=1029.
Holford N. VPC: the visual predictive check superiority to standard diagnostic (Rorschach) plots. 2005; PAGE 14, Abstract 738. www.page-meeting.org/?abstract5738.
Nguyen TH, Mouksassi M, Holford N, Al-Huniti N, Freedman I, Hooker AC, et al. Model evaluation of continuous data pharmacometric models: metrics and graphics. CPT Pharmacometrics Syst Pharmacol. 2017;6(2):87–109. https://doi.org/10.1002/psp4.12161.
Article CAS PubMed PubMed Central Google Scholar
Ibrahim MMA, Nordgren R, Kjellsson MC, Karlsson MO. Model-based residual post-processing for residual model identification. AAPS J. 2018;20(5):81. https://doi.org/10.1208/s12248-018-0240-7.
Article CAS PubMed Google Scholar
Silber HE, Jauslin PM, Frey N, Gieschke R, Simonsson US, Karlsson MO. An integrated model for glucose and insulin regulation in healthy volunteers and type 2 diabetic patients following intravenous glucose provocations. J Clin Pharmacol. 2007;47(9):1159–71. https://doi.org/10.1177/0091270007304457.
Article CAS PubMed Google Scholar
Largajolli A, Bertoldo A, Cobelli C & Denti P. An integrated glucose-insulin minimal model for IVGTT. 2013; PAGE 22, Abstract 2762. www.page-meeting.org/?abstract=2762.
Jauslin PM, Karlsson MO, Frey N. Identification of the mechanism of action of a glucokinase activator from oral glucose tolerance test data in type 2 diabetic patients based on an integrated glucose-insulin model. J Clin Pharmacol. 2012;52(12):1861–71. https://doi.org/10.1177/0091270011422231.
Article CAS PubMed Google Scholar
Ghadzi SM. (2017) Pharmacometrics modeling in type 2 diabetes mellitus: implications on study design and diabetes disease progression. (Doctoral dissertation). Uppsala: Acta Universitatis Upsaliensis.
Ibrahim MMA, Ghadzi SMS, Kjellsson MC, Karlsson MO. Study design selection in early clinical anti-hyperglycemic drug development: a simulation study of glucose tolerance tests. CPT Pharmacometrics Syst Pharmacol. 2018;7:432–41. https://doi.org/10.1002/psp4.12302.
Article CAS PubMed PubMed Central Google Scholar
Silber HE, Nyberg J, Hooker AC, Karlsson MO. Optimization of the intravenous glucose tolerance test in T2DM patients using optimal experimental design. J Pharmacokinet Pharmacodyn. 2009;36(3):281–95. https://doi.org/10.1007/s10928-009-9123-y.
Article CAS PubMed Google Scholar
Cobelli C, Caumo A, Omenetto M. Minimal model Sg overestimation and Si underestimation: improved accuracy by a Bayesian two-compartment model. Am J Phys. 1999;277:E481–8.
CAS Google Scholar
Toffolo G, Campioni M, Basu R, Rizza RA, Cobelli C. A minimal model of insulin secretion and kinetics to assess hepatic insulin extraction. Am J Physiol Endocrinol Metab. 2005;290(1):E169–76. https://doi.org/10.1152/ajpendo.00473.2004.
Article CAS PubMed Google Scholar
Pavan Kumar VV, Duffull SB. Evaluation of graphical diagnostics for assessing goodness of fit of logistic regression models. J Pharmacokinet Pharmacodyn. 2011;38(2):205–22.
Article Google Scholar
Lindbom L, Pihlgren P, Jonsson EN. PsN-toolkit—a collection of computer intensive statistical methods for non-linear mixed effect modeling using NONMEM. Comput Methods Prog Biomed. 2005;79(3):241–57. https://doi.org/10.1016/j.cmpb.2005.04.005.
Article Google Scholar
Beal S, Sheiner LB, Boeckmann A & Bauer RJ. NONMEM user’s guides. (1989–2009), Icon Development Solutions, Ellicott City, MD, USA; 2009.
Team RC. R: a language and environment for statistical computing. Vienna, Austria: 2014. Available from: http://www.R-project.org.
Cobelli C, Pacini G, Toffolo G, Saccà L. Estimation of insulin sensitivity and glucose clearance from minimal model: new insights from labeled IVGTT. Am J Phys. 1986;250(5):E591–8.
CAS Google Scholar
Ibrahim MMA, Largajolli A, Kjellsson MC & Karlsson MO. (2016). Translation between two models; application with integrated glucose homeostasis models. WCOP 2 (2016), Abstr 249. http://goo.gl/EWNWqK.
Bergman RN, Ider YZ, Bowden CR, Cobelli C. Quantitative estimation of insulin sensitivity. Am J Phys. 1979;236(6):E667–77.
CAS Google Scholar

Download references

Author information

Authors and Affiliations

Department of Pharmaceutical Biosciences, Uppsala University, Uppsala, Sweden
Moustafa M. A. Ibrahim, Sebastian Ueckert, Svetlana Freiberga, Maria C. Kjellsson & Mats O. Karlsson
Department of Pharmacy Practice, Helwan University, Cairo, Egypt
Moustafa M. A. Ibrahim

Authors

Moustafa M. A. Ibrahim
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Ueckert
View author publications
You can also search for this author in PubMed Google Scholar
Svetlana Freiberga
View author publications
You can also search for this author in PubMed Google Scholar
Maria C. Kjellsson
View author publications
You can also search for this author in PubMed Google Scholar
Mats O. Karlsson
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.M.A.I., S.U., S.F., M.C.K., and M.O.K. wrote the manuscript. M.M.A.I., S.U., S.F., M.C.K., and M.O.K. designed the research. M.M.A.I. performed the research. M.M.A.I., S.U., S.F., M.C.K., and M.O.K. analyzed the data.

Corresponding author

Correspondence to Mats O. Karlsson.

Ethics declarations

Conflict of Interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic Supplementary Material

ESM 1

(DOCX 4.55 mb)

ESM 2

(ZIP 71.3 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Ibrahim, M.M.A., Ueckert, S., Freiberga, S. et al. Model-Based Conditional Weighted Residuals Analysis for Structural Model Assessment. AAPS J 21, 34 (2019). https://doi.org/10.1208/s12248-019-0305-2

Download citation

Received: 07 November 2018
Accepted: 30 January 2019
Published: 27 February 2019
DOI: https://doi.org/10.1208/s12248-019-0305-2

KEY WORDS

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Model-Based Conditional Weighted Residuals Analysis for Structural Model Assessment

Abstract