Introduction

Nonlinear mixed effects (NLME) modeling is increasingly used to describe longitudinal data from clinical experiments to increase our understanding of the studied underlying pathophysiological system (1). NLME models allow the pooling of data from different subjects while simultaneously quantifying multiple levels of variability. This ability to identify different sources of variability is particularly critical to many clinical applications and has motivated the implementation of previously developed milestone models of biological systems into the NLME context. For some biological systems, there exist several models with somewhat different features. This may have arisen as the underlying data may have been different allowing a more or less detailed description of the system, the developers of the models may have had different insights or the purpose of the models may have been different. The resulting diversity provides detailed insights into the studied biological system from different perspectives, with learning opportunities from the comparison across models. Here, we present a comparison between two models in the field of diabetes.

In diabetes, Ajmera et al. (2) presented a detailed review of the modeling efforts undertaken to describe glucose homeostasis, diabetes, and related complications. Among the numerous models developed to describe glucose-insulin regulation following system perturbation, the minimal models are regarded as the first breakthrough in diabetes modeling. (3,4,5,6,7) They are not data-driven, rather parsimoniously formulated based on physiology. Glucose minimal model provides two important clinical indices that are able to describe the system in different pathophysiological states, glucose effectiveness index (SG) that quantifies the ability of glucose to enhance its own rate of disappearance at basal insulin concentration and insulin sensitivity index (SI) that quantifies the influence of insulin at steady state to change glucose’s own effect on glucose concentration. To overcome several limitations, glucose minimal model went through many developments driven by its validation versus clinical studies and was adapted to NLME modeling (8). However, these models are conditional models that describe either glucose or insulin and use the other variable as a known input. This feature limited their simulation performance and utilization for clinical drug development. Thus, integrated models that can describe simultaneously glucose and insulin kinetics following system perturbation were in need. Though with different structure model assumptions, the integrated glucose insulin (IGI) model (9) then the integrated minimal model (IMM) (10) have been developed by NLME modeling to fill this gap and describe glucose-insulin regulation system following, for example, intravenous glucose tolerance test (IVGTT).

A major step in the evaluation of NLME models was when the properties of multiple data sets simulated from the model with the original trial design was compared to the original, real data using standard metrics in posterior predictive checks (PPCs) (11,12,13) or graphically in visual predictive checks (VPCs) (14,15,16). The common principle in these evaluations are that any systematic discrepancy between the observed data metric and the distribution given by the multiple simulated data sets would indicate a model deficiency. However, the PPC typically focuses on one or two metrics of clinical interest and the VPC evaluate the match in distribution of data at different times following the start of study but does not provide an evaluation of the relative contribution of the different components creating variability between observations. Also, neither technique is well-developed for multivariable models, like the glucose-insulin models.

In this paper, we propose a new evaluation form of simulated data and that is the analysis of both real and simulated data using a structurally different but similar NLME model and hence we called it “NLMEPC” in analogy with VPC and PPC. As NLME models assess both the expected time-courses and inter- and intra-individual variability as well as interdependence between dependent variables, it will pose a challenging test for the simulated data. We demonstrate NLMEPC using two similar NLME models: the IGI model and the IMM. Both models had been proposed to describe the same biological system with different structural assumptions, and this would allow performing such evaluation crosswise, i.e. the IGI model can be used to assess the IMM and vice versa. Additionally, we present an approach to map parameters carrying similar information between NLME models. We showcase this approach by trying to map the clinically used parameters available in the IMM with their corresponding parameters in the IGI model.

Methods

NLME Models

The IGI model shown in Fig. 1, was developed for both healthy subjects and patients with type 2 diabetes mellitus (T2DM) following labeled IVGTT, using observations of glucose, labeled glucose (tracer) and insulin. Glucose sub-model is a two-compartment model, GC (mg) and GP (mg) are glucose amounts in central and peripheral compartments with their volumes of distributions VG (L) and VP (L) respectively, and inter-compartmental clearance Q (L.min−1). Elimination from the central compartment is divided into insulin-dependent clearance CLGI (L.min−1/mU.L−1) and insulin-independent clearance CLG (L.min−1). Glucose sub-model has two effect compartments, GE1 (mg/dL) and GE2 (mg/dL) are glucose concentrations in these effect compartments accounting for the control mechanisms of glucose on its own production and on the 2nd phase insulin secretion respectively, kGE1 (min−1) and kGE2 (min−1) are their respective rate constants while GPRG and IPRG are power terms mediating these control mechanisms respectively. The labeled glucose sub-model is assumed to have the same characteristics as the glucose sub-model but without hepatic glucose production. Insulin sub-model is a one compartment disposition model, with the volume of distribution VI (L) and the clearance CLI (L.min−1). Insulin sub-model has one effect compartment for the delayed effect of insulin on the regulation of CLGI with kIE (min−1) as the rate constant. Upon glucose administration, insulin 1st phase amount IFST (mU) enters insulin 1st phase compartment as a system’s response, then it is released into insulin central compartment through the rate constant kIS (min−1). GSS (mg.L−1) and ISS (mU.L−1) are the basal concentrations of glucose and insulin respectively. The IGI model was the first proposed integrated semi-mechanistic model to describe both glucose and insulin data simultaneously. One of the appealing features of the IGI model is its ability to distinguish hepatic glucose production from glucose disappearance, facilitated by access to labelled glucose measurements during its development. The model had been successfully applied for exploring drug effects (17), disease progression (18), designing early clinical trials (19) and optimizing IVGTT design (20).

Fig. 1
figure 1

Schematic presentation of the integrated glucose insulin model during IVGTT in healthy subjects. GC and GP , central and peripheral compartments of glucose; GE1 and GE2, effect compartments of glucose on endogenous glucose production and 2nd phase insulin secretion respectively; I, central compartment of insulin; IE, effect compartment of insulin on glucose; IFPS, insulin 1st phase compartment; kGE1, kGE2, kIE and kIS, first order rate parameters; CLG, CLGI and CLI, insulin-independent glucose clearance, insulin-dependent glucose clearance and insulin clearance respectively.

The IMM was recently proposed to overcome the limitations of the traditional minimal models, while still deriving the important physiological indices for clinical diagnosis, SG and SI. The model was developed in healthy subjects following an insulin modified IVGTT, but its data was lacking the unique information provided by labeled glucose. The model is divided into two sub-models, glucose and insulin, based on the two-compartment glucose minimal model (5) and insulin minimal model (7) respectively, as shown in Fig. 2. Glucose sub-model is a two-compartment model, and G1 (mg) and G2 (mg) are glucose amounts in the central and the peripheral compartments with their volumes of distributions VOL (dL) and VOLI (dL) respectively. Elimination is from the central compartment only. Transit compartments are used to cater for glucose kinetics in the first minutes after glucose dosing. The rate of change of glucose amount in the central compartment is the difference between the rate of hepatic glucose production Ra (mg.min−1), the rate of glucose disappearance by liver uptake Rdl (mg.min−1), the rate of glucose disappearance by peripheral tissue Rdp (mg.min−1), and the distribution between the central and the peripheral compartments. Since unlabeled IVGTT data did not allow the explicit description of hepatic glucose production, both hepatic glucose production and glucose liver uptake were lumped together as the net hepatic glucose balance (NHGB). Insulin sub-model consists also of a two-compartment model, VI (L) and VI2 (L) are the volumes of distributions of the central and the peripheral compartments respectively. Elimination from the central compartment is described by the clearance enneCL (L.min−1) through the rate constant kI (min−1). Transit compartments with the mean transit time MTT3 (min) are used to account for the delay in insulin 1st phase amount x0 (pmol.L−1). While 2nd phase insulin secretion rate Y (pmol. Min−1) is proportional to glucose concentration. ALPH (min−1) is a time constant parameter that controls the decrease in Y(t) toward a steady state value, BETA (pmol.min−1 .L−1/mg.dL−1) is the feedback control of glucose concentration in glucose central compartment above basal glucose concentration H (mg.dl−1) on Y. When insulin concentrations in the central compartment are higher than its basal concentrations QUINS (pmol.L−1), it moves to a remote compartment representing the receptor pool for insulin binding to peripheral tissues, where it produces its effect by increasing glucose disappearance. QE1 (L.min−1) and QE2 (L.min−1) are the inter-compartmental clearances describing the distribution of glucose and insulin respectively. k1 (min−1) and k4 (L. pmol −1.min−1) are rate parameters that describe peripheral uptake as a function of glucose amount in the central compartment and insulin concentration in the remote compartment respectively. k2 (min−1) and k3 (min−1) are rate parameters for insulin effect, k5 (min−1) is a rate parameter that describes hepatic glucose uptake as well as the inhibitory effect of glucose on hepatic glucose production, k6 (L. pmol −1.min−1) is a rate parameter that describes the effect of insulin concentration in the remote compartment on glucose to enhance hepatic glucose uptake, as well as the inhibitory effect of insulin on hepatic glucose production. For a priori identifiability of model parameters, the model was reparametrized as:

$$ p1={\mathrm{k}}_1+{\mathrm{k}}_5 $$
(1)
$$ p2={\mathrm{k}}_3 $$
(2)
$$ p3={\mathrm{k}}_2\kern0.3em \bullet \kern0.3em \left({\mathrm{k}}_4+{\mathrm{k}}_6\right) $$
(3)
$$ \mathrm{SI}=\frac{p3}{p2} $$
(4)
$$ {\mathrm{S}}_{\mathrm{G}}=p1\bullet \mathrm{VOL} $$
(5)
$$ {\mathrm{S}}_{\mathrm{I}}=\mathrm{SI}\bullet \mathrm{VOL} $$
(6)
Fig. 2
figure 2

Schematic presentation of the integrated minimal model during IVGTT in healthy subjects. G1 and G2, central and peripheral compartments of glucose; I1 and I2, central and peripheral compartments of insulin; insulin concentrations in the remote compartment; Rdp, rate of glucose disappearance by peripheral tissue uptake; x0, 1st phase insulin concentrations; Y, 2nd phase insulin secretion; NHGB, net hepatic glucose balance; k1 and k5, glucose model parameters; k2, k3, k4 and k6 parameters of insulin action; kI, insulin elimination rate constant.

While SG is an estimated model parameter, SI need to be calculated separately after model estimation from SI and VOL. This integrated model provides estimates of SG and SI that are compatible with the traditional minimal model approaches while allowing the simultaneous characterization of the glucose-insulin regulation system (10). The difference between SG and SI derived by the IMM and those derived by the original minimal model (5) is the use of insulin predictions to drive parameters’ estimates in the IMM, while the original minimal model uses insulin observations. This difference will be minimal when the IMM describes insulin observations adequately. More details on the IMM system of differential equations can be found in the Appendix where we maintained the original minimal models’ notations.

Data

Real data was used from two previously published studies, the first study was a stable isotopically labeled IVGTT in 14 healthy volunteers, where an intravenous bolus dose of glucose, enriched with labeled glucose, was administered, and frequent blood samples were drawn for 240 min following the glucose dose (21). The second study was an insulin-modified stable isotopically labeled IVGTT in 10 healthy volunteers. An intravenous bolus dose of glucose enriched with labeled glucose was administered, and after 20 min, a 5-min insulin infusion was given. Frequent blood samples were drawn for 240 min following the glucose dose (22). In both studies, plasma samples were analyzed to determine the concentration of total glucose, insulin, and labeled glucose. We did not use the labeled glucose data since the IMM was developed by unlabeled IVGTT data. The two models were fitted to the real data to obtain the point estimates that were used later for the simulations and as the initial guess for estimations. Visual predictive checks (VPCs) were performed to investigate the adequacy of each model initial fit.

NLMEPC Procedures

Three hundred data sets were created from three different data sources, one hundred data sets were sampled from the real data by bootstrap, one hundred data sets were simulated from the point estimates of the IMM using the same study design as the real data, and lastly one hundred data sets were simulated from the point estimates of the IGI model using also the same study design as the real data. Then the two models were fitted to each of the created three hundred datasets as shown in Fig. 3. This resulted in three hundred estimates for each model parameter, these are rich enough to investigate each of the parameters distribution properties and provide additional insights to where the model failed through understanding the differences in parameters distributions across the three different data sources, as well as investigating the information content of each model’s simulated data compared to real-life-like data (bootstrap datasets).

Fig. 3
figure 3

A descriptive presentation of information translatability investigation set up. The two models were first fitted to the real data. Afterward, one hundred data sets were sampled from the real data by bootstrap, one hundred datasets were simulated from the IMM, and lastly one hundred datasets were simulated the IGI model. Then the two model were fitted to each of the created three hundred datasets.

Mapping the Clinical Parameters

For mapping glucose effectiveness SG, insulin sensitivity SI and insulin action p2 derived by the IMM with their expected corresponding parameters in the IGI model (CLG, CLGI and kIE), a single large dataset (1200 individuals) was simulated from the IMM. The simulated individual estimates of all IMM parameters were outputted in this large dataset. This was followed by a single estimation step by the IGI model, and the individual estimates of CLG, CLGI, and kIE were outputted as well. Then the simulated individual estimates of all IMM parameters from the simulation step were treated as true covariates and added on the individual parameter estimates of the CLG, CLGI, and kIE from the estimation step (treated as response variables) using generalized additive modeling GAM approach (23) as shown in Fig. 4. GAM is an extension of generalized linear models. Generalized linear models relates the expected value of the response variable (μ) to a linear function of the covariates (Xs) multiplied by their respective coefficients (β):

$$ \mu ={\beta}_0+{\beta}_1\bullet {X}_1+{\beta}_2\bullet {X}_2+{\beta}_3\bullet {X}_3 $$
Fig. 4
figure 4

A descriptive representation of the setup for mapping physiological indices derived from the IMM to their corresponding parameters in the IGI model. A large data set (1200 Individuals) was simulated from the IMM, then estimated by the IGI model. Thereafter the simulated individual estimates of all IMM parameters were treated as true covariates and added on the estimated individual estimates of the CLG, CLGI, and KIE using GAM approach. CLG, insulin-independent glucose clearance; CLGI, insulin-dependent glucose clearance; KIE, insulin effect rate constant.

The generalization here is the ability to drive such relations for response variables that are not normally distributed, through transforming these response variables by link function g(.), e.g. natural logarithm can be used as a link function for a response variable of a Poisson distribution with mean and variance μ:

$$ {\displaystyle \begin{array}{c}g\left(\mu \right)={\beta}_0+{\beta}_1\bullet {X}_1+{\beta}_2\bullet {X}_2+{\beta}_3\bullet {X}_3\\ {}\mathit{\ln}\left(\mu \right)={\beta}_0+{\beta}_1\bullet {X}_1+{\beta}_2\bullet {X}_2+{\beta}_3\bullet {X}_3\end{array}} $$

An extra layer of generalization is then obtained by GAM that allows the testing of nonlinear functions f(.) of the covariates to establish their relationship with the response variable:

$$ \mu ={\beta}_0+f\left({X}_1\right)+f\left({X}_2\right)+f\left({X}_3\right) $$

Thus, generalized additive models are more flexible with better fits to the data than generalized linear models. For each response variable (CLG, CLGI, and kIE), we used stepwise GAM for selection among the different parameters of the IMM using Akaike information criterion (AIC). This means that for each response variable, all the IMM parameters (IMMparameter) were fitted separately in the GAM (one fit for each IMMparameter) and AIC was calculated for each fit. The IMMparameter that was accompanied by the highest improvement in fit was added first in the GAM, then the rest of parameters were tested again on the new generalized additive model (that included the first IMMparameter) to identify the IMMparameter with the highest improvement of fit in the new generalized additive model that would be the second IMMparameter added by the GAM and so on. Afterwards, the first IMMparameter to be added by the GAM, was incorporated as a covariate on the population parameter of CLG, CLGI, and kIE in a second re-estimation step by the IGI model for the same simulated large dataset as shown in (Eq. 7) using CLG as an example, where \( {\uptheta}_{{\mathrm{CL}}_{\mathrm{G}}} \) is the population parameter for CLG, θEFFECT is a measure of change in CLG for each unit change in the added IMMparameter, and \( {\eta}_{{\mathrm{CL}}_{\mathrm{G}}} \) is the unexplained individual deviation from the population parameter \( {\uptheta}_{{\mathrm{CL}}_{\mathrm{G}}} \). This explored how much of the inter-individual variability in each of these parameters of the IGI model was explained by incorporating these covariates. Nonlinear mixed effects analysis, statistical and graphical assessment was performed in PSN, (24) NONMEM version 7.3 (25) and R (26).

$$ {\mathrm{CL}}_{\mathrm{G}}\kern0.5em =\left({\uptheta}_{{\mathrm{CL}}_{\mathrm{G}}}+{\uptheta}_{\mathrm{EFFECT}}\ast {\mathrm{IMM}}_{\mathrm{parameter}}\right)\ast {\mathit{\exp}}^{\eta_{{\mathrm{CL}}_{\mathrm{G}}}} $$
(7)

Results

NLMEPC

As expected, both models described the data well as shown by their respective VPCs in Fig. 5, providing the evidence for adequate parameters’ estimates from the initial fit. The IMM described insulin observations adequately at the individual level as shown by a goodness of fit plot of insulin observations versus insulin predictions (Supplementary material). Plots of the 300 estimates of each parameter of the IGI model, grouped by the three different sources of the data are shown in Fig. 6. The simulation performance of the two models can be judged by comparing to the bootstrap data which is assumed to represent the truth or real-life data, i.e. looking at GSS or kGE2 parameter plots, both models’ simulations contained enough information to estimate these parameters with a similar distribution to bootstrap data. There was no difference in any of parameters distribution between the IGI model simulations and bootstrap data even though the medians did not overlap in some cases, however, 75% spread of the IMM simulations was below or above 75% spread of bootstrap in CLG, CLGI, GPRG, IPRG, ISS, kIE, kIS, Q, VG, and VI parameters. The estimated distribution of GPRG from the data simulated by the IMM was shifted toward zero. That concluded an obvious difference in the information content of the IMM simulations and bootstrap for these parameters. Additionally, simulations from the IMM showed large uncertainty with the remaining two parameters kGE1 and VP where kGE1 estimates were also close to zero. In fact, GPRG and kGE1 estimated distributions from the IMM are conveying a message about the implicit description of endogenous glucose production in the IMM. In general, the precision was good for bootstrap and when the same model is used for simulation and estimation, i.e. the IGI model, while the lowest precision is seen with the IMM simulations.

Fig. 5
figure 5

Visual predictive checks of the two dependent variables (glucose and insulin) from each model initial fit. The IGI model on the right-hand side and the IMM on the left-hand side, the lines represent the median of the data, the dashed lines are the 5th, 95th percentiles of the data. The shaded areas are the confidence intervals around the same percentiles in the simulated data. Both models described the data well as the median, 5th, 95th percentiles of the data are within the shaded areas.

Fig. 6
figure 6

Boxplots of the estimated distributions of each parameter of the IGI model, grouped by the three different sources of the data: one hundred bootstrap datasets (red), one hundred datasets simulated from the IGI model (green) and one hundred datasets simulated from IMM model (blue). The boxplot length is the interquartile range (the middle half of the distribution) while each whisker represents one quarter of the distribution. BS, bootstrap; IGI, integrated glucose insulin model; IMM, integrated minimal model;CLG, CLGI and CLI, insulin-independent glucose clearance, insulin-dependent glucose clearance and insulin clearance; GPRG, power parameter for glucose effect on endogenous glucose production; Gss, glucose basal concentrations; IFST, 1st phase insulin amount; IPRG, power parameter for glucose effect on 2nd phase insulin secretion; Iss, insulin basal concentrations; kGE1, kGE2, kIE and kIS, first order rate parameters; Q, glucose inter-compartmental clearance; VG, VI, VP, volumes of distribution of glucose and insulin central compartments, glucose peripheral compartment.

Similarly, plots of the 300 estimates of each parameter of the IMM, grouped by the three different sources of the data are shown in Fig. 7. Simulations from both models contained enough information to estimate SG, VOL, SI, QE1, enneCL, BETA, H, and QUINS parameters with a similar distribution as bootstrap data. Surprisingly, the 50% spread of the IMM simulations were above 75% spread of bootstrap data for parameters p2 and VI, so there is likely different information content between these two data sources. The same problem was found for parameter ALPH, p2 and x0 with the IGI model simulations whose 75% spread was below 50% spread of bootstrap and thus their information content is likely to be different, however for parameter VI, there was no difference between these two data sources. The 75% spread of the IGI model simulations were below 75% spread of bootstrap for parameters VOLI, QE2 and MTT3 and so there is an obvious difference in the information content of these two data sources, but no difference can be claimed between the IMM simulations and the bootstrap data for these three parameters. Interestingly, parameter VI2 was well estimated from the IGI model simulations even though this model has only one compartment disposition model for insulin kinetics. Finally, the precision for all parameters was good with the three different sources, with the highest precision with the IMM as expected.

Fig. 7
figure 7

Boxplots of the estimated distributions of each parameter of the IMM model, grouped by the three different sources of the data: one hundred bootstrap datasets (red), one hundred datasets simulated from the IGI model (green) and one hundred datasets simulated from IMM model (blue). The boxplot length is the interquartile range (the middle half of the distribution) while each whisker represents one quarter of the distribution. BS, bootstrap; IGI, integrated glucose insulin model; IMM, integrated minimal model; ALPH, time constant parameter for insulin provision; BETA, the feedback control of glucose on 2nd phase insulin secretion; enneCL, insulin clearance; SG, glucose effectiveness; H, glucose basal concentration; MTT3, insulin 1st phase mean transit time; p2, rate parameter for insulin effect; QE1 and QE2, inter-compartmental clearance of glucose and insulin respectively; QUINS, insulin basal concentration; SI, insulin sensitivity parameter; VI, VI2, VOL and VOLI, volumes of distributions of insulin central compartment, insulin peripheral compartment, glucose central compartment and glucose peripheral compartment respectively; x0,insulin 1st phase concentrations.

Mapping the Clinical Parameters

Each of the individual simulation parameter values for SG, SI and p2 was the first parameter to be added by the GAM as a true covariate on CLG, CLGI, and kIE individual estimates, respectively. These were the strongest correlations of the physiological indices and insulin action with the IGI model parameters. On incorporating these relations in the IGI model and refitting it to the large simulated data by the IMM, inter-individual variability assigned to CLG, CLGI, and kIE parameters were reduced by 98%, 68%, and 98% respectively as shown in Table I. Since most of the inter-individual variability on CLG and kIE were explained, but not on CLGI; the second IMMparameter added by the GAM on CLGI was incorporated as well, which was the volume of glucose central compartment VOL. That resulted in explaining almost all the remaining inter-individual variability on CLGI.

Table I Mapping of Interesting Parameters in the IMM to their Corresponding in the IGI Model

Discussion

Several mathematical models had been developed to improve our understanding and learning about the glucose-insulin regulatory system, and so that is where modeling has a major impact in the therapeutic area of diabetes. The progress in diabetes modeling was further acknowledged in 2008 with the approval of the first model based computer simulator of human metabolic system for pre-clinical testing of new treatment strategies for type I diabetes (27). In our work, we successfully applied our proposed evaluation procedure on two products of modeling research in diabetes over the past years, the IGI model and the IMM. The results of our work highlighted the pros and cons of the underlying structural assumptions behind each model, and accurately identified existing violations in agreement to previous findings (28).

Both models use a two-compartment disposition model to describe glucose kinetics with elimination from the central compartment only. The parameters of the IMM involved in this were glucose effectiveness SG, the central compartment volume of distribution VOL, the peripheral compartment volume of distribution VOLI, the intercompartmental clearance QE1, and basal glucose concentrations H. All these parameters were well estimated from bootstrap data and the IGI model simulated data except for VOLI which showed underestimation bias with IGI model simulated data. On the other hand, the parameters of the IGI model involved in two compartment description of glucose kinetics were the central compartment volume of distribution VG, the peripheral compartment volume of distribution VP, the intercompartmental clearance Q, insulin-independent glucose clearance CLG, insulin-dependent glucose clearance CLGI and basal glucose concentrations GSS. None of these parameters, except GSS, were well estimated from the IMM simulated data. These results question the simulation performance of IMM, as apparently, the IGI model offers a more informative approach.

For insulin kinetics, the IMM used a two-compartment disposition model with the central compartment volume of distribution VI, the peripheral compartment volume of distribution VI2, the intercompartmental clearance QE2, basal insulin concentrations QUINS and insulin clearance enneCL. Again, all these parameters were well estimated from the IGI model simulated data except for QE2 as expected since insulin kinetics in the IGI model are described with a one compartment disposition model. Interestingly the IGI model simulated data resulted in a more real-life-like estimated VI distribution compared to the IMM itself, and VI2 was precisely estimated from the IGI model simulations which should not be informative for this parameter similar to QE2. The one compartment disposition model of insulin kinetics in the IGI model is described by the central compartment volume of distribution VI, insulin clearance CLI, and basal insulin concentrations ISS. All of these parameters showed differences in the estimated distributions between bootstrap data and the IMM simulated data. Regarding insulin 1st and 2nd phase secretions, bootstrap data and the IGI model simulated data showed similar estimated distributions for the IMM parameter BETA, likely different estimated distributions for IMM parameters x0 and ALPH, and obviously different estimated distributions for IMM parameter MTT3. This could be attributed to the fact that the IGI model assumes no delay in insulin 1st phase secretion until it reaches the insulin 1st phase compartment IFPS and so its simulations resulted in lower values for mean transit time MTT3 and x0. In the IGI model, insulin secretions are defined by parameters IFST, IPRG, kGE2 and kIS. Only kGE2 was accurately estimated from the IMM simulated data, which is expected as the IMM assume delay in the effect of glucose on insulin 2nd phase secretions through ALPH. Again, the IMM delay in insulin 1st phase secretion led to the higher values of IFST and kIS. It’s important to note that parameters describing insulin secretions in both models were well estimated from bootstrap data and so the two models provided two different ways of capturing the same information. Insulin effect on glucose kinetics in the IMM is described through a remote compartment with different values for the binding rate constant k2 (lumped in SI) and the dissociation rate constant p2. The distribution of the insulin sensitivity parameter SI was well estimated from the IGI model simulated data, even though it was not explicitly described in this model, however, the distribution of p2 was not well estimated from the IGI model simulated data, as the IGI model assumed one value for the delay rate constant kIE. This might also explain the overestimated distribution of the IGI model parameter kIE with the IMM simulated data, as here kIE should be capturing information about two of the IMM parameters k2 and p2.

The major difference between the two models is their approach of describing endogenous glucose production, this is attributed to the type of data used in the development of each model. Since the IMM adopted the concept of net hepatic glucose balance NHGB and did not explicitly describe a time-varying endogenous glucose production, none of its parameters can reflect on changes in endogenous glucose after system perturbation. While in the IGI model, time-varying endogenous glucose production is explicitly given by an empirical function of basal glucose elimination rate GSS ∗ (CLG + CLGI ∗ ISS), basal glucose concentration GSS, glucose concentrations in effect compartment GE1 and the feedback power term GPRG. All these parameters were well estimated by bootstrap data but not by the IMM simulated data except for GSS as shown in Fig. 6. The overestimation of insulin-independent glucose clearance and the underestimation of insulin-dependent glucose clearance by the minimal models was reported previously (4) as a consequence of their implicit way of describing endogenous glucose production. Additionally, the IMM simulated data lacked the needed information for the estimation of the feedback power term GPRG and the delay rate parameter kGE1. These results may suggest that the IMM uses a constant direct input of endogenous glucose for healthy subjects following system perturbation, this structure misspecification in the original glucose minimal model was early addressed and found to result in a biased parametric description of the whole system (28). Follow-up extensive investigations were performed resulting in the development of the hot glucose two-compartment minimal model (6).

The evaluations using the NLMEPC approach thus revealed some shortcomings of the IMM. Most of these are expected given the structure of the model and the general knowledge of the physiology. However, while these shortcomings were not evident in the VPC, the NLMEPC with the IGI model analysis of the IMM simulated data clearly displayed the difference between these and real data. Also the IGI model simulated data showed shortcomings, although to a lower extent. These results concerning the IGI model can then be considered in the use or update of the IGI model. Thus, primarily the NLMEPC can provide a detailed answer to the question whether the model can generate real-life like data. Secondarily, it may provide further insight into the model deficiencies, when such are present. The information provided on deficiencies may well vary dependent on a number of factors such as the similarity between, and complexity of, the models and the understanding of the underlying system. The data used in this exercise was unlabeled IVGTT data, on purpose selected to be the same type of experimental data as was used to develop the IMM. The IMM model also showed an adequate fit, as judged by the VPC (Fig. 5). The IGI model was developed using labeled IVGTT data and clamp data and the present unlabeled data was only part of the information supporting it. While it may have been an advantage to use for this exercise part of the data from which the model was developed, it is noteworthy that the model appear robust despite much of the data used to develop it was not part of the present data set.

It should be noted that in this exercise we also re-analyzed data with the same model that was used to simulate them. This is not part of the NLMEPC concept, but a procedure sometimes performed to assess whether the estimation method employed is appropriate for the model and data of the analysis in question (29,30).

An important conclusion of our NLMEPC exercise is the fact that the IGI model simulated data contains similar information about the clinical parameters SG and SI as the IMM simulations and bootstrap data, which opens the door for the possibility of deriving these parameters from the IGI model. This fact was further supported by the results from the mapping exercise. Both glucose effectiveness SG in the IMM and insulin-independent glucose clearance CLG in the IGI model represented the same glucose elimination mechanism and were derived by insulin-independent rate constants and volumes of glucose central compartment. Thus, the individual estimates of SG were the first parameter to be added by the GAM on the individual estimates of CLG, and the inclusion of the simulated individual estimates of SG on population parameter CLG in the IGI model resulted in explaining almost all of the unexplained interindividual variability as expected. It must be noticed that even though both parameters carry similar information, only CLG measures glucose per se effect on glucose utilization while SG is derived from the hybrid rate constant k5 that contains also the inhibitory effect of glucose on endogenous glucose production, and this might be the reason for the overestimation of CLG from the IMM simulated data. As insulin sensitivity parameter SI in the IMM is derived from rate constants only (Eq.4), its inclusion alone did not explain all the inter-individual variability of CLGI, that is derived from both the elimination rate constant and the volume of glucose central compartment. In the stepwise addition to the GAM, the 2nd parameter to be added on CLGI was indeed the volume of glucose central compartment VOL in the IMM. Then on including the individual estimates of both SI and VOL as covariates on population parameter CLGI, most of the inter-individual variability was explained. One more interesting parameter derived from the IMM was insulin action p2, this parameter was fully mapped with kIE parameter in the IGI model.

From the GAM estimates of our mapping exercise (Eq. 7), we derived (Eq. 8) for the calculation of the individual estimates of SG from the IGI model, where CLG is the individual estimate of insulin-independent glucose clearance. Similarly, we derived (Eqs. 911) for calculating the individual estimates of VOL, SI, and p2 from the IGI model parameters. Noting that the product of SI and VOL as obtained from (Eqs. 910) will result in the insulin sensitivity index SI provided by the IMM.

$$ {\displaystyle \begin{array}{c}{\mathrm{CL}}_{\mathrm{G}}\kern0.5em =\left({\uptheta}_{{\mathrm{CL}}_{\mathrm{G}}}+{\uptheta}_{\mathrm{EFFECT}}\cdotp {\mathrm{IMM}}_{\mathrm{parameter}}\right)\cdotp {\mathit{\exp}}^{\eta_{{\mathrm{CL}}_{\mathrm{G}}}}\\ {}{\mathrm{CL}}_{\mathrm{G}}\kern0.5em =\left(0+0.0829\cdotp {\mathrm{S}}_{\mathrm{G}}\right)\cdotp {\mathit{\exp}}^{\eta_{{\mathrm{CL}}_{\mathrm{G}}}}\\ {}{\mathrm{S}}_{\mathrm{G}}\kern1.0em ={\mathrm{CL}}_{\mathrm{G}}/0.0829\end{array}} $$
(8)
$$ \mathrm{VOL}=\left({\mathrm{V}}_{\mathrm{G}}-0.9\right)/0.0899 $$
(9)
$$ \mathrm{SI}=\left(1000\cdotp {\mathrm{CL}}_{\mathrm{GI}}-0.009\cdotp \mathrm{VOL}\right)/\left(5094+502\cdotp \mathrm{VOL}\right) $$
(10)
$$ p2={\mathrm{k}}_{\mathrm{IE}}/0.938 $$
(11)

In conclusion, we demonstrated the NMLEPC as a new approach to investigate models’ ability to simulate real-life-like data, and the information captured in each model in comparison to real data, using two glucose homeostasis NLME models. In addition, we proposed a mapping approach for parameters carrying similar information between different NLME models.