Operational characteristics of full random effects modelling (‘frem’) compared to stepwise covariate modelling (‘scm’)

Amann, Lisa F.; Wicha, Sebastian G.

doi:10.1007/s10928-023-09856-w

Operational characteristics of full random effects modelling (‘frem’) compared to stepwise covariate modelling (‘scm’)

Original Paper
Open access
Published: 21 April 2023

Volume 50, pages 315–326, (2023)
Cite this article

Download PDF

You have full access to this open access article

Journal of Pharmacokinetics and Pharmacodynamics Aims and scope Submit manuscript

Operational characteristics of full random effects modelling (‘frem’) compared to stepwise covariate modelling (‘scm’)

Download PDF

Lisa F. Amann¹ &
Sebastian G. Wicha¹

1628 Accesses
1 Citation
Explore all metrics

Abstract

An adequate covariate selection is a key step in population pharmacokinetic modelling. In this study, the automated stepwise covariate modelling technique (‘scm’) was compared to full random effects modelling (‘frem’). We evaluated the power to identify a ‘true’ covariate (covariate with highest correlation to the pharmacokinetic parameter), precision, and accuracy of the parameter-covariate estimates. Furthermore, the predictive performance of the final models was assessed. The scenarios varied in covariate effect sizes, number of individuals (n = 20–500) and covariate correlations (0–90% cov-corr). The PsN ‘frem’ routine provides a 90% confidence intervals around the covariate effects. This was used to evaluate its operational characteristics for a statistical backward elimination procedure, defined as ‘frem_posthoc’ and to facilitate the comparison to ‘scm’. ‘Frem_posthoc’ had a higher power to detect the true covariate with lower bias in small n studies compared to ‘scm’, applied with commonly used settings (forward p < 0.05, backward p < 0.01). This finding was vice versa in a statistically similar setting. For ‘frem_posthoc’, power, precision and accuracy of the covariate coefficient increased with higher number of individuals and covariate effect magnitudes. Without a backward elimination step ‘frem’ models provided unbiased coefficients with highly imprecise coefficients in small n datasets. Yet, precision was superior to final ‘scm’ model precision obtained using common settings. We conclude that ‘frem_posthoc’ is also a suitable method to guide covariate selection, although intended to serve as a full model approach. However, a deliberated selection of automated methods is essential for the modeller and using those methods in small datasets needs to be taken with caution.

Operating characteristics of stepwise covariate selection in pharmacometric modeling

Article 24 April 2019

A Modified Hybrid Wald’s Approximation Method for Efficient Covariate Selection in Population Pharmacokinetic Analysis

Article 03 March 2021

Impact of covariate model building methods on their clinical relevance evaluation in population pharmacokinetic analyses: comparison of the full model, stepwise covariate model (SCM) and SCM+ approaches

Article 09 April 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Over the past decades, population pharmacokinetic modelling with nonlinear mixed effects (NLME) approaches efficiently supported drug development. During model development covariates are analysed to establish a relationship between a model parameter and a patient specific variable. A covariate can be any variable on patient-level (not time varying) that influences the pharmacokinetics (PK) or pharmacodynamics (PD) of a drug. If informative, it reduces unexplained inter-individual PK or PD variability. To guide dose adjustments in special patient populations (e.g. elderly, adipose, hepatically or renally impaired patients), a covariate analysis is also of interest to regulatory authorities [1]. To date, a number of automated covariate selection techniques are available [2]: these include e.g. stepwise covariate modelling (‘scm’) [3], or least absolute shrinkage and selection operator (lasso) [4]. The stepwise procedure tests predefined covariates on structural PK or PD parameters of interest. Automated covariate selection methods are statistically driven methods. The ‘scm’ includes covariates by the highest drop of objective function (dOFV) with a predefined p-value during the forward inclusion. In one of the more common implementations covariates are included until the likelihood ratio test identifies no significant covariate parameter relationship anymore. Afterwards, the backward elimination reduces the covariate model to obtain the final model, by applying a stricter p-value. This method has been evaluated on their properties and compared to other established methods before [5, 6]. In contrast to that, the ‘frem’ is a full model approach and includes all covariates of interest as observations (i.e., explicitly defining the likelihood of the covariate values) [7]. A full covariance matrix quantifies the random effects of PK parameters and describes parameter covariate relationships [8]. With the matrix, covariances of covariates can inform for other covariates so that this method is less sensitive to collinearity. Covariate coefficients are obtained from the ratio of covariance between parameter and covariate variability to the covariate variance [7].

The novel ‘frem’ method has not been applied to many clinical datasets yet [9,10,11]. Although ‘scm’ and ‘frem’ are techniques that are rather complementary in nature due their inherently different way to approach covariate modelling, a structured comparison of the operational characteristics using a simulation study is lacking. The aim of this study was to compare the ‘scm’ and ‘frem’ as automated covariate analysis methods. In order to enable a comparison, we here introduce the ‘frem_posthoc’ that offers a covariate selection step from the final ‘frem’ model using the confidence intervals around the estimated covariate effect sizes in the final ‘frem’ model. In the present study, the following aspects between ‘scm’, ‘frem’ and ‘frem_posthoc’ were compared: (1) the power to identify the true covariate (here defined as the covariate with the highest correlation with the PK parameter), (2) accuracy and precision of the estimated relationship, as well as (3) the predictive performance. To enable a thorough comparison, we investigated the impact of dataset size (n = 20–500), and covariate correlation (0–90%) for three covariate effect sizes in sparse simulated datasets using the commonly used (‘scm’) or predefined (‘frem’/’frem_posthoc’) settings of both approaches as well as statistically equal settings.

Methods

The workflow of this simulation study is shown in Fig. 1. The simulation dataset contained three covariates sampled from a multivariate normal distribution. The dataset was used to simulate with a one compartment model including the true covariate relationship on clearance. These simulated clinical datasets served for ‘scm’ and ‘frem’ analyses (n = 1000 for each scenario). Based upon the final models, power, precision, and accuracy were evaluated. The following section describes the single steps in detail.

Software

NLME modelling was applied with NONMEM^® 7.5.0 [12], controlled through PsN 5.0.0 [13]. The software R (version 3.6.0) [14] was used for automated run executions and data analysis. The NONMEM^® models as well as relevant R code are provided in Supplement 1.

Generation of datasets and simulation of PK data

Continuous covariates

Three vectors of three covariates (i.e., covariate_I, covariate_II and covariate_III) with defined means, and variances were drawn from a multivariate normal distribution (Supplement 3, Figure S3-1). The datasets included various correlations of covariate_I (cov_true) and covariate_II from 0 to 90%. Covariate_III represented pure “noise” and was independent from cov_true and covariate_II. All simulations used individually simulated datasets with 20, 50, 100 or 500 virtual patients (n) including 2 (sparse) concentration-time points per individual. The sparse sampling datasets included samples in the sixth and twelfth dosing interval (1 and 11.5 h time after last dose, respectively). PK profiles of the scenarios (1-CMT PK model, i.v. short infusion, linear elimination) were obtained via Monte Carlo simulations. The true PK model (run001) is described in Supplement 1.1. The simulated dose was 100 mg q12 h with 30 min infusion. The PK model parameters were clearance (CL) of 18 L/h with inter-individual variability on CL (IIV_CL: 0.1 variance, log-normal distribution), central volume of distribution (V1) of 400 L and a residual proportional error (%CV) of 15%. Cov_true was implemented as an exponential covariate on CL ($\theta_{CL }$) with the $\theta_{cov }$ as covariate coefficient (Eq. 1):

$$CL = \theta_{CL } \, \cdot \,e^{\left( {\theta_{cov } \, \cdot \,\left( {COV - COV_{mean} } \right)} \right)} \, \cdot \,e^{\eta_i } .$$

(1)

The individual covariate value (cov) was normalized by the mean of the covariate distribution $\left( {cov - cov_{mean} } \right)$. The remaining unexplained inter-individual variability ($\eta_i$) described the individual deviation from the typical parameter $\theta_{CL{ }} { }$ for the ith individual (Eq. 1).

The observed concentration $Y_{observed,i,j}$ was calculated by the predicted concentration $Y_{predicted,i,j}$ multiplied by the proportional residual unexplained variance per individual i at each time point j (Eq. 2). No inter-occasion variability was included:

$$Y_{observed,i,j} = Y_{predicted, i,j} \, \cdot \,\left( {1 + \varepsilon_{prop,i,j} } \right).$$

(2)

The simulated covariate effect magnitudes varied between $\theta_{cov}$ = 0.026, 0.032 and 0.045, respectively. This resulted in relative effect sizes of − 18 to + 22%, − 22 to + 27% and − 29 to + 41% on CL at the 5th − 95th percentile of covariate values.

Evaluation using ‘scm’ or ‘frem’ models

Parameter estimation was performed using first order conditional estimation with interaction (FOCE+I), allowing three minimum retries on each simulated dataset (for each scenario, n = 1000). The structural model used for estimation is described in Supplement 1.2 (run002). The ADVAN 1 subroutine was used as analytical solution of the 1-CMT model. All three previously simulated covariates in the simulated dataset were provided to the ‘scm’, as well to ‘frem’ for analysis. The ‘scm’ and ‘frem’ were executed on each simulated dataset. The final ‘scm’ model results were either obtained in the last forward/backward step, or if the covariate identification failed, no covariate model was obtained. The ‘frem’ is a full model approach that includes all provided covariates simultaneously. Thereby, results cannot be compared to ‘scm’ without restrictions. To address the fundamental differences of these methods we evaluated the results in three settings:

(i)
Scenario 1 evaluated the operational characteristics of ‘frem_posthoc’. A covariate backward elimination from final ‘frem’ models was performed via the 90% confidence intervals of the estimated covariate effect and compared to final ‘scm’ models obtained with commonly used settings (forward inclusion, p < 0.05 and a backward elimination p < 0.01).
(ii)
Scenario 2 assessed a statistical ‘head-to-head’ comparison of ‘frem_posthoc’ and ‘scm’ cov_true coefficients with only forward inclusion (p < 0.1)
(iii)
Scenario 3 showed a comparison of all estimated ‘frem’ cov_true covariate coefficients without a selection step compared to ‘scm’ results of Scenario 1.

Scenario 1

A forward selection with a p-value of < 0.05 and a backward elimination (p < 0.01) was used reflecting the commonly used settings of the ‘scm’. We compared those ‘scm’ runs, which selected cov_true to those ‘frem’ runs that identified cov_true with a covariate effect significantly different from zero. The significance was interpreted by the 90% confidence interval obtained from sampling importance resampling (SIR) [15]. The results were extracted from the PsN provided results files (PsN 5.0.0), and the effect sizes (5th – 95th percentile of the covariate effect, 90% confidence interval) reflect the default setting of the ‘frem’ PsN routine. Since this setting evaluated a backward elimination, we define this use case of the ‘frem’ as ‘frem_posthoc’. We furthermore defined power (1–type II error) as the frequency of selecting cov_true in the covariate model (‘scm’), or as frequency to identify cov_true as a covariate with the highest effect size different from zero and non-overlapping 90% confidence interval (‘frem_posthoc’). For the ‘frem_posthoc’, the estimated univariate θ_cov coefficient was evaluated (PsN ‘frem_results.csv’), which represents the effect of a single covariate in isolation [7]. Conditional accuracy and conditional precision, expressed as rbias (Eq. 3) and rrmse (Eq. 4), were calculated as follows for significant cov_true coefficients:

$$rBIAS (\% ) = \frac{1}{N}\, \cdot \,\mathop \sum \limits_1^i \frac{(estimated_i - true_i )}{{true_i }}\, \cdot \,100,$$

(3)

$$rRMSE(\% ) = \sqrt {\frac{1}{N}\, \cdot \,\mathop \sum \limits_1^i \frac{(estimated_i - true_i )^2 }{{true_i^2 }} } \, \cdot \,100.$$

(4)

The denominator (N) was different across the simulated scenarios and methods, as the number of simulations for which cov_true coefficients was evaluated changed accordingly.

Moreover, true alpha values (Type-I error rate) were evaluated based on cov_III inclusion in the forward ‘scm’ models and the final ‘frem_posthoc’ models. Cov_III is independent of the others and represents pure noise without having any simulated relationship between the pharmacokinetics and cov_III. The alpha values in the final ‘frem_posthoc’ models were defined as the frequency of runs in which the cov_III effect was not overlapping with zero.

According to Ribbing et al., we calculated the fraction of predictive models by assuming an estimated covariate coefficient between zero and two times cov_true to be likely to improve the predictive performance of a model [16]. For each scenario the fraction of predictive models was calculated (Eq. 5), where e represents the covariate effect size, c the correlation between cov_true and cov_II and N the dataset size varying from n = 20–500. S_ecnN represented the models which included cov_true (‘scm’) for the respective scenario. For ‘frem_posthoc’ coefficients, s_ecnN represented all runs or those including a significant cov_true relationship with the highest effect of all three covariates in the models for comparison to the ‘scm’.

Fraction of predictive model:

$$s_{ecN} = 100 \, \cdot \, \frac{{\sum_{n = 1}^{1000} \left( {P_{ecnN} \, \cdot \, s_{ecnN} } \right)}}{{\sum_{n = 1}^{1000} s_{ecnN} }}(\% ),$$

(5)

where,

$$P_{ecnN} = \left\{ {\begin{array}{*{20}l} 1 \hfill & {\quad if \left| {\frac{{\hat{\theta }_{ecnN} - \theta_{ecnN} }}{{\theta_{ecnN} }}} \right| < 1} \hfill \\ 0 \hfill & {\quad otherwise} \hfill \\ \end{array} } \right..$$

Scenario 2

For a comparison of equal selection criteria, ‘scm’ runs with only forward inclusion (p-value < 0.1) were compared to ‘frem_posthoc’ results (which evaluates overlap/non-overlap with zero of the 90% confidence interval). Settings for ‘frem_posthoc’ were not changed compared to scenario 1. Power, conditional accuracy and precision were calculated for those runs, where the included cov_true was statistically significant. Similar to scenario 1, the predictive performance of final ‘scm’ and frem_posthoc’ models was evaluated according to Ribbing et al. [16]. As the number of significant runs changed across the simulated scenarios (e.g. n, covariate effect magnitude, cov-corr) the denominator to calculate these evaluation metrices changed between also between both methods.

Scenario 3

In this scenario, conditional accuracy, and precision, but also the predictive performance of all estimated ‘frem’ cov_true coefficients (i.e. no posthoc selection step from the final ‘frem’ model) were compared to ‘scm’ models obtained in scenario 1.

Categorical covariates

Additionally, a simulation study (n = 500) with a true dichotomous categorical covariate was performed. The dataset size varied from n = 20–500 and covariate correlation to a continuous covariate was 0% or 80%. The third covariate (continuous) was independent of the others. The true model included the categorical covariate as a fractional change of clearance with an effect size of either − 20% or − 40%. IIV_CL, but also inter individual variability on central volume of distribution (IIVV_c) was included in the model. More details on this study are described in Supplement 2.

Results

Power of cov_true inclusion for ‘scm’ and ‘frem_posthoc’

The power to include the cov_true throughout the investigated scenarios was highly variable. Overall, the power to select cov_true increased with dataset size or covariate effect and decreased in presence of covariate collinearity.

In scenario 1, the simulations and estimations showed that ‘frem_posthoc’ power was higher compared to the ‘scm’ throughout all scenarios (Fig. 2), likely due to the higher value for alpha of 0.1 in the ‘frem_posthoc’ (non-overlapping 90% confidence interval of the covariate effect size) vs. 0.01 in the ‘scm’. The dataset size (n = 20 to n = 100) strongly increased power for both methods. The presence of covariate correlation reduced the power of ‘frem_posthoc’ from 82 to 59% (n = 50, $\theta_{cov_{true} }$ = 0.032) whereas the ‘scm’ power was less affected by correlation in the simulated scenarios (Table 1). Moreover, with an increasing covariate effect on clearance, we observed an increase of power from 28% (‘scm’, n = 50, 0% cov-corr, $\theta_{cov_{true} } = 0.026$) to 80% $(\theta_{cov_{true} } = 0.0{45)}$ and from 64 to 96% for ‘frem_posthoc’. Scenarios with n = 500 showed a power of > 91%, independent of covariate effect magnitude and were less influenced by covariate collinearity (Fig. 2 and Table 1).

Table 1 Simulation and estimation results of ‘scm’ and ‘frem_posthoc’ in scenario 1

Full size table

Moreover, the frequency of a significant cov_II effect in the final ‘frem_posthoc’ models was > 77% in presence of ≥ 80% cov-corr (n ≥ 100). In contrast to that, cov_II was significantly included in < 16% of ‘scm’ runs.

In scenario 2, a ‘head-to-head’ comparison with a statistically similar setting was performed (same setting for the ‘frem_posthoc’ as in scenario 1 and ‘scm’ with sole forward inclusion using an alpha value of 0.1): the less strict alpha value in combination with only forward inclusion led to an increase of power for the ‘scm’, resulting in above 53% and with that being superior to ‘frem_posthoc’ (Fig. 3). More details are described in Supplement 3.

No comparison of power is possible for scenario 3 due to the missing selection step in the ‘frem’.

Conditional accuracy and precision of $\theta_{cov_{true} }$ estimates

In scenario 1, an overestimation in small n datasets was more pronounced for ‘scm’ than for ‘frem_posthoc’ (Fig. 2). Thus, ‘frem_posthoc’ covariate coefficients were more accurate and precise. (Fig. 2, Supplement 3, Figure S3-2). We observed for both methods a power-dependent increase in conditional accuracy up to unbiased estimates, see Table 1 and Fig. 2. For example, the rbias of ‘scm’ coefficients was reduced from 50% ($\theta_{cov_{true} } = 0.026$) to 8% ($\theta_{cov_{true} } = 0.045$) in small datasets (n = 50) in presence of 90% cov-corr.

The conditional precision of the estimated coefficients in scenario 1 showed the same trend: Imprecision steeply decreased with increasing power (Table 1). With both methods, we obtained imprecise estimates in small n datasets (n = 50, $\theta_{cov_{true} } = 0.032$, ‘frem_posthoc’: 35%, ‘scm’ 42%), independent of correlation.

In scenario 1, CL and V_c were accurately (rrmse < 10%) and precisely (rbias < 3%) estimated in the final ‘scm’ as well as the ‘frem_posthoc’ model. The proportional error model estimate trended to underestimation (rbias > − 11%) and was less precise with rrmse < 27%.

In scenario 2, the higher alpha value of 0.1 in scenario 1 for ‘scm’ forward selection strongly reduced overestimation of coefficients to a rbias below 48%. As a result, conditional accuracy was higher compared to ‘frem_posthoc’, whereas conditional precision of ‘scm’ coefficients was similar to ‘frem’ coefficients throughout the scenarios (Fig. 3, Supplement 3 Table S3-1). Additional details are described in Supplement 3.

Furthermore, scenario 3 compared all ‘frem’ cov_true estimates without a selection step to those of the final ‘scm’ models obtained after backward elimination. This analysis quantitatively shows the effect of selection bias if compared to scenario 1 results. In sum all ‘frem’ coefficients were unbiased. Moreover we observed still a high imprecision of ‘frem’ coefficients in small n datasets (n < 100) which was independent of the selection step, but ‘frem’ showed a superior precision compared to ‘scm’ especially in small n datasets, (Fig. 4). Further details are described in Supplement 3.

The simulation study using a true categorical covariate showed the same trend of power, conditional rbias and rrmse for scenario 1 and scenario 2, whereas the differences of our evaluation criteria were smaller between ‘scm’ and ‘frem_posthoc’, if compared to the simulation study using a true continuous covariate. Supplement 2 provides a detailed description of all obtained results.

Predictive performance of ‘scm’ and ‘frem_posthoc’ models

Scenario 1 evaluated the estimated covariate coefficients of the final ‘scm’ and ‘frem_posthoc’ models for their predictivity, i.e., were termed predictive when estimated between zero and two times the true value. The results are shown in Fig. 5. The predictive performance of the cov_true estimates was a function of power for the ‘scm’, but also for ‘frem_posthoc’. The ‘frem_posthoc’ showed a higher power in small n datasets, thus the fraction of predictive models was more than twice as high compared to ‘scm’. On the other hand, the fraction of predictive ‘scm’ models increased more steeply with increasing power. At a power value of > 28% more than 90% of the final models were likely to improve the predictivity (‘frem_posthoc’ > 47% power).

As power is a composite of dataset size, covariate effect size and correlation, we analysed the individual components on their relation to influence the fraction of predictive models (Supplement 3 Figure S3-3). We observed that dataset size, covariate effect size, rbias and rrmse most influenced the fraction of predictive models and that predictive performance was less impacted by covariate correlation.

The fraction of predictive ‘scm’ and ‘frem_posthoc’ models in scenario 2 were similar (scm: 97.0% ‘frem_posthoc’: 97.5%, n = 50, cov-corr = 80%, $\theta_{cov_{true} } = 0.026$) and reached both 100% in the scenario with the highest simulated covariate effect magnitude, $\theta_{cov_{true} } = 0.045,$ n > 50), see Supplement 3 Figure S3–4.

Overall, final ‘frem’ models (scenario 3) were providing highly predictive covariate coefficient estimates, which were mainly driven by covariate effect magnitude and independent of the dataset size (Supplement 3 Figure S3-6).

Type 1 error

For scenario 1, the true alpha values are displayed in Fig. 6. Overall, ‘frem_posthoc’ indicated a false significant covariate effect of the dummy covariate cov_III in more cases, than the given 10% confidence intervals of the covariate effects would imply, i.e., an inflated type 1 error rate was observed. The ‘scm’ also displayed inflated type 1 error rates for small datasets. For n ≥ 100 both methods approached the set alpha value of 10% (‘frem_posthoc’) or 5% (‘scm’).

In scenario 2, the true ‘scm’ alpha values were between 6 and 11% and with that close to the expected 10% value.

Discussion

In the present study, we compared operational characteristics of the novel ‘frem’ technique to ‘scm’ as automated covariate analysis methods. As the ‘frem’ method is a full model approach and does not originally comprise a selection step, we introduced the ‘frem_posthoc’ step to account for a covariate backward elimination based on significant covariate effect sizes. This reflects an additional application of a ‘frem’ model in an exploratory analysis. Overall, this study gave insights in operational characteristics of the ‘frem’ method, but also showed the ability of ‘frem_posthoc’ to guide covariate selection. Yet, for ‘frem_posthoc’ the same caution as for the ‘scm’ should be applied since this posthoc step also can introduce selection bias in scenarios with low power (i.e. small covariate effect size, small sample size). Of note, an evaluation of precision and accuracy of all cov_true ‘frem’ estimates without a selection step showed that the covariate effect estimates were unbiased and showed lower imprecision as those determined using the ‘scm’, which were biased due to the selection step, in particular in scenarios with low power. This underlines the value of the ‘frem’ method. It has the additional advantage of interpreting the covariate effect simultaneously to statistical significance without the need for further evaluate the parameter uncertainty, which is needed for ‘scm’ to evaluate clinical relevance (e.g. bootstrap, llp-sir [17]). In large datasets, both methods provided precise and accurate inference on covariate effects in our simulation study. Moreover, Yngman et al. described an advantage of ‘frem’ model, that it can provide covariate coefficients for any subset of the examined covariates and thus be applied to different covariate datasets [7]. In addition, a model reduction of the full model could be done in a stepwise manner, if a more parsimonious model is desired [2, 7]. This simulation study comprised an investigation of final ‘frem’ model subsets for the purpose of covariate backward elimination, presented in scenarios 1–2.

The statistical power to detect true covariate effects is important to guide clinical study design. Ribbing et al. described that dataset size, magnitude of collinearity, and covariate effect size influence the power of the ‘scm’ method [16]. Ahamadi et al. investigated the operating characteristics of ‘scm’ using different complexities of true models (i.e. 1–4 true covariates). Those scenarios with one true covariate (n = 300, cov-corr 32% or 89%, 250 simulations) reached a high power [18]. This is in line with our results in datasets n ≥ 100. Beyond that, our observed power increase, as a result of increased dataset- and covariate effect size, as well as a reduction of power caused by collinearity of covariates are in line with Ribbing et al. [19]. In comparison to that, the ‘frem_posthoc’ showed an up to three-fold higher power in the worst-case scenario with high correlation in small cohort studies (scenario 1, $\theta_{cov_{true} } = 0.026$), likely as a result of the different alpha values in the selection step. In scenario 2, power differences of the two methods were smaller, rather favouring ‘scm’. We however think that ‘scm’ with only forward inclusion and an alpha value of 0.1 does not represent common practice. Moreover, it is known that ‘scm’ suffers from multiple testing which is not the case of the ‘frem_posthoc’ method, which makes this an interesting comparison.

In this study cov_II carries up to 90% of the information of cov_true. ‘Frem_posthoc’ accounts for correlation and the high frequency significant cov_II inclusions in high correlation scenarios represents its ability to account for correlation. In contrast to that, ‘scm’ with forward selection (p value < 0.05) and backward elimination (p value < 0.01), but also with applying only forward inclusion (p-value < 0.1) is intrinsically not able to capture the true present correlation. However, the model prediction using a wrong, but highly correlated covariate, that carries information of the true covariate could be comparable to including the true covariate. One the one hand, the inclusion would lead to interpretation difficulties, on the other hand, an exclusion of correlated covariates could also cause confounded interpretation of covariate effect estimates, as the correlated covariate carries parts of the true covariate information. Thereby pharmacological understanding is key for decision making.

We also investigated a scenario with a true categorical covariate with and without an additional level of variability, the IIVV_c. The results showed a similar behaviour as observed for continuous covariates. Scenarios 1 and 2 showed only minor differences in power for ‘scm’ and ‘frem_posthoc’ in cases when the covariate has a strong effect size. The additional level of variability decreased power by up to ca. − 5%. The simulation study using a true continuous covariate did not include IIVV_c, so we assume an overall worsening effect of the presented continuous covariate study results in presence of IIVV_c here.

Moreover, conditional accuracy and precision of the covariate coefficients were investigated in case cov_true was selected in the final models. In scenario 1 bias was present in both methods, however slightly lower when using the ‘frem_posthoc’ (especially in low power scenarios). In scenario 2 the findings were vice versa, so that overestimation was less pronounced for ‘scm’, resulting from a less strict alpha value in the selection step. According to Wahlby et al. selection bias is only moderate in typical PK modelling dataset [5], but this was only confirmed for covariates with high effect sizes [16]. In scenario 3 unbiased ‘frem’ estimates were obtained, as no covariates were selected, and all estimated coefficients were considered for the evaluation.

Conditional precision was more precise for ‘frem_posthoc’ compared to ‘scm’ in scenario 1 and equally high in scenario 2. Precision was improved by a less strict alpha value (‘scm’ in scenario 1 vs. scenario 2). As precision is a function of power, we assume that the increased precision is caused by increased ‘scm’ power.

Beyond that, in scenario 1 we evaluated the predictive performance of the final models and used the range of zero to two times the true coefficient value as a predictor for improvement of the model fit, according to Ribbing et al. [16]. The present study confirmed the predictive performance of ‘scm’ models being a function of power and we confirmed this for ‘frem_posthoc’ estimates. Compared to ‘scm’ models, the fraction of predictive ‘frem_posthoc’ models was higher, especially in scenarios which achieved power < 50%. The predictive performance was positively correlated with rbias, rrmse and number of study individuals. Interestingly, covariate collinearity did not impact the predictive performance (Supplement 3 Figure S3-3).

The type 1 error rate was evaluated with cov_III being independent from the other two available covariates. The previously described inflated type 1 error rate in the ‘scm’ approach [5] was confirmed in this study but was also observed for the ‘frem_posthoc’. The ‘frem_posthoc’ alpha values were decreasing with increasing study size but were still slightly inflated. The confidence interval of the covariate effect is calculated by SIR in the PsN implementation of ‘frem’ [15]. The confidence interval served for the calculation of the frequency in how many of the performed runs the cov_III effect size was estimated to be significantly different from zero. Broeker et al. found, that especially in small n datasets the SIR-derived confidence interval tends to be underestimated, in particular for the omega values [17]. This underestimation might explain the inflated alpha values, as zero is less often included in the SIR-based confidence intervals if they are too narrow.

‘Frem’ is mathematically equivalent to FFEM, which has been suggested as an alternative to stepwise procedures [2]. Although a backward elimination is not originally intended by the full model approach, as this may curtail its benefits, a guidance for this backward elimination step has been proposed by Gastonguay et al. [2]. A model reduction based on covariate effect size, has also been applied to clinical data [20]. A reduction of a full model for predictive purposes can be done via exclusion of non-statistically significant (CI includes null value) and non-clinically important (entire CI contained within no effect range) covariate effects. Covariates which are clinically important and statistically significant, or are not statistically significant but may be clinically important should be retained in the model [2]. The clinical relevance criteria was not considered in our study evaluation, as this additional filter is subjective in a simulation study and driven by the pharmacological considerations. Furthermore, statistically significant effects are clearly defined, whereas the often used clinical relevance threshold of 20% is not. This threshold may apply for clearance; however, it can be different for other PK parameters related to a covariate effect. Moreover, this threshold can be dependent on the indication, pharmacometric question to be answered or substance itself, e.g. a narrow therapeutic window could reduce the threshold. These factors cannot be fully reflected in a simulation study.

A few more limitations shall be mentioned: the here evaluated scenarios only display a portion of the complexity of covariate analysis in real clinical datasets. The here simulated cov_true effect magnitudes were chosen around the often-used clinical significance threshold of 20% on clearance [12] displaying a weak, moderate, and strong effect as it could be expected in a real clinical dataset. However, neither collinearity between more than one covariate, nor the presence of more than one true covariate carrying information was investigated.

To calculate the fraction of predictive models amongst the evaluated runs in each scenario, we assumed an estimated covariate coefficient between zero and two times cov_true to be likely to improve the predictive performance of a model [16]. This in other words accounts for up to 100% overestimation, so that even in presence of a strong selection bias, coefficients were rated as predictive. Highly biased covariate coefficients make the model less adequate for predictive purposes and could ultimately cause misleading clinical interpretation on e.g., clearance if the covariate coefficient originates from small n datasets (< n = 100). However, as ‘frem_posthoc’ has not been applied to clinical data yet, this needs to be further evaluated.

Besides that, this simulation study investigated only covariates on clearance, but usually clinical covariates are also found on other model parameters. Moreover, interindividual variability on central volume of distribution is very common in clinical datasets but was not included in the analysis using a true continuous covariate. Based on prior knowledge and confirmatory results obtained in the simulation study with categorical data, we assume a reduction of power in presence of more levels of variability. Moreover, the covariate coefficients directly obtained via the PsN ‘frem’ routine, represent exponential covariate parameterization in fixed effects models [8]. Other implementations might be of interest, too, and could be explored in subsequent studies.

Conclusion

Overall, this study contributed to the understanding of the ‘frem’ and showed properties and characteristics of the methods for continuous but also categorical covariates. We introduced with ‘frem_posthoc’ a possibility to guide covariate selection, mimicking how ‘frem’ could be additionally used in practise. With that, covariate effect size interpretation and selection can be done simultaneously and a predictive model with capturing correlation in the datasets can be obtained. Using the commonly applied settings of ‘scm’ and ‘frem’, in small n datasets the power of ‘frem_posthoc’ was substantially higher, leading to a lower bias, compared to ‘scm’ in scenario 1. In datasets with n > 100 power, precision, and accuracy of ‘frem_posthoc’ were comparable to ‘scm’. However, the simulated scenarios still highlight the need for thoughtful choice of the method to answer the underlying pharmacometric question in small datasets.

References

U.S. Food and Drug Administration. Guidance for industry: Pharmacokinetics in patients with impaired hepatic function: Study design, data analysis, and impact on dosing and labeling, U.S.Department of Health and Human Services Food and Drug Administration Center for Drug Evaluation and Research (CDER),Center for Biologics Evaluation and Research (CBER), May 2003.Clinical Pharmacology. Available from http://www.fda.gov/downloads/Drugs/GuidanceComplianceRegulatoryInformation/Guidances/ucm072123.pdf. Accessed 23 Jun 2022
Gastonguay MR. Full covariate models as an alternative tomethods relying on statistical significance for inferences aboutcovariate effects: a review of methodology and 42 case studies. Twentieth Meeting, Population Approach Group in Europe; 2011 Jun 7–10; Athens. Available at https://www.page-meeting.org/pdf_assets/1694-GastonguayPAGE2011.pdf. Accessed 06 Jan 2023
Jonsson EN, Karlsson MO (1998) Automated covariate model building within NONMEM. Pharm Res 15:1463–1468. https://doi.org/10.1023/a:1011970125687
Article CAS PubMed Google Scholar
Ribbing J, Nyberg J, Caster O, Jonsson EN (2007) The lasso—a novel method for predictive covariate model building in nonlinear mixed effects models. J Pharmacokinet Pharmacodyn 34:485–517. https://doi.org/10.1007/s10928-007-9057-1
Article PubMed Google Scholar
Wahlby U, Jonsson EN, Karlsson MO (2002) Comparison of stepwise covariate model building strategies in population pharmacokinetic-pharmacodynamic analysis. AAPS PharmSci 4:1–12. https://doi.org/10.1208/ps040427
Article Google Scholar
Hutmacher MM, Kowalski KG (2015) Covariate selection in pharmacometric analyses: a review of methods. Br J Clin Pharmacol 79:132–147. https://doi.org/10.1111/bcp.12451
Article PubMed Google Scholar
Yngman G, Bjugård Nyberg H, Nyberg J et al (2022) An introduction to the full random effects model. CPT Pharmacomet Syst Pharmacol 11:149–160. https://doi.org/10.1002/psp4.12741
Article CAS Google Scholar
Karlsson MO (2004) A full model approach based on the covariance matrix of parameters and covariates. Aaps 20:207–253
Google Scholar
Bruno R, Vivier N, Vergniol JC et al (1996) A population pharmacokinetic model for docetaxel (Taxotere^®): model building and validation. J Pharmacokinet Biopharm 24:153–172. https://doi.org/10.1007/BF02353487
Article CAS PubMed Google Scholar
Brekkan A, Lopez-Lazaro L, Yngman G et al (2018) A population pharmacokinetic-pharmacodynamic model of pegfilgrastim. AAPS J 20:1–14. https://doi.org/10.1208/s12248-018-0249-y
Article CAS Google Scholar
Novakovic AM, Krekels EHJ, Munafo A et al (2017) Application of item response theory to modeling of expanded disability status scale in multiple sclerosis. AAPS J 19:172–179. https://doi.org/10.1208/s12248-016-9977-z
Article CAS PubMed Google Scholar
Boeckmann AJ, Sheiner LB, Beal SL, Bauer RJ (2011) NONMEM Users Guide. NONMEM Project Group, University of California at San Francisco, Ellicott City
Google Scholar
Lindbom L, Pihlgren P, Jonsson N (2005) PsN-Toolkit—a collection of computer intensive statistical methods for non-linear mixed effect modeling using NONMEM. Comput Methods Programs Biomed 79:241–257. https://doi.org/10.1016/j.cmpb.2005.04.005
Article PubMed Google Scholar
R Core Team (2019) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna
Dosne A-G, Bergstrand M, Harling K, Karlsson MO (2016) Improving the estimation of parameter uncertainty distributions in nonlinear mixed effects models using sampling importance resampling. J Pharmacokinet Pharmacodyn 43:583–596. https://doi.org/10.1007/s10928-016-9487-8
Article PubMed PubMed Central Google Scholar
Ribbing J, Niclas Jonsson E (2004) Power, selection bias and predictive performance of the population pharmacokinetic covariate model. J Pharmacokinet Pharmacodyn 31:109–134. https://doi.org/10.1023/B:JOPA.0000034404.86036.72
Article CAS PubMed Google Scholar
Broeker A, Wicha SG (2020) Assessing parameter uncertainty in small-n pharmacometric analyses: value of the log-likelihood profiling-based sampling importance resampling (LLP-SIR) technique. J Pharmacokinet Pharmacodyn 47:219–228. https://doi.org/10.1007/s10928-020-09682-4
Article PubMed PubMed Central Google Scholar
Ahamadi M, Largajolli A, Diderichsen PM et al (2019) Operating characteristics of stepwise covariate selection in pharmacometric modeling. J Pharmacokinet Pharmacodyn 46:273–285. https://doi.org/10.1007/s10928-019-09635-6
Article PubMed Google Scholar
Bonate PL (1999) The effect of collinearity on parameter estimates in nonlinear mixed effect models. Pharm Res 16:709–717
Article CAS PubMed Google Scholar
Hu C, Adedokun O, Ito K et al (2015) Confirmatory population pharmacokinetic analysis for bapineuzumab phase 3 studies in patients with mild to moderate Alzheimer’s disease. J Clin Pharmacol 55:221–229. https://doi.org/10.1002/jcph.393
Article CAS PubMed Google Scholar

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Clinical Pharmacy, Institute of Pharmacy, University of Hamburg, Bundesstraße 45, 20146, Hamburg, Germany
Lisa F. Amann & Sebastian G. Wicha

Authors

Lisa F. Amann
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian G. Wicha
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.F.A conceived, designed and performed the analysis, wrote the main manuscript text and prepared all figures. S.G.W. conceived, designed and supervised the project. All authors reviewed the manuscript.

Corresponding author

Correspondence to Sebastian G. Wicha.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 66 kb)

Supplementary file2 (DOCX 1059 kb)

Supplementary file3 (DOCX 2671 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Amann, L.F., Wicha, S.G. Operational characteristics of full random effects modelling (‘frem’) compared to stepwise covariate modelling (‘scm’). J Pharmacokinet Pharmacodyn 50, 315–326 (2023). https://doi.org/10.1007/s10928-023-09856-w

Download citation

Received: 13 June 2022
Accepted: 21 March 2023
Published: 21 April 2023
Issue Date: August 2023
DOI: https://doi.org/10.1007/s10928-023-09856-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Operational characteristics of full random effects modelling (‘frem’) compared to stepwise covariate modelling (‘scm’)

Abstract

Similar content being viewed by others

Operating characteristics of stepwise covariate selection in pharmacometric modeling

A Modified Hybrid Wald’s Approximation Method for Efficient Covariate Selection in Population Pharmacokinetic Analysis

Impact of covariate model building methods on their clinical relevance evaluation in population pharmacokinetic analyses: comparison of the full model, stepwise covariate model (SCM) and SCM+ approaches

Introduction

Methods

Software

Generation of datasets and simulation of PK data

Continuous covariates

Evaluation using ‘scm’ or ‘frem’ models

Scenario 1

Scenario 2

Scenario 3

Categorical covariates

Results

Power of covtrue inclusion for ‘scm’ and ‘fremposthoc’

Conditional accuracy and precision of \(\theta_{cov_{true} }\) estimates

Predictive performance of ‘scm’ and ‘fremposthoc’ models

Type 1 error

Discussion

Conclusion

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 66 kb)

Supplementary file2 (DOCX 1059 kb)

Supplementary file3 (DOCX 2671 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Power of cov_true inclusion for ‘scm’ and ‘frem_posthoc’

Predictive performance of ‘scm’ and ‘frem_posthoc’ models