Comparison of Joint Modelling and Landmarking Approaches for Dynamic Prediction Using Bootstrap Simulation

Hossain, Zakir; Khondoker, Mizanur

doi:10.1007/s40840-022-01300-5

Comparison of Joint Modelling and Landmarking Approaches for Dynamic Prediction Using Bootstrap Simulation

Open access
Published: 26 May 2022

Volume 45, pages 301–314, (2022)
Cite this article

Download PDF

You have full access to this open access article

Bulletin of the Malaysian Mathematical Sciences Society Aims and scope Submit manuscript

Comparison of Joint Modelling and Landmarking Approaches for Dynamic Prediction Using Bootstrap Simulation

Download PDF

Zakir Hossain¹ &
Mizanur Khondoker ORCID: orcid.org/0000-0002-1801-1635²
for the Alzheimer’s Disease Neuroimaging Initiative

1904 Accesses
1 Citation
Explore all metrics

Abstract

Prediction models for clinical outcomes can greatly help clinicians with early diagnosis, cost-effective management and primary prevention of many medical conditions. In conventional prediction models, predictors are typically measured at a fixed time point, either at baseline or at other time point of interest such as biomarker values measured at the most recent follow-up. Dynamic prediction has emerged as a more appealing prediction technique that takes account of longitudinal history of biomarkers for making predictions. We compared prediction performance of two well-known approaches for dynamic prediction, namely joint modelling and landmarking, using bootstrap simulation based on the Alzheimer’s Disease Neuroimaging Initiative (ADNI) data with repeat Mini-Mental State Examination (MMSE) scores as the longitudinal biomarker and time-to-Alzheimer’s disease (AD) as the survival outcome. We assessed the performance of both approaches in terms of extended definitions of discrimination and calibration, namely dynamic area under the receiver operating characteristic curve (dynAUC) and expected prediction error (PE). We focused on real data-based bootstrap simulation in an attempt to be as impartial as possible to both methods as landmarking is a pragmatic approach which does not specify a statistical model for the longitudinal markers, and therefore any comparison based on model based data simulation may potentially be more advantageous to joint modelling approach. The dynAUC and PE were compared at landmarks $ t_{s}=1.0, 1.5, 2.0\,\text{ and }\,2.5$ years and within a 2-year window from the landmark time points. The optimism corrected estimates of dynAUC for joint modelling were slightly higher (1.26, 3.22, 2.76 and 0.12% higher at the four landmark time points) than that of landmarking approach. Apart from the final landmark point (at 2.5 years), dynamic prediction based on joint models has also performed slightly better in terms of calibration. The expected prediction errors (PE) for joint models were 0.70, 2.56 and 2.04% lower at the first three landmark time points, respectively, compared to the landmarking approach. In general, joint modelling approach has performed better than the landmarking approach in terms of both discrimination (dynAUC) and calibration (PE), although the margin of gain in performance by using joint models over landmarking was relatively small indicating that landmarking approach was close enough, despite not having a precise statistical model characterising the evolution of the longitudinal markers. Future comparative studies should consider extended versions of joint modelling and landmarking approaches which may overcome some of the limitations of the standard methods.

Dementia risk prediction in individuals with mild cognitive impairment: a comparison of Cox regression and machine learning models

Article Open access 02 November 2022

Multi-stage Biomarker Models for Progression Estimation in Alzheimer’s Disease

Assessment of Clinical Meaningfulness of Endpoints in the Generation Program by the Insights to Model Alzheimer’s Progression in Real Life (iMAP) Study

Article 28 December 2018

1 Introduction

Prediction models for clinical outcomes can greatly aid clinicians with early diagnosis, cost-effective management and primary prevention of many medical conditions. Clinical prediction models typically estimate risk scores of outcome of interest based on various factors such as age, sex, ethnicity, body mass index (BMI), smoking habit and genetic history [1]. A well-known example is the QRISK®risk calculator, which is a well-established cardiovascular disease (CVD) risk assessment tool in context of the UK population [2]. This estimates the risk of developing CVD over the next 10 years based on current measures of age, sex, ethnicity, smoking status, cholesterol, blood pressure levels etc. Although these prediction models have been tremendously useful for day-to-day clinical practice, most conventional risk models make use of predictors measured at a single time point, e.g. at baseline or at the most recent hospital visit or follow-up [3]. Predictors measured at a single time point do not reflect the changes in longitudinal predictor profiles and the way these changes influence the risk of an event. This is counter-intuitive to the way health professionals assess their patients’ health states by progressively updating a prognosis based on total history of available information. Also, conventional prediction models are not well placed in terms of exploiting the full potential of patient level longitudinal data available from modern electronic health records systems.

Dynamic prediction has recently emerged as an alternative and more appealing prediction technique that can fully utilise the longitudinal changes in prediction variables [3]. Dynamic prediction in time-to-event analysis is the computation of predictive distribution at a certain moment in time, given the history of event(s) and covariates [4]. The predictions can be updated as additional information gets collected from further follow-up or visits and the access to updated risks may be very useful for making clinical decisions such as may enable clinicians to gain better understanding of the disease dynamics, and help making most optimal decision at the specific time point.

There are two approaches that are commonly used for dynamic prediction based on longitudinally measured biomarkers [5]. They are: (i) joint models for longitudinal and time-to-event data [6, 7], and (ii) landmarking in time-to-event analysis [8]. The use of joint models for dynamic prediction is based on a formal and theoretically rigorous statistical framework that jointly characterises the evolution of the longitudinal biomarkers and the time-to-event processes. The predictions using correctly specified joint models are expected to be efficient [5], although their implementation generally is computationally demanding, and requires specialised software [9, 10], that has not yet been incorporated within the mainstream standard statistical software packages.

The landmarking approach of dynamic prediction does not specify a formal statistical model for the longitudinal markers and therefore is based on a pragmatic rather than a formal statistical model of the joint distribution of longitudinal markers and the time-to-event process [4, 8]. Due to not providing a model-based continuum of the predicted longitudinal markers, landmarking approach is generally less efficient than joint modelling, and prediction can often be somewhat biased for intermittently observed longitudinal markers resulting from relying on last observation carried forward to impute unobserved values of the longitudinal biomarkers. The method is, however, still popular due to ease of implementation as dynamic prediction based on landmarking can be obtained using commonly used standard statistical software.

Comparison of predictive performance of joint modelling and landmarking approaches has been recently explored in the statistical literature. Using various functional forms of the association structure between a longitudinal marker and time-to-event processes, Rizopoulos et al. [1] demonstrated that prediction accuracy of the joint model is better than the landmarking approach. Suresh et al. [11] compared joint modelling and landmarking approaches in context of a binary longitudinal marker (representing an illness-death model). The paper did not lead to a conclusion of whether prediction by one approach is more accurate than the other, but, suggests that joint modelling is likely to perform better when the stochastic process of the longitudinal markers can be well characterised from the available data. When this is harder (e.g. in case of sparse longitudinal data), or if there are many longitudinal markers, prediction by landmarking may provide a good enough approximation.

Most of the comparisons in the literature are mainly based on model based data simulations, meaning data are simulated according to specific statistical models. This poses no problems for predictions based on joint models as they completely specify a data generation mechanism. However, this may be problematic for the landmarking approach as the method does not provide a complete data generation mechanisms for the longitudinal process. Therefore, data generated according to joint models are likely to be more favourable to prediction using joint models than that using landmarking approach. To overcome this issue, and to facilitate a fairer comparison of the two methods, we consider real data for performance comparison using bootstrap simulation. We use bootstrap simulation to correct for any potential “optimism” which often results from evaluating model performance on the same data that are used to train the model. The main motivation for bootstrap simulation-based comparison is to ensure that the data generation mechanism does not depend on either of the underlying statistical models, so that the comparison is impartial to predictions based on both joint modelling and landmarking approaches.

2 Methods

2.1 Dynamic Prediction via Joint Modelling

The basic principle of joint modelling for time-to-event and longitudinal data is to couple a survival model for the time-to-event process with a suitable model for the longitudinal process that will account for any interdependence between the two processes [7]. Joint modelling fully specifies the joint distribution that characterises and links the two models for the longitudinal marker process Y(t), and the time-to-event process T by a statistical framework involving shared parameters to model the interdependence [6, 7, 12]. Using notations similar to [7], we denote the observed value of the longitudinal marker for subject i ($i=1,\ldots , n$) at time point t by $y_{i}(t)$, and more specifically, the observed longitudinal marker for subject i at a specific occasion $t_{ij}$ by $y_i(t_{ij})$. The observed longitudinal marker process therefore is denoted by $y_{ij}=\left\{ y_i(t_{ij}), j=1,\ldots ,n_i\right\} $. The longitudinal sub-model can be written as [1],

$$\begin{aligned} y_{i}(t)= & {} \mu _{i}(t)+\epsilon _{i}(t) \nonumber \\= & {} \varvec{x}_{i}'(t) \varvec{\beta } +\varvec{z}_{i}'(t)\varvec{b}_{i} + \epsilon _{i}(t), \end{aligned}$$

(1)

where $\varvec{x}_{i}(t)$ and $\varvec{z}_{i}(t)$ are the corresponding design vectors for the fixed-effects $\varvec{\beta }$ and random effects $\varvec{b}_i$, respectively, with $\varvec{b}_i \sim N(\varvec{0}, \varvec{D})$. The corresponding random error terms, $\epsilon _{i}(t)$, are assumed to be independent of the random effects with $\epsilon _{i}(t) \sim N(0, \sigma ^2)$ and $\mathrm{cov}[(\epsilon _{i}(t), \epsilon _{i}(\tilde{t})]=0$ for $\tilde{t} \ne t$.

To formulate the survival sub-model, we denote the true event time for the ith subject by $T_i$ and the corresponding observed event time by $ T_{i}^{*}=\min (T_i, C_i)$ where $C_i$ is the potential censoring time and $\delta _i=1(T_{i}\le C_i)$ is the event indicator. The time-to-event sub-model relates $\mu _{i}(t)$ with event time $T_{i}$ via

$$\begin{aligned} \lambda _{i} \left\{ t|\mathcal {M}_i(t), w_i\right\}= & {} \mathrm{lim}_{\Delta t \downarrow 0}\mathrm{Pr}\left\{ t<T_{i} \le t+\Delta t| T_{i}>t,\mathcal {M}_{i}(t), w_i\right\} /\Delta t \nonumber \\= & {} \lambda _{0}(t)\exp \left\{ \varvec{\gamma }'w_{i}+\alpha \mu _{i}(t) \right\} , \end{aligned}$$

(2)

where $\mathcal {M}_i(t)=\{ \mu _i(s), 0 \le s < t \}$ represents the accumulated history of the true unobserved longitudinal marker up to the time point t, $w_i$ is a vector of baseline covariates and $\lambda _{0}(\cdot )$ is the baseline hazard function. The vector of regression coefficients $\varvec{\gamma }$ represents the effects of the baseline covariates, and the parameter $\alpha $ quantifies the effect of the underlying longitudinal marker.

Although the survival sub-model (2) looks similar to an extended Cox regression, and the risk for an event at time t appears to depend only on the current value of the longitudinal marker $\mu _i(t)$, the estimation of joint models depends on the whole history of the longitudinal marker, $\mathcal {M}_i(t)$. This is evident from the definition of survival function for model (2) which can be expressed as

$$\begin{aligned} S_{i} \left\{ t|\mathcal {M}_i(t), w_i\right\}= & {} \mathrm{Pr} (T_{i} > t| \mathcal {M}_{i}(t), w_i) \nonumber \\= & {} \exp \left( - \int _{0}^{t} \lambda _{0}(s)\exp \left\{ \gamma ' w_{i}+\alpha \mu _{i}(t) \right\} \mathrm{d}s \right) , \end{aligned}$$

(3)

and the fact that the survival function (3) is a part of the likelihood function for the joint models [7].

The dynamic predictions of survival probabilities can be obtained after fitting a joint model on a training sample $D_{n}=\left\{ T_{i}^{*}, \delta _{i}, w_i,y_{i} ; i=1,\ldots ,n\right\} $. The probability of survival of a person i at least up to a horizon, $t_{\mathrm{hor}}$, given alive at $t_{s}$ ($t_{\mathrm{hor}}>t_{s}$) is estimated by [7]

$$\begin{aligned} \hat{\pi }_{i}(t_{\mathrm{hor}}|t_{s})=\mathrm{Pr}(T_{i}>t_{\mathrm{hor}}|T_{i}>t_{s}, \mathcal {Y}_{i}(t_{s}),w_{i},D_{n};\hat{\theta }), \end{aligned}$$

(4)

where $w_i$ is a vector of baseline predictors, $\hat{\theta }$ is the vector of estimated parameters of the joint model, and $\mathcal {Y}_{i}(t_s)=\left\{ y_{i}(u):0\le u<t_{s} \right\} $ is the observed accumulated history of the longitudinal marker.

Various methods have been suggested for estimation of the parameters of the joint model in the literature. We use the maximum likelihood approach suggested by Rizopoulos [7], which is based on the conditional independence assumption, i.e. given the random effects ($b_i$), the time-to-event process and the longitudinal marker process are independent of each other. More precisely, this implies that the random effects account for the covariance between the longitudinal and time-to-event processes as well as the pairwise correlation between the repeated measurements of the longitudinal marker. More detailed expressions for the likelihood function and the formula for dynamic prediction given in Eq. (4) can be found in Rizopoulos [7].

2.2 Dynamic Prediction via Landmarking

Although joint modelling is a natural framework and one of most rigorous methods for dynamic prediction, its implementation often involves hard computational procedures, e.g. the likelihood construction requires numerical integration over multiple dimensions, which can often be computationally intensive or even infeasible for more than one longitudinal predictors. Dynamic prediction by landmarking is considered as a more flexible and easier alternative. The idea of landmarking approach, introduced by van Houwelingen [4], is to fit standard survival models on the subsample of subjects at risk at the landmark time $t_s$ , and use this as a basis for dynamic prediction.

The dynamic prediction via landmarking can be obtained by preselecting a landmark time $t_{s}$ and subjects risk at $t_{s}$ from the original sample $\mathcal {D}_n$, i.e. construct the adjusted risk set at $t_{s}$: $\mathcal {R}(t_{s})=\left\{ i: T_i>t_s \right\} $, and then use standard survival model to predict probability of survival up to a horizon, $t_{\mathrm{hor}}>t_{s}$. Typically, the standard Cox regression models are used and can be written as

$$\begin{aligned} \lambda _{i}(t)=\lambda _{0}(t)\exp \left\{ \gamma 'w_{i}+\alpha \tilde{y}_{i}(t_s) \right\} ;\quad t_s\le t\le t_{\mathrm{hor}}, \end{aligned}$$

(5)

where $\tilde{y}_i(t_s)$ is the last observed value of the longitudinal marker, $w_{i}$ is the vector of baseline covariates and the time origin is reset at the landmark: $t-t_{s}$. Based on the Cox regression model, the estimation of $\pi _{i}(t_{\mathrm{hor}}|t_{s})$ by landmark analysis can be obtained as

$$\begin{aligned} \hat{\pi }_{i}^{LM}(t_{\mathrm{hor}}|t_{s})= \exp \left[ -\hat{\Lambda }_{0}(t_{\mathrm{hor}})\exp \left\{ \hat{\gamma }'w_{i}+\hat{\alpha }\tilde{y}_{i}(t_s) \right\} \right] , \end{aligned}$$

(6)

where $\hat{\Lambda }_{0}(t_{\mathrm{hor}})$ is the estimate of baseline cumulative hazard. A commonly used choice for the estimate of baseline cumulative hazard is the Breslow estimator given by

$$\begin{aligned} \hat{\Lambda }_{0}(t_{\mathrm{hor}})=\sum _{i \in \mathcal {R}(t_{s})} \frac{I(T_i \le t_{\mathrm{hor}}) \delta _{i}}{\sum _{l \in \mathcal {R}(t_{\mathrm{hor}})} \exp \left\{ \hat{\gamma }'w_{l}+\hat{\alpha }\tilde{y}_{l}(t_s) \right\} }. \end{aligned}$$

(7)

2.3 Criteria for Comparison of Prediction Performance

As recommended for clinical prediction models [13], we assessed the performance of both approaches for dynamic prediction in terms of discrimination and calibration. We use their extended definitions in the context of dynamic prediction [1]. The estimation of joint models, calculation of dynamic predictions, discrimination and calibration were implemented using the R packages JM [9] and JMbayes [10].

2.3.1 Discrimination

Discrimination measures the ability of a prediction model to distinguish between subjects with positive and negative outcomes (e.g. the presence and absence of dementia) [13,14,15]. Discrimination measures for dynamic predictions are typically based on an extended definition of the area under the receiver operating characteristic curve, termed dynamic AUC (dynAUC). We use the definition of dynAUC based on the principle that, at any given point, one would like a prediction tool that gives higher predicted risk of event for subjects who are more likely to experience the event than for subjects who are less likely to experience it [1, 16].

2.3.2 Calibration

Calibration measures the agreement between observed and predicted outcomes [17, 18]. Calibration assessment for time-to-event outcomes, typically based on comparison of observed Kaplan–Meier survival probabilities with average predictions across groups of patients, does not directly carry over to dynamic predictions from joint models as the aim and nature of predictions are different in the latter context. For dynamic predictions from joint models, it is of interest to predict subject-specific conditional risk of events at landmark times $t_s$ for a prediction window given survival and history of longitudinal marker up to the landmark time $t_s$ [19]. Rizopoulos [1] proposed an estimator of this calibration measure, termed expected prediction error (PE), which is implemented in the R package JMbayes [10].

2.4 The ADNI Data

Data used in the preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (http://adni.loni.usc.edu). The ADNI was launched in 2003 as a public–private partnership, led by Principal Investigator Michael W. Weiner, MD. We used the longitudinal Mini-Mental State Examination (MMSE) data from ADNI to conduct the bootstrap simulation study. In total 863 MCI (mild cognitive impairment) participants aged 55–90 years at baseline were included. The frequency and interval of follow-up visits varied over the course of the study, but overall, participants were initially assessed every 6 months for the first two years (i.e. at baseline, 6, 12, 18 and 24 months) and then on a yearly basis. The version of the dataset we used has up to 12 follow-up visits and a maximum follow-up duration of 9 years. Of the 863 MCI subjects, 295 (34%) progressed to Alzheimer’s disease (AD) during the follow-up period. The survival outcome of interest is the time to conversion to AD from the MCI state, and the repeat MMSE scores are considered as the time-varying exposure variable (or the longitudinal outcome in context the of joint models). The analyses using both joint models and Cox regression considered several baseline covariates including demographic (age, sex, education), behavioural (smoking status, alcohol abuse), medical history (family history, body mass index (BMI), blood pressure) and genetic marker (APOE4 genotype) for Alzheimer’s disease.

2.5 Results

The MMSE profiles of the participants (converters and non-converters) of ADNI analysis sample are shown in Fig. 1. The plot has been stratified by the event status—the right hand side panel shows the MMSE profiles of the participants who have converted to AD, and the left hand side panel shows that of participants who have not developed AD before the end of follow-up or death.

As can expected, the profile plot of MMSE scores suggests that, on average, the participants who went on developing AD experienced deterioration of cognitive performance at a much faster rate than those who did not convert to AD by the end of the follow-up period or by the date of censoring (i.e. by the date of death or date lost to follow-up).

The predicted survival probabilities based on Cox regression model at several given MMSE levels are displayed in Fig. 2.

The plot of survival probabilities by MMSE levels (Fig. 2) clearly indicates that the global cognitive performance measured by MMSE scores has a substantial impact on the development of AD. People with smaller MMSE scores (poorer cognitive performance) appear more susceptible to develop AD. The evidence shown in Figs. 1 and 2 along with the statistically significant association observed ($\hat{\alpha }=-0.19$, $p<0.001$, 95%CI: $-0.22$ to $-0.17$) between the longitudinal MMSE scores and development of AD from the estimated joint model (full results given in “Appendix”) indicate that repeat cognitive performance measured by MMSE scale plays an important role in predicting the risk of developing Alzheimer’s disease.

2.6 Optimism Corrected Performance Estimates

As we explained in Introduction section, the main motivation of this study is to compare performance of dynamic prediction via joint models and landmarking using bootstrap simulation as model-based simulation of data is likely to favour predictions based on joint models. We use bootstrap simulation to ensure that the data generation mechanism does not depend on either of the underlying statistical models, so that the comparison is impartial to predictions based on both joint models and landmarking approaches. As is typically done for evaluating prediction models, we compare the methods in terms of discrimination and calibration, but using extended definitions of these criteria suitable for dynamic predictions, namely dynAUC and PE, respectively. We generate $B=500$ bootstrap samples from the ADNI sample described earlier in the methods section. A total of 500 bootstrap replications estimated the relevant parameters with reasonably high precision. For example, the standard deviations of the dynAUC estimates over the 500 replicates for the joint modelling and landmarking approaches were very small (0.017 and 0.018) which were 50 and 45 folds smaller than the corresponding means, respectively. We then use optimism corrected estimates of dynAUC and PE to account for any potential “optimism” which often results from evaluating model performance on the same data that are used to fit the model.

We estimated the optimism corrected dynAUC and PE by following the the algorithm suggested in Steyerberg [20]. Specifically, steps for estimating dynAUC are given below.

(a)
calculate dynAUC on original data, dynAUC$_{\mathrm{orig}}$
(b)
take B bootstrap samples and then calculate dynAUC$_{b, \mathrm{boot}}; b=1,\ldots , B$.
(c)
calculate dynAUC$_{b, \mathrm{orig}}; b=1,\ldots , B$:
1. (i)
  by fitting models to each bootstrap sample, and
2. (ii)
  using them to calculate performance on the original data
(d)
calculate the estimate of optimism:
$$\begin{aligned} O =(1/B) \sum _{b=1}^{B}\left\{ \text{ dynAUC}_{b,\mathrm{boot}}-\text{ dynAUC}_{b,\mathrm{orig}} \right\} \end{aligned}$$
(e)
calculate the optimism corrected estimate of dynAUC as: $\text{ dynAUC}_{\mathrm{orig}} -O$.

The same steps apply to calculation of optimism corrected PE.

We summarised simulation results obtained from 500 bootstrap samples from the ADNI data in Table 1 and Fig. 3. The dynAUC and PE were compared at landmarks $ t_{s}=1.0, 1.5, 2.0\,\text{ and }\,2.5$ and within a 2-year window from the landmark time points, i.e. the time horizon corresponding to each landmark time point was set at $ t_{\mathrm{hor}}=t_{s}+2$. The results from Table 1 and Fig. 3 imply that the values of dynAUC for joint modelling approach are slightly higher than that of landmarking. In terms of discrimination (dynAUC), the performance of joint model has been found to be 1.26, 3.22, 2.76 and 0.12% better at the four landmark time points 1.0, 1.5, 2.0 and 2.5 years, respectively. Apart from the final landmark point (at 2.5 years), dynamic prediction based on joint models has also performed somewhat better in terms of calibration. The expected prediction errors (PE) for joint models were 0.70, 2.56 and 2.04% lower at the first three landmark time points 1.0, 1.5 and 2.0 years, respectively, compared to the landmarking approach.

Table 1 Optimism corrected estimates of performance measures in terms of discrimination (dynAUC) and calibration (expected prediction error, PE) for dynamic prediction using joint modelling and landmarking

Full size table

Overall, it appears the joint modelling approach has performed slightly better than landmarking in terms of both discrimination and calibration in context of dynamic prediction, although the magnitude of difference was very small.

3 Discussion and Conclusion

With the advent of richer patient level longitudinal biomarker data available from modern electronic health records systems, dynamic risk prediction of clinical outcomes has recently emerged as an appealing prediction technique due to its ability to utilise the full history as well as the temporal changes in prediction variables. The prediction is dynamic in the sense that the predicted risk can be updated as additional information become available from further follow-up visits which may help healthcare professionals making most optimal decision at the specific point in time.

In this article, we have compared prediction performance of two well-known approaches for dynamic prediction, namely joint modelling and landmarking, of survival probabilities using bootstrap simulation. Unlike other comparative studies of the two methods in the literature, we focused on real data-based bootstrap simulation using the ADNI data in an attempt to be as impartial as possible to both methods. This is motivated by the fact that while joint model is based on a proper statistical framework and provides a complete data generation mechanism, landmarking is a pragmatic approach which does not specify a statistical model for the distribution of longitudinal markers, and therefore, any comparison based on model based data simulation may potentially be more advantageous to joint modelling approach.

Based on optimism corrected estimates of prediction performance in terms of discrimination (dynAUC) and calibration (expected prediction error, PE) in predicting Alzheimer’s disease, in general, joint modelling approach has been found to have performed better than the landmarking approach. Although this finding is consistent with previous comparative studies in the literature e.g. [1], the margin of gain in performance by using joint models over landmarking was relatively small (highest gains in dynAUC and PE were 3.2% and 2.6%, respectively). Overall, the findings from our analysis suggest that joint modelling have performed little better, but landmarking approach was close enough, despite not having a precise statistical model characterising the evolution of the longitudinal markers.

The above conclusion echoes that from another comparative study by Suresh et al. [11] in context of a binary longitudinal marker suggesting that joint modelling is likely to perform better when the evolution of the longitudinal markers can be well characterised statistically. When model misspecification is a concern, prediction by landmarking may provide a good enough approximation.

In the current work, we have considered only the standard form of joint models and the landmarking approach. In the joint model, a quadratic function of time was used to characterise the longitudinal MMSE scores within a linear mixed model, and for the landmarking approach any missing MMSE scores corresponding to the time points of interest were approximated via last observation carried forward (LOCF) which is often considered as the main source of bias in landmarking approach. Several extensions of landmarking have been considered in the literature which may potentially improve prediction performance [8]. It may be sensible for future comparative studies to consider extended landmarking approaches as well recent extensions of joint models [21] which may overcome some of the limitations of standard methods.

References

Rizopoulos, D., Molenberghs, G., Lesaffre, E.M.E.H.: Dynamic predictions with time-dependent covariates in survival analysis using joint modeling and landmarking. Biom. J. 59(6), 1261–1276 (2017)
Article MathSciNet Google Scholar
Hippisley-Cox, J., Coupland, C., Brindle, P.: Development and validation of qrisk3 risk prediction algorithms to estimate future risk of cardiovascular disease: prospective cohort study. bmj 357 (2017)
Andrinopoulou, E.R., Harhay, M.O., Ratcliffe, S.J., Rizopoulos, D.: Reflections on modern methods: dynamic prediction using joint models of longitudinal and time-to-event data. Int. J. Epidemiol. (2021)
Van Houwelingen, H.C.: Dynamic prediction by landmarking in event history analysis. Scand. J. Stat. 34(1), 70–85 (2007)
Article MathSciNet Google Scholar
Putter, H., van Houwelingen, H.C.: Landmarking 2.0: bridging the gap between joint models and landmarking. Rev. Epidemiol. Sante Publique 69, S17 (2021)
Article Google Scholar
Tsiatis, A.A., Davidian, M.: Joint modeling of longitudinal and time-to-event data: an overview. Stat. Sin. 14, 809–834 (2004)
MathSciNet MATH Google Scholar
Rizopoulos, D.: Joint Models for Longitudinal and Time-to-Event Data: With Applications in R. CRC Press, Boca Raton (2012)
Book Google Scholar
van Houwelingen, H., Putter, H.: Dynamic Prediction in Clinical Survival Analysis, 1st edn. CRC Press, Boca Raton (2012)
MATH Google Scholar
Rizopoulos, D.: JM: an R package for the joint modelling of longitudinal and time-to-event data. J. Stat. Softw. (Online) 35(9), 1–33 (2010)
Google Scholar
Rizopoulos, D.: The R package JMbayes for fitting joint models for longitudinal and time-to-event data using MCMC. J. Stat. Softw. 72(1), 1–46 (2016)
MathSciNet Google Scholar
Suresh, K., Taylor, J.M.G., Spratt, D.E., Daignault, S., Tsodikov, A.: Comparison of joint modeling and landmarking for dynamic prediction under an illness-death model. Biom. J. 59(6), 1277–1300 (2017)
Article MathSciNet Google Scholar
Henderson, R., Diggle, P., Dobson, A.: Joint modelling of longitudinal measurements and event time data. Biostatistics 1(4), 465–480 (2000)
Article Google Scholar
Steyerberg, E., Vickers, A., Cook, N., Gerds, T., Gonen, M., Obuchowski, N., Pencina, M., Kattan, M.: Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiology 21(1), 128–138 (2010)
Article Google Scholar
Harrell, F.E., Jr., Lee, K.L., Mark, D.B.: Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat. Med. 15(4), 361–387 (1996)
Article Google Scholar
Pencina, M.J., D’Agostino, R.B., Sr., D’Agostino, R.B., Jr., Vasan, R.S.: Evaluating the added predictive ability of a new marker: from area under the roc curve to reclassification and beyond. Stat. Med. 27(2), 157–172 (2008)
Article MathSciNet Google Scholar
Blanche, P., Proust-Lima, C., Loubère, L., Berr, C., Dartigues, J.-F., Jacqmin-Gadda, H.: Quantifying and comparing dynamic predictive accuracy of joint models for longitudinal marker and time-to-event in presence of censoring and competing risks. Biometrics 71(1), 102–113 (2015)
Article MathSciNet Google Scholar
Gerds, T.A., Schumacher, M.: Consistent estimation of the expected brier score in general survival models with right-censored event times. Biom. J. 48(6), 1029–1040 (2006)
Article MathSciNet Google Scholar
Schemper, M., Henderson, R.: Predictive accuracy and explained variation in cox regression. Biometrics 56(1), 249–255 (2000)
Article Google Scholar
Schoop, R., Schumacher, M., Graf, E.: Measures of prediction error for survival data with longitudinal covariates. Biom. J. 53(2), 275–293 (2011)
Article MathSciNet Google Scholar
Steyerberg, E.W.: Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating. Springer, Berlin (2008)
MATH Google Scholar
Andrinopoulou, E.-R., Rizopoulos, D.: Bayesian shrinkage approach for a joint model of longitudinal and survival outcomes assuming different association structures. Stat. Med. 35(26), 4813–4823 (2016)
Article MathSciNet Google Scholar

Download references

Acknowledgements

The authors would like to thank Alzheimer’s Disease Neuroimaging Initiative (ADNI) for giving access to the data. The collection and sharing of the data used in this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (http://www.fnih.org). The grantee organisation is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California

Author information

Authors and Affiliations

Department of Statistics, University of Dhaka, Dhaka, Bangladesh
Zakir Hossain
Norwich Medical School, Faculty of Medicine and Health Sciences, University of East Anglia, Norwich, UK
Mizanur Khondoker

Authors

Zakir Hossain
View author publications
You can also search for this author in PubMed Google Scholar
Mizanur Khondoker
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

for the Alzheimer’s Disease Neuroimaging Initiative

Corresponding author

Correspondence to Mizanur Khondoker.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Communicated by Rafiqul I. Chowdhury.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at: http://adni.loni.usc.edu/wp-content/uploads/how_to_applyADNI_Acknowledgement_List.pdf.

Appendix: Estimated Parameters of Joint Model on ADNI Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hossain, Z., Khondoker, M. & for the Alzheimer’s Disease Neuroimaging Initiative. Comparison of Joint Modelling and Landmarking Approaches for Dynamic Prediction Using Bootstrap Simulation. Bull. Malays. Math. Sci. Soc. 45 (Suppl 1), 301–314 (2022). https://doi.org/10.1007/s40840-022-01300-5

Download citation

Received: 31 December 2021
Revised: 24 April 2022
Accepted: 26 April 2022
Published: 26 May 2022
Issue Date: September 2022
DOI: https://doi.org/10.1007/s40840-022-01300-5

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Comparison of Joint Modelling and Landmarking Approaches for Dynamic Prediction Using Bootstrap Simulation

Abstract

Similar content being viewed by others

Dementia risk prediction in individuals with mild cognitive impairment: a comparison of Cox regression and machine learning models

Multi-stage Biomarker Models for Progression Estimation in Alzheimer’s Disease

Assessment of Clinical Meaningfulness of Endpoints in the Generation Program by the Insights to Model Alzheimer’s Progression in Real Life (iMAP) Study

1 Introduction

2 Methods