Appendices
A simulation study design
A.1 Scenario I
Scenario I simulates 500 patients with a maximum of 15 repeated measurements per patient. We included \(K = 2\) continuous longitudinal outcomes and one survival outcome. The k longitudinal outcomes each had form:
$$\begin{aligned} y_{ki}(t)&= \eta _{ki}(t) +\epsilon _{ki}(t)\\&= \beta _{k0} + \beta _{k1} \times \text {time} + \beta _{k2}\times \text {group}\\&\quad + \beta _{k3}\times \text {interaction} \\&\quad +\, b_{ki0}+ b_{ki1}\times \text {time}+\epsilon _{ki}(t), \end{aligned}$$
with \(\epsilon _{ki}(t) \sim N(0,\sigma ^2_{k})\) and \(\mathbf{b }_{ki} = (b_{ki0}, b_{ki1})^T\), with \(\mathbf{b }_i = (\mathbf{b }_{1i}^T, \mathbf{b }_{2i}^T)^T \sim MVN({\mathbf {0}},{{\varvec{D}}})\). The variance-covariance matrix \({{\varvec{D}}}\) has general form:
$$\begin{aligned} {{\varvec{D}}} = \left[ \begin{array}{cccc} {{\varvec{D}}}_{1} &{}{\cdots } &{} D_{\mathrm{else}} \\ \cdots &{} {{\varvec{D}}}_{2} &{} {\cdots } \\ {\cdots } &{} \ddots &{} {\cdots } \\ {\cdots } &{} &{}{{\varvec{D}}}_{k} \end{array}\right] , \quad {{\varvec{D}}}_k =\left[ \begin{array}{cccc} D_{k11} &{} D_{k12} \\ D_{k21} &{} D_{k22} \end{array}\right] \end{aligned}$$
and for Scenario I:
$$\begin{aligned} {{\varvec{D}}}_{1} = {{\varvec{D}}}_{2} = \begin{bmatrix} 0.68 &{}-0.08 \\ -0.08 &{}0.28 \\ \end{bmatrix}, \text {with} \; D_{\mathrm{else}} = 0.10 \end{aligned}$$
Time was simulated from a uniform distribution between 0 and 25. For the survival outcome, adjusting for group allocation, we used:
$$\begin{aligned} h_i (t)&= h_0(t) \exp \biggl [\gamma _0 + \gamma _1 \times \text {group} + \sum \limits _{k = 1}^K \alpha _k \eta _{ki}(t) \biggr ]\\&=h_0(t) \exp \biggl [\gamma _0 + \gamma _1 \times \text {group} + \alpha _1 \eta _{1i}(t) + \alpha _2 \eta _{2i}(t)\biggr ]. \end{aligned}$$
The baseline risk was simulated from a Weibull distribution \( h_0(t) = \phi t^{\phi -1}\), with \(\phi = 1.65\). For the simulation of the censoring times, an exponential censoring distribution was selected, with mean \(\mu = 15\), such that the censoring rate was between \(60\%\) and \(70\%\). More details are presented in Table 6.
A.2 Scenario II
Scenario II is an extension of Scenario I such that we now have \(K = 6\) continuous longitudinal outcomes. We again simulate 500 patients with a maximum of 15 repeated measurements per patient. The k longitudinal outcomes each had form:
$$\begin{aligned} y_{ki}(t)&=\eta _{ki}(t) +\epsilon _{ki}(t)\\&= \beta _{k0} + \beta _{k1} \times \text {time} + \beta _{k2}\times \text {group} \\&\quad + \beta _{k3}\times \text {interaction} \\&\quad + \, b_{ki0}+ b_{ki1}\times \text {time}+\epsilon _{ki}(t), \end{aligned}$$
with \(\epsilon _{ki}(t) \sim N(0,\sigma ^2_{k})\) and \(\mathbf{b }_{ki} = (b_{ki0}, b_{ki1})^T\), with \(\mathbf{b }_i = (\mathbf{b }_{1i}^T, \mathbf{b }_{2i}^T, \ldots , \mathbf{b }_{6i}^T)^T \sim MVN({\mathbf {0}},{{\varvec{D}}})\),
$$\begin{aligned} {{\varvec{D}}}_1&= {{\varvec{D}}}_2 = \begin{bmatrix} 0.97 &{}0.78 \\ 0.07 &{}0.032 \\ \end{bmatrix}, {{\varvec{D}}}_3 = {{\varvec{D}}}_4 = \begin{bmatrix} 0.13 &{}-0.009 \\ -0.009 &{}0.002 \\ \end{bmatrix},\\ {{\varvec{D}}}_5&= {{\varvec{D}}}_6 = \begin{bmatrix} 0.56 &{}0.05 \\ 0.05 &{}0.03 \\ \end{bmatrix}, \, \text {and} \,\, {{\varvec{D}}}_{\mathrm{else}} = 0.00 \end{aligned}$$
Time was simulated from a uniform distribution between 0 and 25. For the survival outcome, adjusting for group allocation as in Scenario I, we used:
$$\begin{aligned} h_i (t)= & {} h_0(t) \exp \biggl [\gamma _1 \times \text {group} + \sum \limits _{k = 1}^K \alpha _k \eta _{ki}(t) \biggr ]. \end{aligned}$$
The baseline risk was simulated using B-splines with knots specified a priori. An exponential censoring distribution was used for the simulation of the censoring times, with mean \(\mu = 15\), such that the censoring rate was between \(60\%\) and \(70\%\). Further details are again available in Table 6.
A.3 Scenario III
In Scenario III, we simulate 500 patients with a maximum of 15 repeated measurements per patient, including three continuous and three binary longitudinal outcomes such that:
$$\begin{aligned}&g_k \bigl [ E \{ y_{ki}(t) \mid {\varvec{b}}_{ki} \} \bigr ]= \eta _{ki}(t) \\&\quad = \beta _{k0} + \beta _{k1} \times \text {time} + \beta _{k2}\times \text {group} + \beta _{k3}\times \text {interaction} \\&\qquad +b_{ki0}+ b_{ki1}\times \text {time}, \end{aligned}$$
where \(g_k(\cdot )\) denotes the canonical link function appropriate to the response type (identity and logit for the Gaussian and binomial outcomes, respectively), and \(\mathbf{b }_i = (\mathbf{b }_{1i}^T, \mathbf{b }_{2i}^T, \ldots , \mathbf{b }_{6i}^T)^T \sim MVN({\mathbf {0}},{{\varvec{D}}}),\) with
$$\begin{aligned} {{\varvec{D}}}_1&= {{\varvec{D}}}_2 = \begin{bmatrix} 0.97 &{}0.78 \\ 0.07 &{}0.032 \\ \end{bmatrix}, {{\varvec{D}}}_3 = {{\varvec{D}}}_4 = \begin{bmatrix} 0.13 &{}-0.009 \\ -0.009 &{}0.002 \\ \end{bmatrix},\\ {{\varvec{D}}}_5&= {{\varvec{D}}}_6 = \begin{bmatrix} 0.56 &{}0.05 \\ 0.05 &{}0.03 \\ \end{bmatrix}, \, \text {and} \, {{\varvec{D}}}_{\mathrm{else}} = 0.00 \end{aligned}$$
For the survival outcome, adjusting for group allocation, we again used:
$$\begin{aligned} h_i (t) = h_0(t) \exp \biggl [\gamma _1 \times \text {group} + \sum \limits _{k = 1}^K \alpha _k \eta _{ki}(t) \biggr ]. \end{aligned}$$
Scenario III maintains the use of the uniform distribution between 0 and 25 for time and the use of B-splines for the simulation of the baseline hazard. The censoring times were simulated using an exponential censoring distribution as before, with mean \(\mu = 15\). Table 6 provides additional information.
B Example R Code
The below code fits a multivariate joint model for \(K = 3\) longitudinal outcomes: \(y_1, y_2\) and \(y_3\), where \(y_1\) is binary and both \(y_2\) and \(y_3\) are continuous. We fit a linear mixed model for \(y_1\) with random intercept and slope (time is futime) and use natural cubic splines with two knots, (at \(futime = 6\) and \(futime = 15\) respectively) in both the fixed and random parts of the models for \(y_2\) and \(y_3\). The survival submodel adjusts for continuous baseline predictors \(x_1\) and \(x_2\).
C Tables
See Tables 4, 5, 6 and 7.
Table 4 Simulation results for parameters of the longitudinal submodel (Scenario II) Table 5 Simulation results for parameters of the longitudinal submodel (Scenario III) Table 6 Simulation scenarios Table 7 Parameter estimates and 95% credibility intervals under the joint modeling analysis for the Bio-SHiFT data (Model 1)
D Figures
See Figs. 4, 5, 6 and 7.