Skip to main content

Jointly modeling time-to-event and longitudinal data: a Bayesian approach

Abstract

This article explores Bayesian joint models of event times and longitudinal measures with an attempt to overcome departures from normality of the longitudinal response, measurement errors, and shortages of confidence in specifying a parametric time-to-event model. We allow the longitudinal response to have a skew distribution in the presence of measurement errors, and assume the time-to-event variable to have a nonparametric prior distribution. Posterior distributions of the parameters are attained simultaneously for inference based on Bayesian approach. An example from a recent AIDS clinical trial illustrates the methodology by jointly modeling the viral dynamics and the time to decrease in CD4/CD8 ratio in the presence of CD4 counts with measurement errors to compare potential models with various scenarios and different distribution specifications. The analysis outcome indicates that the time-varying CD4 covariate is closely related to the first-phase viral decay rate, but the time to CD4/CD8 decrease is not highly associated with either the two viral decay rates or the CD4 changing rate over time. These findings may provide some quantitative guidance to better understand the relationship of the virological and immunological responses to antiretroviral treatments.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

References

  • Acosta EP, Wu H, Walawander A, Eron J, Pettinelli C, Yu S, Neath D, Ferguson E, Saah AJ, Kuritzkes DR, Gerber JG (2004) Comparison of two indinavir/ritonavir regimens in treatment-experienced HIV-infected individuals. J Acquir Immune Defic Syndr 37:1358–1366

    Article  Google Scholar 

  • Arellano-Valle RB, Genton MG (2005) On fundamental skew distributions. J Multivar Anal 96:93–116

    Article  MATH  MathSciNet  Google Scholar 

  • Arellano-Valle RB, Bolfarine H, Lachos VH (2007) Bayesian inference for skew-normal linear mixed models. J Appl Stat 34:663–682

    Article  MathSciNet  Google Scholar 

  • Azzalini A, Capitanio A (2003) Distributions generated by perturbation of symmetry with emphasis on a multivariate skew \(t\) distributions. J Roy Stat Soc Ser B 65:367–389

    Article  MATH  MathSciNet  Google Scholar 

  • Brooks SP, Gelman A (1998) General methods for monitoring convergence of iterative simulations. J Comput Graph Stat 7(4):434–455

    MathSciNet  Google Scholar 

  • Carroll RJ, Ruppert D, Stefanski LA, Crainiceanu CM (2006) Meas Error Nonlinear Models Mod Persp, 2nd edn. Chapman and Hall, London

    Book  Google Scholar 

  • DeGruttola V, Tu XM (1994) Modeling progression of CD4-lymphocyte count and its relationship to survival time. Biometrics 50:1003–1014

    Article  Google Scholar 

  • Ferguson TS (1973) A Bayesian analysis of some nonparametric problems. Ann Stat 1:209–230

    Article  MATH  MathSciNet  Google Scholar 

  • Gelman A, Rubin DB (1992) Inference from iterative simulation using multiple sequences. Stat Sci 7(4):457–511

    Article  Google Scholar 

  • Gelman A (2006) Prior distributions for variance parameters in hierarchical models. Bayesian Anal 1:515–533

    Article  MathSciNet  Google Scholar 

  • Hougaard P (1999) Fundamentals of survival data. Biometrics 55:13–22

    Article  MATH  Google Scholar 

  • Huang Y, Liu D, Wu H (2006) Hierarchical Bayesian methods for estimation of parameters in a longitudinal HIV dynamic system. Biometrics 62:413–423

    Article  MATH  MathSciNet  Google Scholar 

  • Huang Y, Dagne G (2011) A Bayesian approach to joint mixed-effects models with a skew-normal distribution and measurement errors in covariates. Biometrics 67:260–269

    Article  MATH  MathSciNet  Google Scholar 

  • Huang Y, Dagne G, Wu L (2011) Bayesian inference on joint models of HIV dynamics for time-to-event and longitudinal data with skewness and covariate measurement errors. Stat Med 30:2930–2946

    Article  MathSciNet  Google Scholar 

  • Ishwaran H, James L (2002) Dirichlet process computing in finite normal mixtures: smoothing and prior information. J Comput Graph Stat 11:508–532

    Article  MathSciNet  Google Scholar 

  • Liu W, Wu L (2007) Simultaneous inference for semiparametric nonlinear mixed-effects models with covariate measurement errors and missing responses. Biometrics 63:342–350

    Article  MATH  MathSciNet  Google Scholar 

  • Lunn DJ, Thomas A, Best N, Spiegelhalter D (2000) WinBUGS: a Bayesian modelling framework: concepts, structure, an extensibility. Stat Comput 10:325–337

    Article  Google Scholar 

  • Ntzoufras I (2009) Bayesian modeling using WinBUGS. Wiley, New Jersey

    Book  MATH  Google Scholar 

  • Pawitan Y, Self S (1993) Modeling disease marker processes in AIDS. J Am Stat Assoc 88:719–726

    MATH  Google Scholar 

  • Rubin DB (1976) Inference and missing data. Biometrika 63:581–592

    Article  MATH  MathSciNet  Google Scholar 

  • Sahu SK, Dey DK, Branco MD (2003) A new class of multivariate skew distributions with applications to Bayesian regression models. Can J Stat 31:129–150

    Article  MATH  MathSciNet  Google Scholar 

  • Tsiatis AA, Davidian M (2004) Joint modeling of longitudinal and time-to-event data: an overview. Stat Sin 14:809–834

    MATH  MathSciNet  Google Scholar 

  • Wu H, Zhang J-T (2002) The study of long-term HIV dynamics using semi-parametric non-linear mixed-effects models. Stat Med 21:3655–3675

    Article  Google Scholar 

  • Wu H, Ding AA (1999) Population HIV-1 dynamics in vivo: applicable models and inferential tools for virological data from AIDS clinical trials. Biometrics 55:410–418

    Article  MATH  Google Scholar 

  • Wu L (2009) Mixed effects model for complex data. Chapman and Hall, London

    Book  Google Scholar 

  • Wu L, Liu W, Hu XJ (2010) Joint inference on HIV viral dynamics and immune suppression in presence of measurement errors. Biometrics 66:327–335

    Article  MATH  MathSciNet  Google Scholar 

Download references

Acknowledgments

The authors are extremely grateful to the Editor, an Associate Editor and an anonymous reviewer for their insightful comments and suggestions that led to a marked improvement of the article. This research was partially supported by NIAID/NIH Grant R03AI080338 and USF Proposal Enhancement Grant to Huang, Natural Sciences and Engineering Research Council of Canada discovery Grant to Hu, and NIMH/NIH Grant R01MH040859-22 to Dagne.

Author information

Affiliations

Authors

Corresponding author

Correspondence to Yangxin Huang.

Appendix: multivariate skew distributions

Appendix: multivariate skew distributions

Different versions of the multivariate skew–normal(SN) and skew-\(t\) (ST) distributions have been considered and used in the literature (Arellano-Valle and Genton 2005; Arellano-Valle et al. 2007; Azzalini and Capitanio 2003; Sahu et al. 2003 and others). A new class of distributions by introducing skewness in multivariate elliptically distributions were developed in publication (Sahu et al. 2003). The class, which is obtained by using transformation and conditioning, contains many standard families including the multivariate SN and ST distributions. For completeness, this appendix briefly summarizes the multivariate SN and ST distributions that will be used in defining the joint models with skew distributions considered in this paper. Assume an \(m\)-dimensional random vector \(\varvec{Y}\) follows an \(m\) variate SN or ST distribution with location vector \(\varvec{\mu },\,m \times m\) positive (diagonal) dispersion matrix \(\varvec{\Sigma }\) and \(m \times m\) skewness matrix \(\varvec{\Delta }(\varvec{\delta })=\textit{diag}(\delta _1, \delta _2,\ldots , \delta _m)\) or the degrees of freedom \(\nu \), where skewness parameter vector \(\varvec{\delta }=(\delta _1,\ldots ,\delta _m)^T\). In what follows, we briefly discuss multivariate SN and ST distributions introduced by Sahu et al. (2003), which are suitable for Bayesian inference since they are built using the conditional method. For detailed discussion on properties of SN and ST distributions, see reference (Sahu et al. 2003).

Skew-t distribution

An \(m\)-dimensional random vector \(\varvec{Y}\) follows an \(m\)-variate ST distribution if its probability density function (pdf) is given by

$$\begin{aligned} f(\varvec{y}|\varvec{\mu },\varvec{\Sigma },\varvec{\Delta }(\varvec{\delta }),\nu )= 2^m t_{m,\nu }(\varvec{y}|\varvec{\mu }, \varvec{A})P(\varvec{U}>\mathbf{0}), \end{aligned}$$
(15)

where \(\varvec{A}=\varvec{\Sigma }+\varvec{\Delta }^2(\varvec{\delta })\), we denote the \(m\)-variate \(t\) distribution with parameters \(\varvec{\mu },\,\varvec{A}\) and degrees of freedom \(\nu \) by \(t_{m,\nu }(\varvec{\mu }, \varvec{A})\) and the corresponding pdf by \(t_{m,\nu }(\varvec{y}|\varvec{\mu }, \varvec{A})\) henceforth, \(\varvec{U}\) follows the \(t\) distribution \(t_{m,\nu +m}(\cdot )\). We denote this distribution by \(ST_{m,\nu }(\varvec{\mu },\varvec{\Sigma },\varvec{\Delta }(\varvec{\delta }))\). In particular, when \(\varvec{\Sigma }=\sigma ^2 \varvec{I}_m\) and \(\varvec{\Delta }(\varvec{\delta })=\delta \varvec{I}_m\), equation (15) simplifies to

$$\begin{aligned} f(\varvec{y}|\varvec{\mu },\sigma ^2,\delta ,\nu )&= 2^m (\sigma ^2+\delta ^2)^{-m/2}\frac{\Gamma ((\nu +m)/2)}{\Gamma (\nu /2)(\nu \pi )^{m/2}}\nonumber \\&\quad \times \left\{ 1+\frac{(\varvec{y}-\varvec{\mu })^T(\varvec{y}-\varvec{\mu })}{\nu (\sigma ^2+\delta ^2)} \right\} ^{-(\nu +m)/2} \nonumber \\&\quad \times \, T_{m,\nu +m} \left[ \left\{ \frac{\nu +(\sigma ^2+\delta ^2)^{-1}(\varvec{y}-\varvec{\mu })^T(\varvec{y}-\varvec{\mu })}{\nu +m}\right\} ^{-1/2}\right. \nonumber \\&\qquad \qquad \qquad \qquad \qquad \left. \frac{\delta (\varvec{y}-\varvec{\mu })}{\sigma \sqrt{\sigma ^2+\delta ^2}}\right] \!, \end{aligned}$$
(16)

where \(T_{m,\nu +m}(\cdot )\) denotes the cumulative distribution function (cdf) of \(t_{m,\nu +m}(\mathbf{0},\varvec{I}_m)\). However, unlike the SN distribution below, the ST density can not be written as the product of univariate ST densities. Here \(\varvec{Y}\) are dependent but uncorrelated.

The mean and covariance matrix of the ST distribution \(ST_{m,\nu }(\varvec{\mu },\sigma ^2 \varvec{I}_m,\varvec{\Delta }(\varvec{\delta }))\) are given by

$$\begin{aligned} E(\varvec{Y})&= \varvec{\mu }+(\nu /\pi )^{1/2}\frac{\Gamma ((\nu -1)/2)}{\Gamma (\nu /2)}\varvec{\delta },\nonumber \\ cov(\varvec{Y})&= \left[ \sigma ^2\varvec{I}_m+\varvec{\Delta }^2(\varvec{\delta })\right] \frac{\nu }{\nu -2}-\frac{\nu }{\pi }\left[ \frac{\Gamma \{(\nu -1)/2\}}{\Gamma (\nu /2)}\right] ^2\varvec{\Delta }^2(\varvec{\delta }). \end{aligned}$$
(17)

It is noted that when \(\varvec{\delta }=\mathbf{0}\), the ST distribution reduces to the usual \(t\) distribution. In order to have a zero mean vector, we should assume the location parameter \(\varvec{\mu }=-(\nu /\pi )^{1/2}\frac{\Gamma ((\nu -1)/2)}{\Gamma (\nu /2)}\varvec{\delta }\), which is what we assume in the paper. In order to better understand the shape of an ST distribution, plots of an ST density as a function of the skewness parameter with \(\delta = -3, 0, 3\) are shown in Fig. 5a.

Fig. 5
figure 5

The univariate skew-\(t\) (df \(\nu =4\)) and skew-normal density functions with variance \(\sigma ^2=1\) and skewness parameter \(\delta =0, -3\) and 3, respectively

By the proposition of Sahu et al. (2003), the ST distribution of \(\varvec{Y}\) has a convenient stochastic representation as follows.

$$\begin{aligned} \varvec{Y}=\varvec{\mu }+\varvec{\Delta }(\varvec{\delta })|\varvec{X}_0|+\varvec{\Sigma }^{1/2}\varvec{X}_1, \end{aligned}$$
(18)

where \(\varvec{X}_0\) and \(\varvec{X}_1\) are two independent random vectors following \(t_{m,\nu }(\mathbf{0},\varvec{I}_m)\). Note that the expression (18) provides a convenient device for random number generation and for implementation purposes. Let \(\varvec{w}=|\varvec{X}_0|\); then \(\varvec{w}\) follows an \(m\)-dimensional standard \(t\) distribution \(t_{m,\nu }(\mathbf{0},\varvec{I}_m)\) truncated in the space \(\varvec{w}>\mathbf{0}\) (i.e., the standard half-\(t\) distribution). Thus, a hierarchical representation of (18) is given by

$$\begin{aligned} \varvec{Y}|\varvec{w}\sim t_{m,\nu +m}(\varvec{\mu }+\varvec{\Delta }(\varvec{\delta })\varvec{w}, \omega \varvec{\Sigma }),\; \varvec{w}\sim t_{m,\nu }(\mathbf{0},\varvec{I}_m)\varvec{I}(\varvec{w}>\mathbf{0}), \end{aligned}$$
(19)

where \(\omega =(\nu +\varvec{w}^T\varvec{w})/(\nu +m)\).

Skew-normal distribution

We briefly discuss a multivariate SN distribution introduced by Sahu et al. (2003) in this section. An \(m\)-dimensional random vector \(\varvec{Y}\) follows an \(m\)-variate SN distribution, if its pdf is given by

$$\begin{aligned} f(\varvec{y}|\varvec{\mu },\varvec{\Sigma },\varvec{\Delta }(\varvec{\delta }))= 2^m|\varvec{A}|^{-1/2}\phi _m \{\varvec{A}^{-1/2}(\varvec{y}-\varvec{\mu })\} P(\varvec{U}>\mathbf{0}), \end{aligned}$$
(20)

where \( \varvec{U}\sim N_m\{\varvec{\Delta }(\varvec{\delta }) \varvec{A}^{-1}(\varvec{y}-\varvec{\mu }), \varvec{I}_m-\varvec{\Delta }(\varvec{\delta }) \varvec{A}^{-1}\varvec{\Delta }(\varvec{\delta })\}\), and \(\phi _m (\cdot )\) is the pdf of \(N_m(\mathbf{0},\varvec{I}_m)\). We denote the above distribution by \(SN_m (\varvec{\mu },\varvec{\Sigma },\varvec{\Delta }(\varvec{\delta }))\). An appealing feature of equation (20) is that it gives independent marginal when \(\varvec{\Sigma }=\textit{diag}(\sigma ^2_1, \sigma ^2_2,\ldots , \sigma ^2_m)\). The pdf (20) thus reduces to

$$\begin{aligned} f(\varvec{y}|\varvec{\mu },\varvec{\Sigma },\varvec{\Delta }(\varvec{\delta }))=\prod _{i=1}^{m}\left[ \frac{2}{\sqrt{\sigma ^2_i+\delta ^2_i}}\phi \left\{ \frac{y_i-\mu _i}{\sqrt{\sigma ^2_i+\delta ^2_i}}\right\} \Phi \left\{ \frac{\delta _i}{\sigma _i}\frac{y_i-\mu _i}{\sqrt{\sigma ^2_i+\delta ^2_i}}\right\} \right] \!,\qquad \end{aligned}$$
(21)

where \(\phi (\cdot )\) and \(\Phi (\cdot )\) are the pdf and cdf of the standard normal distribution, respectively.

The mean and covariance matrix are given by \(E(\varvec{Y}) = \varvec{\mu }+\sqrt{2/\pi }\varvec{\delta },cov(\varvec{Y})=\varvec{\Sigma }+(1-2/\pi )\varvec{\Delta }^2(\varvec{\delta })\). It is noted that when \(\varvec{\delta }=\mathbf{0}\), the SN distribution reduces to usual normal distribution. In addition, the SN distribution is a special case of the ST distribution. That is, the ST distribution reduces to the SN distribution when the degrees of freedom \(\nu \rightarrow \infty \). In order to have a zero mean vector, we should assume the location parameter \(\varvec{\mu }=-\sqrt{2/\pi }\varvec{\delta }\). In order to better understand the shape of an SN distribution, plots of an SN density as a function of the skewness parameter with \(\delta = -3, 0\), and 3 are shown in Fig. 5b.

According to Sahu et al. (2003), if \(\varvec{Y}\) follows \(SN_m(\varvec{\mu },\varvec{\Sigma },\varvec{\Delta }(\varvec{\delta }))\), it can be expressed by a convenient stochastic representation as follows.

$$\begin{aligned} \varvec{Y}=\varvec{\mu }+\varvec{\Delta }|\varvec{X}_0|+\varvec{\Sigma }^{1/2}\varvec{X}_1, \end{aligned}$$
(22)

where \(\varvec{X}_0\) and \(\varvec{X}_1\) are two independent random vectors with \(N_m(\mathbf{0},\varvec{I}_m)\). Let \(\varvec{w}=|\varvec{X}_0|\); then, \(\varvec{w}\) follows an \(m\)-dimensional standard normal distribution \(N_m(\mathbf{0},\varvec{I}_m)\) truncated in the space \(\varvec{w}>\mathbf{0}\). Thus, a two-level hierarchical representation of (22) is given by

$$\begin{aligned} \varvec{Y}|\varvec{w}\sim N_m(\varvec{\mu }+\varvec{\Delta }\varvec{w}, \varvec{\Sigma }),\; \varvec{w}\sim N_m(\mathbf{0},\varvec{I}_m)\varvec{I}(\varvec{w}>\mathbf{0}). \end{aligned}$$
(23)

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Huang, Y., Hu, X.J. & Dagne, G.A. Jointly modeling time-to-event and longitudinal data: a Bayesian approach. Stat Methods Appl 23, 95–121 (2014). https://doi.org/10.1007/s10260-013-0242-7

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10260-013-0242-7

Keywords

  • Accelerated failure time model
  • Dirichlet process
  • Semiparametric linear/nonlinear mixed-effects model
  • Skew-elliptical distribution
  • Time-to-event