Bayesian dynamic regression models for interval censored survival data with application to children dental health

Abstract

Cox models with time-varying coefficients offer great flexibility in capturing the temporal dynamics of covariate effects on event times, which could be hidden from a Cox proportional hazards model. Methodology development for varying coefficient Cox models, however, has been largely limited to right censored data; only limited work on interval censored data has been done. In most existing methods for varying coefficient models, analysts need to specify which covariate coefficients are time-varying and which are not at the time of fitting. We propose a dynamic Cox regression model for interval censored data in a Bayesian framework, where the coefficient curves are piecewise constant but the number of pieces and the jump points are covariate specific and estimated from the data. The model automatically determines the extent to which the temporal dynamics is needed for each covariate, resulting in smoother and more stable curve estimates. The posterior computation is carried out via an efficient reversible jump Markov chain Monte Carlo algorithm. Inference of each coefficient is based on an average of models with different number of pieces and jump points. A simulation study with three covariates, each with a coefficient of different degree in temporal dynamics, confirmed that the dynamic model is preferred to the existing time-varying model in terms of model comparison criteria through conditional predictive ordinate. When applied to a dental health data of children with age between 7 and 12 years, the dynamic model reveals that the relative risk of emergence of permanent tooth 24 between children with and without an infected primary predecessor is the highest at around age 7.5, and that it gradually reduces to one after age 11. These findings were not seen from the existing studies with Cox proportional hazards models.

This is a preview of subscription content, log in to check access.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

References

  1. Biller C (2000) Adaptive Bayesian regression splines in semiparametric generalized linear models. J Comput Graph Stat 9(1):122–140

    MathSciNet  Google Scholar 

  2. Biller C, Fahrmeir L (2001) Bayesian varying-coefficient models using adaptive regression splines. Stat Model 1(3):195–211

    MATH  Article  Google Scholar 

  3. Brezger A, Lang S (2006) Generalized structured additive regression based on Bayesian P-splines. Comput Stat Data Anal 50(4):967–991

    MathSciNet  Article  Google Scholar 

  4. Brooks SP, Giudici P (1999) Convergence assessment for reversible jump MCMC simulations. In: Bernardo JM, Berger JO, Dawid AP, Smith A (eds) Bayesian statistics 6—proceedings of the sixth valencia international meeting. Clarendon Press, Oxford, pp 733–742

  5. Cai T, Betensky RA (2003) Hazard regression for interval-censored data with penalized spline. Biometrics 59(3):570–579

    MathSciNet  MATH  Article  Google Scholar 

  6. Cai Z, Sun Y (2003) Local linear estimation for time-dependent coefficients in Cox’s regression models. Scand J Stat 30(1):93–111

    MathSciNet  MATH  Article  Google Scholar 

  7. Celeux G, Forbes F, Robert CP, Titterington DM (2006) Deviance information criteria for missing data models (Pkg: P651–706). Bayesian Anal 1(4):651–674

    MathSciNet  Article  Google Scholar 

  8. Cox DR (1972) Regression models and life-tables (with discussion). J R Stat Soc B 34:187–220

    MATH  Google Scholar 

  9. De Iorio M, Johnson WO, Müller P, Rosner GL (2009) Bayesian nonparametric nonproportional hazards survival modeling. Biometrics 65(3):762–771

    MathSciNet  MATH  Article  Google Scholar 

  10. Denison DGT, Mallick BK, Smith AFM (1998) Automatic Bayesian curve fitting. J R Stat Soc B 60:333–350

    MathSciNet  MATH  Article  Google Scholar 

  11. Dimatteo I, Genovese CR, Kass RE (2001) Bayesian curve-fitting with free-knot splines. Biometrika 88(4):1055–1071

    MathSciNet  MATH  Article  Google Scholar 

  12. Fahrmeir L, Lang S (2001) Bayesian inference for generalized additive mixed models based on Markov random field priors. J R Stat Soc C 50(2):201–220

    MathSciNet  Article  Google Scholar 

  13. Fan J, Gijbels IA, King M (1997) Local likelihood and local partial likelihood in hazard regression. Annals Stat 25(4):1661–1690

    MathSciNet  MATH  Article  Google Scholar 

  14. Fine JP, Yan J, Kosorok MR (2004) Temporal process regression. Biometrika 91(3):683

    MathSciNet  MATH  Article  Google Scholar 

  15. Geisser S, Eddy WF (1979) A predictive approach to model selection (Corr: V75 p765). J Am Stat Assoc 74:153–160

    MathSciNet  MATH  Article  Google Scholar 

  16. Gilks WR, Wild P (1992) Adaptive rejection sampling for Gibbs sampling. Appl Stat 41:337–348

    MATH  Article  Google Scholar 

  17. Goetghebeur E, Ryan L (2000) Semiparametric regression analysis of interval-censored data. Biometrics 56(4):1139–1144

    MathSciNet  MATH  Article  Google Scholar 

  18. Goggins WB, Finkelstein DM, Schoenfeld DA, Zaslavsky AM (1998) A Markov chain Monte Carlo EM algorithm for analyzing interval-censored data under the Cox proportional hazards model. Biometrics 54:1498–1507

    MATH  Article  Google Scholar 

  19. Gómez G, Calle ML, Oller R, Langohr K (2009) Tutorial on methods for interval-censored data and their implementation in R. Stat Modell 9(4):259

    Article  Google Scholar 

  20. Gray RJ (1992) Flexible methods for analyzing survival data using splines, with application to breast cancer prognosis. J Am Stat Assoc 87:942–951

    Article  Google Scholar 

  21. Green PJ (1995) Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika 82:711–732

    MathSciNet  MATH  Article  Google Scholar 

  22. Hastie T, Tibshirani R (1993) Varying-coefficient models. J R Stat Soc B 55(4):757–796

    MathSciNet  MATH  Google Scholar 

  23. Hennerfeind A, Brezger A, Fahrmeir L (2006) Geoadditive survival models. J Am Stat Assoc 101(475):1065–1075

    MathSciNet  MATH  Article  Google Scholar 

  24. Ibrahim JG, Chen M-H, Sinha D (2001) Bayesian survival analysis. Springer, New York

    Google Scholar 

  25. Jara A, Lesaffre E, De Iorio M, Quitana F (2010) Bayesian semiparametric inference for multivariate doubly-interval-censored data. Ann Appl Stat 4(4):2126–2149

    MathSciNet  MATH  Article  Google Scholar 

  26. Kauermann G (2005) Penalized spline smoothing in multivariable survival models with varying coefficients. Comput Stat Data Anal 49(1):169–186

    MathSciNet  MATH  Article  Google Scholar 

  27. Kim S, Chen M-H, Dey DK, Gamerman D (2007) Bayesian dynamic models for survival data with a cure fraction. Lifetime Data Anal 13(1):17–35

    MathSciNet  MATH  Article  Google Scholar 

  28. Kneib T (2006) Mixed model-based inference in geoadditive hazard regression for interval-censored survival times. Comput Stat Data Anal 51(2):777–792

    MathSciNet  MATH  Article  Google Scholar 

  29. Kneib T, Fahrmeir L (2007) A mixed model approach for geoadditive hazard regression. Scand J Stat 34(1):207–228

    MathSciNet  MATH  Article  Google Scholar 

  30. Kooperberg C, Clarkson DB (1997) Hazard regression with interval-censored data. Biometrics 53:1485–1494

    MATH  Article  Google Scholar 

  31. Kooperberg C, Stone CJ, Truong YK (1995) Hazard regression. J Am Stat Assoc 90:78–94

    MathSciNet  MATH  Article  Google Scholar 

  32. Martinussen T, Scheike TH (2002) A flexible additive multiplicative hazard model. Biometrika 86(2):283–298

    MathSciNet  Article  Google Scholar 

  33. Murphy SA, Sen PK (1991) Time-dependent coefficients in a Cox-type regression model. Stoch Process Appl 39:153–180

    MathSciNet  MATH  Article  Google Scholar 

  34. Pan W (1999) Extending the iterative convex minorant algorithm to the Cox model for interval-censored data. J Comput Graph Stat 8:109–120

    Google Scholar 

  35. Pan W (2000) A multiple imputation approach to Cox regression with interval-censored data. Biometrics 56(1):199–203

    MATH  Article  Google Scholar 

  36. Peng L, Huang Y (2007) Survival analysis with temporal covariate effects. Biometrika 94(3):719–733

    MathSciNet  MATH  Article  Google Scholar 

  37. Satten GA (1996) Rank-based inference in the proportional hazards model for interval censored data. Biometrika 83:355–370

    MATH  Article  Google Scholar 

  38. Sinha D, Chen M-H, Ghosh SK (1999) Bayesian analysis and model selection for interval-censored survival data. Biometrics 55(2):585–590

    MathSciNet  MATH  Article  Google Scholar 

  39. Spiegelhalter DJ, Best NG, Carlin BP, van der Linde A (2002) Bayesian measures of model complexity and fit (Pkg: P583–639). J R Stat Soc B 64(4):583–616

    MATH  Article  Google Scholar 

  40. Tian L, Zucker D, Wei L (2005) On the Cox model with time-varying regression coefficients. J Am Stat Assoc 100(469):172–183

    MathSciNet  MATH  Article  Google Scholar 

  41. Vanobbergen J, Martens L, Declerck D, Lesaffre M (2000) The Signal Tandmobiel (R) Project: A longitudinal intervention oral health promotion study in Flanders (Belgium): Baseline and first year results. Eur J Paediatric Dentistr 2:87–96

    Google Scholar 

Download references

Acknowledgments

We would like to thank the Editor-in-Chief, an Associate Editor, and two referees for their helpful comments and suggestions, which led to an improved version of the article. This research was partially supported by NIH grants GM70335 and CA74015 and NSF grant DMS0805965.

Author information

Affiliations

Authors

Corresponding author

Correspondence to Jun Yan.

Appendix: Sampling details in the update move

Appendix: Sampling details in the update move

The values of \(\mu _i\) and \(\sigma _i\) in (5) are calculated as follows.

When \(J=1\), it corresponds to time-independent coefficient model. There is only one piece of \(\theta (t)\) to sample, \(\mu _1 = 0\) and \(\sigma _1^2 = \infty \).

When \(J>1\), for \(j=1\),

$$\begin{aligned} \mu _1&= \sigma _1^2 \Big \{ \sum \limits _i\sum \limits _k {I\big (s_k \in (0, \tau _1]\big ) Z_i dN_{i,k}} \Big \} + a_0 \theta (\tau _2) / (1+a_0) , \nonumber \\ \sigma _1^2&= a_0 \omega / (1 + a_0) ; \end{aligned}$$

for \(j = 2,\ldots , J - 1\),

$$\begin{aligned} \mu _j&= \sigma _j^2 \Big \{ \sum \limits _i\sum \limits _k {I\big (s_k \in (\tau _{j-1}, \tau _j]\big ) Z_i dN_{i,k}} \Big \} + \theta (\tau _{j-1}) / 2 + \theta (\tau _{j+1}) / 2, \nonumber \\ \sigma _j^2&= \omega / 2; \end{aligned}$$

and for \(j=J\),

$$\begin{aligned} \mu _J&= \sigma _J^2 \Big \{ \sum \limits _i\sum \limits _k {I\big (s_k \in (\tau _{J-1}, \tau _J]\big ) Z_i dN_{i,k}} \Big \} + \theta (\tau _{J-1}), \nonumber \\ \sigma _J^2&= \omega . \end{aligned}$$

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Wang, X., Chen, M. & Yan, J. Bayesian dynamic regression models for interval censored survival data with application to children dental health. Lifetime Data Anal 19, 297–316 (2013). https://doi.org/10.1007/s10985-013-9246-8

Download citation

Keywords

  • Cox model
  • Latent variables
  • Markov chain Monte Carlo
  • Reversible jump
  • Semiparametric
  • Time-varying coefficient