Skip to main content
Log in

Optimal treatment regimes for survival endpoints using a locally-efficient doubly-robust estimator from a classification perspective

  • Published:
Lifetime Data Analysis Aims and scope Submit manuscript

Abstract

A treatment regime at a single decision point is a rule that assigns a treatment, among the available options, to a patient based on the patient’s baseline characteristics. The value of a treatment regime is the average outcome of a population of patients if they were all treated in accordance to the treatment regime, where large values are desirable. The optimal treatment regime is a regime which results in the greatest value. Typically, the optimal treatment regime is estimated by positing a regression relationship for the outcome of interest as a function of treatment and baseline characteristics. However, this can lead to suboptimal treatment regimes when the regression model is misspecified. We instead consider value search estimators for the optimal treatment regime where we directly estimate the value for any treatment regime and then maximize this estimator over a class of regimes. For many studies the primary outcome of interest is survival time which is often censored. We derive a locally efficient, doubly robust, augmented inverse probability weighted complete case estimator for the value function with censored survival data and study the large sample properties of this estimator. The optimization is realized from a weighted classification perspective that allows us to use available off the shelf software. In some studies one treatment may have greater toxicity or side effects, thus we also consider estimating a quality adjusted optimal treatment regime that allows a patient to trade some additional risk of death in order to avoid the more invasive treatment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Bai X, Tsiatis AA, O’Brien SM (2013) Doubly-robust estimators of treatment-specific survival distributions in observational studies with stratified sampling. Biometrics 69:830–839

    Article  MathSciNet  MATH  Google Scholar 

  • Brinkley J, Tsiatis AA, Anstrom KJ (2009) A generalized estimator of the attributable benefit of an optimal treatment regime. Biometrics 21:512–522

    MathSciNet  MATH  Google Scholar 

  • Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297

    MATH  Google Scholar 

  • Hubbard AE, van der Laan MJ, Robins JM (1999) Nonparametric locally efficient estimation of the treatment specific survival distributions with right censored data and covariates in observational studies. In: Halloran E, Berry D (eds) Statistical models in epidemiology: the environment and clinical trials. Springer, New York, pp 134–178

    Google Scholar 

  • Moodie E, Richardson TS, Stephens D (2007) Demystifying optimal dynamic treatment regimes. Biometrics 63:447–455

    Article  MathSciNet  MATH  Google Scholar 

  • Murphy SA (2003) Optimal dynamic treatment regimes (with discussion). J R Stat Soc Ser B 65:331–366

    Article  MathSciNet  MATH  Google Scholar 

  • Orellana L, Rotnitzky A, Robins JM (2010) Dynamic regime marginal structural mean models for estimation of optimal treatment regimes, part I: main content. Int J Biostat. doi:10.2202/1557-4679.1242

  • Robins JM (2004) Optimal structured nested models for optimal sequential decisions. In: Lin DY, Heagerty PJ (eds) Proceedings of the second seattle symposium on biostatistics. Springer, New York, pp 189–326

    Chapter  Google Scholar 

  • Robins J, Orellana L, Rotnitzky A (2008) Estimation and extrapolation of optimal treatment and testing strategies. Stat Med 27:4678–4721

    Article  MathSciNet  Google Scholar 

  • Rubin DB (1978) Bayesian inference for causal effects: the role of randomization. Ann Stat 6:34–58

    Article  MathSciNet  MATH  Google Scholar 

  • Tsiatis AA (2006) Semiparametric theory and missing data. Springer, New York

    MATH  Google Scholar 

  • van der Laan MJ, Luedtke AR (2014) Targeted learning of the mean outcome under an optimal dynamic treatment rule. U.C. Berkeley Division of Biostatistics Working Paper Series. Working Paper 325

  • Zhang B, Tsiatis AA, Laber EB, Davidian M (2012a) A robust method for estimating optimal treatment regimes. Biometrics 68:1010–1018

    Article  MathSciNet  MATH  Google Scholar 

  • Zhang B, Tsiatis AA, Davidian M, Zhang M, Laber EB (2012b) Estimating optimal treatment regimes from a classification perspective. Stat 1:103–114

    Article  MATH  Google Scholar 

  • Zhao L, Tian L, Cai T, Claggett B, Wei LJ (2013) Effectively selecting a target population for a future comparative study. J Am Stat Assoc 108:527–539

    Article  MathSciNet  MATH  Google Scholar 

  • Zhao Y, Zeng D, Rush AJ, Kosorok MR (2012) Estimating individualized treatment rules using outcome weighted learning. J Am Stat Assoc 107:1106–1118

    Article  MathSciNet  MATH  Google Scholar 

  • Zhao Y, Kosorok MR, Zeng D (2009) Reinforcement learning design for cancer clinical trials. Stat Med 28:3294–3315

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgments

This research was supported by NIH Grants R01 HL118336 and P01 CA142538. The authors gratefully acknowledge Dr. Sean O’Brien, Dr. William Weintraub, Dr. Fred Edwards, and the ASCERT investigator team for assembling the study database.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiaofei Bai.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 325 KB)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bai, X., Tsiatis, A.A., Lu, W. et al. Optimal treatment regimes for survival endpoints using a locally-efficient doubly-robust estimator from a classification perspective. Lifetime Data Anal 23, 585–604 (2017). https://doi.org/10.1007/s10985-016-9376-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10985-016-9376-x

Keywords

Navigation