Predictive Analytics in Long Term Care

Zail, Howard

doi:10.1007/978-3-030-05660-5_13

Howard Zail¹³

Part of the book series: Springer Actuarial ((SPACT))

612 Accesses

Abstract

This chapter introduces machine learning and artificial intelligence techniques for analyzing long term care risks. The focus is on incidence rates, but the methods can be extended to termination and lapse rates. Four algorithms are described: generalized linear model, Lasso, neural networks and extreme gradient boosting. We provide practical advice how to clean and prepare data, train and validate models and show how to test and compare the accuracy of the different algorithms. The chapter provides the tools for developing LTC risk rates from either industry data or an insurer’s proprietary database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Hardcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A machine learning approach for diagnostic and prognostic predictions, key risk factors and interactions

Article Open access 18 March 2024

Learning (predictive) risk scores in the presence of censoring due to interventions

Article 20 October 2015

Comparison of methods for early-readmission prediction in a high-dimensional heterogeneous covariates and time-to-event outcome framework

Article Open access 06 March 2019

Notes

1.
Murphy [1].
2.
“Long Term Care Intercompany Experience Study” [2].
3.
The full set of data adjustments to date is found in the clean_incidence function on the Github Site.
4.
“Caveats for Use of Long Term Care Experience Basic Tables”, 2015, Society of Actuaries.
5.
These claims count fields are called Claims_NH, Claims_ALF, and Claims_HHC, respectively, in the database.
6.
The selection of ActiveExposure as the measure, instead of TotalExposure, is described in Aalen [3], Chap. 5.
7.
See (i) Holford [4] and (ii) Rodriguez [5].
8.
This same loss function can be derived by using solely a piecewise constant hazard function assumption and does not require the \( y_{i} \) to have a Poisson distribution. See Aalen [3] Chap. 5.
9.
Each of the field names are described in the SOA Report [2].
10.
Aikake [6].
11.
We use the term “ LAMBDA ” to designate the constant in order to be consistent with the R package glmnet which will be used to execute the Lasso. This LAMBDA is not the same as and should not be confused with the Poisson mean parameter \( \lambda \).
12.
Tibshirani [7].
13.
“Deviance” is a measure of the accuracy of the model (lower is better), see the glmnet package vignette for more details.
14.
Unfortunately, at the time of writing, setting up Keras on a GPU is not for the faint of heart. Only Nvidia GPUs are currently supported. Help can be found on the RStudio, Tensorflow and Nvidia websites on how to install the various components. An alternative is to use a cloud server through Google, Amazon where the server is pre-installed with R and Keras.
15.
Chen [9].
16.
A lengthy discussion of tuning parameters can be found at http://xgboost.readthedocs.io/en/latest///parameter.html.

References

Murphy, K.P.: Machine Learning: A Probabilistic Perspective (Adaptive Computation and Machine Learning series). Massachusetts Institute of Technology (2012)
Google Scholar
Long Term Care Intercompany Experience Study—Aggregate Database 2000–2011 Report, January 2015, Society of Actuaries
Google Scholar
Aalen, O.O., Borgan, O., Gjessing, H.: Survival and Event History Analysis: A Process Point of View. Springer (2008)
Google Scholar
Holford, T.: The analysis of rates and of survivorship using log-linear models. Biometrix 36 (1980)
Article MathSciNet Google Scholar
Rodriguez, G.: Generalized Linear Models. Princeton University, http://data.princeton.edu/wws509/notes/c7s4.html
Akaike, H.: A new look at the statistical model identification. IEEE Trans. Autom. Control. (1974)
Google Scholar
Tibshirani, R.: Regression shrinkage and selection via the lasso. J. R. Stat. Society. Ser. B (Methodological) 58(1), 267–288 (1996)
MathSciNet MATH Google Scholar
Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat. 29(5), 1189–1232 (Oct., 2001)
Article MathSciNet Google Scholar
XGBoost: A Scalable Tree Boosting System. Tianqi Chen, Carlos Guestrin. https://arxiv.org/pdf/1603.02754.pdf (2016)
McCullagh, P., Nelder, J.A.: Generalized Linear Models. Chapter 13, Chapman & Hall (1989)
Google Scholar

Download references

Author information

Authors and Affiliations

Elucidor, LLC, 305 East 40th St, Suite 21F, New York, NY, 10016, USA
Howard Zail

Authors

Howard Zail
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Howard Zail .

Editor information

Editors and Affiliations

Bellows Falls, VT, USA
Etienne Dupourqué
ISFA, Université Lyon, Lyon, France
Frédéric Planchet
ISUP, New York, NY, USA
Néfissa Sator

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Zail, H. (2019). Predictive Analytics in Long Term Care. In: Dupourqué , E., Planchet, F., Sator, N. (eds) Actuarial Aspects of Long Term Care. Springer Actuarial. Springer, Cham. https://doi.org/10.1007/978-3-030-05660-5_13

Download citation

DOI: https://doi.org/10.1007/978-3-030-05660-5_13
Published: 29 May 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05659-9
Online ISBN: 978-3-030-05660-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Predictive Analytics in Long Term Care

Abstract

Access this chapter

Similar content being viewed by others

A machine learning approach for diagnostic and prognostic predictions, key risk factors and interactions

Learning (predictive) risk scores in the presence of censoring due to interventions

Comparison of methods for early-readmission prediction in a high-dimensional heterogeneous covariates and time-to-event outcome framework

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Predictive Analytics in Long Term Care

Abstract

Access this chapter

Similar content being viewed by others

A machine learning approach for diagnostic and prognostic predictions, key risk factors and interactions

Learning (predictive) risk scores in the presence of censoring due to interventions

Comparison of methods for early-readmission prediction in a high-dimensional heterogeneous covariates and time-to-event outcome framework

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation