Statistical Prediction of High-Cost Claimants Using Commercial Health Plan Data
Among Cigna’s claimant population with at least one year of continuous medical or pharmacy eligibility over 2014–2015 (N = 2.7 million), our objective was to accurately identify high-cost claimants and identify clinical and demographic cost drivers among individuals with commercial health plan benefits. High-cost claimants were defined as those with annual costs over $100,000. We collected 800+ potential risk factors and utilized multivariable weighted logistic regression on an oversampled model dataset to estimate odds ratios for clinical and demographic factors available in claims data. We used decision tree methodology to assist in variable selection/reduction. High-cost claimants (n = 17,702) comprised only 0.6% of the 2015–2016 population, but accounted for over 20% of 2015–2016 total costs. Our optimized maximum likelihood estimation model identified cost drivers including: actuarial prospective episode-related group (ERG) risk score, prior-year medical claim costs, prior-year pharmacy claim costs, gaps in care/noncompliance score, hemophilia, short stature, and end-stage renal disease. Our findings show that weighted logistic regression modeling with oversampling techniques can be used to identify high-cost claimants in the upcoming year more accurately than traditional maximum likelihood estimation. Managed care decision makers should use prospective claims data analyses to target and implement intervention programs, with the goal of managing care among those at risk for incurring catastrophic costs.
KeywordsData science Rare event statistical prediction Managed care administration
The authors gratefully acknowledge the technical and editorial assistance provided by Jeffrey Young, MS, Janki S. Bhatt, PhD, Stuart Lustig, MD, Joshua Barrett, MS, Jeanne Fox, BNS, Qun Wang, PhD, Nick Tschaika, BS and Lillian Thomas, BS.
- 3.Wilson, D.M., Troy, T.D., Jones, K.L.: American Health Policy Institute: High Cost Claimants: Private vs. Public Sector Approaches (2016)Google Scholar
- 6.United States Census Bureau: Health Insurance Coverage in the United States: 2015, Report P60-257 (2016)Google Scholar
- 10.Optum Insight: Symmetry Episode Treatment Groups: Measuring Health Care with Meaningful Episodes of Care. White Paper (2012). https://www.optum.com/content/dam/optum/resources/whitePapers/symmetry_episode_treatment_groups_wp_06_2012.pdf
- 11.Sun Life Financial: Third Annual Sun Life Stop-Loss Research Report, Spring 2015. Stop Loss Insurance Brokers, Ltd. https://www.stoplossins.com/wp-content/uploads/2015/07/Catastrophic-Claims-Report-2015.pdf. Accessed 24 Apr 2018