Segmentation of High-Cost Adults in an Integrated Healthcare System Based on Empirical Clustering of Acute and Chronic Conditions

Davis, Anna C.; Shen, Ernest; Shah, Nirav R.; Glenn, Beth A.; Ponce, Ninez; Telesca, Donatello; Gould, Michael K.; Needleman, Jack

doi:10.1007/s11606-018-4626-0

Segmentation of High-Cost Adults in an Integrated Healthcare System Based on Empirical Clustering of Acute and Chronic Conditions

Original Research
Published: 04 September 2018

Volume 33, pages 2171–2179, (2018)
Cite this article

Download PDF

Journal of General Internal Medicine Aims and scope Submit manuscript

Segmentation of High-Cost Adults in an Integrated Healthcare System Based on Empirical Clustering of Acute and Chronic Conditions

Download PDF

Anna C. Davis PhD^1,2,3,
Ernest Shen PhD⁴,
Nirav R. Shah MD^3,5,
Beth A. Glenn PhD²,
Ninez Ponce PhD²,
Donatello Telesca PhD⁶,
Michael K. Gould MD⁴ &
…
Jack Needleman PhD²

2530 Accesses
27 Citations
13 Altmetric
3 Mentions
Explore all metrics

Abstract

Background

High-cost patients are a frequent focus of improvement projects based on primary care and other settings. Efforts to characterize high-cost, high-need patients are needed to inform care planning, but such efforts often rely on a priori assumptions, masking underlying complexities of a heterogenous population.

Objective

To define recognizable subgroups of patients among high-cost adults based on clinical conditions, and describe their survival and future spending.

Design

Retrospective observational cohort study.

Participants

Within a large integrated delivery system with 2.7 million adult members, we selected the top 1% of continuously enrolled adults with respect to total healthcare expenditures during 2010.

Main Measures

We used latent class analysis to identify clusters of alike patients based on 53 hierarchical condition categories. Prognosis as measured by healthcare spending and survival was assessed through 2014 for the resulting classes of patients.

Results

Among 21,183 high-cost adults, seven clinically distinctive subgroups of patients emerged. Classes included end-stage renal disease (12% of high-cost population), cardiopulmonary conditions (17%), diabetes with multiple comorbidities (8%), acute illness superimposed on chronic conditions (11%), conditions requiring highly specialized care (14%), neurologic and catastrophic conditions (5%), and patients with few comorbidities (the largest class, 33%). Over 4 years of follow-up, 6566 (31%) patients died, and survival in the classes ranged from 43 to 88%. Spending regressed to the mean in all classes except the ESRD and diabetes with multiple comorbidities groups.

Conclusions

Data-driven characterization of high-cost adults yielded clinically intuitive classes that were associated with survival and reflected markedly different healthcare needs. Relatively few high-cost patients remain persistently high cost over 4 years. Our results suggest that high-cost patients, while not a monolithic group, can be segmented into few subgroups. These subgroups may be the focus of future work to understand appropriateness of care and design interventions accordingly.

Subgroups of High-Cost Medicare Advantage Patients: an Observational Study

Article 03 December 2018

High-cost high-need patients in Medicaid: segmenting the population eligible for a national complex case management program

Article Open access 23 October 2021

Potentially Preventable Spending Among High-Cost Medicare Patients: Implications for Healthcare Delivery

Article 26 February 2020

INTRODUCTION

There is national interest in improving care for patients with complex needs, for whom there are both humanitarian and financial imperatives to do better.^{1, 2} In an environment of increasing value-based payment and risk-sharing, strong incentives encourage the development of complex care models, which often rely on enhancing the relationship between high-needs patients and their ambulatory care providers.^3,4,5 Yet few interventions have demonstrated reliable evidence of success. The reasons are myriad, but the heterogeneity of high-cost patients (and their needs) is one important factor.

A better understanding of complex patients is foundational to understanding and harnessing improvement opportunities.¹ Published descriptions of patients with high costs are mixed in terms of rigor and frequently are limited by simplified frameworks, such as using decision trees to assign patients to hypothesized groups. These approaches can fail to recognize subgroups that are not intuitive.

One pathway forward is the application of patient segmentation methods. Segmentation methods define subtypes within a heterogenous population, using data both to identify the groups that exist and to assign individuals to groups. This family of approaches is being deployed in varied industries,⁶ and is increasingly recognized as a key strategy to improve healthcare.⁷

Recently, several authors have used segmentation methods to cluster patients based on clinical conditions^8,9,10 or patterns of utilization.^{11, 12} However, many of the studies focused on high-need patients have been restricted to specific populations or settings,¹³ such as Medicare^{8, 11, 12, 14} or Medicaid^{15, 16} beneficiaries, or patients of a safety net hospital,^{9, 17} and have used self-reported data or limited data sets to create patient clusters.

This paper goes beyond the existing literature by presenting a data-driven characterization of high-cost adults using comprehensive clinical data from a diverse, all-payer population of health plan members. We hypothesized that interventions for high-cost patients should be tailored based on clinical needs, and therefore focused our segmentation on patterns of chronic and acute conditions. While we did not set out to understand appropriateness of high spending in our population, we establish a granular description of high-cost adult that lays the foundation for subsequent work.

METHODS

Study Design, Setting, and Data Sources

This is a retrospective observational cohort study of patients from Kaiser Permanente Southern California (KPSC). KPCS is an integrated healthcare delivery system and health plan with a comprehensive electronic health record (EHR). In 2010, KPSC cared for over 4.4 million members in 14 hospitals and over 220 medical offices across Southern California.¹⁸ The population of members is generally representative of the underlying population¹⁹ and includes Medicare and Medicaid beneficiaries as well as people with commercial and employer-sponsored coverage. We drew existing data from 2010 through 2014 from Kaiser Permanente’s internal data infrastructure (see eMethods in the online supplement).

Participants

We defined the high-cost population as the top 1% of continuously enrolled adult KPSC members with respect to total healthcare expenditures during 2010. To obtain the cohort, we first limited the adult membership of KPSC in 2010 (N = 2,714,005) to those with continuous coverage (N = 2,118,343) and then selected the top 1% (N = 21,183; eFigure 1).

Variables

We used the Centers for Medicare and Medicaid Services’ (CMS’s) publicly available ICD-9 grouping methodology, the Hierarchical Condition Categories (HCCs) system,²⁰ to develop indicator variables that captured the presence or absence of acute and chronic conditions based on diagnostic codes for each patient. We collapsed some very rare condition categories (CCs) together (eMethods), resulting in a final list of 53 CCs used to define patient classes. We did not apply the hierarchies available in the CMS-HCC algorithm, but rather leveraged only the validated ICD-9 grouping function.

Statistical Methods

Our primary analysis used a multivariate structural equations modeling technique called latent class analysis (LCA). LCA is based on the theory that an unobservable (“latent”) variable (i.e., class membership) can be measured through patterns of association in a set of observable “manifest” variables (in our case, the set of 53 CCs capturing patients’ acute and chronic conditions). The investigator must identify the set of observed data on which to compose the classes.

LCA categorizes a population into underlying subgroups,^{11, 21, 22} with the goal of arriving at the most “parsimonious and interpretable set of classes”.^{8, 23, 24} Increasingly complex models are generated by adding to the number of latent classes, and model-fit indexes are used to compare models until the best-fitting model (i.e., with the lowest Bayesian information criterion, and best interpretability and parsimony) is found.²¹

Class membership is probabilistic. The model generates posterior probabilities of class membership, which sum to one for each individual. Once the best-fitting model is selected, individuals are assigned to the specific class for which their probability of membership is greatest.^{21, 25} The average of the posterior probabilities among members of each class can be used to assess quality of model fit. The classes are mutually exclusive and collectively exhaustive (eMethods).

After completing the LCA, we used demographic, health behavior, utilization, spending, and survival data to describe and interpret the resulting classes (but not to compose the classes). For comparison with the top 1% of adults, we also drew a 10% simple random sample from the bottom 99% of spenders (“all other adults”). We used the MPLUS and SAS statistical software packages for all analyses. This study was approved by the appropriate institutional review boards.

RESULTS

In aggregate, the top 1% of patients accounted for 22% of total healthcare spending on continuously enrolled adult KPSC members during 2010. The table shows basic descriptive statistics for high-cost adults versus all other adults, for contextual understanding. Annual per patient spending among the top 1% ranged from $40,000 to $3.6 million. Mean annual spending was $77,600 (median $57,500), which was more than 26 times higher than the mean among the bottom 99% of spenders ($2900; median $1200).

Mean age among high-cost adults was 62 years, compared to a mean of 49 years in all other adults (Table 1). Forty-eight percent of high-cost adults were female (vs. 53% of all other adults), 51% were White (vs. 39%), and 51% were Medicare beneficiaries (vs. 16%). Diabetes was the most common condition among high-cost adults (34% of the top 1% vs. 7% of all other adults).

Table 1 Characteristics of Patients with High Costs Versus All Other Adults

Full size table

Segmentation by Clinical Conditions

In the primary analysis, we tested LCA models with three to nine classes (eMethods) and selected a seven-class model as the best-fit solution based on fit statistics, interpretability, and parsimony.^{21, 23} In the final model, class sizes ranged from 5% (N = 1124) to 33% (N = 7014) of the high-cost population (eTable 1). The average of the posterior probabilities of class membership ranged from 0.79 to 0.89 (eTable 1), indicating good model performance.²⁶

Figure 1 displays the prevalence of the model indicators conditional on class membership; eTable 2 presents the same data in tabular format. Conditions were considered distinctive if they were highly prevalent within a class or highly concentrated in a specific class (even if not highly prevalent).

Some conditions were ubiquitous across all of the classes: diabetes, vascular diseases, and heart conditions were present to some extent across nearly every class. Within this context, three classes contained distinct groups of patients with specific constellations of chronic conditions: the end-stage renal disease (ESRD), cardio-vascular and pulmonary diseases, and diabetes with multiple comorbid conditions (MCCs) classes represented 12%, 17%, and 8% of the population, respectively (Fig. 1, eTable 1). The ESRD class contained patients with dialysis status and accompanying chronic comorbidities (Fig. 1, eTable 2). The cardio-vascular and pulmonary disease group had a high prevalence of heart conditions (e.g., congestive heart failure, coronary artery disease, vascular disease) and chronic obstructive pulmonary disease and low prevalence of diabetes. In contrast, patients in the diabetes with MCCs class had diabetes and commonly co-occurring comorbidities (e.g., dialysis status, congestive heart failure), as well as complications of diabetes (e.g., wound-healing and amputation codes, retinopathy).

Another class had acute exacerbations of chronic illness. This class made up 11% of the sample (eTable 1) and was the oldest class with a mean age of 71 years (eTable 3). In this class, we observed common chronic conditions, plus high prevalence of acute illnesses that are frequently observed as patients deteriorate and are proximate causes of death, such as sepsis and shock, acute renal failure, cardio-respiratory failure and arrest, intestinal obstruction/perforation, and artificial openings for feeding or elimination (Fig. 1, eTable 2). This class had the greatest number of individual members with ten or more comorbid conditions (Fig. 2). A somewhat larger class contained patients receiving highly specialized treatments (14% of the sample) such as organ transplantation and patients with immune disorders and cancers (Fig. 1, eTable 2). The smallest class contained patients with neurologic and catastrophic conditions (5% of the sample) such as stroke or post-stroke paralysis, seizure disorders, trauma such as severe head injuries and burns, and paralysis and coma (Fig. 1, eTable 2).

The largest class (33% of the sample) had few comorbidities (eTable 1). They were the youngest class with a mean age of 54 years and included more women and people with commercial insurance (eTable 3). The preponderance of patients in this class (68%) had two or fewer co-occurring conditions (Fig. 2), which represented a markedly lower burden of illness than among the other six classes.

Survival

After 4 years, 69% of high-cost adults from 2010 were alive, compared to 98% of all other adults (Fig. 3). Survival among high-cost adults varied widely between the classes. Four-year survival was best (88%) for those with few comorbidities and worst for the acute exacerbations of chronic illness (43%) and diabetes with MCCs (46%) classes. The remaining classes had survival in the range of 62 to 73% at 4 years (Fig. 3).

Healthcare Spending

Figure 4 shows 2010 spending in aggregate for each class as a whole (panel A) and as an average (spending per person) within each class (panel B), broken down by type of care. While average spending was lowest in the few comorbidities class ($65,000 per person in 2010), their total aggregate spending as a class ($456 million) was the highest because of the large class size (33% of the sample).

In contrast, the acute exacerbation of chronic illness and neurologic and catastrophic classes had the highest average per person spending ($109,900 and $98,200 respectively). Patients in the ESRD and cardio-vascular and pulmonary classes had lower average per person spending ($66,000 and $67,500 respectively). For all classes—except ESRD—hospital services accounted for most of total spending (Fig. 2).

Persistence of high spending was limited over four subsequent years. Quarterly median individual spending by class is shown in Figure 5. Substantial regression to the mean after 2010 is apparent in all classes except the ESRD and diabetes with MCCs classes. In every class, there remained individual patients with very high spending at points in time. However, only 2126 patients (10%) sustained top 1%-level spending over all 5 years (data not shown).

DISCUSSION

A rich understanding of the segmentation of high-cost patients is foundational to any strategy to improve their care.¹ In this analysis of the top 1% in a large integrated healthcare system, we took advantage of an all-payer population with comprehensive data across the continuum of care. We developed a detailed characterization of high-cost adults using a person-centered, data-driven approach. Our analysis is the largest to our knowledge, demonstrating the feasibility of LCA with a large population and more than 50 variables used in fitting the model.

We identified seven classes of high-cost patients, clustering based on categories of diagnosis codes alone. The constellations of conditions present in each of the latent classes showed clinically intuitive patterns of comorbidity. As opposed to classifications based on expert judgment, LCA directly reflects the patterns of comorbidity that exist in the data, and offers quantitative information about the quality of the model fit. Of the 21,183 participants in our study, more than 85% had 0.70 or greater posterior probability of membership in the class to which they were assigned, and all classes exceeded the in-class average membership probability threshold established in the literature. Compared to alternative empirical approaches, LCA has advantages such as the option to include covariates and the ability to prospectively assign other subjects to proven clusters if they have measurements for the same set of manifest data.

Our results both confirm the prior literature and add nuance to the clinical segmentation of high-cost adults. In particular, we demonstrate that chronically ill high-cost patients are not monolithic, but rather segment into distinctive subtypes. We suggest that it is the combinations of comorbidities, rather than specific individual conditions, that characterize these high-cost patients. One potential policy action supported by our findings is continued movement toward clinical practice guidelines that address or incorporate multi-morbidity, to improve clarity about evidence-based management of these complex patients.²⁷

Several of the classes of high-cost patients that arose in our analysis align with the existing literature, such as groups characterized by catastrophic injury or illness, neurological disease, renal disease, and minimal comorbidity.^{8, 17} However, our work makes an important contribution as we used an all-payer population (vs. Medicare-only data) and an expansive list of condition indicators (vs. a limited list) based on a comprehensive inventory of diagnosis codes (vs. self-report). Closest to our findings are the subgroups reported by Whitson et al. in a LCA of 14,052 Medicare beneficiaries based on self-reported variables capturing 13 chronic conditions.⁸ However, the model reported by Whitson et al. had high misclassification error, while ours did not. A comparison of the subtypes of patients based on clinical conditions in their analysis and ours is provided in the online supplement (eTable 4).

In most of the classes we identified, average spending rapidly decreased after the high-cost year. This pattern may leave little opportunity (or cause) to intervene, since for many patients, costs resolve on their own over time. Failure to recognize this pattern could lead evaluators of pilot programs to incorrectly attribute cost reductions to their interventions. Future work should better understand individual trajectories of spending and their predictors, so that interventions can be planned for groups of patients who are expected to have sustained patterns of high spending.

Spending patterns over time were more persistent in the ESRD and diabetes with MCCs classes in our analysis, both of which contained substantial numbers of patients receiving dialysis. Dialysis costs made up a large share of total spending for the ESRD class, but hospital costs dominated in the diabetes with MCCs class. These findings suggest both upstream work to prevent and delay the transition to dialysis status and to ensure optimal starts and focused effort to optimize care for these complex patients in terms of coordination, efficiency, and shared decision-making. Future work should investigate the level of diabetes control in this population, to understand whether interventions to improve self-management might be indicated.

Our study has limitations. We focused on the top 1% of patients, which is a narrow definition of patients with complex needs (but yielded a very large cohort in our population). We lack comparable cost data from prior to 2010 because of a transition to a new medical records and billing system and therefore cannot discern whether 2010 was the first high-cost year for patients in our study. Kaiser Permanente’s cost data may not be readily generalizable to other providers, but are based on a consistent cost accounting methodology (see eMethods) and can be used to understand relative differences in spending within our membership. Furthermore, our population, while more general and representative than many studied previously (which are often payor-specific), is not directly translatable to the USA due to the integrated system in which they receive care.

The LCA methodology used in our analysis is a data-driven segmentation approach,²¹ but it does rely on the investigator to select the set of manifest data for fitting the model. We used diagnosis-based condition indicators, a clinician-oriented approach. Future analyses should test other sets of manifest data for identifying clusters of complex patients. Other industries have begun assembling broader datasets, which could add substantially to our understanding of this population if new data (e.g., social needs and care preference data) and existing cross-sectoral data (e.g., from correctional authorities and social services) could be leveraged.²⁸ Our analysis is the largest to our knowledge, demonstrating the feasibility of LCA with a large population and many indicator variables.

To minimize ascertainment bias, we required continuous enrollment during the study period. While this approach might lack sensitivity for conditions untreated during the study year, it should identify chronic conditions that persist over time, particularly in our population of high-use patients. It also identifies acute events such as trauma or heart attacks.

We did not set out to distinguish appropriate from inappropriate care or spending, and our results cannot be used to make direct inferences about opportunities for improvement. Much of the care reflected in our analysis may represent high-quality care that is concordant with patient preferences. However, a recent study focused on the top 10% of Medicare fee-for-service beneficiaries concluded that there is some potentially preventable spending within the high-cost population, particularly among frail elders.²⁹ In our analysis, many patients have advanced illness or experience an overwhelming cascade of illness, and survival is poor. Thus, we would suggest that interventions in this population should focus on improving the delivery of person-centered, high-quality, and goal-concordant care that maximizes dignity, rather than the ubiquitous focus on costs as the primary outcome. Given the dynamic nature of this population, carefully controlled studies are essential to demonstrate the impact of any intervention.^{1, 28}

In summary, our study offers a detailed descriptive characterization of high-cost patients, distinguishing seven distinctive classes of patients with various combinations of acute and chronic conditions. While the classes were composed on the basis of clinical conditions alone, they differed substantially with respect to four-year survival and future spending. This work builds on and extends conceptual patient taxonomies and analyses driven by ex ante categorization, and it lays the foundation for future work to explore actionable insights. Additional efforts are needed to differentiate appropriate and inappropriate spending and design intervention strategies accordingly, to meet the needs of these challenging populations.

References

Blumenthal D, Chernof B, Fulmer T, Lumpkin J, Selberg J. Caring for High-Need, High-Cost Patients — An Urgent Priority. N Engl J Med. 2016;375(10):909–11. https://doi.org/10.1056/NEJMp1608511
Article PubMed Google Scholar
Bodenheimer T, Berry-Millett R. Follow the Money — Controlling Expenditures by Improving Care for Patients Needing Costly Services. N Engl J Med. 2009;361(16):1521–3. https://doi.org/10.1056/NEJMp0907185
Article PubMed CAS Google Scholar
Porter ME, Pabo EA, Lee TH. Redesigning primary care: a strategic vision to improve value by organizing around patients’ needs. Health Aff (Milwood) 2013;32(3):516–25. https://doi.org/10.1377/hlthaff.2012.0961
Article Google Scholar
Wajnberg A, Long J. Transforming Care for High Cost, High Needs Patients to Improve Experience, Quality, and Spending. SGIM Forum 2017;40(11):7–10.
Google Scholar
Ganguli I, Orav EJ, Weil E, Ferris TG, Vogeli C. What Do High-Risk Patients Value? Perspectives on a Care Management Program. J Gen Intern Med 2018;33(1):26–33. https://doi.org/10.1007/s11606-017-4200-1
Article PubMed Google Scholar
Ersi. Tapestry Segmentation. 2016. http://www.esri.com/landing-pages/tapestry. Accessed July 18 2018.
Vuik SI, Mayer EK, Darzi A. Patient Segmentation Analysis Offers Significant Benefits For Integrated Care And Support. Health Aff (Milwood) 2016;35(5):769–75. https://doi.org/10.1377/hlthaff.2015.1311
Article Google Scholar
Whitson HE, Johnson KS, Sloane R, Cigolle CT, Pieper CF, Landerman L, et al. Identifying Patterns of Multimorbidity in Older Americans: Application of Latent Class Analysis. J Am Geriatr Soc 2016;64(8):1668–73. https://doi.org/10.1111/jgs.14201
Article PubMed PubMed Central Google Scholar
Rinehart DJ, Oronce C, Durfee MJ, Ranby KW, Batal HA, Hanratty R, et al. Identifying Subgroups of Adult Superutilizers in an Urban Safety-Net System Using Latent Class Analysis: Implications for Clinical Practice. Med Care 2018;56(1):e1-e9. https://doi.org/10.1097/mlr.0000000000000628
Article PubMed PubMed Central Google Scholar
Gould MK, Munoz-Plaza CE, Hahn EE, Lee JS, Parry C, Shen E. Comorbidity Profiles and Their Effect on Treatment Selection and Survival among Patients with Lung Cancer. Ann Am Thorac Soc 2017;14(10):1571–80. https://doi.org/10.1513/AnnalsATS.201701-030OC
Article PubMed Google Scholar
Hastings SN, Whitson HE, Sloane R, Landerman LR, Horney C, Johnson KS. Using the past to predict the future: latent class analysis of patterns of health service use of older adults in the emergency department. J Am Geriatr Soc 2014;62(4):711–5. https://doi.org/10.1111/jgs.12746
Article PubMed PubMed Central Google Scholar
Lee NS, Whitman N, Vakharia N, Taksler GB, Rothberg MB. High-Cost Patients: Hot-Spotters Don’t Explain the Half of It. J Gen Intern Med 2016;32(1):28–34. https://doi.org/10.1007/s11606-016-3790-3
Article PubMed PubMed Central Google Scholar
Aldridge MD, Kelley AS. Appendix E: Epidemiology of Serious Illness and High Utilization of Health Care. In: Medicine Io, editor. Dying in America: Improving Quality and Honoring Individual Preferences Near the End of Life. Washington, DC: The National Academies Press; 2015.
Google Scholar
Clough JD, Riley GF, Cohen M, Hanley SM, Sanghavi D, DeWalt DA, et al. Patterns of care for clinically distinct segments of high cost Medicare beneficiaries. Healthc (Amst). 2016;4(3):160–5. https://doi.org/10.1016/j.hjdsi.2015.09.005
Article Google Scholar
Billings J, Raven MC. Dispelling an urban legend: frequent emergency department users have substantial burden of disease. Health Aff (Milwood) 2013;32(12):2099–108. https://doi.org/10.1377/hlthaff.2012.1276
Article Google Scholar
Coughlin TA, Long SK. Health care spending and service use among high-cost Medicaid beneficiaries, 2002-2004. Inquiry. 2009-2010;46(4):405–17.
Johnson TL, Rinehart DJ, Durfee J, Brewer D, Batal H, Blum J, et al. For Many Patients Who Use Large Amounts Of Health Care Services, The Need Is Intense Yet Temporary. Health Aff (Milwood) 2015;34(8):1312–9. https://doi.org/10.1377/hlthaff.2014.1186
Article Google Scholar
Kaiser Permanente. About KP. 2015. https://share.kaiserpermanente.org/article/fast-facts-about-kaiser-permanente/. Accessed July 18 2018.
Koebnick C, Langer-Gould AM, Gould MK, Chao CR, Iyer RL, Smith N, et al. Sociodemographic Characteristics of Members of a Large, Integrated Health Care System: Comparison with US Census Bureau Data. Perm J 2012;16(3):37–41.
Article Google Scholar
Centers for Medicare and Medicaid Services. Risk Adjustment. 2015. https://www.cms.gov/Medicare/Health-Plans/MedicareAdvtgSpecRateStats/Risk-Adjustors.html. Accessed July 18 2018.
Muthen B, Muthen LK. Integrating person-centered and variable-centered analyses: growth mixture modeling with latent trajectory classes. Alcohol Clin Exp Res 2000;24(6):882–91.
Article CAS Google Scholar
Lanza ST, Collins LM, Lemmon DR, Schafer JL. PROC LCA: A SAS Procedure for Latent Class Analysis. Struct Equ Model 2007;14(4):671–94.
Article Google Scholar
Huh J, Riggs NR, Spruijt-Metz D, Chou CP, Huang Z, Pentz M. Identifying patterns of eating and physical activity in children: a latent class analysis of obesity risk. Obesity (Silver Spring). 2011;19(3):652–8. https://doi.org/10.1038/oby.2010.228
Article Google Scholar
Nagin DS, Odgers CL. Group-based trajectory modeling in clinical research. Annu Rev Clin Psychol 2010;6:109–38. https://doi.org/10.1146/annurev.clinpsy.121208.131413
Article PubMed Google Scholar
Schmiege SJ, Meek P, Bryan AD, Petersen H. Latent variable mixture modeling: a flexible statistical approach for identifying and classifying heterogeneity. Nurs Res 2012;61(3):204–12. https://doi.org/10.1097/NNR.0b013e3182539f4c
Article PubMed Google Scholar
Nagin DS. Group-based modeling of development. Cambridge: Harvard University Press; 2005.
Book Google Scholar
Wilson KC, Gould MK, Krishnan JA, Boyd CM, Brozek JL, Cooke CR, et al. An Official American Thoracic Society Workshop Report. A Framework for Addressing Multimorbidity in Clinical Practice Guidelines for Pulmonary Disease, Critical Illness, and Sleep Disorders. Ann Am Thorac Soc 2016;13(3):S12–21. https://doi.org/10.1513/AnnalsATS.201601-007ST
Article PubMed PubMed Central Google Scholar
Raven MC, Romm I, Ajayi T. Evaluating Complex Care Programs: Is It a Zero-Sum Game? NEJM Catalyst. 2017. Available at: https://catalyst.nejm.org/evaluating-complex-care-programs. Accessed 22 Aug 2018.
Figueroa JF, Joynt Maddox KE, Beaulieu N, Wild RC, Jha AK. Concentration of Potentially Preventable Spending Among High-Cost Medicare Subpopulations: An Observational Study. Ann Intern Med 2017;167(10):706–13. https://doi.org/10.7326/m17-0767
Article PubMed Google Scholar

Download references

Acknowledgements

We gratefully acknowledge the following individuals who provided essential support in the completion of this work: Janet S Lee, Stephen Zuniga, and Kenneth Martinez, for their support in data assembly, and Dana C Barnes for her advice regarding data visualization. Dr. Davis and Dr. Shen had full access to all the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis.

Dr. Davis was supported by NIH/National Center for Advancing Translational Science (NCATS) UCLA CTSI Grant Number TL1TR000121 and by a UCLA Graduate Division dissertation year fellowship.

A preliminary version of this work was presented at the 2016 Academy Health Annual Research Meeting in Boston, MA.

Author information

Authors and Affiliations

Kaiser Permanente Center for Effectiveness and Safety Research, Pasadena, CA, USA
Anna C. Davis PhD
Department of Health Policy and Management, University of California Los Angeles Fielding School of Public Health, Los Angeles, CA, USA
Anna C. Davis PhD, Beth A. Glenn PhD, Ninez Ponce PhD & Jack Needleman PhD
Kaiser Permanente Southern California Clinical Operations Support, Pasadena, CA, USA
Anna C. Davis PhD & Nirav R. Shah MD
Kaiser Permanente Southern California Department of Research and Evaluation, Pasadena, CA, USA
Ernest Shen PhD & Michael K. Gould MD
Stanford University School of Medicine, Stanford, CA, USA
Nirav R. Shah MD
Department of Biostatistics, University of California Los Angeles Fielding School of Public Health, Los Angeles, CA, USA
Donatello Telesca PhD

Authors

Anna C. Davis PhD
View author publications
You can also search for this author in PubMed Google Scholar
Ernest Shen PhD
View author publications
You can also search for this author in PubMed Google Scholar
Nirav R. Shah MD
View author publications
You can also search for this author in PubMed Google Scholar
Beth A. Glenn PhD
View author publications
You can also search for this author in PubMed Google Scholar
Ninez Ponce PhD
View author publications
You can also search for this author in PubMed Google Scholar
Donatello Telesca PhD
View author publications
You can also search for this author in PubMed Google Scholar
Michael K. Gould MD
View author publications
You can also search for this author in PubMed Google Scholar
Jack Needleman PhD
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Michael K. Gould MD.

Ethics declarations

Conflict of Interest

The authors declare that they do not have a conflict of interest.

Additional information

Dr. Anna C Davis is no longer affiliated with Department of Health Policy and Management, University of California Los Angeles Fielding School of Public Health, or with Kaiser Permanente Southern California Clinical Operations Support. Dr. Nirav R Shah is no longer affiliated with Kaiser Permanente Southern California Clinical Operations Support.

Electronic Supplementary Material

ESM 1

(DOCX 130 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Davis, A.C., Shen, E., Shah, N.R. et al. Segmentation of High-Cost Adults in an Integrated Healthcare System Based on Empirical Clustering of Acute and Chronic Conditions. J GEN INTERN MED 33, 2171–2179 (2018). https://doi.org/10.1007/s11606-018-4626-0

Download citation

Received: 26 February 2018
Revised: 21 June 2018
Accepted: 02 August 2018
Published: 04 September 2018
Issue Date: December 2018
DOI: https://doi.org/10.1007/s11606-018-4626-0

KEY WORDS

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Segmentation of High-Cost Adults in an Integrated Healthcare System Based on Empirical Clustering of Acute and Chronic Conditions