INTRODUCTION

More than two-thirds of Medicare beneficiaries have two or more chronic conditions, and more than 15 % present with six or more.1,2 As stated in the 2010 US Department of Health and Human Services “Strategic Framework on Multiple Chronic Conditions”,3 the aging of the U.S. population warrants a detailed investigation into specific combinations of conditions associated with adverse health outcomes to identify the most vulnerable subgroups of older individuals and their health care needs. Such an inquiry would also advance patient-centered outcomes research in important ways,4 by moving away from a paradigm based on the identification of an index disease.5,6

The study of multimorbidity faces numerous challenges, including the lack of a uniform definition.5,79 Given the ubiquitous use of administrative data, however, multimorbidity has often been equated with multiple chronic conditions, especially when these conditions are identified from claims data.1012

In a previous study,13 we argued that multimorbidity should be redefined to include the co-occurrence of chronic conditions, functional limitations, and geriatric syndromes. We demonstrated the strong cumulative effects of their co-occurrence on adverse health outcomes, including self-report of fair/poor health, self-rated worse health at 2 years, and 2-year mortality.

In this study, we expand on the above-referenced study, by identifying specific combinations of conditions within and across the broad rubrics of chronic conditions, functional limitations, and geriatric syndromes that are particularly relevant in predicting poor health outcomes. We address important limitations noted in previous studies on multimorbidity:

First, rather than relying on administrative claims data, which are not optimal when gauging functional limitations and geriatric syndromes, we use data from the U.S. Health and Retirement Study (HRS). The HRS includes a rich array of longitudinal data that can be used to characterize, in detail, both multimorbidity and health outcomes.

Second, given the inadequacies presented by traditional statistical methods to study combinations of numerous conditions, we rely on a tree-based analytic approach—the Classification and Regression Tree analysis (CART), a modern version of the original CART by Breiman et al.,14 described in detail below. CART allows for an exhaustive search of all possibilities by identifying splitting variables, and groups individuals in empirically emerging homogeneous, but mutually exclusive categories relative to a given outcome of interest.15

We hypothesize that the combinations of conditions that are relevant in predicting health outcomes include functional limitations and/or geriatric syndromes, rather than chronic conditions only.

METHODS

Overview

This study uses the 2010–2012 HRS public use files to identify specific combinations of chronic conditions, functional limitations, and geriatric syndromes that are most prominent in predicting the following health outcomes: current fair/poor health status, self-rated worse health at 2 years, and 2-year mortality.

This study was approved by the Case Western Reserve University Institutional Review Board.

Data Source and Study Population

Sponsored by the National Institute on Aging, the HRS is an ongoing, nationally representative longitudinal survey of approximately 30,000 older adults. The HRS collects biennial data on a broad array of measures, including demographics; chronic conditions; self-reported health status; functional status (e.g., upper/lower body limitations, activities of daily living (ADL) limitations); and geriatric syndromes (e.g., depressive symptoms, and incontinence).

The 2010 wave of the HRS contains data on 19,924 living respondents 50 years of age or older, including data on 1382 proxy respondents (typically the spouse) who provided answers for the focal respondents. In the 2012 wave, 2279 respondents were lost to follow-up (not interviewed and not deceased). We excluded 555 respondents with missing information on relevant variables. Those excluded were sicker than those who remained in the study population, as measured by the percentage of individuals who rated their health as fair/poor (38.8 % vs. 28.8 %, p < 0.01), for example.

The study population included 16,640 respondents.

Variables of Interest

Outcomes

We created the following three binary outcomes:

Self-rated fair or poor health status in 2010 (vs. excellent, very good or good).

Self-rated worse health, coded as ‘1’ if the respondent rated his/her health as “worse” (vs. “better” or “about the same”) in 2012 compared to when s/he was interviewed in 2010.

Deceased status in 2012, usually reported by a spouse or other relative or as identified through a probabilistic National Death Index match.

Primary Predictors

Chronic Conditions

Chronic conditions were identified by whether the respondent was ever told by a physician that s(he) had hypertension, heart disease (including subtype of the disease—e.g., myocardial infarction or congestive heart failure), lung disease, diabetes, stroke, arthritis, cancer, or psychiatric conditions. Additional questions assessed whether the respondent was being treated for a given condition (e.g., medication for hypertension, or oxygen for lung disease), and/or that a condition caused limitations in usual activities. For each condition, we categorized respondents as no disease (=0), self-reported diagnosis as mild (=1), and active treatment/limitations as severe (=2). We also identified whether a respondent was ever told by a physician that she or he had Alzheimer's disease or other dementia.

For functional performance, the respondent was asked to assess whether s(he) has difficulty with a specific task, using validated questions.16 Functional limitations were grouped in the following categories: a) Strength limitations (e.g., pulling/pushing a large object; lifting ten pounds); b) upper body limitations (e.g., picking up a dime; reaching overhead); c) lower body limitations (e.g., walking one block; climbing stairs); d) Activities of Daily Living (ADL: e.g., bathing; dressing); and Instrumental Activities of Daily Living (IADL: e.g., preparing meals, managing money). For parsimony, some of our analyses incorporated broader, rather than more detailed, categories constituting functional limitations (e.g., IADL, instead of each of the items contained in that rubric).

Geriatric Syndromes

Geriatric syndromes are conditions commonly experienced by older individuals,17 including: a) visual impairment (self-rated as fair or poor when wearing corrective lenses as usual, or legally blind); b) hearing impairment (self-rated as fair or poor when wearing hearing aid as usual); c) moderate or severe depressive symptoms (≥ 4 symptoms on the eight-item Center for Epidemiologic Studies Depression Scale [CES-D]); d) urinary incontinence; e) low cognitive performance (< 11, based on the 35-point Telephone Interview Cognitive Status scale),18 or a proxy reporting that the respondent’s cognitive performance was fair or poor; f) persistent dizziness or lightheadedness; and g) severe pain (“often troubled by”). Conditions for which we observed relatively high percentages of missing values (e.g., depressive symptoms), we created a category of ‘missing.’

Other Variables of Interest

Sociodemographic variables: Age was grouped in 5-year increments. All respondents older than 85 years of age were grouped in the ≥ 85 category to protect privacy. Race/Ethnicity included four categories: White non-Hispanic, Black non-Hispanic, Hispanic, and Other. Marital status was identified as Married, Divorced, Widowed, or Never Married. Years of education were grouped in six categories: < 9, 9–11, 12, 13–15, 16, and ≥ 17. Income adjusted for household size was expressed as the ratio of household income to the federal poverty level, as follows: < 100, 100–199, 200–299, and ≥ 300 %. Smoking status was grouped in three categories (never smoked, former smoker, and current smoker), as was alcohol use (none, moderate, heavy and missing). We characterized body mass index (BMI, measured as kg/m2) as underweight (BMI ≤ 18.5), normal/overweight (BMI of 18.5–30), obese (BMI ≥ 30) or missing (2 % of respondents). Vigorous exercise was measured as a dichotomous variable for engagement in vigorous sports or activities more than once a week. Self-rated fair or poor health status in 2010 was also used as predictor for prospective health outcomes.

Statistical Analyses

We used modern generalizations of the classification and regression tree (CART) analysis to identify combinations of predictors associated with the outcomes of interest. CART is a nonparametric, machine-learning method that recursively partitions the values of each of the predictor variables into two sets so that the values of the outcome variable are as homogeneous as possible in each set.14 Each predictor variable is considered as a potential split, including every grouped value of a numerical predictor. The optimal split is the one with the largest reduction in an impurity index, which measures the extent of misclassification at a given node. We used the following stopping criteria (based on model-tuning described below): a maximum tree depth of five splits, a minimum node size of 50 subjects (25 for mortality outcome), requiring a split to increase the complexity parameter by a minimum of 0.001, and using the information impurity index to determine node splits.

To build our model, we partitioned the study data into training data sets (70 % of the data) and validation data sets (remaining 30 %) using random sampling within each class of the outcome variables. We used ten-fold cross-validation repeated three times on the training data set testing to build the CART models. Since mortality is highly unbalanced, we weighted the “cost” of a false negative to be higher than a false positive in order to improve sensitivity and produce a more meaningful model. We then tested the accuracy of our models on the validation data set. We chose our final model for each outcome based on accuracy and interpretability. Additionally, we compared our final models with those from a random forest model to see if they were in agreement on variables that are the most important predictors (data not shown). Random forest is a bootstrap aggregation method that creates multiple decision trees using random variable selection. Breiman et al. provide a detailed description of random forest.19 We used SAS for data management; for our statistical analyses, we used R version 3.1.3 and the ‘rpart’ (CART), partykit (tree graphics), ‘randomForest’ (random forest), and ‘caret’ (model tuning and cross-validation) packages.

RESULTS

Table 1 presents the distribution of sociodemographic characteristics and behavioral factors for the total study population and by self-reported health, self-rated worse health at 2 years, and 2-year mortality. Twenty-nine percent of respondents reported fair/poor health, 23 % reported 2-year decline, and 5 % died by the end of the 2-year follow-up period. These proportions varied considerably across subgroups of the population. Self-reported fair/poor health status was highest among the oldest respondents (39 %); those of Hispanic origin (48 %); persons with low incomes or educational attainment (53 and 57 %, respectively); current smokers (40 %); those who are underweight (48 %); and among those with proxy respondents (54 %). Self-rated worse health at 2 years followed a pattern similar to that of self-reported fair/poor status, and 2-year mortality was markedly higher among those who were older, widows, underweight; and those with proxy respondents.

Table 1 Sociodemographic Characteristics and the Prevalence of the Outcomes of Interest Across Subgroups of the Study Population

Table 2 presents the percent of respondents with chronic conditions, functional limitations, and geriatric syndromes. The percent of respondents presenting with our outcomes of interest varied markedly by the presence of these conditions. We observed the highest levels of fair/poor self-reported health among respondents with severe lung disease (75 %); limitations in ADLs or IADLs (76 and 65 %, respectively); severe pain (77 %), depressive symptoms (62 %), and visual impairment (61 %).

Table 2 Prevalence of the Outcomes of Interest in the Presence of Chronic Conditions, Functional Limitations, and Geriatric Syndromes

The percent of respondents with self-rated worse health at 2 years was highest among those with severe lung disease (42 %); limitations in ADLs and IADLs (40 and 37 %); severe pain (44 %) and depressive symptoms (40 %).

Two-year mortality was highest among respondents with Alzheimer’s disease (35 %), limitations in ADLs and IADLs (24 and 16 %, respectively), and poor cognitive functioning (27 %).

Figures 1, 2, and 3 show the trees obtained from our Classification and Regression Tree (CART) analysis for each of the three outcomes: fair/poor health status, self-rated worse health at 2 years, and 2-year mortality. See the Appendix for the more detailed analysis.

Figure 1.
figure 1

Classification and regression tree analysis for self-reported fair/poor health. IADL: Instrumental activities of daily living.

Figure 2.
figure 2

Classification and regression tree analysis for self-rated worse health at 2 years.

Figure 3.
figure 3

Classification and regression tree analysis for 2-year mortality. IADL: Instrumental activities of daily living.

Figure 1 shows that the combination of specific conditions associated with the highest percentage of fair/poor health consists of difficulty walking several blocks, depressive symptoms, and severe pain (nodes 1, 22, and 28, leading to the second bar from the right). Indeed, over 80 % of respondents with this combination of conditions reported fair/poor health. Conversely, the lowest prevalence of fair/poor health was observed among respondents with no difficulty walking several blocks, no impaired vision, but reporting incomes at or exceeding 300 % of the FPL (nodes 1, 2, and 3, leading to a prevalence of fair/poor health of less than 10 %).

Figure 2 shows that the highest percentage of self-rated worse health at 2 years (about 60 %) was observed among respondents who had difficulty walking several blocks, who self-rated their health as fair/poor, and who were 68.5 years of age or older (nodes 1, 23, and 31). When difficulty walking several blocks co-occurred with reporting fair/poor health in individuals who are younger than 68.5 years of age, it was the presence of depressive symptoms that placed them at a high risk of reporting self-rated worse health, at nearly 50 % (nodes 1, 23, 31, and 32). In contrast, the lowest percentage of self-rated worse health (less than 10 %) was observed among those who had no difficulty walking several blocks, rated their health as Excellent, Very Good, or Good, and had no arthritis (nodes 1, 2, and 3, leading to the first bar at the left hand side).

Figure 3 shows that the most important element in predicting mortality is age, which is split at 80.5 years. Thus, the highest percentage of 2-year mortality (slightly over 40 %) was observed among those who were 80.5 years of age or older, and who had limitations in both ADLs and IADLs (nodes 1, 19, and 27). The second highest percentage (at slightly lower than 40 %) was observed among respondents who had limitations in IADLs, difficulty pulling/pushing a large object, and were 84.5 years of age or older (nodes 1, 19, 28, and 32, leading to the second bar from the right).

Both Figs. 1 and 2 identified difficulty walking several blocks as the most important variable predicting both self-reported fair/poor health and 2-year worse health. The models seldom identified chronic conditions as splitting variables at critical junctures when predicting any of our three outcomes.

DISCUSSION

In this study, we analyzed health outcomes (self-reported fair/poor health, self-rated worse health at 2 years, and 2-year mortality) in a U.S. representative sample of older adults, and identified combinations of conditions that are most relevant in predicting these outcomes. The most remarkable finding was that, rather than chronic conditions, the combination of functional limitations and geriatric syndromes was most important in predicting health outcomes. Indeed, when all of these conditions were accounted for, and when CART conducted an exhaustive search of all the possibilities,15 the model determined the most important “splitting” variables to be age, functional limitations, and geriatric syndromes, rather than chronic conditions. This is akin to the disablement process described by Verbrugge and Jette.20

These findings have important implications for research and clinical practice. Methodologically, these findings imply that accounting for chronic conditions alone may not be as informative as accounting for the co-occurrence of functional limitations and geriatric syndromes, which appear to drive health outcomes in older individuals. However, research in this realm has relied on International Coding of Diseases 9th Clinical Modification (ICD-9-CM) diagnosis codes documented in administrative data, which present a challenge for identifying disability21 and geriatric syndromes. As a result, multimorbidity has often been equated to multiple chronic conditions, and the importance of functional limitations and geriatric syndromes in predicting health outcomes has seldom been examined in relation to health outcomes.

With regard to clinical practice, the findings call for the use of such instruments as the Comprehensive Geriatric Assessment to evaluate the health status and clinical presentation of older individuals in a way that makes it possible to identify subgroups of older adults that are most vulnerable to experience adverse outcomes,2225 especially when making clinical decisions. For example, our findings document the importance of visual impairment in predicting health outcomes, and yet this condition has been mostly ignored in the literature. As such, this analysis constitutes an important step towards a more personalized approach when characterizing patients with multimorbidity.

To our knowledge, this is the first study to identify combinations of specific conditions that are associated with adverse health outcomes using modern CART. The main advantage of using this analytic approach lies in our ability to learn about empirically emerging combinations of conditions instead of having our queries guided solely by a priori hypotheses. In addition, an important strength of this study consists of characterizing multimorbidity based on a rich array of nationally representative data from the HRS, including sociodemographic variables, behavioral factors, and a number of clinically pertinent measures, which we were able to characterize by level of severity. Finally, we note that rather than just including those 65 years of age and older, our study population included those 50–64 years of age, given that a sizable percentage (18.2 %) present with co-occurring chronic conditions, functional limitations, and geriatric syndromes.13 This compares to 33.3 % in their older counterparts.13 In addition, our age-stratified analysis yielded CART trees that also underscored the importance of functional limitations and geriatric syndromes (rather than chronic conditions) driving health outcomes.

This study has several limitations: First, all of the conditions accounted for are self- or proxy-reported, and the chronic conditions examined are limited to the ten conditions reported in the HRS. However, our access to the linked HRS-Medicare data allowed us to repeat our analysis using claims-derived chronic conditions as per the Chronic Conditions Data Warehouse (CCW) that are flagged in the Medicare Beneficiary Summary File. The CART trees obtained using the CCW conditions instead of the ten self-reported chronic conditions did not change markedly, even though this change in analytic strategy also required that we limit our study population to Medicare fee-for-service beneficiaries. Second, while the CART uses the entire dataset, it produces a single tree. This is in contrast with the random forest approach, which uses subsets of the data, as well as bootstrapping, to identify the most important predictors. On the other hand, it is not possible to use the random forest approach to identify combinations of conditions, which was the primary objective of this study. Third, because of the randomness in splitting data into particular training and testing subsets involved with building trees, our figures may look somewhat different if they are regenerated by others. However, we conducted thorough sensitivity analysis; and the trees presented herein are the ones that produced the best statistics relative to their accuracy and predictive abilities (see Online supplemental data). Finally, we note that our study population was exclusive of 555 respondents who differed significantly from others (e.g., in self-reported health status). However, given that they represented only 3 % of the population, it is unlikely that their exclusion affected our results, except possibly by yielding more conservative estimates.

In conclusion, this study highlighted the importance of characterizing multimorbidity in broader terms, rather than limiting its definition to the co-occurrence of chronic conditions alone. Future studies should analyze additional outcomes, including health care utilization and costs.