Introduction

Currently, approximately 900 million individuals worldwide are aged 60 years or older and this number is expected to nearly double by 2050 [1]. Amongst those reaching this age in good health, future years of life will commonly be marked by functional decline involving physical disability or cognitive impairment, leading to a loss of independence and significant costs to society [2].

The ability to prolong the time spent in good health, living without dependence on others, has become a fundamental societal goal [3,4,5]. For this reason, disability-free survival has emerged as an important geriatric research outcome in contrast to disease-specific outcomes such as cardiovascular disease or dementia [4]. Particularly in older people, the maintenance of good health requires the avoidance of multiple interacting co-morbidities and chronic conditions, many with shared risk factors and management strategies [6,7,8]. Understanding the major preventable determinants, not only of individual diseases but also of ongoing good health [9], can be facilitated by development of prediction models for disability-free survival.

Previous studies have identified risk factors for a range of specific geriatric outcomes including frailty, physical disability, dementia and death. In addition to age, these have included abnormal body mass index (BMI), smoking, diabetes, abnormal blood lipids, chronic kidney disease, low level of physical activity, low gait speed, presence of depression, subclinical cardiovascular disease, specific diets and lack of social support [10,11,12,13,14,15,16,17,18,19]. These studies have typically been limited to the evaluation of pre-specified relationships between predictors and specific outcomes, rather than a composite measure of functional independence. Similarly, no previous study has employed a machine-learning approach to enable objective variable selection [20].

The ASPirin in Reducing Events in the Elderly (ASPREE) placebo-controlled trial of low-dose aspirin was the first large-scale clinical trial that utilized disability-free survival as the prespecified primary endpoint. Measures of physical disability and cognition were assessed systematically throughout follow-up in all subjects in detail, which is rarely available in a non-trial setting. In this analysis, we used the ASPREE data to develop and validate a prediction model for disability-free survival in a population of relatively healthy individuals aged 65 or more when recruited. Machine-learning techniques allowed the relative importance of key predictors to be determined in an unbiased and objective fashion. Specifically, we sought to identify modifiable risk factors that could identify individuals at risk and prioritize preventive policies.

Methods

Study population and trial design

ASPREE was a large, randomized, double-blind, placebo-controlled trial investigating the efficacy of 100mg aspirin on extending disability-free survival in a healthy elderly population. The trial design and primary results have been published previously [21,22,23,24]. Briefly, 19,114 community-dwelling individuals without prior cardiovascular events, dementia or physical disability were randomized to low-dose enteric-coated aspirin or placebo and followed for an average of 4.6 years. All individuals were at least 70 years old (≥ 65 years for US minorities). Exclusion criteria included a previous diagnosis of cardiovascular events (including myocardial infarction, heart failure, angina pectoris, stroke, transient ischemic attack, 50% carotid artery stenosis or previous carotid endarterectomy or stenting, coronary artery angioplasty or stenting, coronary artery bypass grafting or abdominal aortic aneurysm), atrial fibrillation, evidence of dementia or major cognitive impairment, inability to perform independently any basic Katz activity of daily living (ADL), or a serious illness with a life expectancy of less than 5 years. All participants provided written informed consent. The trial was approved by the local ethics committees and is registered on clinicaltrials.gov (NCT01038583). Requests for data access will be via the ASPREE Principal Investigators with details for applications provided through the web site, www.ASPREE.org.

Baseline collection, definition and selection of potential predictors

A total of 25 candidate predictors were selected on the basis of previous outcome studies related to dementia, disability and death [11, 17,18,19, 25] and their likely availability in typical clinical practice. The preselection of candidate predictors was also performed to minimize the possibility of including noise variables in the final model. The variables were collected as part of the initial standardized screening process prior to participant enrolment in ASPREE and included:

  • demographics (age, living status, years of education, ethnicity/race),

  • prevalent conditions or risk factors (diabetes, smoking history, alcohol consumption, family history of myocardial infarction),

  • laboratory results (high-density lipoprotein cholesterol [HDL-c], non-HDL-c, estimated glomerular filtration rate [eGFR], haemoglobin),

  • physical measurements (systolic and diastolic blood pressure, BMI, abdominal circumference, grip strength, gait speed),

  • medication use (antihypertensive agents, lipid-lowering agents and in-trial randomization to aspirin treatment),

  • measures of cognitive function (Modified Mini-Mental State Examination [3MS] score) and depression (Centre for Epidemiological Studies-Depression-10 questions [CES-D] score, as a binary variable ≥ 8).

Details related to all predictors are provided in the Supplementary Material.

Outcome ascertainment

The primary end point of disability-free survival was defined as survival free from dementia or persistent physical disability and was used as a surrogate for functional independence [21]. Physical disability and dementia were combined in the primary end point, as they represent the important reasons why individuals lose the ability to live independently. Participants without the documented outcome were censored at 5 years or at the last date they were known to be event-free, whichever came first. To assess this composite endpoint, all participants had serial in-person visits, which included standardized assessments of cognitive function with a cognitive battery, and dementia assessment if triggered, and physical disability, defined as an inability to perform, having severe difficulty in performing, or requiring assistance to complete at least one of the six basic ADL by self-report [26]. This was considered to be persistent if the same ADL loss was confirmed six months later.

Dementia was adjudicated based on the Diagnostic and Statistical Manual of Mental Disorders-IV (DSM-IV) criteria [27]. The ascertainment of death has been described before and was based on information collected from at least two sources including close contacts, physicians, public death notices and by linkage to the Australian National Death Index and US National Death Index [23]. Dementia and cause of death were adjudicated and confirmed by the respective endpoint committees blinded to treatment allocation.

Statistical analyses

Prognostic model development

Separate models were developed for men and women due to their known differing risk factor profiles. The statistical model employed was the Cox proportional hazards regression model. Prior to selection of variables, we tested for non-linearity of continuous variables. Specifically, a complete case analysis was initially fitted with all candidate predictors, in which each continuous variable was modelled with a restricted cubic spline function with 2 degrees of freedom. Statistically significant non-linear relations were found for BMI, HDL-c, waist circumference and gait speed in the model for women; and non-HDL-c, eGFR, BMI and waist circumference in the model for men.

Data was missing for 11 variables with the proportion ranging from very low, <0.1% (for years of education, haemoglobin and CES-D) to as high as 8% (for family history of myocardial infarction). The total proportion of missing values was 1% but using a complete case analysis would have discarded about 15% of the sample size (see Table S1). Hence, multiple imputation was performed using chain equations, assuming data were missing at random [28]. The imputation model included all candidate predictors and other potentially relevant variables. Non-linear BMI, waist circumference, gait speed, eGFR and non-HDL-c were imputed passively [29]. The survival outcomes were included as a combination of Nelson-Aalen estimator of the cumulative hazard function and the outcome status [30]. Five data sets were imputed for men and women separately, based on the amount of missing data [31].

Variable selection was performed using the machine-learning technique “group least absolute shrinkage and selection operator” (group-lasso) in combination with bootstrapping [32]. The group-lasso model ensures that categorical variables, or linear and non-linear terms of continuous variables, are included or excluded in the model altogether. The penalty parameter, which regulates the amount of shrinkage, was selected by optimizing the 10-fold cross-validation prediction error. We chose a suboptimal penalty based on the “one-standard-error” rule to obtain a more parsimonious model [33].

The combination with bootstrap can be described as follows: for each imputed data set, we randomly drew 100 bootstrap samples with replacement, and performed group-lasso selection on each bootstrap sample. The variable inclusion frequency (VIF) over 100 × 5 = 500 models was calculated for each predictor. A variable with VIF ≥ 60% was then included in the final model. The final model was derived by refitting the selected predictors to each of the 5 imputed datasets, and combining parameter estimates using Rubin’s rule [34]. Variable selection based on VIF has previously demonstrated good performance in the presence of imputed data [35].

Model performance

The area under the cumulative/dynamic ROC curve (AUC) at 5 years was used to assess discrimination [36]. AUC ranges from 0.5 to 1 with a higher value indicating better ability to discriminate those who developed events and those who did not. Harrell’s calibration plot at 5 years was used to assess the agreement between predicted and observed risks [37]. The apparent performances were obtained by evaluating the final model on the development samples, which were used to build the models (averaged across 5 imputed data sets).

Model validation

To quantify the degree of optimism due to overfitting in performance measures, we implemented internal validation using the enhanced bootstrap resampling procedure [37, 38]. The optimism was calculated as follows. From the original data, 100 random bootstrap samples were drawn. For each sample, we repeated the model development procedure (including imputation and lasso selection) as outlined above to obtain a bootstrap final model. We then calculated the difference between bootstrap apparent performance (averaged across bootstrap imputed data) and bootstrap test performance (averaged across original imputed data). Finally, these differences were averaged across 100 bootstrap samples to obtain the single estimate for the optimism. The procedure is illustrated in Figure S1. The estimated optimism was then subtracted from the apparent performance to obtain the bias-corrected predictive performance.

All analyses were conducted using R version 4.0.2 with the companion R packages: mice, grpreg, rms, survival, timeROC, pec.

Results

Population characteristics

A total of 19,114 community-dwelling Australian and US ASPREE participants were included in this study. The median age of the trial population was 74 years (interquartile range [IQR] 71.6–77.7) and 10,782 (56.4%) were women. Baseline characteristics for the overall population of men and women are reported in Table 1.

Table 1 Baseline characteristics of the trial population

During the 5-year follow-up (median (IQR) 4.6 (3.5–5.6) years), 795 (7.3%) female participants and 799 (9.6%) male participants developed dementia or persistent physical disability or died. Death accounted for 54% of the total events in men and 43% in women (Table S2).

Model development for risk of the primary composite outcome

Results of the variable inclusion frequency are shown in Table S3. The multivariable Cox regression models which included variables with VIF ≥ 60% are presented in Table 2. Specifically, increasing age, lower 3MS score, lower gait speed, lower grip strength and abnormal (low or elevated) BMI were identified as risk factors in both sexes. In men, current smoking and eGFR were additionally selected as predictors. In women, the presence of diabetes and a CES-D score ≥ 8 were also selected as predictors.

Table 2 Multivariable Cox regression analyses based on the selected predictors of disability-free survival. Estimates were pooled across multiple imputed data sets using Rubin’s rule

The final models (Table 2) show that a 5-year increase in age corresponded to 49% and 50% increased risk of not maintaining disability-free survival in men and women respectively, while a 5-unit increase in 3MS corresponded to about 30% decreased risk in both sexes. A decrease in gait speed from 1.0 to 0.5 m/s corresponded to 64% increased risk in men, and nearly 3-fold higher risk in women. Figure S3 illustrates the non-linear relation between gait speed and the primary outcome for women. Importantly, gait speed less than 1 m/s was associated with a substantially greater negative impact on risk of disability-free survival relative to the lesser beneficial impact of faster gait speed. The relationship between BMI and the outcome was non-linear in both models (Figure S2, S3). The risk was minimal with a BMI of around 27 kg/m2, whereas both elevated and lower BMI values were associated with an unfavourable outcome.

In women, having diabetes and depression (CES-D ≥ 8) were associated with 30% and 47% increased risk of the primary composite outcome, respectively. In men, current smokers had almost 2-fold higher risk of earlier onset of death, dementia or disability compared to former/never smokers. The non-linear relation of eGFR (Figure S2) indicated that compared to a reference value of 74, a higher or lower value of eGFR was associated with a higher risk.

Model performance and calibration

The discriminative performance of each predictor and their combinations for the prediction of disability-free survival are presented in Figure 1 and Table S4. The AUCs of age alone were 0.66 and 0.65 in women and men, respectively. Adding 3MS increased the discriminative performance by about 23% in both sexes. Compared to the simplified model which consisted of age and 3MS, the final models improved the predictive performance by 24% (AUC 0.76 vs. 0.71) in women and 21% (AUC 0.73 vs. 0.69) in men (Table S4). After correcting for optimism, the final models showed good discrimination, with an AUC of 0.72 for males and 0.75 for women (Table S4).

Figure 1
figure 1

Discrimination of each selected predictor, of their combination when added sequentially in order of their AUC, and of the final models. Orange dots show the AUC at 5 years of models made with each predictor of not maintaining disability-free survival individually. Green dots show the AUC of models made by incrementally adding each predictor along the x-axis. The percentages show the added value of the current model against the previous model. Predictors are arranged by their inclusion frequency. Abbreviation: AUC, area under the curve

The final models also showed good agreement between the observed and the predicted risk of the overall outcome, although the risk for men was slightly overestimated in the higher risk categories (Figure 2). The predicted probability of the endpoint at 5 years can be calculated using the formula provided in Table 3. The illustrations for two specific participants are displayed in the supplementary material (Table S5).

Figure 2
figure 2

Calibration graph comparing the observed risk (based on Kaplan-Meier estimates) and the predicted risk with the final models, by tenths of the predicted probability. The red asterisk represents the bias-corrected predicted risks

Table 3 Sex-specific formulas for calculation of the risk prediction model for disability-free survival within 5 years

The risk distribution within each 5-year increment age group was further investigated in Figure S4. As expected, the figure demonstrates the strong association between advanced age and event risk in both sexes. Compared to older age groups (75–85+), younger age groups (65–74) appear to have a lower proportion of people in the lower risk categories.

Discussion

We utilized a machine-learning variable selection approach to identify the determinants of survival free of persistent physical disability and dementia in the ASPREE study population [21]. Disability-free survival is a surrogate measure of functional independence and was assessed by regular in-person surveillance of a group of 19,114 apparently healthy individuals participating in a large-scale trial of low-dose aspirin. All participants were free of prior cardiovascular events, disability and dementia at the commencement of the study.

During a median follow-up of 4.7 years, the average age increased from 74.0 to 78.8 years and 1,594 individuals became disabled or died. By choosing a population relatively healthy at baseline and regularly monitoring each individual for the onset of dementia or physical disability, this study provided a unique opportunity to identify and prioritize the characteristics of those most likely and those least likely to continue living independently with increasing age. A particular strength of these analyses was the objective approach used for variable selection, which was based on a machine-learning technique applied to a range of clinical variables measured at study entry, together with previously recognized risk factors for physical and cognitive decline.

Out of 25 potential biomedical and social predictors tested, 7 contributed substantially to the prediction of disability-free survival in women and 7 were predictors in men. As expected, advanced age was the most important predictor, which alone resulted in an AUC of 0.65. Cognitive function was the second most important contributor followed by gait speed. Importantly, these three predictors alone resulted in the highest AUC increase, which empathizes their importance. Additional predictors in men included grip strength, smoking, BMI and renal function. In women, diabetes, and depression scores also contributed to the model. In the case of eGFR and BMI, the association was non-linear and U-shaped, similar to that noted in prior studies of mortality and other geriatric outcomes.

Previous studies have identified similar risk factor profiles as predictors of cognitive decline, frailty, functional decline and a range of other chronic diseases of ageing [10,11,12,13,14,15,16,17,18,19]. For example, a high BMI, slow gait speed and/or the presence of diabetes or hypertension has been identified as predictors of cognitive decline. Similarly slowing gait speed, weaker handgrip, increased BMI and diabetes have been identified as predictors of physical decline and frailty. The interdependence of several of these predictors creates a web of causation in which physical disability, dementia or death can be the end result of several pathways.

The dominant impact of cognitive decline and reduced physical function suggests the possibility that interventions designed to improve cognition or physical function might alter the trajectory of decline in otherwise healthy older subjects. Recently published trials suggested that modest improvement may be possible. Amongst these, first, the LIFE study showed that the initiation of a moderate-intensity physical program in sedentary individuals aged 70–89 years modestly reduced progression to major mobility disorder, (assessed as the ability to walk 400metres). The program involving aerobic, resistance and flexibility training is provided in centre twice weekly and twice weekly at home [39]. The intervention over 2.6 years did not affect cognitive decline or reduce the incidence of cardiovascular disease [40]. Second, the SPRINT-MIND trial showed a reduction in the combined outcome of mild cognitive impairment plus probable dementia amongst older individuals on intensified blood pressure management [41]. The study involved 9,361 hypertensive older adults with increased cardiovascular risk whose mean systolic BP was 121.4 mmHg in the intensive-treatment group and 136.2 mmHg in the standard group. Third, the Finnish Geriatric Intervention Study to Prevent Cognitive Impairment and Disability (FINGER) trial investigated the efficacy of a multidomain lifestyle intervention including dietary guidance, physical activity, cognitive training, and monitoring and management of cardiovascular risk factors. The control group received general health advice. After 2 years, the intervention group improved more in executive function, processing speed and complex memory tasks regardless of genetic risk or baseline risk factor levels.

From a clinical perspective, the identification of cognitive decline and physical limitations indicate that an individual is tracking towards an earlier loss of independence. The results of the trials described above suggest that the loss of independence in older individuals may be slowed by a multifactor intervention. However, the likelihood of these interventions producing a significant slowing of the trajectory towards loss of independence is as yet unclear.

From a population perspective, the results also emphasize the relevance of addressing the determinants of physical and cognitive decline at an earlier stage in the life course. Although diet and lipids were not included as predictive factors in the final model, their role as shared determinants in the development of cognition and physical capacity at earlier stages has been well established, and they should be included in any strategy to promote healthy ageing. Furthermore, BMI and diabetes were part of the final models. A 2015 forecast of future trends in disability and life expectancy in the UK predicted an increase of 25% (over 10 years) in the number with care needs, with one quarter of the remaining life expectancy of those aged 65 years or above expected to involve a significant disability [42]. Although the increase in prevalence is largely a reflection of population ageing, it emphasizes the importance of strategies to extend disability-free survival. However, as our study results were based on an older population, future longitudinal studies are warranted to evaluate the impact of these determinants.

Strengths of this study include the extensive data collected from a large population with relatively small amounts of missing data. The measurement of disability-free survival required data concerning the time of onset of physical disability, dementia and death, with diagnoses of dementia confirmed by a specialist adjudication panel; systematic collection of information of this type is rarely available outside the context of a clinical trial. The outcome data was accompanied by measurement of a wide range of clinically relevant and recognized geriatric potential prediction variables (biomedical and social) collected near the time of study initiation.

The limitations include that the majority of the study population were white Caucasians, most of whom lived in Australia, a high-income country with universal healthcare services. As a whole, the cohort had extensive use of preventive medications including statins (in 30%) and antihypertensive agents (in 35%), indicating a high level of preventive care. The generalizability of the study is also restricted to individuals reaching the 70-plus age group in relatively good health free of prior cardiovascular disease, dementia or significant physical limitations. It is also possible that there is residual confounding, and future research may identify new, relevant predictors. Finally, we were not able to perform an external validation, as comparable datasets are missing.

In conclusion, in a population of healthy older people, both modifiable and non-modifiable characteristics successfully identified individuals at higher risk of dying or developing dementia or physical disability within 5 years. After age, a low 3MS score was the strongest predictor followed by slow gait speed with a lesser contribution by other risk factors. Recent trials suggest that interventions aimed at slowing cognitive and physical decline are effective but additional trials will be required to demonstrate a meaningful impact in prolonging the time individuals remain functionally independent.