Introduction

To date, more than 55 million people are living with dementia and the tendency is expected to rise to nearly 140 million in the next twenty-five years (1). Currently, available treatments can improve cognitive function but do not provide a cure for cognitive decline (2). Risk reduction interventions are a significant area of interest in dementia research, aligned with the strategic goal 15 of the recent blueprint for dementia research by the World Health Organization (WHO) (3).

While there are non-modifiable risk factors such as aging and the Apolipoprotein E genotype, Livingston and colleagues identified a set of twelve modifiable risk factors (1). Together, those risk factors account for up to 40% of the risk for dementia. Apart from low education and detrimental health behaviors such as smoking, alcohol consumption or physical inactivity, other relevant factors include social isolation, depression, air pollution and health issues like hearing loss, brain injury, hypertension, obesity and diabetes (1).

Sex differences can be observed in the prevalence of dementia, which is higher in women (4). Also, risk factors for cognitive decline differ between men and women in both the severity as well the as the effect on cognition (57). Well-known examples of those differences include hypertension, which is more prevalent in men (8), and stress, which is stronger in women (7). Furthermore, certain factors are specific to one sex, such as andropause for men and early menopause for women, respectively (7). Evidence suggests additional age-related differences. Notably, depression and sleep disturbances are stronger risk factors in women at late-life (9).

The last two decades have seen a growing trend towards multi-domain lifestyle interventions targeting cognitive decline. A first approach was the Finnish Geriatric Intervention Study to Prevent Cognitive Impairment and Disability (FINGER) (10). The study reported small but positive intervention effects on global cognition, domain-specific cognitive function and physical functioning (11). To adjust the approach to different economic, cultural and regional environments, the WW-Fingers network was founded. A trial tailored to the German health care setting was the AgeWell. de-trial. Although no significant effect on global cognition performance was observed in AgeWell.de, the intervention trial demonstrated improvements in health-related quality of life in the overall sample (12).

Evidence indicates sex differences in the risk of dementia and corresponding lifestyle factors, as described above. Moreover, a review by Zülke and colleagues suggests modest benefits of one or two-domain lifestyle interventions on cognition specifically for women (13). Furthermore, sex differences are also observed in previous research for aspects of the intervention. For example are older men found to be more likely to be physical active than older women (14). Women, however, tend to eat healthier than men (15). Regarding social activity, there is a consensus, that social isolation and disengagement are risk factors for cognitive decline16, whereby recent studies suggest higher benefits of social engagement for women (17). Indeed, much less is known, however, about the differences between women and men regarding the effectiveness of multi-domain lifestyle interventions. Results of the FINGER did not find differences between women and men in the effectiveness of the intervention on cognitive outcomes (18).

The objective of this study was to investigate the relationship between sex and the effectiveness of the AgeWell.de intervention. The secondary analyses of the trial followed three research questions: 1) what are the differences between men and women in the effectiveness of the AgeWell.de-intervention on cognitive performance and secondary outcomes at follow-up? 2) How effective was the AgeWell.de intervention on cognitive performance and secondary outcomes at follow up for different age groups within women and men?

Methods

Study design & participants

AgeWell.de was a 2-year multi-domain cluster-randomized intervention trial to preserve cognitive function in older adults at increased risk for dementia. Sample size was calculated based on assumptions of differences in change between groups, statistical power, cluster size and intra-cluster correlation and a dropout rate of ≤ 10% in both groups. Participants were recruited from five study sites across Germany (Leipzig, Greifswald, Halle, Kiel and Munich) by general practitioners (GP). Participating GP practices were used as clusters for randomization, which was carried out by the data management centre at the Institute for General Practice of the Hannover Medical School using a computerized block-randomization algorithm. Study design has been described in detail preciously including statistical power analysis, sample size, recruitment, randomization, intervention procedure and baseline characteristics (19, 20).

The intervention was composed of several components: nutritional counselling, enhancement of physical activity, social activity and cognitive training, management of cardiovascular risk factors, optimization of medications and intervention in bereavement, depressiveness and grief following loss experiences. Intervention components were instructed by trained study nurses during the baseline visit at the participants’ homes. Participants of the control group (CG) received the usual GP treatment and additional written health advice on the intervention components. Randomization and intervention procedure have been particularized in detail previously (12, 20).

The intervention trial was conducted with older adults at risk for dementia with respect to the inclusion criteria of a CAIDE Dementia Risk Score ≥9 (assessed by the GP; (21)) and an age of 60 years and older. Exclusion criteria were diagnosed dementia, severe impairments of hearing, vision or mobility, diseases, which might hinder the implementation of the intervention, insufficient command of the German language or participation in another intervention. Of initially 1,030 participants who completed the baseline assessment, 819 (n=433 women; n=386 men) completed the follow-up assessment 24 months after baseline and were thus used for analysis.

The responsible ethics boards of the coordinating study center of AgeWell.de (Ethics Committee of the Medical Faculty of the University of Leipzig; ethical vote number: 369/17-ek) and of all participating study sites approved the AgeWell.de-study. Participants provided written informed consent to participate at their respective study center. AgeWell.de is registered at the German Clinical Trials Register (DRKS; ID: DRKS00013555).

Outcomes

The main outcome was cognitive performance at follow-up using the neuropsychological test battery mainly based on a subset of the Consortium of Establish a Registry for Alzheimer’s Disease (CERAD) (22) at both baseline and follow-up according to the DSM-V-criteria (23). The test battery consists of measurements of the following cognitive domains: executive function using (Trail Making Test, TMT-B – TMT-A (24)), learning (CERAD Wordlist Memory (22)), language (CERAD Verbal Fluency Test “Animals” (22)), perceptual-motor skills (CERAD Constructional Practice (22)) and social cognition (Reading the Mind in the Eyed test, revised version (25)), which were operationalised by one composite z-score.

Secondary outcomes were mortality, nursing home placement, functioning in activities of daily living, functioning of instrumental activities of daily living, quality of life, health-related quality of life, depressive symptoms, social inclusion and resource utilization and costs. Comparative analyses of nursing home placement (n=1) and mortality (n=11) were not carried out due to insufficient cases; cost-effectiveness will be part of the economic analysis which is still pending. Activities of daily living was compiled by two instruments, the Barthel Index (26) (Activity of Daily Living, ADL) and the Amsterdam-IADL-scale (27) for instrumental activities of daily living. Due to a highly left skewed distribution, the Barthel-Index was dichotomized (100 vs <100). Quality of life was analysed using the World Health Organization Quality of Life questionnaire (WHQOL-OLD) and health-related quality of life (HR-QOL) using the EuroQol visual analogue scale (EQ-VAS; (28)), depressive symptoms using the Geriatric Depression Scale-15 (GDS; (29, 30)) and social inclusion using the Lubben Social Network Scale (LSNS; (31)). Details about the instruments used in AgeWell.de were described previously by Zülke and colleagues (20).

Statistical analysis

Intention-to-treat (ITT) analyses were performed including all participants who completed the follow-up assessment. Examination of incomplete data revealed that missingness was at random (MAR). No evidence of systematic missingness was found using an extragenerated index variable to indicate missingness of information of any variable at baseline. Missing data were therefore imputed by chained equations using multiple imputations (32, 33). All analyses contain pooled estimates of 50 imputed data.

The outcomes were transformed into z-scores by standardizing to the baseline mean and standard deviation, apart from depressive symptoms (used with original scale) and the Barthel-Index (dichotomized). Overall cognitive performance was composed of the z-standardized scores of the six domain specific tests (range −2.711 to 1.394). To have a good trade-off between validity of outcomes and a minimized risk of selection bias, a minimum of three out of the six tests used for the composite score had to be available to calculate the primary outcome.

The models were run using generalized linear regression models (GLM). All outcomes were normally distributed except for activities of daily living (ADL) which was used dichotomized and thus calculated with binomial distribution and depressive symptoms, for which we used GLMs with negative binomial distribution errors. All other models were calculated with identity link and Gaussian distribution errors. The models were all controlled for age (except for the age-stratified models), education (using the Comparative Analysis of Social Mobility in Industrial Nations (CASMIN)-scale (34)), and familial status. Further, all analysis were run using cluster-robust standard errors. Reported results are treatment effects as average marginal effects (AME) with a significance alpha-level set at 0.05 (two-tailed). Data management and analysation were performed using STATA, Version 16 (StataCorp, College Station, TX, USA).

In a first step, we ran GLMs with interaction terms between the indicators of the research questions (1) sex, (2) age-group within women and (3) age-group within men (all interacting with the intervention group (IG)) to assess whether a potential intervention effect varied by subgroup on each outcome, respectively. Second, we performed GLMs on the outcomes for women and men after stratifying the sample into female and male subsamples. For the analysis within women and men, were further stratified the male and female samples by age (60–69 and 70–77 years old). In addition, two-way interaction analyses were carried out with each subgroup indicator and treatment group. The results of the interaction analysis and the subgroup analyses are depicted together for better interpretability (tables 2 & 3).

Results

Characteristics of male and female participants

Participant characteristics are summarized in Table 1. Of the 819 participants who completed the follow-up assessments, 52.9% (n=433; IG: n=199; CG: n=234) were female and 47.1% (n=386; IG: n=179; CG: n=207) were male; the participants age ranged between 60 and 77 years. Differences between women and men were observed for both groups for intermediate education that was bigger in women, whereby the proportion of participants with a high level of education was bigger in men. In addition, differences were found in the family status. More women than men were widowed, while more men than women were married or cohabitating. Only a very small proportion of male participants was widowed or single. Differences in the outcomes were observed for cognitive performance, which was higher for female participants and only in the intervention group, as well as instrumental activities of daily living, which was higher in men for the intervention recipients.

Table 1 Baseline characteristics of AgeWell.de-study participants separated by sex
Table 2 Between-group differences in primary and secondary outcomes at follow-up, stratified by sex; including results of GLMs with interaction-term of intervention group×sex on each outcome
Table 3 Between-group differences in primary and secondary outcomes at follow-up, stratified by age for women and men; including results of GLMs with interaction-term of intervention group×age for the sex-stratified models on each outcome

Differences in the effectiveness between men and women

No differences were found in the effectiveness for cognitive performance between women and men (ßwomen=0.009, p=0.892; βmen=0.037, p=0.590); accordingly, no effect was found for the interaction of treatment group and sex (p=0.663). Regarding secondary outcomes, significant interaction terms for sex and intervention on quality of life (p=0.038) and depressive symptoms (p=0.045) indicate differences between women and men. The results of the additional models reveal, nevertheless, that there was no further difference in the effectiveness of the intervention on quality of life between women and men. However, we found a difference in the effectiveness between women and men for depressive symptoms. While women benefitted from the intervention respective depressive symptoms (β=− 0.325; p=0.003), no effect was found for men (β=0.004; p=0.978). In addition, a positive intervention effect on health-related quality of life was found for men (β=0.057; p=0.010), which, however, was not significantly different from women related to the p-value of the interaction (p=0.514).

Differences in the effectiveness between age groups in women

The results of the age-stratified analysis of the female group are set out in Table 3. Women between 60–69 years benefitted from the intervention in their perceived health-related quality of life (β=0.434, p<0.001), whereas women 70 years and older did not (β=−0.095, p=0.513; p-value of interaction=0.005). We did not find further differences between the age groups for women. Strong effectiveness of the intervention for women in reducing depressive symptoms was also found in both age-stratified groups, whereby the effect of the intervention on depressive symptoms was slightly stronger for women aged 70 years and older (60–69 years: β=0.343; p=0.017 vs. 70–77 years: β=0.352; p=0.020). Furthermore, no statistical significance of the interaction term supports the result that there are no further differences between age groups for the effectiveness of the intervention on depressive symptoms in women.

Differences in the effectiveness between age groups in men

No significant differences of the intervention were found between the age groups for the male group on cognitive performance. Respective secondary outcomes, we found a negative effect for quality of life for men between 70 and 77 (β=−0.288; p=0.014). However, no effect was found for men between 60–69 for quality of live (−0.010, p=0.915). Instead, men between 60–69 years reported significantly better health-related quality of life when participating in the intervention (β=0.402, p= 0.001) compared with men who were 70 years and older (β=0.012, p=0.924; p-value of interaction=0.05). The significant interaction term for health-related quality of life for men strongly supports the assumption of a difference (p=0.050).

Discussion

The objective of this study was to examine differences in the effectiveness of the AgeWell.de -intervention between women and men, as well as within each sex group.

The primary outcome of the AgeWell.de-trial was global cognitive performance. No significant effects were observed for either men or women, or within the subgroups. Hence, no differences between the groups were found for cognitive performance. However, a difference between women and men was found regarding depressive symptoms. The intervention had a positive effect for female intervention group participants but not for men. Generally, participants in the younger age group (60–69) benefited in terms of better health-related quality of life in both men and women.

While several lifestyle interventions against cognitive decline based on the FINGER have been implemented10, to our knowledge, there is currently limited evidence regarding sex differences in the effectiveness of multidomain lifestyle interventions so far. Consistent with the findings of the FINGER-multidomain intervention, our observations also indicate no sex differences in the effectiveness on cognitive performance (18).

However, we identified variations in the effectiveness on depressive symptoms. It must be taken into account, that the level of depressiveness was moderate at the beginning of the intervention. Nevertheless, evidence on sex differences in depression among older adults reports higher scores in women (35) while evidence offers mixed findings about sex differences in response to treatments of depressiveness (36). Main reasons for depressive symptoms among adults who are 60 and older are widowhood or living alone, poor health or poverty (35). It seems possible that this result is due to a better-targeted intervention for women in terms of depressive symptoms. First, even if no differences between depressive symptoms were found between men and women at baseline, women and men strongly differed in being widowed which is seen as risk factor. The analyses were adjusted for family status but it would be of interest if having a partner or living alone had an influence on the adherence and effectiveness of the intervention. Also, two of the intervention components were composed of the enhancement of social activity, which is seen as directly linked to reduce depressive symptoms (37, 38), and secondly, individual intervention for bereavement, grief & depressive symptoms. Analyses towards the intervention components and respective adherence might reveal further insights about the reasons, why women benefited more than men did regarding depressive symptoms. In fact, we are actually analysing the effect on depressiveness and the association between the change and the components of the intervention in greater depth which is to be reported in a more detailed manuscript. However, the result is gratifying, since depressiveness as risk factor is stronger in women at late-life (9).

No sex difference was found regarding intervention effects on health-related quality of life. Interestingly, we found differences between younger and older intervention participants in both women and men. This is in line with the main results of the AgeWell.de-trial, where a positive effect was found for the overall sample with a negative interaction with continuous age (12). Surprisingly, we found a negative effect on quality of life in men between 70–77 years. Together with the results on health-related quality of life, this might suggest, that treatments are more efficient in younger participants. A qualitative study about aspects of lifestyle change in older adults reported that retirement was seen as auspicious moment to implement lifestyle changes – which would support the assumption that younger persons benefitted more than older ones (39).

The need of more targeted interventions is clearly supported by the current findings. Differences in the effectiveness between both men and women and further different ages reveal those needs. Considerably more work will be needed to determine the differences between women and men, for example in the qualitative analysis of adherence to lifestyle interventions. Qualitative studies could further be a promising approach to analyse the tangible differences between women and men in the effectiveness but also the adherence of interventions. Besides, meta-analysis of several multidomain randomized clinical trials as quantitative approach could be worthwhile. Another strategy might be the inclusion of potential participants in designing future intervention studies to enhance effectiveness.

Limitations

This study has several limitations, some of which were bound to the intervention itself. As discussed in the main trial paper, a limitation is the trial duration, which might be too short for significant effects on cognitive performance (12). Furthermore, the control group received GP treatment as usual. Different check-ups for pre-existing diseases or certain aspects of prevention are usually covered in GP treatment, so that this has as a quite high standard in Germany. At last, AgeWell.de was completed during the COVID-19 pandemic, whereby perceived restrictions were more frequently reported for participants of the intervention, especially for nutrition (12).

Conclusion

This study has identified beneficial effects of the AgeWell.de intervention on depressive symptoms in women and on health-related quality of life in women and men between 60 and 69 years. Concluding, this study supports evidence of stronger effects of lifestyle interventions for women between 60 and 69 years. Thus, this study supports the need of intervention designs that are closer tailored for specific subgroups.