Introduction

Dementia is one of the key non-motor features of Parkinson’s disease (PD), which is associated with increased mortality [1], poorer quality-of-life [2], worse carer burden [2], and higher rates of institutionalization [3]. A systematic review showed the prevalence of dementia in PD was about 24–31% [4]. However, there are relatively few reliable data on the annual or cumulative incidence of dementia in PD, especially over the long term. Widely varying annual incidence rates (5–11 per 100 person-years) [5] and cumulative proportions (3–28%) have been quoted up to 4 years post-diagnosis [6], whilst at 20 years, 20–80% of PD survivors have been found to have dementia [6, 7]. Some of the differences relate to the nature and age of the cohorts studied, many being selected, largely hospital-based cohorts with relatively young ages of onset, which have limited generalizability. Very few studies consider the incidence of dementia in unselected incident, population-based PD cohorts with prospective long-term follow-up [5, 8,9,10,11,12]. Those that do have mostly reported short-term results with incidence rates between 2 and 6 per 100 patient-years in the first 5 years, rising with duration of follow-up, and a cumulative proportion of about 10% at 3 years rising to 46% at 10 years (Supplementary Table 1). The rates of dementia in PD were over twice that in the general population [10, 12].

The limited reliable data on the incidence of PD dementia (PDD) also mean that there are relatively limited data on the early clinical factors that predict the subsequent development of dementia. Previous studies have suggested numerous factors such older age at onset, male sex, fewer years of education, worse baseline motor and cognitive scores, mild cognitive impairment, REM sleep disturbance, visual hallucinations, orthostatic hypotension, vascular risk factors, obesity and genetic polymorphisms in the apolipoprotein E (ApoE), β-glucocerebrosidase (GBA), and microtubule-associated protein tau (MAPT) gene are associated with subsequent dementia [6, 13,14,15], but some of these are inconsistent associations. In addition, some risk factors for other dementias such as subjective cognitive decline have not been studied in PDD [6, 16]. Therefore, there is a need for more robust data on early risk factors from unselected population-based cohorts.

We aimed to assess the incidence of PDD by age and sex in a well-characterized, unselected, prospective population-based incident cohort of people with PD compared to a concurrent control group and to identify baseline predictors for PDD and outcomes following the development of PDD.

Methods

The Parkinsonism Incidence in North-East Scotland (PINE) study used multiple overlapping search strategies to identify all patients with a newly diagnosed degenerative or vascular parkinsonian syndrome from primary care practices (baseline population about 315,000) in and around Aberdeen, Scotland, in two phases (2002–04, 2006–09) [17]. Parkinsonism was defined as two or more cardinal motor signs (bradykinesia, rigidity, rest tremor, otherwise unexplained postural instability). Eligible patients were offered life-long yearly follow-up (still ongoing) by a consultant neurologist with an interest in movement disorders (CC) or a supervised clinical research fellow with linkage to the national death register to establish survival. Clinical care was not altered by participation in the study. For each eligible patient who consented to follow up, we tried to identify an age–sex-matched community-based control without PD by approaching up to four people closest in age to the patient from the same primary care practice or from a register of elderly people who had taken part in a previous community-based screening project [18].

At each annual review, the cause of parkinsonian syndrome was classified using all available information (clinical syndrome, atypical features, response to dopamine replacement therapy, development of motor complications, and results of structural (CT or MRI) or dopamine transporter SPECT imaging where undertaken) using clinical expertise guided by the appropriate research criteria (the UK Brain bank criteria for PD [19]) or from pathology in those who died and had given consent for post-mortem. For the purposes of this report, we only included patients with PD and all controls.

Assessments

Patients and controls who gave consent had a standardized baseline (diagnostic) visit and yearly review including clinical examination looking for atypical features and assessment of Unified Parkinson’s Disease Rating Scale [UPDRS], disability [Schwab & England (S&E) and Barthel index], cognitive function (mini-mental state examination (MMSE) and mini-mental Parkinson’s [MMP] [20]), mood (Geriatric Depression Scale-15 item version), and a brief questionnaire for non-motor complications including symptoms of REM sleep behaviour disorder (i.e. dream enactment) and subjective cognitive symptoms (“do you feel that your memory or thinking processes are worse than they should be?”). Some patients only consented to limited assessment including UPDRS motor score, S&E score, MMSE, and the checklist of motor and non-motor complications. Patients who consented to all aspects of the study gave blood for genotyping including testing of MAPT H1 versus H2 haplotype, ApoE ε4 allele, and GBA gene variants using previously described methods [21], whereas those who consented to more limited assessment did not have blood taken for genetic testing. Those who were unable to come to clinic were visited at home. Controls were able to consent to remote follow-up (annual medical records review, questionnaires, and linkage to national death register) or in-person review in which case they completed the same assessments as patients, performed by a research nurse.

Definition of dementia

Dementia in patients and controls was diagnosed clinically according to DSM-IV criteria by an experienced clinician with expertise in diagnosing dementia, namely either the study neurologist or a consultant psychiatrist if the latter had seen the patient as part of their clinical care (identified by screening primary and secondary care medical records for each patient and control). There needed to be evidence of progressive cognitive decline (detected from prospective follow-up), involving at least two cognitive domains (from attention, executive function, visuospatial function, verbal and visual memory, language, agnosia, and apraxia) assessed from the history, examination, and MMSE and MMP cognitive tests, which was limiting daily activities, social or occupational function in the absence of delirium. We did not apply strict cut-off scores from cognitive tests or require formal neuropsychological testing. All PD patients who developed dementia were presumed to have PDD (all met clinical criteria for probable or possible PDD [22]). The date of dementia was defined as the date it was first diagnosed clinically.

Data analysis and statistical methods

Data were extracted after follow-up until 31st May 2020 and double-checked prior to analysis. Follow-up time was calculated using the reverse Kaplan–Meier method [23]. Incidence rates of dementia were calculated by dividing the number of cases by the total number of person-years at risk, stratified by age and sex. Confidence intervals were calculated assuming a Poisson distribution. Kaplan–Meier curves of time-to-dementia were plotted censoring for last follow-up or death and the hazard ratio (HR) for dementia in patients compared to controls was calculated using Cox proportional-hazards modelling controlling for baseline age, sex, years of education, and socioeconomic status [24]. We tested whether there was a significant interaction between patient/control status and sex. We also did a sensitivity analysis excluding controls who consented to remote rather than in-person follow-up to investigate whether this altered the incidence rate or hazards ratio comparing patients and controls.

Several potential baseline variables that might predict subsequent dementia in PD were identified from the previous literature (supplementary Table 2) and were entered into a Cox proportional-hazards model to identify the statistically significant predictors of PDD. The primary analysis (model 1) included ten baseline variables (measured at, or shortly after, diagnosis) with minimal missing data: age, sex, years of education, smoking history, body mass index, history of vascular disease (defined as history of previous ischemic heart disease, stroke, transient ischemic attack or peripheral vascular disease) or diabetes, presence of subjective cognitive symptoms, history of dream enactment behaviour, MMSE, and UPDRS part 3. The number who developed dementia per variable was just under 10 [25]. Secondary analyses included other variables which were not collected in all patients (due to different levels of study participation). Model 2 also included National Adult Reading Test (NART) score as an alternative measure for premorbid intelligence than years of education [26], GDS-15 as depression measure, and MMP instead of MMSE as a measure more likely to capture cognitive deficits in PD. Model 3 added the three genetic variables to the model 1 variables. In model 1, missing predictor variable data were imputed with multiple imputation, assuming that data were missing at random. We used a predictive mean matching algorithm to impute data using all the predictor variables in the model, the dementia variable, the Nelson–Aalen estimator of the cumulative hazard function, and the year 1 MMSE values (because some baseline MMSE values were missing). For each model, 20 imputed datasets were combined using Rubin’s rules. The missing data in the additional variables included in models 2 and 3 were all missing in most cases with lower levels of study participation, so we only included patients with complete data in these models. We excluded one person with longstanding learning disability but no dementia from the prediction models, because this was unusual clinically and their low baseline scores (MMSE 18/30, MMP 15/32) were highly influential on the modelling.

We investigated the risk of institutionalization and death after dementia. We plotted Kaplan–Meier probabilities of remaining (i) living at home (not in an institution) and (ii) alive after dementia in both PD and controls. Participants were censored at death (in the institutionalization analysis) or when last seen in those with ongoing follow-up. Hazard ratios comparing these outcomes in PD versus controls were calculated using Cox regression adjusted for age at baseline and sex. We investigated the influence of dementia on both these outcomes in time-dependent Cox regression models with the development of dementia as a time-varying covariate (i.e., dementia coded as 0 until time of development of dementia and as 1 thereafter). These models were adjusted for baseline age, sex, and the other significant predictors of dementia in Model 1. The proportional-hazards assumption was verified by visual inspection of log–log plots and a formal test based on Schoenfeld residuals (the estat phtest command in Stata). Functional form was assessed with fractional polynomials [27]. Data were analysed using Stata version 16.1.

Results

The recruitment, follow-up, and baseline characteristics of 201 patients and 260 controls are shown in Fig. 1 and Table 1. Four people with PD (with delayed diagnosis from motor onset) had dementia at baseline assessment and were excluded from all survival analyses. Controls were slightly older than patients as we included all controls, not just those age-matched to people with PD.

Fig. 1
figure 1

PINE study flowchart

Table 1 Baseline characteristics of patients and controls

Ninety-three patients and 46 controls developed dementia over a median follow-up of 9.5 years (inter-quartile range [IQR] 5.4–11.4). Onset of dementia was at a younger age in PD (Table 1). The overall incidence rate of dementia in PD was 7.4 cases per 100 person-years (95% CI 6.1–9.1) compared to 2.1 cases per 100 person-years (95% CI 1.6–2.8) in controls, but it was significantly dependant on baseline age, especially in PD (1.6 per 100 person-years under 60 versus 12.2 over 80) (Table 2). Kaplan–Meier dementia-free survival curves are shown in Fig. 2 Median time-to-dementia in PD was 8.5 years (95% CI 7.0–11.0) and was not reached in controls. In PD, 25.8% developed dementia by 5 years, 58.1% by 10 years, and 79.9% by 15 years compared to 5.9%, 16.7%, and 39.1%, respectively, in controls.

Table 2 Dementia incidence and outcomes after dementia
Fig. 2
figure 2

Kaplan–Meier curves for survival free of dementia in incident Parkinson’s disease and controls

The HR of dementia in PD versus controls adjusted for baseline age, sex, years of education, and socioeconomic deprivation score was 6.0 (95% CI 4.1–8.7), but there was a significant interaction between sex and patient/control status and the risk of dementia (p value for interaction was 0.03). There was no sex difference in the risk of dementia in PD, whereas in controls, dementia was more common in women (Fig. 3). Hence, the adjusted HR for dementia was higher in men (8.7, 95% CI 5.2–14.7) than in women (3.9, 95% CI 2.3–6.5). In the sensitivity analysis excluding 18 controls who only consented to remote follow-up, the incidence rate in controls was slightly lower (1.8 cases per 100 person-years) and the adjusted HR comparing PD with controls was slightly higher [6.7 (95% CI 4.5–10.0)] than in the main analyses including all controls.

Fig. 3
figure 3

Kaplan–Meier curves for survival free of dementia in incident Parkinson’s disease (PD) and controls by sex

Significant baseline prognostic factors for PDD in the multivariable analyses were age, presence of subjective cognitive symptoms, history of dream enactment behaviour, motor UPDRS score, MMSE score, and the ApoE ε4 allele (Table 3). None of the other prognostic factors studied were significantly associated when adjusted for confounders. The proportional-hazards assumption was satisfied and there was no evidence of non-linearity of interval variables.

Table 3 Predictors of Parkinson’s disease dementia

Kaplan–Meier survival curves of the risk of death and institutionalization after dementia are shown in Fig. 4, but should be interpreted cautiously more than 4 years after dementia onset due to the very small number of surviving patients and controls who were event-free. The rate of institutionalization after dementia was 26.9 per 100 person-years (95% CI 19.6–36.8) in PD and 21.4 per 100 person-years (95% CI 13.3–34.4) in controls (see Table 2). Median survival free from institutionalization after dementia was lower in PD (2.7 years, 95% CI 2.1–3.2) than controls (4.3 years, 95%CI 1.7 to undefined). The mortality rate after dementia was 32.0 per 100 person-years (95% CI 25.9–39.7) in PD and 23.7 per 100 person-years (95% CI 16.5–33.8) in controls. Median survival after dementia was similar in PD and controls: 3.1 years (95%CI 2.3–3.6) in PD and 3.1 years (95% CI 2.2–5.8) in controls. The risk of institutionalization and death after dementia were non-significantly higher in PD than controls (HR 1.8, 95%CI 0.9–3.7 and 1.5, 95% CI 0.9–2.5, respectively), adjusted for baseline age, sex, cognitive symptoms, dream enactment behaviour, and MMSE.

Fig. 4
figure 4

Kaplan–Meier curves for (a) death and (b) survival free of institutionalization following dementia in PD and controls (shaded areas = 95% confidence intervals)

In PD, the development of dementia (as a time-varying covariate in Cox model, adjusted for the same potential confounders) was associated with an increased risk of death (HR 2.5, 95% CI 1.6–3.7) and institutionalization (HR 5.6, 95% CI 3.1–10.0).

Discussion

We have shown that dementia in PD has a high incidence (7.4% per year), significantly more than in a control population, and had similar rates in men and women unlike controls where it was more common in women, probably because Alzheimer’s is more common in women [28]; came on at a younger age in PD than controls; rose sharply with age at PD diagnosis; and was associated with worse outcomes (annual rates of institutionalization and death 27 and 30 per 100 person-years, respectively), some of which were worse than in controls who developed dementia, probably due to the co-existent motor difficulties of PD. Our incidence rate of PDD was higher than most previous estimates (supplementary Table 1), probably due the higher baseline age and longer follow-up duration. The cumulative proportion of people with PD who had developed dementia up to 15 years after diagnosis was nearly 80%.

We also confirmed some of the previously identified early predictors of PDD [age, dream enactment (a proxy for REM sleep behaviour disturbance), motor and cognitive function, ApoE ε4 allele]. However, for the first time, we identified that subjective concern about cognitive symptoms at baseline was also a predictor of subsequent PDD, independent of objective cognitive testing, as described in other dementias [16]. This means that clinicians need to be cautious about dismissing people with Parkinson’s concerns about their memory, even if routinely applied cognitive testing is normal. Interestingly, the baseline MMSE cognitive score was more predictive of subsequent dementia than the more PD-specific MMP, despite concerns that the MMSE does not adequately capture cognitive deficits in PD [29].

This study has several strengths, particularly in its design and length of follow-up. It was based on an incident population-based cohort of people with PD with prospective standardized long-term follow-up with no losses to mortality, dementia, and institutionalization follow-up in patients and very few in controls. Hence, the risk of selection and attrition bias was minimized, and the results should be generalizable to demographically similar populations. There was active surveillance for dementia and the diagnosis of both PD and dementia was made by expert clinicians. PD was confirmed pathologically in 25% of the patients. Consistency of follow-up gave the opportunity to confirm progressive cognitive decline impacting on daily function. It is, therefore, unlikely that cases of clinically significant PDD were missed (unless they developed it in the maximum 1 year period between their last clinical visit and their death) and the diagnosis of PD was as accurate as it could be. However, it is possible that some of the controls who consented to remote notes/questionnaire-based follow-up and were not seen in-person may have developed dementia and been missed, especially if they were not referred for assessment to secondary care by their primary care physician, but the sensitivity analysis excluding these controls did not suggest this was the case.

There are some other weaknesses in the current study. The numbers were relatively small, so the confidence intervals for incidence rates in specific age groups were wide. It also meant we were limited in how many predictive variables we could use in the models to avoid overfitting [25] and there would have been limited power for identifying weaker predictors. We did not assess anticholinergic medication burden as an independent risk factor for PD dementia, because previous work had not shown it to be a predictive factor in our cohort [30]. In addition, we did not have complete baseline data for some predictor variables, again limiting sample size in the secondary analyses. Some potentially important baseline predictors were not available. For example, we did not formally diagnose mild cognitive impairment, a known baseline predictor of PDD [13], although baseline cognition was captured objectively and subjectively. The primary cognitive score was the MMSE which is not recommended for use in PD [29], but the more PD-specific MMP measure was found to be less predictive of dementia. We did not use the Montreal Cognitive Assessment, because this had not been developed when we started the PINE study in 2002. We also did not have the resources to formally diagnose REM sleep behaviour disorder and so a clinical history of dream enactment was used as a proxy. Although the control group was population-based, it is never truly randomly selected, and so, there is a risk it may have been unrepresentative of the general population and may have underestimated the risk of dementia in controls. In terms of generalizability, the PINE study included few people with young-onset PD due to its low incidence and almost all were Caucasian.

This study has several implications. First, in clinical practice, it helps to provide long-term age–sex-specific incidence rates of PDD to guide patient information on prognosis and management. It will also help guide those who plan health care to provide adequate resources in the future for dementia care in PD. Finally, improved understanding of the rates of PDD and its baseline predictors will help researchers design better, adequately powered trials of potential preventative treatments for PDD.