Background

Up to 40% of non-demented patients with Parkinson’s disease (PD) present with cognitive impairment [1] within both non-instrumental functions—i.e. attention and executive functioning—and instrumental domains—i.e. memory, visuo-spatial skills and language [2]. Since such dysfunctions detrimentally impact on patients’ functional outcomes [3] and prognosis [4], to screen for them via clinimetrically sound and feasible performance-based tests is clinically pivotal [1, 2] and thus highly advisable [5].

To this aim, according to the 2018 Movement Disorders Society (MDS) guidelines [5], the Montreal Cognitive Assessment (MoCA) [6] is—amongst those tests that are disease-nonspecific—strongly recommended. Such a screener has indeed received major support for use in this population by the International literature as far as its psychometrics, diagnostics as well as both cross-sectional and longitudinal feasibility are concerned [7], being also recommended within clinical trials as an outcome measure [8]. In fact, the MoCA samples from all of the abovementioned cognitive functions and domains are typically involved in PD [5,6,7].

However, with specific regard to the Italian scenario, the only two studies that focussed on the clinimetrics MoCA in non-demented PD patients—the first by Biundo et al. [9] and the second by Federico et al. [10]—are both lacking in relevant information and outdated.

Indeed, first, neither of these reports [9, 10] delivers evidence on the construct validity of the MoCA in this population—as both merely focussing on its diagnostic properties.

Moreover, although both Biundo et al. [9] and Federico et al. [10] overall provided data supporting the diagnostic value of the MoCA in non-demented PD patients, such studies either preceded the availability of demographically adjusted norms for the Italian MoCA (which were first delivered in 2015 [10,11])—as is the case for Biundo et al.’s [9] report—or addressed a limited sample size [6]—as is the case for Federico et al.’s [10] investigation, which included N = 43 patients. Moreover, Federico et al.’s [9, 10]  study, which dates back to 2015, of course could not address the most recent normative dataset for the MoCA—which has been updated in 2021 [12]. Taken together, such stances imply that, in Italy, no up-to-date, generalizable PD-specific cut-off is available for the MoCA [13].

Given the relevance of delivering comprehensive and up-to-date clinimetric information for a given cognitive screener, to the aim of increasing their level of recommendation for clinical and research use [14], the present study aimed at: (1) assessing, in an Italian cohort of non-demented PD patients, the construct validity of the MoCA against both first- and second-level cognitive measures; (2) delivering an exhaustive and updated evaluation of its diagnostic properties.

Methods

Participants

Data on N = 237 PD patients—diagnosed according to the UK Brain Bank criteria [15]—consecutively referred to IDC-Hermitage Capodimonte, Napoli, Italy between 2013 and 2019 and having undergone the MoCA were retrospectively collected. Patients did not present with dementia according to the DSM-5 criteria for a Major Neurocognitive Disorder due to PD [16]. Moreover, patients were free from (1) further neurological/psychiatric diseases, (2) severe and/or unstable general-medical conditions that could have possibly affected cognition (i.e. system/organ failures and uncompensated metabolic/internal conditions) 3) uncorrected hearing/vision deficits.

Materials

All patients were administered the MoCA [13]. Patients assessed up to 2016 (N = 169) further underwent the Mini-Mental State Examination (MMSE) [17], whilst those assessed since 2017 (N = 68) were screened via the Parkinson’s Disease Cognitive Rating Scale (PD-CRS) [18]—the latter being is a 9-test, performance-based cognitive screening battery assessing attention/executive functioning, memory, language and visuo-spatial abilities/constructional praxis (range = 0–134). Out of patients assessed up to 2016 (N = 169), a subsample (N = 60) also underwent a second-level cognitive battery (Table 1) encompassing measures of attention and executive functioning (Trail-Making Test; Stroop Colour-Word Test; Phonemic Verbal Fluency; Backward Digit Span), memory (Rey Auditory Verbal Learning Test; Babcock Memory Test), language (Noun- and Verb-naming tasks from the Esame Neuropsicologico per l’Afasia; Semantic Verbal Fluency), praxis (Design Copy) and visuo-spatial abilities (Benton Judgment of Line Orientation) [19,20,21,22,23,24,25]. Figure 1 displays the abovementioned patient subsamples, defined according to which test they were administered.

Table 1 Second-level cognitive battery
Fig. 1
figure 1

Flowchart displaying the tests administered to each patient subsample. MoCA Montreal Cognitive Assessment, MMSE Mini-Mental State Examination, PD-CRS Parkinson’s Disease Cognitive Rating Scale

Statistics

Skewness and kurtosis statistics were addressed to check for Normality on raw variables (with values >|1| and |3| being judged as abnormal, respectively) [26]; thus, based on such an assumption being met or not, either parametric or non-parametric techniques were employed to test associations/predictions of interest. Skewness and kurtosis values for each cognitive measure are shown in Supplementary Table 1.

The construct validity of the MoCA against the PD-CRS was tested via a Pearson’s coefficient by addressing age- and education-adjusted scores on both tests [13, 18], whilst that against each measure of the second-level cognitive battery via a set of Bonferroni-corrected Spearman’s correlations that addressed test raw scores and covaried for age, education and sex.

The diagnostics of both raw and adjusted [13] MoCA scores were examined via receiver-operating characteristics (ROC) analyses by addressing a below-cut-off, age- and education-adjusted MMSE score [17] as the positive state. Such an operationalization was indeed supported both by a previous Italian report demonstrating the suitability of the MMSE for cognitive screening aims in this population [6], and by the fact that, within the present cohort, the effect size for the association between both raw (rs(169) = 0.64; p < 0.001) and adjusted scores (rs(169) = 0.53; p < 0.001) on the two tests was large—according to Cohen’s [27] benchmark (i.e., ≥ 0.5). Sensitivity (Se), specificity (Sp), positive and negative predictive values (PPV; NPV) and likelihood ratios (LR + ; LR-) were computed at the optimal cut-off identified via Youden’s J statistic. The number needed for screening utility (NNSU) was also computed—with a value 1.02 ≤ being deemed as indexing an optimal screening performance [28].

Analyses were run with R 4.1 (R Core Team, 2012), jamovi 2.3 (the jamovi project, 2022) and IBM SPSS 27 (IBM Corp., 2020). The Partial Correlation macro of jamovi 2.3 was employed to compute partial Spearman’s coefficients.

Results

Table 2 summarises patients’ background, clinical and cognitive measures.

Table 2 Patients’ background and cognitive measures

The MoCA was significantly associated with both PD-CRS scores (r(68) = 0.68; p < 0.001) and to the vast majority of second-level cognitive measures (Table 3).

Table 3 Convergence of the MoCA against second-level cognitive measures

Twenty-five out of 169 patients performed defectively on the MMSE (14.8%). Both raw and adjusted MoCA scores proved to be highly accurate to the aim of identifying patients with MMSE-confirmed cognitive dysfunctions (Fig. 2), also coming with sound diagnostics at the optimal cut-offs of ≤ 18 (J = 0.66; Se = 0.88; Sp = 0.78; PPV = 0.41; NPV = 0.97; LR +  = 3.96; LR- = 0.15; NNSU = 0.89) and < 19.015 (J = 0.72; Se = 0.92; Sp = 0.80; PPV = 0.44; NPV = 0.98; LR +  = 4.57; LR- = 0.10; NNSU = 0.84), respectively.

Fig. 2
figure 2

ROC curves for raw and adjusted MoCA against a defective MMSE score. Notes. ROC = receiver-operating characteristics; MoCA = Montreal Cognitive Assessment; MMSE = Mini-Mental State Examination. Raw scores (cut-off: ≤ 18): AUC = .89; SE = .04; CI 95% [.80, .97]; adjusted scores (cut-off: < 19.015): AUC = .91; SE = .04; CI 95% [.84, .99]. MoCA scores were adjusted according to Aiello et al. [9]

When applied to the whole sample, the two cut-offs yielded two moderately consistent classifications—with their agreement rate being of 88% (Cohen’s k = 0.74; z = 11.38; p < 0.001). Discrepancies were accounted for by 14 patients being classified as impaired by the raw cut-off but not by the adjusted one and by other 14 patients classified as impaired by the adjusted cut-off but not by the raw one.

Notably, when exploratively testing the diagnostic properties of the optimal cut-off values previously derived on raw scores by Biundo et al. [9]—i.e., ≤ 25—and by Federico et al. [10]—i.e., ≤ 24— for detecting mild cognitive impairment in PD, accuracy was low (Biundo et al. [9]: 29%; Federico et al. [10]: 36%) and nominal metrics were excessively biassed towards Se as well as overall unacceptable (Biundo et al. [9]: Se = 0.96; Sp = 0.17; PPV = 0.17; NPV = 0.96; LR +  = 1.16; LR- = 0.23; NNSU = 3.06; Federico et al. [10]: Se = 0.96; Sp = 0.26; PPV = 0.18; NPV = 0.97; LR +  = 1.29; LR- = 0.16; NNSU = 2.35). Consistently, an unlikely extreme proportion of patients was classified as impaired according to such cut-offs (Biundo et al. [9]: 88%; Federico et al. [10]: 81%).

Discussion

The present study provides Italian practitioners and clinical researchers with updated, and mostly unprecedented, evidence on the validity and diagnostic value of the MoCA in non-demented PD patients.

Indeed, the MoCA herewith proved to be an accurate estimate of cognitive efficiency in this population—as (1) being associated with a disease-specific measure of global cognition (i.e. the PD-CRS) and (2) converging with several second-level measures of both instrumental and non-instrumental cognitive domains/functions. Notably, such findings align with the international literature [5, 7].

As to its diagnostics, sound evidence has been herewith delivered on the fact that the MoCA is able to detect overall cognitive inefficiency (operationalized as a below-cut-off MMSE score) in non-demented PD patients. In this respect, it is worth noting that the cut-off derived on demographically adjusted MoCA scores slightly—but systematically—outperformed the “raw” cut-off. Such a finding is hardly surprising, given the undoubtable relevance of controlling for demographic confounders when interpreting test scores [14]. Hence, a MoCA score adjusted for age and education according to the most recent normative dataset [13] and falling below 19.015 should be addressed in the view of detecting cognitive impairment in this population.

In this respect, it is also worth noting that the raw cut-off herewith derived also clearly outperformed those previously suggested by Biundo et al. [9] and Federico et al. [10]—this strengthening, a posteriori¸ the rationale underlying the present study (i.e. the need of delivering up-to-date and generalizable thresholds).

The present study is of course not free from limitations.

The first, and the most relevant, issue lies in the operationalization herewith adopted for the positive state—i.e. a below-cut-off MMSE score. Indeed, albeit evidence on its feasibility as a cognitive screener in PD patients has been delivered in Italy [10], it has to be borne in mind that the MMSE should not represent the preferred choice to the aim of screening for cognitive impairment in this population—as recently highlighted by the MDS [5]. Indeed, the MMSE is mostly unable to capture executive deficits—which, however, typically characterise the cognitive profile of PD [31, 32]. Hence, it is mandatory that future studies aim at replicating the present findings by employing a different operationalization of the positive state—e.g. by means of a second-level cognitive battery or via a disease-specific cognitive screener (such as the PD-CRS). With that being said, the strong association detected between MMSE and MoCA scores within the current cohort, as well as the sound diagnostics herewith yielded, suggests that a sufficient degree of generalizability might be granted to the present cut-offs.

As to the second, it should be noted that PPVs proved to be relatively poor—even more if addressed within the context of the remaining metrics. However, such a finding is not surprising and does not in any case undermine the overall optimal diagnostic performance of the MoCA herewith detected. Indeed, at variance with the LR + , PPVs are heavily influenced by the prevalence of the positive state in the study sample: the lower the prevalence (such is the case for this investigation), the lower the PPV.

A final limitation lies in fact that the normative dataset herewith addressed, by Aiello et al. [13], has been derived from individuals in Northern Italy—whilst the present patient cohort was recruited in Southern Italy. In this respect, Aiello et al. [13] themselves noted that, in healthy subjects, the three normative datasets that are currently available for the MoCA in Italy, which comes from individuals residing in different Italian regions, might classify individuals’ cognitive status differently [13]. However, it should be noted that Aiello et al.’s [13] normative dataset includes a larger number of individuals and covers a wider range of age and education when compared to the previous ones published by Conti et al. [11] and Santangelo et al. [12] between 2014 and 2015—this having led the present Authors to opt for Aiello et al.’s [13]. Whilst it would be far beyond the aim of this study to do so, future investigations might aim at exploring these three normative datasets classify PD patients differently as to their normative status—similarly to what Salvadori et al. [33] recently did in stroke patients. Finally, it is worth mentioning the present PD cohort was not particularly heterogeneous as far as their age and level of educational attainment were concerned; hence, the cut-offs herewith provided might not be representative of all ranges of age and education, and future investigations should be aimed at filling such a gap.

In conclusion, the Italian MoCA is a valid and diagnostically sound screener for global cognitive inefficiency in non-demented PD patients, whose adoption is recommended within both clinical and research settings. Further studies are nevertheless needed that confirm its diagnostic values against a measure that is more appropriate than the MMSE.

Ethics

Participants provided informed consent. This study was approved by the Ethics Committee the University of Campania “Luigi Vanvitelli”.