Machine learning-based prediction of cognitive outcomes in de novo Parkinson’s disease

Harvey, Joshua; Reijnders, Rick A.; Cavill, Rachel; Duits, Annelien; Köhler, Sebastian; Eijssen, Lars; Rutten, Bart P. F.; Shireby, Gemma; Torkamani, Ali; Creese, Byron; Leentjens, Albert F. G.; Lunnon, Katie; Pishva, Ehsan

doi:10.1038/s41531-022-00409-5

Machine learning-based prediction of cognitive outcomes in de novo Parkinson’s disease

Article
Open access
Published: 07 November 2022

Volume 8, article number 150, (2022)
Cite this article

Download PDF

You have full access to this open access article

npj Parkinson's Disease

Machine learning-based prediction of cognitive outcomes in de novo Parkinson’s disease

Download PDF

4008 Accesses
8 Citations
10 Altmetric
Explore all metrics

Abstract

Cognitive impairment is a debilitating symptom in Parkinson’s disease (PD). We aimed to establish an accurate multivariate machine learning (ML) model to predict cognitive outcome in newly diagnosed PD cases from the Parkinson’s Progression Markers Initiative (PPMI). Annual cognitive assessments over an 8-year time span were used to define two cognitive outcomes of (i) cognitive impairment, and (ii) dementia conversion. Selected baseline variables were organized into three subsets of clinical, biofluid and genetic/epigenetic measures and tested using four different ML algorithms. Irrespective of the ML algorithm used, the models consisting of the clinical variables performed best and showed better prediction of cognitive impairment outcome over dementia conversion. We observed a marginal improvement in the prediction performance when clinical, biofluid, and epigenetic/genetic variables were all included in one model. Several cerebrospinal fluid measures and an epigenetic marker showed high predictive weighting in multiple models when included alongside clinical variables.

Machine learning-based prediction of longitudinal cognitive decline in early Parkinson’s disease using multimodal features

Article Open access 14 August 2023

PRedicting the EVolution of SubjectIvE Cognitive Decline to Alzheimer’s Disease With machine learning: the PREVIEW study protocol

Article Open access 12 August 2023

Using Multi-Scale Genetic, Neuroimaging and Clinical Data for Predicting Alzheimer’s Disease and Reconstruction of Relevant Biological Mechanisms

Article Open access 24 July 2018

Introduction

Cognitive impairment and dementia are highly common and debilitating non-motor symptoms in Parkinson’s Disease (PD). Cognitive impairment in PD carries distinct diagnostic challenges, a higher burden of care, worse functioning, and a lower quality of life¹. Cross-sectional population studies show that ~30% of cases with PD have dementia, with 20–25% of patients presenting with mild cognitive impairment (MCI)² as early as diagnosis³. Longitudinal studies report an average of 50% of PD patients develop dementia within 10 years^4,5. Despite this high prevalence, however, significant cognitive impairment in the early stage of the disease is often underdiagnosed in most clinical settings⁶, in part due to the complex and multi-domain nature of cognitive dysfunction in PD⁷. Several demographic and clinical measures have been shown to be predictive in PD-cognitive impairment, including age, visual hallucinations, REM sleep disorder, and severity of parkinsonism, in particular non-tremor symptoms¹. Moreover, considerable research interest has focused on identifying objective biomarkers, including structural and functional imaging, biofluid measures, and genetic risk^8,9,10.

A major challenge for predicting cognitive outcome in PD is the high levels of heterogeneity implicit within the condition, with high interindividual variation in clinical presentation and progression¹¹. A potential solution for addressing such challenges is utilizing algorithms that combine multiple measures for individual-level cognitive outcome prediction^12,13,14. Employing multivariate panels of data, however, comes with limitations implicit in the complexity of multi-modal data. Compared to classical statistical methodology, learning-based methods benefit from being able to process high-dimensional and complex data, finding both linear and nonlinear associations and extracting meaningful variables of interest^15,16. Therefore, a growing area of research opts to utilize machine learning (ML) approaches both to identify data-driven subtypes of disease^17,18 and to predict disease progression^19,20,21 including future cognitive outcomes^14,22.

In the present study, we assessed longitudinal records of cognitive diagnoses in the Parkinson’s Progression Markers Initiative (PPMI)²³, a well-characterized cohort of early PD patients and used multiple ML methods to predict cognitive outcome using baseline variables. We assessed prediction of two outcome measures over an 8-year time period: (i) development of cognitive impairment (MCI or dementia) and (ii) development of dementia. Variables were split into three subsets, including clinical measures, biofluid (CSF, serum) assays and variables of genetic/epigenetic markers in blood. These variables were tested separately and in combination, to assess the performance of ML methods.

For prediction, we applied four different machine learning algorithms (Random Forest [RF], ElasticNet, Support Vector Machines [SVM] and Conditional inference forest [Cforest]) and assessed the performance of each to determine if different learning approaches show better overall predictive accuracy. Applying multiple outcome measures, different subsets of predicting variables and ML algorithms, we aimed to test which showed the best overall predictive performance, establish powerful multivariate predictive models, and highlight important predictive variables included in these models.

Results

Prediction of cognitive outcomes

Using records of cognitive diagnosis over an 8-year time period (Fig. 1), we subset two cognitive outcomes. The first outcome tested development of overall cognitive impairment, including a group showing solely normal or subjective cognitive decline (SCD) (n = 127) and another with development of MCI and Dementia (n = 82). The second outcome tested dementia development; comparing a dementia conversion group (n = 43) to a set of combined normal, SCD and MCI cases (n = 166) (Fig. 1). Four ML algorithms were used for prediction using baseline variables, with each evaluated based on metrics of overall accuracy. Descriptive statistical summaries of each cognitive outcome group tested are shown in Table 1. Baseline variables were binned into individual subsets of genetic/epigenetic (47 variables), biofluid (12 variables), and clinical (64 variables) measures (Summarized in Supplementary Table 1) and tested individually and collectively. An overview of individual ML algorithm accuracy for each variable subset and outcome are summarized in Fig. 2 and Table 2.

**Fig. 1: Flow diagram of case subsetting criteria.**

Table 1 Summary statistics of demographic and selected clinical measures.

Full size table

**Fig. 2: Receiver operating characteristic plots for predicting cognitive impairment and dementia using selected clinical, genetic/epigenetic, and biofluid variables.**

Table 2 Summary of predictive accuracy for individual ML methods.

Full size table

Comparing both outcomes, prediction of cognitive impairment outcome showed better predictive accuracy than dementia conversion, reflected by higher area under the receiver operating characteristic curve (AUC) and Matthews Correlation Coefficient (MCC) metrics for all variable subsets. The one exception to this was biofluid variables, which when evaluating solely on AUC, appeared to show better prediction of dementia conversion than cognitive impairment. However, reviewing the prediction of dementia using biofluid variables shows poor overall prediction of true dementia converters when investigating MCC (Cforest = 0.38, SVM = 0.32, ElasticNet = 0.55, RF = 0.25) and sensitivity metrics (Table 2).

Overall, across both outcomes and variable sets, the best prediction was achieved for the cognitive impairment outcome using a combination of biological and clinical variables, reflected by high value balance for AUC and MCC (Table 2). This represented a marginal improvement over prediction of the cognitive impairment outcome using the clinical variable subset alone. Combining biological and clinical variable types improved sensitivity over the clinical models, represented by a higher number of true cognitive impairment predictions (Table 2).

The genetic/epigenetic variables alone showed minimal predictive accuracy irrespective of cognitive outcome and ML algorithm tested, with near-random prediction, with AUC measures between 0.40 and 0.65 and MCC below 0.19 (Fig. 2, Table 2).

Predictive variables for cognitive impairment outcome

Given the best overall prediction was achieved using a combination of biological and clinical variables for the cognitive impairment outcome, predicting development of both MCI and dementia, we further investigated individual variable contribution using Shapley values. Shapley values can be interpreted as the additive relative importance of a particular variable to a model’s prediction (Methods). Variables included by at least three ML algorithms are shown in Fig. 3. Cognitive tests were heavily represented in overlapping models, with Hopkins Verbal Learning Test-Revised (HVLT-R) Immediate/Total Recall and Delayed Recall scores, Symbol Digit Modalities (SDM) and Semantic Fluency Test (SFT) being included in all four ML methods and Benton Judgment of Line Orientation (BJLO), HVLT-R Discrimination Score, Montreal Cognitive Assessment (MoCA), and SFT—Vegetable subscore being included in at least three (Fig. 3a).

**Fig. 3: Variable importance in predicting cognitive impairment outcome.**

Noncognitive clinical measures included in multiple models were age of symptom onset, State Trait Anxiety Inventory (STAI) scores (total and state subscore) and the University of Pennsylvania Smell Identification test (UPSIT) for olfactory impairment. In these combined models, three biological variables showed consistently high contribution across multiple models including CSF Ratios of phospho-tau to amyloid-β (1–42) and total-tau to amyloid-β (1–42), respectively, as well as blood DNA methylation at cg13953978 (Fig. 3a). Differences in overlapping variables are shown in Fig. 3b, highlighting the direction of effect for each variable between cognitively intact and impaired groups.

Looking at correlation between top predictive variables included across multiple models, we found that eight show collinearity (Pearson’s Correlation > 0.7), including HVLT Immediate and Delayed Recall, Semantic Fluency Total Score and SFT—Vegetable subscore, STAI total and state subscores and CSF Ratios of phospho-tau to amyloid-β (1–42) and total-tau to amyloid-β (1–42). By contrast eight variables: SDM, age of symptom onset, BJLO, methylation at cg13953978, HVLT discrimination score, LNS, MOCA, and UPSIT all show a higher degree of independence (all Pearson’s Correlations < 0.6).

Genetic variables were conspicuous in their absence from overlapping contributing variables, but were present in certain models, for example, GBA nonsynonymous mutations were included for both Cforest and ElasticNet. Summarized Shapley value contribution across all tested algorithms are shown in Supplementary Figs. 1–4. As a graphical representation of prediction in our best performing model (Cforest), Supplementary Fig. 5 displays a surrogated decision tree, built by aggregating the best performing decision trees within the forest, containing a mix of biological and clinical variables. It is worth noting that this representation does not contain all variables included in the entire decision forest.

The effect of cognitive tests in predictive accuracy

As we observed a large proportion of the top predictive variables were cognitive tests (9 out of 16, Fig. 3a), we tested the sensitivity of predictions made without the use of cognitive variables. As Cforest models performed best on the clinical subset, we chose to explore the sensitivity of predictions with and without cognitive variables using this algorithm. Clinical variables were subset to cognitive only and noncognitive variables as annotated in Supplementary Table 1. As shown in Fig. 4, we found that cognitive variables only (AUC = 0.90, MCC = 0.54) performed better than noncognitive variables (AUC = 0.86, MCC = 0.46). The combination of the two variable subsets into an overall clinical model showed a marginal increase in AUC (0.90–0.93) but a larger increase in sensitivity reflected by increased MCC from 0.54 to 0.70.

**Fig. 4: Sensitivity analysis of cognitive variables.**

Stratification of PD- MCI from PD-dementia

Given that MCI represents an intermediate stage between normal cognition and dementia, we next tested if ML methods could accurately distinguish 43 PD-dementia from 39 PD-MCI, without records of further progression to dementia with the same length of follow-up time (Supplementary Fig. 9). Across variable subsets, we observed best individual performance from clinical variables with increased performance in combination with biological variables. However, models lacked overall accuracy in their predictions (AUC 0.69–0.75), in particular with lower MCC values (0.177–0.470), reflecting a low specificity (0.4–0.6) of dementia prediction. All these results together indicate that in this context, the generated ML models lack accuracy to resolve dementia from MCI over the timescale tested.

Discussion

In the present study, we tested the prediction of two cognitive outcome measures in newly diagnosed PD subjects within 8 years, using multiple variable subsets and ML algorithms. The generated models were assessed for metrics of prediction accuracy and the importance of contributing variables. We found that combining both biological and clinical variables produced best performing models, with a marginal improvement in predictive performance compared to models using clinical variables alone. We interpret this as evidence of synergistic contribution of multivariate data types, producing the most accurate predictions. Of variable subsets, the most accurate and balanced prediction was achieved when testing for cognitive impairment (MCI and dementia combined) using clinical data, giving the highest AUC, MCC metrics and balance of sensitivity and specificity. When evaluated individually, nonclinical measures (biofluids and genetic/epigenetic) showed poor predictive performance, regardless of outcome tested and ML algorithm used.

Comparing outcomes, prediction of combined cognitive impairment, merging cases developing either MCI or dementia, consistently outperformed dementia conversion alone, which we interpret as being driven by poor differentiation of MCI individuals when predicting dementia conversion. Indeed, models tested to stratify MCI from dementia cases performed poorly with low specificity of predictions. MCI is a well-established risk factor for future dementia development⁴, and previous studies show higher dropout within PPMI is associated with worse cognitive performance²⁴. Given this, the overall progression profile of MCI and dementia, as subsets within this study, might not differ substantially, with MCI patients potentially converting to dementia in unobserved events. This further supports the use of a combined cognitive impairment group, with best prediction being observed for this outcome.

Unsurprisingly a high number of contributing variables included cognitive assessments, indicating that there was already a level of cognitive changes present at baseline. This highlights a potential limitation in the inclusion of these variables, as these cognitive assessments are highly associated with the outcome of interest we aimed to predict. However, these measures reflect an assessment time 1–7 years before a clinically diagnosed conversion to either MCI or dementia. Sensitivity analysis of the effect of cognitive variables in prediction confirmed that cognitive variables had a large contributory effect to predictions although increased sensitivity was observed with the inclusion of noncognitive clinical variables. Top contributing noncognitive variables included age at onset of PD, anxiety, and olfactory impairment. Older age of PD onset, which we observe within the cognitive decline group, is a well-established and validated risk factor for PD-cognitive decline⁴. Olfactory impairment has been increasingly associated with cognitive impairment in PD^25,26,27,28. Although anxiety is less associated as a predictive variable for cognition within PD²⁹, it has been associated as a predictor of worse cognitive prognosis in general population studies³⁰.

Within combined models utilizing both biological and clinical variables, ratios of CSF protein measures of total-tau, phospho-tau and amyloid-β (1–42), had a high contributory effect across multiple ML algorithms. Additionally, one measure of blood DNA methylation, cg13953978, was included in multiple combined models. This locus has been previously associated with multiple neurodegenerative diseases and, of note, we observe the same direction of effect between cognitively impaired and preserved individuals in this study and previously reported findings³¹.

Several studies have aimed at creating an accurate model to predict cognitive outcome in PD using the PPMI cohort^13,22,32. Compared to previous studies, in the current study, we have included a larger range of biological variables including polygenic scores for multiple related traits and epigenetic measures. We used MDS criteria for defining cognitive performance at each follow-up as a substitute for the commonly used MoCA. Additionally, we included a long follow-up period and excluded reverters from the modeling.

To improve the accuracy and generalizability of our models compared to other models reported previously, we employed a multi-objective model optimization procedure using three criteria (AUC, MCC, and number of variables). Although AUC is commonly used for model interpretation, it is insensitive to class imbalance. Therefore, to prevent inaccurate prediction assessment, we included MCC, as this metric can evaluate accuracy while considering class balance. This, along with recursive feature elimination (RFE)^33,34 and k-fold cross-validation, further avoided the risk of overfitting and addressed the high number of variables included in this dataset. We applied multiple ML algorithms, to cover a range of different learning strategies, standardly applying RFE and multi-objective optimization for each.

A potential limitation of this study is the curatorial nature in which cognitive groups were subset and the relatively small sample size available. We justify the methods for cognitive group subsetting as we aimed to represent individuals with clinically relevant diagnoses confirmed by multiple observations over time. However, due to data missingness and attrition within PPMI, there are a number of de novo cases enrolled at baseline which were not tested within our models.

A potential caveat of this study is its broader applicability to samples outside of PPMI. Replication efforts in additional cohorts are hampered by the unique nature of PPMI as a cohort, both in how thoroughly assessed these individuals are, the early de novo stage at which they were enrolled and the longitudinal observations present, in particular in the MDS-cognitive diagnosis measure used as an outcome here. To our knowledge, a viable cohort covering these domains is not available at current.

PPMI’s de novo stage has important implications for the broader applicability when comparing to prediction models of cognitive progression in later disease stages. In Phongpreecha et al.’s 2020 study¹⁴, using cases from the Pacific Udall Centre (PUC) Cohort, they tested multitask models for prediction of future yearly incidence of MCI and dementia diagnosis. They report highest accuracy for prediction of dementia and retained normal cognition, with lowest accuracy for MCI prediction, largely consistent with our findings in the PPMI cohort. Furthermore, they highlight cognitive measures as the most important variables in their model in line with our findings following RFE. However, they report higher AUC measures for their dementia conversion predictions than we observed here. This may be attributable to the different distributions of the disease stage of the PUC PD patients compared to the newly diagnosed PPMI patients.

Salmanpour and colleagues³⁵ have employed machine learning in the prediction of cognitive outcomes in PPMI. Our studies differ however in the cognitive outcome tested, with the use of MDS-criteria cognitive diagnosis conversion here and using MoCA at year 4. We also explored a larger range of biological measures and restricted predictor input solely to baseline, while Salmanpour incorporated measures at year 1 in the models. Differences in methodology and outcome measure make direct study comparison difficult; however, despite the variability in methodology some interesting consistencies between the two studies are evident, in particular in the finding of baseline state-trait anxiety as a predictive measure.

A previous study by Liu et al.¹³ developed a multivariate predictor of global cognitive impairment in a large multi-cohort analysis. The predictive score reported high performance, with high positive predictive (0.87) and negative predictive value (0.92) utilizing solely age at onset, MMSE, education, motor exam score, gender, depression, and GBA mutational status. This predictive model benefits from generalizability, both as a result of the high number of samples used to validate it and in the low variable number required to achieve prediction. However, the multi-center design of the study introduces a high level of heterogeneity, both in the disease stage included and the outcome measure used to define cognitive impairment³⁶, something which is highly consistent within our study here. Furthermore, due to the range of variables included in PPMI, we were able to explore a broader range of biological and clinical predictors in our present study.

Our findings of DNA methylation at cg13953978 as a predictive variable requires further replication to ensure it is not the result of an unknown cryptic stratification in this cohort. Previous association of this loci with neurodegenerative disease across multiple cohorts do however support it as a potential biomarker. Expanding the number of genetic and epigenetic variables included in future studies to a genome-wide level in cohorts designed around cognitive decline prediction is also essential to truly uncover potential predictive efficacy. However, due to the challenge of including the high number of variables implicit in multi-omics data^24,37,38,39, we found this to be outside of the scope of the current study.

Although not explored in this study, incorporation of neuroimaging measures in cognitive predictive models represent an important additional data modality for future work. A number of studies have highlighted structural underpinning to PD-MCI and dementia^40,41 and in this present study we highlight four cognitive tests, consistently incorporated across multiple ML algorithms. Taking these measures of perturbed cognitive domains as indicative of structural changes in the brain, we can interpret executive dysfunction, as measured by the semantic fluency score being evidence of associated frontal lobe atrophy⁴¹. Some studies have associated verbal memory, as we see measured by the HVLT, with differences in the inferior frontal gyrus⁴² and in the context of PD with functional changes associated in the anterior cingulate and orbitofrontal cortex⁴³. Our finding of the attentional test assessed by SDM having predictive contribution supports studies relating attentional effects to striatal dopamine in dopamine active transporter (DAT) imaging⁴⁴ and to microstructure changes in the anterior cingulate and frontal cortex using diffusion tensor imaging (DTI)⁴⁵.

In summary, after evaluating multiple predictive variable types and outcomes, we established a model that accurately predicted cognitive impairment and preserved normal cognition over a follow-up 8-year time span. This prediction was largely driven by clinical measures of both known risk factors and more novel measures, but also variably included biological variables. This work supports evidence of anxiety and olfactory impairment as potential predictors of cognition in PD and highlights epigenetic measures of DNA methylation as biological predictive variables requiring further investigation.

Methods

Participants and cognitive assessment

All data used in this study was obtained from the PPMI¹⁸ database (https://ida.loni.usc.edu/). Participating PPMI sites all received approval from an ethical standards committee before study initiation and written informed consent was obtained for all individuals participating in the study. The study was registered at clinicaltrials.gov (NCT01141023). Participants were selected from the de novo PD cohort, defined by a diagnosis of the disease within 2 years and unmedicated for motor symptoms at baseline (n = 423). Subjects underwent yearly cognitive diagnosis in accordance with Movement Disorders Society (MDS) recommended criteria for dementia and MCI as previously reported^19,20,21. In brief, a confirmed MCI diagnosis was based on an impaired performance on at least two test scores >1–2 standard deviations below a standardized mean⁴⁶. Dementia diagnosis alongside clinical annotation required impaired performance in at least two cognitive domains coinciding with significant functional impairment resulting from cognitive state⁴⁷.

Records of cognitive diagnoses from baseline to year 8 were sourced from PPMI following their routine application of the above criteria to create three groups of PD patients with distinct cognitive outcomes as follows (Fig. 1 and Supplementary Fig. 6):

PD-Dementia

Cases showing any diagnosis of dementia over an 8-year time span were annotated as the dementia conversion cases, excluding one individual that reverted to normal cognition after an annotation of dementia (n = 43).

PD-MCI

Cases with any record of MCI without any annotation of future dementia diagnosis (n = 39) were annotated as PD-MCI conversion cases. This group excludes a set of 14 cases that reverted to normal cognition following MCI annotation.

Cognitively intact (CI)

To avoid any effect of attrition and cognitive decline in unobserved events, cases defined as cognitively intact required a minimum of five records of normal or subjective cognitive decline (SCD) during recorded visits up to year 8 (n = 127). This excluded 175 cases showing missing values or indeterminate diagnoses.

Subsequently, we used these groups to define two separate binary outcomes for machine learning-based prediction as follows:

Cognitive impairment outcome

Defining conversion to cognitive impairment within an 8-year time span. This compared the CI group (n = 127) to an impaired group, created by combining the PD-Dementia and PD-MCI groups (n = 82).

Dementia conversion outcome

Defining conversion to dementia within an 8-year time span. This compared the PD-Dementia group (n = 43) to a non-dementia conversion group created by combining PD-MCI and CI groups (n = 166).

Epigenomic and genomic profiling

Genotyping and polygenic scores calculation

Whole blood DNA genotyping was previously performed on the NeuroX SNP array by PPMI investigators using published methods⁴⁸. Raw data from 423 individuals covering 267,607 variants was quality control (QC) assessed following published recommendations⁴⁹. In brief, data was excluded on the following criteria: (1) variants and individuals with missingness >0.1, (2) individuals with discordant reported sex and inferred sex (X chromosome homozygosity F-value >0.8 for males, <0.2 for females), (3) variants with minor allele frequency <0.01 or >0.05, (4) variants deviating from Hardy Weinberg Equilibrium <1e-3, (5) individuals with heterozygosity rate ±3 standard deviations, (6) individuals with evidence of cryptic relatedness (pi hat >0.2). Following initial QC, autosomal data was extracted, plink files were converted to vcf format and uploaded to the Michigan Imputation Server. Imputation was conducted using Eagle2 to phase haplotypes and Minimac4 using the 1000 Genomes reference panel (phase 3, version 5). An R² filter score for imputation quality was set at 0.3. Following imputation, data was downloaded, converted to plink format and quality assessed following the previous criteria. Finally, genetic principal components were generated along with reference data from the 1000 Genomes Project and non-European cases removed based on qualitative assessment of clustering of the first two principal components. Five hundred eighty-two cases passed QC (total variants post-imputation n = 2,287,446).

Polygenic risk scores (PRS) were calculated using summary statistics from recent genome-wide association studies (GWAS) for Alzheimer’s disease (AD)⁵⁰, PD⁵¹, education attainment (EA)⁵², schizophrenia (SCZ)⁵³, major depressive disorders (MDD)⁵⁴ and coronary artery disease (CAD)⁵⁵. For AD, the effect of the APOE region was excluded by removing the region chr19:45,116,911–chr19:46,318,605. For PD, the effect of the GBA region was excluded by removing the region chr1:154,600,000 - chr1:156,600,000. PRSice-2 software⁵⁶ was used for polygenic risk score calculation, which automates clumping and p-value thresholding to generate a “best-fit PRS” for a target phenotype of interest. Briefly, clumping was performed to retain the most significant GWAS variants in a linkage disequilibrium (LD) block (250 kb window, r2 threshold = 0.1). The PRS model is tested over an increasing set of p-value threshold (5e-08 to 1), with the optimal threshold set which generates a score explaining the maximum phenotypic variance in the target phenotype of interest. Phenotype was coded as a binary factor of 0 (Control) and 1 (PD) for this analysis, with the first eight genetic principal components used as covariates⁵⁷.

DNA methylation data processing

Whole blood genome-wide methylation in the PPMI cohort at baseline was profiled on Illumina EPIC Array as previously reported⁵⁸. These included individual previously associated methylated loci as well as epigenetic age prediction variables. Raw IDAT files were downloaded from the PPMI database (https://ida.loni.usc.edu/) in April 2020 and processed using the R package wateRmelon⁵⁹. For epigenetic age prediction and age acceleration analysis non-normalized beta values were uploaded to the web-based tool https://dnamage.genetics.ucla.edu, selecting the “normalize data” and advanced analysis” options. For inclusion of specific epigenetic loci, data was quality controlled and normalized following established pipelines⁵⁹. Briefly, samples with low signal intensities or bisulphite conversion rate, mismatched reported and imputed sex or cryptic relatedness were excluded. P-filtering was applied using the ‘pfilter’ function in the wateRmelon package, excluding samples with >1% of probes with a detection P-value > 0.05 and probes with >1% of samples with detection P-value > 0.05. Beta values for each probe were quantile normalized using the ‘dasen’ function.

Baseline data

Baseline data for all 423 PD cases were sourced from PPMI and processed into four sets of variables (Supplementary Table 1): Clinical variables: These included demographic variables (sex, age of onset, years in education, duration of disease, family history of PD), motor symptoms (MDS-UPDRS Part 2 and 3 total scores, rigidity score, tremor dominant / postural gait instability disorder classification, Hoehn and Yahr [H&Y] scale, Modified Schwab & England Activity Daily Life [ADL] Score), psychiatric symptoms (MDS-UPDRS Part 1 subscores, Geriatric Depression Scale [GDS], Questionnaire for Impulsive-Compulsive Disorders, State Trait Anxiety Test), autonomic symptoms (SCOPA-autonomic subscores), sleep disorder (Epworth Sleepiness Scale Score [ESS], Categorical REM Sleep Behavior Disorder Questionnaire subscore, MDS-UPDRS Part 1 subscores) and olfactory symptoms measured by University of Pennsylvania Smell Identification Test (UPSIT). Assessments of cognition (Semantic Fluency Test [SFT], Symbol Digit Modalities [SDM], MDS-UPDRS Part 1 subscores, Montreal Cognitive Assessment [MoCA], Hopkins Verbal Learning Test-Revised [HVLT-R] subscores, Benton Judgment of Line Orientation [BJLO]) were also included.

Biofluid variables

CSF measures for amyloid-β (1–42), phospho-tau181, total-tau, and α-synuclein were included, after removing cases showing high levels of CSF hemoglobin (>200 ng/mL) as previously described^60,61. Ratios of each measure were also included as independent predictive variables. Total serum uric acid was also included as previously described⁶².

Genetic and epigenetic variables

Genetic variables included individual APOE genotype, MAPT haplotype and the SNPs rs12411216⁶³, rs356181⁶⁴, and rs3910105⁶⁵. GBA mutation status was included as a binary factor for the presence of any nonsynonymous coding mutations present within the GBA region. PRS for PD (GBA region excluded), AD (APOE region excluded), EA, SCZ, MDD, and CAD where also included.

After stringent quality control and normalization of the whole-genome DNA methylation data measured in baseline blood, 21 loci were selected based on previously reported differentially methylated positions associated with cognitive decline in PD⁶⁶ or across neurodegenerative disease³¹. Epigenetic age acceleration measures from the GrimAge clock⁶⁷, BloodAndSkin clock⁶⁸ and the modified Hannum clock which included measures of both intrinsic epigenetic age acceleration (IEAA) and extrinsic epigenetic age acceleration (EEAA, incorporating intrinsic measures as well as blood cell proportions)⁶⁹ were included as additional epigenetic variables.

Combined biological and clinical variables

This variable set collated all previously listed variables across the clinical, biofluid and epigenetic/genetic subsets into one combined total set.

Summary lists of measures used for predictive modeling are shown in Supplementary Table 1 and descriptive statistics in Table 1. All measures highlighted in this summary table were carried forward for multivariate modeling.

Data processing

Imputation

Each baseline variable was evaluated for the proportion of missing observations and missing values imputed using available data for the selected variable. For ordinal and categorical variables, the mode value was chosen for imputation, for continuous variables the median value was selected. Median/mode value imputation was chosen based on simulation analysis, showing better accuracy compared to k-nearest neighbors (KNN), Multivariate Imputation via Chained Equations (MICE) and Hotdeck algorithms (Supplementary Fig. 10). The full dataset was assessed on missing values, generating a value representing the missing value fraction per variable (Supplementary Table 1). Samples containing any missing value were removed to produce a dataset with complete observations for all available variables, now called the reference dataset. Missing values were induced in the reference dataset at random, according to the proportion of missing values per variable to generate a ‘test’ dataset. The imputation methods ‘Median/mode’, ‘knn’⁷⁰, ‘hotdeck’⁷¹, and ‘mice’⁷² were used to impute the missing values in the test dataset. Root mean square error (RMSE) was used to determine the error between the test and reference dataset, then summed for all variables to get an overall performance error score. This process was repeated 100 times, randomizing different values per loop to be flagged as missing, to assess the stability of the imputation. The total RMSE error (mean + sd) was displayed per variable subset to indicate which methods perform best per variable type. Additionally, the proportion of missing values was compared to the average RMSE per variable.

The total error per variable subset showed the same pattern between variable subsets (Supplementary Fig. 9). The median/mode imputation showed least average error, followed by knn, hotdeck, and mice. Evaluating the proportion of missing values compared to the average RMSE, higher proportion of missing values contributes to a higher average RMSE. Median/mode imputation was chosen to apply to the actual data, as it showed the best performance in minimizing average imputation error.

Stratification

Due to an imbalance in the size of selected outcome groups, stratified sampling was used to account for potential training imbalance and testing bias⁷³ using the ‘stratified’ function from the splitshapestack R package (version 1.4.8). Sampling considered the proportion of outcome groups, the proportion of MCI and dementia cases as well as sex and categorical age (1: <56 years, 2: 56–65 years, 3: >65 years). A 60/40 train/test split was chosen to increase samples in the test set to give an improved evaluation of the final resulting models.

Data transformation

The baseline data contains three types of variables: categorical, ordinal, and continuous. To ensure each variable had a similar influence during the ML process, Z-score normalization was performed using the base R function ‘scale’ on the continuous variables based on averages of the training set^74,75. The parameters ‘center’ and ‘scale’ were stored per variable and used to rescale the training and testing data accordingly.

Machine learning

Training and selected algorithms

The R package caret (version 6.0.90) was used to establish the machine learning workflow and tune the hyperparameters⁷⁶. We used four different classifiers from three machine learning families. The selected algorithms include functions for RF (‘rf’) and conditional inference forest (‘cforest’) from the RF family, SVM with linear Kernels (‘svmLinear’) from the support vector machine family and ElasticNet (‘glmnet’) from the generalized linear model family of classifiers. RF and Cforest are information-based learning algorithms, and their behavior is determined by concepts from information theory⁷⁷. RF algorithms are based on a majority vote of a collection of different decision trees. Cforest differs from RF as it does not select variables based on maximization of an information measure but based on a permutation test for significance⁷⁸. SVM and ElasticNet are error-based learning algorithms, and their behavior is explained by minimizing total error during training⁷⁷. SVM algorithms are based on generating the best possible separation between classes of interest in a hyperdimensional plane. ElasticNet is a generalized linear model with L1 and L2 regularization, able to shrink or drop coefficients to achieve a better model fit.

Tuning

To avoid overfitting during training, 10 repeated 10-fold cross-validation was used. During the training process, hyper-tuning was enabled with a maximum of 100 tunes to promote model accuracy. To prevent optimistically inflated results due to imbalanced datasets, we used MCC alongside AUC to evaluate model accuracy^75,79.

Variable selection and model generalization

Recursive feature elimination (RFE) was applied as the variable selection algorithm. In brief, RFE iterates through generations of models using a decreasing training set, eliminating the worst contributing variable of each iteration⁸⁰. The first model was trained using all available variables, with the resulting evaluation metrics being extracted and stored. Variable importance was recursively calculated for the generated model using the ‘varImp’ function in caret. The least contributing variable was flagged to be removed in the next iteration. The updated training data was used to train a new model, and the process was repeated until one variable remained. This resulted in numerous models with decreasing number of variables.

Optimal model selection

To reduce generalization error, a multi-objective optimization procedure was applied by utilizing MCC, AUC and the number of variables from each model in each iteration⁸¹. MCC and AUC were chosen as MCC is calculated on binary classes while AUC is calculated by class probability, allowing model selection to benefit from the properties of MCC and the resolution of AUC. This ensures model generalization with higher accuracy. Moving averages of these metrics (window = 5) were calculated and the rank was determined (Supplementary Figs. 7 and 8). Calculating the mean rank of the moving averages allows a comparable scale to the variable number per each ith model. From this we calculated an optimal model score by adding together the number of variables to the average rank, as shown in Eq. (1). This results in an optimization curve highlighting the best performing model with the lowest number of variables. The model with the lowest score was selected as the optimal model, as this model indicates the highest accuracy, balanced prediction, and least number of variables.

$$\begin{array}{l}{\mathrm{Optimal}}\,{\mathrm{model}}\,{\mathrm{score}}_i =\\\\ {\mathrm{number}}\,{\mathrm{of}}\,{\mathrm{variables}}_i +\, \frac{{{\mathrm{rank}}({\mathrm{MA}}({\mathrm{MCC}}_i)) + {\mathrm{rank}}({\mathrm{MA}}({\mathrm{AUC}}_i))}}{2}\end{array}$$

(1)

Testing

The optimal model was used for class prediction on the test dataset, yielding several evaluation metrics (AUC, MCC, Accuracy, Sensitivity, Specificity) as well as other evaluation elements (such as confusion matrices, Receiver Operator Characteristics (ROC)-AUC curves, and individual variable difference plots).

Variable importance calculation

Shapley values were used to assess the importance of variables included in models following RFE. Shapley values are a concept in cooperative game theory but are interpreted in the context of ML to determine a variables contribution to prediction. Shapley values were calculated for the interpretation of individual variables included in best performing models. Using the package iml (version 0.10.1), a predictor object was generated, containing the model of interest and the test dataset. This predictor object was used in the calculation of the Shapley values per sample, with 10,000 Monte-Carlo-Simulations. The resulting absolute Shapley values were averaged over all samples, yielding global Shapley contribution per variable⁸².

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Data used in the preparation of this article were obtained from the Parkinson’s Progression Markers Initiative (PPMI) database (www.ppmi-info.org/access-dataspecimens/download-data). For up-to-date information on the study, visit ppmi-info.org.

Code availability

All codes are available at https://github.com/Rrtk2/PPMI-ML-Cognition-PD.

References

Svenningsson, P., Westman, E., Ballard, C. & Aarsland, D. Cognitive impairment in patients with Parkinson’s disease: diagnosis, biomarkers, and treatment. Lancet Neurol. 11, 697–707 (2012).
Article PubMed Google Scholar
Aarsland, D., Zaccai, J. & Brayne, C. A systematic review of prevalence studies of dementia in Parkinson’s disease. Mov. Disord. 20, 1255–1263 (2005).
Article PubMed Google Scholar
Aarsland, D. et al. Cognitive impairment in incident, untreated Parkinson disease The Norwegian ParkWest Study. Neurology 72, 1121–1126 (2009).
Article CAS PubMed Google Scholar
Aarsland, D. et al. Cognitive decline in Parkinson disease. Nat. Rev. Neurol. 13, 217–231 (2017).
Article PubMed PubMed Central Google Scholar
Williams-Gray, C. H. et al. The CamPaIGN study of Parkinson’s disease: 10-year outlook in an incident population-based cohort. J. Neurol. Neurosurg. Psychiatry 84, 1258–1264 (2013).
Article PubMed Google Scholar
Wyman-Chick, K. A., Martin, P. K., Barrett, M. J., Manning, C. A. & Sperling, S. A. Diagnostic accuracy and confidence in the clinical detection of cognitive impairment in early-stage Parkinson disease. J. Geriatr. Psychiatry Neurol. 30, 178–183 (2017).
Article PubMed Google Scholar
Kim, H. M. et al. Prediction of cognitive progression in Parkinson’s disease using three cognitive screening measures. Clin. Park Relat. Disord. 1, 91–97 (2019).
PubMed PubMed Central Google Scholar
Alves, G. et al. CSF Abeta42 predicts early-onset dementia in Parkinson disease. Neurology 82, 1784–1790 (2014).
Article CAS PubMed Google Scholar
Seto-Salvia, N. et al. Dementia risk in Parkinson disease: disentangling the role of MAPT haplotypes. Arch. Neurol. 68, 359–364 (2011).
Article PubMed Google Scholar
Smith, N. et al. Predicting future cognitive impairment in de novo Parkinson’s disease using clinical data and structural MRI. medRxiv, https://www.medrxiv.org/content/10.1101/2021.08.13.21261662v1 (2021).
Greenland, J. C., Williams-Gray, C. H. & Barker, R. A. The clinical heterogeneity of Parkinson’s disease and its therapeutic implications. Eur. J. Neurosci. 49, 328–338 (2019).
Article PubMed Google Scholar
James, C., Ranson, J. M., Everson, R. & Llewellyn, D. J. Performance of machine learning algorithms for predicting progression to dementia in memory clinic patients. JAMA Netw. Open 4, e2136553 (2021).
Article PubMed PubMed Central Google Scholar
Liu, G. et al. Prediction of cognition in Parkinson’s disease with a clinical-genetic score: a longitudinal analysis of nine cohorts. Lancet Neurol. 16, 620–629 (2017).
Article PubMed PubMed Central Google Scholar
Phongpreecha, T. et al. Multivariate prediction of dementia in Parkinson’s disease. npj Parkinsons Dis. 6, 20 (2020).
Article PubMed PubMed Central Google Scholar
Mei, J., Desrosiers, C. & Frasnelli, J. Machine learning for the diagnosis of Parkinson’s disease: a review of literature. Front. Aging Neurosci. 13, 633752 (2021).
Article PubMed PubMed Central Google Scholar
Su, C., Tong, J. & Wang, F. Mining genetic and transcriptomic data using machine learning approaches in Parkinson’s disease. npj Parkinsons Dis. 6, 24 (2020).
Article PubMed PubMed Central Google Scholar
Salmanpour, M. R. et al. Robust identification of Parkinsonas disease subtypes using radiomics and hybrid machine learning. Computers Biol. Med. 129, 104142 (2021).
Article Google Scholar
Zhang, X. et al. Data-driven subtyping of parkinson’s disease using longitudinal clinical records: a cohort study. Sci. Rep. 9, 797 (2019).
Article PubMed PubMed Central Google Scholar
Latourelle, J. C. et al. Large-scale identification of clinical and genetic predictors of motor progression in patients with newly diagnosed Parkinson’s disease: a longitudinal cohort study and validation. Lancet Neurol. 16, 908–916 (2017).
Article CAS PubMed PubMed Central Google Scholar
Shu, Z. Y. et al. Predicting the progression of Parkinson’s disease using conventional MRI and machine learning: An application of radiomic biomarkers in whole-brain white matter. Magn. Reson. Med. 85, 1611–1624 (2021).
Article PubMed Google Scholar
Rastegar, D. A., Ho, N., Halliday, G. M. & Dzamko, N. Parkinson’s progression prediction using machine learning and serum cytokines. npj Parkinsons Dis. 5, 14 (2019).
Article Google Scholar
Salmanpour, M. R. et al. Optimized machine learning methods for prediction of cognitive outcome in Parkinsonas disease. Computers Biol. Med. 111, 103347 (2019).
Article Google Scholar
Marek, K. et al. The Parkinson’s progression markers initiative (PPMI)—establishing a PD biomarker cohort. Ann. Clin. Transl. Neurol. 5, 1460–1477 (2018).
Article CAS PubMed PubMed Central Google Scholar
Weintraub, D. et al. Cognitive performance and neuropsychiatric symptoms in early, untreated Parkinson’s disease. Mov. Disord. 30, 919–927 (2015).
Article PubMed PubMed Central Google Scholar
Domellof, M. E., Lundin, K. F., Edstrom, M. & Forsgren, L. Olfactory dysfunction and dementia in newly diagnosed patients with Parkinson’s disease. Parkinsonism Relat. Disord. 38, 41–47 (2017).
Article PubMed Google Scholar
Cecchini, M. P. et al. Olfaction and taste in Parkinson’s disease: the association with mild cognitive impairment and the single cognitive domain dysfunction. J. Neural Transm. (Vienna) 126, 585–595 (2019).
Article Google Scholar
Yoo, H. S. et al. Association between olfactory deficit and motor and cognitive function in Parkinson’s disease. J. Mov. Disord. 13, 133–141 (2020).
Article PubMed PubMed Central Google Scholar
Fullard, M. E. et al. Olfactory impairment predicts cognitive decline in early Parkinson’s disease. Parkinsonism Relat. Disord. 25, 45–51 (2016).
Article PubMed PubMed Central Google Scholar
Martens, K. A. E., Silveira, C. R. A., Intzandt, B. N. & Almeida, Q. J. State anxiety predicts cognitive performance in patients with Parkinson’s disease. Neuropsychology 32, 950–957 (2018).
Article Google Scholar
Gulpers, B. et al. Anxiety as a predictor for cognitive decline and dementia: a systematic review and meta-analysis. Am. J. Geriatr. Psychiatry 24, 823–842 (2016).
Article PubMed Google Scholar
Nabais, M. F. et al. Meta-analysis of genome-wide DNA methylation identifies shared associations across neurodegenerative disorders. Genome Biol. 22, 90 (2021).
Article CAS PubMed PubMed Central Google Scholar
Schrag, A., Siddiqui, U. F., Anastasiou, Z., Weintraub, D. & Schott, J. M. Clinical variables and biomarkers in prediction of cognitive CrossMark impairment in patients with newly diagnosed Parkinson’s disease: a cohort study. Lancet Neurol. 16, 66–75 (2017).
Article CAS PubMed Google Scholar
Aksu, Y., Miller, D. J., Kesidis, G. & Yang, Q. X. Margin-maximizing feature elimination methods for linear and nonlinear Kernel-based discriminant functions. IEEE Trans. Neural Netw. 21, 701–717 (2010).
Article PubMed PubMed Central Google Scholar
Guyon, I., Weston, J., Barnhill, S. & Vapnik, V. Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 389–422 (2002).
Article Google Scholar
Salmanpour, M. R. et al. Optimized machine learning methods for prediction of cognitive outcome in Parkinson’s disease. Comput Biol. Med. 111, 103347 (2019).
Article PubMed Google Scholar
Aarsland, D., Creese, B. & Chaudhuri, K. R. A new tool to identify patients with Parkinson’s disease at increased risk of dementia. Lancet Neurol. 16, 576–578 (2017).
Article PubMed Google Scholar
Picard, M., Scott-Boyer, M. P., Bodein, A., Perin, O. & Droit, A. Integration strategies of multi-omics data for machine learning analysis. Comput Struct. Biotechnol. J. 19, 3735–3746 (2021).
Article CAS PubMed PubMed Central Google Scholar
Caspell-Garcia, C. et al. Multiple modality biomarker prediction of cognitive impairment in prospectively followed de novo Parkinson disease. PLoS ONE 12, e0175674 (2017).
Article PubMed PubMed Central Google Scholar
Oxtoby, N. P. et al. Sequence of clinical and neurodegeneration events in Parkinson’s disease progression. Brain 144, 975–988 (2021).
Article PubMed PubMed Central Google Scholar
Summerfield, C. et al. Structural brain changes in Parkinson disease with dementia: a voxel-based morphometry study. Arch. Neurol. 62, 281–285 (2005).
Article PubMed Google Scholar
Gao, Y. et al. Changes of brain structure in Parkinson’s disease patients with mild cognitive impairment analyzed via VBM technology. Neurosci. Lett. 658, 121–132 (2017).
Article CAS PubMed Google Scholar
Costafreda, S. G. et al. A systematic review and quantitative appraisal of fMRI studies of verbal fluency: role of the left inferior frontal gyrus. Hum. Brain Mapp. 27, 799–810 (2006).
Article PubMed PubMed Central Google Scholar
Lucas-Jimenez, O. et al. Verbal memory in Parkinson’s disease: a combined DTI and fMRI study. J. Parkinsons Dis. 5, 793–804 (2015).
Article PubMed PubMed Central Google Scholar
Fornari, L. H. T., da Silva Junior, N., Muratt Carpenedo, C., Hilbig, A. & Rieder, C. R. M. Striatal dopamine correlates to memory and attention in Parkinson’s disease. Am. J. Nucl. Med. Mol. Imaging 11, 10–19 (2021).
PubMed PubMed Central Google Scholar
Zheng, Z. et al. DTI correlates of distinct cognitive impairments in Parkinson’s disease. Hum. Brain Mapp. 35, 1325–1333 (2014).
Article PubMed Google Scholar
Litvan, I. et al. Diagnostic criteria for mild cognitive impairment in Parkinson’s disease: movement Disorder Society Task Force guidelines. Mov. Disord. 27, 349–356 (2012).
Article PubMed PubMed Central Google Scholar
Emre, M. et al. Clinical diagnostic criteria for dementia associated with Parkinson’s disease. Mov. Disord. 22, 1689–1707 (2007).
Article PubMed Google Scholar
Nalls, M. A. et al. Baseline genetic associations in the Parkinson’s Progression Markers Initiative (PPMI). Mov. Disord. 31, 79–85 (2016).
Article CAS PubMed Google Scholar
Marees, A. T. et al. A tutorial on conducting genome‐wide association studies: quality control and statistical analysis. Int. J. methods Psychiatr. Res. 27, e1608 (2018).
Article PubMed PubMed Central Google Scholar
Kunkle, B. W. et al. Genetic meta-analysis of diagnosed Alzheimer’s disease identifies new risk loci and implicates Aβ, tau, immunity and lipid processing. Nat. Genet. 51, 414–430 (2019).
Article CAS PubMed PubMed Central Google Scholar
Nalls, M. A. et al. A multicenter study of glucocerebrosidase mutations in dementia with Lewy bodies. JAMA Neurol. 70, 727–735 (2013).
Article PubMed Google Scholar
Lee, J. J. et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat. Genet. 50, 1112–1121 (2018).
Article CAS PubMed PubMed Central Google Scholar
Pantelis, C. et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427 (2014).
Article PubMed Central Google Scholar
Wray, N. R. et al. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. Nat. Genet. 50, 668–681 (2018).
Article CAS PubMed PubMed Central Google Scholar
Van Der Harst, P. & Verweij, N. Identification of 64 novel genetic loci provides an expanded view on the genetic architecture of coronary artery disease. Circ. Res. 122, 433–443 (2018).
Article PubMed PubMed Central Google Scholar
Choi, S. W. & O’Reilly, P. F. PRSice-2: Polygenic Risk Score software for biobank-scale data. Gigascience 8, giz082 (2019).
Article PubMed PubMed Central Google Scholar
Choi, S. W., Mak, T. S.-H. & O’Reilly, P. F. Tutorial: a guide to performing polygenic risk score analyses. Nat. Protoc. 15, 2759–2772 (2020).
Article CAS PubMed PubMed Central Google Scholar
Garg, P. et al. A survey of rare epigenetic variation in 23,116 human genomes identifies disease-relevant epivariations and CGG expansions. Am. J. Hum. Genet. 107, 654–669 (2020).
Article CAS PubMed PubMed Central Google Scholar
Pidsley, R. et al. A data-driven approach to preprocessing Illumina 450K methylation array data. BMC Genomics 14, 1–10 (2013).
Article Google Scholar
Kang, J. H. et al. Association of cerebrospinal fluid beta-amyloid 1-42, T-tau, P-tau181, and alpha-synuclein levels with clinical features of drug-naive patients with early Parkinson disease. JAMA Neurol. 70, 1277–1287 (2013).
PubMed PubMed Central Google Scholar
Mollenhauer, B. et al. Longitudinal CSF biomarkers in patients with early Parkinson disease and healthy controls. Neurology 89, 1959–1969 (2017).
Article CAS PubMed PubMed Central Google Scholar
Koros, C. et al. Serum uric acid level as a putative biomarker in Parkinson? s disease patients carrying GBA1 mutations: 2-Year data from the PPMI study. Parkinsonism Relat. Disord. 84, 1–4 (2021).
Article CAS PubMed Google Scholar
Jiang, Z. Q. et al. Characterization of a pathogenic variant in GBA for Parkinsonas disease with mild cognitive impairment patients. Mol. Brain 13, 102 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sampedro, F., Marin-Lahoz, J., Martinez-Horta, S., Pagonabarraga, J. & Kulisevsky, J. Cortical thinning associated with age and CSF biomarkers in early Parkinson’s disease is modified by the SNCA rs356181 polymorphism. Neurodegenerative Dis. 18, 233–238 (2018).
Article CAS Google Scholar
Seo, Y. et al. Effect of rs3910105 in the synuclein gene on dopamine transporter availability in healthy subjects. Yonsei Med. J. 59, 787–792 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chuang, Y. H. et al. Longitudinal epigenome-wide methylation study of cognitive decline and motor progression in Parkinson’s disease. J. Parkinsons Dis. 9, 389–400 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lu, A. T. et al. DNA methylation GrimAge strongly predicts lifespan and healthspan. Aging 11, 303–327 (2019).
Article CAS PubMed PubMed Central Google Scholar
Horvath, S. et al. Epigenetic clock for skin and blood cells applied to Hutchinson Gilford Progeria Syndrome and ex vivo studies. Aging 10, 1758–1775 (2018).
Article CAS PubMed PubMed Central Google Scholar
Hannum, G. et al. Genome-wide methylation profiles reveal quantitative views of human aging rates. Mol. Cell 49, 359–367 (2013).
Article CAS PubMed Google Scholar
Hastie, T., Tibshirani, R., Narasimhan, B. & Chu, G. impute: Imputation for microarray data. R package version 1.70.0. (2022).
Gill, J. et al. hot.deck: Multiple Hot Deck Imputation_. R package version 1.2, https://CRAN.R-project.org/package=hot.deck (2021).
Van Buuren, S. & Groothuis-Oudshoorn, K. mice: multivariate imputation by chained equations in R. J. Stat. Softw. 45, 1–67 (2011).
Article Google Scholar
Mirza, B. et al. Machine learning and integrative analysis of biomedical big data. Genes 10, 87 (2019).
Article CAS PubMed Central Google Scholar
Norel, R., Rice, J. J. & Stolovitzky, G. The self-assessment trap: can we all be better than average?. Mol. Syst. Biol. 7, 537 (2011).
Article PubMed PubMed Central Google Scholar
Kocak, B., Kus, E. A. & Kilickesmez, O. How to read and review papers on machine learning and artificial intelligence in radiology: a survival guide to key methodological concepts. Eur. Radiol. 31, 1819–1830 (2021).
Article PubMed Google Scholar
Kuhn, M. Building predictive models in R using the caret Package. J. Stat. Softw. 28, 1–26 (2008).
Article Google Scholar
Kelleher, J. D., Mac Namee, B. & D’Arcy, A. Fundamentals of machine learning for predictive data analytics: algorithms, worked examples, and case studies, pages cm (The MIT Press, 2020).
Hothorn, T., Hornik, K. & Zeileis, A. Unbiased recursive partitioning: a conditional inference framework. J. Comput. Graph. Stat. 15, 651–674 (2006).
Article Google Scholar
Chicco, D. & Jurman, G. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genomics 21, 6 (2020).
Article PubMed PubMed Central Google Scholar
Richhariya, B., Tanveer, M., Rashid, A. H. & Initia, A. D. N. Diagnosis of Alzheimeras disease using universum support vector machine based recursive feature elimination (USVM-RFE). Biomed. Signal Process. Control 59, 101903 (2020).
Google Scholar
Lv, J., Peng, Q. K., Chen, X. & Sun, Z. A multi-objective heuristic algorithm for gene expression microarray data classification. Expert Syst. Appl. 59, 13–19 (2016).
Article Google Scholar
Lundberg, S. M. et al. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2, 56–67 (2020).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported through a ZonMw Memorabel Grant (733050516) to E.P. J.H. and K.L. are supported by funding from a Medical Research Council Grant (MR/S011625/1). J.H. is supported by the Charles Wolfson Charitable Trust. K.L. is supported by a grant from BRACE Dementia Research. We thank all participants and teams who contributed data to PPMI. PPMI, a public-private partnership, is funded by the Michael J. Fox Foundation for Parkinson’s Research and funding partners including 4D Pharma, AbbVie, AcureX Therapeutics, Allergan, Amathus Therapeutics, Aligning Science Across Parkinson’s (ASAP), Avid Radiopharmaceuticals, Bial Biotech, Biogen, BioLegend, Bristol Myers Squibb, Calico Life Sciences LLC, Celgene Corporation, DaCapo Brainscience, Denali Therapeutics, The Edmond J. Safra Foundation, Eli Lilly and Company, GE Healthcare, GlaxoSmithKline, Golub Capital, Handl Therapeutics, Insitro, Janssen Pharmaceuticals, Lundbeck, Merck & Co., Meso Scale Diagnostics LLC, Neurocrine Biosciences, Pfizer, Piramal Imaging, Prevail Therapeutics, F. Hoffmann‐La Roche and its affiliated company Genentech, Sanofi Genzyme, Servier, Takeda Pharmaceutical Company, Teva Neuroscience, UCB, Vanqua Bio, Verily Life Sciences, Voyager Therapeutics and Yumanity Therapeutics. Publication was supported by central open access funds from the University of Exeter.

Author information

These authors contributed equally: Joshua Harvey, Rick A. Reijnders.

Authors and Affiliations

Medical School, Faculty of Health and Life Sciences, University of Exeter, Exeter, UK
Joshua Harvey, Gemma Shireby, Byron Creese, Katie Lunnon & Ehsan Pishva
Department of Psychiatry and Neuropsychology, School for Mental Health and Neuroscience (MHeNs), Maastricht University, Maastricht, The Netherlands
Rick A. Reijnders, Annelien Duits, Sebastian Köhler, Lars Eijssen, Bart P. F. Rutten, Albert F. G. Leentjens & Ehsan Pishva
Department of Advanced Computing Sciences, FSE, Maastricht University, Maastricht, The Netherlands
Rachel Cavill
Department of Medical Psychology, Radboud University Medical Center, Nijmegen, The Netherlands
Annelien Duits
Department of Bioinformatics—BiGCaT, School of Nutrition and Translational Research in Metabolism (NUTRIM), Maastricht University, Maastricht, The Netherlands
Lars Eijssen
Department of Integrative Structural and Computational Biology, Scripps Research, La Jolla, CA, 92037, USA
Ali Torkamani

Authors

Joshua Harvey
View author publications
You can also search for this author in PubMed Google Scholar
Rick A. Reijnders
View author publications
You can also search for this author in PubMed Google Scholar
Rachel Cavill
View author publications
You can also search for this author in PubMed Google Scholar
Annelien Duits
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Köhler
View author publications
You can also search for this author in PubMed Google Scholar
Lars Eijssen
View author publications
You can also search for this author in PubMed Google Scholar
Bart P. F. Rutten
View author publications
You can also search for this author in PubMed Google Scholar
Gemma Shireby
View author publications
You can also search for this author in PubMed Google Scholar
Ali Torkamani
View author publications
You can also search for this author in PubMed Google Scholar
Byron Creese
View author publications
You can also search for this author in PubMed Google Scholar
Albert F. G. Leentjens
View author publications
You can also search for this author in PubMed Google Scholar
Katie Lunnon
View author publications
You can also search for this author in PubMed Google Scholar
Ehsan Pishva
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.H. and R.A.R. contributed equally to this work. E.P. conceived and directed the project. J.H. and R.A.R. undertook data analysis, and support with data review. J.H., R.A.R., and E.P. wrote the first draft of the manuscript. A.D. and B.C. were involved in the selection of the clinical predictors and outcome. R.C., S.K., and A.T. provided advice on data analysis. G.S. contributed to generating polygenic scores. J.H., R.A.R., E.P., K.L., A.F.G.L., L.E., B.P.F.R., B.C., and A.D. contributed to the interpretation of the results. All authors provided critical feedback on the manuscript and approved the final submission.

Corresponding author

Correspondence to Ehsan Pishva.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Figures

Supplementary Table 1

Reporting Summary Checklist

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Harvey, J., Reijnders, R.A., Cavill, R. et al. Machine learning-based prediction of cognitive outcomes in de novo Parkinson’s disease. npj Parkinsons Dis. 8, 150 (2022). https://doi.org/10.1038/s41531-022-00409-5

Download citation

Received: 02 February 2022
Accepted: 11 October 2022
Published: 07 November 2022
DOI: https://doi.org/10.1038/s41531-022-00409-5
Springer Nature Limited

This article is cited by

Classification performance assessment for imbalanced multiclass data
- Jesús S. Aguilar-Ruiz
- Marcin Michalak
Scientific Reports (2024)

Machine learning-based prediction of cognitive outcomes in de novo Parkinson’s disease

Abstract

Similar content being viewed by others

Machine learning-based prediction of longitudinal cognitive decline in early Parkinson’s disease using multimodal features

PRedicting the EVolution of SubjectIvE Cognitive Decline to Alzheimer’s Disease With machine learning: the PREVIEW study protocol

Using Multi-Scale Genetic, Neuroimaging and Clinical Data for Predicting Alzheimer’s Disease and Reconstruction of Relevant Biological Mechanisms

Introduction

Results

Prediction of cognitive outcomes

Predictive variables for cognitive impairment outcome

The effect of cognitive tests in predictive accuracy

Stratification of PD- MCI from PD-dementia

Discussion

Methods

Participants and cognitive assessment

PD-Dementia

PD-MCI

Cognitively intact (CI)

Cognitive impairment outcome

Dementia conversion outcome

Epigenomic and genomic profiling

Genotyping and polygenic scores calculation

DNA methylation data processing

Baseline data

Biofluid variables

Genetic and epigenetic variables

Combined biological and clinical variables

Data processing

Imputation

Stratification

Data transformation

Machine learning

Training and selected algorithms

Tuning

Variable selection and model generalization

Optimal model selection

Testing

Variable importance calculation

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Figures

Supplementary Table 1

Reporting Summary Checklist

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Classification performance assessment for imbalanced multiclass data

Search

Navigation