Predicting disease severity in multiple sclerosis using multimodal data and machine learning

Andorra, Magi; Freire, Ana; Zubizarreta, Irati; de Rosbo, Nicole Kerlero; Bos, Steffan D.; Rinas, Melanie; Høgestøl, Einar A.; de Rodez Benavent, Sigrid A.; Berge, Tone; Brune-Ingebretse, Synne; Ivaldi, Federico; Cellerino, Maria; Pardini, Matteo; Vila, Gemma; Pulido-Valdeolivas, Irene; Martinez-Lapiscina, Elena H.; Llufriu, Sara; Saiz, Albert; Blanco, Yolanda; Martinez-Heras, Eloy; Solana, Elisabeth; Bäcker-Koduah, Priscilla; Behrens, Janina; Kuchling, Joseph; Asseyer, Susanna; Scheel, Michael; Chien, Claudia; Zimmermann, Hanna; Motamedi, Seyedamirhosein; Kauer-Bonin, Josef; Brandt, Alex; Saez-Rodriguez, Julio; Alexopoulos, Leonidas G.; Paul, Friedemann; Harbo, Hanne F.; Shams, Hengameh; Oksenberg, Jorge; Uccelli, Antonio; Baeza-Yates, Ricardo; Villoslada, Pablo

doi:10.1007/s00415-023-12132-z

Predicting disease severity in multiple sclerosis using multimodal data and machine learning

Original Communication
Open access
Published: 22 December 2023

Volume 271, pages 1133–1149, (2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Neurology Aims and scope Submit manuscript

Predicting disease severity in multiple sclerosis using multimodal data and machine learning

Download PDF

Magi Andorra¹^na1,
Ana Freire²^na1^nAff18,
Irati Zubizarreta¹,
Nicole Kerlero de Rosbo^3,4,
Steffan D. Bos^5,6,
Melanie Rinas⁷,
Einar A. Høgestøl^5,6,
Sigrid A. de Rodez Benavent^5,6,
Tone Berge^6,8,
Synne Brune-Ingebretse^5,6,
Federico Ivaldi⁹,
Maria Cellerino³,
Matteo Pardini^3,4,
Gemma Vila¹,
Irene Pulido-Valdeolivas¹,
Elena H. Martinez-Lapiscina¹,
Sara Llufriu¹,
Albert Saiz¹,
Yolanda Blanco¹,
Eloy Martinez-Heras¹,
Elisabeth Solana¹,
Priscilla Bäcker-Koduah¹⁰,
Janina Behrens¹⁰,
Joseph Kuchling¹⁰,
Susanna Asseyer^10,11,
Michael Scheel¹⁰,
Claudia Chien^10,11,
Hanna Zimmermann^10,11,
Seyedamirhosein Motamedi¹⁰,
Josef Kauer-Bonin¹⁰,
Alex Brandt¹⁰,
Julio Saez-Rodriguez⁷,
Leonidas G. Alexopoulos^12,13,
Friedemann Paul^10,11,
Hanne F. Harbo^5,6,
Hengameh Shams¹⁴,
Jorge Oksenberg¹⁴,
Antonio Uccelli^3,4,
Ricardo Baeza-Yates¹⁵ &
…
Pablo Villoslada ORCID: orcid.org/0000-0002-8735-6119^16,17

4599 Accesses
3 Citations
186 Altmetric
25 Mentions
Explore all metrics

Abstract

Background

Multiple sclerosis patients would benefit from machine learning algorithms that integrates clinical, imaging and multimodal biomarkers to define the risk of disease activity.

Methods

We have analysed a prospective multi-centric cohort of 322 MS patients and 98 healthy controls from four MS centres, collecting disability scales at baseline and 2 years later. Imaging data included brain MRI and optical coherence tomography, and omics included genotyping, cytomics and phosphoproteomic data from peripheral blood mononuclear cells. Predictors of clinical outcomes were searched using Random Forest algorithms. Assessment of the algorithm performance was conducted in an independent prospective cohort of 271 MS patients from a single centre.

Results

We found algorithms for predicting confirmed disability accumulation for the different scales, no evidence of disease activity (NEDA), onset of immunotherapy and the escalation from low- to high-efficacy therapy with intermediate to high-accuracy. This accuracy was achieved for most of the predictors using clinical data alone or in combination with imaging data. Still, in some cases, the addition of omics data slightly increased algorithm performance. Accuracies were comparable in both cohorts.

Conclusion

Combining clinical, imaging and omics data with machine learning helps identify MS patients at risk of disability worsening.

The role of machine learning in developing non-magnetic resonance imaging based biomarkers for multiple sclerosis: a systematic review

Article Open access 15 September 2022

Predicting multiple sclerosis disease progression and outcomes with machine learning and MRI-based biomarkers: a review

Article Open access 12 September 2024

Machine learning classifier to identify clinical and radiological features relevant to disability progression in multiple sclerosis

Article Open access 10 May 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Developing personalised health care for people with multiple sclerosis (MS) is hindered by our limited understanding of the biological processes underlying the disease, by the lack of validated prognostic or predictive biomarkers and by the clinical heterogeneity between patients [1,2,3,4]. At present, clinical decisions are taken based on outcomes identified in natural history cohort studies and randomised clinical trials, such as the disease subtype (relapsing vs. progressive course); age (above ~ 45 years old); the time to reach disability milestones like the expanded disability status scale (EDSS) 4.0 or 6.0; the Evidence of Disease Activity (EDA) [5]; lesion activity (presence of gadolinium-enhancing lesions) and lesion load (presence of new or enlarging T2 lesions and T2 lesion volume) [6]. Indeed, retinal atrophy monitored by optical coherence tomography (OCT) is able to predict the risk of disability worsening [7, 8]. Moreover, the use of disease-modifying drugs (DMDs) and, specifically, high-efficacy therapies, is also associated with a more severe disease course, not the least because they are currently restricted to patients with evidence of a highly active disease [9].

Amongst the biomarkers associated with MS, some have been shown to have a reliable predictive value of a more severe disease course, such as the presence of oligoclonal IgM bands [10, 11], the levels of neurofilaments light [12, 13] or chitinase-3 [14] in the cerebrospinal fluid (CSF) and serum. Although many omic-based biomarkers have been proposed, none has been validated to the level of becoming useful at the individual patient level [2]. Nevertheless, most of these approaches were based on group analysis, which limits their application to individual patients when personalised risk assessment is desired. Accordingly, defining the prognosis of individual patients with MS remains a significant unmet need when considering the application of personalised medicine [3, 15].

In this study, we set out to search for algorithms that stratify MS patients based on a differential risk of disease severity. As such, we combined clinical data with that obtained from neuroimaging and different omics techniques (genomics, cytomics and proteomics) to identify predictors of disease severity [13, 16, 17]. We took advantage of the machine learning tools that tolerate unbalance and overfitting such as random forest algorithms to search in a stepwise manner for the combinations of clinical, imaging and omics variables which identify predictors that are accurate when predicting each clinical outcome [18,19,20,21,22].

Materials and methods

Ethical statement

The Sys4MS project was approved by the Institutional Review Boards at each participating institution: Hospital Clinic of the University of Barcelona, IRCCS Ospedale Policlinico San Martino IRCCS, Oslo University Hospital, and Charité—Universitätsmedizin Berlin University. The Barcelona MS cohort study was approved by the Ethical Committee for Clinical Research of the Hospital Clinic Barcelona. Patients were invited to participate by their neurologists, and they provided signed informed consent prior to their enrolment in the study. De-identified data were collected in a REDCap database at the Barcelona centre. All methods were performed in accordance with the relevant guidelines and regulations.

Patients

The Sys4MS cohort [13, 23] was composed of 322 consecutive MS patients according to 2010 McDonald criteria [24] and 98 healthy controls (HC) at the four academic centres: Hospital Clinic, University of Barcelona, Spain (n = 93); Ospedale Policlinico San Martino, Genova, Italy (n = 110); Charité University, Berlin, Germany (n = 96); and the Oslo University Hospital, Oslo, Norway (n = 121). The inclusion criteria were being diagnosed with MS based on 2010 criteria, not having had a relapse in the previous 3 months and patients were required to be stable on the same DMD treatment over the preceding 6 months. RRMS patients were required to have < 10-year disease duration, whereas PMS patients were required to have EDSS 2.0–7.0. The exclusion criteria were use of corticosteroids in the last 30 days, a relapse in the previous 3 months, inability to perform brain MRI, chronic diseases (AIDS, hepatitis B or C, insulin-dependent diabetes, cardiovascular, renal, respiratory or liver insufficiency), pregnancy, breastfeeding or plans to conceive during the course of the study (women only) and participation in any other clinical therapeutic study at or within 30 days of screening visit. We collected clinical information [demographics, relapses, disability scales and use of disease-modifying drugs (DMD)], imaging data (brain MRI and OCT), and blood samples at the same visit. Patients were followed up for 2 years, and the same clinical, disability scales and imaging data (brain MRI and OCT) were collected at the 2-year follow-up visit.

The second cohort was recruited at the Hospital Clinic of Barcelona without overlap with the patients participating in the Sys4MS cohort. The cohort was composed of 271 patients with RRMS or SPMS according to 2010 McDonald criteria [24] and 54 HC without previous or present history of neurological or psychiatric condition. Patients were prospectively recruited at the MS Unit of the Hospital Clinic of Barcelona, as described recently [25].

Clinical variables

Each patient was assessed on the following disability scales: Expanded Disability Status Scale (EDSS); timed 25 feet walking test (T25WT), 9-hole peg test (9HPT), Symbol Digit Modality Test (SDMT), 2.5% low contrast visual acuity (SL25), and high contrast vision (HCVA, using best corrected acuity, EDTRS charts and logMar transformation) using the conditions indicated in the OCT section. Disability scales were obtained 3 months after any new relapses or use of corticosteroids during the follow-up. We calculated the MS Severity Score (MSSS) and the age-related MS Severity Score (ARMSS) as described elsewhere [26]. No Evidence of Disease activity (NEDA) was defined as no evidence of clinical relapses, new or enlarging T2 lesions and not changes on EDSS [27]. We collected the information regarding the patients’ DMD use, categorised as low-efficacy therapy: interferon-beta, glatiramer acetate and teriflunomide); mid- to high-efficacy therapy: fingolimod, dimethyl-fumarate, natalizumab; or other monoclonals like alemtuzumab, rituximab, daclizumab and ocrelizumab [28]. EDSS and the other clinical scales were confirmed at the end of follow-up based on the results of the 6-month previous clinical visit to define Confirmed Disability Accumulation (CDA). EDSS-based CDA was defined as an increase of one point on the EDSS (for EDSS at baseline between 0 and 5.5) or 0.5 points for patients with EDSS at baseline ≥ 5.5 confirmed at 6 months. For 9HPT, T25WT and SL25, CDA was defined as a 20% change in each score, whereas it was four points for the SDMT confirmed at 6 months [29].

Imaging

MRI studies were performed on a 3 T scanner at each centre as described before [17], using a standard operating procedure (SOP) to optimise the volumetric analysis. We used the three-dimensional (3D) structural T1-weighted voxel magnetization-prepared rapid gradient echo (T1-MPRAGE) protocol (voxel size: 0.9 × 0.9 × 0.9 mm³), with 3D T2-fluid-attenuated inversion recovery (T2-FLAIR) images using the same voxel size to quantify changes in brain volume. Briefly, T2-FLAIR images were registered to T1-MPRAGE scans by a trained technician to ease the manual segmentation of the lesions. Subsequently, lesion in-painting of T1-MPRAGE scans allowed the volume of the whole brain, grey matter and thalamus to be quantified using SIENAX. In addition, we used post-gadolinium T1 axial images (voxel size: 0.7 × 0.6 × 3.0 mm³) to quantify gadolinium-enhancing lesions (Gad +). Presence of contrast-enhancing lesions, T2 lesion volume, new or enlarging T2 lesions and volumetric analysis were done at the Berlin centre and by the same operator and were estimated using the lesion in-filled MPRAGE images by FSL SIENAX [30].

Retinal OCT scans were performed in eye-tracking mode by trained technicians under standard ambient light conditions (lighting level of 80–100-foot candles) and without pupillary dilatation, using the same Spectralis device in three centres or a Nidek RS-3000 in Oslo centre. Correction for spherical errors was adjusted prior to each measurement, and the technicians performing OCT scans were blind to the patient’s clinical history. The peri-papillary Retinal Nerve Fibre Layer thickness (pRNFL, μm) was measured with a 12-degree diameter ring scan automatically centred on the optic nerve head (100 ART, 1536 A scans per B scan). The macular scan protocol involved a 20 × 20-degree horizontal raster scan centred on the fovea, including 25 B scans (ART ≥ 9, 512 A scans per B scan). A single grader at Berlin centre at the reading centre in Berlin performed intra-retinal layer segmentation using Orion software^® (Voxeleron Inc, Berkeley, US) to quantify the macular ganglion cell plus inner plexiform layer (GCIPL) and the macular inner nuclear layer thicknesses (μm) in the 6 mm ring area as previously described [31]. All OCT scans fulfilled OSCAR-IB criteria [32] and APOSTEL guidelines [33]. Eyes with severe myopia, optic neuropathies or retina diseases were excluded for analysis. We included only eyes without previous optic neuritis (in case both eyes have no previous optic neuritis, the mean of both eyes was used). Scans with an insufficient signal-to-noise ratio, or when the retinal thickness algorithm failed were repeated, or the data were ultimately excluded.

Genotyping

Genotyping of the samples was performed by Finland Institute of Molecular Medicine Genomics (University of Helsinki, Finland) for the Sys4MS cohort and at the University of California, San Francisco for the Barcelona cohort, using the Illumina HumanOmniExpress-24 v1.2 array (713,599 genotypes from 396 samples). Single-nucleotide polymorphisms (SNPs) imputation was conducted against the 1000-genomes reference (quality of imputation r² > 0.5; 6,817,000 genotypes for 396 samples), which allowed us to extract MS-associated SNPs [152 out of 200 known non-HLA MS-associated SNPs available and 17 out of 31 known MS-associated HLA alleles available (HLA*IMP programme)] as described elsewhere [34]. The MS Genetic Burden Score (MSGB) is used as cumulative genetic risk estimations for MS patients. The MSGB for the HLA and non-HLA alleles and their combination were calculated as described previously [35]. Briefly, the MSGB is computed based on a weighted scoring algorithm using one SNP per MS-associated genomic region as found by trend-test association (meta-) analysis. This statistic is an extension of the log additive model, termed “Clinical Genetic Score”, with weights given to each SNP based on its effect size as reported in the literature. The MSGB is obtained by summing the number of independently associated MS risk alleles weighted by their beta coefficients, obtained from a large GWAS meta-analysis, at 177 (of 200) non-MHC (major histocompatibility complex) loci and 18 (of 32) MHC variants, which includes the HLA-DRB1*15:01-tagging single-nucleotide polymorphism (SNP) rs3135388 [36,37,38,39,40].

Cytomics

Cytomics was performed on fresh peripheral blood mononuclear cells (PBMCs) using 17 antibodies that covered 11 subpopulations of T, B and NK cells as described in detail elsewhere [23]. The following cell populations were studied: Effector cells: Th1 classic: CD3 + CD4 + CxCR3 + CCR6-CD161−; Th17: CD3 + CD4 + CxCR3 + CCR6-CD161 + CCR4 +; Th 1/17: CD3 + CD4 + CCR6-CD161 + CxCR3highCCR4low; Regulatory T cells: CD3 + CD4 +: T reg CD25 + CD127-, T naive CD45RA + CD25low; CD3 + CD8 +: T reg CD28− and T naive CD28-CD45RA +; B cells: B memory: CD19 + CD14-CD24 + CD38−; B mature: CD19 + CD14-CD24 + CD38low; B regulatory: CD19 + CD24highCD38high and NK cells: Effector: CD3-CD14-CD56dim: Regulatory: CD3-CD-CD56bright (reg).

Phosphoproteomics

The phosphorylation levels of 25 kinases participating in pathways associated with MS [41] (AKT1, AKTS1, CREB1, GSK3AB, HSPB1, IKBA, JUN, KS6B1, LCK, MK12, MK03/01, MK09, MP2K1, NRF2, P53, PGFRB, PTN11, RS6, SRC, STAT1, STAT3, STAT5, STAT6, TF65, WNK1) were assessed by xMAP assays in PBMCs and quantified as previously described [42].

Machine learning analysis

The search for predictors of clinical outcomes (see the list of outcomes on supplementary file) was performed through a machine learning analysis using Python and the Scikit Learn library (scikit-learn.org). The analysis included 100 features: clinical, demographics, disability scales, DMD use, MRI, OCT, MSGB, cytomics and phosphoproteomics (supplementary file). Initially, we calculated the Pearson correlation matrix for the different groups of variables (clinical, MRI, OCT, MSGB, cytomics and phosphoproteomics) to select the most informative features based on showing correlation >|0.6| to exclude co-linear variables. In this way, multidimensionality was balanced between the number of features and the number of samples, maintaining a ratio of 1:5 [43]. Besides, we explore further reducing dimensionality using principal component analysis on the features selected. The search of classifiers was done using Random Forest algorithms, considering they are better in handling unbalanced data, high dimensionality, multi-collinear features and have a lower risk of overfitting, which is a common problem in biomedical datasets [44], when studying complex disorders such as MS [20]. For a classification of the clinical endpoints, we calculated the entropy, defined as the measure of impurity, following the formula:

$${\text{Entropy}} = \sum \limits_j p_j \log_2 p_j$$

where $p_j$ is the probability of class j.

During training, several random forest parameters were automatically optimised based on the: (1) number of estimators (number of trees in a random forest): the best value among 10 equally spaced values between number_features/4 and number_features/2; (2) maximum depth (levels in the tree); (3) minimum number of samples required to split a node; and (4) minimum number of samples needed for each leaf node: the best value amongst [19, 44].

We conducted a feature selection process using the feature importance algorithm [44] for selecting the most informative variables and, in this way, increase accuracy, reduce overfitting and reduce training time. Feature importance was calculated as the decrease in node impurity (Gini index) weighted by the probability of reaching that node as defined in the following formula

$$\sum_{i = 1}^C {f_i \left( {1 - f_i } \right)}$$

where f_i is the frequency of label i at a node, and C is the number of unique labels.

The node probability was calculated by the number of samples that reach the node, divided by the total number of samples; therefore, the higher the value, the more critical the feature.

Unbalanced data were addressed by applying cost-sensitive learning, wherein classes were automatically weighted inversely proportional to how frequently they appear in the data [43]. Missing data were addressed as follows: (1) by removing features with more than 20% of missing data when studying the effect of features different from the previous ones; (2) by eliminating observations (patients) with missing data in features with more than 20% of missing data, when studying the effect of these features; and (3) for the remaining missing data, we build a regression model using the random forest for each variable with tenfold cross-validation with all data, and we used a random grid to search for hyper-parametrization. Regarding the dynamic programming problem of “curse of dimensionality”, we applied the rule that there should be at least five training examples for each dimension in the representation (the minimum for each category should be at least 5 cases). For these classification problems, we calculated balanced recall (sensitivity), precision (positive predictive value), and F1 (harmonic mean of recall and precision) measures. The area under the receiver operating curve (AUC) was calculated for the predictors with accuracy above 70%.

Results

The Sys4MS cohort

We recruited 322 consecutive MS patients (age 41 ± 10 years, 71% female), of which 271 (82%) had Relapsing–Remitting MS (RRMS), and 57 (18%) had Progressive MS (PMS; 28 had SPMS and 29 had PPMS), as well as 98 healthy controls matched by sex and age with the RRMS group (Table 1). The patients had a mean disease duration of 10 years, a median EDSS of 2.0 (range 0–8), and mean MSSS of 3.6. Regarding the use of therapies at baseline, 70% of patients were being treated with DMDs, 44% with low-efficacy therapies and 26% with high-efficacy therapies. Clinical and imaging (MRI and OCT) characteristics of the subjects at baseline are summarised in Table 1 and supplementary file S1.

Table 1 The Sys4MS cohort: clinical and imaging variables at baseline

Full size table

By the end of follow-up (mean follow-up 1.98 ± 0.94 years, n = 274), 2 RRMS cases had progressed to SPMS, 22 patients had started DMDs (Cladribine: 1; Fingolimod: 2; Glatiramer acetate: 4; Ocrelizumab: 9; Rituximab: 2; Teriflunomide: (4) and 17 had changed from low to high-efficacy therapies. The number of cases with confirmed disability accumulation (events) for each of the scales was as follows: EDSS: 52, T25WT: 30, 9HPT: 11, SDMT: 27 and SL25: 75; and 122 patients remained as NEDA. Table 1 summarises the frequency of each therapy and means disability scales at the follow-up visit.

Omics analysis

From the HumanOmniExpress-24 v1.2 array, we imputed 152 SNPs outside the HLA region associated with and17 HLA-class II alleles. The MSGB, only the HLA alleles genetic burden score (MSGB^HLA), and the non-HLA genetic burden score (MSGB^non-HLA) were calculated. As expected, the MSGB was significantly higher in the MS patients than in the HC group for the MSGB (MS = 4.23 and HC = 3.2; p = 3.4 × 10^–8); MSGB^HLA (MS = 1.57 and HC = 0.95; p = 1.6 × 10^–4); and MSGB^non-HLA (MS = 2.6 and HC = 2.2; p = 6.8 × 10^–5). Of the 322 patients and 98 HCs recruited, a flow cytometry analysis was carried out on the first 227 consecutive patients and 82 HCs, which did not differ from the overall cohort in the baseline characteristics. Results of the cytomics analysis in this cohort are described in detail elsewhere [23]. Briefly, significantly higher frequencies of Th17 cells in the RRMS population compared with HC and lower frequencies of B memory/B regulatory cells as well as higher percentages of B mature cells in patients with PMS compared with HCs were found. In addition, we observed higher percentages of B mature cells in patients with PMS compared with HCs. Fingolimod treatment induced a decrease in total CD4 + T cells and in B mature and B memory cells and increases in CD4 + and CD8 + T regulatory and B regulatory cells [23]. Finally, the phosphoproteomic analysis was carried out on the first 148 consecutive MS patients, which did not differ from the overall cohort in the baseline characteristics. Patients showed higher levels of phosphorylated IKBA, JUN, KSGB1, MK03, RS6, STAT3 and STAT6 in MS patients compared to controls. See supplementary file for aggregated results for each variable.

Predictors of disease activity

We searched for algorithms predicting clinical outcomes at follow-up, such as 6-month confirmed disability accumulation using the EDSS, T25WT, 9HPT, SDMT or SL25 scales, as well as maintaining the NEDA status or starting or changing DMDs, using random forests algorithms (see supplementary file for list of outcomes and features). We compared algorithm performance based on the use of clinical data alone, or by adding imaging, genetics, and the other omics information sequentially to learn how much the prediction improves by including additional tests. This stepwise approach was chosen for prioritising algorithms based on the accuracy but also the consequent burden for patients and health systems depending on the tests required (genetics was analysed separately form the other omics because the accessibility of genotyping at present). We found algorithms with AUC higher than 60% for most of the outcomes, and AUC above 80% for SL25 and NEDA status during follow-up (Table 2). We tested also the performance of using support vector machines without finding improvement on the accuracy of the classifiers (data not shown).

Table 2 Algorithm performance for predicting clinical outcomes at 2-year follow-up for the Sys4MS cohort

Full size table

At present, the gold standard for defining disease progression in patients with MS is by probing 3- or 6-month confirmed disability accumulation of the EDSS (EDSS-CDA) [29]. We found random forest algorithms predicting which MS patients would achieve EDSS-CDA by 6-month on the EDSS 2 years later with precision (positive predictive value): 71%, recall (sensitivity): 73%, and F1 (harmonic mean): 72% (Table 2 and Fig. 1a, b). The accuracy of the predictor did not increase from using clinical features alone to adding imaging, genetic or omics information (a representative decision tree of including clinical and imaging features is shown in Fig. 1c). The predictors always included disability scales, age or disease duration as the most informative features, followed by brain volume and T2 lesion load, whereas the only omics that contributed to predictors was phosphoproteomics, including the levels of phosphorylated MP2K1, a kinase of the MAPKinase pathway associated with MS [42].

Regarding the prediction of confirmed disability accumulation for other disability scales, the algorithm for predicting 9HPT using clinical and imaging features achieved a precision 90%, recall 93% and F1 92%, with disability scales, brain volume and T2 lesion volume (T2LV) being the most informative (Fig. 2a). The algorithm for predicting the T25WT achieved a precision 75%, recall 80% and F1 77%, by combining clinical, imaging and omics data, with several kinase phosphorylation levels (JUN, STAT6, MP2K1, AKT1, PTN1 and GSK3B), disease duration and disability scales being the top predictors (Fig. 2b). The algorithm for predicting SDMT also achieved a precision 83%, recall 87% and F1 85%, when using clinical and imaging variables, with disability scales, brain volume and T2LV being the most informative ones (Fig. 2c). Moreover, the algorithms for predicting the SL25 achieved a precision 82%, recall 82%, and F1 82%. In this case, the informative variables were visual acuity at baseline, retina thickness (pRNFL), and several kinase levels (Fig. 2d).

We also searched for algorithms predicting maintaining the NEDA status by the end of follow-up, obtaining a precision 76%, recall 76%, and F1 76%. The algorithm for predicting NEDA included as top features several disability scales, age, disease duration and disease subtype (Fig. 3a).

Predictors of the use of disease-modifying drugs

At present, therapeutic decisions are based on the label of approved drugs, but personalization of the therapeutic recommendation is a sought for goal. We analysed whether MS patients changing from the low- to high-efficacy therapies or who started DMD during the 2-year follow-up period. We found a random forest algorithm for the change from the low- to high-efficacy therapies using only clinical variables, with precision 89%, recall 94% and F1 91%, and such performance did not improve by adding imaging, genetics or other omics (Table 2). The most informative variables for the change to high-efficacy therapies were the treatment line at baseline, treatment duration, disease duration, time since last relapse, age and disability scales (Fig. 3b). Indeed, we searched for algorithms predicting the onset of new DMD for treatment-naïve MS patients with precision 70%, recall 71% and F1 71% by including only clinical data (Table 2). The most informative variables were treatment duration and high- versus low-efficacy therapy, disease duration, time since the last relapse, age and disability scales (Fig. 3c).

Assessment of algorithm accuracy in the Barcelona cohort

In order to test the accuracy of the predicting algorithms, we repeated the analysis in an independent prospective cohort from a single centre that includes clinical, imaging and genomic features similar to that of the Sys4MS cohort (cytomics and phosphoproteomics data were not available from this cohort). The cohort was composed of 271 patients with RRMS or SPMS and 54 HC (Table 3).

Table 3 The Barcelona cohort: clinical and imaging variables at baseline

Full size table

The analysis was conducted by training the random forest algorithm with the new data, considering that differences in the calculation of several clinical, imaging and MSGB variables would prevent the direct use of the trained algorithm. We found comparable accuracy for the confirmed EDSS worsening as well as for most of the outcomes (Table 4). For example, the change on EDSS, 9HPT, T25WT and SDMT CDA achieved similar AUC, but it was slightly higher (from 62 to 77%) for the EDSS-CDA and smaller for the SL25 CDA (from 81 to 67%) in the Barcelona cohort.

Table 4 Random forest algorithm performance for predicting clinical outcomes at 2-year follow-up

Full size table

Discussion

In this study, we searched for predictors of future disease activity in MS by combining longitudinal clinical and imaging, with omics information, and applying machine learning algorithms such as random forest. We were interested in identifying predictors for each of the outcomes, as well as establishing the contribution of each type of variable (clinical, imaging, omics) to the predictors to assess the feasibility of the algorithms in clinical practice. We found predictors with mid- to high-accuracy for several disability outcomes, such as confirmed disability progression on the EDSS, 9HPT, SDMT and SL25. The main variables contributing to such predictors were always disability scales at baseline, followed by brain or retina atrophy variables, and proteomics variables. Such level of accuracy was assessed in a second and independent cohort.

Recent studies have addressed the ability of brain MRI to predict the course of MS using deep learning, finding good accuracy for predicting clinical worsening [45]. Regarding the use of DMD as a surrogate marker of disease activity, we analysed the ability to predict the start of the DMD or the switch to high-efficacy therapies, two relevant milestones in MS care. It is well described that disease activity and age are strong predictors of response to therapy [46], but also differences in cell populations, such as B (CD19 + CD5 +) and CD8 (perforin +) T cells, are associated with a differential response to some therapies, such as INFB [47], natalizumab or fingolimod [23]. Indeed, the recently developed Individual Treatment Response (ITR) score for MS therapies also identified clinical disability, quality of life and some imaging outcomes as the main predictors of response to therapy [48]. Our machine learning study identified algorithms with high accuracy for predicting the escalation of therapy from the first-line to high-efficacy DMD.

An in-depth analysis of molecular changes by omics analysis offers the promise of providing a comprehensive picture of the pathways altered in complex diseases and consequently improve our prediction of the course of the disease [49, 50]. In the case of MS, other omics approaches have been tested for predicting disease prognosis or response to therapy including pharmacogenetics [51, 52], gene expression [53], proteomics [21, 53], metabolomics [54, 55] or phosphoproteomics [42] analysis aimed to interrogate signalling pathways driving tissue damage and clinical phenotype [2, 41]. By examining signalling pathways by phosphoproteomics and making use of systems biology modelling, it has been possible to identify signalling networks associated with the use of MS therapies at the individual patient level [56]. However, most of such approaches have not achieved very high accuracy and has not been validated to be of use in clinical practice [2]. For this reason, validation of the biomarkers identified so far, combined with prospective multicentric studies, will be required for generating the evidence to be applied in personalised medicine.

In this study, we have applied random forest algorithms for searching the combination of variables that better explain the outcome 2 years later because they better tolerate data unbalance and overfitting. Random forest allows developing algorithms for classification (dichotomous outcomes) or regression (continuous outcomes) by constructing decision trees, ranking variables by importance, and without overfitting the training set. For these reasons, they are being applied to omics and imaging classification problems [18, 19] and are the most commonly used in MS [20,21,22]. Other machine learning techniques can be applied to this type of datasets, such as neural networks, linear regression or least absolute shrinkage and selection operator (LASSO) regression methods, support vector machines or Bayesian networks, which may differ in their performance depending on the size of the dataset and quality of the data as well as on the type of prediction or clinical question [16, 57,58,59,60]. However, the main limitation, in addition to the sample size, is having variables sensitive to the outcome to be predicted [61]. Indeed, we tested support vector machines in this dataset without achieving higher accuracies compared to random forest algorithms. Informative variables are quite difficult to obtain in brain diseases because current assessments may not be sensitive to minor changes in the evolution of the illness, due to the lack of specificity for the biological substrate or lack of spatial and temporal resolution. Whilst machine learning can be effectively used to model well-defined systems, its application to complex diseases dictates a much more careful approach, including high-quality data, expert knowledge and significant customization to the specific medical question being addressed. Finally, differences between centres in terms of patient population, use of DMD or methods for collecting and calculating clinical or imaging variables are other sources of noise for this type of analysis, even if we made significant efforts to standardised data collection between centres.

Physicians would benefit for their natural Bayesian thinking by updating the prior probabilities (e.g. risk of progression or response to therapy based on clinical judgement) with the likelihood ratios (based in the sensitivity and specificity of the biomarkers) obtained from clinical monitoring, imaging or omics to improve their predictions (posterior probabilities) [62]. One formal application already available for MS patients management is the Bayesian Risk Estimate for MS (BREMS) [63], which updates the prior probabilities based on age and disability scales (EDSS) for predicting the MSSS and the conversion to SPMS [64]. Further refinement of these algorithms based on decision trees or Bayesian networks would help support the reasoning and decision-making process for the management of care for people with MS.

The main limitation of the study is the limited sample size considering the heterogeneity, noise and missing data for the machine learning approach. Although we collected a prospective multicentric cohort of more than 300 cases with a 2-year follow-up with a comprehensive assessment with clinical information, disability scales, quantitative imaging and omics information, the sample size was far from being big data, and a follow-up of 2 years is limited to identify enough events for the outcome variables. In addition, some patients dropped out, or some assessment was not completed, creating data gaps that impaired the algorithm performance. Our study did not include relevant CSF-based biomarkers such as IgM oligoclonal bands or chitinase because lack of CSF samples and to avoid requesting a lumbar tap as inclusion criteria to facilitate recruitment. More, spinal cord MRI were also not collected, missing the presence of spinal cord lesions as a predictor. Finally, due to the differences in how some features were calculated between both cohorts (e.g. different method for the imaging analysis and MSGB calculations), this prevented to validate the algorithm in the second cohort. Indeed, the study includes imaging biomarkers but not molecular biomarkers such as oligoclonal bands or neurofilaments that may have improved the algorithm performance. However, even with such limitations, we were able to identify algorithms with fair to good accuracy for predicting relevant clinical outcomes that can be of help to patients and clinicians for the management of their care. Another limitation is that not all currently available biomarkers were included in this analysis, such as the presence of IgG or IgM oligoclonal bands, neurofilaments light chain or chitinase-3 from CSF samples, which may have contributed to improving the accuracy of the prognosis algorithms.

In summary, we found that machine learning algorithms for predicting relevant clinical outcomes in the short term for MS patients achieve intermediate to good accuracy using data that is commonly collected at the outpatient clinic, such as disability scales or imaging. Although omics improved the accuracy slightly in some cases, at present, the information they provide is not worth the cost and efforts they will imply. Future studies with more informative biomarkers might improve the accuracy for predicting disease course.

Data availability

Sequence data have been deposited at the European Genome-phenome Archive (EGA), under accession number EGAS00001007145 (https://ega-archive.org/studies/EGAS00001007145). The code with the trained random forest algorithms is available at https://github.com/anafreire/sys4ms

References

Kotelnikova E, Kiani NA, Abad E et al (2017) Dynamics and heterogeneity of brain damage in multiple sclerosis. PLoS Comput Biol 13:e1005757
Article PubMed PubMed Central Google Scholar
Pulido-Valdeolivas I, Zubizarreta I, Martinez-Lapiscina E, Villoslada P (2017) Precision medicine for multiple sclerosis: an update of the available biomarkers and their use in therapeutic decision making. Expert Rev Precis Med Drug Dev 2:1–17
Article Google Scholar
Villoslada P (2021) Personalized medicine for multiple sclerosis: How to integrate neurofilament light chain levels in the decision? Mult Scler 2021:13524585211049552
Article PubMed Google Scholar
Pitt D, Lo CH, Gauthier SA et al (2022) Toward precision phenotyping of multiple sclerosis. Neurology(R) Neuroimmunol Neuroinflammat 2022:9
Google Scholar
Giovannoni G, Bermel R, Phillips T, Rudick R (2018) A brief history of NEDA. Multiple Sclerosis Related Disord 20:228–230
Article Google Scholar
Thompson AJ, Baranzini SE, Geurts J, Hemmer B, Ciccarelli O (2018) Multiple sclerosis. Lancet 391:1622–1636
Article PubMed Google Scholar
Martinez-Lapiscina E, Arnow S, Wilson J et al (2016) Retinal thickness measured by optical coherence tomography and risk of disability worsening in multiple sclerosis. Lancet Neurol 15:574–584
Article PubMed Google Scholar
Lin TY, Vitkova V, Asseyer S et al (2021) Increased serum neurofilament light and thin Ganglion cell-inner plexiform layer are additive risk factors for disease activity in early multiple sclerosis. Neurology(R) Neuroimmunol Neuroinflammat 2021:8
Google Scholar
University of California SFMSET, Cree BA, Gourraud PA et al (2016) Long-term evolution of multiple sclerosis disability in the treatment era. Ann Neurol 80:499–510
Article Google Scholar
Villar LM, Casanova B, Ouamara N et al (2014) Immunoglobulin M oligoclonal bands: biomarker of targetable inflammation in primary progressive multiple sclerosis. Ann Neurol 76:231–240
Article CAS PubMed Google Scholar
Huss A, Abdelhak A, Halbgebauer S et al (2018) Intrathecal immunoglobulin M production: a promising high-risk marker in clinically isolated syndrome patients. Ann Neurol 83:1032–1036
Article CAS PubMed Google Scholar
Kuhle J, Kropshofer H, Haering DA et al (2019) Blood neurofilament light chain as a biomarker of MS disease activity and treatment response. Neurology 92(10):e1007–e1015
Article CAS PubMed PubMed Central Google Scholar
Brune S, Hogestol EA, de Rodez Benavent SA et al (2022) Serum neurofilament light chain concentration predicts disease worsening in multiple sclerosis. Mult Scler 28:1859–1870
Article CAS PubMed PubMed Central Google Scholar
Canto E, Tintore M, Villar LM et al (2015) Chitinase 3-like 1: prognostic biomarker in clinically isolated syndromes. Brain 138:918–931
Article PubMed Google Scholar
Gafson A, Craner MJ, Matthews PM (2017) Personalised medicine for multiple sclerosis care. Mult Scler 23:362–369
Article PubMed Google Scholar
Pellegrini F, Copetti M, Sormani MP et al (2019) Predicting disability progression in multiple sclerosis: Insights from advanced statistical modeling. Mult Scler 2019:1352458519887343
Google Scholar
Rise HH, Brune S, Chien C et al (2022) Brain disconnectome mapping derived from white matter lesions and serum neurofilament light levels in multiple sclerosis: a longitudinal multicenter study. Neuroimage Clin 35:103099
Article PubMed PubMed Central Google Scholar
Touw WG, Bayjanov JR, Overmars L et al (2013) Data mining in the life sciences with random forest: A walk in the park or lost in the jungle? Brief Bioinform 14:315–326
Article PubMed Google Scholar
Sarica A, Cerasa A, Quattrone A (2017) Random forest algorithm for the classification of neuroimaging data in Alzheimer’s disease: a systematic review. Front Aging Neurosci 9:329
Article PubMed PubMed Central Google Scholar
Hossain MZ, Daskalaki E, Brustle A, Desborough J, Lueck CJ, Suominen H (2022) The role of machine learning in developing non-magnetic resonance imaging based biomarkers for multiple sclerosis: a systematic review. BMC Med Inform Decis Mak 22:242
Article PubMed PubMed Central Google Scholar
Kosa P, Barbour C, Varosanec M et al (2022) Molecular models of multiple sclerosis severity identify heterogeneity of pathogenic mechanisms. Nat Commun 13:7670
Article ADS CAS PubMed PubMed Central Google Scholar
Jokubaitis VG, Campagna MP, Ibrahim O et al (2022) Not all roads lead to the immune system: the genetic basis of multiple sclerosis severity. Brain 2022:1
Google Scholar
Cellerino M, Ivaldi F, Pardini M et al (2020) Impact of treatment on cellular immunophenotype in MS: a cross-sectional study. Neurol Neuroimmunol Neuroinflammat 7:e693
Article Google Scholar
Polman CH, Reingold SC, Banwell B et al (2011) Diagnostic criteria for multiple sclerosis: 2010 revisions to the McDonald criteria. Ann Neurol 69:292–302
Article PubMed PubMed Central Google Scholar
Solana E, Martinez-Heras E, Montal V et al (2021) Regional grey matter microstructural changes and volume loss according to disease duration in multiple sclerosis patients. Sci Rep 11:16805
Article ADS CAS PubMed PubMed Central Google Scholar
Manouchehrinia A, Westerlind H, Kingwell E et al (2017) Age related multiple sclerosis severity score: disability ranked by age. Mult Scler 23:1938–1946
Article PubMed PubMed Central Google Scholar
Giovannoni G, Turner B, Gnanapavan S, Offiah C, Schmierer K, Marta M (2015) Is it time to target no evident disease activity (NEDA) in multiple sclerosis? Multiple Sclerosis Related Disord 4:329–333
Article Google Scholar
Samjoo IA, Worthington E, Drudge C et al (2021) Efficacy classification of modern therapies in multiple sclerosis. J Comp Eff Res 10:495–507
Article PubMed Google Scholar
Goldman MD, LaRocca NG, Rudick RA et al (2019) Evaluation of multiple sclerosis disability outcome measures using pooled clinical trial data. Neurology 93:e1921–e1931
Article PubMed PubMed Central Google Scholar
Rasche L, Scheel M, Otte K et al (2018) MRI markers and functional performance in patients with CIS and MS: a cross-sectional study. Front Neurol 9:718
Article PubMed PubMed Central Google Scholar
Oertel FC, Havla J, Roca-Fernandez A et al (2018) Retinal ganglion cell loss in neuromyelitis optica: a longitudinal study. J Neurol Neurosurg Psychiatry 89:1259–1265
Article PubMed Google Scholar
Schippling S, Balk L, Costello F et al (2014) Quality control for retinal OCT in multiple sclerosis: validation of the OSCAR-IB criteria. Mult Scler 2014:1
Google Scholar
Aytulun A, Cruz-Herranz A, Aktas O et al (2021) APOSTEL 2.0 recommendations for reporting quantitative optical coherence tomography studies. Neurology 97:68–79
Article PubMed PubMed Central Google Scholar
International Multiple Sclerosis Genetics C, Beecham AH, Patsopoulos NA et al (2013) Analysis of immune-related loci identifies 48 new susceptibility variants for multiple sclerosis. Nat Genet 45:1353–1360
Article Google Scholar
Harbo HF, Isobe N, Berg-Hansen P et al (2014) Oligoclonal bands and age at onset correlate with genetic risk score in multiple sclerosis. Mult Scler 20:660–668
Article PubMed Google Scholar
Gourraud PA, McElroy JP, Caillier SJ et al (2011) Aggregation of multiple sclerosis genetic risk variants in multiple and single case families. Ann Neurol 69:65–74
Article PubMed PubMed Central Google Scholar
Isobe N, Keshavan A, Gourraud PA et al (2016) Association of HLA genetic risk burden with disease phenotypes in multiple sclerosis. JAMA Neurol 73:795–802
Article PubMed PubMed Central Google Scholar
Shams H, Shao X, Santaniello A et al (2022) Polygenic risk score association with multiple sclerosis susceptibility and phenotype in Europeans. Brain 2022:1
Google Scholar
Jia X, Madireddy L, Caillier S et al (2018) Genome sequencing uncovers phenocopies in primary progressive multiple sclerosis. Ann Neurol 84:51–63
Article CAS PubMed PubMed Central Google Scholar
International Multiple Sclerosis Genetics C (2019) Multiple sclerosis genomic map implicates peripheral immune cells and microglia in susceptibility. Science 2019:365
Google Scholar
Kotelnikova E, Bernardo-Faura M, Silberberg G et al (2015) Signaling networks in MS: a systems-based approach to developing new pharmacological therapies. Mult Scler 21:138–146
Article PubMed Google Scholar
Kotelnikova E, Kiani NA, Messinis D et al (2019) MAPK pathway and B cells overactivation in multiple sclerosis revealed by phosphoproteomics and genomic analysis. Proc Natl Acad Sci USA 116:9671–9676
Article ADS CAS PubMed PubMed Central Google Scholar
Koutroumbas K, Theodoridis S (2009) Pattern recognition. Elsevier, London
Google Scholar
Breiman L (2001) Random forests. Mach Learn 45:5–32
Article Google Scholar
Storelli L, Azzimonti M, Gueye M et al (2022) A deep learning approach to predicting disease progression in multiple sclerosis using magnetic resonance imaging. Invest Radiol 57:423–432
Article PubMed Google Scholar
Kalincik T, Manouchehrinia A, Sobisek L et al (2017) Towards personalized therapy for multiple sclerosis: prediction of individual treatment response. Brain 140:2426–2443
Article PubMed Google Scholar
Villarrubia N, Rodriguez-Martin E, Alari-Pahissa E et al (2019) Multi-centre validation of a flow cytometry method to identify optimal responders to interferon-beta in multiple sclerosis. Clin Chim Acta 488:135–142
Article CAS PubMed Google Scholar
Pellegrini F, Copetti M, Bovis F et al (2019) A proof-of-concept application of a novel scoring approach for personalized medicine in multiple sclerosis. Mult Scler 2019:1352458519849513
Google Scholar
Price ND, Magis AT, Earls JC et al (2017) A wellness study of 108 individuals using personal, dense, dynamic data clouds. Nat Biotechnol 35:747–756
Article CAS PubMed PubMed Central Google Scholar
Chen R, Mias GI, Li-Pook-Than J et al (2012) Personal omics profiling reveals dynamic molecular and medical phenotypes. Cell 148:1293–1307
Article CAS PubMed PubMed Central Google Scholar
Pappas DJ, Oksenberg JR (2010) Multiple sclerosis pharmacogenomics: maximizing efficacy of therapy. Neurology 74(Suppl 1):S62–S69
CAS PubMed Google Scholar
Grossman I, Knappertz V, Laifenfeld D et al (2017) Pharmacogenomics strategies to optimize treatments for multiple sclerosis: Insights from clinical research. Prog Neurobiol 152:114–130
Article CAS PubMed Google Scholar
Paul A, Comabella M, Gandhi R (2019) Biomarkers in multiple sclerosis. Cold Spring Harb Perspect Med 9:a029058
Article CAS PubMed PubMed Central Google Scholar
Bhargava P, Calabresi PA (2016) Metabolomics in multiple sclerosis. Mult Scler 22:451–460
Article CAS PubMed Google Scholar
Villoslada P, Alonso C, Agirrezabal I et al (2017) Metabolomic signatures associated with disease severity in multiple sclerosis. Neurol(R) Neuroimmunol Neuroinflammat 4:e321
Article Google Scholar
Bernardo-Faura M, Rinas M, Wirbel J et al (2021) Prediction of combination therapies based on topological modeling of the immune signaling network in multiple sclerosis. Genome Med 13:117
Article CAS PubMed PubMed Central Google Scholar
Sargent DJ (2001) Comparison of artificial neural networks with other statistical approaches: results from medical data sets. Cancer 91:1636–1642
Article CAS PubMed Google Scholar
Bose G, Healy BC, Lokhande HA et al (2022) Early predictors of clinical and MRI outcomes using LASSO in multiple sclerosis. Ann Neurol 2022:1
Google Scholar
Eshaghi A, Young AL, Wijeratne PA et al (2021) Identifying multiple sclerosis subtypes using unsupervised machine learning and MRI data. Nat Commun 12:2078
Article ADS CAS PubMed PubMed Central Google Scholar
Zhao Y, Wang T, Bove R et al (2020) Ensemble learning predicts multiple sclerosis disease course in the SUMMIT study. NPJ Digit Med 3:135
Article PubMed PubMed Central Google Scholar
Ngiam KY, Khor IW (2019) Big data and machine learning algorithms for health-care delivery. Lancet Oncol 20:e262–e273
Article PubMed Google Scholar
Gill CJ, Sabin L, Schmid CH (2005) Why clinicians are natural bayesians. BMJ 330:1080–1083
Article PubMed PubMed Central Google Scholar
Bergamaschi R, Berzuini C, Romani A, Cosi V (2001) Predicting secondary progression in relapsing-remitting multiple sclerosis: a Bayesian analysis. J Neurol Sci 189:13–21
Article CAS PubMed Google Scholar
Bergamaschi R, Montomoli C, Mallucci G et al (2015) BREMSO: a simple score to predict early the natural course of multiple sclerosis. Eur J Neurol 22:981–989
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We would like to thank Mark Sefton for his assistance in project management and manuscript editing. This work was supported by the European Commission Sys4MS project of the Eracosysmed programme (Grant ID-43) and the Instituto de Salud Carlos III, Madrid, Spain (AC1500052).

Funding

Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. This work was supported by the European Commission (ERACOSYSMED ERA-Net programme, Sys4MS project, id:43), Instituto de Salud Carlos III, Spain (AC1500052); the Italian Ministry of Health (WFR-PER-2013-02361136), the German Ministry of Science (Deutsches Teilprojekt B “Förderkennzeichen: 031L0083B) and the Norwegian Research Council (project 257955).

Author information

Ana Freire
Present address: UPF Barcelona School of Management, Balmes 132, 08008, Barcelona, Spain
Magi Andorra and Ana Freire have contributed equally as first authors.

Authors and Affiliations

Institut d’Investigacions Biomediques August Pi Sunyer (IDIBAPS) and Hospital Clinic Barcelona, Barcelona, Spain
Magi Andorra, Irati Zubizarreta, Gemma Vila, Irene Pulido-Valdeolivas, Elena H. Martinez-Lapiscina, Sara Llufriu, Albert Saiz, Yolanda Blanco, Eloy Martinez-Heras & Elisabeth Solana
School of Management, Pompeu Fabra University, Barcelona, Spain
Ana Freire
Department of Neurosciences, Rehabilitation, Ophthalmology, Genetics, Maternal and Child Health, University of Genoa, Genoa, Italy
Nicole Kerlero de Rosbo, Maria Cellerino, Matteo Pardini & Antonio Uccelli
IRCCS Ospedale Policlinico San Martino, Genoa, Italy
Nicole Kerlero de Rosbo, Matteo Pardini & Antonio Uccelli
University of Oslo, Oslo, Norway
Steffan D. Bos, Einar A. Høgestøl, Sigrid A. de Rodez Benavent, Synne Brune-Ingebretse & Hanne F. Harbo
Oslo University Hospital, Oslo, Norway
Steffan D. Bos, Einar A. Høgestøl, Sigrid A. de Rodez Benavent, Tone Berge, Synne Brune-Ingebretse & Hanne F. Harbo
Institute for Computational Biomedicine, Heidelberg University Hospital, and Heidelberg University, Heidelberg, Germany
Melanie Rinas & Julio Saez-Rodriguez
Oslo Metropolitan University, Oslo, Norway
Tone Berge
Department of Internal Medicine, University of Genoa, Genoa, Italy
Federico Ivaldi
Charité Universitaetsmedizin Berlin, Berlin, Germany
Priscilla Bäcker-Koduah, Janina Behrens, Joseph Kuchling, Susanna Asseyer, Michael Scheel, Claudia Chien, Hanna Zimmermann, Seyedamirhosein Motamedi, Josef Kauer-Bonin, Alex Brandt & Friedemann Paul
Max Delbrueck Center for Molecular Medicine, Berlin, Germany
Susanna Asseyer, Claudia Chien, Hanna Zimmermann & Friedemann Paul
ProtATonce Ltd, Athens, Greece
Leonidas G. Alexopoulos
School of Mechanical Engineering, National Technical University of Athens, Zografou, Greece
Leonidas G. Alexopoulos
Department of Neurology, University of California, San Francisco, USA
Hengameh Shams & Jorge Oksenberg
School of Engineering, Pompeu Fabra University, Barcelona, Spain
Ricardo Baeza-Yates
Department of Medicine and Life Sciences, Pompeu Fabra University, Barcelona, Spain
Pablo Villoslada
Hospital del Mar Research Institute, Barcelona, Spain
Pablo Villoslada

Authors

Magi Andorra
View author publications
You can also search for this author in PubMed Google Scholar
Ana Freire
View author publications
You can also search for this author in PubMed Google Scholar
Irati Zubizarreta
View author publications
You can also search for this author in PubMed Google Scholar
Nicole Kerlero de Rosbo
View author publications
You can also search for this author in PubMed Google Scholar
Steffan D. Bos
View author publications
You can also search for this author in PubMed Google Scholar
Melanie Rinas
View author publications
You can also search for this author in PubMed Google Scholar
Einar A. Høgestøl
View author publications
You can also search for this author in PubMed Google Scholar
Sigrid A. de Rodez Benavent
View author publications
You can also search for this author in PubMed Google Scholar
Tone Berge
View author publications
You can also search for this author in PubMed Google Scholar
Synne Brune-Ingebretse
View author publications
You can also search for this author in PubMed Google Scholar
Federico Ivaldi
View author publications
You can also search for this author in PubMed Google Scholar
Maria Cellerino
View author publications
You can also search for this author in PubMed Google Scholar
Matteo Pardini
View author publications
You can also search for this author in PubMed Google Scholar
Gemma Vila
View author publications
You can also search for this author in PubMed Google Scholar
Irene Pulido-Valdeolivas
View author publications
You can also search for this author in PubMed Google Scholar
Elena H. Martinez-Lapiscina
View author publications
You can also search for this author in PubMed Google Scholar
Sara Llufriu
View author publications
You can also search for this author in PubMed Google Scholar
Albert Saiz
View author publications
You can also search for this author in PubMed Google Scholar
Yolanda Blanco
View author publications
You can also search for this author in PubMed Google Scholar
Eloy Martinez-Heras
View author publications
You can also search for this author in PubMed Google Scholar
Elisabeth Solana
View author publications
You can also search for this author in PubMed Google Scholar
Priscilla Bäcker-Koduah
View author publications
You can also search for this author in PubMed Google Scholar
Janina Behrens
View author publications
You can also search for this author in PubMed Google Scholar
Joseph Kuchling
View author publications
You can also search for this author in PubMed Google Scholar
Susanna Asseyer
View author publications
You can also search for this author in PubMed Google Scholar
Michael Scheel
View author publications
You can also search for this author in PubMed Google Scholar
Claudia Chien
View author publications
You can also search for this author in PubMed Google Scholar
Hanna Zimmermann
View author publications
You can also search for this author in PubMed Google Scholar
Seyedamirhosein Motamedi
View author publications
You can also search for this author in PubMed Google Scholar
Josef Kauer-Bonin
View author publications
You can also search for this author in PubMed Google Scholar
Alex Brandt
View author publications
You can also search for this author in PubMed Google Scholar
Julio Saez-Rodriguez
View author publications
You can also search for this author in PubMed Google Scholar
Leonidas G. Alexopoulos
View author publications
You can also search for this author in PubMed Google Scholar
Friedemann Paul
View author publications
You can also search for this author in PubMed Google Scholar
Hanne F. Harbo
View author publications
You can also search for this author in PubMed Google Scholar
Hengameh Shams
View author publications
You can also search for this author in PubMed Google Scholar
Jorge Oksenberg
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Uccelli
View author publications
You can also search for this author in PubMed Google Scholar
Ricardo Baeza-Yates
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Villoslada
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MA: Analysed the data and drafted the manuscript for intellectual content. AF: Designed the ML algorithms and analysed the data. IZ: Major role in the acquisition of data, analysed the data; drafted the manuscript for intellectual content. NKDR: Major role in the acquisition of data, design and conceptualised study; analysed the data; drafted the manuscript for intellectual content. SDB: Major role in the acquisition of data, analysed the data. MR: Major role in the acquisition of data, analysed the data. EAH: Major role in the acquisition of data. SADRB: Major role in the acquisition of data. TB: Major role in the acquisition of data, analysed the data. SB-I: Major role in the acquisition of data. FI: Major role in the acquisition of data. MC: Major role in the acquisition of data. MP: Major role in the acquisition of data. GV: Major role in the acquisition of data. IP-V: Major role in the acquisition of data. EHM-L: Interpreted the data; revised the manuscript for intellectual content. SL: Major role in the acquisition of data. YB: Major role in the acquisition of data. EM-H: Major role in the acquisition of data. ES: Major role in the acquisition of data. PB-K: Major role in the acquisition of data. JB: Major role in the acquisition of data. JK: Major role in the acquisition of data. SA: Major role in the acquisition of data. MS: Major role in the acquisition of data. CC: Major role in the acquisition of data. HZ: Major role in the acquisition of data. SM: Major role in the acquisition of data. JK-B: Major role in the acquisition of data. AB: Design and conceptualised study; analysed the data; drafted the manuscript for intellectual content. JS-R: Design and conceptualised study; analysed the data; drafted the manuscript for intellectual content. LGA: Design and conceptualised study; analysed the data; drafted the manuscript for intellectual content. FP: Design and conceptualised study; analysed the data; drafted the manuscript for intellectual content. HFH: Design and conceptualised study; organised acquisition of the Oslo data; analysed the data; drafted the manuscript for intellectual content. HS: analysed the data. JO: analysed the data; drafted the manuscript for intellectual content. AU: Design and conceptualised study; analysed the data; drafted the manuscript for intellectual content. RB-Y: Designed the ML algorithms and analysed the data. PV: Design and conceptualised study; analysed the data; drafted the manuscript for intellectual content.

Corresponding author

Correspondence to Pablo Villoslada.

Ethics declarations

Conflicts of interest

Magi Andorra is an employee of Hoffman-La Roche AG. Yet this article is related to his activity at the Hospital Clinic of Barcelona. Ana Freire reports no disclosures. Irati Zubizarreta received reimbursement from Genzyme, Biogen, Merck, and Bayer-Schering. Irene Pulido-Valdeolivas is currently an employee of UCB pharma. Yet this article is related to her activity at the Hospital Clinic of Barcelona. She has received travel reimbursement from Roche Spain and Genzyme-Sanofi, European Academy of Neurology, and European Committee for Treatment and Research in Multiple Sclerosis for international and national meetings over the last 3 years; she holds a patent for an affordable eye-tracking system to measure eye movement in neurologic diseases, and she holds stock in Aura Innovative Robotics. Elena H Martinez-Lapiscina is an employee of the European Medicines Agency (Human Medicines) since 16 April 2019. Yet this article is related to her activity at the Hospital Clinic of Barcelona and consequently. It does not in any way represent the views of the Agency or its Committees. Sara Llufriu received compensation for consulting services and speaker honoraria from Biogen Idec, Novartis, TEVA, Genzyme, Sanofi, and Merck. Albert Saiz received compensation for consulting services and speaker honoraria from Bayer-Schering, Merck-Serono, Biogen-Idec, Sanofi-Aventis, TEVA, Novartis, and Roche. Eloy Martinez-Heras reports no disclosures. Elisabeth Solana received travel reimbursement from Sanofi and ECTRIMS and reports personal fees from Roche Spain. Melanie Rinas reports no disclosures. Julio Saez-Rodriguez reports no disclosures. Steffan Bos reports no disclosures. Maria Cellerino reports no disclosures. Federico Ivaldi reports no disclosures. Matteo Pardini received research support from Novartis and Nutricia and honoraria from Merk and Novartis. Gemma Vila reports no disclosures. Sigrid A. de Rodez Benavent reports no disclosures. Synne Brune Ingebetsen has received honoraria for lecturing from Biogen and Novartis. Priscilla Bäcker-Koduah is funded by the DFG Excellence grant to FP (DFG exc 257) and is a Junior scholar of the Einstein Foundation. Tone Berge has received unrestricted research grants from Biogen and Sanofi-Genzyme. Einar Høgestøl received honoraria for lecturing and advisory board activity from Biogen, MS-union, Merck, and Sanofi-Genzyme and unrestricted research grant from Merck. Friedemann Paul received honoraria and research support from Alexion, Bayer, Biogen, Chugai, Merck Serono, Novartis, Genzyme, MedImmune, Shire, Teva, and serves on scientific advisory boards for Alexion, MedImmune, and Novartis. He has received funding from Deutsche Forschungsgemeinschaft (DFG Exc 257), Bundesministerium für Bildung und Forschung (Competence Network Multiple Sclerosis), Guthy Jackson Charitable Foundation, EU Framework Program 7, National Multiple Sclerosis Society of the USA. Hanne F. Harbo reports no disclosures. Nicole Kerlero de Rosbo reports no disclosures. Claudia Chien received honoraria for speaking from Bayer and research funding from Novartis, unrelated to this study. Susanna Asseyer received a conference grant from Celgene and honoraria for speaking from Alexion, Bayer, and Roche. Janina Behrens reports no disclosures. Alex Brandt has a patent pending for Perceptive visual computing-based postural control analysis, Multiple sclerosis biomarker, Perceptive sleep motion analysis, and Fovea morphometry; consulted for Motognosis; is on the executive board of IMSVISUAL; received research support from Novartis, Biogen, BMWi, BMBF, and the Guthy-Jackson Charitable Foundation; and holds stock or stock options in Motognosis. Leonidas G Alexopoulos is founder and hold stocks at ProtATonce. Antonio Uccelli received grants and contracts from FISM, Novartis, Biogen, Merck, Fondazione Cariplo, Italian Ministry of Health, received honoraria, or consultation fees from Biogen, Roche, Teva, Merck, Genzyme, Novartis. Ricardo Baeza-Yates reports no disclosures. Pablo Villoslada has received consultancy fees and hold stocks from Accure Therapeutics SL, Attune Neurosciences Inc, Spiral Therapeutics Inc, QMenta Inc, CLight Inc, NeuroPrex Inc, Oculis SA and Adhera Health Inc. Other authors do not have competing interests.

Statistical analysis

The machine learning analysis was conducted by Magi Andorra, Ana Freire, and Ricardo Baeza-Yates, and supervised by Pablo Villoslada.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (XLSX 60 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Andorra, M., Freire, A., Zubizarreta, I. et al. Predicting disease severity in multiple sclerosis using multimodal data and machine learning. J Neurol 271, 1133–1149 (2024). https://doi.org/10.1007/s00415-023-12132-z

Download citation

Received: 24 June 2023
Revised: 28 October 2023
Accepted: 22 November 2023
Published: 22 December 2023
Issue Date: March 2024
DOI: https://doi.org/10.1007/s00415-023-12132-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Predicting disease severity in multiple sclerosis using multimodal data and machine learning

Abstract

Background

Methods

Results

Conclusion

Similar content being viewed by others

The role of machine learning in developing non-magnetic resonance imaging based biomarkers for multiple sclerosis: a systematic review

Predicting multiple sclerosis disease progression and outcomes with machine learning and MRI-based biomarkers: a review

Machine learning classifier to identify clinical and radiological features relevant to disability progression in multiple sclerosis

Introduction

Materials and methods

Ethical statement

Patients

Clinical variables

Imaging

Genotyping

Cytomics

Phosphoproteomics

Machine learning analysis

Results

The Sys4MS cohort

Omics analysis

Predictors of disease activity

Predictors of the use of disease-modifying drugs

Assessment of algorithm accuracy in the Barcelona cohort

Discussion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflicts of interest

Statistical analysis

Supplementary Information

Supplementary file1 (XLSX 60 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation