Introduction

Cognitive disorders are common in patients with multiple sclerosis (MS) and occur in all MS subtypes with a prevalence of between 40% and 70% [1]. They are not only most frequent in progressive forms but are also present at the early stage of the disease [2, 3]. However, their pathophysiology is poorly understood. The cognitive domains most frequently altered include attention, speed of information processing, and working memory [4]. The path to diagnose cognitive disorders is often complicated and requires the use of a battery of tests, including at least 1 h of cognitive assessment [5, 6]. Moreover, these batteries can only be implemented by experienced neuropsychologists and are not easy to use in daily neurological practice.

Many magnetic resonance imaging (MRI) parameters are known to be correlated with cognitive impairment in MS, especially global brain atrophy (in particular the width of the third ventricle [V3]) or the basal ganglia volume [7, 8]. The importance of the involvement of the cortical grey matter volume was more recently demonstrated [9]. Specific brain areas were more atrophied in cognitively impaired patients with relapsing–remitting (RR) MS, especially the thalami, insula, the prefrontal cortex, and regions in the temporo-occipital lobes [10, 11]. In modern connectivity-based approaches, researchers in diffusion-tensor imaging have demonstrated that white matter thalamocortical connections, which constitute the executive system of working memory, were structurally altered in patients with MS [12]. More recently, it has been shown that functional connectivity changes in the thalamocortical network occurred in patients with MS and correlated with cognitive performance [13]. Contradictory data concerned the link between cognitive impairment and T1 and T2 lesion loads [14, 15].

This study was designed to establish and report correlations between neuropsychological and conventional MRI data. The aim was to improve early detection of cognitive impairment in MS with a simple test based on clinical examination and brain MRI, directly usable during a physician consultation.

Methods

Compliance with Ethics Guidelines

The study was approved by the Nice Ethics Committee. All procedures followed were in accordance with the ethical standards of the responsible committee on human experimentation (institutional and national) and with the Helsinki Declaration of 1975, as revised in 2000 and 2008. Informed consent was obtained from all patients and controls for being included in the study.

Subjects

We prospectively enrolled patients with RRMS followed at the tertiary MS center in Nice, France. Patients with a history of psychiatric disorders, major depression, or substance abuse were excluded. Demographic data were recorded, including disease duration, first symptoms, and disease-modifying treatment. The level of education and laterality were noted. Disability assessment was based on validated scales: Multiple Sclerosis Functional Composite (MSFC), Expanded Disability Status Scale (EDSS), and Multiple Sclerosis Severity Scale (MSSS). Fatigue was assessed with the validated French fatigue scale “Echelle de Mesure de l’Impact de la Fatigue sur la SEP” (EMIF-SEP) [16].

A group of 20 control patients with no history of neurological or psychiatric diseases was also recruited. Each control patient was matched in age, gender, laterality, and level of education with a subgroup of patients with MS.

Cognitive Assessment

All patients and controls performed the Paced Auditory Serial Addition Test (PASAT) 3, which was used as a screening test for cognitive assessment [17]. It is a complex test of executive function, working memory, information processing speed, and divided attention. Results were expressed as z scores to normalize data by age, gender, and level of education by comparison with a standard population [18]. Low performance at this test was defined as a z score less than two standard deviations (SD) below the normalized PASAT mean.

Brain MRI

All subjects completed a 1.5 Tesla MRI according to the same protocol. T1-weighted (T1w), T2-weighted spin echo, fluid-attenuated inversion recovery (FLAIR), and T1 with gadolinium sequences were systematically performed.

SepINRIA software was used to quantify the T2, T1, and gadolinium lesion loads, and grey and white matter volumes [19]. Global atrophy was indirectly measured on FLAIR sequences by the third ventricle (V3) width, lateral ventricles width, bicaudate ratio, and brain diameter, as described in previous studies [7, 20, 21].

Measures were done manually by two independent evaluators using OsiriX software (Geneva, Switzerland) [22]. Each measure was controlled twice by each evaluator and the average value was conserved for statistical analysis.

To quantify the change in regional volumes, we created a template T1w-MR image by averaging the 3D T1w-MR images from the control group using the methodology presented by Guimond [23]. This average anatomical model of the human brain was then fused with a digital brain atlas from the Montreal Neurological Institute to identify each structure together and with the International Consortium for Brain Mapping (ICBM) atlas to identify white matter tracts. Each patient’s MRI was then compared with the average brain model using Jacobian integration to quantify regional volumetric changes. Analysis was conducted using anatomical or functional segmentation (Broadmann areas). As some studies have demonstrated the structures involved in the execution of the PASAT, we specifically focused on the variation in volume of these structures [24].

Statistical Analysis

Statistical analysis was performed using the XLSTAT® software (Addinsoft, France). For the univariate analysis, the Chi square test was used for categorical data. The Student’s t test was used for quantitative data. When the distribution was not normal (analysis of variance), the Mann–Whitney test was used. When samples were too small the Fisher’s exact test was used for comparison. The level of significance was set at P < 0.05. Threshold V3 and composite score threshold EDSS + V3 width (HV3 score, chapter results), predictive of low PASAT performance, was calculated using the receiver operating characteristic (ROC) curve and analyzed at the Youden plot. The Pearson’s correlation test was performed for continuous variables to determine the presence of a correlation between cognitive performance and clinical and imaging data.

Results

Demographic and Cognitive Data

Seventy-five patients with RRMS were enrolled, 57 female and 18 male. All patients were treated with beta interferon. The patients characteristics were mean (±SD) age 36.1 ± 9.2 years, mean (±SD) level of education 12.8 ± 2.4 years, mean (±SD) disease duration 6.9 ± 7.3 years, median (range) EDSS 1.5 (0–5.5), mean (±SD) standardized PASAT 41.65 ± 12.3, and mean EMIF-SEP 30.5 ± 21.2.

Low performance on the PASAT was detected in 16/75 (21.3%) of patients with MS. The mean (±SD) PASAT score in the MS group with a low PASAT performance was 22.17 ± 6.27 against 47.48 ± 7.92 in the MS group with normal PASAT performance.

The demographic, clinical, and imaging data of both groups were compared to identify specific parameters associated with a low PASAT performance. There was no difference in age, disease duration, treatment duration or type of treatment, gender, or level of education. EDSS was significantly higher in the group with low PASAT performance, with a median (range) of 2.0 (1–5.5) against 1.5 (0–5) in the group with normal performance (P = 0.019). There was no difference in the analysis of the MSSS. There was no difference between the groups for the annualized relapse rate. Patients with MS who demonstrated low performance in the PASAT showed no increase in fatigue. In particular, there was no difference in the cognitive dimension assessed with the EMIF-SEP. All results are presented in Tables 1 and 2.

Table 1 Demographic characteristics of patients with MS and controls
Table 2 Demographic characteristics of patients with multiple sclerosis with low PASAT performance compared to normal performers

Brain MRI Analysis

MS Patients vs. Controls

Patients with MS showed a significant enlargement of the V3 width compared with controls (P = 0.0001). Similarly, brain width was significantly reduced in the MS group (P = 0.0003). There was no difference between the two groups regarding the bicaudate ratio and the width of the lateral ventricles (Tables 3, 4).

Table 3 Indirect measurement of global atrophy: comparison between the MS group and control group
Table 4 Indirect measurement of global atrophy: comparison between the cognitively impaired patients with MS and the unimpaired patients with MS

Impaired vs. Unimpaired Patients with MS

There was no significant difference in the T1, T2, and gadolinium lesion loads, and white matter volume and grey matter volume between the both MS groups.

The width of the V3 was significantly increased in the group with low PASAT performance (P = 0.044). There was no difference in the width of the lateral ventricles, the brain diameter, and bicaudate ratio between the two MS groups. The cutoff score of the V3 width predictive of cognitive impairment was over 4.13 mm with a sensitivity (95% CI) of 67.9% (55.6; 80.1) and a specificity (95% CI) of 68.8% (46; 91.5). The positive predictive value (PPV) was 88.4%.

Correlation Analysis

A correlation analysis between the PASAT score and the available clinical and imaging data was performed. Significant correlation between the PASAT and V3 width (r = −0.271, P = 0.021), EDSS (r = −0.260, P = 0.028), and 9 Hole Peg Test score (r = −0.294, P = 0.012) was demonstrated.

The HV3 Score (Global Handicap Measured by the V3)

The data were used to define a composite clinical and radiological score easily done in consultation. We statistically analyzed several combinations and selected the one with better correlation with the PASAT score. By adding the EDSS and the V3 width, we obtained significant correlation with the PASAT (r = −0.325, P = 0.006; Fig. 1). This combination of EDSS and V3 width (in mm) was called HV3 (global handicap measured by the V3). The HV3 score correlated better with PASAT than the EDSS or V3 width separately (r = −0.325, P = 0.006 against r = −0.271 for V3 width and r = −0.260 for EDSS).

Fig. 1
figure 1

The HV3 score. HV3 is obtained by summing the V3 width (in mm; technique of determining V3 width detailed by Benedict et al. [7]) with the EDSS. EDSS Expanded Disability Status Scale, V3 third ventricle

At Youden plot (HV3 >5.5), HV3 showed the best properties to detect low PASAT performance (sensitivity 66.1%, specificity 81.3%, PPV 92.5%, negative predictive value 40.6%, accuracy 70.1%; Fig. 2).

Fig. 2
figure 2

Cutoff HV3 score predictive of low PASAT performance. A ROC curve was used to determine the cutoff HV3 score predicting a cognitive impairment. A score of over 5.5 was associated with low PASAT performance with a sensitivity (95% CI) of 66.1% (53.7; 78.5) and a specificity (95% CI) of 81.3% (62.1; 100). PASAT Paced Auditory Serial Addition Test, ROC receiver operating characteristic

The MS group with low PASAT performance had a significantly higher mean HV3 score (6.81 ± 2.34) than the group without impairment (5.3 ± 2.36; P = 0.027). Conversely, for 81% of patients with MS with cognitive impairment, the HV3 score was over 5.5.

Local Atrophy

Compared with patients with MS without cognitive impairment, right-handed patients with low PASAT performance had a more pronounced atrophy of specific brain structures: left supplementary motor area (SMA; P = 0.03), inferior parietal lobules (P < 0.036), cingulate gyri (P < 0.039), and right thalamus (P = 0.027).

A correlation was found between the PASAT score and the local volume change of the left SMA (r = 0.295, P = 0.027), the right parietal lobe (r = 0.277, P = 0.039), left and right lower parietal lobules (r = 0.291, P = 0.03 and r = 0.332, P = 0.012, respectively).

Discussion

Many clinical and MRI data are associated with cognition assessment. Although the concept of cognitive impairment is defined as an alteration in at least two cognitive domains, we have chosen, in our study, to use the PASAT as a screening test for cognitive impairment. This test is generally considered a sensitive measure of cognitive dysfunction patients with MS who have mild disability [1]. It has demonstrated its ability to evaluate several cognitive domains simultaneously and is widely used because of its incorporation in the MSFC score.

As with previous studies, our study demonstrated that the V3 width was of interest as both a marker of global atrophy and cognitive decline [7, 25]. Cognitive impairment in patients with MS may be related, at least in part, to thalamic atrophy [26]. The existence of connections between the thalamus and many brain structures involved in limbic functions and, in particular, the neocortex supports this hypothesis. Enlargement of V3 is explained by the fact that the thalami are part of the lateral wall of this structure. Our study objectified atrophy of the right thalamus in patients with MS who had a low PASAT performance. The thalamus is involved in the working memory network and is activated in functional MRI (fMRI) during the PASAT [12, 20]. Moreover, Houtchens et al. [26] showed that the EDSS was the clinical scale that correlated with thalamic atrophy. These data demonstrate that the width of the V3 was closely linked to cognitive impairment as well as disability. We also demonstrated that the left SMA, lower parietal lobules, and the cingulate gyri were more atrophied in the population with a pathological PASAT score (P < 0.05). These structures have been described to be activated in fMRI when performing the PASAT. Frontal lobe atrophy has been demonstrated several times in MS and is associated with cognitive impairment [11, 27, 28]. For other structures (lower parietal lobules and cingulate gyri) our study is the only one, to our knowledge, which demonstrated a link between atrophy of these structures and cognitive impairment. Audoin et al. [29] showed early atrophy of the cingulate gyri in clinically isolated syndrome but their study did not take into account cognitive impairment. Unlike other authors [10, 30], we did not find any significant volume variation in the callosum or in temporal lobes between patients with MS with and without cognitive impairment.

In the clinical practice, the width of the V3 is a structure that is workable and easily measurable by a physician using an axial FLAIR or 3D T1w sequence on brain MRI. The purpose of HV3 score is not to replace the cognitive assessment but to better identify patients requiring a full cognitive assessment. It overcomes bias linked to the evaluator, the test execution conditions, the patient’s fatigue, or the retest effect, and can be done at each visit. However, with a sensitivity score of 66.1% (negative predictive value 40.6%), while HV3 proves effective in diagnosing cognitive impairment in most patients, it is not enough to suspect all of them. Moreover, this score was defined for patients with an EDSS less than 5.5. It therefore remains to be seen whether it can be useful for patients with a more severe disability. It should be interesting to design the same study using the Symbol Digit Modalities Test or its computerized version [3133].

Conclusion

We have established a HV3 composite score from the EDSS and the V3 width measured by MRI. From the beginning of the disease, even in patients with low T2 lesion load, deep grey matter atrophy can be detected indicating that a degenerative process is already underway. Knowing that thalamic atrophy correlates with disease progression, the use of this easy and reliable tool can be helpful for detecting progression in patients with MS, treated or not, even if they seems to be clinically asymptomatic. In these patients, the challenge is to precociously begin therapy to reduce clinical worsening. This score is an additional and complementary tool to suspect easily cognitive impairment in MS. It cannot replace a full cognitive assessment but can help to better identify patients with MS who have a risk of cognitive impairment.