Background

Theory of mind (ToM) refers to the capacity to understand someone else’s emotions and mental state. ToM can be separated into two systems: the affective and cognitive components. The affective ToM is involved in drawing inferences about other people’s emotions and feelings; the cognitive ToM is involved in drawing inferences about other people’s beliefs and intentions [1]. Researchers have developed experimental tests to assess this capacity. The main tests used in the literature are the first- and second-order false belief tasks, the Faux Pas Recognition (FPR) test, the Reading the Mind in the Eyes (RME) test and Ekman’s Facial Emotion Recognition test [2]. Neuroimaging and lesion studies have tried to establish the neural correlates of ToM. In their review of the literature, Carrington and Bailey [3] found 40 functional imaging studies about ToM. Prefrontal regions, especially the medial prefrontal and orbitofrontal cortices, were associated with ToM in 93 % of studies. The anterior cingulate cortex, superior temporal sulcus (STS) and temporoparietal junction (TPJ) were involved in 50–60 % of studies. Poletti et al. [4] proposed a model for ToM processing with common brain areas (precuneus, TPJ and STS) for both the affective and cognitive components of ToM, as well as for specific areas for each component. Indeed, the ventromedial prefrontal region would be preferentially involved in affective ToM [5], while cognitive ToM would be supported by the dorsolateral prefrontal cortex [6].

The first publications about ToM reported studies on patients with autism [7] or schizophrenia [8]. In the 2000s, researchers began to assess ToM in neurological diseases. One of the first of these studies was by Gregory et al. [9], who showed ToM impairments in patients with frontotemporal lobe degeneration. These results were confirmed in other studies (for review, see [10]) that revealed a substantial, early and global deficit of both components of ToM in this disease. However, this is not the case in other neurodegenerative diseases, such as Alzheimer’s disease (AD) and Parkinson’s disease (PD). In AD, patients are relatively preserved from ToM impairment at the beginning of the disease, with subtle difficulties in high-demand cognitive tests [9, 11, 12]. In PD, it has been reported that patients develop ToM impairments as the disease progresses, particularly involving the cognitive component [1315]. Studies on ToM have been carried out in many other neurological diseases, including Huntington’s disease [16], amyotrophic lateral sclerosis [17], corticobasal degeneration [18], semantic dementia [19], progressive supranuclear palsy [20] and multiple sclerosis [21], but no studies have assessed ToM in dementia with Lewy bodies (DLB). DLB is the second most common cause of neurodegenerative dementia after AD in people aged over 65 years. DLB is characterized by intra-neuronal inclusions of α-synuclein, called Lewy bodies. In series of demented patients, 15–25 % of patients had pathological lesions of DLB [22]. Patients with DLB sometimes have social behavioural disorders, even in the first stage of the disease [23]. We hypothesised that these disorders are caused by ToM impairments. The only publication evoking ToM in DLB is Modinos et al.’s study [24]. They assessed ToM in five patients with different types of dementia: mild cognitive impairment, AD, frontotemporal dementia, vascular dementia and DLB. Their patient with DLB performed worse than the patient with AD and the one with mild cognitive impairment but better than the patient with frontotemporal dementia. This result is in line with our hypothesis, but a prospective study was needed to confirm this case report.

The aim of our study was to assess affective and cognitive ToM in patients with DLB in the early stage of the disease, and to search for the neural correlates of these deficits. We hypothesised first that patients with DLB would have cognitive and affective ToM impairments and second that any such cognitive deficit would be correlated predominantly to atrophy in the prefrontal cortex and the TPJ.

Methods

Selection of participants

We conducted a prospective, single-centre, multi-site study between June 2012 and January 2015 with patients with probable DLB (DLB group) or probable AD (AD group), in all cases in the early stage of the disease, and with healthy elderly control subjects (HC group). Patients were included by six neurologists and geriatricians, experts in dementia, at the Memory Resource and Research Centre of the University Hospital of Strasbourg in France. Healthy elderly subjects were recruited by means of an advertisement for volunteers in a local newspaper. The study was approved by the local ethics committee (Comité de Protection des Personnes Est IV, Strasbourg, France), and all participants gave their written informed consent.

Inclusion criteria for patients with DLB and patients with AD were as follows: age 45 years or older, presence of probable DLB as defined by McKeith’s criteria revised in 2005 [25] and the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition [26], or probable AD as defined by Dubois’s criteria [27] in the early stage of the disease, as defined in both cases by impairment according to results of at least two neuropsychological tests and a Mini Mental State Examination (MMSE) score higher than 20. Patients were excluded in the case of diagnostic doubt, association of AD and DLB, other central neurological or psychiatric disease, visual or hearing disorders or difficulty in speaking French.

Inclusion criteria for healthy subjects were age between 55 and 90 years, absence of neurological or psychiatric history, a normal neurological examination and absence of impairment on neuropsychological tests (excluding ToM tests).

First, the three groups were examined by clinicians with expertise in dementia in order to perform a complete anamnesis and medical examination. Using the Unified Parkinson’s Disease Rating Scale III score [28], akinesia, rigidity and tremor at rest were rated from 0 (no symptoms) to 4 (serious impairment). Fluctuations were assessed with the Mayo Clinic Fluctuation Scale [29] and the Clinician Assessment of Fluctuation [30]. The medical check-up was completed with cerebral magnetic resonance imaging (MRI) for the three groups and lumbar puncture with AD biomarkers (tau, phosphorylated tau and amyloid-β42) for the DLB and AD groups. DaTscan (GE Healthcare/Medi-Physics, Arlington Heights, IL, USA), a radiopharmaceutical agent used in single-photon emission computed tomography (SPECT) for striatal dopamine transporter visualization of the brain, was employed if the extrapyramidal syndrome was unclear.

Second, cognitive functions were evaluated using the following tests: MMSE [31] for general cognitive function; the French version of the Free and Cued Selective Reminding Test (FCSRT) [32, 33], the 48-item Delayed Matching to Sample (DMS48) test [34] and the digit span test for memory function; the 80-item Dénomination Orale (DO80; an oral naming test) [35] for language; the Frontal Assessment Battery (FAB) [36], the Trail Making Test (TMT) A and B [37], the Digit Symbol Substitution Test (DSST) [38] and ‘formal and semantic lexical evocation’ [39] for executive function; the praxis set of Mahieux [40] and the Rey-Osterrieth complex figure test [41] for praxis; and number localisation and cube analysis of the Visual Object and Space Perception (VOSP) battery [42] for visuoperceptive functions.

ToM tasks

ToM was assessed by have participants perform three tasks: the FPR test, the RME test and Ekman’s Facial Emotion Recognition test.

The FPR test [43] is frequently used to assess cognitive and affective components of ToM. We used the shortened version, in which ten stories are read to participants: five containing a social faux pas (faux pas stories) and five control stories without a social faux pas (control stories). For each of the faux pas stories, participants were asked the following questions (faux pas questions): (1) ‘Did anyone say something he or she shouldn’t have said’? (2) ‘Who said something he or she shouldn’t have said’? (3) ‘Why shouldn’t he or she have said it’? (4) ‘Why do you think he or she said it’? (5) ‘Did X know that Y…’? (6) ‘How did X feel’? One point was given for each faux pas question (i.e., a total possible score of 30 points for the 5 faux pas stories). Two points were given for correct rejection of control stories (i.e., a total possible score of ten points for the five control stories). For each story (faux pas and control stories), two control questions were asked to check that participants understood the details of the story. One point was given for each control question (i.e., a total possible score of 20 points for the 10 stories). In our results, the scores for the following questions were also considered separately: questions 3 and 4, assessing the participant’s comprehension of the faux pas; question 5, assessing the participant’s comprehension of the character’s intention; and question 6, assessing the participant’s capacity for attributing emotions to the victim of the faux pas. This last question is used to evaluate affective ToM, and the other questions are used to evaluate cognitive ToM.

The French version of the RME test [44] is used to assess affective ToM on the basis of comprehension of complex mental states. Thirty-six photographs of the eye region of the faces of actors and actresses were presented to participants. Participants were required to choose which of four adjectives best described the mental state. A glossary with all of the proposed adjectives could be used during the assessment.

Ekman’s Facial Emotion Recognition test [45] is also used to assess affective ToM, but with a lower level of difficulty than the RME test. Thirty-five photographs of faces were presented to participants. The participants were asked to choose which of the following emotions corresponded to the facial expression: ‘happiness’ , ‘fear’ , ‘anger’ , ‘disgust’ , ‘surprise’ , ‘sadness’ or ‘neutral’.

Finally, we calculated the global score for the mini-Social cognition and Emotional Assessment (mini-SEA) battery [46] using the FPR test score and Ekman’s Facial Emotion Recognition test score.

For each test, we compared the patients’ results with standardized scores and classified them into successful or failed groups. We used the standardized scores of the mini-SEA [46] for the FPR test and Ekman’s Facial Emotion Recognition test, and Baron-Cohen’s standardised scores [44] were used for the RME test. The score was considered pathological for a z-score less than −1.6.

Scanning protocol

Twenty-four patients with DLB, nine patients with AD and fifteen healthy control subjects underwent cerebral MRI in a 3-T MAGNETOM Verio scanner (Siemens Healthcare, Erlangen, Germany). T1-weighted spin echo sequences were collected using the following parameters: repetition time = 1900 ms, echo time = 2.53 ms, inversion time = 900 ms, field of view = 192 mm and slice thickness = 1 mm. The voxel size was 1 mm × 1 mm × 1 mm.

Data processing

For image processing, we used SPM12 software (Statistical Parametric Mapping; Wellcome Trust Centre for Neuroimaging, London, UK; http://www.fil.ion.ucl.ac.uk/spm) running on MATLAB R2010a (MathWorks, Natick, MA, USA). Images of each patient were spatially normalised to the Montreal Neurological Institute space. T1-weighted images were segmented into grey matter, white matter, cerebrospinal fluid (CSF), bone, fat and air. Grey matter images were modulated and smoothed with a Gaussian kernel of 8 mm.

Voxel-based morphometry

Atrophy of grey matter was analysed using voxel-based morphometry (VBM) in SPM12 in an exploratory whole-brain analysis. First, two-sample one-tailed t tests between the HC group and the DLB group and between the HC group and the AD group were performed in order to confirm that brain atrophy in the DLB group and the AD group was congruent with previous reports in the literature. Second, we performed an analysis in the DLB group to search for neural correlates of ToM impairment. For this purpose, we performed a one-sample, one-tailed t test to investigate the correlation between each ToM score and grey matter volume, and a two-sample, one-tailed t test to compare patients with DLB who were impaired with those who were non-impaired for each ToM test. Age and global grey matter volume were considered as nuisance covariates for all of the analyses. Statistical maps were thresholded at P < 0.001, uncorrected, and with a minimum cluster size of 25 voxels. Statistical maps were analysed with xjView (http://www.alivelearn.net/xjview8/), which enabled us to identify the brain regions that were associated with the detected clusters.

Statistical analysis

IBM SPSS version 21.0 software (IBM, Armonk, NY, USA) was used for further statistical evaluation as required. Where appropriate, differences in demographic and clinical data were assessed using parametric (analysis of variance, t tests) and non-parametric tests (Kruskal-Wallis test) with post hoc analysis. For categorical measures, χ2 tests were applied. Correlations between clinical data, neuropsychological tests and ToM tests were assessed using Spearman’s test. For each test statistic, P < 0.05 was regarded as significant.

Results

Population characteristics

Ninety-six participants were recruited to perform ToM tasks. Twenty-eight patients were excluded for the following reasons: too severe stage of the disease (n = 9), presence of another neurological or psychiatric disease (n = 10), possible DLB or AD but without criteria for probable DLB or AD (n = 5), association of DLB and AD (n = 1), subjective memory complaint without objective impairments on neurological tests (n = 2) and insufficient knowledge of French (n = 1). Four healthy subjects were excluded because of psychiatric disorder (n = 1), incomplete file (n = 1), presence of hallucinations (n = 1) and presence of parkinsonism (n = 1). Ultimately, 64 participants were included in the study: 33 patients in the DLB group, 15 patients in the AD group and 16 elderly healthy subjects in the HC group.

There were no significant differences between the DLB, AD and HC groups for age, sex, handedness or level of education (all P > 0.05). The three groups’ demographic and clinical characteristics are presented in Table 1. There were no significant differences between the DLB and AD groups for disease duration, mean MMSE score or mean instrumental activities of daily living (IADL) score. Eighty-two percent of patients with DLB and 80 % of patients with AD had a lumbar puncture. AD biomarkers were always negative in the patients with DLB and always positive in the patients with AD. Fourteen patients with DLB underwent DatScan SPECT, and dopaminergic denervation was found in 64 % of cases.

Table 1 Clinical and demographic features of patients with dementia with Lewy bodies, patients with Alzheimer’s disease and healthy elderly control subjects

The results of the general neuropsychological tests are presented in Table 2. Patients with DLB were in a stage of mild cognitive impairment, but they performed poorly compared with the HC group in the majority of tasks. The mean MMSE scores were 27.2 for patients with DLB, 27 for patients with AD and 29.3 for HC. Patients with DLB had significantly more memory impairments than HC (FCSRT and DMS48) and more difficulty with executive, speed processing, praxis and attentional tasks (TMT A and B, DSST, digit span and lexical evocation). The DLB and AD groups performed similarly on all these tests except the FCSRT, on which patients with AD performed worse than patients with DLB, and also abstract praxis, on which the DLB group performed worse than the AD group.

Table 2 Neuropsychological features of patients with dementia with Lewy bodies, patients with Alzheimer’s disease and healthy elderly control subjects

ToM tasks

Table 3 displays data for ToM tasks.

Table 3 Theory of mind tests of patients with dementia with Lewy bodies, patients with Alzheimer’s disease and healthy elderly control subjects

Ekman’s Facial Emotion Recognition test

We found no statistical difference between the three groups for Ekman’s Facial Emotion Recognition test. A similar percentage of subjects failed this test in the two groups of patients (28.6 % in the DLB group versus 23.1 % in the AD group and 12.5 % in HC subjects; P = 0.66). Mean scores were similar between the three groups (27.1 in the DLB group versus 26.6 in the AD group and 28.6 in the HC group; P = 0.303). All the subjects had difficulty in identifying fear, anger and sadness, in the same proportions (P > 0.05). The only difference between the three groups concerned the recognition of surprise (4.1 in the DLB group and the AD group versus 4.9 in the HC group, P = 0.001).

Reading the Mind in the Eyes test

We found a significant difference between patients with DLB and the HC group for the mean score on this test (20.6 in the DLB group versus 23.9 in the HC group; P = 0.015). Forty-eight percent of patients with DLB failed this test versus 14 % of the HC group. Thirty-three percent of patients with AD failed this test, but there were no significant differences between the AD group and the HC group or between the AD group and the DLB group.

Faux Pas Recognition test

According to the mini-SEA standardized score for the FPR test, 42 % of patients with DLB and 43 % of patients with AD failed this test versus 12 % of HC (P = 0.071). Patients with DLB performed significantly worse than HC for faux pas stories (P = 0.033). There were no significant differences between the AD group and the HC group or between the AD group and the DLB group. No significant difference was observed between the three groups for the number of correct hits (P = 0.271), question 3 (P = 0.415) and question 4 (P = 0.213), intentionality (P = 0.271), or emotional attribution (P = 0.064). Patients with DLB performed worse than the other groups for this last question, but the difference was not significant. Patients with DLB and patients with AD correctly rejected the control stories. The comprehension of stories was preserved, with high scores for control questions in the three groups (P = 0.053).

Mini-Social cognition and Emotional Assessment score

A significant difference was found between the DLB group and the HC group for the global mini-SEA score (P = 0.008) (Table 3). This score is based on Ekman’s Facial Emotion Recognition test score and the composite score of the FPR test. Patients with AD had a lower score than HC, but the difference was not significant. There was no significant difference between the DLB group and the AD group.

We found a significant correlation between education level and scores on Ekman’s Facial Emotion Recognition test, the RME test and the FPR test (all P < 0.05). The score on the RME test was correlated with executive function (FAB, TMT B and semantic lexical evocation), the Rey-Osterrieth complex figure test score and the IADL score. We did not find any correlations between the RME test, the FPR test, the mini-SEA score and either age or disease duration. Ekman’s Facial Emotion Recognition test and FPR test scores were not correlated with any of the neuropsychological tests (Table 4).

Table 4 Spearman’s correlations between clinical and neuropsychological data and performances on the theory of mind tasks

VBM analysis

Comparison of atrophy between patients with DLB, patients with AD and HC group

VBM analysis revealed some regions of atrophy in the DLB group, compared with HC, in the frontal lobe (medial, middle and inferior frontal gyri; anterior cingulate cortex; and orbital gyrus), the temporal lobe (bilateral superior and left inferior temporal gyri) and the bilateral insula (P < 0.001, uncorrected, minimum cluster size of 25 voxels) (Fig. 1 and Additional files 1 and 2).

Fig. 1
figure 1

Voxel-based morphometric analysis of global atrophy in the dementia with Lewy bodies group compared with the healthy control group. Comparison between the dementia with Lewy bodies group and the healthy control group shows relative grey matter atrophy in frontal, temporal and insular regions (P < 0.001, uncorrected, minimum cluster size of 25 voxels)

Patients with AD were particularly atrophied in the bilateral medial temporal lobe (fusiform and parahippocampal gyri, hippocampus) compared with HC (P < 0.05, family-wise error–corrected, minimum cluster size of 25 voxels). They also presented more atrophy than HC in the frontal lobe (superior, middle and medial frontal gyri and cingulate gyrus), the right superior temporal gyrus and the right insula (P < 0.001, uncorrected, minimum cluster size of 25 voxels) (Fig. 2 and Additional files 1 and 2).

Fig. 2
figure 2

Voxel-based morphometric analysis of global atrophy in the Alzheimer’s disease group compared with the healthy control group. Comparison between the Alzheimer’s disease group and the healthy control group shows relative grey matter atrophy in medial temporal lobes and frontal regions (P < 0.001, uncorrected, minimum cluster size of 25 voxels)

Compared with the DLB group, the AD group was more atrophied in the left middle frontal gyrus, the bilateral parietal lobe (superior and inferior parietal gyrus, precuneus), the right cuneus and the left lingual gyrus, and especially in the bilateral medial temporal lobe (fusiform gyrus, amygdala, hippocampus and parahippocampal gyrus). Compared with the AD group, the DLB group was more atrophied in the left cingulate gyrus and the right middle frontal gyrus (P < 0.001, uncorrected, minimum cluster size of 25 voxels) (Additional files 1 and 2).

Correlation analysis for Ekman’s Facial Emotion Recognition test

In the DLB group, impairment on Ekman’s Facial Emotion Recognition test was correlated with atrophy in the right middle frontal gyrus (Brodmann’s area [BA] 10) and the left precuneus (BA 31) (P < 0.001, uncorrected, minimum cluster size of 25 voxels) (Table 5 and Fig. 3).

Table 5 Voxel-based morphometric analysis for correlations between grey matter volume and theory of mind tasks in the dementia with Lewy bodies group, and group analysis between impaired and non-impaired patients with dementia with Lewy bodies
Fig. 3
figure 3

Voxel-based morphometric analysis for correlations between grey matter volume and theory of mind tasks in the dementia with Lewy bodies (DLB) group, and group analysis between impaired and non-impaired patients with DLB. a Ekman’s Facial Emotion Recognition test: correlations between scores and grey matter atrophy in the DLB group. b Faux Pas Recognition test: correlations between scores and grey matter atrophy in the DLB group (yellow) and relative atrophy in impaired patients with DLB compared with non-impaired patients (red). c Reading the Mind in the Eyes test: correlations between scores and grey matter atrophy in the DLB group (yellow) and relative atrophy in impaired patients with DLB compared with non-impaired patients (red). P < 0.001, uncorrected, minimum cluster size of 25 voxels

Owing to the limited number of MRI images for impaired patients with DLB for this test (n = 4), no significant results were found in the group analysis comparing impaired and non-impaired patients in the DLB group.

Correlation analysis and group analysis for the RME test

Impairment on the RME test in the DLB group was correlated with diminished volume of the right middle (BA 9) and inferior frontal (BA 10) gyri (P < 0.001, uncorrected, minimum cluster size of 25 voxels) (Table 5 and Fig. 3).

Group analysis comparing impaired and non-impaired patients with DLB revealed atrophy in impaired patients in the bilateral superior and middle frontal gyri (BA 6, 9, 10, 11 and 46), the left orbital gyrus (BA 11), the right inferior parietal lobule (BA 40) and the right fusiform gyrus (BA 19) (P < 0.001, uncorrected, minimum cluster size of 25 voxels) (Table 5 and Fig. 3).

Correlation analysis and group analysis for the FPR test

Impairment on the FPR test in the DLB group was correlated with atrophy in the right medial frontal gyrus (BA 10), the right cingulate gyrus (BA 32), the right supramarginal gyrus (BA 40), the right precuneus (BA 7) and the left insula (BA 13) (P < 0.001, uncorrected, minimum cluster size of 25 voxels) (Table 5 and Fig. 3). Group analysis comparing impaired and non-impaired patients with DLB showed diminished volume in impaired patients in frontal regions (superior, medial and inferior frontal gyri), the left orbital gyrus and the left insula (P < 0.001, uncorrected, minimum cluster size of 25 voxels) (Table 5 and Fig. 3).

Discussion

ToM impairments in dementia with Lewy bodies

This study is the first to assess ToM performance in patients with DLB. The results confirm the hypothesis of ToM impairments in patients with DLB, with deficits on the RME test and on the FPR test. The patients with DLB in our study were mildly impaired, some with normal activity of daily living, with a mean MMSE score of 27 and relatively preserved performance on other neuropsychological tasks. Thus, ToM seems to be impaired at the early stage of the DLB.

Patients with DLB had preservation of low-demand affective ToM tasks such as Ekman’s Facial Emotion Recognition test but were impaired for high-demand affective ToM tasks such as the RME test. In Ekman’s Facial Emotion Recognition test, patients and HC were similarly impaired in recognizing negative emotions such as fear, anger, disgust and sadness. In their meta-analysis, Ruffman et al. [47] showed that older healthy adults performed worse than young adults in identifying these negative emotions. Their finding may explain why all three groups in our study were impaired for the recognition of negative emotions. On the RME test, nearly 50 % of our patients with DLB failed versus only 15 % of HC; therefore, the deficit was not due to ageing, and DLB does indeed seem to cause impairments of high-demand affective ToM. The difference in score between the AD group and the HC group was not significant, but we were not able to show a significant difference between AD and patients with DLB.

Patients with DLB were also impaired for cognitive ToM tasks such as the FPR test. Forty-two percent of the patients with DLB failed on this test. They performed worse than the HC group on faux pas stories, which reflect the detection and explanation of faux pas. The score of the AD group was better than that of the DLB group but lower than that of the HC group, though neither of these differences reached significance. The most marked deficiency in the DLB group, compared with the HC and AD groups, was on the question about emotional attribution, which assesses the affective component of ToM. Impairments on this test were not caused by an inability to understand the stories as all the subjects performed well on the control questions.

Thus, both affective and cognitive components were impaired in DLB, patients having difficulty in interpreting the intentions of others, recognising complex feelings and imagining another’s feelings in a story. The distinction between these two components of ToM is interesting because different pathways would be implicated for each of them [5, 6, 48, 49]. In their review, Poletti et al. [4] proposed that each neurodegenerative disease would lead to a specific pattern of ToM deficits depending on its neuropathological course and these different patterns could help physicians to diagnose and distinguish between these diseases.

The majority of studies that assessed ToM in AD found impairments on high-cognitive-demanding ToM tasks whereas low-cognitive-demanding ToM tasks were preserved [9, 11, 5054]. As we observed in our study, Gregory et al. [9] and Narme et al. [55] found lower scores in patients with AD than in healthy control subjects for the FPR test, but without significant difference. For the RME test, the data are more heterogeneous. Gregory et al. [9] found that the performance of patients with AD was preserved for this test, but Castelli et al. [53] and Laisney et al. [54] found the opposite. In these latter two studies, patients with AD were older and had lower mean scores on the MMSE than those in the study by Gregory et al. [9], which might explain the different performance on the RME test. In our study, patients with AD were more similar to the patients with AD in Gregory et al.’s study. Overall, the ToM capacities of patients with AD in the early stage of the disease seem to be preserved, because they seem to experience difficulties only with high-demand ToM tasks. In DLB, ToM deficits seem to be more global and occur earlier than those reported in AD, but we did not find any significant differences between these two groups.

The relationship between ToM and executive function is a matter of debate. In their review of the literature, Aboulafia-Brakha et al. [56] showed a congruent association between ToM and executive function in 64 % of studies and incongruent associations in 29 % of studies. In this review, both FPR and RME tests were strongly correlated with executive function. In our study, we found some correlations between executive function tests and the RME test but not with Ekman’s Facial Emotion Recognition test or the FPR test. In the DLB group, ToM and other cognitive abilities (executive and visuoperceptive functions) seemed to be relatively independent. Even if ToM can be linked with executive function in some circumstances, it seems to be a distinct cognitive function and its exploration would be useful in clinical practice. However, education level plays a role in success on ToM tasks.

The social impact of ToM impairments needs to be investigated further. Few studies have been done on the clinical implications of ToM. However, Gregory et al. [9] found a correlation between ToM impairments and the Neuropsychiatric Inventory score, which reflects behavioural disorders. Deficits of ToM in DLB could explain social interaction disorders such as loss of empathy, egocentricity or aggressiveness.

Neural correlates of ToM impairments

Compared with HC, patients with DLB were atrophied in the frontal and lateral temporal lobes and in the bilateral insula. These results are consistent with previous data [5759], particularly for the insula. Indeed, in their meta-analysis, Zhong et al. [59] showed that the insula is a core region for atrophy in DLB and could have a role in autonomic dysfunction. As expected, the patients with AD in our study were more atrophied in the medial temporal lobes than were the HC and patients with DLB. These patterns of atrophy demonstrate the homogeneity of each group.

VBM analysis showed correlations between ToM tasks and cerebral atrophy in different cerebral regions. Prefrontal cortex atrophy was correlated with impairments on the three ToM tests; specific cerebral regions were then found for each test. The prefrontal cortex is known for its role in executive function, such as attention, planning, decision-making and working memory [60]. This region is classically associated with ToM [7, 46, 6163]. In their review, Carrington and Bailey [3] reported an involvement of the medial prefrontal cortex and orbitofrontal cortex in 93 % of studies and an involvement of the lateral prefrontal cortex in 35 % of studies. According to lesion and neuroimaging studies, the medial prefrontal cortex is implicated in affective ToM [5] while the lateral prefrontal cortex is implicated in cognitive ToM [6]. In our study, RME impairments were particularly correlated with atrophy in the orbitofrontal cortex. This region has often been reported to be involved in affective ToM [6466]. Its role in regulation of emotions is known, and patients with lesions in the orbitofrontal cortex have social behaviour impairments. Hynes et al. [64] stated that the orbitofrontal cortex has a role in the automatic response to a social stimulus, without explicit judgement about this stimulus, and allows us to adapt our behaviour. In our study, the FPR test was also correlated with the medial prefrontal cortex and orbitofrontal cortex. Even if the FPR test assesses cognitive ToM, some studies found a link between the FPR test and the medial prefrontal cortex [48, 67, 68]. Moreover, patients with DLB failed especially on the last question of the FPR test, concerning emotional attribution. The atrophy in the medial prefrontal cortex may explain the difficulty in this empathic question, and consequently the FPR test impairments.

The inferior frontal cortex is part of the mirror neuron system and is engaged in both the execution of actions and the observation of these actions performed by someone else. According to the theory of simulation, other people’s mental states are represented by simulating their states with resonant states of one’s own [69]. This process implies mirror neurons and explains the role of the inferior frontal cortex in the RME and FPR tests.

The RME and FPR tests were also correlated with regions of the right TPJ (inferior parietal lobule). This region is strongly associated with ToM for both the affective and the cognitive components [3, 70]. Uddin et al. [71] demonstrated the role of the right TPJ in self–other distinction. In their study, subjects failed to discriminate self-faces and other familiar faces when repetitive transcranial magnetic stimulation was applied over this region. Decety and Lamm [72] performed a meta-analysis to attempt to understand the role of the TPJ in ToM. In their view, the activation in the TPJ during ToM tests relies on a ‘lower-level computational mechanism involved in generating, testing, and correcting internal predictions about external sensory events’ and is not due to a specific role in ToM.

The precuneus, which correlated with FPR and Ekman’s Facial Emotion Recognition tests in our study, is a core region of ToM [3, 70]. It is known for its role in visuospatial perception and episodic memory. According to Cavanna and Trimble [73], the precuneus plays a role in ToM by using mental imagery to represent the perspective of another person.

Patients with RME test impairments had relatively more atrophy in the fusiform gyrus. This region was reported to be related to face and object perception [74]. Furthermore, Gallagher et al. [75] found an association between the fusiform gyrus and understanding the meaning of visual jokes. In a similar manner, Baglio et al. [63] studied performances on the RME test in patients with amnestic mild cognitive impairment and found activation of the bilateral fusiform gyrus during the test. ToM capacities required facial examination mediated by the fusiform gyrus.

Patients with FPR test impairments had relatively more atrophy in the bilateral insula. The role of the insula in emotional response is well known. Bertoux et al. [46] also found a correlation between the insula and the FPR test, particularly with the explanation of faux pas. They supposed that this region was involved in impairment of the last question, about emotional attribution. This hypothesis is congruent with our findings because the results for this question were the most impaired in the DLB group compared with HC.

To summarise, ToM capacities are based not on a single cerebral region but on a neuronal network, including prefrontal regions and the TPJ, precuneus and insula. Our findings in patients with DLB are consistent with those reported in healthy subjects, psychiatric patients and patients with other neurodegenerative cognitive diseases, and they support the existence of such a network. Further studies, including functional connectivity (functional and arterial spin labelling MRI) and anatomical connectivity (diffusion tensor imaging), are now needed to better characterize this network.

Our study has some limitations. The main limitation is that patients with DLB were included on the basis of clinical criteria. Today, the gold standard for the diagnosis of DLB is pathology, and there are no biochemical tests to confirm the diagnosis during the lifetime of the patient, unlike the CSF biomarkers that exist for AD. There is therefore a risk of including patients with other neurodegenerative diseases or with the association of AD and DLB. However, McKeith’s criteria have very good specificity (98 %) [76]), and the majority of our patients had a lumbar puncture, which excluded AD. Another limitation is that we did not perform specific evaluations to determine the social impact of the ToM impairments. It would be interesting in a further study to investigate this impact on daily living using questionnaires.

Conclusions

In this study, we demonstrate ToM impairments in patients with DLB. These deficits were present at an early stage of the disease and affected both the affective and cognitive components of ToM. Impairments were correlated with atrophy in regions classically involved in ToM (prefrontal and orbitofrontal cortex, TPJ, precuneus and insula), which reinforces the hypothesis of a neuronal network of ToM. Further studies are now needed to confirm these first results and to better understand the neural correlates of ToM impairments and their functional and anatomical connectivity.