Background

Classical galactosemia (CG; OMIM #230400) is a rare disorder affecting the galactose metabolism. It is autosomal-recessively inherited and caused by a profound deficiency of the enzyme galactose-1-phosphate uridyltransferase (GALT; EC 2.7.7.12) [1]. Together with two other enzymes: galaktokinase (GALK) and UDP-galactose epimerase (GALE), GALT is part of the Leloir pathway that metabolizes ingested galactose into glucose-1-phosphate used for energy and into UDP-galactose which is used for glycosylation of complex molecules. In untreated CG patients, galactose, galactose-1-phosphate, galactitol and galactonate accumulate in body tissues and fluids [2]. So far over 180 different mutations in the gene encoding for GALT have been identified and associated with CG [3]. Other mutations are known to cause only mild deficiency, including the so-called Duarte-2 mutation (N314D), the most common of them. Compound-heterozygotes for a Duarte and a classical mutation normally have a residual GALT activity of 14–25% and a good prognosis without treatment [4].

Newborns with CG develop a life-threatening intoxication syndrome with acute liver failure, renal tubular dysfunction, sepsis and cerebral oedema. Symptoms resolve within a few days after establishing a galactose-restricted diet [1, 2]. Even when typical clinical signs of galactosemia are present, the diagnosis may be missed. In order to ensure rapid diagnosis and adequate management, newborns are screened for galactosemia in many countries. Newborn screening can decrease the morbidity and mortality caused by the acute complications of galactosemia in the neonatal period.

However, even strict adherence to the diet cannot prevent the long-term complications that may occur in CG, such as deficits in cognitive functions, speech and language impairments and neurological deficits including tremor and other extrapyramidal motor abnormalities, as well as premature ovarian insufficiency and low bone mineral density [1, 5, 6]. Most studies researching the neuropsychological impairments have focused on global measures of IQ, reporting overall IQ scores within the low to below average range, with a great variability between individual patients [7,8,9,10,11,12,13,14]. The cognitive impairments are thought to result from a broad set of deficits [10]. Few other studies also described visual-perceptual difficulties and less well-developed executive functions [7, 12, 15]. Many of the patients suffer from speech and language impairments [16, 17]. Higher incidence of psychiatric disorders such as depression or anxiety and problems with social interactions are also known in CG [17]. The biomarker galactose-1-phosphate, correlates only loosely with long-term neuropsychological and motor outcome [5, 18, 19].

The exact pathomechanisms of these neuropsychological long-term impairments remain unclear. The brain may already be damaged in utero, as elevated levels of galactose-metabolites were found in foetuses from the age of 20 weeks of gestation [20]. Others suggested that the endogenous galactose production might lead to a toxic accumulation of galactose metabolites even when patients are on a galactose-restricted diet [1]. The currently prevalent theory is that abnormal galactosylation of complex molecules, including myelin, may contribute to the pathology [21, 22]. Neuroimaging studies showed poor myelination and other white matter abnormalities, as well as cerebral and cerebellar atrophy [10, 23, 24].

In order to further characterize the neuropsychological profile of adult patients with CG, especially aspects of executive and visual perceptual functions, we used selected tasks from the Cambridge Neuropsychological Test Automated Battery (CANTAB), a computerized assessment tool. This battery is highly standardised and easy to conduct, and the results highly reproducible.

Results

Characteristics of patients, Duarte subjects and controls (Table 1)

Twenty-two patients, i.e. 58% of the known Swiss CG population ≥ 16 years of age, were enrolled in the study. Thirteen (59%) were females. Mean age was 30 years (SD 11), median 29, range 16–59 years. Of the 22 CG patients all except one were diagnosed through newborn screening. Most of them nevertheless exhibited symptoms of intoxication before treatment initiation. All patients had two known classical mutations (Q188R, K285 N, L195P, H319Q, A320T and M142K), except two patients with one classical and one known slightly milder (R258C) mutations and one patient with the genotype Q188R/L264 V and residual GALT activity of 1.5%. Mean IQ score of the CG patients was 77 (SD 17), the median also 77, and the range 49 to 112. Comparison of patient and control groups did not show significant differences concerning age (p = 0.475) and gender (p = 0.903). The education levels of controls and the parents of the patients were comparable (p = 0.383). But, as expected, they differed significantly between patients and controls (p = 0.028). The three subjects with mild “Duarte” galactosemia were compound-heterozygous for a classical and the Duarte-2 (N314D) mutations. They were clinically normal.

Table 1 Characteristics of Patients, Duarte and Controls

Validation of controls

Mean Z-scores of controls were compared to the CANTAB norms by one sample t-test. Overall, the control group was comparable to the normative data cohort of CANTAB (see Additional file 1: Table S1). We therefore compared the test results of CG patients with the results of our control group.

CANTAB testing

Table 2 presents the descriptive data for all measures of the CANTAB. In the Motor Screening Task (MST), CG patients did not show a significantly longer mean latency or made more errors than controls. In the Paired Associates Learning (PAL) task, CG patients performed worse compared to controls. They needed more trials and made more errors until successful completion of a stage. Patients also made more errors in the stage with six shapes and needed more trials in total. However, none of these measures reached significance in nonparametric testing. In the Spatial Span (SSP) task, CG patients had a significantly shorter span length, which means they could remember shorter sequences than controls. Patients also made significantly more usage errors. These errors are made when the subject selected a box that did not change colour. In the Reaction Time (RTI) task, CG patients performed equally well as controls in the simple-choice part. In the five-choice part movement time was slower for CG patients compared to controls, again without reaching significance, because of the important variation and large overlap with controls. In the Rapid Visual Information Processing (RVP) task, CG patients performed significantly worse in all but one outcome measure. They were less likely to identify the target sequence but did not have more false alarms than controls (Fig. 1). The results of the Emotion Recognition Task (ERT) are displayed in Fig. 2: Recognition of the emotions happiness and sadness was not significantly different between CG patients and controls. However, CG patients had significantly lower percentages for the recognition of the emotions anger, disgust, fear and surprise. They also needed more time to answer than controls.

Table 2 CANTAB results
Fig. 1
figure 1

Rapid visual information processing (RVP) is a measure of sustained attention. Subjects had to recognise target sequences of three digits from numbers appearing in a pseudo-random sequence at a rate of 100 digits per minute. The number of total hits is significantly lower (** = p < 0.01) in Galactosemia patients compared to controls, whereas the probability of false alarms is not different in both groups

Fig. 2
figure 2

Emotion Recognition Task (ERT). The participants had to recognise facial expressions of six different emotions presented for 200 ms. Recognition of basic emotions, such as happiness and sadness, was not significantly different between patients and controls, whereas emotions considered more complex, including surprise, anger, disgust and fear, appeared significantly more difficult for patients than for controls. Percentage of correct recognition of each of the six emotions listed. NS = not significant, * = p < 0.05

Importance of outcome measures

A random forest model consisting of 5000 trees was trained and validated on the CANTAB data to quantify the relative importance of each outcome measure for the discrimination of group membership (control vs. galactosemia patients; Additional file 2: Figure S1). The higher the MDA (mean decrease in accuracy) of a given measure the more important its contribution to group discrimination. Several global and individual outcome measures of the ERT (including total number correct, mean latency, as well as the emotions surprise and fear) and RVP tasks were most important for group discrimination. Bootstrap estimate of error rate: 35.14%.

Correlations

A priori significant correlations of ERT and RVP outcome measures are shown in Table 3. When corrected for multiple comparisons (p-value FDR), only three measures, all from the ERT (total correct, recognition of sadness and disgust), correlated significantly with the overall IQ score of the patients but none with their level of education. In contrast, no significant a priori correlation at all was observed with the education of controls (not shown) or the maximum education of the patients’ parents. Note, that no correlation with IQ, educational level or age was found for the ERT fear, which was also important for group discrimination (see above). Two outcome measures, PAL total errors at the stage of six shapes and RTI five-choice reaction time showed a significant correlation with the age of the controls, but not with the age of patients (not shown). No significant correlations between CANTAB outcome measures and any biochemical marker, such as galactose-1-phosphate or residual GALT activity (not shown) were found.

Table 3 Correlation of ERT and RVP with other patient characteristics

Discussion

In this study, we aimed at deepening the neuropsychological phenotype of classical galactosemia patients by administering a series of tasks from the Cambridge Neuropsychological Test Automated Battery (CANTAB) to a Swiss cohort of 22 adult CG patients. This cohort represents 58% from totally 38 known patients in Switzerland.

In our cohort, we found the most robust deficits in facial emotion recognition (ERT) and rapid visual information processing (RVP). Most of the ERT and RVP outcome measures were correlated to overall IQ and education of the patients, much less to age. Apart from three instances (see Table 3), these correlations were no longer significant when the p-value was adjusted for multiple comparisons. Bad performance could not be explained by comprehension problems of the subjects with low IQ, since the recognition of ‘happiness’ was not different between patients and controls. Comparison of overall IQ and ERT performance in a scatter plot is shown in Additional file 3: Figure S2.

The patients performed also worse on several other outcome measures of the four other tasks, MOT, PAL, SSP and RTI. Probably due to the relatively small number of participants, only the Spatial Span (SSP) remained significant, when nonparametric testing and for multiple testing adjusted p-values were used (see Table 2).

To our best knowledge, this study is the first to examine facial emotion recognition in CG patients using the ERT of the CANTAB. Our results show that CG patients were able to correctly identify the basic emotions happiness and sadness but performed significantly worse on the more complex emotions anger, fear, disgust and surprise. Previous studies reported that CG patients have problems with social interactions and that they exhibit internalizing symptoms such as depression and anxiety [12, 17, 25]. In other studies, children with CG were also described as shy and withdrawn in social relationships [8, 26]. Interestingly, there seems to be a gap between the parents view and the patients’ self-perception of their emotional state: while the parents report considerable psychosocial difficulties, the patients themselves often fail to recognize them [12, 27]. Data concerning the patients’ behaviour and emotional state was mostly collected by means of questionnaires filled out by parents, teachers or patients themselves, but so far, little is known about the neuropsychological basis of the psychosocial impairment. Our results suggest that patients with CG have difficulties reading the facial expressions of their opposite correctly and therefore may not always react appropriately. Considering that the ERT is based on photographs of people acting the target emotions, it cannot be excluded that social conventions concerning these emotions play a role in the difficulties of the patients to recognize them. It is possible that CG patients also have difficulties to express their emotions or even to perceive and identify their own emotions. This could explain the differing opinions of parents and patients mentioned above. Importantly, these emotion recognition deficits may also be related to the observation that many galactosemia patients manifest autistic traits, but generally without fulfilling the diagnostic criteria.

Deficits in emotion recognition have also been described in other inborn errors of metabolism, such as Wilson’s disease [28] and tyrosinemia type I [29]. As in both studies different tests were used, conclusions by comparing the results can only be drawn cautiously. In the study with patients with Wilson’s disease, the most significant deficit was found in recognizing “anger”, while in our galactosemia patients, the most important difficult emotion was “fear”. The authors argue that patients with Wilson’s disease tend to react more aggressively to ambiguous social situations than healthy controls [28]. In contrast, galactosemia patients, as mentioned before, tend to be shy and withdrawn. In tyrosinemia type I patients, in turn, Van Ginkel and colleagues found less specific and less pronounced deficits in emotion recognition [29]. Nevertheless, similar to our findings, these were not completely explained by the correlation with IQ.

The RVP task revealed another weakness of CG patients. Our results showed that rapid visual information processing, a measure of sustained attention, is impaired in adult CG patients. Widhalm et al. postulated that galactosemic patients suffer from cognitive slowing and evaluated this outcome by means of reaction time tasks [30]. In their study, children with CG showed reduced ability to sustain visual attention, as well as attention deficits in central processing stages indicating a reduced processing capacity. Additionally, they had a remarkable impairment of information processing speed [30]. The significantly longer mean latency of patients in our study also suggests reduced velocity of visual information processing. In a study from Taiwan, RVP was administered to a relatively large cohort of adolescents with autism spectrum disorders (ASD) [31]. Compared to healthy controls, they performed significantly worse, even after adjusting for full IQ. The authors propose that RVP could serve as a trait marker for ASD. These findings are interesting in two respects: first, RVP abnormality in galactosemia patients may be another link to autism as discussed above for the ERT and second, this measure appears to be independent of IQ, at least in the normal IQ range. To our knowledge, no study has systematically investigated the correlation of RVP performance with low full IQ.

In the study of Widhalm et al. the CG children also performed significantly slower than controls on a task of simple reaction time [30]. Our patients however performed equally well in both measures of the simple-choice part and the reaction time of the five-choice part of the RTI. In contrast, the movement time in the five-choice part was slower, although this difference only reached significance when means were compared. This may be due to poor visual-motor integration described by previous studies [10, 32]. In addition, motor difficulties alone may have an influence, as it is easier to learn a uniform movement than an unpredictable one.

Results on PAL revealed problems with visual memory, which is most likely due to impaired visual perception as described by other researchers before [7, 15]. Reduced working memory capacity seems to be involved, too, as indicated by a significantly shorter span length of the CG patients compared to the controls on the SSP task. These findings are in line with previous studies reporting working memory scores of CG patients in the low average range [12, 13]. Furthermore, a recent study performed resting-state functional MRI on CG patients in order to assess the organization of core processing systems of the brain [33]. They found abnormalities in networks linked to spatial orientation, attention, sensory-motor integration and motor planning. In addition, altered connectivity was found in networks involved in visuospatial capacity and working memory. The alterations correlated with some neurocognitive tests which indicates a relation with the clinical phenotype [33].

Conclusions

In conclusion, our study showed that CG patients have impaired visual perception, sustained visual information processing and visual-motor integration, thus confirming findings of previous studies. More interestingly, however, our study showed a deficit of facial emotion recognition in CG patients. To our best knowledge, this is the first time that this specific impairment has been demonstrated in the context of CG. The difficulty to recognise emotions correctly may have a considerable impact on patients’ social life. The selected CANTAB tasks proved useful to detect specific deficits of CG patients. Especially the ERT and the RVP appeared to be important for group discrimination. They could therefore be used in future studies such as functional MRI studies aiming to find neuronal correlates of the cognitive long-term complications, as well as surrogate markers of efficacy for potential new treatments. Finally, the findings of this study could also help to design programs for galactosemia patients aiming at the development of effective strategies to cope with the everyday consequences of these specific deficits in emotion recognition, in visual information processing and sustained attention.

Methods

Subjects and controls

The current study was approved by the local Ethics Committee and all subjects gave informed consent. The International Galactosemia Registry of the European Galactosemia Network (EGN Registry) was implemented in Switzerland in 2015, aiming at the inclusion of all CG patients, most of whom have been diagnosed by new-born screening since the mid-1960ies. All 38 known patients with CG in Switzerland who are ≥16 years of age, were contacted and invited to participate in this study. CG was defined by a known genotype of classical galactosemia or a residual GALT enzyme activity below 10%. Some patients declined to participate in the study because they did not feel well enough (n = 5). One of these suffered from a second condition (Down’s syndrome) and three were born before newborn screening. Others declined because they were not available, mainly for professional reasons (n = 5). For the remaining, the reason is not known (n = 6). The mean age of the non-included patients was 35.3 years (SD 13.1; range 18–59). The final sample consisted of 22 CG patients (59% females). Fifteen controls also completed the CANTAB test battery. They were recruited from a laboratory, administrative and medical staff. It was taken care that their level of education was similar to the level of education of the patient’s parents and male-female proportion was close or identical to the patient cohort, in order to reduce selection bias. In addition, three subjects with mild “Duarte” galactosemia were enrolled. They underwent identical testing as the patient group but were analysed separately. Due to their neuropsychological deficits, patients with CG often achieve lower education than their parents and non-affected siblings. In order to get a measure of the cognitive and psychosocial functioning of the patient families and the controls, of which no full IQ scores were available, we assessed and classified their level of education, as well as of the patients, as follows: “School without qualification”, i.e. either regular schooling not completed or special needs schooling. “School with qualification”, i.e. completed regular schooling and some additional professional training. “Vocational”, i.e. full professional education after obligatory school at age 16 and parallel to high school and college (this is the main educational path in Switzerland, with a good professional standing). “Undergraduate” and “Postgraduate” are the two University levels of the European Bologna system.

Cognitive assessment

The study was part of a larger study conducted at the University Hospital of Bern in Switzerland, which included a full IQ assessment using the Wechsler Adult Intelligence Scale, Fourth Edition (WAIS-IV). Six tasks from the Cambridge Automated Neuropsychological Test Battery (CANTAB) were administered to all subjects including the controls in a session of approximately 60 min using German and French translations of the standardised test instructions and a Windows Surface touch-screen tablet. Thus, each task was explained to the subjects in a thorough and standardized way, and it was made sure that he/she had understood the instructions.

The following tasks were selected (see Additional file 1: Table S2 for a description of the tests):

  1. 1.

    Motor Screening Task (MOT)

  2. 2.

    Paired Associates Learning (PAL)

  3. 3.

    Spatial Span (SSP)

  4. 4.

    Reaction Time (RTI)

  5. 5.

    Rapid Visual Information Processing (RVP)

  6. 6.

    Emotion Recognition Task (ERT)

For reference, see http://www.cambridgecognition.com/cantab/cognitive-tests/ (Cambridge Cognition Ltd., 2017).

Statistical analysis

Baseline characteristics were presented in a descriptive format, showing mean with standard deviation and median with range for age. The Student’s t-test was applied for mean, the Wilcoxon rank-sum test for median comparison between patients and controls. Frequencies with percentages were shown for categorical variables and the Chi-Squared test was used for comparison. The one sample t-tests was used to compare z-scores of controls to CANTAB normative data. For the comparison of the outcomes of the CANTAB sub-tests we again used the t-test for mean and the Wilcoxon rank-sum test for median, as well as linear models to compute p-values corrected for the influence of age, gender and the maximal educations of parents. As multiple statistical tests were performed, these p-values were adjusted using the method of Benjamini and Yekutieli to reduce the false discovery rate. The relative importance of outcome measures for predicting CG status was assessed by a random forest model consisting of 5000 trees, implemented in the randomForest R package. The random forest bootstrap estimate of error rate was 35.14%. The Spearman correlation was used and p-values were computed using Spearman’s rho statistics. All analyses were performed in version 3.4.1 of the R statistical environment.