Adult cognitive outcomes in phenylketonuria: explaining causes of variability beyond average Phe levels

Romani, Cristina; Manti, Filippo; Nardecchia, Francesca; Valentini, Federica; Fallarino, Nicoletta; Carducci, Claudia; De Leo, Sabrina; MacDonald, Anita; Palermo, Liana; Leuzzi, Vincenzo

doi:10.1186/s13023-019-1225-z

Adult cognitive outcomes in phenylketonuria: explaining causes of variability beyond average Phe levels

Research
Open access
Published: 28 November 2019

Volume 14, article number 273, (2019)
Cite this article

Download PDF

You have full access to this open access article

Orphanet Journal of Rare Diseases Aims and scope Submit manuscript

Adult cognitive outcomes in phenylketonuria: explaining causes of variability beyond average Phe levels

Download PDF

Cristina Romani¹,
Filippo Manti²,
Francesca Nardecchia²,
Federica Valentini³,
Nicoletta Fallarino³,
Claudia Carducci⁴,
Sabrina De Leo⁵,
Anita MacDonald⁶,
Liana Palermo⁷ &
…
Vincenzo Leuzzi²

4189 Accesses
30 Citations
12 Altmetric
Explore all metrics

Abstract

Objective

The objective was to deepen the understanding of the causes of individual variability in phenylketonuria (PKU) by investigating which metabolic variables are most important for predicting cognitive outcomes (Phe average vs Phe variation) and by assessing the risk of cognitive impairment associated with adopting a more relaxed approach to the diet than is currently recommended.

Method

We analysed associations between metabolic and cognitive measures in a mixed sample of English and Italian early-treated adults with PKU (N = 56). Metabolic measures were collected through childhood, adolescence and adulthood; cognitive measures were collected in adulthood. Metabolic measures included average Phe levels (average of median values for each year in a given period) and average Phe variations (average yearly standard deviations). Cognition was measured with IQ and a battery of cognitive tasks.

Results

Phe variation was as important, if not more important, than Phe average in predicting adult outcomes and contributed independently. Phe variation was particularly detrimental in childhood. Together, childhood Phe variation and adult Phe average predicted around 40% of the variation in cognitive scores. Poor cognitive scores (> 1 SD from controls) occurred almost exclusively in individuals with poor metabolic control and the risk of poor scores was about 30% higher in individuals with Phe values exceeding recommended thresholds.

Conclusions

Our results provide support for current European guidelines (average Phe value = < 360 μmol/l in childhood; = < 600 μmo/l from 12 years onwards), but they suggest an additional recommendation to maintain stable levels (possibly Phe SD = < 180 μmol/l throughout life).

Public significance statements

We investigated the relationship between how well people with phenylketonuria control blood Phe throughout their life and their ability to carry out cognitive tasks in adulthood. We found that avoiding blood Phe peaks was as important if not more important that maintaining average low Phe levels. This was particularly essential in childhood. We also found that blood Phe levels above recommended European guidelines was associated with around 30% increase in the risk of poor cognitive outcomes.

Long-Term Follow-Up of Cognition and Mental Health in Adult Phenylketonuria: A PKU-COBESO Study

Article Open access 03 August 2017

The effect of improved dietary control on cognitive and psychiatric functioning in adults with phenylketonuria: the ReDAPT study

Article Open access 18 January 2021

Total choline intake and working memory performance in adults with phenylketonuria

Article Open access 29 July 2023

Background

Phenylketonuria (PKU; OMIM#261600) is an inherited metabolic disease where a genetic error results in a partial or complete de-activation of the enzyme phenylalanine hydroxylase (PAH) which normally metabolizes the amino acid phenylalanine (Phe; E.C. 1.14.16.1) into tyrosine (a precursor of dopamine). Phe accumulation results in several and still incompletely known negative effects on the postnatal development of the brain as well as on the functioning of the mature brain [1]. Fortunately, these negative consequences can be controlled by adopting, since birth, a Phe-restricted diet and protein supplementation. There is no question that a low Phe diet must be followed throughout childhood to achieve good cognitive outcomes [2]. However, several questions remain open [3]. We need to know more about: 1. which measures are most important to consider for dietary control (Phe average vs Phe fluctuations); 2. the impact of dietary control on different cognitive functions and possible interactions with age; and 3. which Phe value should be considered safe at different developmental age; there is uncertainty especially regarding the levels which are safe after early childhood. The purpose of this study is to provide some evidence relevant to these questions by analysing the performance of a mixed group of English and Italian early treated adults with PKU (from now on AwPKU) in relation to the current and historical blood phenylalanine control.

Which metabolic measure? (average Phe levels vs Phe variation)

Blood Phe levels are usually measured with the assumption that they correlate with levels in the brain (see Leuzzi et al. [4]; Pietz et al. [5]; Rupp et al. [6], but also Brumm et al. [7], Moats et al. [8]; Schindeler et al. [9] for no relationship. Different measures of blood Phe have been found to correlate with cognitive performance, but their relative contribution is unclear (from now on Phe without qualification refers to blood Phe).

Most studies have assessed the impact of dietary control by considering either current Phe levels or average levels across a time period (also referred to as IDC- index of dietary control). Average levels have generally been calculated as a mean of yearly median values or, more rarely as a mean of half-year median values (for examples of this latter measure see Pietz et al. [10]; Vilaseca et al. [11]). These studies have shown that current Phe levels as well as average Phe levels are good predictors of cognition (for examples of positive associations in adults across cognitive functions see Brumm et al. [7]; Romani et al. [12]; for effects on IQ see Manti et al. [13]; Weglage et al. [14]; for effects on IQ in children, see Waisbren et al. [2]). Note, however, that effects are limited when only a restricted set of tasks is used [15, 16] and/or when only current Phe level has been considered; for example, effects of current Phe on IQ have been inconsistent across studies (see Jahja et al. [17]; Moyle et al. [18] for positive and/or marginal results; see Koch et al. [19]; Feldmann et al. [20]; Pietz et al. [10], for no correlation).

Phe variation (also referred to as Phe fluctuation by some authors) has also been shown to predict cognition. Phe variation has generally been measured as a mean of yearly SD of Phe values [21,22,23]. Most studies have considered children and found that indexes of variation predict IQ (Burgard et al. [24]; Hood et al. [25]; marginally significant results in Anastasoaie et al. [21]; see also Vilaseca et al. [11] for results with a mixed-age group), executive functions [22, 24], motor control [26], white matter integrity [27]; for a review across functions, see Cleary et al. [28]. There is more limited evidence that Phe variation predicts cognitive outcomes long-term, since studies on adult patients are lacking.

Viau et al. [23] studied a mixed sample of children and young adults (N = 55) and assessed the impact of current and historical Phe on cognition. They reported limited correlations with Phe averages and no correlations at all with Phe SD. However, cognition was measured only with limited subtests from the WAIS and the WISC (Block design, Symbol Search and Verbal IQ or Verbal comprehension). Our previous study on a sample of 37 English AwPKU, early treated and with good metabolic control, showed significant effects of both historical Phe average and Phe SD (0–10, 11–16, 17+) on adult cognitive performance measured through IQ and an ad-hoc PKU battery of cognitive tasks [12].^{Footnote 1} Importantly, however, these results did not provide information on the relative contribution of Phe average and Phe SD to cognitive outcomes. These two measures are, in principle, independent of one another. Two individuals can maintain the same average Phe level, but one may show little variation around the mean, with values very similar to one another, while another may show a lot of variation. Thus, both average Phe and Phe variation may contribute independently to good cognitive outcomes. However, in practice, these two measures are highly correlated in PKU populations, because individuals who maintain a lower Phe average, also maintain a more consistent low Phe diet [11, 12, 23, 25].

Hood et al. [25] reported some independent contributions of Phe SD, but they only assessed relationships in children and with limited cognitive measures (they found an independent contribution of childhood SD 5–10 at years or after 10 years on matrix reasoning and number of non-responses in an N-back task). In our study, we aim to assess an independent contribution of Phe SD on adult cognitive outcomes assessed more comprehensively.

Individual variation in cognitive outcomes

While it is clear that cognitive outcomes depend on metabolic control, the extent of this dependence is debatable.

One question relates to whether all effects of having PKU can be eliminated through dietary control [1]. We know that most early-treated AwPKU perform within the norm, but that, as a group, their performance is worse than the controls. What we do not know, however, is whether the whole distribution of cognitive scores is shifted so that even performance at the high-end of the distribution is affected, or, instead, it is only the lower end of the distribution which is affected, where individuals are likely to have maintained poor dietary control. The first option will indicate that there are some fixed costs of having PKU which are not avoidable even maintaining a low Phe diet following current treatment guidelines. The second option instead, will indicate that a strict diet can completely eliminate the cognitive impact of having PKU.

A second, related question concerns the safe target range for blood Phe control at different ages. Current European guidelines advise to maintain Phe average levels below 360 μmol/L, before 12 years of age and below 600 μmol/L thereafter [29, 30]. American guidelines are even more strict recommending 120–360 μmol/L throughout life (American College of Medical Genetics and Genomics, ACMG) [31]. However, even the European guidelines have been criticized for being over stringent [32]. This is because there is little evidence of ill-effects when guidelines are relaxed in adulthood [13] and even the evidence to advocate childhood Phe < 360 is not strong [33,34,35,36]. A way to examine this question is to examine the distributions of cognitive scores within the PKU group in relation to metabolic control (see Waisbren et al. [2] for analyses of children data). This will allow us to examine if there are discontinuities in the distributions of cognitive scores, with pathological scores starting to appear and/or become more frequent when a given metabolic value is exceeded and whether these boundaries are consistent with current guidelines. Additionally, the cost of not following guidelines can be quantified by comparing the rates of poor cognitive scores in individuals which have or have not followed guidelines.

A final, related question, is whether there are individuals who have maintained poor metabolic control, but still have escaped cognitive impact. This will show that there is variability on how negatively PKU affects cognition (see van Vliet et al. [37] for a review of extreme case).

In conclusion, our study has two related aims: 1. To compare the effects of protracted exposure of brain to Phe –best measured through average Phe levels— with the effects of Phe peaks –best measures through SD from the mean--, and possible interactions with age. We want to see whether both average Phe and Phe SD contribute to adult outcomes and whether these two measures have a different weight in childhood and adolescence/adulthood. 2. To assess cognitive variability in a population of adults with PKU to see a) whether effects are pervasive or limited to a portion of individuals, b) whether the Phe boundaries identified by current European guideline are meaningful and c) whether there are exceptional cases where good cognition is achieved in spite of poor metabolic control.

To achieve aims, we have combined results from English and Italian AwPKU tested with the same battery of tasks (N = 56). Italian and English sub-samples show similar patterns of cognitive impairments and relationships with current and historical Phe measures, justifying accruing results (Romani et al., unpublished data). The resulting sample is larger and more varied in terms of metabolic control than most sets reported in the literature allowing better assessment of correlations between metabolic and cognitive variables (current Phe range is 54–2081; SD = 403; compared, for example, to: Brumm et al. [7]: 157–1713; SD = 338; Channon et al. [38]: 221–1233; SD = 261; Jahjia et al. [17]: 66–1550; SD = 342; Smith et al. [39]: 200–1879).

Method

Recruitment

Fifty six early-treated adult PKU participants were tested: 19 Italian and 37 English. They were all diagnosed soon after birth as result of national newborn screening programs.

The 19 Italian AwPKU were recruited from the Clinical Centre for Neurometabolic Diseases Department of Human Neuroscience, Child Neurology and Psychiatry Unit, Sapienza University of Rome. Three participants were currently treated with Kuvan. Nineteen Italian control participants were recruited among friends and students of the researchers. They were matched to the Italian PKU participants for age and education. Among the Italian participants, 4 had a diagnostic Phe level > 600 μmol/L but < 1200 μmol/L; 15 participants had Phe > 1200 μmol/L at birth.

The 37 English AwPKU participants were recruited from the Department of Inherited Metabolic Disorders at the University Hospitals Birmingham. They all had Phe > 1200 μmol/L at birth. The performance of this sample on a larger set of tasks as been described in previous publications [12, 40, 41]. Thirty English healthy controls were recruited through an advertising volunteering website. They were matched to the English PKU participants for age and education.

All AwPKU treated in the English and Italian centres were invited to participate and were accepted in the study on a first come, first served basis. The English study received NHS ethical approval. The Italian study was approved by the local ethics committee. All participants provided informed consent to the study.

Metabolic measures

For both the English and the Italian PKU participants blood spots for blood Phe were taken regularly since diagnosis in early infancy and extensive records were available although there were limited data for a few participants (6 UK participants lacked or had very limited childhood data). We averaged Phe control in three age bands: childhood: 0–10 years old, adolescence: 11–16 years old, and adulthood: 17 years to present. We have also averaged measures throughout the life-time and considered current Phe level (for the Italian group, Phe has been measured immediately before the testing session/s or close to it; for the UK group, Phe has been measured immediately before the two testing sessions and averaged). We considered two types of measures: Phe average and Phe variation. Phe average in each band was calculated by taking the median values for each year and, then averaging the yearly values. The median is the value set halfway in a distribution of scores; it is generally used in the PKU literature rather than the mean because the median is not influenced by Phe variations. It is particularly, important to use the median in our study since we want to contrast a measure of central tendency (median, mean) with a measure of variation. Phe variation in each band was calculated by taking the SD for each year and then averaging yearly values in the band.

Cognitive assessment

Cognitive assessments were carried out in a quiet room at the clinical centres in Birmingham and Rome by one the psychologist on the team. The testing session for the Italian participants lasted between 2 and 3 h. The English participants were tested in two separate sessions of similar length (a less extensive set of tasks was administered to the Italian participants because of resource limitations). A few PKU participants were not able to attend the second testing session which resulted in some data points missing for some tests (N = 31 instead of 37).

IQ was measured using, the Wechsler Adult Intelligence Scale-Revised (WAIS-R, [42]) with the Italian participants and the Wechsler abbreviated scale of intelligence (WASI, [43]) with the English participants, which includes the following subtests: Vocabulary, Block Design, Similarities, and Matrix Reasoning. In addition, participants were given a set of tasks chosen from the larger set of tasks administered in our previous studies [12, 40]. We chose tests which either showed a strong difference between participants with PKU and controls and/or strong correlations with metabolic measures. We also gave precedence to tasks with non-linguistic stimuli which did not need adapting across languages. Therefore, we did not include tests of picture naming, reading, spelling and orthographic knowledge (spoonerisms, phoneme deletions). Accuracy in these tasks was very good and not related to metabolic measures [12]. Speed of processing was assessed with visual search tasks. To reduce the number of tasks tapping similar functions, we also did not administer the Tower of Hanoi, the lexical learning task, the Stroop, and nonword repetition. Measures of STM (digit span and Corsi span) and a baseline measure of peripheral speed of processing were included for completeness and because of mixed results from the literature (for impairments in digit span and nonword repetition see Palermo et al. [40]; for contrasting results see Brumm et al. [7], and Moyle et al. [18]; see also Jahja et al. [17], for deficits with increasing working memory load).

The following cognitive areas were assessed:

1.
Visual Attention. This was assessed with four tasks [12, 40]: 1.Simple Detection: Press a response button as soon as a ladybird appears on the screen; 2. Detection with Distractors: Press a button when a ladybird appears on the screen alone or with a green bug; in the second part of the task the instruction was changed to press a button when a green bug appears on the screen alone or with a ladybird; 3. Feature Search: Detect a target among distractors not sharing features by pressing a ʽyes’ or ʽno’ button (e.g., a red ladybird among green bugs); 4. Conjunction Search: Detect a target among distractors sharing features (e.g., red ladybird among red bugs and green bugs). Both reaction times (RT from now on) and accuracy measures (error rates) were taken.
2.
Visuo-motor Coordination. This was assessed with two tasks: 1. Grooved Pegboard Test [44]: Put pegs into the holes of a board using only one hand as quickly as possible (short version with two trials one with the dominant and one with the non-dominant hand to match Italian and English samples) and 2. Digit Symbol Task [42]: Fill as many boxes as possible with symbols corresponding to numbers (key with associations remains visible) in 90 s. Trail Making Test A (TMT A) [45, 46]: connect circles containing numbers in ascending order of the numbers as quickly as possible.
3.
Complex Executive Functions. This was assessed with four tasks tapping skills such as planning, flexibility and abstract thinking: 1. The Wisconsin Card Sorting Test (WCST) 64 card version [47]: Discover the rules to match cards from a deck with four reference cards according to the shape, number or colour of the symbols on the card; feedback is provided to allow learning. Flexibility is required when the sorting rule is changed unknown to the participant and the new rule must be discovered. We used three different scores: total errors, number of perseverative responses and number of completed categories. 2. Difference in speed between Trail Making Test B-A (TMT B-A) [45, 46]. A involves connecting circles containing numbers in ascending order; B also involves connecting circles in ascending order, but alternating between circles containing numbers and letters . Only completion time is considered in this test; when, occasionally, an error is made, it is corrected by the examiner and this affects time to complete the task. 3 Fluency: For letter fluency: generate as many words as possible starting with a given letter in one minute of time (for Italian: P, F and L; Novelli et al. [48]; for English: C, F and L; Benton et al. [49]); for semantic fluency [50, 51]: generate as many names of animals as possible in one minute of time. This requires planning an efficient search through the lexicon.
4.
Short-term Memory/Working Memory. This was assessed with two tasks: 1. Digit Span: Repeat a sequence of digits spoken by the examiner, soon after presentation; 2. Corsi Block Tapping Test [52]: The examiner taps a sequence of blocks and the participant must reproduce the sequence in the same order.
5.
Sustained Attention – This was assessed with the Rapid Visual Information Processing task (RVP; adapted from Sahakian et al. [53]): detect three target sequences of 3 digits by pressing the response key when the last number of the sequence appears on the screen. Scores are percentage correct.
6.
Verbal Memory and Learning. This was assessed with The Rey Auditory Verbal Learning Test [54, 55] which asks for learning, immediate recall, and delayed recall of a list of 15 words. The list is presented five times and participants are asked to recall the words immediately after each presentation. After the 5th presentation (A5), an interfering list (B1) is presented and participants are asked to recall this list and then, once again, the original list (A6) without a further presentation. Finally, participants are asked to recall the original list after a 20-min filled interval. Our scores include total number of errors across the five learning trials (A1–5); errors in recalling the words after an interfering list (A6); and, again, errors in delayed recall of the original list.
7.
Visual Memory and Learning. This was assessed with the Paired Associates Visual Learning [56]: Learn to associate objects with locations.

Demographics and preliminary analyses

Data analysis

For each participant, we computed z scores for each task using the relative (Italian or English) control group as reference. We also averaged z scores across tasks as a measure of overall cognitive performance. We report results of the PKU group using z-scores. Group differences of PKU from controls is examined through t-tests. Relationships between cognitive scores and Phe is examined with Pearson bivariate correlations. To reduce the number of variables per task, we did not carry out correlations with accuracy measures in search task (which are not impaired), and we only correlated for the TMT, the B-A condition; for the WCST, the total errors; and for the Rey, performance over 1–5 trials (learning) and in delay recall.

Participants

Table 1 shows demographic variables for age, gender, years of education and Phe control across age. Average Phe level increased across ages (diet became more relaxed), Phe variation remained more stable (see also Hood et a [25]., for similar results in children up to 18 years old).

Table 1 Demographic and metabolic information for English and Italian PKU groups matched for age, gender and education, and for the whole group. Blood Phe measured in μmol/L

Full size table

Cognitive outcomes

Cognitive performance across tasks is shown in Table 2. Patterns of results are very similar to those reported previously with an overlapping sample of 37 AwPKU [40], except for the visual paired-associate learning which shows a modest group impairment. The tasks with the largest differences from controls were tasks of visual search measured in terms of speed of processing and task involving visuo-motor coordination (pegboard, digit symbol, TMT A). Executive functions in terms of flexibility and planning, (TMT B, verbal fluency^{Footnote 2}) and sustained attention were also impaired consistent with previous results (see for speed of processing: Albrecht et al. [57]; visuo-motor coordination: Griffiths et al. [58]; Pietz et al. [10]; executive functions: Smith et al. [39]; Brumm et al. [7]; sustained attention: Schmidt at al [55].; Bik-Multanowski et al. [59]; Weglage et al. [14]; Jahja et al. [17]).

Table 2 Cognitive performance of the PKU group (English and Italian PKU participants; N = 56). Z scores calculated from respective control groups (N = 30 and N = 19). To facilitate interpretation, for all scores, higher Z-score reflect worse performance. Scores in bold are significantly higher than expected. ms. = milliseconds; sec. = seconds

Full size table

Cognitive outcomes in relation to metabolic control

Table 3 shows bivariate Pearson r correlations between cognitive and metabolic measures. Correlations were extensive both for Phe average and Phe variations. Correlations were significant both with current and historical measures and for all tasks (except the Corsi span), although they were not systematic across all ages and types of metabolic measures. Significant correlations with lifetime measures (either average or SD) were found with IQ, speed in visual search, tasks tapping visuo-motor coordination, EF (WCST, TMT-B-A and semantic fluency), sustained attention, Rey words delayed recall, and paired visual learning.

Table 3 Pearson r correlations between Phe measures taken at different points in time and adult cognitive performance (N participants = 51–56; N tasks = 16). Significant correlations are in bold. ^a = significant <.05; ^b significant <.01. To facilitate interpretation, positive correlations always indicate that high Phe was associated with worse performance. Thus, for IQ, digit span, Corsi span, and semantic fluency correlations were reversed

Full size table

Consistent with previous results [12], tasks tapping visuo-attentional speed were associated with blood Phe early in life, but less with adult blood Phe and not at all with current Phe level. AwPKU who had maintained a more constant control in early childhood (0–10 years) still showed positive effects many years later, in adulthood, with faster RTs. In contrast, other tasks correlated strongly even with current Phe level. FSIQ, visuo-motor coordination (digit symbol), sustained attention, TMT B-A and learning are all strongly affected by current Phe level (as well as by levels at previous years).

Phe average vs Phe SD

Data analyses

Effects of Phe average and Phe SD were compared with different analyses. We compared the effect of these measures at different ages by contrasting correlations between Phe average/Phe SD in either childhood or adulthood and adult cognitive outcomes. We compared the number of significant correlations through χ² tests and average size of correlation with t-tests.

Furthermore, we compared the relative contribution of Phe average and Phe SD to cognition by carrying out regression analyses where cognition was measured with either IQ or mean z-score in our cognitive battery as a summary measure of performance (contribution of individual measures is shown in the previous section with correlation analyses). We carried out three types of regressions. First of all, we compared the effects of Phe average and Phe variation across the lifespan. We carried out a two-steps regression where education was entered in the first step (to partial out any contribution) and both Phe average and Phe variation were entered together in the second step (forward method where the variables making the strongest contribution is considered first and, then, any other variable which makes an additional significant contribution is added). Note that entering education at a first step is a conservative choice, not only because there is a mutual relationship between IQ and education (with education influencing IQ, but also IQ influencing education), but also because Phe levels may influence education. In a second analysis, we assessed directly the contribution of Phe SD after Phe average was considered. Therefore, Phe average was forced in the first step and Phe variation was entered in the second step. Finally, we carried out a third type of regression to consider the contribution of metabolic measures at different ages. Based on the correlation results, we contrasted Phe average and Phe variation taken in childhood with the same measures taken either in adolescence or adulthood. All measures were entered together in the regression equation to see which combination predicted cognition best (SPSS forward method). In this analysis the order in which the variables are entered in the equation is identified by the regression model. The variable making a stronger contribution is entered first following by any other variable making an additional, significant contribution. We considered either adult or adolescent values in separate analyses because of their high correlation (for Phe average r = .74; for Phe variation r = .50) and becaue we wanted to avoid power with more variables.