Introduction

With a prevalence of 2.6 per 10,000 births [1], congenital diaphragmatic hernia (CDH) is a rare condition, which is life-threatening and often requires a prolonged stay on the intensive care unit (ICU) [2, 3]. Fortunately, survival rates of children born with CDH are increasing, but so is the occurrence of developmental delays, which may stretch into adulthood and beyond [4, 5]. Delays concern both motor function and neurocognitive domains which impact daily life; particularly memory and attention deficits have been reported at school age, possibly contributing to academic difficulties [5,6,7]. Of note, those educational problems tend to become overt only around 8 years of age, possibly because of the stronger appeal on a diversity of brain functions that are needed for cognitive processing. The latter phenomenon has been coined the “growing-into-deficit” phenomenon, as deviant brain development in early life is revealed not until those higher cognitive brain functions are demanded [8]. While intelligence has been investigated in CDH-survivors at preschool age [9, 10], more specific neurocognitive functions such as memory have not been studied extensively. It is therefore currently unknown whether these deficits are already detectable at an earlier stage. If so, timely detection is important, because this would provide the opportunity to intervene and possibly prevent children from educational delay [11]. Moreover, clinical variables predictive of cognitive deficits are not yet available, while those can be of additional value in the search for possible early interventions. We hypothesized that memory and attention problems are present earlier in life and can be detected by using comprehensive neurocognitive assessments [12,13,14]. We aimed to investigate intelligence, memory, inhibition, and attention together with clinical predictors in preschool children born with CDH.

Methods

Inclusion criteria

We included data of CDH-survivors born between February 2010 and November 2015 who had joined the structural follow-up program at the Erasmus MC-Sophia Children’s Hospital. This multidisciplinary program includes assessments of neurocognitive function, motor function, lung function, and exercise capacity. At 30 months, and at 5, 8, 12, and 17 years of age, a psychologist evaluates neurodevelopment and cognitive functioning. In case of emerging problems, children are offered extra help [5, 15].

Exclusion criteria

Children were excluded if they had been diagnosed with CDH later than 7 days post-partum, when the anomaly appeared to be a para-esophageal hernia or an eventration of the diaphragm, when diagnosed with a genetic syndrome known to affect neurodevelopment, or in case of severe neurologic or developmental impairment, due to which assessments could not be administered.

Neuropsychological assessment

All cognitive tests were age-appropriate and offered to all participants in our follow-up program as part of routine care (Appendix). They were administered by certified and experienced psychologists. Norm-scores of all tests are based on a representative sample of children without disabilities. Intelligence was assessed with the Dutch version of the Wechsler Preschool and Primary Scale of Intelligence–III (WPPSI-III-NL) [16], which has an average population mean score for intelligent quotient (IQ) of 100, with a standard deviation (SD) of 15. The WPPSI-III-NL covers three domains of intelligence, i.e., performance IQ (3 subtests), verbal IQ (3 subtests), and processing speed (2 subtests). Total IQ is based on seven subtests. To assess both inhibition and attention, several subtests from the second version of A Developmental NEuroPSYchological Assessment (NEPSY-II-NL) were used [17], i.e., Inhibition-Naming (NEPSY-II-NL-IN), Inhibition-Inhibition (NEPSY-II-NL-II), and Auditory Attention (NEPSY-II-NL-AA). During Inhibition-Naming, children are given a sheet with target pictures and are asked to name the shape or direction of the target pictures as accurately and rapidly as possible. Performance on the task is reported as a norm score, with a mean score of 10 and an SD of 3, representing the time needed to finish the task. Low scores indicate slow processing speed or poor naming ability.

For Inhibition-Inhibition, children must rapidly and accurately name the opposite shape or direction of the target pictures. Completion time and the number of mistakes made are expressed in percentile categories (0–10 and > 10). Many mistakes indicate an impulsive approach, whereas a longer completion time suggests that requirements for inhibition slow down processing speed, indicating a problem with inhibition. The number of mistakes made in both Inhibition-Naming and Inhibition-Inhibition is added up, resulting in a percentile category representing the level of performance.

The Auditory Attention task was used to assess selective and sustained attention. In this task, children must selectively respond to auditory target words. The result is represented by the number of correct responses and is expressed in percentile categories. Lower scores indicate increased distractibility.

As for memory testing, two subtests of the first edition of the Kaufman Assessment Battery for Children (K-ABC, Dutch version) [18] were administered. The Hand Movements subtest assesses visual-motor memory. In this subtest, the child is shown different hand positions and must reproduce these. In the Number Recall subtest for verbal memory, the child must repeat a sequence of numbers of increasing length. Both tests have a mean population mean of 10 and an SD of 3. Higher scores represent better performance.

Patient characteristics

We recorded perinatal characteristics such as the following: fetoscopic endoluminal tracheal occlusion (FETO, yes/no), inborn (yes/no; yes if born in our hospital or another CDH center), observed-to-expected lung-to-head ratio (o/e LHR) (recorded at 32-week gestational age, or measurement closest to that moment), primary closure of defect (yes/no), open repair versus minimal access surgery (MAS) (yes/no), duration of initial stay at ICU (days), cardiac malformations (yes/no; recorded if follow-up by a pediatric cardiologist was necessary), cardiac malformations for which surgical intervention is needed (yes/no), maximum vasoactive inotropic score (VIS) during first 2 months of initial stay, veno-arterial extra corporeal membrane oxygenation (ECMO) treatment (yes/no), maternal education level (MEL, low-to-middle or high, based on International Standard Classification of Education (ISCED)) [19].

Statistical analyses

Mann–Whitney U tests were used to compare continuous data of participants and those of non-participants lost to follow up; chi-square tests and Fisher’s exact tests were used for categorical data. The one-sample binomial test was used to compare outcome proportions in our sample with normative proportions, and the one sample t-test was used to compare continuous test results with normative scores. Effect sizes were interpreted according to Cohen’s d. When assumptions for linear regression were met, univariable regression analyses were performed to assess associations between characteristics and neuropsychological outcome. In case of a nonparametric distribution, the Mann–Whitney U test was used to investigate associations between dichotomous determinants and cognitive scores and Kendall’s tau was used in case of continuous determinants. Characteristics used for association analyses were birthweight, inborn versus out-born, primary closure of defect versus patch repair, open repair versus MAS, initial ICU stay in days, maximum VIS, and MEL, as we aimed to cover perinatal characteristics, degree of illness, effects of surgical approach, and socioeconomic status. Determinants that were significantly associated with cognitive performance in univariable analyses were used in multivariable linear regression analyses. We used continuous raw scores for association analyses with NEPSY-outcome (e.g., completion time in seconds and number of mistakes) instead of percentile categories, to perform linear regression. In case variables were strongly skewed, log-transformation was considered. All statistical tests were two-sided with a significance level of 0.05. Analyses were performed by using SPSS version 25 (IBM SPSS Statistics, IBM Corporation, Armonk, NY).

Results

Population

We included 63 children who had been assessed with at least one cognitive test, either the WPPSI-III-NL, NEPSY-II-NL-AA, NEPSY-II-NL-IN, or the K-ABC visual and verbal memory (For a flowchart of inclusions, refer to Fig. 1). Fifty-one (81%) children had been assessed with the WPPSI-III-NL; fifty-two (83%) with both the NEPSY-II-NL-IN and the NEPSY-II-NL-II; fifty-three (84%) with the NEPSY-II-NL-AA; and 61 (97%) with the K-ABC. Forty-seven of the 63 children (75%) were born in a CDH center, and in 42 (89%) o/e LHR had been recorded prenatally, with a median of 47.9%. Comparison of the baseline characteristics of participants and non-participants revealed one significant difference: a higher number of children with non-Dutch ethnicity among the non-participants (Table 1).

Fig. 1
figure 1

Flowchart of inclusion and exclusion of study participants. CDH, congenital diaphragmatic hernia; ECMO, extracorporeal membrane oxygenation; WPPSI, Wechsler Preschool and Primary Scale of Intelligence–III; NEPSY I-N, NEPSY Inhibition-Naming; NEPSY I-I, NEPSY Inhibition-Inhibition; K-ABC, Kaufman Assessment Battery for Children. *Chromosomal duplication, Sotos syndrome

Table 1 Background characteristics

Test results

Performance on all tests did not significantly deviate from the norm, except for scores on performance IQ and visual memory, which were significantly higher (Cohen’s d respectively 0.45 and 0.56). For details on results on all tests, refer to Fig. 2 and Supplementary Tables 1 and 2 (Table S1 Continuous scores compared to reference scores, Table S2 Percentile scores compared to reference scores).

Fig. 2
figure 2

Results on neurocognitive tests in our study population compared to reference population. *p < 0.05, **p < 0.001, for an overview of scores refer to Supplemental Tables S1 and S2

Univariable analysis

Longer ICU stay was associated with lower performance IQ (B −3.37, p = 0.04 (Confidence Interval (CI) −6.65 to −0.09)). Low-to-middle MEL was associated with lower verbal IQ (B −17.38, p < 0.01 (CI −26.79 to −7.98)) and total IQ (B −14.30, p < 0.01 (CI −23.41 to −5.19)) compared to high MEL.

Children who had undergone open surgery made significantly more mistakes on the inhibition-inhibition task (NEPSY-II-NL-II) than the children who had undergone minimal access surgery (MAS) (p < 0.001), and more mistakes were associated with lower birthweight (τ =  −0.25, p = 0.02), longer ICU stay (τ = 0.25, p = 0.01), and higher maximum VIS (τ = 0.28, p = 0.01).

The total number of mistakes during naming and inhibition (NEPSY-II-NL-IN and NEPSY-II-NL-II) was significantly higher in children treated with open surgery versus MAS (p < 0.001), and more mistakes were significantly associated with longer ICU stay (τ = 0.22, p = 0.03) and higher maximum VIS (τ = 0.27, p = 0.01).

For auditory attention, we found significantly lower outcome scores in relation to low-to-middle MEL versus high MEL (p = 0.04).

Determinants significantly associated with verbal memory were not identified, while low-to-middle MEL was associated with lower visual memory scores (B =  −1.52, p = 0.04, (CI −2.97 to −0.07)).

Multivariable analysis

For the multivariable analysis, birthweight, ICU stay in days, maximum VIS, and open surgery versus MAS served as independent variables; inhibition and naming task scores served dependent variables.

In multivariable analysis, only open repair remained significantly associated with the number of mistakes in the inhibition task (NEPSY-II-NL-II; B = 5.52, p = 0.04, (CI 0.32–10.72)) and number of mistakes in inhibition and naming tasks combined (B = 7.60, p = 0.01, (CI 1.77–13.44)).

For further details regarding analyses, refer to supplemental files (Table S3 Univariate analyses Intelligence; Table S4 Univariate analyses Memory; Table S5 Association analyses Inhibition, Naming and Auditory Attention; Table S6 Multivariable analysis Inhibition-Inhibition, total mistakes; Table S7 Multivariate analysis Inhibition-Inhibition and Inhibition-Naming, total mistakes).

Discussion

In this study, we tested cognitive functioning and explored possible predictors in 5-year-old CDH-survivors. Given the abundant evidence that any neurocognitive problems of CDH-survivors will eventually appear around school age [5,6,7, 20, 21], we hypothesized that cognitive deficits could be detected earlier in life. However, we found no significant deviations from normative scores at this age on intelligence, attention, inhibition, and verbal memory, while visual memory scores were significantly higher than normative scores. The only clinical variable that remained significant in the multivariable analysis was type of surgery; i.e., children who had undergone open repair surgery scored lower on inhibition tasks than those treated with MAS. The absence of neurodevelopmental problems in our study population may support the growing-into-deficit theory, which has it that neurocognitive problems become functionally evident with increasing age. An alternative explanation is that the tools currently available are not sensitive enough to detect these problems.

Studies on neurodevelopment in this patient population at toddler age generally reported neurodevelopment to be within normal ranges [22,23,24], or only slightly below [25, 26]. Most of these studies, however, used a version of the Bayley Scales of Infant Development, which is not devised for assessing specific neurocognitive functions, but rather for general development [27]. Yet, several domains of neurodevelopment around 5 years of age can be assessed with the WPPSI. Danzer and colleagues, for example, found total, verbal, and performance intelligence to be within normal ranges in 35 5-year-olds born with CDH, who, however, had significantly more low and borderline scores on at least one domain compared to normative cohorts [9]. This discrepancy with our findings might be explained by the greater number of children treated with ECMO in their group. Interestingly, the cohort studied showed favorable scores on academic achievement using assessment instruments to estimate school readiness [9]. Another study found that 44% of their CDH-patients (born 2001–2005) aged 4 to 7 scored < 80 on any of the tests of both the WPPSI and a language assessment [10]. Yet, the sample size was small (n = 16), and four patients had previously been diagnosed with neurodevelopmental impairment. In our study, children with intellectual disabilities or with comorbid syndromes affecting cognition were excluded.

Given the results from our and the aforementioned studies, it remains a topic of debate whether it is even possible to detect problems with higher order neurocognitive functions in children this young. As higher neurocognitive functions such as working memory gradually develop between 4 and 18 years [28], it might not be surprising that we found no obvious abnormalities at 5 years of age.

Determinants

In univariable analysis, longer ICU-stay was negatively associated with performance IQ, which has been demonstrated before [22, 29]. Moreover, lower birthweight, higher maximum VIS, and open repair were related to poorer inhibition. In a previous study, maximum VIS was found to be related to memory performance, possibly as an indicator for severity of illness or rather as a proxy for altered brain perfusion [6]. In this study, open repair was the only clinical variable that remained significantly associated with poorer inhibition in multivariable analysis. The effect of open repair versus MAS on brain development is still a topic of debate. Yet, as open repair is mostly performed in clinically unstable patients, including the ones treated with ECMO, whereas MAS is only done in relatively stable patients with smaller defects, the association we found might reflect the extent of critical illness rather than the effect of the surgical technique on the brain. Indeed, children treated with laparotomy had a significant longer stay on the PICU (p < 0.01).

Strengths and limitations

We were able to collect data from a relatively large group of children born with a rare anatomical malformation who had been prospectively included in a follow-up program. However, several limitations must be mentioned. First, o/e LHR was not included in the association analyses because up to 25% of our patients were out-born, indicating that the defect was not diagnosed prenatally, and that, therefore, o/e LHR had not been measured. The measurements that we do have show a relatively mild cohort, as nearly no children with severe lung hypoplasia are included (o/e LHR < 25%), which must be taken into account while interpreting our results. In addition, the defect size could not be reliably retrieved for every participant, as structural recording of defect size only started in recent years. Second, we excluded patients with severe genetic syndromes, but we do not know whether the others might have had small genetic defects, as exome sequencing was not yet offered in this study period [30]. Therefore, we cannot say for certain whether the possible existence of small genetic defects might have influenced our findings. Of note, recent studies have demonstrated that de novo deletions in CDH-survivors are associated with worse neurodevelopmental outcome [31]. Third, several eligible children were not assessed due to fatigue, decreased attention, or lack of motivation. Therefore, our findings might overestimate performance, as those children might have performed worse on several tasks. In addition, we lost significantly more children with a non-Dutch ethnicity to follow-up compared to the ones with a Dutch ethnicity. Fourth, although we examined a relatively large cohort of CDH-survivors, the sample might not be fully representative of the general population of CDH-survivors. In a previously studied cohort of children born with CDH (born 2006–2009), more neonates had been treated with ECMO and maximum VIS scores were higher than in our cohort [6]. The question remains whether this difference is due to a new standardized treatment protocol [32] or a natural fluctuation in disease severity [2]. As we cannot rule out that the current study cohort had a relatively mild level of disease severity, given the relatively high O/E LHR measurements and the clinical characteristics, we assume that conclusions should be interpreted with caution and that more studies in other cohorts of children with CDH are needed.

Future directions

There is consensus that being born with CDH has an impact on neurodevelopment at school age. Yet, there is a need for predictive factors. While we merely focused on clinical characteristics, social factors such as socioeconomic background should be further explored in coming studies. Maternal education level (MEL) was relatively high in our cohort in comparison to normative data, although quite a few numbers on MEL were missing. Considering the assumed correlation of MEL and intelligence of offspring [33], we would have expected a generally high IQ among our population, instead of average. This argues for an actual effect of having survived CDH on brain development, despite the mild clinical characteristics. Another field of interest is neuroimaging markers. Recently, prenatal cerebellar diameters of CDH-patients on ultrasound were found to be smaller than the cerebellar diameters of healthy fetuses [34], which is of importance as the cerebellum is involved in many neurocognitive functions [35]. Also, given that intelligence in CDH-survivors is generally within the normal range [36], but problems occur in specific neurodevelopmental domains [6], possibly a subtle form of brain injury is reflected in those problems. Longitudinal magnetic resonance imaging (MRI) may inform us about the underlying mechanisms of growing into those deficits. For instance, previous MRI studies have shown that hippocampal volume is decreased in survivors of critical illness at school age and is associated with decreased memory function [37]. Thus, brain imaging, together with close neuromonitoring during critical moments, enables investigation of the mechanisms and trajectories of deviant brain development. This will eventually result in possible targets for intervention. Moreover, age-appropriate validated assessment instruments including up-to-date population specific reference data will always be a challenge for multicenter studies and outcome registries.

Conclusion

We found neurodevelopmental outcome to be within normal ranges at 5 years of age within our cohort of CDH-survivors. However, the knowledge that the neurocognitive performance of these children around 8 years of age is generally not normal underscores the growing-into-deficit hypothesis in this group of children. Multiple cohort studies are imperative to account for the natural fluctuation in clinical course, and to determine whether some subgroups show signs of neurocognitive developmental problems as early as preschool age, for which timely intervention is desirable. Therefore, future research should focus on both more sensitive assessments and specific determinants of neurodevelopment in a structured follow-up program to prevent or mitigate any future neurocognitive problems in these children.