Introduction

Infantile hemangiomas (IH) are the most common vascular tumors of childhood, with estimated incidences ranging from 2.0 to 4.5% [1, 2]. A substantial proportion of otherwise healthy infants with IH requires treatment with beta-blockers to prevent or treat complications, such as ulceration, functional impairment, or disfigurement [3]. Concerns have been raised about the long-term impact of propranolol, a lipophilic beta-blocker, due to possible treatment effects on the central nervous system (CNS) at a vulnerable age [4]. Previous clinical studies have shown that atenolol, a hydrophilic beta-blocker, is as effective as propranolol but seems to be associated with fewer CNS effects during IH treatment [5, 6]. Since 2014, propranolol has been the only worldwide approved beta-blocker to treat IH. Atenolol has been frequently prescribed for IH, though off-label [7, 8].

To date, no long-term neurocognitive problems in children treated with beta-blockers for IH have been reported [9,10,11,12]. However, the generalizability of previous studies was limited due to small sample sizes (n = 23 [11] and n = 27 [12]). Furthermore, previously used outcome measures such as general intelligence or broad neurodevelopmental milestones are not sensitive to subtle deviations in complex neurocognitive functions, e.g., working memory, processing speed, and attention [9, 10]. Also, previous research did not compare the long-term effects between propranolol and a hydrophilic beta-blocker, such as atenolol. Therefore, the aim of this study was to investigate and compare long-term neurocognitive outcomes (i.e., working memory, processing speed, and attention) in school-aged children who had been treated with either propranolol or atenolol for IH during infancy.

Materials and methods

Design

This two-center cross-sectional study was conducted at the vascular anomaly centers of the Erasmus MC, University Medical Center Rotterdam (Erasmus MC, Rotterdam, the Netherlands), and the University Medical Center Utrecht (UMCU, Utrecht, the Netherlands) [13]. Both centers introduced propranolol treatment in 2008. UMCU switched to atenolol treatment in 2009 and Erasmus MC switched to atenolol treatment in 2013, based on positive clinical experience and before propranolol was globally approved [14]. This enabled us to study an internationally unique cohort of school-aged children, who had received either propranolol or atenolol, independent of their disease characteristics.

Children were assessed during an outpatient visit, consisting of a neuropsychological assessment by a psychologist (MH), a standard physical examination by a pediatrician (PdL, JB, MR), and a dermatological examination by a pediatric dermatologist (MdG, SP). Children’s parents completed questionnaires about the child’s psychological, neurocognitive, and physical development. Information on IH treatment was retrieved from medical records.

Participants

Prior to recruitment, we screened records of all patients born between 2008 and 2014 who were treated for IH at either center to identify any eligible children. Children were actively recruited between April and December 2019; the last recruited child was assessed in February 2020.

The inclusion criteria were (1) age ≥ 6 years upon participation in neuropsychological assessment; (2) IH previously treated with either oral propranolol at ≥ 2 mg/kg/day or oral atenolol at ≥ 1 mg/kg/day; (3) treatment duration ≥ 6 months; (4) treatment initiated before the age of 1 year; (5) IQ estimated > 55 (no moderate to severe intellectual disability); and (6) child and parent(s)/legal guardian(s) having sufficient comprehension of the Dutch language to understand study materials. The exclusion criteria were (1) prematurity < 37 weeks of gestation; (2) low birth weight (< 2.5 SD for gestational age); (3) complicated neonatal period with hospitalization; (4) suspected PHACE syndrome; (5) other treatment than oral propranolol or atenolol for IH (such as other oral beta-blockers, oral corticosteroids, vincristine, interferon alpha, topical beta-blockers, intralesional corticosteroids, imiquimod, rapamycin, laser, surgery, and cryotherapy); (6) documented psychological or neurocognitive problems before starting beta-blockers; (7) medication that could negatively affect psychological or neurocognitive functioning (including multiple general anesthesia); (8) genetic syndromes known to affect cognitive performance; (9) concomitant or successive use of propranolol and atenolol; and (10) participation in a study or compassionate use program with ID V0400SB.

This study was exempt from the Dutch Medical Research Involving Human Subjects Act according to the institutional review boards of Erasmus MC (MEC-2019–0268) and UMCU (19–115/C). All parent(s)/legal guardian(s) provided written informed consent.

Measurements

We included those measures of neurocognitive functions that have been documented to be affected by beta-blockers [4]. All measures are standardized for children aged 6 to 12 years, have age-corrected normed scores based on the general Dutch population, and have sufficient psychometric properties [15,16,17,18].

The primary outcome measure was the Cognitive Proficiency Index (CPI), a subscale of the Wechsler Intelligence Scale for Children-V, Dutch version (WISC-V-NL). The CPI comprises four subtests that measure working memory and processing speed. Attention may be inferred based on demands required to complete these subtests. The CPI composite score is more reliable than the individual subtests. CPI standardized scores have a mean of 100 and a standard deviation of 15 points [15].

Secondary outcomes were general intelligence (full-scale intelligence quotient (FSIQ) and general ability index (GAI) of the WISC-V-NL), as well as auditory, visuospatial, and narrative memory. Auditory memory was measured with the Rey Auditory Verbal Learning Test (RAVLT). This included immediate recall (auditory working memory and attention) and delayed recall based on scores at immediate recall (long-term auditory memory). Raw scores were age-corrected and converted into Z-scores (mean 0, standard deviation 1) [16]. Visuospatial memory was evaluated with the Dutch version of the Developmental Neuropsychological Assessment-II (NEPSY-II-NL), subtest Memory for Designs (MD) and Memory for Designs Delayed (MDD). MD assesses spatial memory for novel visual material, while MDD assesses long-term visuospatial memory. Verbal narrative memory was assessed with the NEPSY-II-NL subtest Narrative Memory. Total scores of all NEPSY-II-NL subtests were converted into age-corrected percentiles [17]. Percentiles ≤ 10 were considered to be in the clinical range.

Parents reported on their child’s executive functioning in daily life and sleep habits. Executive functioning was assessed with the Behavioral Rating Inventory of Executive Function (BRIEF), which resulted in T-scores (mean 50, standard deviation 10) [18]. Sleep habits were assessed with the Child Sleep Habits Questionnaire (CSHQ) [19]. The CSHQ allows for a total score, which reflects the major sleep disorders in children aged 4 to 11 years old. The mother’s highest completed education level was used for its association with socioeconomic status and parent intelligence and was categorized according to the International Standard Classification of Education (ISCED) [20].

We retrieved the following information from patient records: the child’s sex, age at treatment initiation (months), treatment duration (months, excluding temporary treatment interruptions), maximum dose (mg/kg/day), average dose (mg/kg/day), and cumulative dose (total exposure corrected for weight, mg/kg).

One certified psychologist performed all neuropsychological assessments, blinded to the type of beta-blocker treatment the child had received as an infant and the treatment practices in both centers.

Data analysis

All test assumptions were checked prior to data analysis. This included tests for normality of continuous data, using inspection of plots, means and medians, kurtosis and skewness, and Shapiro–Wilk testing.

Comparisons between propranolol and atenolol groups

We used independent samples t tests to analyze differences in our primary outcome (CPI) between children treated with propranolol and those treated with atenolol. A multivariable linear regression, with CPI as the dependent variable and beta-blocker type as predictor, was performed controlling for the child’s sex, the mother’s education level, the child’s age at treatment initiation, treatment duration, and cumulative dose. For normally distributed secondary outcomes at interval level, we used the same procedure as for the CPI. Non-normally distributed or ordinal outcomes were analyzed with Mann–Whitney U tests and multivariable linear regression. Dichotomous variables were analyzed with Fisher’s exact tests and logistic linear regression.

Comparisons to general population norms

Differences between sample scores and general population norms were analyzed using one-sample t tests or one-sample Wilcoxon signed rank tests for skewed data. We compared dichotomous data with expected proportions using chi-square tests. If any differences were observed between the beta-blocker groups, analyses were performed independently for each beta-blocker group.

All data were entered into an online OpenClinica 3.12.2. database and analyzed by authors MH, AR, and RS using SPSS 25.0. As missing data were rare (< 5% of data), we used complete case analysis. A two-sided p < 0.05 was considered statistically significant in primary analyses. Accounting for multiple comparisons, a two-sided p < 0.002 was considered statistically significant in secondary analyses (Dunn–Šidák correction). Effect sizes were calculated corresponding to each statistical method and interpreted according to Cohen’s guidelines [21].

Results

Participant characteristics

Figure 1 shows the flow chart of the inclusion process. Consent to participate was obtained for 109 of the 162 potentially eligible children. After inclusion, 4 children were considered screen failures, resulting in a final sample of 105 children (66% of the 158 eligible children; 78% of the 134 successfully contacted eligible children), consisting of 36 children treated with propranolol and 69 children treated with atenolol.

Fig. 1
figure 1

Recruitment flowchart. Abbreviations: np, number of patients treated with propranolol; na, number of patients treated with atenolol; np&a, number of patients treated with both propranolol and atenolol [13]

Participant characteristics are described in Table 1. There was a female predominance (81%), consistent with the literature [22]. The sex ratio was the same in both beta-blocker groups. As propranolol was initiated before atenolol in both treatment centers, children treated with propranolol were significantly older than children treated with atenolol. All outcomes were age-corrected; thus, this difference did not affect our results. The prevalence of attention-deficit/hyperactivity disorder (ADHD) (5.7%) was in line with the estimated population prevalence among children and adolescents (5.9–7.1%) [23]. Four children (3.8%) were treated with methylphenidate. In accordance with test criteria of the WISC-V-NL, children with ADHD (with or without methylphenidate) were included in the study [15].

Table 1 Participant characteristics

Corresponding to the transition to atenolol at UMCU in 2009 and at Erasmus MC in 2013, almost all atenolol-treated children were treated at UMCU, and most propranolol-treated children were treated at Erasmus MC. To avoid multicollinearity, we did not control the analyses for treatment center.

Given the standard maintenance dose of 2 mg/kg/day for propranolol and 1 mg/kg/day for atenolol, significant differences in average, cumulative, and peak dose between children treated with propranolol and atenolol were as expected. Treatment duration differed significantly between both beta-blocker groups; the median duration was almost 6 months longer for children treated with propranolol. Given that all IH were treated until sufficient clinical improvement was achieved, this difference in treatment duration may reflect higher severity of IH treated with propranolol compared to IH treated with atenolol.

Comparison between propranolol and atenolol

The primary outcome measure CPI was normally distributed and did not differ between children treated with propranolol (M = 98.9, SD = 17.4) and children treated with atenolol (M = 101.6, SD = 12.8; p = 0.38) (Table 2). Similarly, analysis corrected for confounders showed no significant effect of beta-blocker type on CPI scores (p = 0.81). None of the secondary outcomes differed between the two groups.

Table 2 Analyses of the difference in neurocognitive functioning between children treated with propranolol and atenolol for IH

Comparison between beta-blockers and norm data

The sample mean of the primary outcome CPI (M = 100.7, SD = 14.5) was not significantly different from norm data (M = 100, SD = 15; p = 0.64); the effect size was small (Table 3). Scores on all secondary outcomes, with the exception of the BRIEF, did not differ from norm scores. Parents of children treated with beta-blockers scored significantly lower (i.e., better) than test norms on the BRIEF, with a large effect size.

Table 3 Univariate analyses of the difference in neurocognitive functioning between children treated with beta-blockers for IH and normed scores based on the general Dutch population

Post hocanalysis: sex differences

Having included sex as a confounder in corrected analyses of the primary outcome measure, we found a significant association between sex and CPI scores, seemingly independent of beta-blocker type. Post hoc analyses showed that the CPI mean of males (M = 92.4, SD = 10.3) was 10.3 IQ points lower than the CPI mean of females (M = 102.7, SD = 14.7). Considering the small number of males (n = 20), we performed a Mann–Whitney U test to compare medians. The sex difference was significant (p = 0.001), with a medium effect size (r = 0.32). A sex difference with a small effect size (r = 0.25) was also observed for FSIQ (p = 0.009) but not for any other secondary outcomes (Supplementary Table S1). Sex differences were not found for variables such as age, mother’s education level, IH location, treatment type, dose, or treatment duration. To compare the sample of males more precisely with males from the general population, we obtained original Dutch normative data 1:1 matched by sex, mother’s education, and age from the authors of the WISC-V-NL. The CPI mean of males in the study sample was 12.4 IQ points lower than the CPI mean of matched males from the general population (M = 104.8, SD = 13.6), which was significant (Mann–Whitney U test, p = 0.003) with a medium effect size (r = 0.46).

Discussion

This two-center study is the largest study to date investigating the long-term neurocognitive functioning of otherwise healthy children (age ≥ 6 years old) who, as infants, had received propranolol or atenolol for IH. Considering the drug characteristics of propranolol, we expected that children treated with propranolol would have lower scores on a pre-specified outcome measure for working memory, processing speed, and attention (CPI) in comparison to children treated with atenolol. Our results show no differences between the two groups for the CPI and secondary outcomes. Furthermore, neurocognitive outcomes did not differ between the total sample and children from the general population. However, in post hoc analyses, males had substantially lower CPI scores.

The finding that the level of neurocognitive functioning at school age was not different between children treated with either propranolol or atenolol during infancy is not in line with expectations based on the pharmacological characteristics and side effect profiles of these beta-blockers. Although propranolol passes the blood–brain barrier, it does not seem to affect neurocognitive development during infancy. If, nevertheless, any disruption of CNS development occurs under the influence of beta-blockers during infancy, the neurocognitive consequences of this disruption may be resolved by brain plasticity [24].

The lack of difference in the level of neurocognitive functioning between children treated with beta-blockers and children in the general Dutch population is in line with previous studies of children treated with propranolol for IH [11, 12]. These earlier studies, however, had small sample sizes, which limits the ability to draw conclusions. Our study analyzed 105 children and provides further evidence that beta-blockers are generally safe as far as long-term neurocognitive functioning is concerned. We found that scores on parent-reported executive functioning were better than norm scores. Since parents were aware of the research hypothesis, expectation bias may have influenced reporting. Nonetheless, the combined results illustrate that the studied children perform at an adequate level in both a research setting and daily life.

The results regarding the 20 males in our sample should be interpreted with caution. Males had substantially lower CPI scores, both compared to females treated with beta-blockers and compared to males from the normative sample (1:1 matched for sex, age, and mother’s education). This difference was considered clinically significant, as it may have implications for the educational attainment of these males [25]. In our sample, we found that females represented the full range of the normal distribution, whereas males only represented the lower range of the normal distribution. A similar distribution of scores was observed in raw data published by González-Llorente and colleagues (2017) [11]. Underlying mechanisms may be sex differences in brain plasticity and neurological vulnerability during infancy, pharmacokinetic differences between males and females, or unknown pathology leading to both IH and cognitive problems in males [26,27,28]. Given these results, we cannot be certain about the long-term safety of beta-blocker treatment in male infants until further research has been done with a larger sample. Working together, the clinician and parents should weigh the risks and benefits before starting treatment of IH with beta-blockers, especially when the child is male.

A strength of the current study is the substantial size of our unique cohort of children who received either propranolol or atenolol independent of disease characteristics. The large sample size enabled us to control for covariates such as sex, mother’s education level, and dose-related variables. Additionally, we applied measures that are sensitive to subtle deviations in neurocognitive functioning.

We maintained strict inclusion criteria. Therefore, the results cannot as yet be extrapolated to the entire population of children who have received beta-blockers for IH, e.g., preterm infants or children who have been treated for less than 6 months. Given our negative findings and in this strictly defined sample, we cannot exclude a type II error, although the CPI difference found between both beta-blocker groups is not considered clinically relevant. Additionally, as in previous research, the current research was limited by a lack of a suitable control group of children with complicated IH not receiving beta-blocker treatment, since withholding beta-blocker treatment for complicated IH is considered unethical.

In conclusion, this study provides robust evidence that for children with IH, treatment with propranolol or atenolol is not associated with long-term deficits in neurocognitive functioning. Although beta-blockers thus appear to be a safe treatment for IH with regard to long-term neurocognitive functioning, there are concerns about possible effects of this treatment on the long-term neurocognitive functioning of males.