Introduction

Very low birth weight infants (VLBWI) are a high-risk population exposed to high-morbidity rates and a wide spectrum of long-term neurodevelopmental abnormalities. Although advances in perinatal medicine have led to an increase in the survival rates at extreme gestational ages [1,2,3,4,5], the long-term neurologic outcome of these patients remains a matter of concern due to high rates of neurodevelopmental disorders including intellectual deficits, behavioral disorders, cerebral palsy, and epilepsy [3, 4, 6,7,8,9,10]. These sequelae of prematurity have a great impact on the child’s future health, with notable family and social impact.

Brain imaging using ultrasound (US) and magnetic resonance imaging (MRI) is a valuable tool during neonatal admission to diagnose brain injury, assisting the clinician to predict the long-term outcome of preterm infants. The increasing research interest in the developing brain together with the technological improvement of these tools has led to a better understanding of the impact of prematurity on the immature brain. MRI is considered the gold standard but cannot rival US for its role in sequential and incubator-based neurological assessment in the preterm infant. Aside from brain injury, US has the potential to estimate brain volumes as reliably as MRI [11] allowing the study of early brain growth patterns. Moreover, we have previously demonstrated that those preterm infants born at lower gestational age and exposed to an increasing number of comorbidities have a deviated pattern of brain growth [12]. However, our study was previously based on the results of term-equivalent MRI while our aim in this study is to identify the pattern of early brain growth in the preterm infants in relation to the 2-year neurodevelopmental outcome.

Materials and methods

Study population

This longitudinal study included VLBWI admitted to the Hospital Puerta del Mar, Cádiz, Spain, from May 2018 to January 2021. We consecutively enrolled those VLBWI with a birth weight equal or less than 1500 g and/or a gestational age at birth equal or less than 32 weeks. The exclusion criteria were defined as the presence of congenital or chromosomal anomalies, metabolic disorders, and central nervous system infections. We also excluded those preterm infants who presented posthemorrhagic ventricular dilatation or died. This study was approved by the Research and Ethics Committee, and all parents or guardians of the participants provided informed consent.

Perinatal and postnatal variables were prospectively collected (see Table S1 in Supplemental material for a detailed definition of the clinical variables).

2D and 3D brain US

Weekly 2D and 3D brains US were carried out while the included patients were admitted to the neonatal intensive care unit (NICU), with the infant lying supine and their head turned to the right. Volume acquisition was carried out using the 3D option of the 3D/4D Voluson S8 BT18 (General Electric Healthcare, Buckinghamshire, United Kingdom) as explained elsewhere [11].

TBV was measured by manual tracing the brain contour on 6 slices at 30 degrees rotation on the vertical axis using VOCAL (Virtual Organ Computer-Aided Analysis) feature of the 4D View software (version 17.0; GE Healthcare). This technique provides a reliable measure of TBV as previously published by our group [11].

Brain MRI

At term-corrected age, all patients underwent a cranial MRI. MRI scans were performed using 1.5 T scanner Magneton Symphony (Siemens Health Care, Erlangen, Germany) located in the radiology unit. T1-weighted images were obtained using a three-dimensional spoiled gradient [repetition time 1660 (RT)/echo time 5.16(ET)] and transverse T2-weighted turbo spin-echo imaging (4180.00/98.00).

Term-MRI scans were evaluated using the scale published by Kidokoro et al. [13], which separately grades the development and injury of the cortical and deep gray matter, white matter and cerebellum. Those with a score of less than 8 points were considered to have normal/mild abnormalities at term-MRI, while those with a score equal or greater than 8 points were classified as having moderate/severe abnormalities at term-MRI.

Assessment at two years of age

All the included patients were reviewed after discharge as part of the neonatal neurology follow-up program. Assessments at two years of corrected age were performed using the Bayley Scales of Infant and Toddler Development, Third Edition (Bayley-III). The scores obtained on the cognitive, motor, and language scales are standardized with a mean of 100 and a standard deviation of 15. We considered both the quantitative data and further dichotomized the scores considering a good neurodevelopmental outcome if the score was greater than or equal to 85 and adverse outcome if they scored under 85 for each scale.

Statistical analysis

Clinical characteristics and demographic variables were described as frequency and percentage if categorical, or mean and standard deviation (sd), or median and interquartile range [IQR] according to their distribution. Bivariate analysis was performed using Pearsonʼs chi-squared test or Fisherʼs exact test for categorical data and Studentʼs t-test or Mann–Whitney U test for continuous variables after testing for normality. Multilevel linear regression models were used to study the relationship between Bayley-III scores, TBV, and clinical variables accounting for repeated measurements and time. The included variables were selected based on the theoretical background, and a backward stepwise approach was performed to exclude the non-significant variables if not considered to be variables for which an adjustment was needed.

Statistical analysis was conducted using Stata 16.0 (Stata Statistical Software: Release 16. College Station, TX: StataCorp LP). A result was considered statistically significant at p < 0.05.

Results

Clinical characteristics of the studied population and 2-year neurodevelopmental outcomes

For this study, we included those VLBWI who were admitted to the NICU at Puerta del Mar Hospital from May 2018 to January 2021, including a total population of 163 patients. Nineteen (11.7%) of these patients died during the neonatal period. Six (3.7%) patients were excluded: one patient for congenital cytomegalovirus infection (CMV), a patient with Down syndrome, and four patients for developing posthemorrhagic ventricular dilatation (PHVD). Of the remaining 138 patients, our final sample size included 105 (76.1%) that completed the assessment at 2 years of corrected age. These patients had a combined total of 719 brain US during their stay in the NICU (see Fig. S1 in Supplemental material).

Our population had a mean gestational age at birth of 29.3 (± 2.3) weeks and mean birth weight of 1168.6 (± 363.1) grams. Seventeen patients (16.19%) were small for gestational age (SGA). A detailed description of the perinatal variables, socioeconomic status, and comorbidities related to the 2-year neurodevelopmental outcome is shown in Table S2 in Supplemental material.

Motor outcome

Those with an adverse motor outcome had a lower proportion of exposure to prenatal steroids compared to those with a good motor outcome (7/13 (53.85%) vs. 75/88 (85.23%); p = 0.015), a higher proportion of severe retinopathy of prematurity (ROP) (4/13 (30.77%) vs. 3/91 (3.3%); p = 0.004), moderate/severe bronchopulmonary dysplasia (BPD) (6/13 (46.15%) vs. 13/90 (14.44%); p = 0.014), intraventricular hemorrhage (IVH) grade 3 (3/13 (23.08%) vs. 3/92 (3.26%); p = 0.024), moderate/severe white matter injury (WMI) (2/13 (15.38%) vs. 0/92 (0%); p = 0.014), higher scores on the Kidokoro scale (2 [0–10] vs. 0 [0–2]; p = 0.03), and a greater number of comorbidities (1 [0–3] vs. 0 [0–1]; p = 0.006), respectively (see Table S2 in Supplemental material).

Cognitive outcome

The proportion of multiple births 7/9 (77.78%) was higher in those with an adverse cognitive outcome when compared to those with a good cognitive outcome (34/96 (35.42%); p = 0.026). No other differences were found in the baseline characteristics of the studied population related to cognitive outcome (see Table S2 in Supplemental material).

Language outcome

Sex was associated with language scores at 2 years with females having better outcomes: 47 females/83 (56.63%) had good vs. 6 females/22 (27.27%) with an adverse language outcome; p = 0.001. Adverse language outcome was related to severe ROP (4/22 (18.88%) vs. 3/82 (3.66%); p = 0.035) and moderate/severe WMI (2/22 (9.09%) vs. 0/83 (0%); p = 0.042) (see Table S2 in Supplemental material).

Total brain volume (TBV) during early postnatal life related to 2-year neurodevelopmental outcome

We studied the association of sequential measurements of TBV, gestational age (GA) at birth, and PMA at the time of US, with the 2-year neurodevelopmental outcome (see Table 1). TBV was related to cognitive and language outcome, while no association was found with motor outcome.

Table 1 Total brain volume (TBV), GA at birth, and PMA related to 2-year neurodevelopmental outcomes

TBV showed a slower increased related to PMA in those VLBWI who had an adverse cognitive outcome, with mean TBV differences between both outcome groups being significant from 28 weeks PMA onwards and ranging from 4.56 cm3 at 28 weeks PMA to 42.58 cm3 at 43 weeks PMA (see Table 2 and Fig. 1).

Table 2 TBV differences in VLBWI with good versus adverse cognitive outcome by PMA
Fig. 1
figure 1

Total brain volume (TBV) during early postnatal life related to cognitive outcome

Similarly, TBV showed a slower increase related to PMA in those VLBWI who had an adverse language outcome, with mean TBV differences between both outcome groups being significant from 28 weeks PMA onwards and ranging from 2.21 cm3 at 28 weeks PMA to 26.98 cm3 at 43 weeks PMA (see Table 3; Fig. 2).

Table 3 TBV differences in VLBWI with good versus adverse language outcome by PMA
Fig. 2
figure 2

Total brain volume (TBV) during early postnatal life related to language outcome

Association of perinatal factors, maternal level of education, comorbidities, and brain injury with the prognostic scores

Motor outcome

Motor outcome at 2 years was related to GA at birth (β coef = 0.52; p = 0.008), sex (β coef (female) = 2.23; p = 0.025), being SGA (β coef = -4.4; p = 0.0001), and moderate/severe findings in term-equivalent MRI (β coef = −14.87; p = 0.0001). We did not find a statistically significant association of brain volumes during NICU admission and socioeconomic status (SES) on motor scores (see Table 4).

Table 4 Association of perinatal factors, SES, comorbidities, and brain injury with prognostic outcome

Cognitive outcome

Cognitive outcome was associated to GA at birth (β coef = 0.56; p = 0.005), sex (β coef (female) = 2.13; p = 0.033), SES (low SES group β coef = −5.7; p = 0.0001), being SGA (β coef = −2.67; p = 0.028), moderate/severe abnormalities on term-MRI (β coef = −12.87; p = 0.0001), and TBV (β coef = 0.02; p = 0.037) (see Table 4).

Language outcome

Language outcome at 2 years was associated with GA at birth (β coef = 0.86; p = 0.0001), SES (low SES group β coef = −11.85; p = 0.0001; medium SES group β coef = −10.51; p = 0.0001), moderate/severe abnormalities at term MRI (β coef = −14.26; p = 0.0001) while brain volume in early postnatal life was not related to language scores (see Table 4).

Brain growth rate related to cognitive outcome

As TBV was more consistently related to cognitive outcome, we explored brain growth rate related to the 2-year cognitive outcome. We calculated brain growth rate as the difference in TBV between two consecutive ultrasound scans divided by time (cm3/week). In those VLBWI with good cognitive outcome, brain growth rate was as follows: TBV growth rate (cm3/week) = 18.78 + (0.33 × (PMA-33)) while in those with an adverse cognitive outcome, it was adjusted to the following equation: TBV growth rate = 13.73 + (0.64 × (PMA-33)) (see Fig. 3 and Table S3 in Supplemental material).

Fig. 3
figure 3

Brain growth rate during early postnatal life related to cognitive outcome

TBV during early postnatal life in VLBWI with normal cognitive outcome related to PMA as a reference for clinical use

To facilitate establishing routine TBV monitoring during NICU admission of VLBWI, we estimated the TBV percentiles by PMA and sex in those patients with good cognitive outcomes (see Table 5 and Fig. 4). TBV can be accurately estimated using 2D US measurements of three orthogonal axes (biparietal diameter, vertical axis, and anteroposterior axis) as we have previously shown [11].

Table 5 TBV percentiles (cm3) by postmenstrual age in preterm infants with normal 2-year cognitive outcome
Fig. 4
figure 4

TBV percentiles (cm3) by postmenstrual age in preterm infants with normal 2-year cognitive outcome

A detailed table of mean TBV by PMA and the population 95% confidence interval by PMA and sex is shown on Tables S4 in supplemental material.

Discussion

The study of TBV through serial US during NICU admission of VLBWI has allowed us to identify an early deviated pattern of brain growth related to 2-year neurodevelopmental outcome. While we have previously shown that TBV can be monitored accurately through 3D and 2D US [11] and we have shown that those preterm infants born at lower gestational ages and exposed to a greater number of comorbidities had smaller brain volumes related to moderate/severe findings in term-equivalent MRI, this study adds insights into the usefulness of early brain growth monitoring [12].

We found TBV was independently associated with the 2-year neurodevelopmental outcome, with those having good cognitive outcomes showing greater brain volume and brain growth during early postnatal life. In line with other studies, good cognitive outcome was also associated with GA at birth, being female, and the maternal level of education; being SGA, and having moderate to severe MRI findings had a negative impact on the 2-year cognitive outcome [14,15,16,17,18].

Similarly, TBV evolution during early postnatal life was associated with language scores in our preterm study population. Nevertheless, other variables were more importantly related to language outcome, with TBV losing significance when studied with such variables. Thus, language outcome showed a positive association with GA at birth and maternal level of education, while those children with moderate or severe brain injury had lower language scores at two years of age as shown elsewhere [19,20,21,22,23].

In turn, we found no association of TBV and brain growth during the first postnatal weeks and later motor scores at 2 years. We did find that motor outcome was affected by GA at birth, sex (with better motor scores in females), being SGA, and the presence of moderate to severe brain injury on term-MRI, as also supported by previous evidence in the literature [16, 24,25,26,27].

We have estimated the population mean of TBV per week of PMA and calculated TBV centiles in those preterm with good cognitive outcome to assist clinicians with normative reference values. This could help incorporating routine TBV monitoring during NICU admission.

The relationship between structural alterations on MRI at term corrected age and adverse long-term neurodevelopmental outcome has been extensively studied [28,29,30]. Some studies have related total and regional brain volumes measured on MRI at term to motor, cognitive, language, executive, and behavioral functioning in childhood [31,32,33,34,35]. Soria et al. [36] found reduced regional white and gray matter volumes and decreased intellectual functioning in their cohort of low-risk preterm newborns. Arhan et al. [37], similarly, studied regional brain volumes in low-risk preterm infants, identifying smaller volumes than their term controls, with these smaller regional volumes being associated with worse cognitive scores. Bolk et al. [38] found a positive association of volumes in specific brain areas, fine motor skills and visuomotor integration. Kelly et al. [39] found a relationship between white and gray matter volumes with cognitive and language outcomes, with no differences found in motor or behavioral scores.

Few studies have explored brain volumes earlier than term corrected age and through US, as we have seen, volume segmentations have been mostly performed at term equivalent MRI. In recent years, some authors have been interested in investigating early brain volume and their possible association with short- and long-term neurological prognosis. Graça et al. [40] studied a cohort of 128 infants (72 very preterm infants at term equivalent age and 56 term infants during their first postnatal week) in which they estimated brain volumes from intracranial diameters measured on brain US at term, finding that, even in the absence of structural brain damage or major cerebral lesions, preterm infants had smaller brain volumes than term infants. Similarly, they found that smaller brain volumes at term were associated with lower GA at birth, lower birth weight and being SGA. While they were one of the first research groups to study brain volume from ultrasound images, they did not perform early brain volume estimation, nor did they recruit a longitudinal cohort as these preterm infants were assessed after term corrected age. Moreover, in contrast to our previous report [11], their model was not validated in relation to manual segmentation or MRI based TBV estimation. Simsek et al. [41] developed a similar model of estimating brain volume from intracranial diameters measured on ultrasound images based on an ellipsoid, studying brain volume longitudinally in a cohort of 121 preterm infants from the first postnatal days until 34 weeks of corrected age. Subsequently, they related lower brain volumes to poorer neurodevelopment outcomes assessed at two years of age. This was one of the first studies to investigate the relationship between brain volumes measured by US and neurodevelopment in VLBW preterm infants, although it was also based on indirect measurements of brain volume. Furthermore, Cuzzilla et al. [42] evaluated brain growth using sequential cUS regional linear measures from birth to term-equivalent age in a cohort of 139 infants born at < 30 weeks and related it to cognitive, language, and motor outcome at two years of age. They found a positive relationship between the growth of the corpus callosum, cerebellum, and vermis with cognitive and language scores; in contrast, no relationship was demonstrated between tissue measurements and motor scores. This study, unlike the previous ones, did not estimate total brain volume, but instead assessed brain growth through a series of multiple linear measurements at different levels of brain tissue, directly studying the relationship of these isolated measurements with the prognosis at two years. Our study provides new insights into the study of brain growth, thanks to the use of 3D ultrasound, directly assessing total brain volume sequentially from the time of birth, and subsequently relating it to long-term neurological prognosis.

Our study suggests that we can identify an early deviation of the trajectory of brain growth in those preterm infants who will have worse cognitive scores in the long term. We have established reference values that would enable the clinician to identify preterm infants who, despite not necessarily showing brain injury, have an altered brain growth pattern and, therefore, are at a higher risk of presenting adverse neurodevelopmental outcomes.

This study has some limitations that should be acknowledged. Firstly, we had a small number of patients who had adverse neurological outcomes, which has limited us from more robust statistical analysis. Regional volumes may further explain the relationship between early brain growth and neurodevelopmental outcome. We measured TBV and not regional brain volume, which is warranted in future research.

Conclusions

Measurement of smaller TBV by serial ultrasound during the first weeks of life and up to term age is associated with poor cognitive prognosis at two years of age. Using a sequence of ultrasound scans, we can detect a deviation of brain growth in patients who will have worse cognitive outcomes. We propose normal values for TBV that can serve as a reference as part of the overall assessment in NICU incubators.