Introduction

Every year and worldwide, more than 10% of babies are born preterm (PT; i.e. before 37 weeks gestation) (WHO, 2018). Concomitantly, the increased survival rate of preterm born infants also results in greater prevalence of associated sequelae that can be present in infancy or manifest later during childhood or adolescence (Maitre et al. 2014, 2020). Multiple studies have shown that PT children present more delays in sensory and cognitive development than their full-term (FT) peers (Aarnoudse-Moens et al. 2009; Barre et al. 2011; Bhutta et al. 2002; Dimitrova et al. 2018; Johnson and Marlow 2011; Turpin et al. 2019; Nevalainen et al. 2008). Sensory processing deficits following PT birth can in turn contribute to subsequent cognitive difficulties such as fine motor delays, global cognitive impairment, visual processing issues and atypical language development (Spittle & Treyvaud, 2016; Hövel et al., 2015; Vandormael et al., 2019; Chorna et al., 2014). This cascade might lie at the core of developmental challenges, school readiness and poorer outcomes later on (Ayres and Robbins 2005; Maitre et al. 2017, 2020). Therefore, it is important to characterise any links between low-level sensory and high-level cognitive processing patterns in PT children, as these will likely help to devise appropriate interventions to rehabilitate or reinforce these processes early on.

PT children have a disadvantage when processing sensory information at least in early childhood (e.g. Maitre et al. 2017; Maitre et al. 2020). Standardized tests suggest atypical developmental patterns across sensory modalities, some of which persist into later childhood (Bucci et al. 2015; Jackson et al. 2003; Jongmans et al. 1996; Wickremasinghe et al. 2013). PT infants have increased risk for hearing loss than FT children and they frequently exhibit auditory processing deficits in early childhood (Gallo et al. 2011). These deficits can be attributed to a combination of the extenuated immaturity of the brain at birth of preterm infants as well as their acoustically atypical environments (Wachman & Lahav 2011; Blackburn 1998). This may make them less able to fully benefit from postnatal experience during the first 4 months of life, such as the presence of speech sounds in the environment to which they may have diminished/absent access depending on their NICU experience (Key et al. 2012).

In the auditory domain, hearing deficits have been reported in PT children at 4–7 years of age (Gallo et al. 2011) and central auditory processing deficits are observed in 7–13 year-old PT children (Amin et al., 2015). Other studies of PT infants (6–12 months of age (corrected age) showed impaired perceptual narrowing (i.e. preferential processing of sounds of their native language) (Jansson-Verkasalo et al. 2003, 2010), suggestive of impaired or dysfunctional semantic representations of the acoustic environment. Auditory processing differences between PT and FT infants and young children have also been documented using auditory evoked potentials (AEPs). These AEP differences manifest not only in terms of amplitude and latency of responses, but also in terms of AEP topography (Mahmoudzadeh et al., 2018; Chorna et al. 2018; Key et al. 2012; Bisiacchi et al., 2009; Fellman et al., 2004; Maitre et al. 2014). Such topographic modulations are forcibly due to differences in the active brain networks. In the case of PT versus FT children, then, different circuits may be recruited to solve the same perceptual task. AEPs are highly sensitive to differences in sensory processing capacity and can even assess small effects of sensory interventions during infancy (Chorna et al. 2018). Most previous AEP studies investigated responses to speech stimuli (Chorna et al. 2018; Jansson-Verkasalo et al. 2010; Key et al. 2012; Paquette et al. 2015) and/or focused on paradigms eliciting a mismatch negativity (MMN) response. Some evidence would suggest that while speech processing is particularly impaired in PT infants, processing of non-speech sounds (i.e. stimuli based on the 2nd and 3rd formants of the speech stimuli) remains unimpaired (Paquette et al. 2015). Smaller auditory MMN amplitude was observed in 4-year-old children that had been born preterm and with very low birthweight (Jansson-Verkasalo et al., 2003). A similar effect was observed in an auditory distraction paradigm in children at age of 5 years (Mikkola et al. 2010). However, paradigms used in different studies investigating auditory processing in PT children and adolescents have produced inconsistent results. For example, Gomot et al. 2007 showed a normal MMN in 9-year old PT children, albeit reduced amplitude of the later N250 component. These authors hypothesized that this N250 difference may affect development of high-order processes such as language acquisition. Another MMN study on 10-year-old PT children showed shorter N100 latencies and larger P2 amplitudes, suggesting a more stimulus-driven response mode (Lindgren et al. 2000). In contrast, Lavoie et al. (1997), showed normal N100 and P2 responses in 5-year-old PT children, but decreased amplitudes of P3a to rare tones. More recently, P300 differences between PT and FT 8–10 year-old children were observed in terms of latency, amplitude and morphology (Durante et al., 2018). More generally, differences in semantic language skills between FT and PT born children have been observed that persist in adolescence (16 years old) (Mullen et al., 2011), albeit diminishing with age (Saavalainen et al., 2006).

The above evidence indicates that auditory processing in PT infants and children differs from that of their FT peers and that such deficits might extend to higher-order auditory processes. Until now, the overwhelming majority of research has focused on differences in the sensory and semantic processing of speech stimuli. By contrast, limited research has considered to what extent auditory semantic representations of non-speech stimuli – i.e. sounds of environmental objects – are comparable between PT and FT children. Discrimination and categorization of auditory objects is fundamental in everyday life, as it englobes everyday interactions that we have with the world (Brefczynski-Lewis & Lewis, 2017). In fact, semantic processing of environmental visual objects has been shown to differ between PT and FT children. More specifically, differences have been documented in the processing of animal and tool stimuli between PT and FT adolescents between 13 and 15 years (Klaver et al. 2015). Thus, there is reason to hypothesize that semantic representations in the auditory modality are also impacted by preterm birth.

The aim of the present study is to clarify this and investigate the likely presence of low-level auditory impairments and higher-level cognitive differences in auditory object processing. To this end, 10 year-old children born preterm performed an auditory oddball task involving the discrimination of sounds of living and sounds of manmade objects. This is a well-established task (e.g. Murray et al. 2006) that allows us to explore the interaction between the lower-level sensory and higher-level semantic processes that result in auditory object recognition. AEPs were concurrently recorded in order to compare the brain responses to living and manmade sounds in terms of response strength and topography. Previous studies in healthy adults employing this living/manmade oddball task have shown that differential processing of living versus manmade sounds starts approximately 70ms post-stimulus in terms of response strengths and 150ms in terms of topography (Murray et al. 2006). Furthermore, in terms of topography, two stages in the processing of sounds of living and manmade objects have been demonstrated: an early stage, behavior independent, around 100ms post-stimulus and a later stage linked to behavior and decision making processes, around 270ms post-stimulus (De Lucia et al. 2012). This is the first study to our knowledge investigating auditory object processing in both PT and FT children.

Methods

Participants

Thirty-two children participated in the study. The children belonged to two groups: a very preterm group and a full-term group. The VPT group was composed of 17 children (9 female, mean age ± SD = 10.42 ± 0.70 years). VPT children in this sample were recruited from a longitudinal clinical cohort study that investigated how neonatal and parental stress affects children’s development. All VPT children were born at < 33 weeks gestational age (GA) (mean GA ± SD = 29.68 weeks ± 1.92) between 2005 and 2007, and were hospitalized in the NICU of the Lausanne University Hospital. The mother was informed about the study after the child’s birth. In the beginning of the study, infants with malformation or chromosomal abnormalities or parents with psychiatric illness, drug abuse or without fluency in French were excluded. The FT group was composed of 15 children (7 female, mean age ± SD = 10.28 ± 0.98 years). Inclusion criteria were being born at the 37th week or later GA and parental absence of psychiatric illness. FT children were recruited using an advertisement posted in the Lausanne University Hospital and a sports club. Children belonging to the FT group participated only in the current study. This sample size of ~ 15 per group is highly comparable to that of prior EEG studies in school-aged PT children (e.g. Koripilahti et al., 2016; Durante et al., 2018). No differences between groups were observed in age, gender proportions, or socio-economic status (SES) score (Table 1). SES was assessed based on an adapted version of the Hollingshead Four Factor Index of Socioeconomic status (Pierrehumbert et al. 1996). The total score combined parents’ education level and work position. Higher score means higher SES. For all participants, parental informed written consent was obtained before testing. The procedure was evaluated and approved by the Vaudois Cantonal Ethics Committee (Ethics approval number:256/14).

Table 1 Demographic data from the preterm (PT) and full-term (FT) groups as well as their statistical comparison

Apparatus and Stimuli

The participants were seated at the center of a sound-attenuated booth (WhisperRoom model 102126E), and acoustic stimuli were delivered over inset earphones (Etymotic model ER-4P; www.etymotic.com). Stimulus intensity was approximately 75dB SPL at the ear (measured via a CESVA audiometer). Auditory stimuli were complex, meaningful sounds (16 bit stereo; 22,500 Hz digitalization) taken from Murray et al. (2006). The sound set was composed of sounds of living objects and sounds of manmade objects (see Table 2). In total, there were 120 different sound files: 60 sound files of living objects (20 different living objects presented in three exemplars) and 60 sound files of manmade objects (20 different manmade objects presented in three exemplars). Each sound was 500ms in duration including an envelope of 50ms decay time that was applied to the end of the sound file. The E-prime software controlled stimulus delivery and recorded the participants’ behavioural performance (www.pstnet.com/eprime).

Table 2 Sound objects used in this study (see fuller details including psychoacoustics in Murray et al. (2006). Note that sounds of humans were all non-speech vocalizations. Note also that sounds of musical instruments entailed a few notes, but not any melody or rhythms. For all sounds there were three acoustically distinct versions used

Procedure

Participants performed an auditory ‘oddball’ detection task with living vs. manmade sounds. Every participant completed four blocks. The ‘target’ category of stimuli to which participants responded occurred 10% of the time for a given block. The remaining 90% of stimuli (‘distracters’) were comprised of the other sound category. Therefore, two out of the four blocks had manmade sounds as targets, and two blocks had living sounds as targets. Every block contained 150 trials with a variable inter-stimuli interval ranging between 1500 and 2000ms. As all sounds had duration of 500ms, each block lasted approximately 5 min. A short instruction in the screen, accompanied by a more detailed verbal explanation of the task by the experimenter, was presented before the start of each block. In addition, examples of living and manmade sounds were presented to children to verify their understanding of the task at the beginning of the experiment. The children’s task was to press as quickly and accurately as possible a button on the response box when the target stimuli appeared. The same response button was used in all blocks. Children were encouraged to take short breaks between blocks in order to minimize fatigue.

EEG Recording and Pre-Processing

Continuous EEG was acquired at 1024 Hz through a 64-channel Biosemi ActiveTwo AD-box (www.biosemi.com), referenced to the common mode sense (CMS, active electrode) and grounded to the driven right leg (DRL; passive electrode), which functions as a feedback loop driving the average potential across the electrode montage to the amplifier zero. Prior to epoching, the EEG was filtered (low-pass 40 Hz; high-pass 0.1 Hz; removed DC; 50 Hz notch; using a second-order Butterworth filter with − 12dB/octave roll-off that was computed linearly in both forward and backward directions to eliminate phase shifts). Peri-stimulus epochs from distracter trials, spanning 100ms pre-stimulus to 500ms post-stimulus onset, were averaged from each subject for each condition to compute AEPs. EEG responses to target stimuli were not analysed as they were too few for sufficient signal quality. Epochs were rejected based on automated artefact rejection criterion of ± 80µV as well visual inspection for eye blinks and movement or other sources of transient noise. The average (± SD) number of accepted EEG epochs for the living sounds condition, for FT children was 159 ± 27 and for VPT children was 158 ± 25. The average number of accepted EEG epochs for the manmade sounds condition, for FT children was 164 ± 27 and for VPT children was 156 ± 21. No differences in these values were observed. Neither significant main effect of Group or Category nor the interaction between the two factors was observed (F(1,30) = 0.26, p = 0.69; F(1,30) = 0.13, p = 0.78; and F(1,30) = 0.99, p = 0.48, respectively). Bad channels were identified before averaging and excluded from the artefact rejection. These data at artefact electrodes from each participant were interpolated using 3-D splines prior to group averaging (Perrin et al. 1987). The average number of interpolated channels was 4.8 ± 1.3. In addition, data were baseline corrected using the 100ms pre-stimulus period and recalculated against the average reference.

AEP Analyses

Differences in the processing of AEPs to living and manmade sounds between FT and VPT children were assessed using a multi-step analysis procedure, referred to as electrical neuroimaging, which involves both local and global measures of the electric field on the scalp. These methods have been described in detail previously (Koenig et al. 2011, 2014; Michel and Murray 2012; Michel et al. 2004; Murray et al. 2008; Tzovara et al. 2012).

First, we analysed the AEP voltage waveform data from each scalp electrode as a function of time using a two-way mixed-model repeated measures analysis of variance (rmANOVA) with the between-subject factor of children group (2 levels: FT vs. PT) and the within-subject factor of type of semantic Category (2 levels: Living vs. Manmade). For this analysis we used an average reference as well as a temporal criterion for the detection of statistically significant effects (> 10ms continuously at 1024 Hz sampling rate) in order to correct for temporal auto-correlation at individual electrodes (Guthrie and Buchwald 1991). Similarly, a spatial criterion (effects were considered statistically significant only if they entailed > 10% of the electrodes of the 64-channel montage at a given latency) was applied in order to address spatial correlation.

Second, a topographic cluster analysis based on a hierarchical clustering algorithm was performed on the post-stimulus group-average AEPs across experimental conditions (Murray et al. 2008). This clustering (“segmentation”) identifies stable electric field topographies (“template maps”). The clustering is insensitive to pure amplitude modulations across conditions as the data are first normalised by their instantaneous GFP. The optimal number of template maps that explained the whole group-averaged data set was determined using a modified Krzanowski-Lai criterion (Murray et al. 2008). The clustering makes no assumption regarding the orthogonality of the derived template maps (De Lucia et al. 2010; Koenig et al. 2014; Pourtois et al. 2008). The pattern of maps that was identified in the group-average AEPs was then submitted to a fitting procedure wherein each time point of each individual participant’s ERP is labelled according to the template map with which it best correlated spatially (Murray et al. 2008). This yielded a measure of relative map presence over a fixed time window (in milliseconds) that was then submitted to a mixed-model rmANOVA with factors of Group, Map, and Category. This fitting procedure revealed whether a given experimental condition for a given group was more often described by one map versus another, and therefore whether different intracranial generator configurations better accounted for particular experimental conditions or groups. That is, biophysical laws indicate that a change in the topography of the electric field at the scalp surface forcibly originate from changes in the configuration of the underlying intracranial sources (reviewed in Michel and Murray 2012).

Subsequently, changes in the strength of the electric field at the scalp as a function of semantic category and group of children were assessed using global field power (GFP) for each participant and condition (Lehmann 1987). GFP is equivalent to the standard deviation of the voltage potential values across the electrode montage (Lehmann and Skrandies 1980). A stronger GFP value is indicative of greater and/or more synchronised brain activity, though the root cause (increased neural firing rate, increased numbers of active neurons, etc.) cannot be unequivocally asserted based on this measure alone. However, a modulation of GFP in the absence of reliable evidence for topographic modulations can most parsimoniously be interpreted as a modulation in the strength of responses originating from statistically indistinguishable sources or set of sources. GFP data were analysed using area measures. These measures were calculated (vs. the 0µV baseline) using time periods of stable scalp topography that were defined in the previous analysis steps and were statistically tested using rmANOVAs to investigate the effects of Group and Category. Next, we assessed whether differences between semantic categories in GFP area values over an identified time period could reliably classify participants according to their group assignment (PT vs. FT). To do this we conducted a receiver operating characteristic (ROC) and area under the curve (AUC) analysis. ROC analyses were performed in SPSS.

Finally, we estimated the underlying intracranial sources of the AEPs in response to the two different types of sound in the two groups of children using a distributed linear inverse solution (minimum norm) combined with the LAURA (local autoregressive average) regularisation approach (Grace de Peralta Menendez et al. 2001, 2004; see also Michel et al. 2004 for a comparison of inverse solution methods). The solution space was calculated on a realistic head model that included 3000 nodes, selected from a grid equally distributed within the grey matter of the Montreal Neurological Institute’s average brain (available from https://sites.google.com/site/cartoolcommunity/downloads). The head model and lead field matrix were generated with the Spherical Model with Anatomical Constraints (SMAC; Spinelli et al. 2000 as implemented in Cartool version 4.10 (Brunet et al. 2011), using a 4-shell model (skull, scalp, cerebral spinal fluid, and brain) as well as with an upper skull thickness (5.1 mm) and mean conductivity (0.027 S/m) values based on the mean age of our sample. As an output, LAURA provides current density measures; their scalar values were evaluated at each node. Statistical analysis of source estimations was performed by first averaging the AEPs across time for each participant and condition over the first and second time windows emerging from the analyses described above. Specifically, during the first time window there was a main effect of Group, and therefore the source modelling was performed after first collapsing the AEP data across factor of Category. For the second time window there was a reliable Category x Group interaction in the ERP analysis and thus a rmANOVA was performed on the source modelled data. The statistical significance criterion at an individual solution point was set at p < 0.0.05. Only clusters with at least 10 contiguous significant nodes were considered reliable in an effort to correct for multiple comparisons and was based on randomization thresholds (see also De Lucia et al. 2012; Knebel and Murray 2012; Retsa et al. 2018; Retsa et al. 2020 for similar implementations).

Results

Behavioural Results

Children in both FT and VPT groups performed the task accurately with no significant main effects of Group (F(1,30) = 0.49; p = 0.492; η2 = 0.144) or Category (F(1,30) = 3.69; p = 0.064; η2 = 0.461), nor an interaction between factors (F(1,30) = 0.844; p = 0.366; η2 = 0.144; see Table 3). The results of the mixed rmANOVA on the reaction times showed no main effect of Group (F(1,30) = 0.46; p = 0.500; η2 = 0.101) nor an interaction between Group and Category (F(1,300) = 3.81; p = 0.060; η2 = 0.472 (though a non-significant trend). However, a main effect of Category was observed (F(1,30) = 46.38; p < 0.002; η2 = 1.0). Overall RTs to manmade objects were faster than RTs to living objects (917ms vs. 1032ms). This difference in RTs was observed for both FT (t(14) = 5.02, p < 0.001) and PT children (t(16) = 4.5, p < 0.001) and is consistent with prior observations in healthy adults (Bergerbest et al. 2004; Murray et al. 2006; Saygın et al. 2003).

Table 3 Group-averaged accuracy and reaction time for each group and each semantic category. Asterisks indicate significant differences between groups

AEP Results

Group-averaged AEPs from FT and VPT children in response to sounds of living and manmade objects from an exemplar midline electrode (FCz) are displayed in Fig. 1. The results of the two-way mixed-model ANOVA across the full electrode montage as a function of time showed significant main effects of both children’s Group and sound Category, starting as early as 50ms post-stimulus onset (see Fig. 1). The main effect of Group followed from the generally larger magnitude of responses in the FT group than VPT group. There was a general difference between responses to Category starting at 50ms post-stimulus. In addition, there was a significant interaction between Group and Category ~ 140-250ms post-stimulus.

Fig. 1
figure 1

A. Group-averaged AEPs at an exemplar fronto-central midline electrode, shown separately for each group and sound-type

A topographic clustering analysis was then conducted over the full 500ms post-stimulus time period in order to identify time intervals of stable electric field distributions at the scalp and to determine whether the above response differences between conditions followed from single or multiple topographic configuration changes. This analysis provides a set of so-called “template maps”. Eight different template maps (shown in Fig. 2) accounted for the collective group-averaged dataset with a global explained variance of 95.5% across the cumulative group-averaged data. During the first interval (36-107ms), two template maps were identified, during a second interval (108-224ms) four maps were identified, and during a third time interval (225-400ms) three template maps were identified. Across the different time intervals shown, the same set of topographies was observed for both living and manmade sounds. That is, the same maps appeared to represent the responses to living sounds and manmade sounds during all the time periods. In contrast, across all the three time intervals (36–107, 108–224, 225-400ms), different maps appeared to predominate the responses of the FT group and the responses of the VPT group. This pattern observed in the group-averaged AEPs was statistically assessed in the single-subject data using the spatial correlation based fitting procedure. This was done separately for each of the three different time periods and their respective template maps. The values of the fitting procedure were then submitted to a mixed rmANOVA (one for each time period) using Group, Category and template Map as factors (see Fig. 2 for the number and topography of template maps identified over each time period).

Fig. 2
figure 2

The topographic pattern analysis identified eight stable topographies (template maps) for all the conditions around 500ms post-stimulus period. The time period when each map was observed is indicated. At the group-average level, two different template maps were identified over the (1) 36-107ms post-stimulus period, four different template maps over the (2) 108-224ms period and three maps over the (3) 225-400ms post-stimulus period. Maps are shown with the front upwards and left hemiscalp on the left. The bar graphs on the bottom show the results of the single-subject fitting procedure, indicated the group-averaged duration each template map was ascribed to each of the two groups. All the three mixed ANOVAs (one for each time period) on these duration values revealed an interaction between template map and group. Post-hoc comparisons in the all the three time periods demonstrated the maps that are predominant for each group

For the first time period (TW1 = 36-107ms), there was a significant interaction between Group and Map (F(1,30) = 8, p < 0.001, ηp2 = 0 0.211). That is, one map predominated the responses of the FT group and the second map predominated the responses of the VPT group (see Fig. 2).Also, over the second time window (TW2 = 108-224ms), there was a significant interaction between Group and Map (F(3, 90) = 5.8, p < 0.001, ηp2 = 0 0.162), indicating that different template maps predominated responses in each of the two groups. As there was no interaction between Category and maps nor a three-way interaction, we averaged the data across responses to living and manmade sounds for each map in order to test which maps predominantly represent the FT and the VPT groups. The results of this analysis showed that one template map predominated the FT group and another map the VPT group (see Fig. 2). Over the third time period (TW3 = 225-400ms), there was again an interaction between Group and Map (F(2,60) = 6.64, p < 0.01,, ηp2 = 0 0.181). Similarly to the second time window, the data from living and manmade semantic categories were collapsed in order to investigate the contribution of the three template maps on the two groups. Two template maps found to predominate responses from the FT group, whereas the third map was found to predominate responses from the VPT group (see Fig. 2).

Prior data from adults indicates that semantic processing of sounds of objects involves a latency difference expressed as the difference in the duration over time when a given map offset (i.e. the last time point when a template map best correlated spatially with AEPs from each semantic category) (Murray et al. 2006). We therefore performed a similar analysis here using the last offset of template maps over the 108-224ms post-stimulus period. We observed a significant three-way interaction between Group, Category and Map (F(3,87) = 9.65, p < 0.001, ηp2 = 0 0.286). Post-hoc comparisons showed that there was an effect of Category only in the FT group. Specifically, the predominant map for FT in this time window was found to end significantly later in response to living sounds (204ms±0.76(SE) compared to manmade sounds (182ms±3.7(SE).

Based on the results of the topographic clustering analysis, we also defined the time windows for the analysis of GFP waveforms (Fig. 3A). Specifically, we investigated the main effects of Group and Category as well as their interaction on GFP area for each of the three identified time windows. Neither the main effect of Group nor Category was found to be significant in any of the three time windows. The Group × Category interaction was significant only for TW2 (108-224ms): F(1,30) = 11.4, p < 0.01, ηp2 = 0 0.275. Post-hoc t-tests indicated that in the FT group children responded significantly stronger to sounds of living objects (2.87µV) compared to sounds of manmade objects (2.4µV): t(14) = 2.83, p < 0.05. In contrast, in the VPT group, there was a trend for the children to respond stronger to the sounds of manmade objects (2.85µV) than to the sounds of living objects (2.55µV): t(16) = 1.992, p = 0.055. (Fig. 3B).

Fig. 3
figure 3

A. Global field power (GFP) waveforms from each group and sound-type. B. Mean GFP values shown separately for each group and type of sound object in the three identified time periods during the topographic analysis. There is an interaction between group and type of object only in TW2. Asterisks indicate significant differences between types of objects within a group. C. ROC curve with AUC score for the GFP differences between living and manmade objects during TW2.

Next, ROC analysis was performed, using the area under the curve (AUC) versus a null hypothesis of chance classification, to determine if VPT and FT children could be reliably classified based on the GFP area measures over TW2, which also exhibited a significant Group x Category interaction (i.e. when we observed differences between the processing of living and manmade objects as a function of prematurity). The ROC analysis showed statistically reliable classification of VPT from FT children, using the differences between the GFP of responses to living and manmade objects, with an AUC of 0.788 (p < 0.001 based on the null hypothesis of an AUC of 0.5; see Fig. 3C). In other words, differences in brain responses to semantic categories were sufficient to reliably classify schoolchildren as having been born preterm or full-term. For completion, we also assessed whether reaction time differences between semantic categories could also reliably classify schoolchildren according to whether they had been born preterm or full-term. This was indeed the case (AUC = 0.706; p = 0.036). By contrast, there was no evidence that the abovementioned GFP difference and reaction time differences were themselves correlated (r(30) = 0.062; p = 0.74).

Finally, LAURA-distributed source estimations were calculated separately over TW1 (36-107ms) and TW2 (108-224ms). For this purpose, AEPs for each participant and experimental condition were averaged over the periods 36-107ms (TW1) and 108-224ms (TW2). For TW1, only overall differences between the two groups of children were investigated. This t-test showed significant differences between FT and PT children in the estimated source activity within the left inferior lingual gyrus (local maximum: -4,-80,-4 mm (BA18)), the left superior temporal gyrus (local maximum: -57, -19, 4 mm (BA41)), the left dorsolateral prefrontal cortex (local maximum: -50, 27, 19 mm (BA46)) and the right anterior prefrontal cortex (local maximum: 11, 57, -11 mm (BA10) (Fig. 4). Stronger activity in PT children was observed only in the occipital cortex cluster. In the other three clusters, FT children exhibited stronger activity.

Fig. 4
figure 4

LAURA source estimations over the TW1 (36-107ms). Mean source activations are shown for Full-term and Preterm children independently of semantic group. Significant differences between the two groups of children were observed in four main areas. Stronger activation in the left inferior occipital cortex is observed in PT children, whereas stronger activation in the left primary auditory cortex, left dorsolateral prefrontal cortex and right anterior prefrontal cortex was observed in Full-term children

For TW2, differences between categories and groups were investigated. An rmANOVA was performed and it revealed a significant interaction between Group and Category on the estimated source activity. Differential source activity was localised to clusters within the right inferior occipital cortex (lingual gyrus, local maximum: 25, -72, -2 mm (BA 19)) and the right dorsolateral prefrontal cortex (local maximum: 19, 42, 31 mm (BA 9)) (Fig. 5). Greater differences in the activity between living and manmade objects in the right occipital cortex were observed in PT children, whereas greater differences in the activity between living and manmade objects were observed in FT children in the right prefrontal cortex.

Fig. 5
figure 5

LAURA source estimations over the TW2 (108-224ms). Mean source activations are shown for Living and Manmade objects in Full-term and Preterm children. A significant Group × Condition interaction is observed in two main areas. Greater activation in the right inferior occipital cortex is observed in Preterm children, whereas greater activation in the right superior frontal gyrus is observed in Full-term children

Discussion

We investigated the long-term effects of preterm birth on sensory and semantic auditory object processing. An electrical neuroimaging analysis framework applied to the recorded AEPs allowed us to differentiate between effects due to modulations in response strength and effects due to modulations in response topography. We observed general topographic differences in auditory responses between groups as early as 50ms post-stimulus onset, indicative of altered sensory responsiveness and more specifically the engagement of at least partially distinct brain networks by each group in the processing of sounds. Source reconstructions over the 36-107ms post-stimulus period identified differential patterns of activation between FT and VPT children within the primary auditory cortex and prefrontal cortices as well as the occipital cortex. Over the 108-224ms post-stimulus period, we also observed different patterns of semantic discrimination as a function of preterm status. Full-term children exhibited stronger responses to sounds of living than manmade objects, whereas preterm children exhibited the reverse. This index of semantic discrimination accurately classified a child’s preterm status. Finally, source reconstructions over the 108-224ms post-stimulus period identified interactions between Group and Category within brain circuits also typically implicated in visual object discrimination, consistent with the notion of multisensory, if not altogether amodal and multisensory object representations whose functional organization is impacted in a long-term manner by preterm birth. Collectively, our results indicate there to be persistent differences in sensory and semantic processing in healthy preterm-born children.

We provide evidence that preterm birth can entail longstanding and persist effects (at least) until 10 years of age. We also provide some of the first evidence that differences in auditory information processing extend beyond linguistic stimuli to include the semantic analysis of sounds of environmental objects. These sequelae are undoubtedly the consequence, at least indirectly, of both the atypical auditory sensory experiences of the NICU, which are noisy, together with the immaturely developed sensory systems of the PT born infant (Vitale et al. 2021; Carvalhais et al. 2017; Pineda et al. 2017). Another set of contributing factors likely includes the family environment and in particular the reactions of the parents to the preterm birth, including their stress and consequent manner of interacting with their child (e.g. Muller-Nix et al., 2004; Bozkurt et al., 2017; Turpin et al. 2019).

Several ERP studies in infants have reported differential auditory processing in PT versus FT infants (e.g. de Regnier, 2008; Key et al., 2012; Maitre et al. 2014; Bisicacchi et al., 2009; Fellman et al., 2004). Most of these have shown impaired processing of speech stimuli (e.g. Maitre et al., 2014), including speech-elicited mismatch negativity (MMN) responses that are either absent (e.g. Fellman et al., 2004) or followed by a decreased N200 component (Bisiacchi et al., 2009). In particular, one study demonstrated impaired processing of complex consonant sounds both in terms of AEP amplitude and also topography, indicative of differences in the underlying neural processes for speech sounds between FT and PT infants (Key et al. 2012), starting mainly from 400ms post-stimulus. By contrast, vowel discrimination was found unimpaired (Key et al. 2012). These differences in the processing of consonant sounds between FT and PT infants were interpreted as an additional requirement of the engagement of attentional processes in the case of PT infants whose sensory system is more immature and struggles more with the accurate discrimination of more complex sounds. It should be noted that early auditory responses (before 150ms post-stimulus) were not investigated.

Most of the studies investigating auditory processing in PT children have focused on infancy and early childhood. Very few studies have investigated the long-term effects of preterm birth on sound processing. Two studies indicated that the auditory processing in 9-year-old PT children differs from that of their full-term peers (Gomot et al. 2007; Korpilahti et al. 2016). Gomot et al. observed an unimpaired MMN in the 9-year-old PT children, but a reduced N250 instead. They interpreted this reduction as indicative of dysfunction in auditory processing as a result of delayed brain maturation in PT infants. Korpilahti et al., observed impaired N200 and N400 in the PT group in response to auditory words and pseudowords. Additionally, these differences in N200 in PT children correlated with auditory attention deficits. More generally, these differences were interpreted to reflect impaired lexical access in PT children. The present data are consistent with this interpretation and extend such beyond lexical access to instead reflect semantic analysis of acoustic signals.

Our results showed that despite the absence of behavioural performance differences between FT and VPT children, there are striking neurophysiological differences in auditory processing between the two groups. These sensory processing differences were identified starting as early as 50ms post-stimulus onset, independently of the semantic category of the sound object being heard. VPT children had generally smaller amplitude of AEP responses and exhibited differences in terms of AEP topography. Topographic differences between FT and VPT children were identified throughout the post-stimulus period. That is, different maps – different cortical networks- represented the responses of VPT children compared to the responses of FT children. VPT born children, even 10 years after birth, exhibit fundamental differences in the way that they process sounds.

This is the earliest post-stimulus difference in terms of auditory processing that has been identified so far in PT born schoolchildren. Previous studies have identified later differences starting around 200ms post-stimulus (Korpilahti et al. 2016; Key et al. 2012; Gomot et al. 2007). However, these prior studies used different analysis frameworks (i.e. only voltage waveform analyses in contrast to our electrical neuroimaging analyses) and stimuli and did not focus on early sensory processing. Source modelling of the early AEP differences identified group-wise effects within a network of temporal, occipital and prefrontal regions. Activations within the superior temporal gyrus and prefrontal cortices have been previously shown to be linked to auditory processing, sound discrimination and category discrimination (Husain & Horwitz, 2006). Specifically, activations within superior temporal gyrus and bilateral prefrontal cortices have been observed in a MEG study investigating the processing of natural and manmade sounds (Salvari et al. 2019). Activations within the superior temporal gyrus and prefrontal cortices network were mainly found in FT children compared to VPT children suggesting long-term effects of preterm birth in these regions. In contrast, significantly stronger activations within left inferior occipital cortices were observed in VPT children comparted to FT children. In agreement, a recent fMRI study has indeed shown ventral occipital cortex activation associated with sounds of objects (as well as images of objects), suggesting that this region can be activated in object processing independently of modality Mattioni et al. 2020; Amedi et al. 2005). Interestingly in our study this occipital activation is observed quite early (already at the first time window(36-107ms post-stimulus) and is more evident in VPT than FT children. One possibility, therefore, is that stronger occipital activity in VPT children could reflect a compensatory mechanism where the auditory processing related areas seem to respond more weakly.

Aside from sensory processing differences, we also observed general, group-independent differences in semantic processing. Semantic discrimination of living versus manmade objects was observed starting as early as 50ms post-stimulus, which is comparable to the findings observed in adults (Murray et al. 2006; De Lucia et al. 2010). Additionally, we observed a subsequent Group × Category interaction, indicating differential processing of living and manmade sounds between FT and VPT children. More specifically, FT children exhibited larger amplitudes of GFP in response to living objects, whereas VPT children exhibited larger amplitude of responses to manmade objects in a time window approximately between 100-200ms post-stimulus (Fig. 3). It is interesting to note that the differential pattern we observed here in VPT resembles what has been previously reported in adults (Murray et al. 2006). This suggests that in the typical developmental trajectory, responses are initially stronger in response to living objects and at some point, that reverses. Indeed, FT infants show a preference for voices compared to other sounds, whereas the same is not observed in PT infants (Therien et al. 2004). It would be interesting to identify when (and why) this switch happens as well as what is driving this earlier shift in PT children. One speculative possibility is that PT children have stronger responses to manmade objects in part because of their early life exposure to sounds and noises of machines during their stay in NICU (Carvalhais et al. 2017; Therien et al. 2004). A corollary of this speculation would be that such experiences result in augmented representations, perhaps via statistical learning or similar mechanisms, of the kinds of objects to which the preterm infant is exposed; notably sounds of manmade objects. In this regard, it is noteworthy that we could reliably classify a child’s preterm status based on their differential responses to semantic categories (see Fig. 3C). It will likely be informative for future research to ascertain to what extent this may also manifest at earlier ages and effectively indicate the exposure of the child and potential efficacy of any remediation through exposure or experience with more typical sensory environments (see e.g. Chorna et al. 2014 for the example of suckling-driven auditory feedback of the infant’s mother’s voice).

There was no evidence for category-wise topographic differences between the processing of living and manmade sounds in either FT or VPT children. Previous data from healthy adults have identified different AEP topographies that characterize responses to sounds of living and manmade objects during the 155-257ms post-stimulus window (Murray et al. 2006). In this regard, and in contrast to the abovementioned GFP results, the topographic patterns in children remain (somewhat) immature. More specifically, whereas distinct brain networks contribute to adults’ semantic processing of environmental sounds, strength modulations within a statistically indistinguishable brain network account for children’s semantic processing. Nonetheless, we did also observe temporal offset modulations in the pattern of AEP topographies only in FT children, which resembles the temporal pattern observed in adults (Murray et al. 2006). Specifically, the predominant AEP topography over the 108-224ms post-stimulus time window in FT children exhibited a protracted offset in response to living versus manmade sounds of objects. No such temporal difference was observed for VPT children.

With regard to the implicated brain circuits in semantic processing of environmental sounds, we observed a common network of brain areas responding to both categories of sounds and both groups of children. This network included regions of the right temporal cortex, the right dorsolateral prefrontal cortex and the right inferior occipital cortex that are active over the 108-224ms post-stimulus window (see Fig. 5). Source activity within the right inferior occipital cortex exhibited larger differences between living and manmade objects in VPT children than in FT children. In contrast, within the right dorsolateral prefrontal cortex, we observed significant differences in source strength between living and manmade objects for FT children only. The differences between living and manmade objects within the identified frontal regions (as well as the temporal one) are consistent with previous studies in healthy adults (Murray et al. 2006; Husain & Horwitz, 2006; Lewis et al. 2005; Lewis et al. 2004; Salvari et al. 2019). Of particular note is the involvement of visual cortices in the representation of sounds of auditory objects, particularly in VPT children, which was also observed over both the 108-224ms as well as the earlier 36-107ms post-stimulus time window (see Figs. 4 and 5). However, studies have shown that the ventral occipital-temporal cortex (VOTC) is involved in the representation of different categories of auditory objects in a similar way to their visual counterparts (Mattioni et al. 2020; Amedi et al. 2005). In the case of visual cortical involvement in the auditory object recognition, several propositions have been advanced that are not mutually exclusive. On the one hand, it has been proposed that object representations in visual cortices are multisensory in nature (Murray et al. 2004, 2005; Mahon et al. 2009). On the other hand, it has been proposed that auditory object recognition may involve mental imagery in terms of visualizing the referent object (Lewis et al. 2005). Which specific processes or recognition strategies are operating in FT and PT children will require additional research, but may benefit from analytical approaches such as representational similarity analysis (e.g. Tovar et al. 2020).

Employing our paradigm and studying auditory object processing in both younger PT children and teens as well as adults would be critical for understanding the developmental trajectory of auditory object processing and how it is affected by preterm birth. The identification of long-lasting effects of preterm birth in early auditory processing and auditory object processing is a first step in understanding the neurophysiological mechanisms underlying the difficulties that PT children encounter. Here we demonstrated important differences in early auditory processing in general as well as differences in processing of different semantic categories between FT and VPT 10-year-old children. An optimal sensory system is essential in order to process information from the surroundings and to have a favourable cognitive development. The next step is to link the sensory and semantic processing differences in PT children we identified here to measures of cognitive performance and school success. Korpilahti et al. 2016 for example showed differential processing of words and pseudowords only in 9 year old PT children and not FT children; this distinct pattern of processing in PT children was furthermore linked to lower cognitive performance. To the extent that similar associations can be assayed with non-linguistic stimuli, like those of the present study, will not only allow for studying children from a more diverse range of backgrounds but also studying children at earlier neurodevelopmental stages.

The investigation of long-term impacts of preterm birth on developmental outcomes is essential, considering the increased number of preterm birth survivors and the associated risks and difficulties for preterm born infants that often persist to childhood and affect school performance (Maitre et al. 2014, 2020; Turpin et al. 2019). Therefore, the identification of the potential difficulties combined with the development of strategies in order to tackle them as early as possible is crucial. For example, recent studies have shown that preterm birth negatively affects school readiness as it increases the risk for various cognitive and sensory abilities difficulties (Taylor et al., 2022; Maitre et al. 2020). PT children have lower IQ scores compared to FT children (albeit in the normal range), and this persists even when tested in later childhood (Kim et al., 2021) or early adolescence (Turpin et al. 2019). PT children often exhibit school difficulties and lower academic achievement levels. Preterm born children present more cognitive, emotional and behavioral difficulties (e.g. anxiety, inattention, and ADHD) and are at a higher risk for internalizing and externalizing behaviors, including during early childhood, compared to their full-term born peers (Taylor et al., 2022; Maitre et al. 2020; Aarnoudse-Moens et al. 2009; Barre et al. 2011). Preterm birth is also considered a high risk factor for subsequent development of neuropsychiatric disorders such as ADHD, inattention, anxiety, autistic spectrum disorders as well as increased risk for psychosis, depression and bipolar disorder (Nosarti et al., 2012; Johnson and Marlow 2011).