Several theories of adolescent development posit that adolescence may be a time of heightened sensitivity to emotionally salient events (Casey, 2015; Somerville et al., 2010; Steinberg et al., 2008). Indeed, adolescents tend to report greater sensitivity to threat—one type of emotionally salient event—compared with children (O’Brien & Bierman, 1988; Vervoort et al., 2010). In contrast, some work suggests that adolescents may have lower threat sensitivity compared with children (Humphreys et al., 2016; McCormick & Telzer, 2017). See Ernst et al.’s (2006) Triadic Model, which posits that adolescents have a strong reward system but a weak harm-avoidant system. Thus, the relationship between threat sensitivity and development remains unclear. Of concern, heightened sensitivity to threats has been found to be associated with anxiety (Balle et al., 2013; Bar-Haim et al., 2007; Johnson et al., 2003; Katz et al., 2020; Pérez-Edgar et al., 2010, 2011; Vervoort et al., 2010); therefore, it is critical to advance our understanding of threat sensitivity among youth.

There are a variety of different situations, however, that youth may find threatening, such as receiving negative feedback, making mistakes, and seeing angry faces. In survey research, these threatening situations are generally combined into one overall measure of threat sensitivity [e.g., Behavioral Inhibition Scale (BIS; Carver & White, 1994); Sensitivity to Punishment Scale (Torrubia et al., 2001)]. For example, the BIS includes questions assessing responsiveness to negative feedback (e.g., “Criticism or scolding hurts me quite a bit”), making mistakes (e.g., “I worry about making mistakes”), and worrying about whether someone is angry at you (“I feel pretty worried or upset when I think or know somebody is angry at me”). Thus, in survey research, the assumption is that people who are high on sensitivity to threat in one situation also tend to be high in other situations. Within the neuroscience literature, however, different threatening situations are treated as distinct events, each being investigated in isolation from each other. For example, a task where someone receives negative feedback about their performance (e.g., a gambling task) is not compared to a task where someone makes mistakes (e.g., during a go/no-go task); yet both of these situations are captured within the same self-report survey. Research has yet to investigate (1) individual differences in the consistency of ERP activation across different threat-related events (e.g., Do some participants have consistently high neural activation across different threat-related events, while others show heightened neural activation to only one or two events?) and (2) what demographic and self-report factors are associated with individual differences in neural activation across different tasks (e.g., do adolescents have more consistently high neural activation to threats than children? Do individuals who self-report greater threat sensitivity have consistently higher activation across these tasks?).

ERPs to threatening situations

One way to investigate how an individual reacts to different types of threats is to consider their neural activation directly after a threatening event occurs. Event-related potentials (ERPs: an averaged EEG response that is time-locked to an event; Luck, 2005) can provide a sensitive measure of neural activation directly after a threatening event happens (e.g., after an individual receives negative feedback). We have elected to investigate three different types of threatening events that are consistent with self-report measures of threat sensitivity: receiving negative feedback, making mistakes, and viewing angry faces. Below, we discuss three ERPs that are elicited in response to these types of events and have previously been associated with self-reported threat sensitivity.

Negative feedback (P3)

The P3 is an ERP component that is associated with paying attention to feedback (Huang et al., 2015; Luck, 2005). Previous research has found that individuals who have greater sensitivity to threat tend to have larger P3 amplitudes to negative feedback than those with lower sensitivity to threat (De Pascalis et al., 2004; Heffer & Willoughby, 2020; Miltner et al., 2005). Furthermore, Reeb-Sutherland et al. (2009) found a trend whereby high sensitivity to negative feedback and larger P3 amplitudes were associated with greater anxiety.

Making mistakes (ERN)

The error-related negativity (ERN) is an ERP that is associated with performance monitoring, specifically when making mistakes during an inhibitory control task. Indeed, this ERP corresponds to the motivational significance of errors, whereby a larger ERN is associated with greater motivation to avoid errors (Hajcak & Foti, 2008; Meyer, 2017). Individuals with greater threat sensitivity or anxiety tend to have larger ERNs when making errors than those with lower threat sensitivity or anxiety (Boksem et al., 2008; Chong & Meyer, 2019; Hajcak et al., 2003; Ladouceur et al., 2006; Meyer, 2017; Meyer & Hajcak, 2019; Weinberg et al., 2010).

Viewing angry faces (N170)

N170 is an ERP that is elicited to faces (e.g., angry faces). Previous research has found that the N170 is larger in response to angry faces compared to other emotional expressions (e.g., neutral or happy Denefrio et al., 2019; Hinojosa et al., 2015; Jetha et al., 2013; Kolassa et al., 2009, Kolassa et al., 2007; Rossignol et al., 2005). Furthermore, individuals with greater threat sensitivity (or anxiety) tend to have a larger N170 to angry faces than those with less sensitivity to threat or anxiety (Bechor et al., 2019; Kolassa & Miltner, 2006; O’Toole et al., 2013; Wieser et al., 2010).

Age-related differences in ERPs

Of note, there has been some work identifying age-related differences in these ERPs across development (Downes et al., 2017). Specifically, there is evidence that the ERN (Davies et al., 2004; DuPuis et al., 2015; Kim et al., 2005; Santesso et al., 2006; Wiersema et al., 2007) and the P3 (Heffer & Willoughby, 2020; van Dinteren et al., 2014) tend to be larger among older participants. Other studies, however, have not found this pattern of age-related changes among the ERN or the P3 (Eppinger et al., 2009; Grose-Fifer et al., 2014). Age-related changes in the N170 are less consistent. Some studies have found that this ERP also gets larger with age (Hileman et al., 2011), whereas others have shown that the N170 fluctuates across development, showing more positive values around late childhood and early adolescence followed by greater negativity into adulthood (Batty & Taylor, 2006; Kuefner et al., 2010; Taylor et al., 2004). Overall, this research highlights that age may be an important factor to consider when investigating the ERN, P3, and N170. This research, however, has rarely taken into consideration individual differences that may also affect ERP amplitudes. Indeed, youth who are in the same age group have a larger N170s, ERNs or P3 when they have greater anxiety-related symptoms than when they do not have anxiety-related symptoms (Chong & Meyer, 2019; Heffer & Willoughby, 2020; O’Toole et al., 2013). Thus, it is unlikely that these changes in amplitude across development are simply the result of age.

To date, these ERPs have been investigated in isolation and no study has investigated whether individuals have neural consistency across these ERPs. In other words, it is not clear whether the same individuals have a high neural sensitivity across these different threat-related events (i.e., are individuals consistently sensitive to different threats?). Based on self-report measures of sensitivity to threat—that incorporate these different threatening situations all into one measure—we might expect that individuals will be consistently sensitive (or not) across these different situations. From a neuroscience perspective, however, these threat-related events are examined in separate lines of research and each of these ERPs are thought to have different neural generators. For example, studies that have combined fMRI and EEG have suggested that the ERN is generated in the anterior cingulate cortex (Debener et al., 2005; Mathalon et al., 2003)—an important region involved in goal-directed behavior (Holroyd & Yeung, 2012). The N170, however, is thought to be activated in face processing regions (e.g., superior temporal sulcus or the fusiform gyrus; Sadeh et al., 2010), while the P3 is a neural indicator associated with attention (Huang et al., 2015; Luck, 2005). Thus, within the neuroscience field, these different threatening events are thought to be distinct and related to different neural processes. No study has investigated individual differences in consistency of neural activation across these different tasks. Importantly, cumulative neural-level sensitivity across these different threats (e.g., across a number of situations) may be important to consider when targeting individuals at risk factors for anxiety, given that these individuals would have a high reactivity to threat across multiple situations (and multiple neural generators).

At the same time, there may be some children and adolescents who are not concerned with these threatening situations (e.g., they may be consistently low on these ERPs). Indeed, some individuals may be less bothered by these types of threats and therefore may pay less attention to negative feedback, errors, or angry faces. For instance, impulsive individuals tend to be less concerned with threatening situations and instead engage in nonreflective, stimulus-driven responses (Nigg, 2017). Previous research has found that impulsive individuals tend to have smaller P3 amplitudes (Justus et al., 2001; Ruchsow et al., 2008) and smaller ERN amplitudes (Checa et al., 2014; Pailing et al., 2002; Ruchsow et al., 2005; Stahl & Gibbons, 2007; Taylor et al., 2018) compared with individuals who are less impulsive. Less is known about whether impulsive individuals have smaller N170 activation to angry faces; thus, this latter analysis is more exploratory.

Current study

The current study seeks to assess whether consistently high neural activation to threats across different tasks is associated with both demographic and self-report factors (sensitivity to threat, impulsivity, age, pubertal status, sex, and parental education). While this analysis is exploratory, we expect that individuals who self-report higher sensitivity to threat and lower impulsivity will have consistently higher neural activation to threats. We also predict that adolescents (those with more advanced pubertal development and older age) will be more likely to have consistently higher neural activation to threats.

In a follow-up analysis, we further investigate whether the results will replicate when using a difference score for each of these threatening events (i.e., P3 loss - P3 wins; N170 angry - N170 neutral; ERN - CRN). A difference score specifically examines whether individuals have higher neural activation to threats than to non-threatening events. One difference between this analysis and the previous analysis is that difference scores offer a way to investigate whether individuals have neural activation that is specific to threats. In other words, difference scores provide a method for checking if individuals have greater activation to threats than non-threats. At the same time, however, only investigating difference scores can sometimes make interpretation of the results unclear. For example, an individual who has high neural activation to receiving negative feedback and high activation to receiving positive feedback would have a low difference score, but their score could be identical to a person who has low neural activation to negative feedback and low neural activation to positive feedback. Given the different strengths and weaknesses of these analyses, we include both an analysis using only the threat-related ERPs and an analysis using the difference score.

Method

Participants

Participants (N = 228, Mage = 10.57, SD = 1.77; age range = 8-14 years, 49.36% female) were drawn from several elementary and high schools in southern Ontario, Canada, and were part of a larger study examining the associations between wellbeing and youth health-risk behaviors. Parent report indicated that 82.96% of the children and adolescents were white, 1.89% were black, 1.42% were Asian, 2.36% were Hispanic, 0.47% indigenous, and 9.43% were mixed (0.47% of parents indicated that they preferred not to answer the question).

Procedure

Students were invited to participate in the study through visits to schools. Surveys were completed in classrooms during school hours, and all participants received gifts (e.g., backpacks) as compensation. Participants also completed a Mobile Lab component in which EEG data were recorded. Parents were asked to identify if their child had any illnesses or disabilities (either physical or mental). One participant was excluded because of a diagnosis of autism. Eleven people were excluded because of equipment issues (e.g., the event markers did not show up) on at least one of the tasks. Fifteen people were excluded, because EEG data were not usable (e.g., contained a larger number of muscle/movement artifacts) on at least one of the tasks. Two participants did not complete one of the tasks, and five participants did not follow the instructions (e.g., they were off task). We also had 33 participants who had less than 6 trials on the ERN, which can be cause for concern (Olvet & Hajcak, 2009). Thus, we removed these participants from our analyses. The final sample included 161 participants. Of note, the final sample was fairly equally distributed among children aged 8-11 years (N = 91) and adolescents aged 12-14 years (N = 70). The University Ethics Board approved this study. Participants provided informed assent, and their parents provided informed consent.

Missing Data Analysis

Missing data occurred because some participants did not finish the questionnaire (average missing data = 5.09%) and because some participants were absent during the time of the survey. The percentage of students who completed the survey was 93.43%. Missing data were primarily due to absenteeism, but also occasionally due to time conflicts, RA mistakes (e.g., not inviting a child to complete the survey), or students moving to another school district with no contact information. Missing data were imputed using the expectation-maximization algorithm (EM). EM retains cases that are missing survey waves and thus avoids the biased parameter estimates that can occur with pairwise or listwise deletion (Schafer & Graham, 2002).

Measures

Demographics

Pubertal status, age, sex, and parental education were collected. Parental education was measured with one item per parent on the following scale: 1 (did not finish high school); 2 (high school diploma); 3 (some university/college); 4 (associate degree/diploma); 5 (undergraduate degree); 6 (graduate degree). The average level of parental education for this sample was a 4, “completed an associate degree and/or technical diploma.” Pubertal status was assessed using the Puberty Development Scale (PDS; Petersen et al., 1988). The PDS is a self-report measure that assesses body hair, facial hair, and voice development in boys, and body hair, menarche, and breast development in girls. All items were rated on a 4-point scale from 1 (not yet started changing) to 4 (change seems complete). The PDS scale exhibits good reliability and validity (Carskadon & Acebo, 1993; Petersen et al., 1988).

Sensitivity to Threat

Participants reported the extent to which they agreed with three items specifically examining sensitivity to threat from the Behavioral Inhibition Scale (Carver & White, 1994; “Criticism hurts me quite a bit”; “I feel worried when I think I have done poorly at something”; “I feel pretty worried or upset when I think or know somebody is angry at me”) on a scale ranging from 1 (strongly disagree) to 4 (strongly agree). Higher scores indicate higher levels of threat sensitivity. Cronbach’s alpha was 0.73.

Impulsivity

Impulsivity was measured using 4 items (“I do not consider the consequences before I act”; “I say things without thinking”; “I often act on the spur of the moment”; “I do things without thinking”; Baars et al., 2015; Barratt, 1959; Patton et al., 1995; Van der Elst et al., 2012). Items were assessed on a 4-point scale from 1 (almost never) to 4 (almost always). Higher scores indicate higher impulsivity. Cronbach’s alpha for this scale was 0.79.

Go/No-go task

Participants completed the go/no-go task (DuPuis et al., 2015) while EEG was recorded. Participants were instructed to continuously push a button every time a stimulus appeared (a Go trial) unless the newly presented stimulus matched the previously presented stimulus (i.e., the same stimulus appeared twice in a row), in which case the participant needed to refrain from pushing the button on that trial (a No-go Trial). We were particularly interested in the ERN, an ERP elicited when participants make mistakes during this task. Stimuli were presented 1,000-ms apart, and there were a total of 225 trials. On average, participants committed 17 errors (SD = 8.25) on no-go trials. The average reaction time to a no-go trial was 362 ms (SD = 47.89 ms). To create the difference score, we also extracted the correct-response negativity (CRN)—an ERP elicited when participants correctly push a button during a go trial.

Balloon Analogue Risk Task

The Balloon Analogue Risk Task (BART) is a behavioral task that has been used to measure risky decision-making (Lejuez et al., 2002). We used a modified version of the BART to use this task for an ERP study (Heffer & Willoughby, 2020). Participants were instructed to inflate a series of balloons to earn points. Participants indicated the number of pumps they wanted to inflate the balloon at the beginning of the trial (Euser et al., 2013; Pleskac et al., 2008; Yau et al., 2015). Participants then observed the balloon as it either safely reached the inflation number they picked (i.e., they won the points for that trial), or the balloon burst before reaching that point (i.e., they lost the points for that trial). Given that this task provides feedback associated with losing (i.e., when the balloon pops and points are lost), it facilitates the examination of sensitivity to negative and positive feedback using ERPs (Chandrakumar et al., 2018; Fein & Chang, 2008; Gu et al., 2018; Takács et al., 2015).

The task consisted of 90 trials with a maximum breaking point of 20 pumps. The probability of the balloon popping increased as the number of pumps chosen increased (e.g., choosing to pump the balloon up to “15” had a greater likelihood of it popping compared to pumping the balloon up to “5”). After feedback was presented, a new balloon appeared after 1,000 ms. Participants earned one point for every pump of the balloon, and points for all the “win” trials were summed to calculate their total points. Participants were instructed that the goal of the task was to earn as many points as possible.

Face-processing Task

Participants also completed a face-processing task. During this task, participants were shown pictures of different emotional faces (happy, neutral, fear, and anger), as well as other stimuli (e.g., butterflies, houses, and checkerboards). Participants were instructed that the point of the task was to “catch the butterflies” by clicking a button whenever a butterfly appeared on the screen. This instruction was given to keep children and adolescents’ attention during the task; however, our main goal was to investigate face-processing to angry faces and angry compared with neutral faces. Four blocks were included in this task. The angry face was presented 60 times throughout the task. Overall, there were 496 trials: 240 face trials and 256 nonface trials (checkerboards, houses, and butterflies).

Electrophysiological Recording

Electroencephalography (EEG) was recorded continuously from a BioSemi ActiveTwo system using a 96-channel montage and 7 face sensors. The data were digitized at a sampling rate of 512 Hz. Pre-processing was conducted to identify (1)channels/components that were unreliable within a given time-period, (2)time-periods that were unreliable, (3) and channels/components that were unreliable throughout the recording.

Pre-processing (Channels)

Pre-processing was automated (using MATLAB 2012b scripts) to be performed using EEGLAB (Delorme & Makeig, 2004) version 13.6.5b and was then executed using Octave on Compute Canada’s high performance computer cluster (Cedar: see Desjardins & Segalowitz, 2013; Desjardins et al., 2020; van Noordt et al., 2017, van Noordt et al., 2015 for more details). The data were first separated into 1 second nonoverlapping time windows. For each time window, the voltage variance across each channel was calculated (a 20% trimmed mean was used). Channels were flagged as unreliable if they had a z-score six times greater than the voltage variance across all channels. Time-periods (i.e., the 1-second time windows) were considered unreliable if more than 10% of the channels were identified as having extreme voltage variances. Finally, any channels that were flagged in more than 20% of the time-periods were considered unreliable throughout the recording.

The data were re-referenced to an interpolated average of 19 sites, excluding flagged channels. The data were filtered with a 1-Hz high pass and 30-Hz low pass filter given that cortical activity would not be expected to exceed 30 Hz. After this step, the data were again checked for the same issues reported above: (1) channels that are unreliable within a given time-period, (2)time-periods that are unreliable, (3) and channels that are unreliable throughout the recording. Specifically, any channels that were unlike its neighbouring channels (e.g., had a low correlation with channels around it), were flagged. A channel was flagged as unreliable if it had a z-score that was 2.326 times greater than the mean of the 20% trimmed distribution of correlation coefficients. Time-periods were considered unreliable if more than 10% of the channels within the window were flagged as unreliable. Any individual channels that were flagged in more than 10% of time-periods were considered unreliable across the entire recording. Bridged channels (i.e., channels that are highly correlated with invariable signal) were identified after dividing the average maximum correlation by the standard deviation of the distribution of correlation coefficients. Channels that had a positive z-score that was eight times greater than the 40% trimmed distribution of coefficients were flagged as bridged channels.

Pre-Processing (Components)

After pre-processing the channel data, all data that had not been flagged as unreliable were concatenated back into continuous data. These data were then submitted to an initial Adaptive Mixture of Independent Component Analysis (AMICA) to identify different components of the EEG data (e.g., heart rate components, eye blink components, cortical components, etc.). This process helps to separate brain activity (neural components) from non-neural activity (e.g., muscle movement).

During this procedure, the data were windowed into 1 second time epochs. Unreliable components were detected by comparing each individual component to the variance among all components. Components were flagged if they had a z-score that was 2.326 times greater than the trimmed mean. Time-periods that had more than 10% of its components flagged were considered unreliable. The data were then concatenated into the continuous time course and submitted to three simultaneous AMICA decompositions to assess whether components were replicable (i.e., is muscle movement consistently being classified as muscle movement when the process is repeated multiple times). The procedure above for identifying unreliable components (within 1-second epochs) was completed again using the continuous time series data. Next, a dipole (which identifies the position and orientation for the distribution of positive and negative voltages) was fit using the dipfit plugin in Matlab (Oostenveld et al., 2011). Components with a dipole fit residual variance greater than 15% were flagged. Finally, components were classified using the ICMARC plugin. This process assesses each component against a crowd-sourced database to identify activation consistent with five different categories: eye blinks, neural, heart, lateral eye movements, muscle contamination, and mixed signal.

After pre-processing, a manual quality control review was completed to ensure that the decisions made during pre-processing were appropriate. This procedure was completed by one trained research assistant who assessed the accuracy of the independent component classifications. For example, the research assistant would identify whether cortical components were correctly distinguished from non-cortical components (e.g., muscle, eye blinks, etc.) based on topographical projection, continuous activation, dipole fit, and power spectrum profile. Thus, the quality control review involved using the independent components to help with artifact correction.

EEG post-processing

EEG data were then segmented into single trials and time-locked to the onset of the (1)no-go response (and correct response) from the Go/No-Go task, (2) negative feedback (and positive feedback) from the BART task, and (3) angry faces (and neutral faces) from the face-processing task. A final quality check was completed to identify (and remove) channels that had extreme voltage fluctuations (±50 mV). Channels that were removed during pre-processing were interpolated (i.e., rebuilt using the remaining channel data) to the full montage of 103 channels (96 scalp, 7 exogenous) using spherical spline. The current study used frontocentral midline sites (FCz: electrodes A8 and B8 on our montage) to identify the ERN and CRN during the no-go/task and epochs were baseline corrected at −600 to −400. Similar to previous studies (Fein & Chang, 2008; Hassall et al., 2013; Heffer & Willoughby, 2020), the current study used central midline sites (Cz: electrodes A19 and B19 on our montage) to identify the P3 activation during the BART task; epochs were baseline corrected at −200 to 0. Finally, posterior-temporal sites (P7 and P8; electrodes C2, C3, C12, and C13 on our montage) were used to identify the N170 during the face processing task; epochs were baseline corrected at −200 to 0.

Plan of Analysis

We used STATSLAB, an open-source toolbox that implements robust statistics for analysis of EEG data to extract the ERPs for each task (Campopiano et al., 2018). This software allows for testing using percentile bootstrap and trimmed means, a technique that is robust to distribution characteristics such as skew, outliers, uneven tails, and various model assumption violations (Wilcox, 2017).

In STATSLAB, single trial data for our channels were extracted and averaged together. For each subject, the single trial data were resampled, with replacement, to generate a surrogate sampling distribution. The 20% trimmed mean was taken across trials, at each time point (i.e., removing the most extreme voltages at each time point), to generate a robust bootstrapped ERP. Iterating this process of resampling, trimming, and scoring the difference wave was performed 1,000 times (see Campopiano et al., 2018 for details). The P3 was extracted at the most positive points (315 ms for losses; 307 ms for wins); all other ERPs were extracted at the most negative points (176 ms for anger N170; 174 ms for neutral N170; 35 ms for ERN; 10 ms for CRN). See Fig. 1 for the ERPs and corresponding topographies.

Fig. 1
figure 1

Waveforms and topographical maps show the ERN and CRN during the go/no-go task (0-100 ms), the P3 to losses and to wins during the BART task (250-400 ms), and the N170 to angry faces and to neutral faces during the face-processing task (140-200 ms). Black dots on topographical maps indicate the channel cluster used for analysis. Of note, the N170 does not cross zero in our sample, due to a large/dominating P1 amplitude; this finding is typical among children and adolescent populations (Kuefner et al., 2010; Taylor et al., 2004). The orange waveforms represent neural activation to threatening situations; the blue waveforms represent neural activation to nonthreatening situations

We next created a variable to identify consistently high neural activation across the tasks. For the BART task, given that the P3 is a positive waveform, scores above the mean reflect having high neural activation to threats. For both the go/no-go task and the face processing task, given that the ERN and the N170 are negative waveforms, scores below the mean reflect having high neural activation to threats. We created a consistency variable, whereby a score of 4 represented having consistently high neural activation on all three ERPs, a score of 3 represented having high neural activation on two out of three tasks, a score of 2 represented having high neural activation on one out of three tasks, and a score of 1 represented having high neural activation on none of the tasks (i.e., low neural activation across all three tasks). Thus, higher scores represent more situations where individuals had high neural activation to threats, whereas lower scores represent less situations where individuals had high neural activation to threats.

In a follow-up analysis, we replicated this analysis using the difference score for each ERP (P3 loss- P3 win; angry N170- neutral N170; ERN- CRN). For the BART task, participants with scores above zero would have a larger P3 amplitude to loss feedback compared with win feedback. For both the go/no-go task and the face processing task, scores below zero reflect having larger ERP activation to the threat (mistakes and angry faces) compared to the nonthreatening situations (successful button presses and neutral faces). In this case, we created a consistency variable whereby a score of 4 represented having consistently higher neural activation to threats than nonthreats on all three ERPs, a score of 3 represented having higher neural activation to threats than nonthreats on two out of three tasks, a score of 2 represented having higher neural activation to threats than nonthreats on one out of three tasks, and a score of 1 represented having higher neural activation to threats than nonthreats on none of the tasks (i.e., lower neural activation to threats than to nonthreats across all three tasks). Thus, a higher difference score reflects more situations in which an individual has higher neural activation to threats than to nonthreats.

Results

Means and standard deviations for all study variables are reported in Table 1. Of interest, the mean score on neural consistency to threats was 2.47 (SD = 0.85), suggesting that on average, participants had high neural activation to threats on one or two tasks. Approximately, 11.8% of participants had consistently high neural activation to all three tasks, 34.8% of participants had high neural activation to two out of three tasks, 41.6% of participants had high neural activation to one out of three tasks, and 11.8% of participants had low neural activation on all three tasks. For the difference score, the average was 2.86 (SD = 0.84); higher scores represented consistently higher neural activation to threats than nonthreats. Approximately, 24.8% of participants had consistently higher neural activation to threats than nonthreats on all three tasks, 41% of participants had higher neural activation to threats than nonthreats on two out of three tasks, 29.8% of participants had higher neural activation to threats than nonthreats on one out of three tasks, and 4.4% of participants had no tasks whether they had higher neural activation to threats than nonthreats.

Table 1 Means and standard deviations of study variables

What factors predict consistently high neural activation to threats?

A linear regression was used investigate what factors (sensitivity to threat, impulsivity, age, pubertal status, sex, and parental education) predict consistently high neural activation to threats. Table 2 contains the results of the linear regression. Sensitivity to threat [β = 0.236, p = 0.017], impulsivity [β = −0.374, p = 0.002], and sex [β = −0.284, p = 0.043] were the only significantly predictors of consistent neural activation to threats. Specifically, higher self-reported threat sensitivity and lower self-reported impulsivity predicted having consistently higher neural activation to threats. Additionally, males had consistently higher neural activation to threats than females.

Table 2 Regression results using stability of ERPs to threats as the criterion

What factors predict consistently higher neural activation to threats than to nonthreats (i.e., using the difference score)?

A linear regression was used investigate what factors (sensitivity to threat, impulsivity, age, pubertal status, sex, and parental education) predict consistently higher neural activation to threats than nonthreats (i.e., difference score). Table 3 contains the results of the linear regression. Sensitivity to threats [β = 0.208, p = 0.036] was the only significantly predictor of the difference score. Specifically, higher self-reported threat sensitivity predicted having consistently higher neural activation to threats than to nonthreats.

Table 3 Regression results using stability of ERPs difference score as the criterion

Discussion

Threat sensitivity frequently has been characterized as a risk factor for the development of anxiety (Bar-Haim et al., 2007). Self-report measures of threat sensitivity often combine different threat-related situations (e.g., making a mistake, receiving negative feedback, worrying about someone being angry with you) into one measure. Neuroscience research, on the other hand, often investigates these threat-related situations separately using different tasks. The current study used three EEG tasks to investigate consistency of neural activation to threats.

We were interested in what demographic and self-report factors are associated with consistently high neural activation to threats (e.g., Do adolescents have more consistently high neural activation to threats than children? Do individuals who self-report greater threat sensitivity have consistently high neural activation across these tasks?).

First, we created a measure of neural consistency to threats. Using three different ERPs (P3, N170, and ERN) and three different tasks (the BART, a go/no-go task, a face-processing task), we found that it was quite common for youth to have high neural sensitivity to threats on at least one or two tasks (M = 2.4, SD = 0.85). It was less common, however, to have consistently high neural activation to threats on all three tasks (~12% of the sample). Our results show that although self-report measures of threat sensitivity group these different threat-related events together, not all youth have consistently high neural activation to receiving negative feedback, angry faces, and making mistakes.

Our results show that self-reported impulsivity was a predictor of consistently low neural activation, while self-reported threat sensitivity was a predictor of consistently high neural activation to threats. We also found that males had consistently higher neural activation to threats than females. The latter finding was not part of our original predictions. It is not entirely clear why males would have higher neural consistency to threats than females. Given that this is the first study investigating neural consistency across multiple tasks, future research is needed to tease apart this association.

On the other hand, our findings regarding impulsivity and threat sensitivity were in line with our predictions. Indeed, individuals with high self-reported impulsivity may be less troubled by different types of threats, given that they are more likely to engage in nonreflective, stimulus-driven responding (Nigg, 2017). At the same time, individuals with high self-reported threat sensitivity seem to have higher neural sensitivity to a variety of different threats. Therefore, investigating consistent neural activation across different tasks may be an important way to identify youth who are at the greatest risk for impulsivity and threat sensitivity. Indeed, if individuals have a consistent response across three different threats, it would be more likely that the individuals’ threat sensitivity is being accurately classified. This result is in line with the cumulative risk hypothesis (i.e., a greater number of risk factors is associated with more problem behaviors; Appleyard et al., 2005). Thus, individuals who have consistently high (or low) neural activation across multiple situations would be the most likely to self-report high levels of sensitivity to threat (or impulsivity). Identifying youth with consistent neural activation to different threats may be an important avenue for researchers interested in the development of anxiety. Indeed, given that many youth show high neural activation to at least one threatening event, cumulative neural activation to a variety of different threats may be a promising approach to identify youth who are truly at risk for the development of anxiety.

Surprisingly, we did not find that adolescents (i.e., individuals with more advanced pubertal status or older age) had consistently higher neural activation to threats. We expected that adolescence may be a time of heightened sensitivity to emotionally salient events, such as threatening events (Casey, 2015; Somerville et al., 2010; Steinberg et al., 2008), and therefore, we thought that adolescents would have heightened neural activation across a variety of different threatening events. This nonsignificant finding, however, suggests that children also show consistently high neural sensitive to different threats (see Chong & Meyer, 2019; Heffer & Willoughby, 2020; O’Toole et al., 2013, for other studies showing that children with anxiety-related symptoms can show high ERP activation to these different threatening situations). Thus, youth who demonstrate consistently high neural sensitivity to different threats should be further investigated, especially when considering early interventions aimed at identifying youth who may self-report sensitivity to threats.

In a follow-up analysis, we also wanted to investigate whether our results were consistent when using the difference scores for each ERP (i.e., P3 loss - P3 wins; N170 angry - N170 neutral; ERN - CRN). The main conclusion from our study was replicated when using the difference score: youth with higher self-reported threat sensitivity had consistently higher neural activation to threats than to nonthreats. However, we did not replicate our findings regarding impulsivity and sex. There may be several reasons for the latter findings. In terms of impulsivity, for example, if impulsive individuals are less reflective during the task, they may have been less sensitive to both threatening events and nonthreatening events, providing them with a difference score that thus is similar to others who have high neural activation to both threats and nonthreats. Again, this may be one disadvantage of the difference score—individuals with different patterns of neural activation can end up with the same value on a difference score measure. Additionally, in two out of three of our tasks the nonthreatening event was a positive event (receiving win feedback and making a correct response). Previous research has found that some individuals have high sensitivity to both negative and positive stimuli (Coplan et al., 2006). Thus, individuals who find threatening events aversive (but also find positive events exciting) may not be well represented by a difference score. Future research is needed to replicate these results using multiple threat-related situations in comparison to neutral events.

Our study has important strengths, including a large sample, inclusion of three different ERP tasks, and the use of multiple methods (e.g., self-report and EEG). At the same time, our study is not without limitations. First, we did not include the full scale for either self-report sensitivity to threat or self-reported impulsivity as the data were part of a larger study assessing a wide range of constructs, and it was not feasible to include every item from each scale due to time constraints. Future research would benefit from investigating group differences in threat-sensitivity and impulsivity using the full scales. Second, our study is concurrent; thus, casual inferences cannot be concluded (e.g., we are unable to ascertain whether more consistent neural activation across tasks leads to greater self-reported sensitivity to threat and/or whether greater self-reported sensitivity to threat leads to more consistent neural sensitivity). Longitudinal studies investigating both self-report measures and neural activation at each time point are necessary before drawing these conclusions. Finally, we did not assess whether there were individual differences in specific combinations of tasks (e.g., individuals who have high neural activation to negative feedback, low neural activation to making mistakes, but high neural activation to angry faces). Given that our main interest was in consistent threat sensitivity, this breakdown was not necessary to answer our research question.

Despite these limitations, our study has important implications. A small (~12% of the sample), but important percentage of the sample was identified as having consistently high neural response to threats. Furthermore, consistently higher neural activation to threats (and higher neural activation to threats than to nonthreats) was associated with higher self-reported threat sensitivity. Our results suggest that although it is common for youth to have high neural activation in response to one threat-related tasks, it is far less common for youth to have consistently high neural activation to threats. Threat sensitivity is thought to be associated with the development of anxiety; however, not all youth who are sensitive to threats develop anxiety (Pérez-Edgar et al., 2010). Given that anxiety affects roughly 7–15% of youth (Beesdo et al., 2009; Ghandour et al., 2019), consistency of neural activation to threats may be an important group to investigate in order to identify nonnormative levels of threat sensitivity.