Introduction

The ability to process relevant emotional information and ignore irrelevant emotional information is central for emotional self-regulation (Berg et al., 2016; Marusak, Martin, Etkin, & Thomason, 2015; Ochsner & Gross, 2005). These processes have extensively been examined using emotional interference (Stroop) tasks (Egner, Etkin, Gale, & Hirsch, 2008; Etkin, Egner, & Kalisch, 2011; Kunde & Mauer, 2008; McKenna & Sharma, 2004; Pan, Lu, Chen, Wu, & Li, 2016; Saunders, Milyavskaya, & Inzlicht, 2015; Schreiter, Chmielewski, & Beste, 2018b; Zinchenko, Kanske, Obermeier, Schröger, & Kotz, 2015). In the face-word Stroop task, for example, emotional faces are presented together with an emotional word that is either congruent or incongruent with the presented emotional face (Egner et al., 2008, 2008; Etkin et al., 2011; Kerns et al., 2004; Marusak et al., 2015). Typically, response accuracy and reaction times (RTs) decline when classifying the emotional target dimension (face) on the Stroop stimulus, in trials where the face and the word are incongruent (“emotional Stroop effect”) (Algom, Chajut, & Lev, 2004; MacLeod, 1991; Weissman & Carp, 2013). Importantly, the emotional Stroop effect, like non-emotional Stroop effects, represents a “stimulus-stimulus conflict” (S-S conflict) (Hommel, 2015). This conflict refers to competition and interference between two stimulus dimensions that are simultaneously presented and are semantically similar (Hommel, 2015). Therefore, the magnitude of emotional (Stroop) effects is likely dependent on how deeply the task-irrelevant and interfering emotional stimulus dimension is processed. Only when both stimulus dimensions are processed deeply, are strong emotional interference effects likely to occur. While this seems obvious, an exact characterization of the cognitive-neurophysiological processes being modulated by the depth of processing emotional interfering information is still indeterminate.

In the current study, we examine those aspects integrating neurophysiological and EEG-source localization data with vegetative nervous system data (i.e., pupil diameter data). We experimentally vary the depth of processing (DOP) of the task-irrelevant stimulus dimension by systematically adjusting the intentional relevance of the emotional distractor dimension. Systematically manipulating the DOP of the irrelevant/distracting stimulus dimension should modulate the extent to which emotional conflict is reflected in behavior. Deeper processing of the task-irrelevant stimulus dimensions should result in stronger behavioral Stroop effects, i.e., increased response times and decreased accuracy on incongruent trials. In can therefore be supposed that DOP modulates the difficulty in processing conflicting information.

A natural consequence of a deeper processing, and more difficult processing of the interfering information, is that conflict-monitoring processes and the decision about which response to select become complicated. Considering these processes, it has been suggested that the norepinephrine (NE) system strongly modulates these processes. The adaptive gain theory suggests that the locus coeruleus-norepinephrine (LC-NE) system modulates neural processing during task-relevant decisions (Aston-Jones & Cohen, 2005). It has been suggested that the phasic release of NE facilitates decision processes under more control-demanding circumstances and optimizes behavioral performance according to the relevant task at hand (Aston-Jones & Cohen, 2005). Therefore, it is likely that variations in the DOP of emotional interfering information affect the modulation of task-relevant decision points by the NE system. Importantly, it has been shown that the NE system activity modulates human pupil diameter in a process- and time point-specific manner ( Costa & Rudebeck, 2016; Joshi, Li, Kalwani, & Gold, 2016; Varazzani, San-Galli, Gilardeau, & Bouret, 2015). Therefore, the pupil diameter can be used as an indirect index of NE system activity (Dippel, Mückschel, Ziemssen, & Beste, 2017; Helton et al., 2005; Hong, Walz, & Sajda, 2014; Hou, Freeman, Langley, Szabadi, & Bradshaw, 2005; Jepma & Nieuwenhuis, 2011; Mückschel, Chmielewski, Ziemssen, & Beste, 2017a; Murphy, Robertson, Balsters, & O’connell, 2011). The pupil diameter is modulated by tonic and phasic NE activity (Hong et al., 2014; Hou et al., 2005; Jepma & Nieuwenhuis, 2011; Murphy et al., 2011). Larger pupil diameters have been shown to reflect higher NE concentrations (Hou, Freeman, Langley, Szabadi, & Bradshaw, 2005b; Phillips, Szabadi, & Bradshaw, 2000). Recent data show that neurophysiological indices of response selection and cognitive control are modulated by NE system activity, as indexed using pupil diameter (Chmielewski, Mückschel, Ziemssen, & Beste, 2017; Dippel et al., 2017; Mückschel, Chmielewski, et al., 2017; Wolff, Mückschel, Ziemssen, & Beste, 2018); i.e., there are correlations between the pupil diameter data and the neurophysiological (EEG) data at a specific time point in the processing cascade. With regard to the DOP manipulation in the current study, we hypothesize that there should be strong correlations between pupil diameter data and neurophysiological data when the DOP of the irrelevant stimulus dimension is increased. This is because task-relevant decision processes become more demanding when irrelevant, distracting emotional information in processed in depth. No or much smaller correlations should be evident when the irrelevant stimulus dimension is not processed in depth. As mentioned, modulations of the NE system during cognitive control tasks occur at specific time points. It has been shown that processes in the N2 ERP, and even more so in the P3 ERP time window, are modulated by the NE system as indexed by the pupil diameter (Chmielewski et al., 2017; Dippel et al., 2017; Mückschel, Chmielewski, et al., 2017; Wolff et al., 2018). This is in line with other studies (Murphy et al., 2011; Nieuwenhuis et al., 2005; Warren & Holroyd, 2012). In particular, neural activity in the P3 time window has long been argued to be strongly modulated by the NE system and may even reflect NE activity (Nieuwenhuis et al., 2005). This is also plausible because the P3 component reflects decision making and stimulus-response association processes (Gray, Ambady, Lowenthal, & Deldin, 2004; Twomey, Murphy, Kelly, & O’Connell, 2015), which are known to be modulated by the NE system (Aston-Jones & Cohen, 2005). These processes are enhanced when interfering information has a strong impact on information processing (Mennes, Wouters, Bergh, Lagae, & Stiers, 2008). Thus, it is reasonable to expect that in the DOP-condition the increased complexity of decision-making processes should evoke greater involvement of the NE system activity reflected in pupil diameter. We hypothesize that the pupil diameter is correlated with neurophysiological data at the time point of the P3 ERP in the DOP group, but not in the group in which the interfering emotional information is not processed in depth. We also hypothesize that modulations of the DOP will affect the amplitude of the P3 ERP-component. Notably, it is well established that pupil dilation occurs with a 200- to 500-ms delay relative to LC-NE activity (Joshi et al., 2016; Nieuwenhuis, De Geus, & Aston-Jones, 2011). So, correlations between pupil reaction and neurophysiological signals corresponding to the P3 time window should be visible within a time frame of around 700 ms onwards. During emotional Stroop conflicts, it is well known that the P3 amplitudes are increased in emotionally congruent compared to incongruent trials (Clayson & Larson, 2013; Delplanque, Lavoie, Hot, Silvert, & Sequeira, 2004; Larson, Clayson, & Clawson, 2014; Schreiter et al., 2018b; Yang et al., 2013). Since an increased DOP is hypothesized to increase emotional Stroop effects, we also hypothesize that the P3 modulations between congruent and incongruent trials are also stronger when DOP is increased. From a functional neuroanatomical perspective, these effects are likely to be associated with activation differences in parietal regions, but in addition, the insular cortex likely plays an important role. Parietal regions have frequently been associated with modulations in the P3 ERP-component (Bledowski et al., 2004; Gray et al., 2004) and also the NE system modulates P3-related processes via parietal regions (Wolff et al., 2018). The insular cortex may be relevant because this region plays a major role in the interpretation of emotional information and has been shown to be involved in emotional and non-emotional interference processing (Gogolla, 2017; Simmons, Matthews, Paulus, & Stein, 2008; Xue et al., 2016). Importantly, it has been shown to modulate emotional decision-making processes by orchestrating external emotional with internal physiological signals (such as NE release) to predict future decision outcomes – particularly in situations of affective ambiguity (Gogolla, 2017). Therefore, we hypothesize that modulations of the P3 ERP-component that are modulated by the NE system are associated with the insula. Specifically, we hypothesize that there is stronger insular and parietal activity within the DOP group.

Since previous findings suggest that emotional stimuli have identical effects on early attentional selection processes (Brosch, Sander, Pourtois, & Scherer, 2008; Clayson & Larson, 2013; Pool, Brosch, Delplanque, & Sander, 2016) we do not expect effects of DOP manipulation on P1 and N1 ERPs. Considering the resolution of interferences, the N2 ERP-component is more negative during conflicting trials than on non-conflicting/non-interfering trials (Chen et al., 2016; Chmielewski & Beste, 2017; Larson et al., 2014; Shang, Fu, Qiu, & Ma, 2016). However, due to the complexity of stimuli in word-face emotional Stroop tasks, no modulations of the N2 by interference effects have been observed (Schreiter, Chmielewski, & Beste, 2018a; Schreiter et al., 2018b). Therefore, we do not expect to observe amplitude modulations and differential correlations with the pupil diameter in the N2 time window. The CSP (conflict-slow potential) has been shown to be modulated by emotional Stroop conflict (Chen et al., 2016; Larson et al., 2014; Ma, Liu, Zhong, Wang, & Chen, 2014; Schreiter et al., 2018b), but seems to reflect pro-active control processes rather than conflict control in response to complex task-demands (Larson et al., 2014; Pan et al., 2016; Shen, Xue, Wang, & Qiu, 2013). Therefore, it is unlikely that there are modulations of the processes by the NE system (i.e., correlations with pupil diameter data) and modulations in amplitude due to a manipulation of DOP.

Materials and methods

Participants

We tested two groups of participants. Each group included N=20 healthy participants between 18 and 30 years of age (mean age 23.73 ± 0.72 years; n=12 females in each group). In one group, the depth of processing of the interfering emotional stimulus was increased (refer to the section Manipulation of depth of processing (DOP)). The sample sizes used for each group have been shown to yield reliable correlations between pupil diameter and ERP-data (Dippel et al., 2017; Mückschel, Chmielewski, et al., 2017; Wolff et al., 2018). The reason for choosing a between-subject and not a within-subject design is that “order effects” can never be ruled out using within-subject designs. A crossover design (i.e., where the order of DOP vs. non-DOP is counterbalanced across participants) requires testing, whether “test order” could have affected the results. The consequence is that a between-subject factor is still needed for this kind of investigation. Relating to this aspect, learning/neural plasticity processes of stimulus features and gain-control mechanisms (refer to actions of the NE system and the recording of the pupil diameter data) are closely related, as learning also enhances the neural signal-to-noise ratio in cortical circuits (Dosher & Lu, 1998). This is particularly important when assessing processes related to stimulus-stimulus conflicts.

All participants had normal or corrected-to-normal vision and were screened on personal health background to ensure our sample was free of individuals previously diagnosed with any psychiatric or neurological disorders, or taking regular medication. All participants provided written informed consent prior to the experiment, and received payment (15 €) or course credits (TU Dresden students) after completion of the study. The study was approved by the institutional review board of the Medical Faculty of the TU Dresden and conducted in accordance with the Declaration of Helsinki.

Task

We employed an emotional Stroop paradigm in which emotional interference is induced by the incongruency between two emotional dimensions of one presented stimulus (emotional faces and words). Pictures of emotional facial expressions represented the task-relevant dimension, while emotional words printed across the picture constituted the task-irrelevant distractor dimension. We used pictures from four different Caucasian males depicting either a happy or an angry facial expression. The pictures were selected from the NimStim, MacBrain Face Stimulus Set (Nim Tottenham supported by the John D. and Catherine T. MacArthur Foundation Research Network on Early Experience and Brain Development, open source available on http://www.macbrain.org/resources.htm). The faces were presented in gray-scale using XNView (for windows) and congruent or incongruent emotional words, i.e., “FREUDE” or “ÄRGER” (German for “HAPPINESS” and “ANGER”) were written centrally across the faces in red capital letters (refer to Fig. 1 for details).

Fig. 1
figure 1

Example stimuli and experimental paradigm of the emotional Stroop task and depiction of the conceptual background of the current investigation. (a) Participants were instructed to classify the task-relevant emotional expression of the face (i.e., angry or happy) that had a task-irrelevant emotional word (either angry or happy) written across. Stimuli were either congruent or incongruent with respect to facial expression (emotional target: stimulus dimension 1) and word (emotional distractor: stimulus dimension 2), which created emotional interference. Emotional interference effects were greater in the DOP group, where the distracting stimulus dimension 2 (b) required deeper levels of processing due to intermittent queries. In the DOP condition, both aspects of the composite stimuli are processes, whereas it is possible to ignore the written word (i.e., dimension 2) of the stimuli

There were equal proportions of congruent and incongruent stimuli for each emotion, adding up to 32 different stimuli. These stimuli were pseudo-randomly presented in four blocks of 216 trails each (i.e., 864 trials in total). In each block, the same number of congruent and incongruent stimuli were presented and it was ensured that the same proportion of the different stimuli was presented. Both experimental groups were asked to respond via the arrow keys of a conventional computer keyboard. The stimuli (face-target/task-relevant dimension and emotional-word/task-irrelevant dimension) were presented for 450 ms and then followed by the presentation of the fixation cross for the rest of the trial. All participants were instructed to identify the correct facial emotional expression as quickly as possible for each trial, and encouraged to make use of the allocated breaks if needed. For stimuli depicting happy target emotions participants were required to press the left ctrl-key, and for angry emotions the right ctrl-key. If participants did not respond within 1,000 ms of target onset, the trial was treated as a miss. After the response was executed, a variable response stimulus interval (RSI), jittered between 850 and 1,250 ms, was included, in which a central fixation cross was presented. Likewise, responses to intermittent queries within the “depth of processing” group were given with the left and right ctrl-keys (left- happy/right- angry). All participants were instructed not to double-press a button and to avoid missing responses as much as possible. Before commencing with the experiment, participants were able to become familiar with the experimental design by conducting a 40-trial exercise. In total, the task lasted for approximately 40 min.

Manipulation of depth of processing (DOP)

We systematically enhanced the depth in which the task-irrelevant emotional stimulus dimensions had to be processed by adding intermittent queries about the content of the emotional distractor dimension in one group of participants (DOP group). One group of participants was intermittently presented with infrequent queries about the emotional valence of the distractor (word) stimulus in the trial immediately preceding that query (i.e., “Please indicate which word was presented to you in the previous trial? Press the left key for ‘ANGER’ or right key for ‘HAPPY’.”). The queries were pseudo-randomized and equally distributed across all experimental blocks. This was done to direct participants’ attention to the distractor dimension (i.e., enhance the intentional relevance of the task-irrelevant emotional information). The participants were thus forced to process the emotional face and the emotional word in depth. Queries were treated as hits if the correct answer was obtained within 300–1,300 ms.

EEG recordings and analyses

EEG data were recorded with a sampling rate of 500 Hz using a 60-channel system (BrainAmp, Brain Products Inc.). Passive Ag/AgCl-electrodes (60 recording electrodes) were mounted in an elastic cap (EasyCap Inc.) and arranged in equidistant positions approximating the positions of the 10/20 system. The ground and reference electrode were placed at coordinates theta=58, phi=78, and theta=90, phi=90, respectively. Electrode impedances were kept below 5 kΩ. Subsequent data analysis was performed in the Brain Vision Analyzer 2 software package (BrainProducts, Inc.): Offline, a band pass filter of 0.5–20 Hz (with a slope of 48 dB/oct each) and a notch filter of 50 Hz were applied. In the software package used to analyze the ERP data, even the sharp filter setting with a slope of 48 dB/oct cannot exclude residual 50Hz activity still being evident in the data. This can be seen when running an FFT of the data. Therefore, the 50-Hz notch filter is applied to eliminate even the residual 50-Hz contamination. It needs to be noted that the results reported below remained the same when a less steep filter slope (e.g., 16 db/oct) was used. Following that, raw data inspection was conducted manually to reject technical artifacts from the EEG. Then, an independent component analysis (ICA; Infomax algorithm) was conducted to remove recurrent artifacts. ICA components revealing horizontal and vertical eye movements, blinks, and pulse artifacts were manually discarded (number of discarded ICs: 4 ± 1.5). The data were then segmented for each emotional condition (angry, happy) for congruent as well as incongruent trials. Trials were only taken into account when the correct response was given within 1,000 ms of target onset. The segmentations were locked to target stimuli starting 250 ms prior to target onset and finishing 1,000 ms after its onset. Afterwards, an automated artifact rejection was applied for all the segments. Activity below 0.5 μV in a 100-ms period and a maximal value difference of 200 μV in 200 ms within the epoch were used as rejection criteria. If an artifact was detected in a trial, the trial was discarded. Overall, ~1.6% of trials were discarded due to these issues. To eliminate the reference potential from the data and to re-reference the data, we applied a current source density (CSD) transformation (Kayser & Tenke, 2015; Nunez & Pilgreen, 1991), which also serves as a spatial filter resulting in values for amplitudes in μV/m2. A baseline correction from -200 ms to 0 prior to target onset was applied and the relevant ERP components, P1 (at P7 and P8: after target presentation onset), N1 (at P7 and P8: 140–180 ms), N2 (at Cz: 220–240 ms), P3 (at C4: 550–660 ms), and CSP (at FCz: 430–470 ms) were identified by means of scalp topography. Within these search intervals, a semi-automatic peak detection was conducted for the ERP components P1, N1, and N2. For the P3 and the CSP an area export was conducted for each corresponding search interval. The choice of these search intervals for ERP quantification was validated using a statistical approach outlined in Mückschel et al. (2014). In so doing, the above time intervals were taken and the mean amplitude within the defined search intervals was determined for each of the 60 electrode positions. Then, to compare each electrode against an average of all other electrodes, Bonferroni correction for multiple comparisons (critical threshold, p = .0007) was used. Only electrodes that displayed significantly larger mean amplitudes (i.e., negative for the N- potentials and positive for the P-potentials) when compared to other electrodes were chosen. This procedure revealed the same electrodes as those chosen by visual inspection.

For the source localization analyses, we used sLORETA (standardized low resolution brain electromagnetic tomography) (Pascual-Marqui, 2002), which provides a single solution to the inverse problem (Marco-Pallarés, Grau, & Ruffini, 2005; Pascual-Marqui, 2002). The intracerebral volume is partitioned into 6,239 voxels at 5-mm spatial resolution. Then, the standardized current density at each voxel is calculated in a realistic head based on the MNI152 template. There is mathematical evidence that sLORETA provides reliable results without a localization bias (Sekihara, Sahani, & Nagarajan, 2005). Moreover, there is evidence from EEG/fMRI and neuro-navigated EEG/TMS studies underlining the validity of the sources estimated using sLORETA (Dippel & Beste, 2015; Hoffmann, Labrenz, Themann, Wascher, & Beste, 2014). The voxel-based sLORETA images were compared between groups and experimental conditions using the sLORETA-built-in voxel-wise randomization tests with 2,000 permutations, based on statistical nonparametric mapping (SnPM). Voxels with significant differences (p < .01, corrected for multiple comparisons) between contrasted conditions and groups were located in the MNI-brain.

Pupil diameter recording and analysis

Pupil diameter recordings and analysis were conducted using established protocols (Dippel et al., 2017; Mückschel, Chmielewski, et al., 2017; Mückschel, Gohil, Ziemssen, & Beste, 2017b; Wolff et al., 2018). The pupil diameter was recorded using a RED 500 eye-tracking device and the software iView X (RED 500 eye-tracking device and the software iView X (SensoMotoric Instruments GmbH) at a 500-Hz sampling rate. Before the start of the experiment, the device was calibrated using a 9-point calibration. Eye blinks were automatically interpolated by the eye-tracking recording software. For all subjects, the mean pupil diameter of both eyes was used for data analysis. The pupil diameter did not differ between the left and right eye (p > .95) and the diameter of the left eye was strongly correlated with that of the right eye (r >.96; p<.001). The pupil diameter data and EEG data were synchronized using the EYE-EEG extension (Dimigen, Sommer, Hohlfeld, Jacobs, & Kliegl, 2011) for EEGLab (Delorme & Makeig, 2004), (http://www2.hu-berlin.de/eyetracking-eeg) run on MATLAB (Mathworks, Inc., Natick, MA, USA). For data analysis, the pupil diameter data were low-pass filtered (IIR filter with 20 Hz at a slope of 48 dB/oct each). Remaining artifacts (e.g., blink artifacts) were manually removed by means of raw data inspection. Segmentation and baseline correction were conducted the same way as the EEG data. To integrate the pupil diameter and EEG data we used Pearson correlations between the pupil diameter data and the ERP data. For this, the segments were split into time bins of approximately 2 ms (i.e., the length of each sampling point for a sampling rate of 500 Hz). For the integration of EEG and pupil diameter data, every EEG data time bin was correlated with every pupil diameter data time bin across all participants. Since there is a low signal-to-noise ratio in the single-trial ERP data, no within-subject correlations were calculated according to recent protocols (Chmielewski et al., 2017; Mückschel, Gohil, et al., 2017).

Statistical analysis

Correct responses (%) and RTs were analyzed using mixed-effects ANOVAs using the within-subject factors “congruency” (congruent vs. incongruent) and “emotion” (happy and angry) and the between-subject factor “group” (DOP/controls). For the neurophysiological data, the factor “electrode” (for P1 and N1) was added in the model. Greenhouse-Geisser correction was applied where appropriate and post hoc tests were Bonferroni-corrected.

Results

Behavioral data

Accuracy data

The descriptive behavioral data for both groups, i.e. mean RTs (ms) and accuracy (%) for the congruency conditions is shown in Fig. 2, including the standard error of the mean (SEM).

Fig. 2
figure 2

Summary of the behavioral results. (a) Accuracy (%) and (b) reaction times (RTs) (ms) for each group (DOP group and controls), data (mean ± SEM) are for congruent and incongruent emotional trials. Incongruent trials elicited slower RTs and less accurate responses than congruent trials. Emotional interference effects were greater within the DOP group, where the distracting stimulus dimension 2 required deeper levels of processing due to intermittent queries

With regard to accuracy, the mixed-effects ANOVA revealed a main effect of “congruency,” (F(1,38) = 47.06;p < .001; ηp2 = .553). Overall, participants’ performance was significantly more accurate on congruent trials (88.56 % ± 1.17) than on incongruent trials (81.50 % ±1.46). Furthermore, there was a main effect of “emotion” (F(1,38) = 11.35; p < .002; ηp2 = .230), showing that performance on happy target emotions (86.70 % ± 1.14) was more accurate compared to angry target emotions (83.35 % ± 1.49) (refer to Fig. 1 in the Electronic Supplemental Material). Most importantly, there was a significant interaction “congruency × group” (F(1,38) = 15.69; p < .001; ηp2 = .292). To further analyze this interaction, we calculated the average hit rate values for each group for congruent as well as incongruent trials (pooled across emotion conditions): Independent-samples t-tests revealed that hit rates in the DOP group on congruent trials (91.19 % ± 1.35) were significantly higher compared to hit rates on congruent trials (85.92 % ± 1.92) in the control group, (t(38) = -2.24, p<.05). Hit rates in the DOP group on incongruent trials (82.94 % ± 1.67) compared to hit rates on incongruent trials (80.05 % ± 2.21) in the control group did not significantly differ (t(38) = .99, p =.33). However, since we were focused on the effects of depth of processing on conflict effects (i.e., performance differences between congruent and incongruent trials), we compared hit-rate differences within each group.

Post hoc paired t-tests for each group revealed that hit rates within the DOP group on congruent trials (91.19 % ± 1.35) were significantly higher than on incongruent trials (80.05 % ± 2.21); (t(19) = -6.86, p <.001). Likewise, for the control group, hit rates on congruent trials (85.92 % ± 1.92) were significantly more accurate than on incongruent trials (82.94 % ± 1.67); (t(19) = -2.36, p<.05), but the size of this effect was smaller in the control than in the DOP group (refer to the t-values). This shows that the interaction is driven by the DOP group. There were no other main or interaction effects (all F ≤ 1.89; p ≥ .378).

Reaction times

With regard to the RTs, the mixed-effects ANOVA revealed a main effect of “congruency”, (F(1,38) = 22.48; p < .001; ηp2 = .372). Overall, participant’s reactions was significantly faster on congruent trails (485 ms ± 10) than on incongruent trials (503 ms ±12). Furthermore, there was a main effect of “emotion” (F(1,38) = 12.89; p = .001; ηp2 = .253) showing that responses to happy target emotions (490 ms ± 11) were faster than to angry target emotions (499. ms ± 11) (refer to Fig. 2 in the Electronic Supplemental Material). There was a significant interaction “congruency × group” (F(1,38) = 12.60; p < .001; ηp2 = .253).

To further analyze the congruency × group interaction, we calculated the average RTs for each group for congruent as well as incongruent trials (pooled across emotion condition). Independent-samples t-tests revealed that RTs in the DOP group on congruent trials (527 ms ± 15) were significantly slower compared to RTs on congruent trials (444 ms ± 11) in the control group, (t(38) = -4.23, p<.001). Likewise, RTs in the DOP group on incongruent trials (559 ms ± 20) were significantly slower compared to RTs on incongruent trials (449 ms ± 12) in the control group (t(38) = -4.71, p <.001). Since the main aim was to investigate the effects of manipulating depth of processing on emotional Stroop effects (i.e., performance differences between congruent and incongruent trials), we compared RT differences within each group. Post hoc paired t-test comparison for each group revealed that responses in the DOP group on congruent trials (527 ms ± 15) were significantly faster than on incongruent trials (559 ms ± 20); t(19) = -4.37, p <.001). For the control group, RTs on congruent trials (444 ms ± 11) and incongruent trials (449 ms ± 12) did not significantly differ, t(19) = -1.88, p = .074). There were no other main or interaction effects, (all F ≤ 1.29; p ≥ .417).

Neurophysiological data

As our behavioral data showed that the important interaction of “congruency × group” was not further modulated by emotion and there was no interaction of “emotion × group”, the factor emotion (i.e., positive or negative) does not modulate the pattern of results. Hence, we did not analyze this factor for the EEG data. Any effects that may have occurred in the EEG data are not possible to interpret and reflect epiphenomena due to the lack of behavioral importance.

The P1 and N1 ERP-components are shown in Fig. 3. The N2 ERP-component is shown in Fig. 4a, the CSP ERP-component in Fig. 4b.

Fig. 3
figure 3

P1 and N1 ERP components shown for electrode P7/P8 and representative topography maps of the scalp electrical potential for P1 and N1 at the peak of each ERP-component for each group (DOP group and controls). In the topographies, red denotes positivity, and blue negativity. The lines represent congruent emotional trials. Time point zero denotes onset of the target stimulus

Fig. 4
figure 4

(a) N2 ERP-component at electrode Cz and (b) conflict-slow potential at electrode FCz for each group (DOP group and controls). The corresponding topography maps of the scalp electrical potential are shown for the peak of N2- ERP component. In the topographies, red denotes positivity, and blue negativity. Time point zero denotes onset of trial stimulus. The lines represent congruent/incongruent emotion trials

With regard to the P1, we found a significant main effect of “electrode” (P8/P7), (F(1,38) = 13.55; p<.000; ηp2 = .263) in the mixed-effects ANOVA, showing the P1 amplitude to be increased at the electrode P8 (49.52 μV/m2 ± 3.41) in comparison to P7 (37.21 μV/m2 ± .3.25). No other significant main effects or interactions were observed (all F ≤ 2.83; p ≥ .075). With regard to the repeated-measures ANOVA for N1 amplitudes, there were no significant main or interaction effects (all F ≤ 1.50; p ≥ .229). Similarly, in the repeated-measures ANOVA for the N2 amplitudes (at electrode Cz), there were neither main nor interaction effects (all F ≤ 1.31; p ≥ .259). For the CSP component (at electrode FCz), mixed-effects ANOVA did not reveal any significant main or interaction effects (all F ≤ 0.82; p ≥ 507).

The P3 ERP-component is shown in Fig. 5.

Fig. 5
figure 5

P3 ERP components shown for electrode C4 and corresponding topography maps of the scalp electrical potential for P3 at the relevant time-window (550–650 ms) for each group (DOP group and controls). In the topographies, red denotes positivity, and blue negativity. The lines represent congruent emotional trials. Time point zero denotes onset of the target stimulus. There were significant differences between P3 amplitudes on congruent and incongruent trial in the DOP group, where the distracting stimulus dimension 2 required deeper levels of processing but not for the controls

In the mixed-effects ANOVA for the P3 ERP-component (at electrode C4), a main effect of congruency was observed (F(1,38)= 11.77; p = .001; ηp2 = .241), showing P3 amplitudes to be increased on congruent trials (16.73 μV/m2 ± 1.49) in comparison to incongruent trials (15.51 μV/m2 ± 1.41). Moreover a “congruency × group” interaction was observed (F(1,38)= 8.86; p =.005; ηp2 = .193). Independent-samples t-tests revealed that on congruent trials there was no significant difference between average P3 in the DOP group (13.56 μV/m2 ± 2.95) compared to congruent trials (17.54 μV/m2 ± 2.31) in the control group. In incongruent trials P3 amplitude in the DOP group (11.00 μV/m2 ± 3.21) also did not significantly differ from P3 amplitude within the control group (17.68 μV/m2 ± 2.01), (all t ≥ 1.67; p ≥ .103).

For the DOP group, post hoc paired t-tests showed that average P3 amplitudes significantly differed between congruent (13.56 μV/m2 ± 2.95) and incongruent trials (11.00 μV/m2 ± 3.21) (t(19) = 4.13, p = .001). For the control group, there was no significant difference in P3 amplitude between congruent (17.54 μV/m2 ± 2.31) and incongruent trials (17.68 μV/m2 ± 2.01), (t(19)=.409, p=.687).

Pupil diameter integration with EEG

As can be seen in Fig. 6a, pupil dilation differed between the groups for a time range around 500–700 ms after stimulus onset. Mixed-effects ANOVA for the pupil dilation data revealed no main or interaction effects (all F ≤ 3.00; p ≥ .091). The results from the correlation analysis of the pupil diameter data with the ERP data are shown in Fig. 6b.

Fig. 6
figure 6

Pupil diameter and EEG data integration and sLORETA results. Part a shows changes in pupil dilation (in mm) as a function of time, for each group (DOP group/warm colors and controls/cool colors) for congruent and incongruent trials. Part b shows the correlation between time-specific changes within pupil diameter (x-axis) and the ERP time course at electrode C4 (y-axis) for each experimental group. The sLORETA plots show significant activation in the insular cortex (BA13) and the precuneus (BA7) for the time period in which correlations between the pupil diameter and the ERP data were evident. Part c shows correlation between time-specific changes within pupil diameter (x-axis) and P1/N1 electrodes P7/P8 (y-axis) for each experimental group. Significant correlations (p<.05) are highlighted by black outline, positive correlations are shown in yellow, and negative correlations are shown in blue

Positive correlations are shown in hot colors, negative correlations are shown in cold colors. Only significant correlations are shown. As can be seen in Fig. 6b, there was a significant positive correlation of the pupil diameter data within the time range between 250 and 300 ms after target stimulus presentation and the ERP time-series data with the time range between 700 and 1,000 ms after target presentation in the DOP group (r > .57; R2 = .32; p < .01) at electrode C4, which was shown to reveal modulations in the P3 amplitudes. Notably, no significant correlations were obtained for the control group (all r < -.2; R2 = .04; p > .4), in which the depth of processing was not experimentally increased. Using sLORETA, we examined which brain structures were activated in the time period showing the correlations between pupil diameter and ERP activity in the DOP group. Thus, we took the time range from 700–1,000 ms after target stimulus and compared the mean activity in this time window against zero. The results reveal that areas in the superior parietal cortex (BA7) and the insular cortex (BA13) were associated with modulations in neurophysiological activity in this time window (see Fig. 6b).

The specificity of effects is further underlined by the correlation analyses using electrodes P7/P8 (see Fig. 6c), which also revealed no significant correlations between the pupil diameter data and the ERP data (all r < -.2; R2 = .04; p > .4) in the above time window where correlations between pupil diameter data and the EEG data were evident and whereo an interaction “congruency × group” was also obtained. As can be seen in Fig. 6c, there are some significant correlations between pupil diameter data and EEG data at electrodes P7/P8. Considering that pupil reactions occur with an approximate delay of about 200 ms relative to LC-NE activity (Dippel et al., 2017; Jepma & Nieuwenhuis, 2011; Joshi et al., 2016), it may be argued that these occur due to attentional processing of the P1 and N1 time windows. However, there was no interaction “congruency × group” in P1 and N1. Therefore, these correlations do not seem to reflect meaningful effects induced by the experimental manipulation and, therefore, cannot be interpreted.

As shown above, the correlations between the pupil diameter data and the ERP data in the DOP group were substantial. A correlation coefficient of r > .5 can be considered a “medium to strong” correlation (Evans, 1996). The obtained effect size for differential P3 amplitude effects was η2p = .24. With a sample size of N=20 subjects and an alpha level of α = .05, the achieved power, as calculated using the G*Power software package, in this study is greater than 95%. This shows that the study is sufficiently powered and that the obtained dissociation, i.e., lack of correlations between pupil diameter data and ERP data in the control group and the electrode sites reflecting attentional selection processes is reliable.

Discussion

The aim of the current study was to investigate the specific mechanisms underlying the effect of depth of processing (DOP) of emotional stimuli on the processing of emotional interference effects. We did so by integrating neurophysiological, source localization, and pupil diameter data obtained from two groups of participants performing an emotional Stroop task. Overcoming emotional Stroop interference effects requires accurate and efficient processing of the task-relevant emotional stimulus dimension, while ignoring the distracting effects of a simultaneously presented task-irrelevant but still conflicting/interfering emotional stimulus dimension. Importantly, the emotional Stroop effect, similar to non-emotional Stroop effects, represents a “stimulus-stimulus conflict” (S-S conflict) (Hommel, 2015). Therefore, we detailed how DOP affects neural mechanisms underlying this sort of conflict monitoring. We hypothesized that manipulating the DOP of the distractor stimulus dimension should increase emotional conflicts and should complicate decision-making processes regarding which response to select. These decision-making processes were expected to be modulated by the NE system in the DOP group, as indexed by correlations of the pupil diameter data and the EEG data in the P3 time window.

The behavioral results showed that emotional Stroop effects were stronger in the DOP group, in which the depth of processing of the irrelevant emotional word dimension of the Stroop stimulus was increased. Specifically, we found that within the DOP group, response accuracy was lower and response times slower compared to the control group. The emotional Stroop effect represents a “stimulus-stimulus conflict” (S-S conflict), which refers to competition and interference between two semantically similar stimulus dimensions that are simultaneously presented (Hommel, 2015). The intermittent queries within the DOP condition prompted participants in this group to allocate processing resources to the task-irrelevant stimulus dimension, which increases demands to resolve the emotional conflict compared to the control group. Thus, the behavioral data clearly shows that a manipulation of the DOP of irrelevant yet interfering emotional stimulus dimension constituting the S-S conflict modulates the strength of emotional conflicts. The neurophysiological data provide insights into what kind of cognitive mechanisms are affected by that, and revealed very specific effects.

We found no differential modulation explaining the behavioral data for N1 or P1 ERP-components. Thus, perceptual and early attentional processes reflected by these components are unlikely to underlie the differences observed on the behavioral level (Herrmann & Knight, 2001). Interestingly, neither did the N2 ERP-component reflect the interactive effects on the behavioral level. With regard to the lack of N2-modulation it is important to consider that studies reporting N2 differences in response to conflict usually employ quite simple conflicting stimuli (e.g., Flanker or Simon paradigms). This allows faster conflict detection and resolution processes than complex emotional stimuli (Chen et al., 2016; Chmielewski & Beste, 2017; Larson et al., 2014; Shang et al., 2016). This was to be expected, and is in line with previous investigations where we found no effects on N2 amplitude using this specific set of stimuli employed in other versions of the paradigm (Schreiter et al., 2018a, 2018b). Neither did we find interactive effects within the CSP data, which suggests that conflict monitoring or resolution processes in general are not modulated by varying the processing depth of emotional interfering stimuli. However, since results suggest that the CSP is mostly affected by pro-active control processes (Larson et al., 2014; Pan et al., 2016; Shen et al., 2013), this is an expected finding.

Importantly, interactive effects of congruency and group were observed for the P3 ERP-component. This is in line with our a priori hypothesis. The P3 component has consistently been found to be modulated in emotional Stroop conflict and tends to be stronger on emotionally congruent than incongruent trials, and more positive following negative compared to non-aversive/threatening emotion trials (Clayson & Larson, 2013; Delplanque et al., 2004; Larson et al., 2014; Schreiter et al., 2018b; Yang et al., 2013). Our findings show that P3 modulations between congruent and incongruent emotional Stroop trials were stronger in the DOP group compared to the control group. Thus, P3 amplitude is reflective of the level of processing allocated to particular stimulus dimensions (Hong et al., 2014; Kok, 2001; Murphy et al., 2011). The P3 has been shown to reflect decision-making processes (Gray et al., 2004; Twomey et al., 2015) and has been found to be more enhanced with increasing demand for conflict control (Mennes et al., 2008). Likely, the P3 is modulated by the quantity of evidence required to trigger a decision (Twomey et al., 2015). Obviously, such a decision process becomes complicated by forcing participants in the DOP group to fully evaluate the task-relevant and the task-irrelevant dimension of the emotional Stroop stimulus. Notably, the results from the analysis of the pupil diameter data as an indirect index of the NE system activity are in line with the interpretation that decision processes are modulated by the manipulation of the DOP of emotionally conflicting stimuli. According to the adaptive-gain theory, the NE system plays a major role for more complex decision processes, and it has been suggested that the NE system facilitates neural processes at task-relevant decision points (Aston-Jones & Cohen, 2005). Importantly, in the DOP group, the pupil diameter was selectively correlated with the neurophysiological data in the P3 time window. This strongly suggests that the NE system selectively modulates neural mechanisms during decision processes, which is in line with theoretical conceptions (Nieuwenhuis et al., 2005). It is well established that pupil reactions occur with an approximate delay of about 200 ms relative to LC-NE activity (Dippel et al., 2017; Jepma & Nieuwenhuis, 2011; Joshi et al., 2016). Thus, the obtained positive correlations suggest that greater pupil diameter at a time point of around 700–1,000 ms post-stimulus onset is related to a larger/stronger P3 (at about 500–660 ms). However, even when assuming that this is not the case and the correlations fall after the P3 time window, this is still a reasonable finding because the late positive potentials are known to reflect affective picture processing (Hajcak & Olvet, 2008), for which covariation with autonomic arousal has also been reported (Cuthbert, Schupp, Bradley, Birbaumer, & Lang, 2000).

Larger pupil diameters have been shown to reflect higher NE concentrations (Hou et al., 2005b; Phillips et al., 2000). Therefore, a higher NE activity is associated with more demanding decision-making processes when the DOP of the irrelevant emotional distractor dimension is elevated. Crucially, modulations of neural processes by the NE system have been referred to as modulators of “gain control” (Aston-Jones & Cohen, 2005). Elevated gain control facilitates information processing (Salinas & Thier, 2000) and is evident at sensory and cognitive processing levels (Salinas & Thier, 2000; Servan-Schreiber, Printz, & Cohen, 1990). The obtained results are well in line with such gain-modulation mechanisms. Interestingly, several lines of evidence suggest that parietal areas in particular are subject to gain-control processes (Adelhöfer et al., 2018; Salinas & Thier, 2000). In line with this, the sLORETA analysis showed that areas in the parietal cortex (BA7) and the insular cortex (BA13) were activated during the time period in which correlations between the pupil diameter data and the neurophysiological data were evident. This suggests that the NE system modulates complicated decision-making processes during emotional conflicts via a parietal-insular network. Interestingly, BA7 has recently been found to be modulated by gain-control processes under conditions of complex incoming sensory information (Adelhöfer et al., 2018). Several lines of evidence suggest an involvement of BA7 in cognitive control decision processes whenever information is complex and probably difficult to categorize but essential for behavioral control (Bodmer & Beste, 2017; Bodmer, Mückschel, Roessner, & Beste, 2018; Fokin et al., 2008; Ocklenburg, Güntürkün, & Beste, 2011; Takeichi et al., 2010). Moreover, the precuneus (BA7) has been implicated in the processing of facial (emotional) information (Cheng et al., 2017; Kim et al., 2015; Zhang, Song, Liu, & Liu, 2016; Zhao, Zhao, Zhang, Cui, & Fu, 2017). In the current experiment, the processing of the facial emotional dimension was complicated in the DOP group, because participants in this group were forced to evaluate the distracting semantic word information. It is therefore reasonable that functional neuroanatomical structures are involved, which are known to be activated whenever information is complex and probably difficult to categorize but essential for behavioral control. The insular cortex likely integrates sensory, cognitive, and emotional processes (Gogolla, 2017), plays a role in the interpretation of emotional information, and is involved in emotional and non-emotional interference processing (Gogolla, 2017; Simmons et al., 2008; Xue et al., 2016). It thus seems that the NE system modulates decision processes during emotional conflicts that are associated with functional neuroanatomical structures known to be important when information is complex and probably difficult to categorize but essential for informing behavioral control.

A limitation of the study is that possible interindividual differences regarding the processing of positive and negative emotions were not controlled for and examined. However, since the obtained effects revealed a reasonable effect size, it is unlikely that this has strongly biased effects. We would also like to acknowledge limitations concerning the sample characteristics that may have impacted on the results but were not explicitly controlled for in the current study. We did not collect data on individual different aspects of emotional wellbeing, intelligence, and socio-economic status of the participants. While it is possible that these factors influence the ability to overcome emotional conflict, it must be emphasized that both groups (DOP and control group) were randomly selected from a healthy and culturally homogenous student sample (German mother tongue, no psychiatric background, no regular medication) and, hence, any variations in the above-mentioned factors would have likely been there in either group. Thus, the group differences are most likely due to the existence of the query enhancing the relative importance of the distractor word and deeper processing of the stimuli in the DOP group. In this regard, a possible study limitation is that the control group was not confronted with infrequent queries unrelated to the emotional content of the distractor dimension. It is arguable, however, whether the addition of another unrelated question for the control group, would not induce more confounding effects (such as, switch costs), rather than control for the mere existence of a query. It would be interesting to include a control query in future studies.

An important future direction for the study of processing depth of emotional conflicting stimuli is related to the study design. In the current study, we employed a between-subject design in order to avoid effects or task order (learning effects) on the results. Though task-order effects could be ruled in data analyses, this would be interesting to investigate in future studies to further explore individual differences in DOP processing.

In summary, the study examined what cognitive-neurophysiological mechanisms are modulated when the depth of processing of emotional conflicting stimuli is varied. The study shows that when processing depth of conflicting stimuli is increased, emotional conflict effects become stronger. The EEG data show that this was associated with modulations of decision-making processes, as reflected by the P3 ERP-component. Notably, the integration with pupil diameter data suggests that these decision processes were modulated by the NE system, especially when the depth of processing of interfering emotional stimulus dimensions was increased. This likely reflects gain modulation processes in facilitating processing of complex interfering, emotional information. The source localization results suggest that regions in the parietal and insular cortex are associated with these modulatory effects.