Introduction

Major depressive disorder (MDD) is associated with abnormalities in patterns of information processing and emotional reactivity (Beck et al., 1979; Foti et al., 2010). Negative potentiation theories posit that negative mood states activate specific cognitive structures and increase the processing of negative, schema-congruent stimuli (Scher et al., 2005). This distinctive information processing bias, which may be called “negativity bias,” seems to be most pronounced and detrimental in the context of processing of interpersonal information (Gilboa-Schechtman et al., 2002). The prototype stimulus carrying interpersonal information is the human facial expression. Accordingly, there is considerable evidence that MDD involves specific abnormalities in the cognitive and neural processing of emotional faces (Chen et al., 2014; Gilboa-Schechtman et al., 2002; Gotlib et al., 2004; Leppänen, 2006; Shestyuk et al., 2005; Suslow et al., 2004).

Context information influences face processing

Because a face is never presented out-of-context, contextual features of a person in a specific situation provide information that may either complement or contradict the actual facial expression (Wieser & Brosch, 2012). Hence, the effective processing of social information requires the integration of both types of information. Studies with nonclinical populations have shown that affective context information, such as affective visual scenes in the background of presented facial expressions, have an impact at early, supposedly automatic and unconscious stages of processing (e.g., N170: Righart & de Gelder, 2006) as well as at later, more conscious processing stages (e.g., Early Posterior Negativity (EPN) or Late Positive Potential (LPP): Diéguez-Risco et al., 2013; Wieser et al., 2014). Additionally, context information is capable of modifying the extent of self-reference of a stimulus or a situation. Self-reference refers to the extent that the information of a stimulus is addressed to the subject. High self-reference may be provided by a descriptive sentence, such as “this person likes you,” whereas a third-person statement, such as “this person likes someone” provides no self-reference. Regardless of the valence of the stimulus and the context information, the LPP (Wieser et al., 2014) as well as the EPN (Klein et al., 2015) were modulated by the extent of self-reference of the context. Moreover, an interaction between self-reference and valence, with self-relevant neutral context information leading to most pronounced LPP amplitudes was found in nonclinical individuals (Klein et al., 2015). In line with previous research (Fields & Kuperberg, 2012; Hirsh & Inzlicht, 2008), this finding has been interpreted as being indicative of a higher motivation to evaluate the valence of neutral faces because of their ambiguous manner.

Emotion and face processing in depressive individuals

Neuroimaging studies found abnormalities in areas associated with self-referential processing in depressive patients (e.g., default mode network structures or rostral anterior cingulate cortex (Sheline et al., 2009; Wagner et al., 2015). The authors suggested that depressed patients are less capable of shifting their attention away from negative self-related stimuli. However, electroencephalographic (EEG) studies that examine potential abnormalities in context integration in the processing of facial stimuli in depressed patients are still scarce. Therefore, we planned to extend previous findings regarding cortical processing of facial stimuli presented in a context containing information about the affect as well as self-reference to a sample of depressed individuals (Klein et al., 2015; Wieser et al., 2014).

With respect to previous studies, the present study focused on N170, EPN, and LPP components (Klein et al., 2015; Righart & de Gelder, 2006; Wieser et al., 2014). The N170 is a prominent component that is reliably triggered by face stimuli. This process seems to be altered by depression (Dai & Feng, 2012), albeit some studies could not find any differences at this stage (Foti et al., 2010; Jaworska et al., 2012). However, activation of the N170 in depressive patients seems to be dependent on the affective expression of faces that patients are confronted with. In a study by Chen et al. (2014), subjects with major depression showed a decreased N170 for the processing of neutral and happy faces, but an increased N170 for sad faces. The authors argued that these results may show the altered emotional processing and the specific cognitive bias for negative emotions.

The LPP has been associated with emotional processing of faces and other stimuli in a motivated attention framework (Frenkel & Bar-Haim, 2011; Hajcak et al., 2009; Schupp et al., 2004), particularly with the decoding and appraisal of affective meaning (Schupp et al., 2006; Wessing et al., 2013). Compared with healthy controls, LPP responses to unpleasant emotional stimuli have been shown to be reduced in depressed individuals (Foti et al., 2010; Kayser et al., 2000; MacNamara et al., 2016). Within depressive individuals, however, responses to emotional faces and to self-referential emotional words involved enhanced LPPs (Auerbach et al., 2015; Burkhouse et al., 2017). Particularly, in a self-referential encoding paradigm, depressed subjects showed greater early LPP positivity to negative versus positive words (Shestyuk & Deldin, 2010). Similarly, Jaworska et al. (2012) reported augmented LPPs in reaction to sad faces in depressive subjects suggesting an attentional bias towards mood-congruent stimuli.

The EPN is a mid-latency component usually starting at around 200 ms post-stimulus that has been connected to enhanced emotional processing (Hajcak et al., 2010; Wieser et al., 2014). To date, only poor evidence links depression to EPN activity. Consistent with the negative potentiation hypothesis, though, depressed individuals have recently been reported to exhibit enhanced EPN amplitudes in reaction to both negative, neutral, and positive stimuli compared with nondepressed subjects (Connell et al., 2015). The authors proposed that depression was associated with greater early attentional engagement for emotionally valenced stimuli.

In addition to the N170, the P100, a positive deflection occurring at occipital sites at around 100 ms after stimulus onset, has been shown to be associated with face processing (Herrmann et al., 2005; Righart & de Gelder, 2008). While the N170 is reported to reflect a stage of configural processing, the P100 was suggested to reflect a stage of face detection (Itier & Taylor, 2004). Contrasting with the N170, however, information on modulations of the P100 by depression in the processing of facial stimuli has been scarce. With respect to findings indicating that depressed individuals show greater P100 amplitudes (Auerbach et al., 2015) and decreased P100 latency (Yang et al., 2011) in response to negative self-referential words, it was suggested that the P100 component reflects a promising means of assessing self-referential biases in depression (Auerbach et al., 2015). Therefore, as a novelty, analyses of the P100 component in the processing of facial stimuli were included in the present study.

Emotion processing in depression related to childhood maltreatment

Recently, different clinical subtypes of depression have been proposed (Sharpley & Bitsika, 2014). Most prominently, a subtype of depression that is related to experiences of childhood maltreatment was suggested which is characterized by an earlier onset, a more chronic course, and more suicide attempts (Andersen & Teicher, 2008; Zisook et al., 2007). However, research examining its underlying physiological characteristics is still at its beginnings. Therefore, for exploratory purposes, the present study sought to examine the influence of childhood maltreatment on information processing in depressive patients. In doing so, we planned to investigate the influence of various types of physical, emotional, and sexual childhood maltreatment as well as relational peer victimization differentially. Accordingly, differential effects of various forms of childhood maltreatment on psychopathology, emotional functioning as well as facial emotion processing were previously reported (Dong et al., 2004; Humphreys & Zeanah, 2015; Iffland & Neuner, 2020; McLaughlin et al., 2014; Roth et al., 2018; Teicher & Samson, 2013; Zeanah & Sonuga-Barke, 2016). Addressing previous studies reporting enhanced ERPs in maltreated children at P100 and P3b components (Pollak et al., 2001; Pollak & Tolley-Schell, 2003; Shackman et al., 2007), the present study sought to particularly examine associations of the P100 and P3b components and various types of childhood maltreatment in patients with depression.

Study objective and hypotheses

As an extension of prior work in healthy individuals (Klein et al., 2015), we examined whether depressive individuals show abnormal cortical processing of stimuli presented in a context containing information about the affect as well as self-reference. In healthy participants, we expected to find similar modulations of the N170, EPN, and LPP as presented by Klein et al. (2015). Also, we predicted a general enhancement in a) socially threatening context information in comparison with physically threatening and neutral context information, and in b) self-relevant stimuli compared to other-relevant stimuli. In depressive individuals, we assumed that depressive individuals present with a lack of adaptive context integration, which may lead to a generally heightened impact of negative, self-related context information (Wagner et al., 2015). More specifically, based on previous findings (Chen et al., 2014; Wagner et al., 2015), we hypothesized an enhanced activity on the N170 in depressed individuals in response to faces presented in a negative, self-related context, while healthy controls show more pronounced N170 amplitudes to neutral and other-related context information (Dai & Feng, 2012). With respect to the LPP, we expected that depressive subjects exhibit weaker responses to faces than healthy controls (Foti et al., 2010; Kayser et al., 2000; MacNamara et al., 2016). Within the group of depressive individuals, we assumed most pronounced LPP responses in reaction to faces with negative, self-related context information (Auerbach et al., 2015; Burkhouse et al., 2017; Jaworska et al., 2012; Shestyuk & Deldin, 2010). Based on previous research regarding the EPN component (Connell et al., 2015), we expected larger EPN responses in depressed individuals, irrespective of context information. Finally, we hypothesized that P100 amplitudes were largest in reaction to negative, self-related stimuli in the depressive sample, while healthy individuals’ P100 responses were not different in response to different emotional contexts (Auerbach et al., 2015). For exploratory purposes, associations of the P100 and P3b components and childhood maltreatment in patients with depression were examined.

Methods

Participants

Participants were recruited through the Outpatient Psychotherapy Clinic of Bielefeld University and through newspapers and magazines as well as bulletins at the campus of Bielefeld University. Depressive patients recruited from the Outpatient Clinic were currently on the waiting list for therapy and had shown signs of depression in an initial clinical interview. In order to validate a current diagnosis of depression, depressed patients underwent an extensive structured clinical interview (German version of the SCID-I: Wittchen et al., 1997), administered by trained clinical psychologists. Exclusion criteria for depressive patients included (a) any current DSM-IV Axis I psychiatric disorder other than major depression without a co-morbid depressive disorder, (b) evidence of a current substance abuse or dependence, (c) evidence of current or past psychosis, and (d) evidence of acute suicidal intentions or ideation. Participants recruited from newspapers and bulletin boards were used as a healthy control group and underwent a brief structured clinical interview (German version of the Mini-International Neuropsychiatric Interview (M.I.N.I.); Sheehan et al., 1998) before the experiment to make sure that they had no current or known lifetime history of axis I DSM-IV psychiatric disorders (American Psychiatric Association, 2000). Depressive patients (n = 21) had a mean age of 39.48 years (standard deviation [SD] = 14.43, range 18–58 years, 9 females), whereas participants of the healthy control group (n = 20) had an average age of 39.20 years (SD = 16.02, range 20–66, 8 females). Of the 21 patients with depression, 14 were diagnosed with a recurrent MDD and 7 were diagnosed with a first episode MDD. Comorbid DSM-IV diagnoses in the group of depressed patients, based on the SCID-I-assessment, were specific phobia (n = 3), social phobia (n = 3), generalized anxiety disorder (n = 1), posttraumatic stress disorder (n = 2), panic disorder without agoraphobia (n = 1), hypochondriasis (n = 1), and anorexia (n = 1). Current use of psychotropic medication was reported by 47.6% (n = 10) of the depressive patients (antidepressant: n = 10; anxiolytic: n = 3). Participants of the healthy control group received either course credits (when studying at Bielefeld University) or financial compensation for participation. In the depressive sample, participants did not receive financial compensation. Instead, participants were offered prompt and detailed psychological diagnostic sessions within the present study. Afterwards, depressive patients received detailed written and oral diagnostic feedback. Participants in both groups had normal or corrected-to-normal vision. All subjects received and signed an informed consent before the experiment. The experimental procedure was approved by the Ethics Committee of Bielefeld University. The study was in accordance with the declaration of Helsinki. Table 1 presents participants’ means on the assessments.

Table 1 Participants’ characteristics and mean values on the assessments (N = 41)

Instruments

Symptoms of depression

Symptoms of depression were measured using the German version of the Beck Depression Inventory (BDI-II; Hautzinger et al., 2006). The self-report measure consists of 21 items. The items are rated on a 4-point scale indicating the severity of symptoms. Higher scores indicate more severe depressive symptoms. Standardized cutoff scores indicate sumscores >13 as mild depression. The BDI-II has shown good psychometric properties in nonclinical and clinical samples (Kühner et al., 2007). In the current sample, internal consistency of the BDI-II was excellent (Cronbach’s α = 0.95).

General psychopathology

As a measurement of psychopathology and psychological distress in general, we used the German version of the Brief Symptom Inventory (BSI; Franke, 2000). The BSI is a short form of the Symptom Checklist 90 (SCL-90) and consists of 53 items, producing the same nine primary symptom dimensions (somatization, obsessive-compulsive symptoms, interpersonal sensitivity, depression, anxiety, hostility, phobic anxiety, paranoid ideation, and psychoticism). Additionally, three global indices measure general psychological distress: the Global Severity Index (GSI); the Positive Symptom Total (PST); and the Positive Symptom Distress Index (PSDI). The items are rated on a 5-point Likert scale, ranging from 0 (not at all) to 4 (extremely) and relates to the experience of the past 7 days, including the day of assessment. In the present study, the GSI was used to indicate participants’ psychological distress. The BSI showed high internal consistency in the current sample (Cronbach’s α = 0.97).

Childhood maltreatment

The German version of the Childhood Trauma Questionnaire (CTQ; Wingenfeld et al., 2010) was used to assess different types of childhood maltreatment (sexual abuse, emotional neglect, emotional abuse, physical neglect, and physical abuse). The items are rated from 1 (never true) to 5 (very often true) with a possible range of subscale scores from 5 to 25. The psychometric properties of the German version are similar to the original version, and it has been shown to be a reliable and valid screen for childhood maltreatment. In the current sample, internal consistency was excellent for all items (Cronbach’s α = 0.94). Similarly, internal consistency of the subscales sexual abuse, emotional neglect, emotional abuse, and physical abuse was good to excellent (all Cronbach’s α’s > 0.83). The subscale physical neglect, however, showed only a questionable internal consistency (Cronbach’s α = 0.69), which is in line with prior reports of weak psychometric properties of this subscale of the German version (Klinitzke et al., 2012).

Relational peer victimization

Relational peer victimization was assessed using the Fragebogen zu belastenden Sozialerfahrungen [Adverse Social Experiences Questionnaire] (FBS; Sansen et al., 2013). The FBS consists of 22 items describing aversive social situations, such as rejection, exclusion, being laughed at, insulted, and teased by peers. For each situation, respondents were asked whether or not they have experienced this situation during childhood (age 6-12 years) or adolescence (age 13-18 years). The total score is calculated as a sum of “Yes” responses across both age periods and ranges from 0 to 44. The total score of the FBS presented with a satisfying stability over a 20-month period (r = 0.89) (Sansen et al., 2013). In the current sample, internal consistency of the FBS was excellent (Cronbach’s α = 0.91).

Stimulus Material

Photographs of faces of 22 different Caucasian individuals (11 females, 11 males) were taken from the Radboud Faces Database (Langner et al., 2010). The RFDB offers a variety of different facial expressions, representing different emotions, as well as neutral facial expressions—all of them presented at different camera angles. For the present study, only neutral faces with frontal orientation were included. The pictures were resized to a resolution of 371 × 556 pixels. There was no further standardization in terms of contrast or luminance, because the randomized presentation of all faces would rule out all possible systematic influences.

The context stimuli consisted of 18 German sentences from three different valence categories (physically threatening, socially threatening, and neutral) and were couched in a self-related and other-related fashion as well (e.g., “He wants to hit you.” vs. “He wants to hit someone.”). Physically threatening sentences described situations where an aggressor intends to conduct actual physical violence or uttering violent threats (e.g., “He wants to smash your face in.). Socially threatening sentences were focused on intimidation or the impending loss of social belonging or rank (e.g., “She finds you abhorrent.”). Neutral sentences were characterized by descriptions of nonjudgmental or nonthreatening behaviors or situations (e.g., “He is sitting next to you.”). The stimulus set has been developed by Klein et al. and has been evaluated regarding differences in arousal and valence (Klein et al., 2015).

Procedure

To test our assumptions, we presented three different classes of emotions (physically threatening, socially threatening, and neutral) as written statements that preceded faces with inherently neutral expressions. As a second factor, the context sentences were either directed to the participant or to other persons to modulate the self-referential aspect of the context sentences. Each context sentence and face stimulus was randomly paired for each trial to be able to examine a short-term, one-time influence instead of learning effects.

For the presentation of the experiment, the software package Inquisit 4.0.3 (Millisecond Software, Seattle, WA) was used. The experiment was shown on a 19-inch-TFT-monitor (60-Hz refresh rate), which was located approximately 60 cm in front of the participant. The participants were asked to focus their attention on the center of the screen and passively view the displayed pairs of faces and sentences. The paradigm has been established in a previous study (Klein et al., 2015). Each sentence (self-related/physically threatening, self-related/socially threatening, self-related/neutral, other-related/physically threatening, other-related/socially threatening, other-related/neutral) was pseudo-randomly paired with one of the 22 faces. Each sentence was automatically worded in an appropriate gender-specific fashion (e.g., female face – female personal pronoun). Because the experimental setting itself did not provoke any distractions that would divert focus from the screen, we did not include a fixation cross in our design. In order to ensure that differences in the ERPs are only caused by the influence of the sentences, there was no fixed combination of any face-sentence pair. The experiment consisted of six blocks, each block contained 48 randomly chosen trials, leading to a total number of 288 trials. Each of the six conditions (self-related/physically threatening, self-related/socially threatening, self-related/neutral, other-related/physically threatening, other-related/socially threatening, other-related/neutral) was presented 48 times. Each trial started with the presentation of a sentence for 2,900 ms (interstimulus-interval randomized between 900-1,500 ms), followed by a face, which was presented for 500 ms. The intertrial-interval consisted of an empty gray background and varied randomly between 1,900 and 2,600 ms. Between each block, there was a short break for approximately 2 minutes.

EEG preprocessing and statistical analyses

EEG data was continuously recorded from 128 active electrodes (BioSemi Active Two System; www.biosemi.com), online referenced to the Common Mode Sense (CMS) - Driven Right Leg (DRL) ground. Four additional electrodes were placed above and below the right eye as well as on the outer canthi of both eyes to record horizontal and vertical eye-movements. Online data were recorded with 2,048 Hz.

Preprocessing was performed using SPM8 for EEG data (http://www.fil.ion.ucl.ac.uk/spm). Offline data was re-referenced to the average reference, downsampled to 250 Hz and bandpass-filtered between 0.166 and 30 Hz with a fifth order zero-phase. Filtered data were epoched from 100 ms before to 600 ms after presentation of the neutral faces and baseline corrected (100 ms to 0 ms prestimulus). In a first step, artifact trials exceeding a high threshold (>150 μV) were rejected. Finally, data were averaged using the robust averaging algorithm (Litvak et al., 2011). This averaging method is a unique feature of SPM, down weighting outliers for each channel and time point, which preserves a higher number of trials while controlling for a variety of possible artifacts. We used the recommended weighting function, which preserves approximately 95% of the data points drawn from a random Gaussian distribution. After averaging, visual inspection of the data was performed and bad channels were identified. On average 1.3% of all sensors were interpolated (M = 1.68; SD = 2.02). On average, 4.33 trials were rejected keeping 43 trials for each single condition. A mixed repeated measure ANOVA showed there were no differences in the number of rejected trials between controls and patients (F(1, 39) = 2.20, p = 0.146, η2 = 0.053), nor between self- and other-related context (F(1, 39) < 0.01, p = 0.957, η2 < 0.001), nor between the three emotional valences (F(2, 78) = 2.19, p = 0.119, η2 = 0.053). Similarly, no significant interactions between self-reference and emotion (F(2, 78) = 0.76, p = 0.471, η2 = 0.019), group and self-reference (F(1, 39) = 0.39, p = 0.537, η2 = 0.010), group and emotion (F(2, 78) = 2.65, p = 0.077, η2 = 0.064), and between group, self-reference, and emotion (F(2, 78) = 0.73, p = 0.485, η2 = 0.018) were found.

Components and electrode clusters based on previous studies of contextual influences on face perception (Klein et al., 2015; Wieser et al., 2014). To validate expected time windows in the current sample, we used a collapsed localizer for the P1 and N170 (Luck & Gaspelin, 2017) and collapsed negative-neutral difference waves for the EPN and LPP. However, because difference waves for the EPN did not show a profound negativity, the EPN component was captured around the N2 wave amplitude using the same electrode set as for the N170 (Bruchmann et al., 2020; Schindler et al., 2019, 2020). We investigated differences on early (P100, N170), mid-latency (EPN), and late stages of face processing (LPP). The P100 was measured between 90 and 120 ms, using a parieto-occipital cluster (12 electrodes: PPOz, POz, POOz, Oz, P1, PPO3, PO3, POO3, O1, P2, POO4, PO4, O2). The N170 was detected between 120 and 170 ms and the EPN between 250 and 450 ms after stimulus onset. For both components, two bilateral symmetric occipital sensor clusters of eight electrodes each were examined (left: I1, OI1, O1, PO9, PO9h, PO7, P9, P7; right: I2, OI2, O2, PO10, PO10h, PO8, P10, P8). Finally, LPP effects were investigated between 450 and 600 ms over centroparietal locations (20 electrodes: CCP5, CCP3h, CCP3, CCP4, CCP4h, CCP6, CPz, CPP3, CPPz, CPP4, P5, P3, P1, Pz, P2, P4, P6, PO3, POz, PO4). For the exploratory analyses, the P3b was detected between 300 and 450 ms. The P3b showed a maximum over centroparietal electrodes (13 electrodes: FCC1, FCC2, Cz, CCP1, CCPz, CCP2, CP1, CPz, CP2, CPPz, P1, Pz, P2). In all statistical analyses, mean voltage activity in the given time intervals for the mentioned electrodes was used.

To ensure that ERN score averages were reliable, the number of trials needed to achieve a reliability threshold of 0.80 was calculated for each diagnostic group (Table 2). A reliability threshold of 0.80 was deemed acceptable based on established guidelines for ERP score reliability (Clayson & Miller, 2017a). ERP score reliability as a function of the number of trials needed for a stable average and diagnostic group was examined using the ERP Reliability Analysis (ERA) Toolbox v 0.3.2 (Clayson & Miller, 2017b). The ERA Toolbox calculated ERN score dependability based on algorithms from generalizability theory and used CmdStan v 2.10.0 (Stan Development Team, 2016) to implement the analyses in Stan (Carpenter et al., 2017). Furthermore, Spearman-Brown split-half reliabilities were calculated for all conditions on all components (Table 2). All components showed acceptable to excellent reliabilities. However, because reliability of the LPP component was badly affected, data of one participant of the depressive group was excluded from LPP analyses.

Table 2 ERP reliability scores for each ERP component

Statistical tests on EEG data and data visualization were performed with EMEGS (http://www.emegs.org/, Peyk et al., 2011). For statistical analyses, 2 (group: depressive patients vs. healthy controls) x 2 (self-reference: self-related vs. other-related) x 3 (emotional valence: socially threatening, neutral, physically threatening sentences) repeated measures ANOVAs were conducted to investigate differences between conditions in all time windows of interest. Because stronger effects over right sensors are found in a number of studies on emotion processing or contextual threat in the N170 and EPN time window (Hinojosa et al., 2015; Wieser et al., 2010), laterality was included as an additional factor (laterality: left vs. right). Effect sizes were calculated for all statistical tests (Cohen, 1988). When necessary, additional post-hoc ANOVAs as well as paired t-tests were conducted separately for different emotional valences, differently referenced conditions, and depressive patients versus healthy controls. Post-hoc t-tests were adjusted for multiple comparisons using False discovery rate (FDR) correction (Benjamini & Hochberg, 1995). When Mauchly’s test indicated violation of the sphericity assumption, Greenhouse–Geisser corrections were applied and original degrees of freedom together with Greenhouse–Geisser ε are reported. Additionally, when bivariate Spearman rank correlations indicated significant associations of mean ERP amplitudes and measures of childhood maltreatment, all ANOVAs were performed as analyses of covariance (ANCOVAs) with childhood maltreatment scores serving as covariates. As the pattern of results did not change, only ANOVAs are reported. For exploratory purposes, correlation analyses were conducted to examine the associations of childhood maltreatment, measured by scores on the CTQ and FBS, with mean ERP amplitudes of the P100 and P3b components in depressive patients. To control for severity of depressive symptoms and because all maltreatment scales were not normally distributed (all W’s < 0.89, all p’s < 0.05), partial Spearman’s rank order correlations were calculated with the BDI score serving as control variable. The results are reported as rho coefficients with their respective p values. Statistical significance was set at p < 0.05.

Results

Preliminary analyses

As expected, depressive patients showed significantly higher scores on psychopathology as measured by the BDI and BSI (Table 1). Additionally, depressive patients reported higher levels of emotional neglect, while reports of other forms of childhood maltreatment as well as relational peer victimization did not differ significantly between the depressive and the control group. Moreover, groups differed with respect to their family status, with healthy controls reporting more frequently being single. Furthermore, preliminary analyses showed significant bivariate associations between measures of psychopathology and childhood maltreatment and ERP amplitudes across conditions, particularly on the P100 component. See Supplement 1 for more details on bivariate Spearman rank correlations between measures of childhood maltreatment and psychopathology and ERP mean amplitudes.

In addition, in a preliminary analyses step ERP amplitudes of the healthy control group were analyzed to examine whether findings of Klein et al. (2015) could be replicated in the present sample (Supplement 2). Here, findings on the N170 component fully replicated the findings by Klein et al. (2015), whereas results regarding EPN and centroparietal LPP slightly differed. As presented by Klein et al. (2015), a main effect of self-reference was found on the EPN. However, laterality did not have a significant effect on EPN amplitudes in the present sample. In line with findings regarding the centroparietal LPP indicated by Klein et al. (2015), the interaction between self-reference and valence also was significant in the present analysis.

Event-related brain potentials

Means and standard deviations of the mean amplitudes of the event-related brain potentials (ERPs) components are presented in Table 3.

Table 3 Means and standard deviations of mean amplitudes of ERP components

P100 (90-120 ms)

For the P100, a significant main effect of group was found (F(1, 39) = 5.10, p = 0.030, partial η2 = 0.116). Depressive patients showed larger mean P100 amplitudes than healthy controls (Fig. 1). Further main or interaction effects were not significant (all F’s < 2.21; all p’s > 0.145; Table 4).

Fig. 1
figure 1

P1 main effect of group. The difference topographies display differences between depressive patients and controls, highlighting the P1 cluster. The electrode curve displays the mean activity over the occipital P1 cluster for both groups separately. The difference plot contains 95 % bootstrap confidence intervals of average group differences

Table 4 F, p, and η2 values for the ANOVA analyzing mean P100 amplitudes (90-120 ms)

N170 (120-170 ms)

Regarding the face-specific N170 component, the repeated measures ANOVA did not show any significant effect related to depression (all F’s < 2.36; all p’s > 0.101; Table 5). In all participants, however, a significant main effect of self-reference was found (F(1, 39) = 6.92, p = 0.012, partial η2 = 0.151; Fig. 2). Larger mean N170 amplitudes for faces in self-related compared to other-related conditions were found. Additionally, a significant main effect of laterality was found (F(1, 39) = 5.52, p = 0.024, partial η2 = 0.124). Here, mean N170 amplitudes were more negative over the right compared to the left electrode cluster. Further main or interaction effects did not reach significance (all F’s < 2.11; all p’s > 0.155).

Table 5 F, p, and η2 values for ANOVAs analyzing mean amplitudes of N170 and EPN components
Fig. 2
figure 2

N170 and EPN main effects of reference. The difference topographies display differences between self- and other-related trials across patients and controls, highlighting the N170/EPN cluster. The electrode curve displays the mean activity over the occipital N170/EPN cluster. The difference plot contains 95 % bootstrap confidence intervals of intra-individual differences

EPN (250-450 ms)

For the EPN, depression did not show any significant main or interaction effects (all F’s < 1.83; all p’s > 0.184; Table 5), although a significant main effect of self-reference was found in the whole sample (F(1, 39) = 15.64, p < 0.001, partial η2 = 0.286; Fig. 2). Larger mean EPN amplitudes for faces in self-related conditions than in other-related conditions were found. In addition, the repeated measures ANOVA showed a significant main effect of laterality (F(1, 39) = 6.56, p = 0.014, partial η2 = 0.144). In contrast to the N170 time window, mean amplitudes were more negative over the left compared with the right electrode cluster. Furthermore, a significant interaction of self-reference and laterality was found (F(1, 39) = 5.34, p = 0.026, partial η2 = 0.120). The main effect of self-reference over the left electrode cluster (F(1, 39) = 7.28, p = 0.010, partial η2 = 0.157) was substantially smaller than over the right electrode cluster (F(1, 39) = 25.03, p < 0.001, partial η2 = 0.391). There were no further significant main or interaction effects (all F’s < 1.16; all p’s > 0.307).

LPP (450-600 ms)

For the LPP over the centroparietal cluster, there was no significant main or two-way interaction effect related to depression (all F’s < 1.46; all p’s > 235; Table 6). Additionally, the ANOVA showed no significant main effect of emotional valence (F(2, 76) = 3.16, p = 0.059, partial η2 = 0.077). A significant three-way interaction of group, self-reference, and emotional valence was found (F(2, 76) = 4.30, p = 0.017, partial η2 = 0.102) (Fig. 3). Resolving this interaction, additional post-hoc 2 (self-reference) x 3 (emotional valence) repeated measures ANOVAs were conducted for groups separately. In the depressive group, the post-hoc ANOVA revealed a significant main effect of self-reference (F(1, 19) = 5.07, p = 0.036, partial η2 = 0.211). Depressive patients showed larger LPP responses to faces in self-related than in other related context. Furthermore, a significant main effect of emotional valence was found (F(2, 38) = 4.87, p = 0.013, partial η2 = 0.204). Post-hoc analyses showed that LPP responses to faces after socially threatening sentences were larger than to faces after neutral sentences (t(19) = 3.56, p = 0.006). There were no significant differences in the reactions to faces in socially threatening versus physically threatening contexts or physically threatening versus neutral contexts (all t’s < 1.99, all ps > 0.093). The interaction of self-reference and emotional valence did not reach significance (F(2, 38) = 1.36, p = 0.268, partial η2 = 0.067). Explorative FDR corrected post-hoc analyses revealed that depressive patients showed a tendency toward larger LPP responses to faces after other-related socially threatening sentences than after other-related neutral sentences (t(19) = 2.72, p = 0.070). Moreover, depressive patients showed more pronounced mean LPP amplitudes in reaction to faces in a self-related socially threatening and neutral context compared with the faces presented in an other-related neutral context (all t’s > 3.00; all p’s < 0.050). In addition, a tendency toward larger mean LPP amplitudes was found in reaction to faces after self-related physically threatening faces than in reaction to faces after other-related neutral sentences (t(19) = 2.39, p = 0.096). Additional significant t-tests were not found in depressive patients (all t’s < 2.31, all ps > 0.100). In healthy controls, the post-hoc repeated measures ANOVA showed no significant main effects of self-reference (F(2, 38) = 0.07, p = 0.799, partial η2 = 0.003) and emotional valence (F(2, 38) = 1.07, p = 0.338, partial η2 = 0.053, ε = 0.739). However, a significant interaction of self-reference and emotional valence was found (F(2, 38) = 3.27, p = 0.049, partial η2 = 0.147). FDR corrected post-hoc t-tests did not show any significant differences (all t’s < 2.28, all p’s > 0.510). However, without FDR correction, separate post-hoc analyses for self-reference conditions showed larger LPP responses for faces in self-related socially threatening contexts than for faces in self-related neutral contexts (t(19) = 2.28, p = 0.034), whereas no differences were detected in other-related contexts (all t’s < 0.80, all p’s > 0.433). Moreover, no significant main or interaction effects were found in the initial repeated measures ANOVA (all F’s < 1.17; all p’s > 0.317).

Table 6 F, p, and η2values for ANOVAs analyzing mean amplitudes of the LPP component
Fig. 3
figure 3

LPP interaction effect between group, reference, and emotional valence. The difference topographies displays differences between emotion categories for self- and other-related trials, highlighting the LPP cluster. a) Differences within depressive patients. Mean activity over the centro-parietal LPP cluster for all conditions. b) Differences within healthy controls. Mean activity over the centroparietal LPP cluster for all conditions. All difference plots contain 95% bootstrap confidence intervals of intra-individual differences

Exploratory analyses on childhood maltreatment and neural activity in patients with depression

In depressive patients, exploratory analyses showed significant correlations between the severity of maltreatment and the mean amplitudes at P100 and P3b stages. While there were significant positive associations of the CTQ sumscore, physical abuse, emotional abuse, and relational peer victimization with the mean amplitude of the P100 component, a significant negative correlation between physical abuse and the mean P3b amplitude was found (Table 7).

Table 7 Partial Spearman rank correlations between childhood maltreatment and mean electrocortical activity at P100 and P3b stages

Discussion

In a study of contextual face processing, we found evidence for altered information processing in individuals with depression. By randomly pairing sentences and faces in each trial of the experiment, the present study was designed to investigate one-time and short-term affective context modulation effects (instead of contextual learning processes), which have been demonstrated before for healthy individuals (Klein et al., 2015). Across conditions, depressive patients showed an augmented mean P100 amplitude when confronted with neutral faces that were preceded by emotional context information. At the same time, mean LPP amplitudes of depressive patients were larger in response to faces presented in a self-related context than to faces presented in an other-related context. In addition, LPP amplitudes of depressive patients were more pronounced in response to faces presented after socially threatening sentences than in response to faces presented after neutral sentences. Furthermore, exploratory post-hoc analyses revealed that LPP amplitudes of depressive patients were modulated by emotional valence only when presented in an other-related context, while healthy controls rather showed differentiated mean LPP amplitudes in reaction to emotionally valenced faces when these were presented in a self-related context. Moreover, self-related conditions led to larger mean amplitudes than other-related conditions in both samples. However, contrary to our expectations, we could not find specific differences between depressive patients and healthy controls with regard to the manipulations of self-reference or emotional valence on N170 and EPN stages. Furthermore, exploratory analyses showed that among depressive patients a history of childhood maltreatment was associated with mean amplitudes at P100 and P3b stages.

P100

In line with previous research (Auerbach et al., 2015; Yang et al., 2011), the P100 component was modulated by depression. However, against our hypotheses, P100 amplitudes were not specifically enhanced in reaction to self-related negative stimuli, but depressed patients presented with a generally heightened early responsiveness, leading to higher mean P100 amplitudes regardless of valence and self-relevance. Hence, instead of a generally attenuated responsiveness to affective stimuli (Foti et al., 2010) or a specifically amplified responsiveness for negative, self-referential stimuli (Auerbach et al., 2015; Scher et al., 2005), findings suggest a generalized early potentiation of encoding interpersonal information in depressive individuals. A possible explanation for the observed heightened responsiveness on the P100 may arise from findings of an impaired recognition ability of facial expressions in depressive patients in that they attributed emotional valence to affectively neutral faces (Leppänen et al., 2004). Similarly, meta-analyses on facial emotion expression processing in major depression revealed that individuals with depression are prone to evaluate positive, neutral, or ambiguous facial expressions more negatively than healthy controls and have impaired emotion recognition for every facial expression except sadness (Bourke et al., 2010; Dalili et al., 2015). With respect to the present study, especially when being confronted with negative and neutral context information, depressed patients could have suffered from a sustained negatively biased interpretation of the inherently neutral facial expressions presented in the experiment. This could have led to the observed intensified processing on the ERP level, according to the proposed negativity bias. This also would be in line with previous reports of depressive subjects responding faster to negative face expressions and selective attention toward threatening stimuli (Delle-Vigne et al., 2014; Mathews et al., 1996). Although this is a very first trial to examine the P100 component in the processing of facial stimuli in depressed subjects and replication is warranted, findings suggest that depressive individuals engage in early, automated processing of affective and referential information, which may be negatively biased and may serve to reinforce and intensify depressive symptoms.

N170 and EPN

Contrasting with prior research and our hypotheses (Chen et al., 2014; Connell et al., 2015), depressive and healthy individuals did not show divergent N170 and EPN amplitudes dependent on differential context information. The functional meaning of the N170 and EPN as well as the paradigm that was used in the present study may play a crucial role in explaining missing effects on these components. As stated earlier, the P100 is suggested to reflect early categorization processes at a stage of face detection (Herrmann et al., 2005; Itier & Taylor, 2004; Liu et al., 2002), whereas the N170 component reflects configural and facial emotion-specific processing (Feuerriegel et al., 2014). The EPN has been connected with more enhanced emotional processing (Hajcak et al., 2010). Hence, for the modulation of N170 and EPN differences on emotional valence are required. With respect to the present paradigm, it may be suggested that the combination of different emotionally valenced sentences with neutral faces was not strong enough to evoke emotion-specific modulations of the N170 and EPN. That is, the faces may still have been received as emotionally neutral. Unfortunately, no affective ratings were conducted during the course of the current study; therefore, we cannot preclude nor confirm this hypothesis. In line with this assumption, however, prior studies pointed out that alterations of the N170 component in depressive patients were dependent on affective expressions of the faces (Foti et al., 2010; Jaworska et al., 2012). Hence, the neutral faces used in the present study, although combined with emotionally valenced sentences, do not seem to be sufficient to evoke differences on N170 and EPN stages between depressive patients and healthy controls. The process of face detection, reflected by the P100, however, may be more sensitive in depressive patients.

LPP

Contrasting with previous studies (Foti et al., 2010; Kayser et al., 2000; MacNamara et al., 2016), depressive individuals did not show weaker LPP amplitudes than healthy controls, although differentiated processing of faces presented after emotionally valenced sentences in healthy controls and depressive patients could be found. In line with previous research (Wagner et al., 2015), depressive patients showed most pronounced LPP responses to faces with self-related context information and to faces with socially threatening information. However, against our hypotheses and previous literature (Auerbach et al., 2015; Burkhouse et al., 2017; Jaworska et al., 2012; Shestyuk & Deldin, 2010), LPP responses did not differ between emotional valence categories within the self-related condition, whereas depressive patients showed a tendency toward increased reactions to socially threatening stimuli when presented in an other-related context. Hence, it may be suggested that LPP reactions may be indicative of a rather generalized enhanced sensitivity in individuals with MDD. Exploratory analyses showed that mean LPP amplitudes in the self-related conditions were more pronounced than in the other-related neutral condition. On the premise that the other-related neutral condition may serve as a baseline condition, results may be considered in terms of a sustained negatively biased interpretation of neutral facial expressions, irrespective of the actual affective valence (Bourke et al., 2010; Dalili et al., 2015; Delle-Vigne et al., 2014; Mathews et al., 1996; Siegle et al., 2002). However, because LPP modulation is more sensitive to differences in self-reference (Klein et al., 2015; Wieser et al., 2014), the sustained negatively biased processing may not apply to other-related threats in general but may be restricted to socially threatening contexts. This, again, is in line with studies reporting a negativity bias in depressive individuals (Delle-Vigne et al., 2014; Mathews et al., 1996; Scher et al., 2005) addressing prior assumptions that processing of faces in depressive patients is particularly sensitive to social context information (Gilboa-Schechtman et al., 2002). Consequently, it may be assumed that depressive patients are more responsive to any negative social information, whether addressed to themselves or to other people. This proneness to process negative incoming social information may be crucial in the etiology and maintenance of depressive symptoms as it has been proposed in cognitive models of depression (Abramson et al., 1978; Beck, 1976; Bower, 1981; Teasdale & Dent, 1987).

In line with previous reports (Klein et al., 2015; Wieser et al., 2014), an interaction of self-reference and emotional valence could be detected in healthy individuals. Here, results were indicative of a stronger LPP modulation in self-relevant contexts with more pronounced mean amplitudes following neutral faces in a socially threatening context in self-related conditions. However, LPP findings in healthy individuals should be interpreted with caution because post-hoc comparisons did not withstand FDR correction for multiple comparisons.

Replication of previous findings in healthy individuals

Because the present study sought to extend previous findings regarding the effect of context information on facial processing (Klein et al., 2015), it is important to note that previous results could be replicated to a high degree in the healthy control sample as well as in the overall sample of the present study. In both samples, self-relevant context cues led to most pronounced activations on the N170 and EPN components, a result that fits well with the results of past studies (Klein et al., 2015; Wieser et al., 2014). Similarly, in line with findings regarding the centroparietal LPP indicated by Klein et al. (2015), the interaction between self-reference and valence showed a significant effect in the healthy control sample. Contrasting with our hypotheses, neutral faces that have been associated with socially threatening information were not preferentially processed compared with physically threatening contexts in the total sample. At this point, using affective second-hand information may have lacked power to build up an observable unique pattern of activation. However, despite the question of a possible distinction between socially and physically threatening negative context information, the results indicate that context information is capable of modulating the processing of neutral target stimuli in depressive patients, most likely as a top-down regulation function of a higher order alerting system.

Maltreatment related depression and its impact on emotion processing

On the basis of explorative analyses, the present study was one of the first to show significant associations of various kinds of childhood maltreatment including relational peer victimization with modulations of the P100 and P3b components in depressive patients. In line with previous studies (Dong et al., 2004; Gibb et al., 2007; Humphreys & Zeanah, 2015; McLaughlin et al., 2014), different kinds of maltreatment were differentially correlated with ERP modulations. With respect to the examined ERP components, findings of enhanced mean P100 amplitudes in maltreated depressive patients were in line with a previous study in physically abused children (Pollak & Tolley-Schell, 2003). This may be indicative of an enhanced selective attention for potentially threatening cues in maltreated depressive patients. Notably, findings indicate that in addition to physical and emotional abuse relational peer victimization may also cause alterations in responsivity to interpersonal stimuli. However, the negative association of mean P3b amplitudes with severity of physical abuse in depressive participants contrasts with previous findings of increased P3b in physically abused children (Pollak et al., 2001; Shackman et al., 2007). The age of the sample, the kind of paradigm, and current psychopathology may account for differences in the outcome of experiences of maltreatment on mean P3b amplitudes. In line with this assumption, Matz et al. (2010) could show that, regardless of the diagnosis, patients with adverse childhood experiences provided an attenuated cortical reaction to affective stimuli. Finally, the explorative analyses of the present study emphasize the importance of examining the differential impact of adverse life experiences on the development of depressive symptoms. Our findings support the idea of a maltreatment-related clinical subtype of depression (Andersen & Teicher, 2008; Zisook et al., 2007). As indicated in prior research (Pollak et al., 2001; Pollak & Tolley-Schell, 2003; Shackman et al., 2007), the P100 and P3b components seem promising candidates in marking this subtype on ERP level.

Limitations

The present study and its findings are limited by the small sample size. Recruitment of increased sample sizes would allow for more reliable analyses. However, replication of previous findings as well as dependability and split-half reliability scores support the reliability of the presented results. Additionally, identification of the EPN component was impaired, because difference waves for the EPN did not show a profound negativity. In line with previous studies (Bruchmann et al., 2020; Schindler et al., 2019, 2020), the EPN was captured around the N2 wave amplitude. Therefore, comparability to previous studies using the same study design may be limited (Klein et al., 2015). However, explorative analyses using the EPN window applied in the prior study did not show differential effects. With respect to reliability scores presented in Table 2, the current identification is suggested to better capture the EPN in the present study. Additionally, due to unbalanced blink artifacts, i.e., many depressive patients consistently blinked after stimulus offset at approximately 800 ms post-stimulus, later LPP effects occurring after 600 ms post-stimulus could not be examined in the present study. To deal with the problem of multiple (implicit) comparisons inherent in ERP research, time windows and electrode clusters for ERP components were based on a priori hypotheses and validated by using a collapsed localizer (Luck & Gaspelin, 2017). While post-hoc analyses within ERP components were FDR corrected, however, we did not correct for multiple comparisons across ERP components in the present study. Hence, findings and conclusions are limited as we cannot preclude that they are based on false-positive results. Moreover, the restricted range of experiences of childhood maltreatment may impair the validity of the correlation analyses. Although correlation analyses were conducted for explorative purposes only. A further limitation is the lack of obtained affective ratings of the supposedly neutral faces during the course of the experiment. By including affective ratings at different time points of the experiment, future studies should be able to determine a possible change in the subjective affective interpretation of the neutral facial expressions and thereby reveal potentially biased information processing. Moreover, the current study lacks an evaluation of the different sentences through valence and arousal ratings. Although the set of sentences was evaluated in a prior study (Klein et al., 2015), it remains open whether this evaluation may be generalized to the current sample. Hence, all conclusions regarding valence effects have to be drawn with caution. Another limitation was that analyses were not controlled for comorbidities of the depressed patients, especially anxiety disorders. There are many studies that point out possible information processing biases, e.g., for PTSD (Javanbakht et al., 2011) or social anxiety disorder (Mueller et al., 2009). However, because there were no systematic comorbidities in our study, the impact of a specific disorder is not assessable in the present study. Nevertheless, exploratory correlation analyses examining associations between ERP components and subscales of the BSI in the depressive sample revealed potential differences in depressive patients in emotion processing with regards to the presence of symptoms of anxiety and psychoticism. Symptoms of anxiety and psychoticism were associated with diminished amplitudes on N170 and EPN components, which may indicate weakened emotional processing in a subgroup of patients with depression. With respect to this exploratory findings and because it is known that the majority of depressive patients suffer from comorbid disorders (Löwe et al., 2008), future studies should try to control for this aspect. It would be of particular interest to examine populations with specific combinations of comorbidities. To our knowledge, there is still a lack of research regarding the possible subtypes of depression and the respective information processing abnormalities. Because it has been shown that first episode MDD and recurrent MDD patients show differential facial processing (Chen et al., 2014), recurrence and number of depressive episodes may play an additional role in differential processing in subtypes of depression. In the present sample, however, recurrent and first episode MDD patients were merged. Because of the rather small sample size, we could not control systematically and adequately for potential effects of symptom recurrence. Although exploratory analyses did not reveal any effects of recurrence on ERPs in the present sample, this limitation has to be kept in mind when considering the present results. Furthermore, inter-rater reliability of structured clinical interviews administered by trained clinical psychologists was not assessed in terms of rescoring the information given in the interviews. However, administration of interviews was supervised by experienced senior therapists, which suggests reliable and valid diagnoses.

To sum up, our study is in line with previous research reporting altered emotional processing in depression. Interestingly, our study showed a distinct processing pattern, particularly with a generalized heightened initial responsiveness to interpersonal information. Further research is needed to clarify whether this is a result of facilitated attention processes, which may play a crucial role in the etiology of depressive disorder. Further studies also should disclose the association of cortical reactivity and childhood maltreatment in depressive patients.