Introduction

Interactive alignment of motor behavior is a prominent phenomenon in our daily social lives. This includes synchrony, such as dancing (Tarr, Launay, & Dunbar, 2016), marching (Hove & Risen, 2009), or singing together (Pearce, Launay, & Dunbar, 2015), and also mimicry. Mimicry is the automatic alignment of postures to one’s interaction partner (Chartrand & Bargh, 1999). These phenomena may rely on a direct matching of the visual representation of a perceived action mapped onto the motor representations of the same action (direct matching hypothesis, Iacoboni et al., 1999; Rizzolatti, Fogassi, & Gallese, 2001). This shared representation between perception and execution of an action may evoke motor resonance in the observer’s motor system. For example, even when we just see someone lifting a finger, the motor representation of such a finger lifting movement is activated in our motor system. This link between the perception and execution of an action (i.e., perception-action link) may be related to mirror neurons originally discovered in monkey’s ventral premotor cortex (F5), firing when observing as well as when executing a specific action (Rizzolatti et al., 2001; Rizzolatti & Craighero, 2004). A human homologue of the mirror neuron system has been suggested (Gallese, Fadiga, Fogassi, & Rizzolatti, 1996; Hilt et al., 2017; Mukamel, Ekstrom, Kaplan, Iacoboni, & Fried, 2010; Rizzolatti et al., 2001; Rizzolatti & Craighero, 2004). Indeed, single-cell recordings in, amongst others, the supplementary motor area (SMA) show human mirror neuron activity (Mukamel et al., 2010).

The observation of an action may interfere with one’s own action agenda: the perception of a movement may facilitate one’s own if both match, yet may require inhibition if they do not. This phenomenon has been termed automatic imitation (Heyes, 2011). It is usually investigated using the automatic imitation task, which is a stimulus-response compatibility (SRC) paradigm, employing the fact that observation of a movement interferes with task execution of the same or a different movement (Brass, Bekkering, Wohlschläger, & Prinz, 2000). In the automatic imitation task, participants are asked to perform a simple action, such as lifting an index or a middle finger, when a number cue is shown. Yet, simultaneously, and irrelevant to the task at hand, a matching or mismatching finger lifting movement is present on the screen. Motor resonance with the concurrent on-screen movement, while irrelevant, requires interference control for successful task execution. A match between the presented movement and target response (e.g., middle finger lifting) facilitates participants’ motor responses in congruent trials (i.e., perception-action match). This speeds up response times. In contrast, a mismatch between concurrent on-screen movement (e.g., an index finger lifting) and task execution (e.g., middle finger lifting) (i.e., perception-action mismatch) requires inhibition of the respective motor resonance with the task-irrelevant movement for successful task execution in incongruent trials. This slows down reaction times. The mismatch is caused by an overlap between externally activated and internally generated motor representation (i.e., motoric self-other overlap). This motoric overlap may rely on shared representations of the perception of another’s action when executing the same action (Chartrand & Bargh, 1999). According to ideomotor theory, motoric behavior may be represented as an image of its produced sensory feedback (Brass, Derrfuss, Matthes-von Cramon, & von Cramon, 2003; Brass, Ruby, & Spengler, 2009; Brass, Derrfuss, & von Cramon, 2005; Prinz, 1990, 1997, 2005). Repeated coupling between the sensory image of an action and the execution of the same, may evoke a sense of motoric self-other overlap between one’s own movements and movement intentions and the other’s movements, via strengthened shared motor representations (Brass et al., 2009; Prinz, 2005). As such, the direct matching hypothesis between perception of an action and the activation of its internal motor representation (Iacoboni et al., 1999) suggests imitation as a prepotent response tendency (Brass et al., 2005). This may interfere with response execution as mentioned above. To counteract this prepotent imitation, we need so-called interference control mechanisms to distinguish between internally generated and externally activated motor representations. As one of those mechanisms, self-other distinction (SOD) has been suggested (Brass et al., 2005; Brass et al., 2009). Supporting evidence comes from a neuropsychological study showing that patients with frontal lesions exhibit problems inhibiting imitative response tendencies (Brass et al., 2003). This finding is corroborated by brain imaging studies, pointing to a role of the temporoparietal junction (TPJ) during interference control of overlapping action perception and execution, specifically the inhibition of imitation. It has been suggested that the TPJ may play a crucial role in SOD processes (Brass et al., 2005, 2009).

More recently, it has been suggested that interference control may involve an interaction between SOD and general cognitive control mechanisms (Cross, Torrisi, Losin, & Iacoboni, 2013). Cognitive control refers to the ability to configure and monitor thoughts and actions for task performance (Botvinick, Cohen, & Carter, 2004). One fundamental aspect of cognitive and motor control is the ability to detect and resolve conflict in the face of competing task-irrelevant stimuli and response execution (Miller & Cohen, 2001). This may require the implementation of regulative and evaluative control processes (Larson, Clayson, & Clawson, 2014). Regulative processes refer to the implementation of top-down control for behavioral adjustment, thus successful task execution. The anterior cingulate cortex (ACC), as one core brain hub for top-down control of cognitive conflict (Botvinick et al., 2004), has been attributed an essential role in evaluative control processes. This reflects cognitive/action monitoring processes, which signal the demand for implementing cognitive or motor control in line with task demands (Larson et al., 2014).

We took advantage of the high temporal resolution of event-related potentials (ERP) to describe the time course of interference control during the automatic imitation task. We assume interference control during automatic imitation to be constituted via an interplay between SOD and cognitive conflict/action monitoring. Specifically, we plan to extend this research further by investigating the modulation of this interplay by social context, which might facilitate disentangling the two. Findings from a behavioral study suggest that interference control is modulated by contextual factors such as group membership and emotional facial expressions (Rauchbauer, Majdandžić, Stieger, & Lamm, 2016).

To this end, we applied the Social-Affective Mimicry Task (SAMT) while measuring brain activation via electroencephalography (EEG). The SAMT is based on the established automatic imitation paradigm by Brass et al. (2000) and adds task-irrelevant social and affective context information by presenting happy and angry ethnically diverse in-group (white) and out-group (black) faces and hand stimuli (see Methods for details) (Rauchbauer et al., 2016; Rauchbauer, Majdandžić, Hummer, Windischberger, & Lamm, 2015). Determining the time course of the social modulation of automatic imitation can give insights into successful interference control. This can furthermore inform potential difficulties in the modulation of motor resonance in social contexts, such as, for example, in individuals with autism spectrum disorder (ASD) (Cook & Bird, 2012).

Cognitive conflict/action monitoring is prominently investigated by examining amplitude variations of the stimulus-locked N2 component and two response-locked components, the error-related negativity (ERN) on erroneous, and the correct-response negativity (CRN) on correct trials. Moreover, we investigated the N450 component concerning the proposed function in an exploratory post-hoc fashion, as suggested by a reviewer, to examine control-related processes in complex task settings. These components have been associated with monitoring and resolution of cognitive conflict and behavioral adaptation (Falkenstein, Hoormann, Christ, & Hohnsbein, 2000; Folstein & Van Petten, 2008; Pailing & Segalowitz, 2004).

Cognitive conflict/action monitoring studies usually assess the N2 component when they apply simple cognitive conflict tasks. The N2 is a negative amplitude deflection within 200–300 ms after the onset of stimulus presentation at frontocentral electrodes. In SRC paradigms, it varies according to stimulus congruency, with more negative amplitudes for incongruent than congruent trials (Folstein & Van Petten, 2008; Larson et al., 2014; Nieuwenhuis, Yeung, & Cohen, 2004; Yeung, Botvinick, & Cohen, 2004). It may have a functional significance representing strategic monitoring and action control (see for review Folstein & Van Petten, 2008). Importantly, N2 amplitudes are also sensitive to social and emotional manipulations, revealing for example an “own-race effect” with more negative N2 amplitudes for in-group compared with out-group faces (Ito & Bartholow, 2009; Ito & Urland, 2003, 2005; Kubota & Ito, 2007; Tanaka, Kiefer, & Bukach, 2004). In addition, more negative N2 amplitudes were reported for happy compared with angry facial expressions (Kubota & Ito, 2007). This has been suggested to reflect deeper processing due to perceptual individuation of faces beneficial to the participants (e.g., happy and in-group faces; Ito & Bartholow, 2009; Kubota & Ito, 2007).

Moreover, in an exploratory approach we also investigated the later occurring N450 component. The N450 may account for the complex set-up by which cognitive conflict is induced in the SAMT compared with the N2, which may account for conflict in less complex task set-ups. The N450 is a negative amplitude deflection within 400–550 ms after the onset of stimulus presentation at central electrodes. In complex SCR tasks, more negative amplitudes are observed for incongruent compared to congruent trials (Liotti, Woldorff, Perez III, & Mayberg, 2000; West & Alain, 1999). While its function is still be under debate (Larson et al., 2014, but see Galashan, Siemann, & Herrmann, 2019), it has been strongly associated with cognitive control and conflict detection processes (Hanslmayr et al., 2008; Larson et al., 2014; West, Bowry, & McConville, 2004; West, Jakubek, Wymbs, Perry, & Moore, 2005).

The response-locked component of choice in this study was the CRN, reflecting successful interference control on correct trials. We focused on the CRN variation because error rates were extremely low in our previous SAMT studies (Rauchbauer et al., 2015, 2016), leaving not enough trials for a reliable investigation of the ERN component (Olvet & Hajcak, 2009). The CRN component is a negative deflection around response execution elicited by correct trials at frontocentral electrodes and is related to the ERN component (Allain, Carbonnell, Falkenstein, Burle, & Vidal, 2004; Bartholow et al., 2005; Coles, Scheffers, & Holroyd, 2001; Vidal, Burle, Bonnet, Grapperon, & Hasbroucq, 2003; Vidal, Hasbroucq, Grapperon, & Bonnet, 2000). More negative CRN amplitudes were reported following correct responses in incongruent than congruent trials and moreover when the current trial type violated expectations (Bartholow et al., 2005). Its amplitude is further inversely related to error rates (Allain et al., 2004). These findings suggest that CRN amplitudes index strategic use of information to guide adaptive responses (Bartholow et al., 2005). Simons (2010) further proposed that CRN enhancement might have a protective function against error commission and indicates the degree of engagement of conflict/action monitoring during correct trials. CRN amplitude enhancement has also been shown in response to successful stereotype control related to ethnic group membership (Amodio, 2008; Amodio, Devine, & Harmon-Jones, 2008).

We further investigated the Pre-Motor Positivity (PMP), which has been related to motor commands for movement initiation (Bortoletto, Sarlo, Poli, & Stegagno, 2006; Deecke, Grözinger, & Kornhuber, 1976; Deecke, Scheid, & Kornhuber, 1969). It can be observed before the onset of motor execution (Deecke et al., 1969) and may be related to the inhibition of imitative actions (Shibasaki & Kato, 1975). The PMP is observed as a positive amplitude shift, which occurs at the end of the readiness potential (RP), a motor preparation ERP for ipsilateral finger movements (Deecke et al., 1969; Kornhuber & Deecke, 1965). The SMA has been suggested as one potential generator of the PMP (Deecke, Kornhuber, Lang, Lang, & Schreiber, 1985) and also has been ascribed mirror neuron functions (Mukamel et al., 2010).

Recently, we observed that the PMP component not only distinguished between movement congruency, with more positive amplitudes in congruent than incongruent trials. We also observed that PMP amplitude variation in incongruent relative to congruent trials predicted the respective response times of incongruent versus congruent trials. We attempted to replicate this finding, which we had interpreted as the PMP serving as a proxy for successfully resolved self-other distinction (Rauchbauer, Pfabigan, & Lamm, 2018). The process of self-other distinction most likely unfolds and operates during the whole time-course of stimulus processing and action selection and is supported by cognitive conflict/action monitoring. Despite our proposal that this might be reflected by the PMP component, no other ERP components have been described to reflect self-other distinction processes yet. This awaits further targeted experiments.

We expected more negative amplitudes on incongruent compared with congruent trials for the N2, the exploratory analysis of the N450, and the CRN, reflecting increased cognitive control/action monitoring demands. According to the above-mentioned literature, we expected the N2 to be most prominent for happy in-group faces. In line with results from a previous behavioral study (Rauchbauer et al., 2016), we specifically expected CRN amplitudes, and in an exploratory manner also N450 amplitudes, to be most negative for angry out-group trials compared with angry in-group trials. The angry out-group condition may be behaviorally most relevant and thus may require enhanced behavioral adaptation. In line with this, we expected the CRN and also the N450 to predict response times especially in an angry out-group context, with more negative amplitudes predicting longer response times on incongruent trials. Based on our previous results (Rauchbauer et al., 2018), we furthermore expected more positive PMP amplitudes on congruent than incongruent trials. Specifically, we expected more positive-going PMP amplitudes for happy in-group compared with happy out-group congruent trials due to increased self-other overlap. We furthermore expected that PMP amplitudes predict response times, which would replicate our previous results (Rauchbauer et al., 2018). We expected this to be most evident for happy in-group congruent trials, due to the hypothesized increased self-other distinction demands, with more positive amplitudes predicting faster response times. For behavioral results, we expected to replicate our previous results, demonstrating enhanced regulation of automatic imitation in response to angry out-group compared with angry in-group stimuli (Rauchbauer et al., 2016).

Methods

Participants

Thirty white participants took part in this EEG experiment. They received a financial compensation of 25 Euro. An initial data check identified two participants as outliers in the Attitudes toward black (ATB) scale with scores on the lower end of the scale, thus showing a very positive attitude toward the out-group compared with the rest of the sample (mean scores lower than the 25th percentile minus 1.5 times the interquartile range). Data from these participants were not included in any further data analysis to ensure a homogenous sample. The final sample consisted of 28 individuals (19 women, mean age: 24.58 years, standard deviation [SD] = 3.55 years). Participants were right-handed as assessed with the questionnaire by Oldfield (1971) and reported no prior medical history of neurological, cardiovascular, or psychiatric disorders and had normal/corrected to normal vision. Participants all signed an informed consent statement before the experimental task.

The present study was approved by the ethics committee of the University of Vienna and conducted in accordance with the Declaration of Helsinki (and its revision, 2013). Based on previous studies with a similar setup, a sample of 30 healthy volunteers was recruited. Please refer to the Supplementary Materials for details on statistical power.

Social-Affective Mimicry Task (SAMT)

Stimuli and experimental design

Participants performed the Social-Affective Mimicry Task (SAMT) (Rauchbauer et al., 2015, 2016), which is a modified version of the imitation-inhibition task by Brass et al. (2000). The simultaneous perception of a task-irrelevant action leads to interference (e.g., facilitation or inhibition) with a target movement (Brass et al., 2000). Matching motor perception and target action (e.g. presented and target movement are the same) constitutes congruent trials, which facilitate action execution and speed up response times. An action-perception mismatch (between presented and target movement) constitutes incongruent trials, which result in slower reaction times for movement execution. This is due to a required inhibition of the automatic action-perception link for the sake of correct response execution (Brass et al., 2000). On baseline trials, the task-irrelevant hand does not move. Regulation of automatic imitation is measured as the mean response time difference between incongruent and congruent trials (Rauchbauer et al., 2015). Baseline trials were not included in EEG analysis, due to the differing visual input of movement on (in-) congruent and no movement in these trials.

In the SAMT, task-irrelevant, female facial stimuli are simultaneously presented above the hand stimulus (with a white or black wrist; Fig. 1), depicting white in-group, or black out-group members, with happy or angry facial expression (face stimuli taken from NimStim Set of Facial Expressions (Tottenham et al., 2009). The factors Group (In-group, Out-group), Emotion (Happy, Angry) and Congruency (Congruent, Incongruent, Baseline) constituted a 2x2x3 factorial within-subjects design.

Fig. 1
figure 1

Experimental setup of SAMT. (A) Timeline of Social-Affective Mimicry Task (SAMT). Example of one trial; Face stimuli taken from the NimStim Set of Facial Expressions (Tottenham et al., 2009). (B) Ethnic (black and white) hand stimuli: upper row shows an example of an in-group congruent trial (task relevant cue indexing index finger movement, simultaneous task-irrelevant index finger lifting movement); lower row shows an example of an out-group incongruent trial (task relevant cue indexing index finger movement, simultaneous task-irrelevant middle finger lifting movement). Hand pictures by BR.

The same in- or out-group face model was used for both happy and angry emotional expressions. The concurrently presented task-irrelevant hand stimulus depicted a frontal shot of a left hand, mirroring participant’s right hand. This hand was either white or black, in accordance with the simultaneously presented face. The hand (wearing a beige glove, ethnicity visible at the wrist) performed a simultaneous finger lifting movements. Gloves were used to ensure that a potential regulation of automatic imitation was due to the task-irrelevant presentation of ethnically diverse emotional stimuli and not confounded by low-level attentional or perceptual influences caused by color differences of the hands and the target cue. The target cues were represented by the numbers “1” and “2” and displayed in black font in a grey square between the index and middle finger of the on-screen hand stimuli. Participants were asked to keep the number “1” on the keyboard pressed with their index and the number “2” with their middle finger. They were asked to lift their index finger whenever a “1” and the middle finger whenever a “2” appeared on the screen and to disregard any other stimuli presented on the screen. This aimed to ensure that potential regulation of automatic imitation by the ethnically and emotionally diverse stimuli occurred implicitly.

Task-irrelevant stimuli (i.e., face and hand stimuli) were matched for luminance across conditions using the SHINE (Spectrum, Histogram, and Intensity Normalization and Equalization) – toolbox (Willenbockel et al., 2010) for Matlab 8.3 (TheMathworks, Inc., MA).

Stimuli were presented centrally (visual angle approx. 2.4° × 5.1°) in two consecutive frames simulating finger lifting movements. Two instead of four consecutive frames were used in this experiment (compared with previous studies (Rauchbauer et al., 2015, 2016) to minimize the influence of overlapping ERP components caused by rapidly changing multiple frames (see also Rauchbauer et al., 2018). Both frames were displayed for 1,534 ms: the first frame presented only the task-irrelevant face and hand stimuli, the second frame presented the target cue and the task-irrelevant finger-lifting movement. Trials started with the presentation of a white fixation cross on black background for 1,000 ms, followed by the image presentation (in total 3,068 ms) and a variable intertrial interval with a random duration of 1,200-1,800 ms, again presenting the fixation cross.

Procedure

We used an EEG cap with 58 equidistantly mounted Ag/AgCl electrodes (model M10, EASYCAP, GmbH, Herrsching, Germany). Two electrodes were placed one cm above and below the right eye to record vertical eye-movements (electrooculogram [EOG]). Participants were seated in a sound-attenuated and electrically shielded chamber for EEG-recording. Distance to the computer screen was held constant at about 70 cm in front of a 19” cathode ray tube (CRT) monitor (Sony GDM-F520; 85 Hz refresh rate). Stimuli were presented using E-Prime 2.0 (Psychology Software Tools, Inc., Sharpsburg, PA); display resolution was 1,280 x 1,024 pixels (300 dpi).

A DC-amplifier setup (NeuroPrax, neuroConn GmbH, Ilmenau, Germany) recorded EEG signals within a frequency range of DC to 500 Hz and sampled it at 1,000 Hz for digital storage. Out of the scope of the present study and thus not presented in the article, four facial electromyographic electrodes also were attached. Electrode impedances were ensured to be below 4 kΩ before recording using a skin-scratching procedure (Picton & Hillyard, 1972).

Detailed instructions were presented at the beginning of the experiment and ten practice trials were performed. The task consisted of 50 trials per condition (i.e., 25 trials per index and middle finger) in a 2×2×3 within-subjects design: Group (In-group, Out-group), Emotion (Happy, Angry), and Congruency (Congruent, Incongruent, Baseline), resulting in a total of 600 randomly presented trials. After every 100 trials, participants were given short self-paced breaks. Duration of EEG data acquisition was approximately 60 minutes.

Post-experimental questionnaire

Attitudes towards blacks scale (ATB)

The Attitudes towards blacks scale (Brigham, 1993) measured explicit ethnic bias of our participants. Participants rated 20 statements on attitudes towards black individuals on a seven-point scale, indicating strong disagreement (“1”) to strong agreement (“7”) with the statement. A low mean score would imply a positive, whereas a high score a negative explicit attitude towards black individuals. This allowed identifying participants exhibiting either extremely favorable or unfavorable explicit attitudes towards black individuals (see “Participants” section).

Threat/security implicit association test (threat IAT)

We administered an adapted version of the Implicit Association Test (IAT) (Greenwald, McGhee, & Schwartz, 1998; see also Rauchbauer et al., 2016) to assess implicit ethnic bias of our participants. The threat-IAT assessed the strength of participants’ implicit associations between the target concepts of white in-group and black out-group with the attributes Security and Threat. The Security and Threat categories each consisted of five nouns matched for word length and valence in German (for Threat: fear, threat, violence, attack, danger; for Security: peace, protection, calmness, shelter, security). For more details on the threat-IAT see Supplementary Material (Section 2.1).

Statistical analysis

Statistical analyses of repeated measures ANOVA were performed in JASP 0.12.1 (JASP Team, 2020); https://jasp-stats.org/), jamovi 1.2.8. (The jamovi project, 2020; https://www.jamovi.org/), and Statistica 6.0 (StatSoft Inc., Tulsa, OK). Effect sizes of significant ANOVA results are described with partial eta squared (partial η2) (Kirk, 1996); the alpha-level was set at p < 0.05 in both behavioral and EEG data analyses. We further performed exploratory Spearman correlations with: IAT and ATB scores; ERP components and ATB, IAT and reaction times; ERP components showing a significant effect of congruency with each other; and ERP components showing a significant effect of congruency and individual RTs per condition – see Supplementary Materials (Section 2.5).

Statistical analysis of behavioral data

SAMT

Only data from correct trials entered data analysis. Participants’ individual mean response times (RT) were winsorized per condition and target cue to account for outliers before further statistical analyses (Wilcox, 2011). Mean RTs higher than the 75th percentile plus 1.5 times the interquartile range, and mean RTs lower than the 25th percentile minus 1.5 times the interquartile range of conditions and target cue were replaced with the corresponding maximum, respectively, the minimum RT.

First, participants’ mean RTs and percentage of error trials were analysed with a three-way repeated measurement ANOVA with factors Group (In-group, Out-group), Emotion (Happy, Angry), and Congruency (Congruent, Incongruent, and Baseline). Furthermore, we calculated the automatic imitation effect (i.e., the difference of participants’ mean RT on congruent minus incongruent trials per condition) to resolve a potential three-way interaction. Paired t-test will be used as post-hoc tests for the automatic imitation effect.

Threat IAT

The IAT score was calculated as the D-measure (Greenwald et al., 1998; Greenwald, Poehlman, Uhlmann, & Banaji, 2009): High D-measure values represent a strong implicit association of black out-group with Threat and white in-group with Security, whereas low D-measure values represent a low association of black out-group with Threat and white in-group with Security.

Obtained reaction times were winsorized to correct for outliers. Participants with more than 30% errors in one of the blocks will be excluded from analysis. The D-measure was calculated by dividing the RT (ms) difference of the fifth and third task by the latency standard deviation (SD) of RT of the combined fifth and the third task.

Preprocessing and statistical analysis of EEG data

Procedure

EEG data analysis was performed offline using EEGLAB 13.3.2b (Delorme & Makeig, 2004) implemented in Matlab 8.3 (The Mathworks, Inc., MA). First, data were down-sampled to 500 Hz, high (0.1 Hz) and low-pass filtered (30 Hz), and re-referenced to linked mastoids. Linked mastoid were chosen as offline reference sites as these electrodes have a similar noise level as the electrodes of interest, and their symmetric set-up guarantees a comparable weighing of left- and right-hemispheric activation (Luck, 2005; Nunez & Srinivasan, 2006). Additionally, we wanted to guarantee comparability with our previous study (Rauchbauer et al., 2018). Eye movement-related artefacts were detected applying extended infomax independent components analysis (ICA) (Bell & Sejnowski, 1995; Lee, Girolami, & Sejnowski, 1999) and subsequently removed. Data were segmented based on the factors Group (In-group, Out-group), Emotion (Happy, Angry), and Congruency (Congruent, Incongruent) during the presentation of the second frame. This resulted in eight conditions: In-group Happy Congruent, In-group Happy Incongruent, In-group Angry Congruent, In-group Angry Incongruent, Out-group Happy Congruent, Out-group Happy Incongruent, Out-group Angry Congruent, and Out-group Angry Incongruent.Footnote 1 For stimulus-locked data analysis, data segments were extracted 200 ms before target cue onset until 1,534 ms post-target cue onset, the first 200 ms served as baseline interval. For response-locked data analysis, data segments were extracted from −400 before to 600 ms after response execution. The mean voltage of the whole epoch served as baseline per trial, following Luck’s (2005) suggestion for response-locked data. Semiautomatic artefact correction eliminated trials that exceeded voltage values of ±75 μV or showed voltage drifts of >50 μV. Additional visual inspection was applied to exclude marked trials by the EEGLAB algorithms if artefact affliction was confirmed. Artefact-free segments were averaged per participant and condition: stimulus-locked on average 40.37 trials (SD = 5.007) and response-locked on average 33.88 trials (SD = 6.026).

Time windows and electrode locations were determined by literature and visual data inspection, which yielded overlapping results. In detail, we calculated global field power (Hamburger & vd Burgt, 1991; Murray, Brunet, & Michel, 2008; Skrandies, 1990) to specify respective time windows suggested by previous studies via assessing peak latency over all conditions per ERP. Subsequently, we inspected grand means and topographical maps of these time windows to define electrode clusters of interest. To keep the statistical model simple and to increase ERP signal-to-noise ratio, we assessed all ERPs at clusters of several merged electrodes applying a region of interest approach (Luck & Gaspelin, 2017). The N2 was analyzed by extracting mean amplitudes 200–300 ms after stimulus onset at an electrode cluster centred on FCz (Clayson & Larson, 2013); the same cluster as for the CRN was used here. In an exploratory approach, the N450 was analyzed by extracting mean amplitudes 400-550 ms after stimulus onset at an electrode cluster comprising Cz, CPz, and two electrodes in between (Tillman & Wiens, 2011). CRN amplitudes were extracted as peak-to-peak measure (CRN minus the preceding positive peak) to account for visible differences in the positive ERP peak before the CRN (which were, however, not significantly affected by the experimental manipulation, all p-values > 0.141). Negative-going CRN peaks and preceding positive peaks were assessed in a time-window of [−100; 100] ms around response execution at the frontocentral cluster centered on FCz (Pailing & Segalowitz, 2004). Mean PMP amplitudes were extracted in a time-window of 80 to 20 ms before response execution at a parietal cluster comprising three electrodes, which showed reliable activation across all conditions; see Figure S1 in Supplementary Materials for a visual depiction of the current electrode clusters.

First, to assess the modulation of interference control during automatic imitation by social context, three-way repeated measures ANOVAs with the within-subject factors Group (In-group, Out-group), Emotion (Happy, Angry), and Congruency (Congruent, Incongruent) were conducted for all ERP components. In case of significant three-way interactions, a congruency difference wave of the respective ERP (i.e., incongruent minus congruent trials per condition) will be calculated according to the logic of the difference measure of automatic imitation in the behavioral data analysis to resolve the interaction effect. Subsequently, these values will be subjected to a two-way repeated measures ANOVA with Group and Emotion. In line with our previous study (Rauchbauer et al., 2018), paired t-tests will be used as post-hoc tests. For all other interactions, Tukey post-hoc tests will be conducted.

Second, to investigate the prediction of reaction times by ERPs sensitive to congruency (i.e., showing an automatic imitation effect), we conducted linear mixed models in R (Version 1.3.1093, RStudio Team, 2020) with the lme4 package (Bates, Maechler, & Bolker, 2015). For this, we extracted single-trial reaction times for correctly answered trials participant- and condition-wise. Trials with reaction times faster than 3 standard deviations (SD) from the condition mean and slower than 3 SD from the condition mean were excluded from further analyses. Concerning ERPs, we extracted single-trial mean amplitudes in the respective time windows for N450 and PMP components (N450 [400; 550]; PMP [−80; −20]) participant- and condition-wise for correctly answered trials. For the CRN component, we extracted single-trial mean amplitudes ±20 ms around individual peaks (averaged across all conditions) of the CRN and the preceding positivity, which were then subtracted from each other on a single-trial basis (analogue to the peak-to-peak approach used for the main analysis). These single-trial difference mean amplitudes were extracted participant- and condition-wise. Single-trial reaction time data were modelled as a function of Group, Emotion, and Congruency (effect-coded: Group: Black: 1; White: −1; Emotion: angry: 1; happy: −1; Congruency: congruent: 1; incongruent: −1), and the continuous trial-wise mean-centered N450, PMP, and CRN amplitudes as fixed effects, as well as their individual interactions with the three-way full factorial model Group x Emotion x Congruency. The random effects structure included the three-way interaction of Group x Emotion x Congruency, estimating a random intercept and a random slope for each participant. We applied a restricted maximum likelihood estimation. Test statistics (from Wald chi-square tests) were calculated using the car package (Fox & Weisberg, 2019) with orthogonal sum contrasts and Type 3 sum of squares. To resolve significant interactions, we used the emmeans package (Searle, Speed, & Milliken, 1980) for the factor combinations of interest with the continuous ERPs of interest and pairwise comparisons among them. Figures were created using ggplot2 (Wickham, 2016). Asymptotic approximation of degrees of freedom was used.

Results

Social context modulation during automatic imitation in the SAMT: Behavior

On average across participants and conditions, 96% of trials were correct and entered subsequent analyses. The ANOVA on accuracy rates showed, by trend, an interaction of Group x Congruency with slightly less errors for Congruent out- than in-group trials (F(2, 27) = 2.969, p = 0.060, partial η2 = 0.10; Congruent: out-group: M = 1.16; SE = 0.11, in-group: M = 1.45, SE = 0.09). The ANOVA on reaction times (correct trials only) revealed a significant main effect of Group with faster reaction times on out- than in-group trials (F(1, 27) = 4.201, p = 0.050, partial η2 = 0.135; out-group: M = 512.35, SE = 15.69, in-group: M = 517.12, SE = 16.46). We observed a main effect of Congruency in the expected direction of longer RTs for incongruent than baseline and congruent trials (F(2,27) = 66.426, p < 0.001, partial η2 = 0.77; Congruent: M = 481.69, SE = 14.50; Baseline: M = 519.99, SE = 16.30, Incongruent: M = 542.52, SD = 17.99). Furthermore, a trend was observed for the three-way interaction (F(2,27) = 3.076, p = 0.054, partial η2 = 0.10). No significant difference between conditions was found for baseline trials (all p-values ≥ 0.154). Consequently, we calculated the automatic imitation effect as the mean difference of incongruent minus congruent trials. A repeated measures ANOVA showed again a trend result for the interaction GROUP x EMOTION (F(1,27) = 3.911, p = 0.058, partial η2 = 0.127), in the absence of significant main effects (both p-values 0.231, partial η2’s ≤ 0.053; main effect of Group). Post-hoc comparisons revealed a larger automatic imitation effect for Happy in-group versus Happy out-group stimuli, (t(27) = 1.918, p = 0.023; Happy: in-group: M = 68.98, SE = 9.07; out-group: M = 57.91, SE = 5.43). Furthermore, another trend was found for a larger automatic imitation effect for Angry versus Happy out-group faces (t(27) = 2.418, p = 0.066; Out-group: Angry: M = 62.26, SE = 7.54; Happy: M = 54.14, SE = 6.69). No further significant differences for pairwise comparisons were found (all p-values ≥ 0.166; Table 1; Fig. 2).

Table 1 Mean reaction times per condition (in ms) in the Social-Affective Mimicry Task (SAMT)
Fig. 2
figure 2

Automatic imitation effect in social context. Mean reaction time (RT) difference of incongruent minus congruent trials (in ms) of the four social context conditions is presented in box plots depicting the median and quartiles (whiskers demonstrating 1.5 times the interquartile range). The violin plots around illustrate kernel probability density (i.e., its width demonstrates the proportion of data located there); individual values are plotted within the violin plots. *p = 0.058 (for detail of interaction see text).

Attitudes towards Black Scale (ATB) and Threat-Implicit Association Test (IAT)

Data of two participants was missing for the ATB and the IAT due to the same technical error. Two participants were identified as outliers in the ATB because of their extremely favorable attitude towards Blacks and were thus excluded from all subsequent analysis. Participants exhibited a neutral to favorable explicit attitude towards the out-group in the ATB (N = 26; M = 4.10, SD = 0.31). In the IAT only, data from two additional participants were excluded from analysis due to an error percentage > 30%. The Threat-IAT revealed a large effect of the out-group being more associated with threat, whereas the in-group being more associated with security (t(23) = 3.713, p = 0.001, d = 0.76; Block 3 compatible: M = 840.91, SE = 47.97; Block 5 incompatible: M = 958.78, SE = 36.61).

EEG

Social context modulation during automatic imitation in the SAMT: ERPs

N2

The N2 mean amplitudes ANOVA revealed a significant interaction of Group x Emotion (F(1,27) = 7.233, p = 0.012, partial η2 = 0.211). No further significant effects were found (all p-values ≥ 0.113, partial η2 = 0.091). Tukey post-hoc tests showed a trend for more pronounced N2 amplitudes for Angry out-group stimuli compared with Angry in-group stimuli (p = 0.077; Angry out-group: M = 0.158, SE = 0.585; Angry in-group: M = 0.734, SE = 0.480; all other p-values > 0.127; Figure 3; Figure S2 in Supplementary Materials).

Fig. 3
figure 3

Grand average of stimulus-locked N2. Right panel depicts N2 amplitude courses for happy; left panel for angry faces at a frontocentral electrode cluster centered on FCz. SRC = stimulus-response compatibility. Negative amplitudes are drawn upwards by convention. Stimulus presentation started at 0, indicated by a ticked vertical line. For demonstrational purposes, only the first 800 ms of the frames are depicted.

N450

The N450 mean amplitudes ANOVA revealed a significant interaction effect of GROUP x CONGRUENCY (F(1,27) = 8.882, p = 0.006, partial η2 = 0.25). No other significant effects were observed (all p-values ≥ 0.322, partial η2 = 0.04). More negative N450 amplitudes for incongruent compared to congruent trials were observed for out-group stimuli (p = 0.005) but not for in-group stimuli (p = 0.967). Moreover, congruent in-group stimuli elicited more negative N450 amplitudes than congruent out-group stimuli (p = 0.018; Fig. 4).

Fig. 4
figure 4

Grand average of stimulus-locked N450. Right panel depicts N450 amplitude courses for happy; left panel for angry faces at a central electrode cluster. SRC = stimulus-response compatibility. Negative amplitudes are drawn upwards by convention. Stimulus presentation started at 0, indicated by a ticked vertical line. For demonstrational purposes, only the first 800 ms of the frames are depicted.

CRN

The CRN peak-to-peak amplitudes ANOVA revealed a significant main effect of Congruency, with more negative CRN amplitudes for incongruent than congruent trials (F(1,27) = 7.250, p = 0.012, partial η2 = 0.21; Congruent: M = −5.033, SE = 0.461; Incongruent: M = −5.660, SE = 0.548), and a significant three-way interaction (F(1,27) = 4.865, p = 0.036, partial η2 = 0.15), which was resolved in the subsequent difference wave analysis (Figure 5).

Fig. 5
figure 5

Grand average of response-locked CRN. Right panel depicts CRN amplitude courses for happy; left panel for angry faces at a frontocentral electrode cluster centered on FCz. SRC = stimulus-response compatibility. Negative amplitudes are drawn upwards by convention. Response execution occurred at time point 0, indicated by a ticked vertical line.

CRN congruency difference waves

The CRN difference wave ANOVA revealed a significant interaction effect (F(1,27) = 4.865, p = 0.036, partial η2 = 0.15). Post-hoc comparisons showed by trend larger congruency differences for Happy out-group compared with Angry out-group stimuli (t(27) = −1.994, p = 0.056; Out-group: Happy: M = −1.264, SE = 0.534; Angry: M = 0.086, SE = 0.431).

PMP

The PMP mean amplitudes ANOVA revealed a significant main effect of Congruency with more positive PMP amplitudes for congruent than incongruent trials (F(1,27) = 6.176, p = 0.019, partial η2 = 0.19; Congruent: M = 7.048, SE = 0.607; Incongruent: M = 6.289, SE = 0.639). No other significant effects were found (all p-values > 0.108, partial η2 = 0.09; Fig. 6), Table 2.

Fig. 6
figure 6

Grand average of response-locked PMP. Right panel depicts PMP amplitude courses for happy; left panel for angry faces at a parietal electrode cluster. Negative amplitudes are drawn upwards by convention. Response execution occurred at time point 0, indicated by a ticked vertical line.

Table 2 Means and standard deviations (SD) of the quantified ERP components in μV

Prediction of reaction times by ERPs sensitive to congruency (linear mixed model)

The linear mixed model predicting reaction times showed main effects of Congruency and the N450 component (Table 3). Furthermore, a significant interaction effect of Group x Emotion x Congruency x mean N450 amplitudes predicted reaction times (Table 3; Fig. 7; Figures S6 and S7 in Supplementary Material) with more negative N450 amplitudes predicting longer RTs.

Table 3 Linear mixed model prediction of reaction times by the factors Emotion x Group x Congruency and the continuous predictors (N450, CRN, and PMP), modelling the three-factorial interaction of Emotion x Group x Congruency as random effects per participant. Only significant effects are depicted
Fig. 7
figure 7

Linear mixed model prediction of single-trial reaction time data of the four-way interaction (Congruency x Emotion x Group x N450), plotted for congruent and incongruent trials separately. Congruency: Con = congruent (in red); Incon = incongruent (in turquoise). The shaded area around the slopes denotes the standard error of the mean.

To resolve the significant interaction effect, we conducted pairwise post-hoc tests. The comparisons between Emotion x Congruency per Group (all Emotion x Congruency comparisons calculated separately for in-group and out-group trials, Table S3) and Group x Congruency per Emotion (all Group x Congruency comparisons calculated separately for Angry and Happy trials; Table S4) showed significant differences for the N450 predicting reaction times only in comparisons between incongruent and congruent trials, with more negative N450 amplitudes predicting longer RTs (all b > −60.85 (SE = 6.30), all z’s > −9.67, all p < 0.001; Fig. 7). Pairwise comparisons between Emotion x Group by Congruency (all Emotion x Group comparisons calculated separately for Incongruent and Congruent trials) did not reveal any significant differences. Thus, the post-hoc pairwise comparisons were not able to explain the origin of the observed interaction effect. However, when further plotting the interaction effect, we observed that the slopes in Figures S6 and S7 point towards potential interactions; on the one hand for incongruent Happy in-group and Happy out-group trials (Figure S6), and on the other hand for incongruent Happy and Angry out-group trials (Figure S7). This suggests that the four-way interaction may be driven by social context effects in incongruent trials; yet this remains entirely speculative, because these did not reach statistical significance.

Discussion

The present study investigated interference control during the automatic imitation task as an interplay between self-other distinction (SOD) and cognitive conflict/action monitoring, based on the assumption that the social context manipulation may further aid in disentangling these two. We hypothesized that this interplay would be reflected in amplitude variation of the N2, the CRN, as well as the PMP, and in an exploratory manner, the N450 components. These ERPs were furthermore hypothesized to jointly predict individual behavioral responses given they are sensitive to congruency. PMP, N450, and CRN amplitudes were sensitive to congruency, albeit this applied only to out-group trials for the N450 component. The social context manipulation (a variation of emotional expression and group membership) affected cognitive control/action monitoring ERPs only (N2, CRN, N450). Reaction times were predicted by N450 amplitudes for incongruent relative to congruent trials across all conditions.

On the behavioral level, as expected, we observed faster reaction times on congruent than on baseline and incongruent trials, replicating the established automatic imitation effect. By trend, the automatic imitation effect was affected by emotional expressions and group membership. Results of post-hoc comparisons showed a larger automatic imitation effect for happy in-group than out-group stimuli. Also, we found, by trend, a larger automatic imitation effect for angry compared with happy out-group stimuli. We did not replicate previous behavioral results of enhanced regulation of automatic imitation for angry out-group compared with angry in-group faces (Rauchbauer et al., 2016). The post-hoc comparisons should be taken with caution, because the interaction was only a trend result. Not replicating our previous behavioral results might be due to the smaller sample size compared with our previous studies. In general, neural correlates seemed to be better suited to pick up the influence of social context effects than our behavioral measures. Behavioral correlates may capture a compound of several cognitive processes, explaining why its effect sizes are smaller than those of ERPs (see also Pfabigan, Holzner, & Lamm, 2016). Behavioral results furthermore showed that black individuals were implicitly more associated with threat and white individuals more with security, whereas participants concurrently exhibited a neutral explicit attitude towards black individuals.

On the neural level, N2 amplitudes on angry out-group trials were by trend more pronounced than on angry in-group trials. That is, N2 amplitude variation was by trend sensitive to social context information, which could be due to a modulation of top-down processes. Our N2 trend result of more negative amplitudes for angry out-group than angry in-group trials mirrored previous behavioral results in the SAMT (Rauchbauer et al., 2016). Our N2 results may seem in contrast to the “own-race effect,” which constitutes of more negative N2 amplitudes in response to in-group compared with out-group faces (Ito & Bartholow, 2009; Ito & Urland, 2003, 2005; Kubota & Ito, 2007; Tanaka et al., 2004), and more negative N2 amplitudes for happy than angry facial expressions (Kubota & Ito, 2007). Yet, recent accounts suggest that the “own race effect” (Ito & Bartholow, 2009; Kubota & Ito, 2007) and the allocation of attentional resources according to group membership are context sensitive (Derks, Stedehouder, & Ito, 2014; Willadsen-Jensen & Ito, 2008). Asian participants, for example, attend to the most prominent ethnic group (e.g., white) and not necessarily their own (Willadsen-Jensen & Ito, 2008). Also, high identity threat in response to the in-group resulted in enhanced N2 amplitudes to out-group stimuli, potentially through enhanced attentional resource allocation (Derks et al., 2014). Relating the current N2 findings to the IAT results, N2 variation may reflect enhanced attentional resource allocation to the potentially as most salient experienced out-group face before subsequent movements are initiated. Potentially, top-down motivational effects of the social context influence early neural information processing for adaptive response execution or affect regulation (Amodio, 2013; Amodio, Harmon-Jones, & Devine, 2003; Derks et al., 2014; Ratner & Amodio, 2013). In the case of the present study, the categorization of the most salient social context stimuli (i.e., angry out-group faces) may specifically highlight the need to adapt behavior accordingly.

In the present experiment, N2 amplitudes were not sensitive to congruency information, which is in line with a recent study administering an automatic imitation task (Rauchbauer et al., 2018; see Section 2.4 in Supplementary Materials). Nevertheless, N2 amplitude variation according to stimulus congruency has been found in other SRC tasks (for review, see Folstein & Van Petten, 2008). Given the simultaneous processing of social context information in the SAMT, the processing of congruency might be temporally shifted.

Our exploratory N450 results support the suggestion that congruency between the motor command and the task-irrelevant movement is processed after the N2 time window. Previous studies reported more negative N450 amplitudes for incongruent than congruent trials as a sign for enhanced monitoring (Liotti et al., 2000; West & Alain, 1999). This established finding was observed for out-group trials in the current study, which is in line with the literature, suggesting that the N450 reflects cognitive control/action monitoring. However, no such congruency effect was found for in-group trials. Surprisingly, congruent in-group trials elicited N450 amplitudes on the same level as incongruent in-group and out-group trials, suggesting that our participants deployed enhanced conflict monitoring during all in-group trials. However, this monitoring was not adaptive but rather detrimental, as task performance did not benefit (by trend more errors in congruent in-group than out-group trials despite longer reaction times in in-group trials).

Importantly, the N450 is not only tied to cognitive conflict/action monitoring, but has also been associated with congruency processing during action observation (e.g., whether gestures and a given context match or not, Wu & Coulson, 2005), and unexpected socially inappropriate actions (Proverbio & Riva, 2009). More negative N450 amplitudes were reported for incongruent set-ups and unexpected actions (Proverbio & Riva, 2009; Wu & Coulson, 2005; see also Amoruso et al., 2013). Interpreting the current findings in light of these studies, we could suggest that the in-group setting interfered with the differentiation as to whether the observed action and the respective context matched. It might be possible that implicit in-group biases were at play here (see Supplementary Materials 1.5, for a potential pointer, as a more positive attitude towards blacks was associated with more negative N450 amplitudes in in-group angry congruent trials). Avoiding any further speculation, the current result awaits future investigation with larger sample sizes and more diverse samples.

Replicating previous results, CRN amplitudes were enhanced in incongruent than congruent trials (Bartholow et al., 2005), suggesting enhanced task engagement in these trials to successfully avoid erroneous responses. Moreover, CRN difference waves as an indicator of task engagement were by trend more pronounced for happy than angry out-group stimuli. This finding suggests that enhanced cognitive conflict/action monitoring was required for happy outgroup trials to avoid errors. Supporting this interpretation, also previous studies have related CRN amplitude variation to emotional processes (Luu, Collins, & Tucker, 2000; Vidal et al., 2000; Vidal et al., 2003). The current findings suggest that the CRN component is sensitive to socioemotional information and informs required behavioral adaptation of the current social context. The present results suggest that, early on, attention may be preferably allocated to the most salient contextual cue at the time of the N2 component (angry out-group condition). Later, the CRN component may subsequently inform strategic behavioral adaptation (Bartholow et al., 2005) according to contextual cues that might require a change of contextual priorities (happy out-group condition). This social context condition may require enhanced behavioral adaptation.

PMP amplitude variation distinguished between congruency regardless of social context, which corroborates the results of our previous study (Rauchbauer et al., 2018). As such, it could reflect the degree of motoric overlap between self- and other-generated movements at the moment of response execution. Yet in the current study, the PMP component did not predict response times. Our results rather suggest that preparation of the behavioral response may already be under way during stimulus processing, potentially reflecting cognitive conflict/action monitoring processes. The PMP component has been suggested to reflect a command for movement initiation (Bortoletto et al., 2006; Deecke et al., 1969; Deecke et al., 1976) and may depend on strategies for movement initiation (Bortoletto et al., 2006). As a kind of “go-signal” for the motor system (Bortoletto et al., 2006; Burwell, Makeig, Iacono, & Malone, 2019; Deecke et al., 1985), the PMP showed more positive amplitudes on correct than incorrect trials, suggesting that it precedes appropriate motor actions (Burwell et al., 2019). Our data could be in line with this, as more positive PMP amplitudes on congruent trials may reflect a potential go-signal on easy-to-resolve congruent trials compared to incongruent ones. It could be suggested that the PMP as a go-signal on congruent trials may reflect response facilitation, due to the motoric overlap of one’s own executed action and the movement of the task-irrelevant hand. The generator of the PMP has been suggested to be located in the SMA (Deecke et al., 1985), which has also been ascribed human mirror neuron functions (Mukamel et al., 2010). Future studies should try to further disentangle response execution processes from those related to the comparison of internal and external motor representations and their neural manifestations.

Linking neural and behavioral data, we observed that the N450 component predicted reaction times for incongruent versus congruent trials in all conditions, with more negative amplitudes for longer RTs, as expected. However, while the LMM revealed a significant interaction of the N450 for the factor combinations of Emotion, Group and Congruency, pairwise comparisons did not show any significant differences between the different social context conditions (all Emotion and Group comparisons calculated separately for congruent and for incongruent trials). They only showed significant differences between congruent/incongruent factor combinations (Figure 7). Visual inspection of the data (Figures S6 and S7 in the Supplementary Material) may suggest that the interaction was driven by Emotion x Group x N450 effects in incongruent trials, specifically for the N450 on Happy in-group versus out-group trials (Figure S6) and on Happy and Angry out-group trials (Figure S7). Yet this is only a descriptive interpretation and should be taken with caution. Nevertheless, the results seem to further underline the role of the N450 for cognitive conflict/action monitoring to resolve motoric (in)congruency.

The present study thus partly extends our previous results (Rauchbauer et al., 2018) as it suggests that primarily processes of cognitive conflict/action monitoring are guiding our behavioral responses. The present results also point towards the importance of social context in regulating automatic imitation at the stimulus processing level before response execution. Thus, interference control may primarily be guided by processes of cognitive control/action monitoring, whilst being modulated by social context demands.

We acknowledge a limitation in terms of statistical power in the present article. While the current sample size had enough power to detect the classic automatic imitation effect on the behavioral level as well as congruency effects on the neural level, we have to acknowledge that it was underpowered in terms of interaction effects that are based on smaller effect sizes. Our interaction results should therefore be considered exploratory.

Nonetheless, our results could inform processes underlying atypical modulation of automatic imitation in, for example, individuals with autism spectrum disorder (ASD). While high-functioning ASD individuals do not show modulation of automatic imitation (Cook & Bird, 2012), hyper-imitation can be observed with increased ASD symptom severity (Bird, Leighton, Press, & Heyes, 2007; Sowden, Koehne, Catmur, Dziobek, & Bird, 2016; Spengler, Bird, & Brass, 2010). Our results could be used to investigate impaired neurocognitive processes underlying the absent or hyper-modulation of automatic imitation in ASD individuals. Supporting this suggestion, PMP amplitude variation has recently been proposed to reflect clinical implications (Burwell et al., 2019). Thus, this might be an interesting avenue to pursue.

Conclusions

Our findings suggest that interference control during automatic imitation is predominantly driven by cognitive control/action monitoring. These processes are malleable by social context. Our results can be useful in identifying potentially maladaptive processes during motor resonance. Interactive motor alignment in everyday life predominantly engages cognitive conflict/action monitoring processes to inform orchestration of behavior during perception-action mismatch.