When faced with reminders of disturbing past events, memories of the event and associated negative emotions can intrude into awareness and lead to an unpleasant state of being. Fortunately, people can recruit cognitive control processes to suppress retrieval of unwanted memories and consequently feel better (Anderson & Hanslmayr, 2014). This ability to exert memory control is an important aspect of maintaining good mental health (Engen & Anderson, 2018; Mary et al., 2020), and a large body of research over the past 20 years has delineated, with increasing detail, the neurocognitive mechanisms that enable memory suppression. However, most research studies the suppression of memories of simple stimuli, such as words or pictures encoded in a laboratory environment (Anderson & Hanslmayr, 2014). Research on the neural basis of memory control of real-world, emotionally charged autobiographical memories remains limited (except one study by Noreen et al., 2016), despite the theoretical view that memory suppression is a motivated process that people use to regulate emotions. We used EEG to investigate the neurocognitive mechanisms involved in suppressing autobiographical memories of events that people should be particularly motivated to avoid—specifically, memories of behaving in a morally wrong manner (Stanley & De Brigard, 2019).

Memory control is typically studied in the laboratory using the Think/No-Think (TNT) paradigm (Anderson et al., 2004; Anderson & Green, 2001). Participants initially learn pairs of stimuli, such as two weakly related words (e.g., ordeal–roach). Next, one stimulus in each pair is presented as a reminder for the other. Participants are instructed to actively remember and keep in mind the associated stimulus if the reminder is displayed in green (Think items), but if it is shown in red (No-Think items), they are instructed to prevent the memory of the associated stimulus from intruding into consciousness. The same suppression or retrieval task is repeated many times for each reminder. Participant ratings of whether or not the memory came to mind for each Think/No-Think trial (Levy & Anderson, 2012) are used to assess how often No-Think memories intrude into conscious awareness despite participants’ attempts at stopping retrieval. A reliable finding is that such intrusions are less frequent over repeated suppressions (Benoit et al., 2015; Davidson et al., 2020; Gagnepain et al., 2017; Harrington et al., 2021; Hellerstedt et al., 2016; Levy & Anderson, 2012; Mary et al., 2020; van Schie et al., 2013), providing real-time evidence that retrieval suppression is increasingly effective the more times it is applied to a memory. Following the TNT task, participants complete a surprise memory test of all cue-target pairs, wherein the reminders are presented again and participants are asked to recall and report all associated words (or pictures), regardless of previous TNT instructions. Memories in the No-Think condition are typically more poorly recalled on the final test compared to both memories in the Think condition and memories in a Baseline condition that were neither repeatedly suppressed nor retrieved, suggesting that intentional suppression can induce subsequent forgetting (Anderson & Hanslmayr, 2014; Stramaccia et al., 2021).

EEG measurements during the Think/No-Think phase have been used to elucidate the neurocognitive mechanisms involved in memory suppression with high temporal resolution. Previous ERP findings indicate that attempting to suppress retrieval in response to a reminder involve early attentional and cognitive control processes between around 200-500 ms (Bergström et al., 2009b; Crespo-García et al., 2022; Mecklinger et al., 2009; Streb et al., 2016; Waldhauser et al., 2012). When suppression is successful, cognitive control reduces the ERP marker of conscious recollection, which is evidenced by a reduced parietal positivity for supressed (No-Think) memories compared to retrieved (Think) memories that is typically maximal between 500-800 ms (Bergström et al., 2007, 2009a, 2009b; Depue et al., 2013). From that time onward, sustained control needs to be maintained to ensure that the memory does not enter into awareness for as long as the person is exposed to the reminder (Hanslmayr et al., 2009). Previous EEG oscillation studies indicate that sustained memory control is associated with a decrease in oscillatory power for suppressed compared with retrieved memories across theta (~4-7 Hz), alpha (~8-12 Hz), and beta (~13-30 Hz) frequency bands. It has been suggested that parietal theta power reductions reflect reduced recollection of the associated memory,Footnote 1 whereas sustained maintenance of cognitive control to ensure that the memory does not enter awareness is reflected by alpha/beta band reductions (Depue et al., 2013; Legrand et al., 2020; Quaedflieg et al., 2020; Waldhauser et al., 2015).

One prior study has investigated the ERP correlates of self-reported intrusions (Hellerstedt et al., 2016). In that study, intrusions were associated with an early frontocentral ERP positivity around 400 ms after the cue, potentially indicating initial reactivation of a memory trace (Hellerstedt et al., 2016; see also Hellerstedt & Johansson, 2014), followed by a negative-slow-wave that was maximal between 550-900 ms. The negative slow-wave was suggested to reflect the intruding memories being active in working memory, or alternatively, processes related to error detection. Contrary to predictions, the study did not show enhanced parietal positive ERP markers of conscious recollection during intrusions. Measuring EEG oscillatory correlates of memory intrusions, (Castiglione et al., 2019) found that successful stopping of both memory retrieval and motor actions was associated with early (200-300 ms) increased right-frontal beta oscillation power (~13-30 Hz), which decreased when participants experienced memory intrusions. These results therefore suggested that intrusions occurred when participants failed to engage inhibitory control mechanisms that were reflected by beta band oscillations.

Prior research, however, has mostly used simple stimuli, such as pairs of words or images to investigate the neurocognitive basis of memory suppression. Although such stimuli are useful for achieving experimental control and for understanding the core processes involved in the task, they do not address how people control retrieval of more complex, self-relevant, and emotional memories in everyday life. There is extant behavioural and neuroimaging literature on suppression of emotional words/picture memories (Chen et al., 2012; Depue et al., 2007; Gagnepain et al., 2017; Joormann et al., 2009; Legrand et al., 2020), but because of the lack of self-relevance of those materials, they arguably do not elicit strong motivation for memory control. Research using more complex stimuli suggests that people can suppress unpleasant self-relevant information (Benoit et al., 2016; Noreen et al., 2014) and autobiographical memories (Noreen et al., 2016; Noreen & Macleod, 2013; Stephens et al., 2013). However, most of this research focusses on the after-effects of suppression, showing enhanced subsequent forgetting of suppressed memories even when those memories contain emotional and self-relevant content. Evidence on how emotion affects the intrusiveness of memories during suppression attempts is inconclusive (Davidson et al., 2020; van Schie et al., 2013), and there is, to our knowledge, no previous research measuring intrusiveness of self-relevant and complex autobiographical memories in this paradigm. Furthermore, although some studies have investigated the fMRI correlates of self-relevant materials (Benoit et al., 2016; Noreen et al., 2016), ours is the first EEG investigation of whether emotional autobiographical memories can be suppressed (although see Hu et al., 2015). This approach is important to understand how we engage memory control in everyday life.

In this experiment, we investigated whether and how people can supress autobiographical memories of their own morally wrong actions. Associated with guilt and shame, such memories threaten people’s view of themselves as morally righteous and should be particularly relevant targets for motivated memory control (Anderson & Hanslmayr, 2014; Stanley & De Brigard, 2019). Indeed, some findings suggest that memories of one’s own immoral actions become obfuscated over time (Kappes & Crockett, 2016; Kouchaki & Gino, 2016), as such memories are more likely to be forgotten over a delay and if remembered are rated as less vivid and more distant in time (Escobedo & Adolphs, 2010) than memories of moral actions. However, more recent investigations were unable to replicate some of these findings, and these researchers (Stanley et al., 2018) argued that one’s own immoral actions are likely to be more, rather than less, memorable than other types of events. We propose that these seemingly contradictory ideas are compatible with theoretical accounts of motivated forgetting (Anderson & Hanslmayr, 2014; Stanley & De Brigard, 2019). This theory posits that people are highly motivated to suppress retrieval of intrusive, negative memories to maintain good mental health and that intrusive memories are likely to recruit inhibitory control to keep those memories out of mind. Such efforts should lead to increased suppression-induced forgetting (Anderson & Hanslmayr, 2014; Benoit et al., 2015; Levy & Anderson, 2012). Therefore, if memories for immoral actions are highly memorable and intrusive (Stanley et al., 2018), then this may indeed make them likely targets of motivated forgetting over the longer term (Kouchaki & Gino, 2016).

To test these hypotheses, we adapted the Think/No-Think paradigm to study autobiographical memories, inspired by previous studies (Noreen et al., 2016; Noreen & Macleod, 2013; Stephens et al., 2013). Participants first remembered and described several autobiographical memories involving morally wrong and morally right actions that they committed. They created a title for each memory to be used in a later Think/No-Think task as a reminder of the memory. The Think/No-Think task was conducted 24 hours later. In this task, participants performed trials in which they received a reminder for one of their autobiographical memories. When a given reminder appeared in green, participants were asked to retrieve the associated event and keep it in awareness for the duration of the trial. However, when a reminder appeared in red, they were asked to focus their attention on the reminder but to exclude the unwelcome memory from awareness, as in the typical paradigm (Anderson et al., 2004; Anderson & Green, 2001; Sheldon et al., 2019). The reminder to a given morally right or morally wrong memory was repeated multiple times throughout the Think/No-Think phase and, across repetitions, was consistently suppressed or retrieved. Critically, we measured memory intrusions on a trial-by-trial basis by asking participants to report, immediately after the trial had ended, whether the associated memory had entered awareness at all during the preceding trial (Benoit et al., 2015; Hellerstedt et al., 2016; Levy & Anderson, 2012).Footnote 2

We measured ERPs and oscillatory EEG power in the theta, alpha, and beta bands during the Think/No-Think task to delineate the neurocognitive mechanisms associated with autobiographical retrieval and suppression. We sought to investigate how those neural markers might be modulated by the moral nature of the memories and by intrusions. Rich autobiographical memories are likely to be retrieved over a more protracted timeframe and may involve partially different/additional cognitive processes (Sheldon et al., 2019) compared with remembering simple item associations encoded in the same lab session. Of the few studies that have investigated the neural mechanisms of autobiographical memory retrieval with EEG, most have found that autobiographical retrieval is associated with later EEG effects, including late parietal positive ERPs that may reflect conscious recollection (Renoult et al., 2015) but also other late slow-drift ERP effects (Conway et al., 2001; see also discussion in Staresina & Wimber, 2019). No prior research has described the EEG correlates of autobiographical memory suppression and intrusions. It was unclear whether such EEG effects would have the same topography and timing as suppression and intrusion effects found for simpler stimuli. We therefore used a data-driven approach involving cluster-based permutation tests (Maris & Oostenveld, 2007) to assess differences in ERPs and EEG oscillations between conditions across the whole scalp and time-period after Think and No-Think reminders were shown.

Morally wrong memories have been theorised to be memorable and vivid (Stanley et al., 2018; Stanley & De Brigard, 2019). Therefore, if intrusiveness of a memory is related to its vividness, such memories could trigger more intrusions than morally right memories during suppression attempts. Based on evidence that autobiographical retrieval involves at least partially overlapping neurocognitive processes with simpler forms of episodic recollection (Hebscher et al., 2020; Renoult et al., 2015; Tanguay et al., 2018), we predicted that suppression would reduce recollection-related EEG activity, which would be manifest as a reduced late parietal ERP positivity (Bergström et al., 2007) and reduced oscillatory power in the theta band (Crespo-García et al., 2022; Waldhauser et al., 2015) for No-Think compared with Think reminders. When autobiographical memories intrude into participants’ awareness despite suppression attempts, such trials may be associated with increased parietal ERP positivities and theta-band oscillatory power if autobiographical memory intrusions recruit recollection-related neurocognitive processes. If morally wrong memories intrude more often than morally right memories, Think vs. No-Think differences in recollection-related activity therefore could be smaller for the former. However, because Hellerstedt et al. (2016) did not find enhanced parietal ERP positivities when simple word-pair memories intruded into awareness, the retrieval processes that are engaged during memory intrusions may be qualitatively different from those engaged during voluntary episodic recollection. Therefore, it was not clear how EEG during intrusion trials would compare to retrieval-related activity during Think trials.

Because the inhibitory control mechanism involved in retrieval suppression is considered domain general (Anderson & Floresco, 2021; Anderson & Hulbert, 2021; Apšvalka et al., 2022; Depue et al., 2016; Gagnepain et al., 2014, 2017; Hu et al., 2017), EEG correlates of autobiographical memory control may be similar to those involved in suppressing simpler memories. Therefore, EEG correlates of autobiographical memory control may be reflected in early ERP negativities (Bergström et al., 2009b; Crespo-García et al., 2022; Mecklinger et al., 2009; Streb et al., 2016) and later sustained alpha/beta power reductions as in prior research (Legrand et al., 2020; Lin et al., 2021; Quaedflieg et al., 2020; Waldhauser et al., 2015). Theoretically, recollecting an autobiographical memory involves a slow, gradually unfolding retrieval process (Hebscher et al., 2020; Staresina & Wimber, 2019). Given this, EEG effects associated with both recollection and control processes were predicted to emerge later and be more sustained than effects found in prior research investigating suppression of recently learned item associations.

Method

Participants

Thirty-four students aged 18 to 21 years (24 females; Mage = 19.15; SDage = 0.78) at the University of Kent completed the study in exchange for a combination of course credits and £5. This sample size was chosen to be either similar to or larger than samples used in the most relevant previous EEG research (e.g., N = 32 in Hellerstedt et al., 2016; N = 24 in Waldhauser et al., 2015) and provided >80% power to detect a medium effect size of Cohen’s d = 0.5 (EEG effect sizes associated with suppression are typically in the range of medium to large). All participants self-reported that they were right-handed, had normal or corrected-to-normal vision, were psychologically and neurologically healthy, and were not taking any psychoactive medication. They were advised not to take part if they were feeling low or depressed due to the upsetting nature of the memories. Participants provided informed consent before taking part. The study was approved by the University of Kent’s School of Psychology ethics committee.

Design materials and procedure

We ran this study in two sessions. In the first session, participants generated autobiographical memories and associated titles for each memory. We used these titles as retrieval cues in the TNT task in the second session, conducted 24 hours later (Fig. 1).

Fig. 1
figure 1

Illustration of the procedure and trial structure of key phases in the experiment. Participants attempted to suppress (No-Think) memories associated to red cues but consciously recollect the associated memories in response to green cues. Participants indicated how often the associated memory came to mind (retrieval) using the “1 2 3” rating scale

Session 1

The first author conducted this session in a lab using the online Qualtrics survey software. We instructed participants to think of 22 different autobiographical memories, one at a time (10 morally wrong and 10 morally right, plus 2 filler memories involving 1 birthday and 1 holiday) and to type a description in a text box provided to them on a computer screen. To aid recollection of morally wrong actions, we instructed participants to think of memories in which they lied or cheated, physically or emotionally harmed someone, or in which they committed any other act that they considered morally wrong. We also provided similar examples for remembering morally right actions, such as memories in which they were truthful, in which they helped someone physically or emotionally, or in which they committed other morally right actions. Participants had 3 minutes to think of and write about a specific memory in two to three sentences by describing their actions, the persons involved, the location, and how they felt. We instructed them to avoid writing about events that easily blend with other memories and to describe each event in as much detail as they could provide within those limits.

Next, we instructed participants to think of a unique and specific personal title (henceforth referred to as the “cue”) that would help them to recollect the same memory in the next session. We also advised participants to avoid titles that could evoke multiple memories.

After participants generated the cue, they rated each memory on several emotional and phenomenological characteristics. They rated the memory for age (in years), vividness (How well do you remember the event? 1 = not well at all, 5 = extremely well), intentionality (How intentional were your actions? 1 = not intentional at all, 5 = extremely intentional), and morality (How morally right or morally wrong were the actions you performed during the event? 1 = very morally wrong, 7 = very morally right). These phenomenological ratings have been used previously in moral autobiographical memory research (Stanley et al., 2017). The participants were then asked to report emotional affect and valence using two scales: 1) a modified version of the I-PANAS-SF (Thompson, 2007), we substituted the question measuring “hostile” with “guilty,” because guilt was most relevant to our moral memory manipulation: “Indicate the extent to which you feel this way at the present moment of the event” (guilty, 1 = very slightly or not at all, 5 = extremely); and 2) SAM (Bradley & Lang, 1994), which is a 5-point pictorial scale that measures emotional pleasure and arousal.

Session 2

The second session began with control and practice tasks to ensure the validity of the methods.

Cue-test and practice phases

An initial test was conducted to ensure that participants could remember the autobiographical memories associated to the cues that would be used in the subsequent Think/No-Think phase. All 22 cues were presented in a randomised order. For each trial, a fixation cross was first presented on a computer screen for 1,000 ms, followed by a cue, displayed for 4,000 ms. Then, a scale (1 2 3) was shown on the screen until participants responded. Participants pressed the corresponding number keys to indicate how well they could remember the memories related to the cues (1 = not well at all, 2 = somewhat well, 3 = extremely well). The cues for memories that participants failed to remember (as indicated by “1” responses) were evenly and randomly distributed to Think and No-Think conditions and were also excluded from further analysis.

The percentage of memories recalled during the cue-test phase did not differ across morally right (M = 96.65%, standard deviation SD = 4.3%) and morally wrong (M = 96.25%, SD = 4.4%) memories (BF10 = 0.21).Footnote 3 There also was no difference between morally wrong or right memories in either reaction times (right: M = 1.94 s, SD = 2.35; wrong M = 2.28 s, SD = 3.12), nor mean ratings of how well they could be remembered (right: M = 2.54, SD = 0.25; wrong: M = 2.48, SD = 0.25) in the cue-test phase (BF10 = 0.66 for reaction times, and BF10 = 0.41 for ratings). Therefore, the cue test indicated that both morally wrong and right memories were typically well remembered and similar in terms of accessibility before the memory suppression task began. Next, participants extensively practiced the Think/No-Think task and how to make intrusion ratings over three separate stages using filler memories, until they understood and were following all instructions (see Supplemental Materials for details).

Think/no-think phase

In the next phase, the 10 morally wrong and 10 morally right cues were pseudo-randomly assigned to the Think and No-Think conditions in equal proportions by the software, resulting in five cues in each of the four conditions (Morally Right Think; Morally Right No-Think; Morally Wrong Think; and Morally Wrong No-Think). EEG was recorded during this phase.

On each trial, a white fixation cross appeared on a black background for 1,000 ms, followed by a green or red cue for 4,000 ms. We instructed participants to think of the associated memory for the entire time that the cue was on the screen, whereas if it appeared in red (No-Think) they were instructed to prevent any thoughts of the associated memory from coming to mind. Participants were asked to use a direct suppression strategy to stop retrieval, which involved two requirements: 1) if the memory happened to intrude, participants were asked to push it out of awareness, while paying full attention to the cue the whole time it was on the screen; and 2) participants were asked to refrain from generating substitute memories, images, or words to distract themselves (Benoit & Anderson, 2012; Bergström et al., 2009b).

A 200-ms black screen followed each cue, before an intrusion rating scale appeared for 1,500 ms. The scale presented the options “1 2 3”; participants selected their option by using a keyboard button press. For both Think and No-Think trials, participants were asked to rate “1” if the associated item never entered awareness during the trial and “3” if the memory was in awareness for the entire trial or repeatedly came to mind. They were instructed to rate “2” only if they thought the associated memory came to mind briefly. Note that during Think trials, participants ideally would select “3,” given their instructions to think of the item and retain it. For the No-Think trials, they would ideally select “1,” given their instructions to suppress the item. An intrusion occurred on No-Think trials if the participant rated either “2” or “3,” despite efforts to suppress retrieval, in line with how intrusions were operationalised in previous research (Benoit et al., 2015; Hellerstedt et al., 2016; Levy & Anderson, 2012).

Cues appeared in a pseudo-random order, one at a time, ensuring that cues from the same Think/No-Think condition were not presented more than three times in a row, in line with previous research (Hellerstedt et al., 2016). After all 20 cues had been presented, a new pseudo-randomised order was generated and all cues were presented one at a time again. This process was repeated 16 times (leading to a total of 320 trials). Participants were given a short break after each set of 40 trials.

Surprise memory test phase

Participants also completed a surprise final recall test using the Qualtrics software where they again described all autobiographical memories after the Think/No-Think phase had been completed and rated the memories again on the same characteristics as in the first session. Final recall and self-reports were analysed but showed no significant effectFootnote 4 of the Think/No-Think manipulation on memories. These methods and results are therefore presented in a supplementary file.

At the end of the second session, participants also completed a compliance questionnaire adapted from questionnaires typically used in earlier studies, such as Hu et al. (2015), to ensure that they had completed the task as instructed (Liu et al., 2021). All participants were compliant with the instructions and therefore were included in further analyses.

EEG recording and pre-processing

The EEG data were recorded with a sampling rate of 500 Hz and bandpass of 0.05-70 Hz using 30 Ag/AgCl electrodes fitted in an EasyCAP, amplified with a BrainAmp DC amplifier. Data were recorded with an average reference, and AFz was used as the ground electrode. Electrodes were placed below and above the right eye to measure vertical eye movements and on both left and right outer canthi to measure horizontal eye movements. Impedances were reduced below 5 kΩ before starting the experiment by gentle abrasion of the scalp using cotton buds and saline gel.

EEG data were pre-processed and analysed using the EEGLAB toolbox (Delorme & Makeig, 2004) for Matlab version R2018b and self-written code. For each participant, data from all electrodes were first re-referenced offline to the average readings of the left and right mastoid electrodes. A 0.1-Hz (two-way least-squares finite impulse response) high-pass filter was applied to the re-referenced data. Then, the continuous EEG data were divided into epochs beginning 1000ms pre-stimulus-onset and ending 4,000 ms poststimulus onset. These data were visually inspected to delete epochs with extremely noisy data or channels. Concatenated epochs were submitted to independent component analysis using Runica from the EEGLAB toolbox, with default extended-mode training parameters (Delorme & Makeig, 2004). Independent components reflecting eye movements and other sources of noise were identified by visual inspection of component scalp topographies, time courses, and activation spectra and were discarded from the data by back-projecting all but these components to the data space. Corrected data were subsequently low-pass filtered digitally at 30 Hz (two-way least-squares finite impulse response filter). Finally, any remaining epochs with large noise were removed based on visual inspection, and epochs were baseline corrected against the −200-0 ms prestimulus period for ERP analysis.

EEG data extraction

We derived two complementary measures from the EEG epochs. First, we computed ERPs by averaging over all trials in each condition for each participant. Second, the single-trial epochs were submitted to a time-frequency decomposition in FieldTrip (Oostenveld et al., 2011), using a complex Morlet wavelet transform to decompose the EEG into estimates of oscillation power across different frequencies and timepoints. Twenty-seven wavelets with centre frequencies ranging between 4-30 Hz (in steps of 1 Hz) were convolved with the EEG data to produce power estimates across time-steps of 5 ms. Each wavelet had a width of three cycles to prioritise temporal resolution over frequency resolution. To remove edge artefacts, epochs were truncated to −625 to 3,500 ms. A prestimulus baseline period of −625 to −375 ms was used to normalise the oscillatory power to decibels (dB), because using a baseline period closer to stimulus presentation may lead to artificial “bleeding” of poststimulus activity into the baseline period.

EEG statistical analysis

The timing and scalp distribution of EEG effects associated with autobiographical retrieval suppression is largely unknown (Hebscher et al., 2020), so statistical analyses investigated possible condition differences across locations and time without focusing on predetermined timepoints or electrodes of interest, while also applying a threshold to control for false positives.Footnote 5 Data from all 28 scalp electrodes and all poststimulus timepoints were split in two separate time windows (0 ms to 1,750 ms, and 1,750 ms to 3,500 ms), which were submitted to nonparametric cluster-based permutation tests by using the FieldTrip toolbox (Oostenveld et al., 2011). This approach was used for both ERP amplitudes and oscillatory power derived from time-frequency decomposition, wherein frequency (ranging from 4-30 Hz) was added as a third dimension along with electrode locations and time-points.

In the first stage, key experimental conditions (see below) were compared by conducting t-tests at every time (and frequency) point at each electrode site, and adjacent data points in the 2-D (ERP) or 3-D (Oscillation) space that showed significant differences at an uncorrected threshold (p < 0.05) between conditions were identified and grouped into clusters. A minimum of two neighbouring electrodes needed to have significant t-test results to be considered a cluster (default parameter). A cluster-statistic was calculated by summing all the t-values in each cluster. In the second stage, a permutation test was applied to the data to determine which of the observed clusters were statistically significant at a corrected threshold. This test involved randomly reshuffling the condition labels using the Monte-Carlo resampling method (5,000 permutations) and calculating a null distribution of cluster-statistics. The observed cluster-statistics were compared against the null distribution to calculate a p-value for each cluster. The distribution of significant clusters in time, space, and frequency dimensions was then used to determine when in time, where in electrode space, and in which EEG frequencies the conditions differed. However, it is important to note that the precise edges and peaks of effects in the spatiotemporal and frequency dimensions cannot be determined with this method (Hellerstedt et al., 2021; Maris & Oostenveld, 2007; Sassenhagen & Draschkow, 2019; for a detailed explanation of the cluster-based permutation test analysis).

For both ERPs and oscillations, the above cluster-based permutation method was first conducted to test for differences in neural activity when retrieving versus suppressing autobiographical memories and to investigate whether such Think/No-Think neural effects differed for morally wrong versus right autobiographical memories. For this analysis, we divided epochs into separate conditions based on a 2 (Instruction Type) x 2 (Memory Type) design in each half of the Think/No-Think task (i.e., the first 8 repetitions of cues corresponded to the first half of trials, and the subsequent 8 repetitions of cues corresponded to the second half of trials) (Depue et al., 2013; Hanslmayr et al., 2009; Hellerstedt et al., 2016). Of a total of 40 trials per condition, when split into halves, 36 trials (on average) remained in each condition after pre-processing (with individuals contributing between 17-40 trials).

We applied this main 2x2 factorial analysis separately in the first half and second half of Think/No-Think trials to investigate if ERP and EEG oscillation effects related to retrieval suppression (main effect of instruction type; the test for differences between the mean of Think vs. mean of No-Think conditions) and the moral nature of memories (main effect of memory type; the test for differences between the mean of Morally Right vs. mean of Morally Wrong conditions) were present or absent in the early versus late parts of the task and whether these two factors interacted in influencing EEG brain activity (i.e., if the Think vs. No-Think differences for Morally Right memories were different from the Think vs. No-Think differences for Morally Wrong memories). If the interaction between factors was significant, we tested the effects of the Think/No-Think manipulation separately for each memory type.

We also conducted a complementary analysis of ERPs and EEG oscillations to investigate the neural correlates of autobiographical memory intrusions, again using the cluster-based permutation method to correct for multiple comparisons. We categorised epochs into three conditions based on Think vs. No-Think condition and participants’ introspective reports of retrieval using the intrusion rating scale. The conditions were: Successful Retrieval, if participants indicated that memories came to mind briefly or often in the Think condition; Intrusions, if participants indicated the memories came to mind briefly or often in the No-Think condition; and Non-Intrusions, if participants reported that the memories did not come to mind in the No-Think condition. This analysis was collapsed across morally right and wrong memory types and TNT halves to ensure sufficient trial numbers for adequate signal-to-noise ratio. Participants with low trial numbers in the intrusions condition were excluded. Thus, only 26 (out of 34) participants qualified for the trial number cutoff (more than 13 intrusion trials, which ensured clean ERPs) in this analysis. There were on average 130 (range 79-158) trials in the successful retrieval condition, 101 trials on average (range 69-131) in the Non-Intrusions condition, and 31 trials on average in the intrusions condition (range 13-64) after pre-processing the data. We conducted three pairwise comparisons for both ERPs and oscillations: 1) Successful Retrieval vs. Non-Intrusions; 2) Intrusions vs. Successful Retrieval; and 3) Intrusions vs. Non-Intrusions.

Results

Behavioural results

Phenomenological and emotional characteristics of autobiographical memories reported during day 1

We first analysed participants’ memory ratings from the generation phase to 1) verify that the memory manipulation was successful and 2) gain an understanding of the nature of morally wrong and right autobiographical memories, and how those memories differed from each other. The following measures were used for further analyses because of a priori predictions that these characteristics may be different across moral memory types: a) memory age (measured in years), vividness (range, 1 to 5), intentionality (range, 1 to 5), and morality (range, 1, morally wrong to 7, morally right); b) from the I-PANAS-SF, two morally relevant emotions were chosen (guilt, shame; range, 1 to 5); and c) both pleasure and arousal were chosen from the SAM measures (range, 1 to 5).

As expected (Table 1), participants felt more guilty, ashamed, less pleasure, and rated their actions as more immoral for morally wrong compared with morally right memories. Additionally, we replicated previous findings that morally wrong memories were rated to have occurred in the more distant past (Escobedo & Adolphs, 2010) and were less vivid (Kouchaki & Gino, 2016) than morally right memories. Moreover, to extend on prior findings, participants reported their actions as less intentional for morally wrong than morally right memories. There was no significant difference between moral memory types in arousal.

Table 1 Mean ratings of memory characteristics for morally right and wrong memories as reported by participants during the generation phase, and tests for differences in these characteristics between the memory types

Memory retrieval and intrusions

Participants’ trial-by-trial ratings during the Think/No-Think task were used to calculate the frequency of trials accompanied by awareness of the associated memory during Think trials and No-Think trials, separately for morally right and morally wrong memories within both the first half of the TNT phase and the second half. Specifically, we computed the percentage of trials where retrieval occurred. Because ratings of “briefly” or “often” involved retrieval, we combined trials with those ratings together, following prior work with this method (Hellerstedt et al., 2016; Levy & Anderson, 2012).

Given the differing instructions across the Think and No-Think conditions, we take self-reported retrieval judgments in the Think condition to indicate voluntary retrieval, whereas we interpret self-reported retrieval in the No-Think condition as intrusions. Because these conditions may measure distinct cognitive processes, a 2 (Memory Type) x 2 (TNT half) repeated measures ANOVA was conducted separately for the ratings in the Think vs. No-Think conditions. We used these percentages to test (a) overall differences in intrusion frequency across morally right and morally wrong memories, (b) whether repeated suppression trials reduced intrusion frequency, as is typically found, and any interactions of these factors. See Fig. 2 for an illustration of these results.

Fig. 2
figure 2

Percentage of trials eliciting memory retrieval compared across memory type and halves of TNT task (first half and second half) in the a) Think condition, reflecting retrieval success and b) No-Think condition, reflecting intrusion frequency. Dots show the percentage of trials with retrieval for each individual. Thick lines show the group means and the boxes depict the 95% confidence interval of the group means

Retrieval success increased over repeated retrieval attempts

Retrieval reports indicated that recall success was very high overall, but nevertheless improved between the first half (M = 90.26%, SD = 6.86%) of the Think/No-Think phase to the second half (M = 92.56%, SD = 7.20%), F(1,33) = 4.52, p = 0.041, partial η2 = 0.12 (Fig. 2a). No overall differences in retrieval success arose for morally right and morally wrong memories, F(1,33) = 0.03, p = 0.87, partial η2 = 0.001), nor was there any interaction between memory type and TNT half, F(1,33) = 0.02, p = 0.89, partial η2 = 0.001). Thus, intentional retrieval was similarly successful for morally right and wrong memories.

Repeated suppression attempts reduced intrusions and morally wrong memories were more intrusive than morally right memories

Morally wrong memories intruded more frequently (M = 20.28%, SD = 11.43%) than morally right memories (M = 17.22%, SD = 11.47%), as indicated by the main effect of memory type: F(1,33) = 4.62, p = 0.039, partial η2 = 0.123. More memory intrusions occurred during the first (M = 25.06%, SD = 12.48%) than the second half (M = 12.44%, SD = 10.34%) of the TNT task, as reflected in the main effect of TNT half: F(1,33) = 78.51, p < 0.001, partial η2 = 0.704. There was no significant interaction between the memory type and T/NT half F(1,33) = 1.56, p = 0.22, partial η2 = 0.045.

ERP results

Figure 3 illustrates the grand average ERPs for the Think/No-Think conditions in the first and second half of trials, separated by morality of the memories, together with topographic maps depicting ERP amplitude differences and t values for key factorial and pairwise comparisons.

Fig. 3
figure 3

Grand average ERPs and results from the cluster permutation tests. ERP graphs show the experimental conditions at midfrontal (Fz, top) and left parietal (P7, bottom) electrode sites during first half and second half of the TNT task. Topographic maps on the top rows illustrate mean ERP amplitude differences between conditions (blue/white/red colourmap, in μV), and topographic maps on the bottom rows illustrates mean t-values (blue/black/red colourmap, in t) for the condition differences as generated through the cluster-based permutation tests. The colour scale represents magnitude and direction of the effect. T-maps have been thresholded to only show significant clusters

First half ERPs related to retrieval and suppression of morally wrong and right memories

Overall, we did not find significant clusters for the morally right vs. wrong memory type comparison, but importantly, there were significant clusters showing differences in ERPs between Think vs. No-Think conditions and the interaction between Instruction Type and Memory Type revealed significant clusters, as described below.

Overall, suppression attenuates retrieval-related ERPs

We found four significant positive clusters from 1,080 to 1,360 ms (p < 0.001, Nelectrodes = 28), 1,500 to 1,670 ms (p = 0.005, Nelectrodes = 26), 2,990 to 3,160 ms (p = 0.005, Nelectrodes = 27), and 3,210 to 3,390 ms (p = 0.015, Nelectrodes = 19), indicating more positive sustained ERP amplitudes during retrieval compared with suppression. These effects were present across left parietal (in the 1,080-1,360-ms cluster), frontal, and central regions (in all clusters) as can be seen from the t-value distributions in Fig. 3. Thus, consistent with our predictions, suppression attempts reduced parietal ERP positivities, and such effects were later in time and more prolonged than previously found with simpler stimuli.

Think vs. no-think ERP differences are present for morally right but not morally wrong memories

The Think/No-Think x memory type interaction analysis showed that the Think vs. No-Think ERP differences was significantly greater for morally right than morally wrong memories between 550-700 ms (p = 0.023, Nelectrodes = 26). Follow-up analyses confirmed that this interaction was driven by a Think > No-Think ERP effect for morally right memories around the same time as the interaction cluster (550-700 ms, p = 0.009, Nelectrodes = 25), whereas no significant clusters were found for morally wrong memories.

Similar positive Think vs. No-Think clusters were also present for morally right memories from around 1,080 to 1,580 ms (p < 0.001, Nelectrodes = 28), 2,150 to 2,340 ms (p = 0.017, Nelectrodes = 28), and 2,940 to 3,400 ms (p < 0.001, Nelectrodes = 19) in frontal and central regions, whereas there were no such significant effects for morally wrong memories (Fig. 3), indicating that the overall Think vs. No-Think differences were primarily driven by effects for morally right memories.

Second half ERPs related to retrieval and suppression of morally wrong and right memories

In the second half of trials, we only found only one significant cluster, revealing more positive ERPs for Think compared with No-Think items, from around 1,000-1,230 ms (p = 0.013, Nelectrodes = 21) across left-posterior and central regions (Fig. 3). This analysis therefore showed that suppression of autobiographical memories reduced late left parietal positivities also in the second half of trials, in line with our predictions. There were however no significant ERP differences between right vs. wrong memory types, nor any significant clusters for the memory type x Think/No-Think instruction type interaction, suggesting that the neurocognitive processes engaged during retrieval and suppression did not reliably differ for morally right and wrong memories in the second half.

We also tested whether the Think vs. No-Think positivity between morally right and morally wrong memories found in the first half was significantly greater than the second half, but the cluster did not reach the threshold for significance (p = 0.04, which is nonsignificant at the two-tailed alpha = 0.025 threshold).

Intrusion-related ERPs

Successful retrieval elicited more positive ERPs than nonintrusions from around 950 ms, lasting until the end of the epoch (3,500 ms; Fig. 4). This was reflected by significant positive clusters from 950 to 1,340 ms (p = 0.001, Nelectrodes = 27), 1,350 to 1,580 ms (p = 0.001, Nelectrodes = 25), and 1,750 to 3,500 ms (p < 0.001, Nelectrodes = 28). This effect was spread across left posterior regions in the early clusters and in the central and anterior regions across all clusters. When comparing successful retrieval vs. intrusions, one significant negative cluster arose from 50 to 200 ms (p = 0.01, Nelectrodes = 23), caused by more positive ERPs for intrusions than retrieval trials across anterior regions. However, there were no significant clusters when comparing intrusions and nonintrusions. The lack of significant differences between intrusions and the other ERP conditions may be explained by the low number of trials for intrusions compared with nonintrusions and successful retrieval trials (see Methods section), leading to noisy intrusion-related ERPs, which may have resulted in low statistical power.

Fig. 4
figure 4

Grand average ERPs related to autobiographical memory intrusions and results from the cluster permutation tests comparing the intrusion ERP conditions. ERP graphs show the experimental conditions at mid frontal (Fz, top) and left parietal (P7, bottom) electrode sites. Topographic maps in the top row show the mean ERP amplitude differences between conditions (blue/white/red colourmap, in μV). Topographic maps in the bottom row illustrate t-values (blue/black/red colourmap, in t) for the differences. The colour scale represents magnitude and direction of the effect. T-maps have been thresholded to only show significant clusters

EEG oscillation results

Figure 5 reports time-frequency plots from a right parietal (P4) electrode for the Think/No-Think conditions in the first and second halves of the TNT phase, separated by morality of the memories, together with topographic maps depicting oscillatory power (dB) differences and t-values for key factorial and pairwise comparisons.

Fig. 5
figure 5

Results of the EEG oscillation analysis comparing retrieval and suppression of morally wrong and right autobiographical memories. (A) Time-frequency plots from a right parietal (P4) electrode showing the main effect of the Think vs. No-Think manipulation (top row) and the TNT effect separately for morally wrong and right memories (middle and bottom rows respectively) in first half (left panel) and second half (right panel) of the TNT task. Mean power differences are illustrated in the blue/white/red colourmap and t-values for the differences are represented in the cold/black/hot colour map. (B) Topographic maps of power (dB) differences (top rows, blue/white/red colourmap) and t-values for the differences (bottom rows, cold/black/hot colour map) between Think – No-Think conditions at theta (top) and alpha/beta (bottom) frequency bands. T-maps have been thresholded to only show significant clusters

First-half EEG oscillations related to retrieval and suppression of morally wrong and right memories

We did not find significant clusters when comparing morally right and wrong memories, either collapsed or when separated between the Think and No-Think conditions. The interaction contrast comparing Think vs. No-Think oscillation differences between right and wrong memories also did not reveal any significant clusters.Footnote 6

We did find a general Think vs. No-Think synchronisation effect across the whole frequency band (4–30 Hz), with enhanced oscillatory power for Think compared with No-Think trials starting from around 330 ms after the reminders were shown, lasting until the end of the epoch 3,500 ms (p < 0.001, Nelectrodes = 28). This effect was maximal across parietal scalp regions (Fig. 5). To assess whether this effect was present for both morally right and wrong memories, we compared Think vs. No-Think trials separately for each memory type. For morally right memories, we found a large Think vs. No-Think synchronisation effect across the whole frequency band (4–30 Hz) from 300 ms to 3,500 ms (ps < 0.001, Nelectrodes = 28), maximal across parietal regions. A similar synchronisation effect also was present for morally wrong memories, but the significant cluster was found later in time, from around 950 to 1,750 ms and included frequencies from 6 to 30 Hz (p < 0.001, Nelectrodes = 28), whereas from 1,750 to 3,500 ms, the synchronisation effect was significant across the whole frequency range (4–30 Hz). This effect also was maximal across parietal regions.

Second-half EEG oscillations related to retrieval and suppression of morally wrong and right memories

As in first half, the interaction contrast comparing Think vs. No-Think oscillation differences between morally right vs. wrong memories was not significant.Footnote 7 There were no main effect differences between morally right vs. wrong memories. A Think vs. No-Think synchronisation effect was again found in the second half, which was similar to the effect in the first half (Fig. 5). The Think vs No-Think main effect cluster showed significantly higher power in the Think than No-Think condition across the whole frequency range (4–30 Hz) from around 440 to 3,500 ms post-stimulus onset (p < 0.001, Nelectrodes = 28). For morally right memories, the effect was present across the whole frequency band (4–30 Hz) from around 90 to 3,500 ms (ps < 0.001, Nelectrodes = 28). Similarly, for wrong memories, the significant clusters incorporated the whole frequency band (4–30 Hz), but the effect started later lasting from 600 ms until 3,500 ms (both ps < 0.001, Nelectrodes = 28).

Thus, EEG oscillations in both halves were similar and did not vary across memory types but highly significantly differed depending on whether participants were trying to retrieve or suppress autobiographical memories, with reduced power across theta, alpha, and beta frequencies during suppression compared with retrieval, consistent with predictions.

Intrusion-related EEG oscillations

We next compared EEG oscillations across trials based on whether participants successfully retrieved autobiographical memories in the Think condition, successfully suppressed memories in the No-Think condition, or failed at suppression and experienced intrusions of autobiographical memories during No-Think trials. When comparing successful retrieval with non-intrusion suppression trials, a synchronisation effect was present across the whole frequency range (4-30 Hz) starting around 400 ms to 3,500 ms (both ps < 0.001, Nelectrodes = 28), maximal across parietal regions (Fig. 6). Successful retrieval was also associated with a similar synchronisation effect compared to intrusions, again spanning across the whole frequency range (4-30Hz) and starting around 650 ms to 3,500 ms (both ps < 0.001, Nelectrodes = 28). These differences therefore corresponded closely to the effects of Think/No-Think instruction observed in the earlier factorial analysis of EEG oscillations, with enhanced theta, alpha, and beta power during retrieval compared with suppression. Interestingly, when comparing intrusions with nonintrusions, a significant synchronisation effect was found in the alpha and beta bands (7-30 Hz), late in the epoch, around 2,800 ms to 3,500 ms (p = 0.008, Nelectrodes = 28). As shown in Fig. 6, this difference between intrusions and non-intrusions was caused by a more sustained alpha/beta desynchronisation effect during nonintrusion trials. In contrast, during intrusion trials, although there was an initial alpha/beta desynchronisation, this effect weakened during the latter part of the epoch, where the power in this condition returned to levels similar to the baseline period.Footnote 8

Fig. 6
figure 6

Results of the EEG oscillation analysis comparing intrusions with successful retrieval and successful suppression trials. (A) Top row shows time-frequency plots for each condition separately from a right parietal electrode (P4), and the second and third rows show pairwise differences between conditions. Mean power (in dB) is illustrated in the blue/white/red colourmap and t-values for the differences are represented in the cold/black/hot colour map. (B) Topographic maps below show power (dB) differences (top rows, blue/white/red colourmap) and t-values for the differences (bottom rows, cold/black/hot colour map) between conditions at theta (top) and alpha/beta (bottom) frequency bands. T-maps have been thresholded to only show significant clusters

Discussion

In this experiment, we examined the neurocognitive mechanisms underlying the suppression of autobiographical memory retrieval and investigated how the moral nature of memories affects our ability to stop them from intruding into awareness. The results suggest that retrieval of autobiographical memories can be prevented by intentional suppression attempts and that repeated suppression attempts reduce intrusions. Participants’ memories of their morally wrong actions were particularly intrusive compared with their memories of morally right actions, indicating that controlling memories that threaten our self-concept may be especially challenging. ERP and EEG oscillation markers of autobiographical suppression and intrusions were found to be mostly in-line with findings from prior research investigating suppression of simple laboratory materials. However, the time-course of these ERP and EEG effects revealed interesting new evidence of how autobiographical memory retrieval and control processes change over time. Our findings thus provide novel behavioural and neural evidence concerning the control of multifaceted and emotionally charged real life memories.

Behaviourally, we found that the frequency of autobiographical memory intrusions reduced over repeated suppression attempts, a replication of a reliable effect in previous TNT research with simpler stimuli (Benoit et al., 2015; Davidson et al., 2020; Gagnepain et al., 2017; Harrington et al., 2021; Hellerstedt et al., 2016; Mary et al., 2020; van Schie & Anderson, 2017). Furthermore, morally wrong memories intruded more frequently than morally right memories, in general. Research with the TNT paradigm on how the emotional nature of memories affects their intrusiveness is scarce. In the three studies that have investigated this issue, negative memories were found to be more intrusive (Davidson et al., 2020), numerically less intrusive (Gagnepain et al., 2017), or not differently intrusive (Harrington et al., 2021) than neutral memories. Participants in this study self-reported that they felt more guilty, ashamed, and overall more negative while retrieving morally wrong compared with morally right memories, suggesting that emotionally negative memories are more intrusive than positive memories. Importantly, this finding provides new insights by investigating this issue with complex autobiographical memories that reflects the types of unwanted memories we often confront in everyday life.

The findings are in line with suggestions that memories of our own morally wrong behaviour may be particularly memorable (Stanley et al., 2018). Intrusive memories are likely to require effortful cognitive control processes to prevent them from coming to mind (Anderson & Hanslmayr, 2014; Benoit et al., 2015), which may make such memories more vunerable to motivated forgetting in the longer term (Kouchaki & Gino, 2015, 2016; Shu & Gino, 2012). However, it is important to note that we did not observe subsequent changes in how the morally wrong (or right) memories were described on a final test as a result of the Think/No-Think manipulation in this study (see supplementary analyses). Most prior evidence for morally motivated forgetting comes from testing changes in phenomenology of episodes encoded in the lab, or reduced memory accuracy for imagined events (Kouchaki & Gino, 2016; Shu et al., 2011) rather than memory for events that people have personally experienced in their everyday life (Stanley et al., 2018). Some evidence suggests that personal autobiographical memories may not necessarily be completely forgotten after suppression but may instead change in more subtle ways to become less detailed (Noreen & Macleod, 2013; Stephens et al., 2013). Core details of important events in people’s past however may be more resistant to forgetting, especially those details that they have repeatedly dwelled on over long time periods (Noreen & Macleod, 2013). Our method for assessing memory changes may not have been sufficiently sensitive to such subtle and complex effects. Nevertheless, participants’ intrusion ratings showed that autobiographical memories did become less intrusive over repeated attempts at suppression, suggesting that some changes to those memories occurred within the session. Future research should investigate whether these suppression-induced changes to memory intrusiveness are long-lasting and also should use more sensitive measures of autobiographical forgetting.

ERP correlates of autobiographical memory suppression and intrusions

Successful retrieval suppression led to a sustained reduction in positivity compared with retrieval, which began around 750 ms after cue onset and lasted until the end of the epoch at 3,500 ms. This reduced positivity was initially strongest across left parietal scalp regions but later was maximal across frontocentral regions. Similar but weaker effects arose when contrasting ERPs by Think/No-Think condition regardless of self-reported success, suggesting that these differences were primarily driven by trials where both suppression and retrieval attempts succeeded (approximately 90-95% of trials for retrieval attempts, and 75-85% of trials for suppression attempts).

The earlier parietal modulation resembles the left-parietal ERP positivity thought to index conscious recollection (Rugg & Curran, 2007; Wilding, 2000), including during autobiographical retrieval (Renoult et al., 2015; Tanguay et al., 2018). A suppression-induced reduction in left-parietal ERP positivity has been widely found in the EEG literature on memory control with simpler stimuli (Bergström et al., 2007, 2009a, 2009b; Chen et al., 2012; Depue et al., 2013; Hanslmayr et al., 2009; Hellerstedt et al., 2016). These results therefore replicate this suppression effect with autobiographical memories (Bergström et al., 2013; Hu et al., 2015). The timing of the parietal ERP reduction was somewhat later than is usually found; it was maximal around 750-1,000 ms poststimulus onset, compared with peaks occurring around 500-800 ms poststimulus in some previous TNT research with simpler stimuli (Bergström et al., 2007, 2009a, 2009b). However, cued recall success has been associated with slow and sustained positive slow-drifts that peak later than 500-800 ms (Allan & Rugg, 1997). Some prior research found that retrieval suppression reduced sustained ERP positivities between 1.5-5 s, with the reduction of positivity between 1.5-2.5 s suggested to index successfully prevented recollection (Hanslmayr et al., 2009). Our relatively late and sustained Think>No-Think ERP effect therefore is consistent with recollection of autobiographical memories unfolding slowly and gradually over time (Conway & Pleydell-Pearce, 2000, p. 200; Daselaar et al., 2008; Hebscher et al., 2020; Inman et al., 2018; McCormick et al., 2015). Importantly, this finding suggests that suppression can interrupt brain processes that normally contribute to successful autobiographical recollection.

In the later part of the epoch, successful suppression was associated with more negative frontocentral slow waves compared with successful retrieval. This effect was sustained between 1,500 to 3,500 ms. Similar frontal slow waves have been found in previous Think/No-Think research with simpler stimuli, and a reduction in frontal positivity during suppression has been suggested to reflect cognitive control processes that are engaged to keep the memory from entering awareness (Depue et al., 2013; Hanslmayr et al., 2009). Therefore, our results provide neurophysiological and behavioural evidence that retrieval of autobiographical memories can be successfully suppressed and may recruit similar neurocognitive processes as the suppression of simpler memories.

In contrast to the large ERP effects related to successful suppression, there was less clear evidence for ERP markers of intrusions. There was a very early difference of 40-200 ms across anterior regions with more negative ERP amplitudes for successful retrieval compared with intrusions, and numerically also for nonintrusions vs. intrusions (although the latter comparison was not significant). This time region is usually reflected by P1 and P2 peaks, that are thought to reflect early selective attention processes (Bergström et al., 2009b; Hellerstedt et al., 2016). Therefore, this effect could indicate enhanced attentional allocation towards the No-Think cue on trials when an intrusion occurred, or conversely, reduced allocation of attention during nonintrusions, consistent with an explanation whereby early attentional control to cues is important for preventing intrusions.

During the latter part of the epoch, whereas nonintrusion ERPs were more negative than retrieval ERPs, ERPs associated with intrusions were not significantly different from either retrieval or nonintrusion ERPs. Numerically, the left parietal ERP effect was in line with Hellerstedt et al.’s (2016) findings. Intrusions elicited similar ERP amplitudes to nonintrusions between ~500-1,000 ms post-stimulus-onset. However, from ~1,000 ms onwards across central and frontal scalp regions, intrusion ERP amplitudes fell in between nonintrusions and intentional retrieval, indicating that intrusions made ERPs more similar to intentional retrieval. However, because these effects did not survive statistical thresholding, we do not interpret them further. The inconclusive statistical results likely occurred due to the overall low number of intrusions, which impaired the signal-to-noise ratio of those ERPs thereby decreasing statistical power for this analysis (which is especially low with conservative cluster-based thresholding). Future research is needed with increased numbers of intrusion trials to detect reliable ERP markers of autobiographical intrusions.

When comparing the effects of suppression on ERPs for morally right vs. wrong memories, we found that during the first half of the TNT trials, an ERP positivity during retrieval compared to suppression attempts was found for morally right memories but not morally wrong memories. This Think > No-Think positive difference for morally right memories was most reliable around 500-600 ms poststimulus onset and peaked across central and frontal regions. Although ERP modulations indexing the suppression of recollection often peak over the left parietal scalp (Bergström et al., 2009a, 2009b), recall-related ERPs can be spread across the whole scalp (Hellerstedt et al., 2021) and are known to emerge from around 500 ms after a reminder is encountered (Staresina & Wimber, 2019). Therefore, the ERP positivity for retrieving compared to suppressing morally right memories likely indexes differences in recollection-related activity for these memories, whereas control over recollection may not have been as successful for morally wrong memories during these first attempts at suppression. Therefore, in line with behavioural self-reports, ERPs indexed the difficulty in avoiding retrieval of morally wrong but not morally right memories.

During the second half of trials, Think/No-Think ERP effects were no longer significantly different between morally right and wrong memories, and instead the results only showed general Think > No-Think ERP positivities, similar to the first half of the trials, as described above. We did not find, however, that this difference between Think and No-Think ERPs across memory types differed reliably across the two halves of the task. Similarly, we did not find significant evidence that the reduction in intrusion rates across halves was larger for morally wrong than for morally right memories, although there was a numerical pattern in that direction. Although these patterns were not statistically significant, previous research has found that cognitive processing during early stages of the TNT task can differ from processing in the latter stages (Depue et al., 2007, 2013; Gagnepain et al., 2017; Hanslmayr et al., 2009; Hellerstedt et al., 2016; Hulbert et al., 2016). It is possible that our design was underpowered to detect these changes with our relatively conservative statistical analysis.

Oscillatory correlates of autobiographical memory suppression and intrusions

The EEG oscillation results further indicated that retrieval of personal autobiographical memories can be suppressed. Overall, we found a strong decrease in oscillatory power during successful suppression compared to successful retrieval across all analysed frequencies (4-30 Hz), maximal across parietal regions. This effect began around 500 to 1,000 ms after the reminder appeared and was sustained for as long as the reminder was presented on the screen (3,500 ms). This broadband and sustained desynchronisation effect resembles the oscillatory correlates of successful memory suppression described in previous research with simpler episodic memories (Crespo-García et al., 2022; Ketz et al., 2014; Legrand et al., 2020;Quaedflieg et al., 2020 ; Waldhauser et al., 2015).

Previous research indicates that memory-related power differences in the theta band (4-7 Hz) versus alpha/beta bands (8-30 Hz) reflect different cognitive processes. Theta band increases often are related to retrieval success (Hanslmayr et al., 2016; Nyhus & Curran, 2010; Osipova et al., 2006), consistent with our findings of enhanced theta power during retrieval attempts. In contrast, No-Think trials were associated with a significant theta power desynchronisation effect, likely reflecting successful avoidance of retrieval. The results also showed sustained alpha/beta power decreases for No-Think vs. Think reminders. Alpha/beta desynchronization often is found during memory retrieval (Hanslmayr et al., 2016). In our results, both retrieval and suppression trials were associated with an initial alpha/beta desynchronisation effect within the first second after the cue was shown. Critically, however, alpha/beta desynchronization was sustained throughout the epoch—from 1 s to the end of the epoch—during memory suppression only, suggesting that the initial versus latter effects were functionally dissociable. Late, long-lasting alpha/beta power decreases for No-Think vs. Think reminders have been argued to reflect sustained control over the memory for as long as the participant is exposed to the reminder (Lin et al., 2021; Waldhauser et al., 2015). Therefore, the pattern of alpha/beta changes in our study indicates that suppression recruited sustained control processes to prevent autobiographical memories from intruding into awareness.

The theta and alpha/beta reductions described above were found consistently for both morally right and wrong memories, and both during first and second halves of the TNT task, showing these effects to be highly reliable as an index of autobiographical memory suppression. The lack of EEG oscillation differences between memory types indicates that the oscillations reflected at least partially different neurocognitive processes than the ERPs, which did show differences by memory type. Alternatively, our statistical approach for analysing EEG oscillations may not have been powerful enough to detect subtle differences in neurocognitive processes between morally wrong and right memories that was detectable in the ERP analysis.

Interestingly, correlates of memory intrusions were more apparent in the EEG oscillation analysis, contrasting with the weaker ERP findings for intrusions. Autobiographical memory intrusions were associated with a significant increase in parietal alpha/beta (8-30 Hz) power compared with nonintrusion trials, around 2,500-3,500 ms post-stimulus-onset. This result contrasts with some previous findings that intrusions decreased rather than increased oscillatory power compared with successful suppression (Castiglione et al., 2019; Legrand et al., 2020). However, those prior effects were generally observed earlier in time and tended to have more frontal topographies than the intrusion effect found in the present study. Earlier frontal increases in alpha/beta oscillatory power are thought to reflect the engagement of top-down inhibitory processes that are recruited to stop unwanted memories from coming to mind (Castiglione et al., 2019), similar to motor-action stopping.

The much later, more posterior intrusion-related oscillation changes in the current study could reflect a functionally different mechanism. As described earlier, sustained decreases in alpha/beta oscillatory power has been suggested to index maintenance of control over retrieval (Quaedflieg et al., 2020; Waldhauser et al., 2015). In the current study, both intrusion and nonintrusion trials were associated with an initial decrease in alpha/beta power, but towards the last second of the epoch (around 2.5 s after the reminder was shown), alpha/beta power for intrusion trials returned to near baseline levels. In contrast, decreases in alpha/beta oscillatory power were sustained until the end of the nonintrusion trials. Hence, the late intrusion-related increase in alpha/beta oscillatory power could reflect a failure to maintain control throughout the time the reminder was shown (Lin et al., 2021). This interpretation is consistent with behavioural findings that intrusions are more likely if the No-Think reminder is shown for longer to participants, suggesting that people’s memory control ability dissipates over time (van Schie & Anderson, 2017). Therefore, the present results provide novel oscillatory evidence that autobiographical memory intrusions could occur due to a lapse in sustained control.

Even though autobiographical memory intrusions clearly did occur as evident by both self-reports and a unique pattern of EEG effects, participants reported experiencing intrusions less frequently in this study (on average 20% of trials) than in previous studies with simpler memories of paired word or picture associates (around 35% on average; Levy & Anderson, 2012). This disparity could be due to relatively weak cue-memory associations in our autobiographical memory version of the paradigm reducing the likelihood that cues would elicit recall, which is a common issue in autobiographical memory research in general (St Jacques & De Brigard, 2015). In contrast, in memory-suppression paradigms using overlearned paired associates, cues may be more likely to reactive the associated memory automatically (Hellerstedt et al., 2016). As often found in the memory suppression literature, individual levels of self-reported intrusions also varied widely, which may relate to relatively stable individual differences in executive functions that determine how well people can suppress unwanted retrieval (Levy & Anderson, 2008) and/or individual differences in mental health (Stramaccia et al., 2021). Future research should investigate how people’s ability to suppress morally relevant autobiographical memories relate to such individual differences.

One issue that is not clear from the present study is why morally wrong autobiographical memories were more difficult to suppress than were morally right memories. Our findings are in line with predictions from Stanley and De Brigard (2019), who suggested that the guilt, shame, and self-threatening aspects of these memories would make them particularly intrusive. However, further research is required to determine which specific features of morally wrong memories render them more intrusive than morally right memories. Morally wrong memories could be particularly memorable, because they threaten our belief that we are inherently good (Stanley & De Brigard, 2019) and are associated with negative emotions of guilt and/or shame. Alternatively, the more intrusive nature of morally wrong memories found in this study could simply be due to the general negative valence of morally wrong memories without being specifically related to morality-relevant memory characteristics. Therefore, further research that separates contributions to intrusiveness from emotions, morality and how strongly a memory is threatens the belief of a good-self will be imperative to understand why these memories could be more difficult to control in everyday life.

A major limitation of autobiographical memory research is the difficulty in verifying the accuracy of memories that participants report. Indeed, in the present study, it is possible that participants did not report their worst morally wrong actions because of demand characteristics, desire for self-preservation, or because they may have already suppressed such memories before participating in the study. We accepted this limitation as we prioritised the high ecological validity of the autobiographical memory procedure. Future research could investigate suppression of lab-induced memories of morally wrong actions to provide stronger evidence on this issue.

Conclusions

The results of this experiment provide both behavioural and electrophysiological evidence that people can intentionally suppress unwanted autobiographical memory retrieval as early as half a second after a reminder is encountered. Autobiographical retrieval suppression can fail however and result in unwanted memories intruding into awareness, which may be caused by a lapse in sustained control over the memory. Such intrusions reduce over repeated attempts however, indicating better memory control with practice and repetition. This pattern of results both converges with and extends previous findings, because it is the first demonstration of control over intrusions of real-life, self-relevant autobiographical memories (but see also Noreen & MacLeod, 2013; Noreen et al., 2016; Stephens et al., 2013). Both behavioural and EEG evidence also suggests that intentional retrieval suppression may be more difficult for memories of our morally wrong actions than for memories of our morally right actions. Importantly however, the results indicate that even such personal and emotionally negative memories can be controlled, which may be helpful for mental health and wellbeing in our everyday lives.