Introduction

Depression affects more than 264 million adults annually (James et al., 2018), and it is associated with a broad range of cognitive and emotional problems (Gotlib & Joormann, 2010). In particular, depressed adults often present with impaired episodic memory (Rock et al., 2014; Zakzanis et al., 1998). For instance, relative to nondepressed individuals, depressed adults have displayed overgeneral autobiographical memory retrieval (Dalgleish et al., 2007; Hallford et al., 2021) and poorer recollection (MacQueen et al., 2002; Ramponi, Barnard, and Nimmo-Smith, 2004). Most importantly for the current study, meta-analyses indicate that depressed adults often show relatively poor memory for positive material, whereas memory for negative material is unaffected or even enhanced (Burt et al., 1995; Matt et al., 1992).

Difficulty retrieving positive memories is clinically relevant, because it may weaken an individual’s ability to regulate their emotions and interrupt ongoing depressive states (Dalgleish & Werner-Seidler, 2014; Joormann & Siemer, 2004). If the mechanisms underlying the positive memory deficit were well-characterized, they could be targeted for treatment. Unfortunately, however, the relevant psychological and neural processes remain poorly understood.

We proposed that poor memory for positive events in depression reflects suboptimal encoding (Dillon, 2015; Dillon & Pizzagalli, 2018). This hypothesis rests on several observations, the first of which is of growing behavioral evidence, which shows that depressed adults perform poorly on reward-processing tasks compared with their nondepressed counterparts (Halahakoon et al., 2020 review and a meta-analysis). Recent studies also show that this reward task performance deficit is driven by dysfunctional dopaminergic circuits (Pizzagalli, 2014; Proudfit, 2015; Treadway and Zald, 2011). Furthermore, there is strong evidence that dopamine (DA) neurons within dopaminergic circuits signal positive prediction errors (PPEs)—that is, they fire strongly when an event is better than expected (Schultz, 2016). Findings from animal (Bethus et al., 2010; Lisman and Grace, 2005) and healthy adult (Jang et al., 2019) studies suggest that PPEs strengthen memory formation for positive events. Taken together, these observations form the cornerstone of our hypothesis; namely, that depressed adults would generate weak dopaminergic PPEs in response to positive events, which would result in poor encoding of those events into long-term memory.

Supporting this hypothesis, we used functional magnetic resonance imaging (fMRI) to show that responses within the ventral tegmental area/substantia nigra (VTA/SN)—where DA neurons are most densely concentrated (Haber & Knutson, 2010)—positively predicted memory for rewarded stimuli in nondepressed controls but not in adults with Major Depressive Disorder (MDD) (Dillon et al., 2014). The fact that VTA/SN activation was weaker in participants with MDD, and that this effect helped explain the group difference (controls > MDD) in memory for rewarded stimuli, supports the hypothesis that weak PPEs, signaled by DA neurons, play a causal role in the positive memory deficit seen in MDD.

However, this prior study (Dillon et al., 2014) had two key limitations. First, a reward or nonreward outcome was delivered on every trial, and the two kinds of feedback were equiprobable (i.e., half of the encoding trials were rewarded). This is not ideal because PPE magnitude depends partly on surprise; it would be better to present rewarding stimuli less frequently, as rare deviations from a predictable sequence of stimuli. Such an arrangement should elicit robust PPEs and enhance memory in nondepressed controls, facilitating detection of the negative effects of blunted PPEs on memory in depressed adults. Second, participants in our prior study knew that their memories would be tested, raising the possibility of group differences in encoding strategies. Administering a surprise memory test would avoid this problem.

To address these concerns and provide additional insight into mechanisms underlying poor memory for positive material in depression, we designed an oddball task followed by a recognition memory test and administered it to adults with high versus low depressive symptoms. At encoding, participants viewed an equal number of positive, negative, and neutral pictures presented as rare targets (“oddballs”) amidst a stream of rectangles (“standards”). Because the pictures appeared infrequently and unpredictably, we expected them to elicit prediction errors (PEs) whose signs should depend on the picture type: positive pictures should elicit PPEs, and negative pictures should elicit negative PEs (because of their neutral content, neutral pictures should elicit smaller and more weakly signed PEs than valenced pictures). Because depression weakens PPEs, we expected that positive oddballs—but not negative or neutral oddballs—would be poorly encoded in depressed adults. To test this prediction, we administered a recognition memory test for the oddballs one day later. Participants were not forewarned about the test, which was delayed by 24 hours, because the effects of DA release on memory may be consolidation-dependent (Bethus et al., 2010; but see Jang et al., 2019). Our primary hypothesis was that, relative to participants with low depressive symptoms, the more severely depressed adults would show poorer memory for positive pictures.

Our intention was to acquire electroencephalography (EEG) data at encoding so that we could use the P300 event-related potential as a marker of PE strength (Polich, 2007; Donchin and Coles, 1988). Unfortunately, however, the COVID19 pandemic precluded this plan. We thus conducted the study online, which presented a problem: the oddball task is very simple, and without EEG recordings there was no way to quantify PPEs at encoding. Therefore, we focused on analysis of the recognition memory data. Our rationale was that if PPEs elicited in the oddball task enhanced the encoding of positive pictures more strongly in nondepressed versus depressed adults, then it should be possible to detect the downstream consequences of these effects in participants’ recognition memory performance.

Specifically, we applied the Drift Diffusion Model (DDM) (Ratcliff and McKoon, 2008) to the recognition data. The DDM is a useful tool for breaking retrieval down into distinct cognitive processes (Ratcliff, 1978), and it has successfully detected negative effects of depression on cognition in prior studies (Lawlor et al., 2020; White et al., 2009, 2010). As illustrated in Fig. 1, the DDM quantifies cognitive processes that support memory retrieval and that cannot be detected in standard analysis of mean accuracy and response times (RTs) (Ratcliff, 1978; Wiecki et al., 2013). The model is based on the assumption that the retrieval process consists of accumulating evidence (e.g., from memory networks in the brain) that is necessary to decide whether an item was seen before (is “old,” from encoding) or not (is “new,” a lure). Individuals arrive at their decision by accumulating evidence over time from an initial starting point, z, towards one of two response boundaries that represent possible responses (“old” or “new”). The starting point, z, ranges from 0-1; values <0.5 indicate a bias towards the lower boundary (“new” in Fig. 1) and values >0.5 reflect a bias towards the upper boundary (“old” in Fig. 1). The distance between the two boundaries is represented by the parameter a, which indicates the amount of evidence that needs to accumulate before reaching a decision. Although evidence is assumed to accumulate noisily on a trial-by-trial basis (e.g., the green or blue lines in Fig. 1), the average rate of accumulation across trials (the “drift rate,” e.g., the solid black line in Fig. 1) is represented by the parameter v. The magnitude of the drift rate indicates the speed with which evidence is accumulated, such that higher absolute values reflect faster and more efficient evidence accumulation. The sign of v (negative or positive) indicates the direction towards which evidence is being accumulated (e.g., either negatively towards “new” or positively towards “old” in Fig. 1). Finally, the t0 parameter represents the time allotted to all “nondecision” processes in which participants engage outside of accumulating evidence for a given choice (e.g., stimulus perception and motor functions). By modeling the decision process dynamically, the DDM jointly estimates both choice proportions (represented by the proportion of trials in which the upper vs. lower boundary was reached) and response times (represented by the amount of time taken to reach a boundary), thus providing a well-constrained model of decision-making as it unfolds over time.

Fig. 1
figure 1

Drift diffusion model. Note. Drift Diffusion Model (DDM) used to model recognition data. The model includes two response boundaries: an upper boundary corresponding to “old” responses and a lower boundary for “new” responses. Starting bias (z) quantifies whether a participant is more likely to respond “old” or “new” before the beginning of each trial. Drift-rate (v) captures the speed of evidence accumulation: the average time it takes to reach a boundary on each trial. This example shows relatively fast evidence accumulation towards the new boundary (dark blue line), and relatively slow evidence accumulation towards the old boundary (green line). The decision threshold (a) quantifies the distance between the two boundaries

The DDM is helpful in the current context because one model parameter is especially sensitive to differences in encoding quality. Specifically, manipulations that influence encoding strength primarily affect the drift rate at recognition while leaving the other DDM parameters unaffected (Criss, 2010). Consequently, if depression blunts the PPEs elicited by positive oddballs at encoding, then that should result in a lower drift rate when those stimuli are presented again at recognition. Consistent with this argument, a prior study reported that nondysphoric students showed higher drift rates for positive vs. negative words presented in a recognition test, but this effect was not present in dysphoric students (White et al., 2009). This prior study (White et al., 2009) is thus encouraging, and we sought to build upon it by testing our hypothesis about prediction errors in samples drawn from the adult population.

To this end, we used the Beck Depressive Inventory-II (BDI-II; Beck et al., 1996) to identify 49 adults with high BDI-II scores (HBDI group) and 46 with low BDI-II scores (LBDI group). These two groups then completed the oddball task and the memory test using an online platform. We predicted that memory for positive oddballs would be worse in the HBDI versus LBDI group. We used the Hierarchical Drift Diffusion Model (HDDM; Wiecki et al., 2013), a Bayesian implementation of the DDM, to test the prediction that that this deficit would be driven by slow accumulation of positive memory evidence (i.e., lower drift-rates for “old” positive images) in the HBDI group at retrieval. As described above, we hypothesized that this pattern would emerge as a downstream consequence of weak PPEs in the HBDI group during encoding.

Materials and methods

Participants

Participants were recruited through the Harvard SONA system (n = 95; Table 1). Prospective participants were eligible if they were fluent English speakers between ages 18 and 55 years, with BDI-II scores less than 6 (LBDI) or greater than 20 (HBDI). We chose 20 as the high cutoff score, because this is the recommended threshold for identifying adults with at least moderate depression (Beck et al., 1996). We chose 6 as the low cutoff, because to meet this criterion, adults would have to have minimal depressive symptoms such that the two groups would be distinct (i.e., no participants with mild symptoms). While it would be useful to study behavior across the full range of depressive symptoms, we used an “extreme groups” design (HBDI vs. LBDI) to maximize our chances of detecting the expected effect with relatively few participants (Preacher, 2015). All participants were compensated with one study pool credit or $15 per hour and assented to a protocol approved by the Mass General Brigham Institutional Review Board. No formal power analysis was conducted for this study. Instead, sample sizes were selected to be similar to or larger than those used in prior work (e.g., 21 subjects per group in Dillon et al., 2014; 17-24 subjects per group in White et al., 2009).

Table 1 Demographic and self-report data

Self-report measures

Participants provided basic demographic information and completed the BDI-II questionnaire using Research Electronic Data Capture (REDCap) software (Harris et al., 2019). Consistent with the IRB protocol, participants were not presented with BDI-II question 9, which assesses suicidality.

Encoding and retrieval tasks

Stimuli

The pictures used in the oddball task were selected from a separate picture norming study, which was used to generate pairs of similar images such that every picture shown in the oddball task would have a closely matched lure at recognition; commonly used image sets, such as the IAPS system (Lang et al., 1997), are not ideal for this purpose. See the Supplemental Methods for details of the picture norming study. Once that study was complete, 240 pictures were selected for the oddball task. This final set consisted of 120 picture pairs (40 positive, 40 negative, and 40 neutral). As described in the Supplemental Methods, the positive and negative images differed on valence but not arousal, and the neutral images differed from emotional images on both valence and arousal. One image from each pair was randomly selected to serve as a target for each participant, with the other image serving as a lure. Four hundred rectangles of the same color for each participant and the same size as the pictures, served as standards (Goldstein et al., 2002); these rectangles constituted 77% of the stimuli. Each picture type—positive, negative, neutral—comprised 7.7% of the stimuli.

Procedure

The experimental tasks included encoding and retrieval sessions, spaced 24 hours apart. The encoding session consisted of instructions, nine practice trials, and the oddball task (~25 minutes). Participants were instructed to attend to the rapidly changing stimuli and to press the left arrow key in response to rectangles and the right arrow key in response to pictures. Some participants also were asked to make predictions about upcoming stimuli, but this manipulation did not affect the results; see Supplemental Methods for details. Stimuli were centrally presented (1,800 ms) and separated by a jittered interstimulus interval (250-1,000 ms). The task included 40 blocks with 10 trials per block and 1 stimulus per trial. Stimuli were presented in a semirandom order with the following constraints: (1) each block included 7 rectangles and 3 pictures (one per valence), (2) all pictures were separated by 1-7 rectangles, and (3) each block ended with a rectangle so two pictures never appeared sequentially across blocks. See Fig. 2A for a depiction of the oddball task.

Fig. 2
figure 2

Experimental tasks. Note. A Encoding trial and (B) graphical overview of the retrieval task. The response arrows as well as the response scales were presented below the stimuli

Participants were not informed that their memory would be tested until they began the recognition test (Fig. 2B) during the retrieval session. The test was self-paced (~20 minutes; up to 10 seconds per response) and included 120 old pictures from encoding plus 120 closely matched lures. All pictures were presented in a unique, randomized order for each participant. Participants were instructed to identify each image as either old or new, and to rate their confidence (high, medium, low) in each response by using a scale shown below each picture.

Analyses

Data were analyzed using Python 3.8 (van Rossum, 1995) and R software version 3.6.1 (R Core Team, 2019). Computational modeling was conducted using HDDM version 0.6.0 (Wiecki et al., 2013) in Jupyter Notebooks (Kluyver et al., 2016).

Continuous self-report measures (BDI-II scores, age, and years of education) were analyzed using two-group t-tests. Gender differences between groups were analyzed using a chi-square test. All other measures were analyzed using linear models. To account for idiosyncratic stimulus effects and individual differences in memory ability, all linear models contained Picture Name (unique for each image) and Subject ID as random intercepts (Bates et al., 2015).

For analysis of response accuracy at encoding and retrieval, trial-level accuracy (0 or 1) served as the dependent variable with Picture Type, Group, and Memory Status (old or new, for retrieval only), as independent variables. Covariates included Age, Trial Number, and Years of Education. A sensitivity analysis was conducted to test the efficacy of this model (i.e., R2 deviation from zero) using the G*Power 3.1 module for the “Linear multiple regression: Fixed model, R2 deviation from zero” (Faul et al., 2007). The alpha was set at 0.05 with 0.8 power, 95 sample size, and 7 predictors, to match the details of our data. Results of this sensitivity analysis indicate that the minimum effect size that can be detected with these parameters is f2 = 0.16 (F = 2.117), which corresponds to a medium effect size (Cohen, 1988). To further characterize group differences at retrieval, we compared the estimated marginal means for recognition accuracy sorted by picture type. Finally, response times (RTs) at encoding and recognition were (natural) log-transformed and filtered: we removed raw RTs that exceeded 10,000 ms at retrieval, as well as (log-transformed) RTs that deviated from the participant’s mean ± 3 SD. This resulted in the removal of 3.3% of encoding trials and 0.22% of retrieval trials. Filtered RTs were entered as dependent measures in models in which Picture Type, Memory Status (old vs. new; for retrieval only), Pair Number (for encoding only) and Group were independent variables. Covariates were Age and Years of Education.

HDDM

The HDDM is a Bayesian implementation of the DDM that has been validated relative to other software packages (Ratcliff and Childers, 2015) and that has uncovered deficits in depressed adults in prior work (Lawlor et al., 2020). It was fit to trial-level choice and untransformed RT data at retrieval following recommendations by model developers (Wiecki et al., 2013). All parameters [v, a, z, t0] were allowed to vary by Group and Picture Type. A total of 10,000 samples, thinned by a factor of 5, were drawn from the posterior distribution, with the first 2,000 discarded as “burn-in” samples. Convergence was confirmed via diagnostic plotting and by running the model five times to obtain the Gelman–Rubin statistic (\(\hat{R}\)< 1.1 for all parameters). To assess group differences, the between-group overlap of the posterior distributions for each parameter was plotted for each picture type (presented as q-values; Lawlor et al., 2020).

Results

Self-report

See Table 1 for demographic and self-report data. The subject pool consisted of a broad community sample of adults. By design, the HBDI and LBDI groups differed on BDI-II scores, t(56.37) = 28.3, p < 0.001, Cohen’s d = 5.66. The groups also differed on age (t(80.33) = −2.19, p = 0.032; d = 0.45) and years of education (t(3.48) = 79.41, p < 0.001; d = 0.71), as the HBDI group was younger and had completed fewer years of education than the LBDI group. Because of these group differences, Age and Years of Education were included as covariates in all other linear models. The groups did not differ on gender, χ2(1) = 0.17, p = 0.681; Φ = 0.07.

Encoding responses

We assessed attentiveness to the oddball task by analyzing response accuracy and RT at encoding (Table 2). The linear model on response accuracy did not return any significant effects involving Group or Picture Type (ps > 0.05). Analysis of RTs, however, revealed a main effect of Group, t(94.14) = 2.77, p = 0.007, due to slower responses in the HBDI group.

Table 2 Mean (± SD) encoding behavior in the oddball task

Recognition memory

As displayed in Fig. 3, relative to the LBDI group, individuals with HBDI showed lower recognition accuracy for (old) positive pictures but not neutral or negative pictures; there were no apparent group differences for correct rejections of (new) lures of any type. To statistically test this visual impression, we ran a generalized linear model on recognition accuracy. To highlight the most critical test, we set responses to Old Positive pictures in the LBDI group as the reference level for the generalized linear model. Consequently, a significant main effect of Group (with a negative sign) would indicate that memory for old positive pictures was worse in the HBDI group, as predicted. As shown in Table 3, the model did indeed return a main effect of Group (specifically, HBDI) with a negative sign. This result confirms that recognition accuracy for old positive pictures was lower in the HBDI versus LBDI group. Although not strictly necessary, post-hoc pairwise analyses confirmed that group differences were not present for old neutral (Cohen’s d = 0.059, z = 0.6, p = 0.55) or old negative pictures (Cohen’s d = 0.11, z = −1.06, p = 0.29).

Fig. 3
figure 3

Recognition accuracy. NoteHBDI: High BDI-II group; LBDI: Low BDI-II group. The p value corresponds to the group difference in recognition memory accuracy for old positive pictures, based on the linear modeling results given in Table 3. The y-axes differ across panels A and B

Table 3 Generalized linear mixed model results for trial-level recognition accuracy

The model returned three additional interactions involving Group. First, there was a Negative x HBDI interaction. This indicates that the HBDI group showed a greater increase in memory accuracy for old negative versus old positive pictures than the LBDI group. This effect is driven primarily by the aforementioned group difference for old positive pictures, as accuracy in response to old negative pictures was similar across groups. Second, a New x HBDI interaction indicates that although all participants were better at correctly rejecting new, positive lures than they were at identifying old, positive pictures, this difference was larger in the HBDI group versus the LBDI group. Note that this effect is also driven by poorer memory for old positive pictures in the HBDI group, as the groups correctly rejected new positive pictures at a similar rate. Third, a significant Negative x New x HBDI interaction emerged, because the ability to correctly reject negative lures was slightly better in the HBDI versus LBDI group. Precise quantitative interpretations for every term in the model, including ones not detailed here, can be made by examining the regression coefficients presented in Table 3.

HDDM

As shown in the top row of Fig. 4, there was substantial between-group overlap for drift-rates in response to old negative pictures (q = 0.44), little overlap for old neutral pictures (q = 0.1), and markedly less overlap for old positive pictures (q = 0.01). The last two results were driven by lower drift-rates in the HBDI group, and they indicate that memory evidence for old positive pictures (and, to a lesser extent, old neutral pictures) accrued more slowly in depressed adults. The bottom row shows that group differences were not pronounced for new pictures.

Fig. 4
figure 4

Posterior distributions for drift-rate. Note. Between-group overlap of posterior distributions for drift-rates, by Picture Types (columns) and Old/New status (rows). HBDI: High BDI-II group; LBDI: Low BDI-II group

The other model parameters revealed limited evidence of group differences. As depicted in Figure S1, decision boundaries were generally wider for the HBDI versus LBDI group but this effect was weakest for old positive pictures (q = 0.46), indicating that it did not drive the group difference in recognition accuracy for these stimuli. Figure S2 shows that nondecision times were longer in the HBDI group, and Figure S3 shows that there was also a weaker bias to respond “new” in the HBDI group. In both cases, the between-group overlap was substantial; thus, the group differences are modest. Although we caution that the degree of overlap of posterior probability distributions does not constitute a statistical significance test, no parameter other than the drift rate for old positive pictures showed less than 5% between-group overlap (i.e., no other q-value was below 0.05). Thus, the strongest result from the HDDM was the observation of slower drift-rates in response to old positive pictures in the HBDI group.

Discussion

This study found that, relative to adults with minimal depressive symptoms, adults with elevated symptoms had greater difficulty recognizing positive pictures encoded a day earlier. The HDDM linked this deficit to slow evidence accumulation during memory retrieval. There were no group differences in recognition accuracy for old negative or old neutral pictures, or for correct rejections of new pictures of any valence. Furthermore, analysis of the other HDDM parameters revealed minimal evidence of group differences. In short, this study shows a selective association between elevated depressive symptoms and difficulty retrieving positive memories.

We interpret the retrieval results as reflecting poor encoding of positive oddballs in the HBDI group. Although this interpretation is speculative given the lack of EEG data, we believe it is sensible for two reasons. First, the oddball task has successfully elicited prediction errors in hundreds, if not thousands, of EEG and fMRI studies over the past six decades. Consequently, it is reasonable to assume that it elicited PEs in the current study. Given that the pictures used as oddballs were selected based on their ability to elicit valenced responses in the picture norming study, it is also reasonable to assume that the positive pictures likely elicited positive PEs—PPEs—in the LBDI group. Given that PPEs elicit DA bursts (Schultz, 1998), strengthen memory formation in healthy adults (Jang et al., 2019), and that depressed adults perform poorly on reward-processing tasks (Halahakoon et al., 2020) due to diminished activity within brain-reward networks (Proudfit, 2015; Treadway & Zald, 2011), a parsimonious conclusion is that PPEs at encoding were weaker in the HBDI group and that this contributed directly to the group difference in memory for positive oddballs.

The second point in favor of this interpretation concerns the overall pattern of the recognition memory data. It is noteworthy that although the HBDI group performed more poorly than the LBDI group in response to old positive pictures, there were no group differences in response to new positive pictures. These results cannot be easily interpreted in terms of either a general insensitivity to positive valence (Levens and Gotlib, 2009) or diminished attention to positive events (Peckham et al., 2010), because the HBDI group did not show lower accuracy in response to old and new positive pictures, just old ones. The fact that the deficit was selective to previously studied pictures implies that encoding of positive material was less effective in the HBDI group. We hypothesize that this reflects blunted PPEs due to abnormalities in the DA system (Dillon, 2015; Dillon and Pizzagalli, 2018).

How would such abnormalities lead to slow evidence accumulation at retrieval? The speed of evidence accumulation in recognition memory tests is thought to depend on how well the stimulus presented on a given trial matches its representation in long-term memory (Ratcliff, 1978). For correctly rejected new items, a parallel search process yields minimal evidence for a match and the drift process rapidly reaches the “new” boundary. For well-encoded old items, the match with the existing representation is good and thus the drift process efficiently reaches the “old” boundary. This match is weaker for poorly encoded items and evidence accumulation is slower and more meandering, sometimes terminating (incorrectly) at the “new” boundary and thus leading to lower recognition accuracy. If our interpretation is correct, then weaker PPEs in response to positive oddballs at encoding in the HBDI group ultimately led to poorer quality matches during the recognition test, causing slower drift-rates and lower recognition accuracy.

These findings suggest an interesting hypothesis for future EEG/ERP and fMRI studies. A common result in the control literature is that when successful recogniton of old items (hits) is contrasted with correct rejection of lures, robust activation of parietal cortex emerges (Rugg and Curran, 2007; Spaniol et al., 2009). The specific mechanism underlying this activation remains controversial, but one possibility is that it reflects the accumulation of memory evidence supplied by the hippocampus and other cortical areas (Wagner et al., 2005). This idea builds on a large nonhuman primate literature indicating that lateral parietal regions support the accumulation of perceptual evidence (Gold and Shadlen, 2007; Shadlen and Newsome, 2001). The current data indicate that using EEG/ERP and fMRI to study emotional memory retrieval in depressed adults would be worthwhile, because in addition to any effects related to PEs at encoding, depression may disrupt the accumulation of positive—but not negative—memory evidence, and this disruption may be detectable by monitoring parietal activation. Indeed, an earlier study from our lab found lower amplitude, left parietal ERPs during recollection of neutral memories in adults with MDD versus healthy controls (Barrick and Dillon, 2018), but extending this approach to emotional memories and linking EEG/ERP signals to model parameters remains a goal.

Although the current findings are promising, several limitations should be noted. First, participants only completed the BDI-II, so we have no information about other conditions, such as anxiety, that are frequently comorbid with depression and that may have influenced the results. Second, the use of an extreme groups design comes at the cost of generalizability, as we do not know how adults with moderate depressive symptoms would behave. Third, the lack of a neurophysiological measurement at encoding means that while we could make reasonable inferences about PPEs at encoding, we could not observe them directly. Thus, future studies should acquire neurophysiological data from larger, well-characterized samples whose depressive symptoms span the range from mild to severe.

An important focus of future research will be to translate these findings to concrete clinical applications. Recent work in memory therapeutics (Dalgleish and Werner-Seidler, 2014) is promising. For instance, Memory Specificity Training seeks to improve memory by training participants to increase the number of specific details they recall from their pasts. Our study suggests that depressed adults may benefit from focusing specifically on retrieving detailed positive memories, and indeed there is already evidence that this approach is therapeautic (Dalgleish et al., 2013; Hitchcock et al., 2017). It would be valuable to know whether therapy-induced changes in the retrieval of positive, autobiographical memories are related to changes in drift rate.

Given the hypothesis that the positive memory deficit in depression is related to abnormalities in the DA system, it also would be useful to study the memory effects of interventions that stimulate DA production, either behaviorally (i.e., via mirthful laughter induction; Manninen et al., 2017) or pharmacologically (i.e., via bupropion administration; Stahl et al., 2004). Another clinical extension of this work might use noninvasive brain stimulation, which successfully enhanced cortical-hippocampal connectivity and improved memory performance in healthy adults (Wang et al., 2014). Given that evidence accumulation is linked to prefrontal and parietal function (Hanks et al., 2015), using noninvasive methods to stimulate these brain regions during positive memory retrieval might yield benefits for depressed adults.

Conclusions

The current study adds further support to the claim that depression can impair memory for positive material, and points to two relevant underlying mechanisms: blunted PPEs at encoding and slow evidence accumulation at retrieval. This study offers new insights into the harmful effects of depressive symptoms on memory and improves our understanding of why depressed individuals have difficulty remembering positive events. These findings also suggest new therapeutic interventions that might help alleviate the positive memory deficit in depressed individuals.