Positive affect modulates memory by regulating the influence of reward prediction errors

Qasim, Salman E.; Deswal, Aarushi; Saez, Ignacio; Gu, Xiaosi

doi:10.1038/s44271-024-00106-4

Positive affect modulates memory by regulating the influence of reward prediction errors

Article
Open access
Published: 05 June 2024

Volume 2, article number 52, (2024)
Cite this article

Download PDF

You have full access to this open access article

Communications Psychology

Positive affect modulates memory by regulating the influence of reward prediction errors

Download PDF

1153 Accesses
31 Altmetric
Explore all metrics

Abstract

How our decisions impact our memories is not well understood. Reward prediction errors (RPEs), the difference between expected and obtained reward, help us learn to make optimal decisions-providing a signal that may influence subsequent memory. To measure this influence and how it might go awry in mood disorders, we recruited a large cohort of human participants to perform a decision-making task in which perceptually memorable stimuli were associated with probabilistic rewards, followed by a recognition test for those stimuli. Computational modeling revealed that positive RPEs enhanced both the accuracy of memory and the temporal efficiency of memory search, beyond the contribution of perceptual information. Critically, positive affect upregulated the beneficial effect of RPEs on memory. These findings demonstrate how affect selectively regulates the impact of RPEs on memory, providing a computational mechanism for biased memory in mood disorders.

Depressive symptoms bias the prediction-error enhancement of memory towards negative events in reinforcement learning

Article 26 July 2019

Positive reward prediction errors during decision-making strengthen memory encoding

Article 06 May 2019

The reward positivity is sensitive to affective liking

Article 01 October 2021

Introduction

While making decisions, we often try to predict the consequences of our choices so that we can pick the best option. Very often, the outcomes of our choices do not match with our predictions, generating signals called reward prediction errors (RPEs). Reinforcement learning (RL) models capture this framework of trial-by-error learning, which is critical for understanding decision-making in animals and humans and for building artificial agents^1,2. However, it is not well understood whether or how, RPEs that are central to RL models may have a more lasting effect on cognition. For example, if you choose a restaurant for dinner and unexpectedly find $100 on the ground while eating there, does that dining experience stand out more in your memory than other restaurants? Here, we examine how reinforcement learning, driven by RPEs, influences the encoding and retrieval of memories related to prior decisions.

Neurally, RPEs have been associated with phasic dopamine release in the brain³, which are then used to guide how we evaluate and optimize subsequent choices. Understanding how RPEs impact memory is important because RL algorithms provide biologically plausible models of dopamine-driven learning^1,4 facilitated by mesocortical and cortico-striatal circuits for habits, action selection, and decision-making. As a result, one prominent theory of RPE-mediated memory is that midbrain dopaminergic release strengthens memory encoding and consolidation⁵. Indeed, recent behavioral evidence suggests that rewards^6,7,8 and RPEs^{9,10,11,12,13} imbue stimuli with subjective salience that modulates memory¹⁴. However, the specific direction, timing, and magnitude of this effect vary across studies¹⁴ suggesting the involvement of alternative, unaccounted-for sources of prioritized memory. Most prominent among these is the contribution that perceptual information makes to mnemonic salience, independent from the reward outcomes associated with the stimuli^15,16. Perceptual memorability is not explained by low-level visual salience, cognitive control, or priming¹⁵, and represents a parallel path to enhanced memory that has distinct neural circuits and behavioral implications. Therefore, in order to understand how decision-making processes shape our memories for those decisions, it is critical to understand the interplay between stimulus-specific perceptual memorability and reward prediction during memory processes^17,18,19.

Untangling the distinct contributions of perceptual and reward information to memory is also important to understanding how disruptions to affective states—such as depressive and anxious mood—affect the relationship between RPEs and memory. This is important because RPEs and mood state bidirectionally affect each other²⁰, and aberrant reward circuitry and altered RL are features of a range of mood and anxiety disorders^21,22. These disorders can also result in impaired or biased memory processes^23,24. While one prior study suggests that depression scores alter the relationship between RPE and memory²⁵, it is not clear whether this effect is specific to RPE-mediated memory, or which specific depressive symptoms might underlie this effect. These studies, in sum, raise the possibility that altered mood states might alter the RPEs-mediated memory. Therefore, understanding the computational mechanisms that link RPEs to memory could aid in assessing the biasing effect mood disorders have on memory, and developing therapeutic approaches to these symptoms and disorders.

Methods

Data collection and participants

The study was approved by the Institutional Review Board at the Icahn School of Medicine at Mount Sinai. Participants were recruited from Prolific (http://prolific.co), an online survey platform. A total of 246 adults (126 female, age = 40.1 ± 14.3 years) provided informed consent and completed this study. We excluded 40 participants whose overall behavioral performance in the decision-making task was no different from chance. The final sample that completed the behavioral task had 206 adults (109 female, age = 40.3 ± 14.2 years). Participants were asked to return to complete psychometric surveys. The sample that completed the psychometric surveys had 173 adults (94 female, age = 42.4 ± 14 years). Participants were paid a fixed rate with a bonus computed as a function of the reward accumulated in the first task in the experiment. The target sample size was determined based on the results of similar studies investigating the effect of RPE on memory^11,12. The study was not pre-registered.

Task

Participants performed an experiment with two distinct stages: a decision-making task, followed by a recognition memory task. The decision-making task was a two-arm bandit task in which participants attempted to maximize their rewards by learning one of two possible options (decks of cards) on each trial, for a total of 60 trials. Each draw could result in winning either 100 points or 0 points. Each option was associated with a specific win probability (either 0.8 or 0.2), which was reversed four times every 12 ± 1 trial. As a result, each option’s win probability was always negatively correlated. However, in contrast to traditional bandit/reversal tasks, participants were shown a unique image stimulus after making each choice, associating each reward outcome with a specific image stimulus. These image stimuli were memorable faces drawn from a database on perceptual memorability (10k US Adult Faces database: https://wilmabainbridge.com/facememorability2.html)^26,27, where each face image was associated with a normed d’ score²⁸ that measured how well these images were recognized in a large population sample. Face stimuli were chosen in part to be incidental to the decision-making task in order to maximize dissociation between stimulus features and decision-making behavior^12,29; in contrast, participants were informed that they may need to remember the faces for a subsequent task. Participants were only paid for their performance in the decision-making task, however, to ensure there was no direct, instructed link between memory performance and reward attainment. Upon completing the decision-making task, participants immediately began a recognition memory task in which these 60 image stimuli were shown, in addition to 60 novel lure images drawn from the same memorability database with matched d’ scores in random order. During this task, participants were instructed to indicate whether the image was “old" or “new", and then asked to indicate their confidence in their selection. We computed d’, a signal-detection metric, for each subject by subtracting the z-score corresponding to the false-alarm rate from the z-score corresponding to the hit rate²⁸. The task was constructed using the PsychoPy toolbox³⁰.

Computational modeling

We utilized a Rescorla–Wagner model to fit behavior in the decision-making task, in which RPEs modulate a learning rate (α) parameter, and RPE-based decisions are determined by an inverse-temperature parameter (β) modulating a softmax choice function. The learning and decision rules for this model are described by the following equations³¹:

$${Q}_{t+1}^{c}={Q}_{t}^{c}+\alpha ({r}_{t}-{Q}_{t}^{c})$$

(1)

$${p}_{t}^{c}=\frac{{e}^{\beta {Q}_{t}^{c}}}{\mathop{\sum }\nolimits_{i = 1}^{C}{e}^{\beta {Q}_{i}^{c}}}$$

(2)

where ${Q}_{t}^{c}$ is the value of the chosen option on trial t, updated according to r_t, the model-estimated continuous RPE values on every trial. In addition to the Rescorla–Wagner model, which caches values for trial-by-error decision-making, we also constructed alternative models to capture heuristic switching behavior and Bayesian estimation of task reward state. In the heuristic model, agents keep selecting a choice until they lose, at which point they shift to the other choice, with one free parameter (ϵ) capturing choice bias (Table 1). The Bayesian filter model is based on two hidden states: one in which the purple deck is the correct choice, and the other in which the orange deck is the correct choice, with some probability that states have reversed on each trial. The model computes the likelihood that a choice is correct or incorrect as a function of the inferred probability of reward for the current state. Action probabilities are computed from this likelihood, taking into account the inferred probability that a state switch (e.g., a reward reversal) has occurred³². The free parameters for this model are the probability of reward, and the probability of reversal (Table 1).

Table 1 Model details

Full size table

To model the influences of RPE and perceptual memorability on memory search during recognition, we utilized drift-diffusion models (DDMs), which fit a noisy sequential sampling process to choice data such that relative evidence is accumulated over time until reaching a decision boundary (e.g., a recognition choice)³³. We first excluded reaction times beyond 3 standard deviations away from the subject-level mean, and/or those shorter than 300 ms or exceeding 10 seconds. Then, in two separate hierarchical DDMs, we modeled drift rate (v), the rate of evidence accumulation prior to making a recognition choice, as a function of RPE or PM, with subject as a random effect:

$$v \sim {{{{{\rm{RPE}}}}}}+(1| {{{{{\rm{subject}}}}}})+({{{{{\rm{RPE}}}}}}| {{{{{\rm{subject}}}}}})$$

(3)

$$v \sim {{{{{\rm{PM}}}}}}+(1| {{{{{\rm{subject}}}}}})+({{{{{\rm{PM}}}}}}| {{{{{\rm{subject}}}}}})$$

(4)

The remaining free model parameters, including non-decision time (t), starting point (z), and boundary separation (a) were fit with complete pooling across participants. We utilized a hierarchical approach due to the relatively low number of trials contributed by each participant³⁴. In addition, we also constructed alternative models to capture non-linear influences of RPE on drift rate, including polynomial and logarithmic relationships between RPE and drift rate:

$$v \sim {{{{{\rm{RPE}}}}}}^{2}+(1| {{{{{\rm{subject}}}}}})+({{{{{\rm{RPE}}}}}}^{2}| {{{{{\rm{subject}}}}}})$$

(5)

$$v \sim {{{{{\rm{log}}}}}}({{{{{\rm{RPE}}}}}})+(1| {{{{{\rm{subject}}}}}})+({{{{{\rm{log}}}}}}({{{{{\rm{RPE}}}}}})| {{{{{\rm{subject}}}}}})$$

(6)

Bayesian mixed-effects regression

To determine the features predicting successful memory retrieval, we used a Bayesian mixed-effects logistic regression modeling framework³⁵. Within this framework, we coded hits and correct rejections as correct memory choices and misses and false alarms as incorrect memory choices. We then constructed models of the form:

$$p({{{{{\rm{correct}}}}}}=1) \sim X+(1| {{{{{\rm{subject}}}}}})+({{{{{\rm{RPE}}}}}}| {{{{{\rm{subject}}}}}})+({{{{{\rm{PM}}}}}}| {{{{{\rm{subject}}}}}})$$

(7)

where the probability of correct memory choices is modeled using a logit-link function of fixed effects (X) and random effects. The fixed effects include the following trial-level predictors: RPE, PM, and the within-block trial number (coded such that trials following a reversal restart at 1). The fixed effect also includes subject-level traits, including age, sex, RL parameters (α, β), total gambling reward, and factor scores. The random effects allow the influence of RPE and perceptual memorability to vary across subjects, as well as allow for a random intercept such that one intercept is fit per subject. All numerical predictors were standardized by subtracting the mean and dividing by two times the standard deviation³⁶, while sex was coded as a categorical variable. We generated weakly informative (broad) priors for all regression variables³⁷ which are scaled to regularize the model rather than integrate domain knowledge.

Model fitting and assessment

Behavioral models and Bayesian mixed-effects regression models were fit to individual subject data using Bayesian inference over the free parameters, using the Python library pymc³⁸ and bambi³⁹. To fit models, we used four Markov chain Monte Carlo No-U-Turn (NUTS) samplers, drawing 4000 samples from the posterior for each chain, after a minimum of 4000 burn-in samples. All posteriors for independent variables were checked for convergence using the Gelman–Rubin statistic, which was less than 1.01 in all cases, indicating good convergence. We computed the 95% high-density interval (HDI) for each model parameter to quantify the uncertainty around the true value of the parameter⁴⁰. We considered there to be substantial evidence for the influence of a parameter if the 95% HDI did not include zero⁴¹. Model comparison was performed using the Waikake-information criterion⁴². When assessing parameter recoverability for the decision-making task, we used each model to simulate behavior for 206 agents utilizing the true parameters sampled from our cohort. Because the DDM model was hierarchical, we simulated 50 cohorts of 25 participants (a total of 1250 simulations) and fit each simulated cohort hierarchically. We fit this simulated behavior and computed the correlation between the original parameters used to simulate the behavior and those recovered by the fitting procedure. To test model identifiability, we used the fit parameters for each model to simulate behavior for 206 agents, fit this behavior using every model, and performed model comparison to determine which model fit the simulated behavior best.

Factor analysis

We utilized factor analysis to identify latent transdiagnostic structure across three surveys: the state-trait anxiety index (STAI-T), the Zung depression scale (SDS), and the obsessive-compulsive inventory (OCI-R). These were selected to match factor analyses in prior literature⁴³. We first needed to increase the sample size to ensure a robust estimation of factor loadings and scores. To do so, we utilized survey data from an additional 143 online participants who had completed the same set of surveys as our task participants, bringing the total number of participants utilized for the factor analysis, specifically, to n = 320. First, we computed the Kaiser–Meyer–Olkin (KMO) measure of sampling adequacy to assess whether it was plausible to conduct a factor analysis and found that the degree of overlapping information among the survey responses was appropriate for a factor analysis (KMO = 0.94). We also computed Bartlett’s sphericity test and found that the correlation matrix of the survey responses was not an identity matrix, and thus appropriate for a factor analysis (χ² = 13034). We next performed factor analysis using an oblique promax rotation, using maximum likelihood estimation. We used the Cattell–Nelson–Gorsuch (CNG) test⁴⁴ to determine the appropriate number of factors for this data, verified by the resulting scree plot showing the first three factors captured the most variance in eigenvalues (Supplementary Fig. 6). The factor loadings for each survey question are depicted in Table 2. The items with high factor loadings were used to categorize the factors into the following categories: positive affect, intrusive thoughts and rumination, and obsessive–compulsive behavior. We computed the factor score using the ten Berge method⁴⁵.

Table 2 Transdiagnostic factors

Full size table

Statistical analysis and software

Statistical analysis was conducted in Python, using publicly available libraries. Bayesian model-fitting was conducted using pymc, a Python library for Bayesian inference. Drift-diffusion modeling, specifically, was conducted using hssm, a Python library built on top of pymc for constructing sequential sampling models³⁴. Bayesian mixed-effects modeling was conducted using bambi, a Python library built on top of pymc for constructing Bayesian regression models. Data distribution was assumed to be normal when using parametric statistical tests, but this was not formally tested. All null regression findings are accompanied by equivalence tests with equivalence bounds of [−0.1, 0.1].

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Results

Behavioral analysis and computational modeling of decision-making behavior

In order to investigate how the RPEs that become associated with a stimuli influence our memories for that stimuli, we designed an experiment consisting of two consecutive tasks: a decision-making task followed by a memory task, similar to prior studies^{9,10,11,12,13}. For the decision-making task, participants performed a two-arm bandit task (Fig. 1a), in which they drew cards from two separate decks with oppositely yoked reward probabilities that reversed four times throughout the task without warning (80% chance of reward for one deck, 20% chance of reward for the other)⁴⁶. After every decision, participants were shown reward feedback (either 0 or 100 points) for their choice, along with unique image stimuli. These image stimuli were then utilized in the subsequent memory task, where participants were asked to indicate if a cue image had been seen before, or was a novel lure. We selected the image stimuli from a database of face images with normed perceptual memorability ratings (PM)²⁶ (see “Methods”). Memorability is an intrinsic, perceptual property of images that is predictive of how easily remembered an image is²⁷, and thought to potentially reflect how perceptual information is prioritized for memory⁴⁷. As such, the memorability ratings of these images provided a metric for measuring how successful memory might also fluctuate as a function of the perceptual information associated with each image, independent of the extrinsic RPE that participants encoded along with each image.

**Fig. 1: Task schematic and participant performance.**

We recruited 246 online participants (126 female) from Prolific (http://prolific.com), an online survey platform, to perform this experiment. After excluding participants with below chance-level accuracy for the decision task, we demonstrated that the remaining participants (n = 206) learned to choose the more rewarding option during the decision-making task, even after reward probabilities reversed (Fig. 1b). These participants subsequently completed the recognition memory task (Fig. 1c), in which they have to assess whether stimuli in a set had been presented before, or are novel. Accuracy and reaction times (RTs) in the two tasks were correlated (ρ = 0.16, p = 0.02, ρ = 0.49, p < 0.001, respectively; Supplementary Fig. 1a, b), though RTs for the decision-making task were significantly faster than during the memory task (t(410) = − 29.5, p < 0.001, Cohen’s d = 2.9, 95% CI = [−1.1, −0.9]). In addition to successful recognition choices, called hits (Fig. 1d), memory responses were additionally categorized as correct rejections, misses, or false alarms (Supplementary Fig. 1C). Memory performance computed from a combination of these response categories (d’) indicated that participants performed above chance in the memory task (Supplementary Fig. 1D), though memory performance tended to decrease towards the end of the recognition period and was asymmetric between old and novel lure images (z = 31.8, p < 0.001, Cohen’s h = 1.9, 95% = [29.5, 34]; Supplementary Fig. 1a, b).

Having demonstrated that participants exhibited sufficient learning in the first task and memory in the second (Supplementary Fig. 1D), we constructed computational models of participants’ choices during the decision-making task to investigate how RPEs might impact their learning and, subsequently, their memory. Learning and decision-making in similar tasks are well captured by reinforcement learning models driven by RPEs^48,49. As such, we utilized a Rescorla-Wagner model, an RL model driven by trial-and-error learning from incoming RPEs, to fit behavior in the decision-making task (see “Methods” for model details). In addition, we tested this model against alternative models that do not rely on cached value or RPEs, including a heuristic win-stay, lose-shift model, and a Bayesian filter model estimating the probability of reward for correct choices as well as the probability of reward reversal^46,50 (Table 1). We performed a model comparison (see “Methods”) to select the model that provided the best and most parsimonious fit for the majority of participants’ data. The winning model was the Rescorla-Wagner (RW) model (χ² = 18.5, p < 0.001, Cramer’s v = 0.02, chi-square test of proportions, Supplementary Fig. 3A–C) with two free parameters: a learning rate, α, which dictates how strongly RPEs influence value assignment, and inverse temperature, β, which dictates how deterministically value assignments influence choice.

The best-fit parameters across the sample included a learning rate of 0.73 ± 0.19, suggesting that RPEs were weighed heavily in estimating the value of each option and an inverse temperature of 4.9 ± 2.9 (Fig. 2a). We performed a grid search over the joint parameter space, simulating actions, and outcomes. The combination of parameters that maximized reward in these simulations was a combination of a high learning rate and high inverse temperature⁵¹. Accordingly, a comparison between the subset of participants with the highest combination of these parameters vs. the lowest combination (categorized by quantile split) illustrated a dissociation in optimal behavior during the first and second half of each block (Fig. 2b), resulting in higher reward for the participants with the higher combination of learning rate and inverse temperature after learning had stabilized within each block (z = 2.0, p = 0.047, Cohen’s h = 1.9, 95% CI = [−0.26, 4.2]; Fig. 2c).

Reward-prediction errors and perceptual information separably enhance recognition memory

Next, we investigated how the RPE and perceptual memorability ratings associated with each stimulus affected the memory of the stimulus on a trial-by-trial basis. Previous studies that have focused on the trial-level enhancement of memory for stimuli focused on either the contribution of intrinsic perceptual information²⁶ or extrinsic reward information^{9,10,11,12,13,29}. In contrast, our task design and stimuli choice enabled us to associate each stimulus with both a model-estimated RPE as well as a rating based on each stimulus’ normed intrinsic memorability in the absence of rewards²⁶ (Fig. 2c). We selected only stimuli considered highly memorable on average (those with high perceptual memorability ratings, see “Methods”) to ensure that participants could achieve high recognition success even if ignoring RPEs entirely. We first confirmed that perceptual memorability ratings and model-estimated RPEs were orthogonal (ρ = − 0.02, Fig. 2d) and plotted their joint contribution to the probability of correct recognition (Fig. 2e) to visualize the relative contribution of both streams of information. The group-level and subject-level relationships between RPE and memory and perceptual memorability and memory replicated prior studies investigating the effects of these individual features on hit probability^9,10,12,13 (Supplementary Fig. 4). We next sought to understand the parallel contributions of RPE and perceptual memorability to memory beyond the probability of hits alone, while simultaneously accounting for subject-level RL parameters, demographics, and random-effects. To this end, we utilized a Bayesian mixed-effects logistic regression model to measure the importance of extrinsic RPE information and intrinsic perceptual information to correct vs. incorrect memory performance accounting for all four types of memory responses (hits, correct rejections, misses, and false alarms; see “Methods”).

Across participants, the RPE and perceptual memorability associated with each stimulus meaningfully contributed to memory (fixed effects posterior mean = 0.17, 0.16, 95% HDI = [0.11, 0.23], [0.07, 0.25], respectively; Fig. 3a), such that surprisingly rewarding stimuli and more perceptually memorably stimuli were remembered better than other stimuli. Furthermore, stimuli that appeared sooner after the reward-probability reversal were remembered better than those that appeared later after the reversal (fixed effects posterior mean = −0.12, 95% HDI = [−0.19, −0.01]), suggesting that proximity to state changes also induced increased memorability. Participant demographics and RL parameters (learning rate and inverse temperature) did not meaningfully predict memory performance in this full model, nor did the total reward earned during the decision-making task. To examine how RPE and perceptual memorability influence a different index of memory behavior, we utilized drift-diffusion models (DDM) fit to participants reaction time and choices during the recognition memory task (Fig. 3b, Supplementary Fig. 5A). Specifically, we assessed whether RPE or perceptual memorability more strongly modulated drift rate (Fig. 3B, Supplementary Fig. 5b)—if either RPE or perceptual memorability upregulated drift rate, it would suggest that this feature contributes positively to evidence accumulation in support of the recognition of the target image in opposition to evidence accumulating against it. The model integrating RPE was preferred to the model integrating perceptual memorability (Fig. 3b; see Supplementary Fig. 5C, D for all posterior estimates and parameter recovery). This suggests that RPE explained more variance in participant recognition responses and reaction time than PM, though both RPE (posterior mean = 0.046, 95% HDI = [0.025, 0.067]) and perceptual memorability (posterior mean = 0.038, 95% HDI = [0.021, 0.056]) contributed positively to drift rate (Fig. 3c). We computed several alternative models testing non-linear (e.g., logarithmic and polynomial) relationships between RPE, perceptual memorability and drift-rate; however the linear model fit the behavioral data best (Supplementary Fig. 5E).

**Fig. 3: Reward prediction errors and perceptual information make separable contributions to memory.**

Positive affect upregulates the beneficial effects of reward-prediction error on memory

Next, we were interested in determining whether individual affective phenotype modulated the link between reward prediction, perceptual information, and memory to ascertain the features that might contribute to altered memory processes in mood disorders. We thus next examined whether individual differences in self-reported affective symptoms might modulate subject-level reliance on perceptual and reward information during memory. Specifically, we collected the following psychometric surveys from a subset of task participants (n = 173): the state-trait anxiety index (STAI-T), the Zung depression scale (SDS), and the obsessive-compulsive inventory (OCI-R) (Fig. 4A). Similar to prior studies^43,52, we utilized factor analysis to identify latent, transdiagnostic constructs and to derive synthesized affective symptom scores given the considerable overlap between depression and anxiety symptoms and effects on cognition (see “Methods”). Factor analysis identified three prominent factors (Supplementary Figs. 6 and 7A): positive affect (factor 1; e.g., “I am content"), intrusive thoughts and rumination (factor 2; e.g., “I lack self-confidence"), and obsessive-compulsive behaviors (factor 3; e.g., “I feel I have to repeat certain numbers") (Fig. 4B, Table 2). Factor loadings for the factors were allowed to correlate (ρ_F1:F2 = − 0.62, p < 0.001, ρ_F1:F3 = − 0.22, p = 0.1, ρ_F2:F3 = − 0.38, p = 0.003). We next sought to understand how subject-level factor scores (f1, f2, f3) predicted trial-level memory performance. Because these three factors were correlated, we utilized three separate Bayesian mixed-effects models and performed model comparisons to identify which of these factors explained the most variance in memory performance (see “Methods”). The model including factor 1 (positive affect) performed marginally better than the models including factor 2 (intrusive thoughts and rumination) or factor 3 (obsessive–compulsive behaviors; Fig. 5a). Subjects’ factor 1 score did not modulate trial-level memory performance, or interact with trial-level RPE (main effect 95% HDI include 0; Fig. 5a). Because the model including only factor 1 performed best, we used the regression coefficients from this mixed-effects model for subsequent analyses.

**Fig. 4: Factor analysis reveals latent transdiagnostic constructs for mood.**

**Fig. 5: Mood regulates the memory-enhancing effect of RPEs.**

While the mixed-effects model explored how factor scores influenced trial-level memory, we next examined whether subject-level individual differences in memory were explained by individual differences in factor scores and subject-level reliance on RPE for memory (β_RPE; the slope of the relationship between RPE and memory estimated for each subject in the mixed-effects model). Following the trial-level results, we thus performed a regression analysis of subject-level memory performance as a function of participants’ reliance on RPE for memory (β_RPE) and continuous factor scores and found that subjects’ factor 1 score exhibited a significant interaction with subjects’ overall reliance on RPE for memory (B = 1.4, SE = 0.5, p = 0.01, 95% CI = [0.3, 2.5]). This subject-level result indicated that participants who relied more on RPEs for memory also exhibited better memory overall—if they were also individuals with greater positive affect (higher f1 scores). We did not detect a significant interaction between positive affect and participants’ reliance on perceptual memorability, but could not reject the presence of small effects based on an equivalence test (B = − 4.5, SE = 4.5, p = 0.32, t_eq(166) = 1.03, p_eq = 0.15, 95% CI = [−13.5, 4.4]; Fig. 5b). Similarly, we did not detect significant relationships between positive affect and reward (B = − 0.02, SE = 0.29, p = 0.95, t_eq(168) = 0.41, p_eq = 0.34, 95% CI = [−0.6, 0.6]), learning rate (B = − 0.001, SE = 0.01, p = 0.94, t_eq(168) = 7, p_eq < 0.001, 95% CI = [−0.03, 0.03], or inverse temperature (B = − 0.13, SE = 0.21, p = 0.52, t_eq(168) = 1.1, p_eq = 0.13, 95% CI = [−0.6, 0.3]; Supplementary Fig. 7B), though we could not reject the presence of small effects based on equivalence tests. However, equivalence tests did confirm the absence of a significant relationship between positive affect and memory (B = 0.05, SE = 0.04, p = 0.29, t_eq(168) = 7.2, p_eq < 0.001, 95% CI = [−0.04, 0.13]), as well as mnemonic reaction time (B = − 0.02, SE = 0.02, p = 0.29, t_eq(168) = 6, p_eq < 0.001, 95% CI = [−0.06, 0.02]; Supplementary Fig. 7B), suggesting that positive affect did not alter the relationship between RPE and memory by modifying overall memory performance or speed. We did observe that subjects’ factor 2 score exhibited a significant interaction with subjects’ overall reliance on perceptual memorability for memory (B = 12.8, SE = 5.2, p = 0.015, 95% CI = [2.5, 23]), suggesting that participants who relied more on perceptual information for memory exhibited better memory overall—if they were more anxious and ruminative (higher f2 scores). We did not observe a significant relationship between RPE reliance and memory for f2 (B = − 0.52, SE = 0.56, p = 0.36, t_eq(166) = 1.1, p_eq = 0.14, 95% CI = [−1.6, 0.6] or f3 (B = − 0.3, SE = 0.6, p = 0.6, t_eq(166) = 0.7, p_eq = 0.24, 95% CI = [−1.5, 0.9]; Supplementary Fig. 8A, B), but could not reject the presence of a small effect based on equivalence tests.

Discussion

Identifying the specific relevance of RPEs to memory is crucial for understanding why we remember rewarding events better than others and how this process can go awry in psychiatric states featuring altered mood. To that end, we designed and tested an experimental paradigm in which participants performed a decision-making task followed by a memory task. Using a reinforcement learning model, we found that participants remembered a stimulus better if it was associated with a model-estimated RPE, or if it was a more perceptually memorable stimulus. By disentangling the relationship between RPE-driven memory and perceptually-driven memory, we were able to observe differences in their behavioral consequences. Specifically, we showed that RPE improves the efficiency of successful recognition over perceptual information and that transdiagnostic measures of positive affect regulated the relationship between RPE-driven memory and enhanced memory performance, which was not true for perceptually-driven memory. Furthermore, this regulation was specific to affective phenotype and was not present using alternative mental health factors such as intrusive thoughts and rumination or obsessive-compulsive traits. Together, these findings illuminate the computational mechanisms mediating the important relationship between decision-making, memory, and affect.

While several prior studies have investigated whether model-estimated RPEs influence subsequent memories^{9,10,11,12,13,53}, these studies provide conflicting evidence for whether positive, negative, or unsigned RPEs enhance memory, whether this effect is most pronounced only after a delay period, or even whether these effects are age-dependent. One possible reason for these conflicting accounts is that prior studies do not account for how inherently memorable stimuli are on the basis of reward-agnostic perceptual features. The current task design specifically utilized stimuli with known perceptual memorability ratings²⁷ that index how intrinsically memorable a stimulus is in the absence of rewards. These memorability ratings are not explained by low-level visual properties, esthetic attractiveness, or interest level^19,26,27,54. Furthermore, perceptual memorability ratings are not explained by purely attentional processes^15,27 and correlate with memorability scores determined by neural networks⁴⁷, suggesting that perceptual memorability captures intrinsic stimulus properties at the junction between high-level visual processing and memory. This information, in combination with computationally estimated RPEs, enabled us to dissociate between RPE-driven and perceptually-driven memory processes in the subsequent recognition memory task using these same stimuli. While mixed-effects modeling demonstrated that both positive RPE and high perceptual memorability contributed to successful memory, drift-diffusion modeling of reaction times during memory revealed that positive RPEs more meaningfully up-regulated drift rate during memory search. While the strength of visual information is implicated in evidence accumulation in perceptual decision-making⁵⁵, our results suggest that the reward computations associated with surprising rewards provide more important evidence per unit of time for matching the recognition cue stimulus to the image stored in memory. This finding provides support for a functional dissociation between RPE- and perceptually-mediated memory enhancement.

Because perception and subjective valuation involve distinct neural circuits and cognitive processes, understanding their contributions to memory has distinct implications for the downstream effects that psychiatric disorders impairing perception or learning might have on memory. By showing that the transdiagnostic affective state modulates the link between reward information and memory but not perceptual information, our findings provide evidence that RPEs may bring a degree of salience to a stimulus that strengthens memory encoding. Identifying the specific influence that RPEs have on memory is critical to understanding the neural mechanisms that may underlie memory enhancement. In addition to providing computationally parsimonious models of decision making, reinforcement learning algorithms play a critical role in biological psychology because RPEs have been tightly correlated with the activity of dopaminergic neurons³ in the substantia nigra and ventral tegmental area (VTA). This suggests that these models capture dopamine-driven learning^1,4 facilitated by mesocortical and cortico-striatal circuits for habits, action–selection, and decision-making. One prominent theory of RPE-mediated memory is that midbrain dopaminergic neurons innervate the hippocampus, and that dopamine release strengthens hippocampal plasticity involved in memory encoding and consolidation⁵. In support of this theory, neurophysiological evidence from rodents has demonstrated that mesolimbic dopamine modulates memory-related neuronal activity during memory encoding^56,57. Research has also shown that neuronal activity in the VTA, a critical region in the brain’s reward circuitry, correlates with memory-related theta oscillations⁵⁸. In parallel, neuroimaging in humans suggests that RPEs correlate with increased memory-related BOLD activity⁹. In contrast, perceptual memorability is thought to engage neural activity at the junction of perception and memory in the ventral visual stream²⁷, independent of reward¹⁶. By demonstrating the positive affect regulates the RPE-memory link, and not the perception-memory link, our findings suggest that RPEs and their associated dopaminergic activity could provide a plausible neural mechanism for enhancing memory in addition to driving RL processes⁵⁹. Linking value-based decision making to value-based memory enhancement is essential to understanding the role that dopaminergic circuits might play in memory overall.

Critically, abnormal mnemonic processes are of particular importance to mood disorders that are also linked to abnormal learning and decision-making. For example, depression is known to feature impaired processing of RL-related computations such as RPEs²², as well as disruption of explicit memory capacity^23,60. Acute and post-traumatic stress disorders feature pathologically strong associations with traumatic events²³ that may rely on midbrain dopaminergic modulation of synaptic plasticity^61,62. Drug-associated cues may also develop enhanced salience in memory, contributing to substance abuse at the cost of other cues and natural rewards⁶³. By taking a transdiagnostic approach to identifying a latent construct associated with mood, we demonstrate how affective state might further regulate the relationship between encoded RPEs and memory, consistent with findings from perceptual matching⁶⁴ and reward anticipation⁶⁵ studies. As such, disrupted affect could disrupt the interaction between RL and memory, distinct from impairments to RL and memory separately. These findings could have far-reaching implications for not only uncovering deeper neurocomputational mechanisms of disorders like depression and anxiety, but also suggesting that treatment and intervention strategies need to consider learning and memory deficits in an integrated fashion.

Limitations

Our study has several limitations. First, we specifically selected images with high memorability scores to ensure that participants could perform the recognition task without needing to use reward information at all. However, more variance in the memorability scores could be helpful in establishing whether RPEs play an even larger compensatory role in recognition when perceptual information is only weakly predictive of memory. Second, we found that memory performance is more accurate when a reversal is more recent. This suggests that, while our RL model fit participant’s data best, participants may also be performing hidden state inference⁶⁶ that informs their subsequent memory that might be captured by a different Bayesian model formulation than the one we utilized. Also, our analysis of psychiatric self-report and memory were exploratory and data-driven; future, pre-registered studies will be able to further investigate the influence of psychiatric symptoms on RPE-mediated memory. Finally, while memorability is thought to engage processes separate from attention^15,27 it is possible that RPE modulated memory by modulating attention. Future studies utilizing eye-tracking in concert with behavioral modeling will be best able to address this possibility.

Conclusions

Here, we have demonstrated the interplay of perceptual information and reward information enhance memory, and how affective symptoms selectively regulate the influence of RPEs on memory. In addition to reinforcing recent work investigating the memory-enhancing effects of RPEs, these findings provide the first evidence for how value computations during learning directly interact with perceptual memorability, and how mood disorders may diminish the the beneficial effect of RPEs on memory while sparing perceptual processes. These results will thus enable future computational work investigating how models of memory may jointly and dynamically incorporate intrinsic perceptual information and extrinsic associations, and future physiological work investigating how dopaminergic circuits in the brain modulate neural activity in regions typically associated with memory.

Data availability

The behavioral data used in this study are available at https://osf.io/awu3m/. The database used to source image stimuli is available at https://wilmabainbridge.com/facememorability2.html.

Code availability

The analysis code and Jupyter notebooks used to generate manuscript figures are available at: https://osf.io/awu3m/.

References

Rangel, A., Camerer, C. & Montague, P. R. A framework for studying the neurobiology of value-based decision making. Nat. Rev. Neurosci. 9, 545–56 (2008).
Article PubMed PubMed Central Google Scholar
Sutton, R. S. & Barto, A. G. Reinforcement learning: an introduction. Adaptive Computation And Machine Learning Series. Second edition.
Schultz, W., Dayan, P. & Montague, P. R. A neural substrate of prediction and reward. Science 275, 1593–1599 (1997).
Article PubMed Google Scholar
Shepherd, G. M. G. Corticostriatal connectivity and its role in disease. Nat. Rev. Neurosci. 14, 278–91 (2013).
Article PubMed PubMed Central Google Scholar
Shohamy, D. & Adcock, R. A. Dopamine and adaptive memory. Trends Cogn. Sci. 14, 464–72 (2010).
Article PubMed Google Scholar
Adcock, R., Thangavel, A., Whitfield-Gabrieli, S., Knutson, B. & Gabrieli, J. D. E. Reward-motivated learning: mesolimbic activation precedes memory formation. Neuron 50, 507–517 (2006).
Article PubMed Google Scholar
Madan, C. R., Fujiwara, E., Gerson, B. C. & Caplan, J. B. High reward makes items easier to remember, but harder to bind to a new temporal context. Front. Integr. Neurosci. 6, 61 (2012).
Article PubMed PubMed Central Google Scholar
Miendlarzewska, E. A., Bavelier, D. & Schwartz, S. Influence of reward motivation on human declarative memory. Neurosci. Biobehav. Rev. 61, 156–76 (2016).
Article PubMed Google Scholar
Davidow, J. Y., Foerde, K., Galván, A. & Shohamy, D. An upside to reward sensitivity: the hippocampus supports enhanced reinforcement learning in adolescence. Neuron 92, 93–99 (2016).
Article PubMed Google Scholar
Rouhani, N., Norman, K. A. & Niv, Y. Dissociable effects of surprising rewards on learning and memory. J. Exp. Psychol. Learn. Mem. Cogn. 44, 1430–1443 (2018).
Article PubMed PubMed Central Google Scholar
Rouhani, N. & Niv, Y. Signed and unsigned reward prediction errors dynamically enhance learning and memory. Elife 10, e61077 (2021).
Jang, A. I., Nassar, M. R., Dillon, D. G. & Frank, M. J. Positive reward prediction errors during decision-making strengthen memory encoding. Nat. Hum. Behav. 3, 719–732 (2019).
Article PubMed PubMed Central Google Scholar
Calderon, C. B. et al. Signed reward prediction errors in the ventral striatum drive episodic memory. J. Neurosci. 41, 1716–1726 (2021).
Article PubMed PubMed Central Google Scholar
Rouhani, N., Niv, Y., Frank, M. J. & Schwabe, L. Multiple routes to enhanced memory for emotionally relevant events. Trends Cogn. Sci. 27, 867–882 (2023).
Bainbridge, W. A. The resiliency of image memorability: a predictor of memory separate from attention and priming. Neuropsychologia 141, 107408 (2020).
Article PubMed Google Scholar
Li, X., Bainbridge, W. & Bakkour, A. Memorable but not chosen: no effect of memorability on value-based decisions. Sci. Rep. 12, 22056 (2022).
Bylinskii, Z., Isola, P., Bainbridge, C., Torralba, A. & Oliva, A. Intrinsic and extrinsic effects on image memorability. Vis. Res. 116, 165–78 (2015).
Article PubMed Google Scholar
Kramer, M. A., Hebart, M. N., Baker, C. I. & Bainbridge, W. A. The features underlying the memorability of objects. bioRxiv https://www.biorxiv.org/content/early/2022/04/30/2022.04.29.490104.full.pdf (2022).
Wakeland-Hart, C. D., Cao, S. A., deBettencourt, M. T., Bainbridge, W. A. & Rosenberg, M. D. Predicting visual memory across images and within individuals. Cognition 227, 105201 (2022).
Article PubMed Google Scholar
Eldar, E., Roth, C., Dayan, P. & Dolan, R. J. Decodability of reward learning signals predicts mood fluctuations. Curr. Biol. 28, 1433–1439.e7 (2018).
Article PubMed PubMed Central Google Scholar
Russo, S. J. & Nestler, E. J. The brain reward circuitry in mood disorders. Nat. Rev. Neurosci. 14, 609–25 (2013).
Article PubMed Google Scholar
Chen, C., Takahashi, T., Nakagawa, S., Inoue, T. & Kusumi, I. Reinforcement learning in depression: a review of computational research. Neurosci. Biobehav. Rev. 55, 247–67 (2015).
Article PubMed Google Scholar
Pittenger, C. Disorders of memory and plasticity in psychiatric disease. Dialog. Clin. Neurosci. 15, 455–63 (2013).
Article Google Scholar
Park, G., Marsh, B. U. & Johnson, E. J. Enhanced memory for fair-related faces and the role of trait anxiety. Front. Psychol. 10, 760 (2019).
Article PubMed PubMed Central Google Scholar
Rouhani, N. & Niv, Y. Depressive symptoms bias the prediction-error enhancement of memory towards negative events in reinforcement learning. Psychopharmacology 236, 2425–2435 (2019).
Article PubMed PubMed Central Google Scholar
Bainbridge, W. A., Isola, P. & Oliva, A. The intrinsic memorability of face photographs. J. Exp. Psychol. Gen. 142, 1323–34 (2013).
Article PubMed Google Scholar
Bainbridge, W. A., Dilks, D. D. & Oliva, A. Memorability: a stimulus-driven perceptual neural signature distinctive from memory. Neuroimage 149, 141–152 (2017).
Article PubMed Google Scholar
Wickelgren, W. A. & Norman, D. A. Strength models and serial position in short-term recognition memory. J. Math. Psychobiol. 3, 316–347 (1966).
Article Google Scholar
Wimmer, G. E., Braun, E. K., Daw, N. D. & Shohamy, D. Episodic memory encoding interferes with reward learning and decreases striatal prediction errors. J. Neurosci. 34, 14901–12 (2014).
Article PubMed PubMed Central Google Scholar
Peirce, J. et al. Psychopy2: experiments in behavior made easy. Behav. Res. Methods 51, 195–203 (2019).
Article PubMed PubMed Central Google Scholar
Hampton, A. N., Adolphs, R., Tyszka, M. J. & O’Doherty, J. P. Contributions of the amygdala to reward expectancy and choice signals in human prefrontal cortex. Neuron 55, 545–55 (2007).
Article PubMed Google Scholar
Eckstein, M. K., Master, S. L., Dahl, R. E., Wilbrecht, L. & Collins, A. G. E. Reinforcement learning and bayesian inference provide complementary models for the unique advantage of adolescents in stochastic reversal. Dev. Cogn. Neurosci. 55, 101106 (2022).
Article PubMed PubMed Central Google Scholar
Ratcliff, R. A theory of memory retrieval. Psychol. Rev. 85, 59–108 (1978).
Article Google Scholar
Wiecki, T. V., Sofer, I. & Frank, M. J. HDDM: hierarchical Bayesian estimation of the drift-diffusion model in Python. Front. Neuroinform. 7, 14 (2013).
Article PubMed PubMed Central Google Scholar
Popov, V., Marevic, I., Rummel, J. & Reder, L. M. Forgetting is a feature, not a bug: Intentionally forgetting some things helps us remember others by freeing up working memory resources. Psychol. Sci. 30, 1303–1317 (2019).
Article PubMed Google Scholar
Gelman, A. Scaling regression inputs by dividing by two standard deviations. Stat. Med. 27, 2865–73 (2008).
Article PubMed Google Scholar
Gelman, A., Jakulin, A., Pittau, M. G. & Su, Y.-S. A weakly informative default prior distribution for logistic and other regression models. Ann. Appl. Stat. 2, 1360 – 1383 (2008).
Article Google Scholar
Salvatier, J., Wiecki, T. & Fonnesbeck, C. Probabilistic programming in python using pymc. https://arxiv.org/pdf/1507.08050.pdf (2015).
Capretto, T. et al. Bambi: A simple interface for fitting bayesian linear models in python. Preprint at arXiv https://doi.org/10.48550/arXiv.2012.10754 (2020).
Morey, R. D., Hoekstra, R., Rouder, J. N., Lee, M. D. & Wagenmakers, E.-J. The fallacy of placing confidence in confidence intervals. Psychon. Bull. Rev. 23, 103–23 (2016).
Article PubMed Google Scholar
Bartsch, L. M. & Oberauer, K. The effects of elaboration on working memory and long-term memory across age. J. Mem. Lang. 118, 104215 (2021).
Article Google Scholar
Gelman, A., Hwang, J. & Vehtari, A. Understanding predictive information criteria for bayesian models. https://arxiv.org/pdf/1307.5928.pdf (2013).
Gillan, C. M., Kosinski, M., Whelan, R., Phelps, E. A. & Daw, N. D. Characterizing a psychiatric symptom dimension related to deficits in goal-directed control. Elife 5, e11305 (2016).
Gorsuch, R. & Nelson, J. Cng scree test: an objective procedure for determining the number of factors. In Annual Meeting of the Society for Multivariate Experimental Psychology (1981).
ten Berge, J. M., Krijnen, W. P., Wansbeek, T. & Shapiro, A. Some new results on correlation-preserving factor scores prediction methods. Linear Algebra its Appl. 289, 311–318 (1999).
Article Google Scholar
Hampton, A. N., Bossaerts, P. & O’Doherty, J. P. The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans. J. Neurosci. 26, 8360–7 (2006).
Article PubMed PubMed Central Google Scholar
Needell, C. D. & Bainbridge, W. A. Embracing new techniques in deep learning for estimating image memorability. Comput. Brain Behav. 5, 168–184 (2022).
Izquierdo, A., Brigman, J. L., Radke, A. K., Rudebeck, P. H. & Holmes, A. The neural basis of reversal learning: an updated perspective. Neuroscience 345, 12–26 (2017).
Article PubMed Google Scholar
Duncan, K., Semmler, A. & Shohamy, D. Modulating the use of multiple memory systems in value-based decisions with contextual novelty. J. Cogn. Neurosci. 31, 1455–1467 (2019).
Article PubMed Google Scholar
Costa, V. D., Tran, V. L., Turchi, J. & Averbeck, B. B. Reversal learning and dopamine: a bayesian perspective. J. Neurosci. 35, 2407–16 (2015).
Article PubMed PubMed Central Google Scholar
Zhang, L., Lengersdorff, L., Mikus, N., Gläscher, J. & Lamm, C. Using reinforcement learning models in social neuroscience: frameworks, pitfalls and suggestions of best practices. Soc. Cogn. Affect Neurosci. 15, 695–707 (2020).
Article PubMed PubMed Central Google Scholar
Banker, S. M. et al. Disrupted computations of social control in individuals with obsessive-compulsive and misophonia symptoms. iScience 25, 104617 (2022).
Article PubMed PubMed Central Google Scholar
Rosenbaum, G. M., Grassie, H. L. & Hartley, C. A. Valence biases in reinforcement learning shift across adolescence and modulate subsequent memory. Elife 11, e64620 (2022).
Bainbridge, W. A. Chapter one—memorability: how what we see influences what we remember. In Federmeier, K. D. & Beck, D. M. (eds.) Knowledge and Vision, vol. 70 of Psychology of Learning and Motivation, 1–27 (Academic Press, 2019). https://www.sciencedirect.com/science/article/pii/S0079742119300015.
Palmer, J., Huk, A. C. & Shadlen, M. N. The effect of stimulus strength on the speed and accuracy of a perceptual decision. J. Vis. 5, 376–404 (2005).
Article PubMed Google Scholar
Kempadoo, K. A., Mosharov, E. V., Choi, S. J., Sulzer, D. & Kandel, E. R. Dopamine release from the locus coeruleus to the dorsal hippocampus promotes spatial learning and memory. Proc. Natl Acad. Sci. USA 113, 14835–14840 (2016).
Article PubMed PubMed Central Google Scholar
Kaufman, A. M., Geiller, T. & Losonczy, A. A role for the locus coeruleus in hippocampal ca1 place cell reorganization during spatial reward learning. Neuron 105, 1018–1026.e4 (2020).
Article PubMed PubMed Central Google Scholar
Gomperts, S. N., Kloosterman, F. & Wilson, M. A. Vta neurons coordinate with the hippocampal reactivation of spatial experience. Elife 4, e05360 (2015).
Sharp, M. E., Duncan, K., Foerde, K. & Shohamy, D. Dopamine is associated with prioritization of reward-associated memories in parkinson’s disease. Brain 143, 2519–2531 (2020).
Article PubMed PubMed Central Google Scholar
Drakeford, J. L. et al. Recollection deficiencies in patients with major depressive disorder. Psychiatry Res. 175, 205–10 (2010).
Article PubMed Google Scholar
Pignatelli, M. et al. Synaptic plasticity onto dopamine neurons shapes fear learning. Neuron 93, 425–440 (2017).
Article PubMed Google Scholar
Seidemann, R., Duek, O., Jia, R., Levy, I. & Harpaz-Rotem, I. The reward system and post-traumatic stress disorder: does trauma affect the way we interact with positive stimuli? Chronic Stress (Thousand Oaks) 5, 2470547021996006 (2021).
PubMed Google Scholar
Torregrossa, M. M., Corlett, P. R. & Taylor, J. R. Aberrant learning and memory in addiction. Neurobiol. Learn Mem. 96, 609–23 (2011).
Article PubMed PubMed Central Google Scholar
Sui, J., Ohrling, E. & Humphreys, G. W. Negative mood disrupts self- and reward-biases in perceptual matching. Q. J. Exp. Psychol. (Hove) 69, 1438–48 (2016).
Article PubMed Google Scholar
Young, C. B. & Nusslock, R. Positive mood enhances reward-related neural activity. Soc. Cogn. Affect. Neurosci. 11, 934–44 (2016).
Article PubMed PubMed Central Google Scholar
Zika, O., Wiech, K., Reinecke, A., Browning, M. & Schuck, N. W. Trait anxiety is associated with hidden state inference during aversive reversal learning. Nat. Commun. 14, 4203 (2023).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Blair Shevlin and Kaustubh Kulkarni for helpful comments and suggestions. X.G. is supported by NIH R01DA043695, R21DA049243, R21MH120789, R01MH122611, R01MH123069, and R01MH124115. I.S. is supported by NIH R01MH124763. S.E.Q. is supported by NIH K99MH132873. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the paper.”

Author information

Authors and Affiliations

Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Salman E. Qasim & Xiaosi Gu
Center for Computational Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Salman E. Qasim, Ignacio Saez & Xiaosi Gu
The Winsor School, Boston, MA, USA
Aarushi Deswal
Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Ignacio Saez & Xiaosi Gu
Department of Neurosurgery, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Ignacio Saez
Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Ignacio Saez

Authors

Salman E. Qasim
View author publications
You can also search for this author in PubMed Google Scholar
Aarushi Deswal
View author publications
You can also search for this author in PubMed Google Scholar
Ignacio Saez
View author publications
You can also search for this author in PubMed Google Scholar
Xiaosi Gu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.E.Q. and X.G. conceived the study; S.E.Q. and A.D. analyzed the data; and S.E.Q., I.S. and X.G. wrote the paper.

Corresponding author

Correspondence to Xiaosi Gu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Psychology thanks Miriam Klein-Flugge and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Patricia Lockwood and Marike Schiffer. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplementary Information

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Qasim, S.E., Deswal, A., Saez, I. et al. Positive affect modulates memory by regulating the influence of reward prediction errors. Commun Psychol 2, 52 (2024). https://doi.org/10.1038/s44271-024-00106-4

Download citation

Received: 06 October 2023
Accepted: 28 May 2024
Published: 05 June 2024
DOI: https://doi.org/10.1038/s44271-024-00106-4
Springer Nature Limited

Positive affect modulates memory by regulating the influence of reward prediction errors

Abstract

Similar content being viewed by others

Depressive symptoms bias the prediction-error enhancement of memory towards negative events in reinforcement learning

Positive reward prediction errors during decision-making strengthen memory encoding

The reward positivity is sensitive to affective liking

Introduction

Methods

Data collection and participants

Task

Computational modeling

Bayesian mixed-effects regression

Model fitting and assessment

Factor analysis

Statistical analysis and software

Reporting summary

Results

Behavioral analysis and computational modeling of decision-making behavior

Reward-prediction errors and perceptual information separably enhance recognition memory

Positive affect upregulates the beneficial effects of reward-prediction error on memory

Discussion

Limitations

Conclusions

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Peer Review File

Supplementary Information

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation