Altered monetary loss processing and reinforcement-based learning in individuals with obesity

Kube, Jana; Mathar, David; Horstmann, Annette; Kotz, Sonja A.; Villringer, Arno; Neumann, Jane

doi:10.1007/s11682-017-9786-8

Altered monetary loss processing and reinforcement-based learning in individuals with obesity

Original Research
Open access
Published: 29 December 2017

Volume 12, pages 1431–1449, (2018)
Cite this article

Download PDF

You have full access to this open access article

Brain Imaging and Behavior Aims and scope Submit manuscript

Altered monetary loss processing and reinforcement-based learning in individuals with obesity

Download PDF

Jana Kube ORCID: orcid.org/0000-0001-9801-4387^1,2,3,
David Mathar^1,4,
Annette Horstmann^1,2,
Sonja A. Kotz^1,5,
Arno Villringer^1,2,6,7 &
…
Jane Neumann^1,2,8

3651 Accesses
28 Citations
2 Altmetric
Explore all metrics

Abstract

Individuals with obesity are often characterized by alterations in reward processing. This may affect how new information is used to update stimulus values during reinforcement-based learning. Here, we investigated obesity-related changes in non-food reinforcement processing, their impact on learning performance as well as the neural underpinnings of reinforcement-based learning in obesity. Nineteen individuals with obesity (BMI > = 30 kg/m², 10 female) and 23 lean control participants (BMI 18.5–24.9 kg/m², 11 female) performed a probabilistic learning task during functional magnetic resonance imaging (fMRI), in which they learned to choose between advantageous and disadvantageous choice options in separate monetary gain, loss, and neutral conditions. During learning individuals with obesity made a significantly lower number of correct choices and accumulated a significantly lower overall monetary outcome than lean control participants. FMRI analyses revealed aberrant medial prefrontal cortex responses to monetary losses in individuals with obesity. There were no significant group differences in the regional representation of prediction errors. However, we found evidence for increased functional connectivity between the ventral striatum and insula in individuals with obesity. The present results suggest that obesity is associated with aberrant value representations for monetary losses, alterations in functional connectivity during the processing of learning outcomes, as well as a decresased reinforcement-based learning performance. This may affect how new information is incorporated to adjust dysfunctional behavior and could be a factor contributing to the maintenance of dysfunctional eating behavior in obesity.

Higher body weight-dependent neural activation during reward processing

Article Open access 04 April 2023

Dorsolateral and medial prefrontal cortex mediate the influence of incidental priming on economic decision making in obesity

Article Open access 04 December 2018

Excessive body fat linked to blunted somatosensory cortex response to general reward in adolescents

Article 18 August 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Previous studies have reported obesity-related alterations in the neural representation of rewarding food stimuli (Feldstein Ewing et al. 2016; Stice et al. 2009; García-García et al. 2014). However, while the processing of food reward has been studied extensively in obesity, non-food reward likewise provides a powerful source of information to monitor and successfully adapt behavior to changing environments. In this vein, Saunders and Robinson (2013) hypothesized that humans generally differ in their reward cue reactivity, a trait that is likely to be stable across different domains of primary and secondary reinforcers. Indeed, obesity-related alterations in reward processing have recently been shown to exist outside of the food context. For instance, Opel et al. (2015) reported increased neural responses of individuals with obesity in areas of the brain’s reward circuit following the presentation of monetary gains. While these authors found no significant group differences during the processing of monetary losses, Balodis et al. (2013) described that individuals with obesity exhibited greater neural activation in subcortical as well as prefrontal brain areas during the anticipation of monetary gains and losses, suggesting that obesity-related alterations in reinforcement processing may also exist for aversive stimuli.

Interestingly, Balodis et al. (2013) additionally found a dissociation between neural responses during the anticipation and receipt of monetary reinforcement, a phenomenon that has similarly been observed in the food context. Specifically, individuals with obesity tend to show increased neural responses during the anticipation (Rothemund et al. 2007), but blunted responses during the actual receipt of rewarding food stimuli (Stice et al. 2008, 2010). Prominent theories argue that this discrepancy results from an initially high trait reward responsiveness facilitating overeating and subsequent neuroadaptive processes, leading to a heightened motivational value of anticipated food, but blunted hedonic signals when actually consuming it (Kenny 2011). Others argue that both a high and low reward responsiveness may be associated with obesity (Val-Laillet et al. 2015). Importantly, Kroemer and Small (2016) suggest that the apparent dissociation between responses during the anticipation and receipt of rewarding (food) stimuli may instead be explained in terms of altered reinforcement-based learning in individuals with obesity. Specifically, individuals with obesity displayed heightened reward sensitivity, but lower learning rates leading to increased neural responses during the anticipation, but blunted striatal responding during the receipt of rewarding (food) stimuli.

While animal studies have, indeed, shown that obesity is also associated with alterations in learning and behavioral adaptation (Reichelt et al. 2014; Johnson and Kenny 2010; Kanoski and Davidson 2011), few studies have investigated reinforcement-based learning in humans with obesity. Using the Iowa Gambling Task, Horstmann et al. (2011) demonstrated that women with obesity in contrast to lean women preferred choice options associated with high immediate monetary rewards even in light of high potential losses, and failed to adjust their behavior over time despite an overall negative outcome. Recently, Coppin et al. (2014) reported evidence suggesting that these deficits may be driven by impaired reinforcement-based learning. Using two different tasks, they found that individuals with obesity failed to develop a preference for the most rewarded patterns in a cue conditioning paradigm, and also showed less avoidance for negative stimuli in a probabilistic learning task. Interestingly, performance was partly affected by working memory differences between lean participants and participants with obesity. Together, these studies suggest that obesity may be associated with alterations in neural reinforcement processing beyond the food context that may also affect decision making and reinforcement-based learning.

Electrophysiological and neuroimaging studies in normal-weight populations highlight the role of a dopaminergic prediction error (PE) signal for learning and updating stimulus and action values when a presented outcome is better or worse than expected (Schultz et al. 1997; Garrison et al. 2013; Chase et al. 2015). Alterations in the coding of dopaminergic PEs in the striatum as well as the transfer of feedback signals to higher cortical areas have been found to be associated with a reduced learning performance. For instance, successful learners exhibit more robust PE signals in the dorsal and ventral striatum (VS) than less successful ones (Schönberg et al. 2007), while a decline in learning performance with age seems to be related to a reduction in PE-related blood oxygenation level dependent (BOLD) activity in the VS (Eppinger et al. 2013). Moreover, Park et al. (2010) reported that individuals with alcohol dependence show a reduced learning performance despite intact ventral striatal PE-responses, which was, however, associated to alterations in the functional connectivity between the VS and dorsolateral prefrontal cortex. Accordingly, it seems that both PE coding in the VS and its functional utilization in other brain areas may be potential mechanisms that evoke impaired decision making and learning. Indeed, individuals with obesity have been shown to have an altered dopaminergic circuitry, such as a lower striatal D2-receptor binding potential (Wang et al. 2001). This further highlights the possibility that alterations in neural PE signaling may affect feedback utilization in reinforcement-based learning in individuals with obesity.

In the current study we used functional magnetic resonance imaging (fMRI) to investigate the neural mechanisms of monetary gain and loss processing and the neural underpinnings of feedback utilization in reinforcement-based learning in individuals with obesity. We aimed to (1) further examine whether individuals with obesity are characterized by alterations in reinforcement processing beyond the food context; (2) test whether these alterations affect the neural representation of both monetary gains and losses as well as their omission and avoidance; (3) replicate previous findings regarding obesity-related alterations in learning performance and examine whether learning deficits are present for both performance in learning from reward and performance in learning from punishment, and (4) investigate the neural correlates of reinforcement-based learning, specifically the representation and utilization of PE signals in the brain.

We hypothesized that individuals with obesity show altered neural representations of both positive and negative monetary outcomes in areas of the brain’s reward system, such as the striatum, medial orbitofrontal cortex (OFC), insula, midbrain and thalamus. Further, we hypothesized that individuals with obesity would exhibit a lower reinforcement-based learning performance, which potentially is mediated by alterations in ventral striatal PE processing.

Materials and methods

Participants

Fifty-five participants were recruited for the current study via online advertisements, and from the participant database of the Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany. Inclusion criteria encompassed MR-eligibility, right-handedness, an age-range of 20 to 45 years, as well as a BMI between 30.0 and 50.0 kg/m² for participants with obesity, or a BMI between 18.5 and 24.9 kg/m² for lean control participants. Participants were excluded from the study if they reported current smoking, the use of drugs or psychoactive medication, a history of neuropsychiatric diseases, current depressive symptoms (Beck’s Depression Inventory, BDI-SF, > = 10, Beck and Steer 1987), or a thyroid disease. We restricted our sample by these criteria to avoid confounding influences of age (e.g. Samanez-Larkin et al. 2012), smoking status (e.g. Martin et al. 2014), as well as neuropsychiatric symptoms and medication (e.g. Etkin and Wager 2007; Philip et al. 2012; Zhang et al. 2013; Wittmann and D’Esposito 2015; Yan et al. 2015) on reinforcement processing. Furthermore, participants reporting thyroid diseases were excluded from the current sample as these conditions may affect body-weight status (Tzotzas et al. 2000).

Upon participation we had to exclude two participants due to lack of task compliance (one lean, one obese), and three participants due to lack of task comprehension (one lean, two obese). Four participants were excluded who reported obesity at the time of recruitment, but fell below our predefined BMI criteria for obesity at the time of measurement, and three participants were excluded due to current depressive symptomatology (BDI-SF > = 10; one lean, one obese) or medication use (one obese). Finally, one participant experienced a panic attack inside the scanner and aborted the scanning session.

The final sample thus consisted of 19 individuals with obesity and 23 individuals without obesity who were comparable with respect to gender, age, education, and working memory performance (Table 1).

Table 1 Sample characteristics

Full size table

All participants gave written informed consent prior to their participation and received 8 €/hour for reimbursement (mean study duration 2 h). Additionally, participants received a monetary bonus dependent on their performance in the reinforcement learning task (final score/1000, on average 3.10 €). The study was carried out in accordance with the Declaration of Helsinki and was approved by the ethics committee of the University of Leipzig.

Procedure and probabilistic reinforcement learning task

Participants performed a probabilistic reinforcement learning task comprising 240 trials, which we adapted from Kim et al. (2006). In each trial participants were presented with a pair of symbols and had to choose one of them by button press. Three types of pairs were included in the experiment: (1) one pair signaled the possibility of winning 50 points or receiving no outcome (gain condition, 80 trials), (2) one pair signaled the possibility of losing 50 points or receiving no outcome (loss condition, 80 trials), and (3) one pair was associated with a neutral outcome signaling neither gain nor loss (neutral condition, 80 trials). In each pair of stimuli one symbol had a higher probability of receiving the respective outcome: In the gain condition, the advantageous symbol was associated with a 70% probability of winning 50 points and led to no outcome in only 30% of the trials in which it was chosen. The disadvantageous symbol was associated with a 30% probability of winning 50 points and led to no outcome in 70% of the trials in which it was chosen. Similarly, in the loss condition the advantageous symbol was associated with a 30% probability of losing 50 points, while the disadvantageous symbol had a loss probability of 70%. Additionally, we included a financially neutral control condition, which primarily served as a control condition for fMRI data analysis. Here, the two symbols likewise had a 70 and 30% probability of seeing neutral feedback, and no outcome otherwise. Symbols were assigned to the given conditions pseudorandomly. Trial order was randomized in 8 consecutive bins each comprising 30 trials (10 gain, 10 loss, 10 neutral). Within each bin these 30 trials were freely randomized. This was done to ensure a roughly equal number of trials per condition in each stage of the experiment.

Trial timing and conditions are displayed in Fig. 1. In short, the pair of symbols was presented for a maximum of 1500 ms and participants were asked to select one symbol. Once they had made their selection, the chosen option was highlighted for 1000 ms and a blank screen appeared during a 1000 ms delay period. Thereafter, the feedback occurred on the screen for 2000 ms. If the participants received no outcome a fixation cross was shown instead. If the participants did not press the button within 1500 ms after stimulus onset, the trial was aborted and the text “Too slow!” appeared on the screen. No response trials were omitted from all analyses (on average 2.55% of all trials).

Prior to the experiment, participants completed the Figurative Memory subtest from the Wechsler-Memory-Scale-R in order to evaluate the influence of (visual) working memory on learning performance that was previously reported in other studies (Collins and Frank 2012; Coppin et al. 2014). Participants were then instructed about the task and performed a practice run of 12 trials (four trials in each condition) outside the scanner. In the instructions, they were told that two stimuli would be presented in each trial and their task was to select one of them. Dependent on their choice they would win 50 points, lose 50 points, receive financially neutral feedback or no outcome. Participants were informed that the task comprised three conditions and that in each condition one cue had a higher probability of leading to an advantageous outcome. However, they did not know which cue was associated with a particular outcome. In addition, participants were informed that their net gain would be transformed into a monetary bonus at the end of the experiment. Upon completion of the experiment participants were interviewed about their retrospective task comprehension and knowledge about the cue-outcome contingencies. Finally, all participants were debriefed about the aim of the study.

Ratings

Immediately before and after the learning experiment, we obtained subjective valence and arousal ratings for each symbol to determine changes in affective responses towards the stimuli. Here, each symbol was presented individually and rated according to valence and arousal on 9-point Self-Assessment Manikin visual analog scales (Bradley and Lang 1994). The sequence of stimulus presentations was pseudo-randomized.

FMRI data acquisition

Functional and structural images were obtained using a 3T Siemens Trio MRI scanner. Functional images were acquired in a T2*-weighted blood oxygen level dependent sequence with a TR of 2000 ms, TE of 22 ms, flip angle of 90°, 64 × 64 in-plane matrix, field of view of 192 mm. Thirty-eight 2.5 mm slices with a 0.5 mm gap were measured in ascending order and 1098 volumes were acquired for the current study.

Additionally, a T1-weighted structural scan was recorded using a three dimensional MPRAGE sequence (matrix 256 × 240; 176 slices, FoV = 256 × 240 mm, voxel size = 1.0 × 1.0 × 1.0 mm, TR = 2300 ms, TE = 2.96 ms, flip angle = 9°) for participants that had not previously received a structural scan in our institute. For all other participants an existing T1-weighted structural scan showing similar imaging parameters was employed for co-registration.

A standard 12-channel head coil was used for the experiment. Visual stimuli were presented on a screen behind the scanner that was visible to the participants via a mirror mounted on the head coil.

Behavioral data analyses

Statistical analyses of the behavioral performance data were carried out using IBM SPSS Statistics 20 (Armonk, NY, USA) with a level of significance being set at p < .05. For repeated-measures ANOVAs degrees of freedom were adjusted using Greenhouse-Geisser correction (Greenhouse and Geisser 1959) if the assumption of sphericity was violated. In this case, we report uncorrected degrees of freedom, corrected p-values and epsilon. For significant effects, generalized eta squared (η_G²) as determined by R’s afex function is reported as a measure of effect size. Bonferroni-corrected t-tests were utilized as post-hoc tests where the ANOVA indicated a significant main or interaction effect. Cohen’s d is reported as a measure of effect size for independent samples t-tests.

Computational learning model

Trial-wise PEs and participant- and condition-specific learning rates were derived from a reinforcement model. The model was previously applied in another implicit learning paradigm in healthy and clinical populations and it was shown to adequately capture probabilistic classification tasks (Mathar et al. 2017b). As a slightly modified version of standard Q-learning, our model contains separate learning rates for the experimental conditions that are fitted independently of other model parameters. The latter ensures that learning rates are statistically independent of the choice consistency parameter, which is not the case in standard Q-learning (Mathar et al. 2017b). More specifically, the reinforcement learning model consists of six input nodes ${I}_{i=1,...,6}$ with weighted connections to two output nodes (Q-values) ${Q}_{j=\text{1,2} }$that represent the presence or absence of the six different symbols (three pairs of symbols) and the two possible outcomes in each condition, respectively. On each trial, activity of the output nodes is computed as ${Q}_{j}= \sum _{i}{q}_{ij}{I}_{i},$ where ${q}_{ij}$is the weight connecting input node ${I}_{i}$ and output node ${ Q}_{j}$. Weights are initialized to 0.25, representing equal distribution of initial weights between the four connections that can be updated within one trial (connections from two input patterns to the two outcomes). Weights are updated in each trial by means of ${q}_{ij}(k+1)={{q}_{ij}\left(k\right)+\alpha }^{}{S}_{j}({R}_{j}-{Q}_{j}){I}_{i}$ where ${R}_{j}$ encodes the correct output in this trial, α constitutes a learning rate, and ${S}_{j}$ represents the subject’s response. The latter is included for allowing the model to simulate the behavior of the individual participant rather than optimal learning.

Since participants were informed about the three separate learning conditions and learning performance in one condition was independent of the other two conditions, we fitted three independent learning rates for the gain, loss, and neutral condition, respectively. This allowed us to differentially assess learning from reward (monetary gains) and punishment (monetary losses). For each participant, individual learning rates were determined that minimized the sum of squared differences between the model’s output and the participant’s response: $\sum _{jk}{\left({S}_{jk}-{Q}_{jk}\right)}^{2}$ → $min$, with $j=1, 2$ and $k$ being the number of trials. In a subsequent step, we modeled each participant’s choices of a particular outcome to follow a softmax distribution:

$$P\left( {choice={S_j}|{Q_1},{Q_2}} \right)=\frac{{{\text{exp}}\left( {\beta {Q_j}} \right)}}{{\exp \left( {\beta {Q_1}} \right)+{\text{exp}}\left( {\beta {Q_2}} \right)}}{\text{with}}\;j=1,2$$

with temperature or choice consistency parameter β. The parameter β was fitted to participants’ choices by minimizing the negative log likelihood of the choice probabilities P by $LL=-{ln}(\prod _{k}{P}_{k}\left({Q}_{j}\right))$, while the learning rates were held constant at the values optimized in the first step. As previously proposed by Mathar et al. (2017b), model fitting and estimation of all parameters was accomplished by nonlinear optimization.

FMRI data analyses

MR images were preprocessed and analyzed using SPM8 (Wellcome Trust Centre for Neuroimaging, UCL, London, UK), implemented in Matlab 7.14 (The MathWorks Inc., Sherborn, MA). Functional images were unwarped and spatially aligned to the first image of the session to correct for movement artifacts. Realignment parameters were subsequently included as regressors of no interest in all individual participant level models described below. Slice timing correction to the anatomical middle slice was performed to correct for different acquisition times. The mean EPI image was co-registered to the high-resolution anatomical image, the T1 reference scan was segmented into different tissue classes, and functional and structural images were normalized to Montreal Neurological Institute (MNI) stereotaxic space. Subsequently, the normalized images were smoothed with an isotropic Gaussian kernel of 8 mm FWHM. The final resampled voxel size after normalization was 3 × 3 × 3 mm.

At the individual participant level, we set up separate models for the analyses of outcome-related BOLD responses, PE-related BOLD responses and functional connectivity: Stimulus- and outcome-related BOLD responses were modeled using three symbol pair regressors (gain condition trial, loss condition trial, neutral condition trial) and six outcome regressors (gain, gain omission, loss, loss avoidance, neutral outcome, no neutral outcome) that were modeled as impulse function and convolved with a hemodynamic response function. To examine outcome-related brain activation, individual contrast images for gain, gain omission, loss, and loss avoidance compared to neutral control trials were computed and submitted to separate one-sample t-tests for the analysis of within-group effects as well as two-sample t-tests for the comparison of outcome-related brain responses between lean participants and participants with obesity. For a detailed analyses of activation and deactivation patterns in contrasts revealing significant group differences, we extracted percent signal change of the BOLD signal using MarsBar 0.42.

PE-related brain activation was modeled at feedback onset and trial-wise PE estimates derived from the reinforcement learning model were used as parametric modulators of the feedback regressor that signaled the onset of any outcome in the gain or loss condition. Trials of the neutral condition were excluded from the analysis of PEs as performance was not reinforced by monetary feedback and participants may thus have been less attentive or motivated to learn the cue-outcome contingencies. Individual contrast images were submitted to one- and two-sample t-tests for within- and between group comparisons, respectively.

To investigate obesity-related changes in functional connectivity of the VS, we followed an approach proposed by Park et al. (2010) to build a psychophysiological interaction (PPI) term. Using this method, we examined the correlation of the observed BOLD time-series, without making assumptions about the neural event contributing to the BOLD signal (Kahnt et al. 2009). We focused on the VS as a seed region as previous studies have highlighted its importance in PE coding. First, we identified activated voxels in the left and right VS that significantly correlated with trialwise-PEs at whole group level. Here, anatomical ROI masks of the nucleus accumbens from the Harvard-Oxford Subcortical Structural Atlas were used to restrict the analysis. We then extracted individual participants time courses within the whole group activation masks, which were then multiplied by condition vectors that contained ones for four TRs after the presentation of positive (PPI regressor for positive PE feedback) and negative feedback (PPI regressor for negative PE feedback) and zeros otherwise. The resulting vectors were then used as regressors in an individual participant level model, which also included condition vectors containing separate feedback onsets for positive and negative feedback as well as realignment parameters as regressors of no interest. Contrast images of the PPI regressors were subsequently submitted to a 2nd level ANOVA comprising the factors PE condition (positive, negative) and group (lean, obese).

All results were corrected for multiple comparisons using a combination of individual voxel probability and cluster-extent based thresholds. Using 3dClustSim with an estimated non-Gaussian autocorrelation function and individual-voxel threshold of p < .001, we determined a cluster-extent based threshold of 53 adjacent voxels to reach a family-wise error rate of 5%.

Association of neural responses and learning behavior

To examine if alterations in VS functional coupling are associated with learning success, we extracted for each participant individual beta weights for the functional connectivity between the VS and areas showing significant group differences (i.e. insula/superior temporal gyrus and vermis). These were then used as predictors of learning in a multivariate ANOVA including objective and subjective measures of learning success, namely the percentage of advantageous choices in gain and loss conditions during the acquisition phase, the average learning rate as well as subjective valence ratings.

Data availability

The datasets analyzed during the current study are available from the corresponding author on request.

Results

Behavioral performance

The net monetary outcome at the end of the experiment, percentage of advantageous choices and model-derived learning rates were evaluated as indices of individual learning performance. For the overall monetary score, an independent samples t-test revealed that individuals with obesity accumulated a significantly lower outcome than lean control participants over the course of the experiment [t(40) = 2.206, p = .037, d = 0.703].

For the analyses of choice behavior, we calculated the percentage of advantageous choices in the gain and loss condition in 4 time bins (each comprising ~ 20 trials per condition). To evaluate learning performance, we then focused on choice behavior during the early phases of the experiment, when cue-outcome contingencies are predominantly acquired (e.g. Pessiglione et al. 2006; Lin et al. 2012; den Ouden et al. 2013). Specifically, we evaluated the percentage of advantageous choices during the first two blocks of the experiment. The neutral condition was excluded from this analysis since there was no financial incentive to develop a choice preference and participants may thus have used diverse behavioral strategies to complete the task (e.g. random choices or fixed choices of one symbol). Due to violations of normality, choice data were rank transformed and subjected to a repeated measures ANOVA including the within subject factors condition (gain, loss), block (1–2) as well as the between subject factor group (lean, obese). The results corroborate the previous finding: we found a significant main effect of group [F(1, 40) = 4.622, p = .038, η_G² = 0.049], a main effect of block showing an increase in correct responses from block 1 to block 2 [F(1, 40) = 50.560, p < .001, η_G² = 0.129], as well as a Group × Block interaction [F(1, 40) = 6.617, p = .014, η_G² = 0.019], indicating that individuals with obesity achieved a lower number of advantageous choices than lean controls particularly during the later acquisition phase (Block 2, p = .031; Fig. 2a). Interestingly, we found no significant modulation of learning performance by condition [main effect of Condition: F(1, 40) = 2.371, p = .131] and no significant interaction of group and condition, suggesting that group differences are comparable when learning from gain and loss feedback [interaction of Condition × Group: F(1, 40) = 1.671, p = .204].

Additionally, we examined choice behavior during the later phase (last two blocks) of the experiment, where the learning process should have resulted in stable cue-outcome associations. Here, we found no significant increase of performance across blocks [F(1, 40) = 3.984, p = .053] and no significant group differences across the gain and loss condition [main effect of group: F(1,40) = 1.259, p = .269; interaction of Condition × Group: F(1, 40) = 1.168, p = .286, Fig. 2a].

For the analysis of model-derived learning parameters, we extracted learning rates for the gain and loss condition separately and submitted them to a repeated-measures ANOVA including the within-subject factor condition (gain, loss) as well as the between-subject factor group (lean, obese). In line with the behavioral performance results, a significant main effect of group [F(1, 40) = 5.713, p = .022, η_G² = 0.076] indicates that lean participants exhibited significantly higher learning rates than individuals with obesity (Fig. 2b). As with the observed choice behavior, this effect was not modulated by the factor condition [interaction of Condition × Group: F(1, 40) = 0.839, p = .365].

In order to disseminate the influence of working memory on learning performance, we repeated the above-mentioned analyses including the Figurative Memory score as a covariate of interest. However, there was no evidence for a significant modulation of learning performance by individual working memory differences (Online Resource 1). Further, lean and obese participants did not significantly differ in their working memory performance (U = 167.500, p = .172).

Ratings

To investigate differential changes in the evaluation of advantageous and disadvantageous symbols, we obtained individual valence and arousal ratings of all symbols before and after the experiment. Both were submitted to a repeated-measures MANOVA including the within-subject factors condition (gain, loss, neutral), time (before, after) and reinforcement probability (advantageous, disadvantageous) as well as the between-subject factor group (lean, obese). Using Pillai’s trace, we found a significant multivariate interaction effect of Time × Group × Reinforcement Probability [V = 0.157, F(2, 39) = 3.644, p = .035]. Univariate follow-up analysis showed that this effect was strongly driven by group differences in valence ratings [interaction of Time × Group × Reinforcement Probability: F(1, 40) = 4.635, p = .037, η_G² = 0.006]. Specifically, while participants with obesity exhibited similar ratings of advantageous and disadvantageous choice options before and after the experiment (all p > .05), lean participants showed a decrease in positive valence ratings for disadvantageous choice options from before to after the experiment (p = .045) as well as more positive valence ratings of advantageous compared to disadvantageous choice options after the experiment (p = .036; Fig. 2c).

Similar to objective markers of learning performance, we found no evidence for an association between the subjective evaluation of advantageous and disadvantageous symbols and figurative working memory (Online Resource 1).

FMRI results

Gain receipt and loss avoidance

For the analysis of neural responses towards positive monetary outcomes, we first examined neural responses to monetary gains as well as responses to the successful avoidance of monetary losses, in each group individually. Subsequently, we compared lean and obese participants in a subtraction analysis.

Whole brain within-group analysis revealed that the receipt of a monetary gain was associated with significant activation in clusters encompassing the striatum, insula, anterior cingulate (ACC), middle frontal gyrus and midcingulate cortex (MCC) in lean and obese participants. Further, lean participants exhibited significantly higher activation to monetary gains than to neutral feedback in the right middle OFC, cerebellum and occipital cortex, while individuals with obesity showed increased activation in the inferior parietal lobule (Table 2).

Table 2 Within- and between-group comparison of whole-brain outcome processing results

Full size table

In both groups, the successful avoidance of monetary losses was similarly associated with higher activation in clusters encompassing the insula, middle frontal gyrus, cerebellum, and inferior parietal lobule. Additionally, lean participants demonstrated significant activation in the MCC, superior frontal, and superior medial frontal gyrus, whereas individuals with obesity showed increased activation in the middle OFC (Table 2).

The between-group analysis revealed that individuals with obesity and lean control participants did not significantly differ in their neural responses towards monetary gains or the successful avoidance of monetary losses.

Loss receipt and gain omission

In a second step, we examined neural responses following a negative monetary outcome. Specifically, we first analyzed the processing of monetary losses as well as the omission of monetary gains in each group individually. Subsequently, we compared responses of individuals with obesity and lean control participants in a between-group subtraction analysis.

In both groups monetary loss processing was associated with increased activation in clusters encompassing the insula, superior medial frontal gyrus, ACC and MCC, as well as cerebellum and inferior partial lobule. Additionally, lean participants displayed higher activation to monetary losses than neutral feedback in the thalamus, midbrain, and middle frontal gyrus (Table 2).

The omission of monetary gains elicited significant activation in the inferior parietal lobule in both groups. In lean control participants we further found activation in the insula, as well as MCC (Table 2).

The between-group analysis of monetary losses compared to neutral feedback revealed a region of significant differences in the medial prefrontal cortex (mPFC). Extracted percent signal change of the BOLD signal indicated that the effect was driven by significantly different neural responses to monetary losses [t(40) = 2.666, p = .013], such that lean participants demonstrated a pronounced deactivation in response to monetary losses, while individuals with obesity showed a small increase in activation (Fig. 3a).

PE representation

The within-group analysis of neural responses associated with PEs revealed that lean participants showed significant PE-related activity in the VS, and medial OFC, as well as superior temporal gyrus, occipital gyrus, MCC, and posterior cingulate gyrus. In individuals with obesity, PE-related activity occurred in the precentral gyrus, occipital gyrus and inferior parietal lobule (Table 3). Additionally, using a less conservative individual-voxel threshold (p < .005, 128 voxels) we likewise found evidence for significant PE-related activity in the VS bilaterally (x = 12, y = 8, z = − 11, T = 4.39; x = − 9, y = 14, z = − 11, T = 4.17).

Table 3 Within- and between-group comparisons of whole-brain prediction error processing results

Full size table

The between-group comparison revealed that individuals with obesity and lean control participants did not significantly differ in PE-related activity (Table 3). To investigate the possibility that PE-related group differences occurred mainly during the acquisition phase, we additionally examined PE-related responses during the first two blocks of the experiment only. Again, we found no evidence for obesity-related alterations in neural PE representation (Online Resource 2).

VS functional connectivity

For the analysis of VS functional connectivity, the group-by-condition ANOVA indicated regions of significant group differences in clusters encompassing the left insula and superior temporal gyrus as well as between the VS and vermis/cerebellum (Table 4; Fig. 3b). Individuals with obesity compared to lean participants showed increased functional connectivity between the VS and these regions, while no modulation of group differences by condition (i.e. no significant Group × Condition interaction) was observed.

Table 4 Between-group comparison of ventral striatal functional connectivity during prediction error processing – main effect of group

Full size table

Association of neural responses and learning behavior

Finally, we investigated the association of alterations in functional connectivity and learning behavior. Here, we used the strength of functional connectivity between the VS and the clusters showing significant group differences during outcome processing (insula/ superior temporal gyrus and vermis) to predict learning behavior of lean participants and participants with obesity. Surprisingly, we found no evidence for an association of learning success and connectivity strength. Using Pillai’s trace, there was no significant multivariate effect of VS connectivity with the insula/superior temporal gyrus [V = 0.077, F(3, 37) = 1.036, p = .388] or vermis [V = 0.166, F(3, 37) = 2.450, p = .079] on indices of learning (Fig. 3c). To rule out specific influences of connectivity on objective (learning rate, percentage of advantageous choices) compared to subjective measures of learning success (ratings), we further examined the univariate effects, but similarly found no evidence for any significant relationship.

Discussion

In the current study, we aimed to investigate obesity-related alterations in non-food reinforcement processing, learning performance and the neural underpinnings of reinforcement-based learning in individuals with obesity. The results partly confirmed our hypotheses: (1) individuals with obesity compared to lean control participants showed alterations in the processing of monetary reinforcement stimuli. Specifically, we found differences during the processing of monetary losses, where lean participants responded with a strong deactivation, while individuals with obesity exhibited a small increase in activation of the mPFC. Contrary to our hypothesis, we found comparable activation patterns in reward-related areas in both groups for the processing of monetary gains. (2) In line with previous studies, individuals with obesity exhibited a compromised learning performance. This was evidenced by a lower number of advantageous choices as well as lower learning rates in individuals with obesity. In the same vein, subjective indices of reinforcement-based learning suggested that lean, but not obese, participants’ evaluation of the task stimuli was modulated by learning experience. (3) Lastly, both groups showed similar neural PE representations in the VS, but individuals with obesity exhibited higher functional connectivity following feedback between the VS and a cluster encompassing the insula and superior temporal gyrus. This was, however, not predictive of a compromised learning performance in individuals with obesity.

Outcome processing

In the current study, we provide further evidence for generalized obesity-related alterations in reinforcement processing beyond the food context. We found evidence for aberrant neural responses of the mPFC after actual monetary losses in individuals with obesity, which indicate that the processing of negative reinforcement may be associated with altered value representations in obesity.

To date little evidence exists on the processing of negative events in individuals with obesity. Opel et al. (2015) found obesity-related differences in the coding of monetary rewards and no differences in the coding of punishment, but used relatively higher gains than losses. Employing comparable gains and losses, Balodis et al. (2013) reported obesity-related differences in the neural representation of anticipated and received monetary losses. They found that the presentation of an early predictive cue indicating an upcoming monetary loss was associated with relatively higher neural responses to anticipated losses than neutral monetary outcomes in areas of the brain’s reward circuit, while actual monetary losses compared to financially neutral feedback lead to relatively decreased medial frontal activation in participants with obesity. In our study, we found an obesity-related modulation of mPFC activation for monetary losses compared to neutral feedback. Importantly, however, a separate inspection of activation and deactivation patterns towards monetary losses and financially neutral outcomes revealed that this was driven by a slight increase in response to monetary losses in individuals with obesity, which stood in contrast to a pronounced deactivation in lean control participants.

The mPFC, in particular its ventral subdivision, has been hypothesized to provide a common valuation system for different reinforcers, showing greater BOLD responses to more rewarding or less aversive stimuli (Bartra et al. 2013). This has been reported for money and food (Levy and Glimcher 2011; Sescousse et al. 2013) as well as the encoding of the emotional value of pictures (Winecoff et al. 2013). These neural responses are often characterized by opposing patterns of activity with higher activation to the presentation of more rewarding and deactivation to more negative (Winecoff et al. 2013) or less valuable stimuli (Mullett and Tunney 2013). Canessa et al. (2013) reported that alterations in the activation patterns of the brain’s reward circuit may be associated with behavioral responses towards potential losses, such that larger loss-related deactivation than gain-related activation predict higher loss aversion during decision making. Indeed, Tom et al. (2007) found that greater neural sensitivity to increasing losses in the medial OFC, insula, and striatum were associated with greater behavioral loss aversion in a gambling paradigm, supporting the notion that individual differences in cortical sensitivity to aversive stimuli affect cognitive performance and decision making.

It has been suggested that obesity is characterized by a two-fold pattern of reward responses encompassing heightened anticipatory, but blunted consummatory neural responses to rewarding stimuli (Kenny 2011). Previous studies in the context of monetary reward have already shown mixed results with increased anticipatory (Balodis et al. 2013), but both increased (Opel et al. 2015) and decreased (Balodis et al. 2013) consummatory responses to monetary gains. Though the design of the current study was focused on outcome processing and did not allow for a thorough investigation of anticipatory processes, we find evidence for a decreased responsiveness to the receipt of negative stimuli in obesity. Surprisingly, we do not find differences in the neural processing of monetary gains, suggesting that reward processing may not be universally altered in individuals with obesity and differences in task design need to be considered.

In conclusion, our results indicate that individuals with obesity exhibit aberrant value representations of monetary losses in the mPFC. A decreased motivational significance of negative action consequences could be an integral mechanism contributing to alterations in decision making, such as a preference for immediate rewards in the light of long-term negative consequences (Horstmann et al. 2011) or a higher valuation of temporally close, but objectively worse decision outcomes (Simmank et al. 2015). Similarly, whether individuals with obesity will change or maintain their eating behavior can be strongly determined by their perception of its consequences. As evidence suggests that these mechanisms may be generalized across different domains of reinforcement, a lower motivational significance of negative (health) consequences of overeating may thus potentially decrease their regulatory effect on eating behavior, facilitating maintained dysfunctional eating patterns even in the light of negative long-term consequences.

Learning performance

In addition to non-food incentive representation, we also evaluated group differences in reinforcement-based learning performance. Similar to previous studies (Coppin et al. 2014; Horstmann et al. 2011), we found evidence for a lowered reinforcement-based learning performance in individuals with obesity. Interestingly, data on the subjective evaluation of the presented stimuli, as indicated by valence ratings, suggested that this effect was driven by alterations in differential conditioning, such that differences were particularly pronounced for the evaluation of the disadvantageous stimuli across conditions. While lean participants evaluated the disadvantageous stimuli as less pleasant after the experiment and showed a clear differentiation in valence ratings between advantageous and disadvantageous symbols, individuals with obesity demonstrated no modulation of their ratings by learning experience. This is in line with previous studies that similarly showed obesity-related impairments particularly when learning the meaning of cues that have a low probability for subsequent rewards. Specifically, Zhang et al. (2014) reported that women with and without obesity responded comparably towards the cues that were associated with a food reward, but women with obesity showed higher reward expectancies towards the other cue that was in fact never followed by a food reward. In the same vein, Coppin et al. (2014) found that individuals with obesity were particularly impaired in avoiding disadvantageous options in a probabilistic learning task. In an earlier study from our group, we used the Weather Prediction Task to investigate PE coding in individuals with obesity in a complex implicit learning task. Adding to previous results, we found selective impairments on the neural level, namely in the utilization of negative feedback and PEs for learning in individuals with obesity (Mathar et al. 2017a). Interestingly, rodents studies point in a similar direction showing that rats fed on highly palatable cafeteria diets are insensitive to aversive stimuli, i.e. they do not decrease food consumption in the light of a conditioned stimulus that is predictive of a aversive foot shock (Velazquez-Sanchez et al. 2015), an effect that may be mediated by alterations in the striatal D2 receptor system (Johnson and Kenny 2010). This deficit seems to be selective to negative stimuli, as rats fed on Western diets fail to solve tasks, in which a (negative) feature stimulus signals that a subsequent conditioned stimulus will not be paired with an expected reward, while they are not impaired in similar tasks using positive feature stimuli (Kanoski and Davidson 2011). Together, previous studies in humans and animals point at obesity-related alterations in negative outcome learning.

Here, we extended this work by applying a task that explicitly separates effects of learning from monetary gains (and their omission) versus learning from monetary losses (and their successful avoidance). Interestingly, none of the learning indices displayed condition effects, suggesting that learning performance is not primarily related to the actual monetary value of the presented outcomes. Rather it depends on their relative meaning discriminating disadvantageous from advantageous choice options.

PE processing and functional connectivity

A lower reinforcement-based learning performance has been shown to relate to alterations in the neural representation of dopaminergic learning signals in the striatum (Schönberg et al. 2007; Park et al. 2010; Eppinger et al. 2013). In the current study, individuals with obesity showed no alterations in the regional PE coding per se, but exhibited significantly higher functional connectivity between the VS and a cluster encompassing the left insula, and superior temporal gyrus during the processing of monetary outcomes. However, as opposed to other studies, this was in fact not directly related to decreases in learning performance, suggesting that alterations in VS-insula connectivity may rather reflect more general changes in the processing of (unexpected) feedback than differences in the utilization of striatal signals for learning.

The insula is a key area for the processing of interoceptive sensations and a node for the integration of external and interoceptive inputs (Craig 2002, 2009, 2011; Critchley et al. 2004). Predominantly the (ventral) anterior insula seems to be related to affective processing and autonomic function (Kelly et al. 2012; Chang et al. 2013). Interestingly, VS and insula are anatomically connected (Leong et al. 2016) and commonly co-activate in task-based and resting state fMRI studies (Postuma and Dagher 2006; Cauda et al. 2011; Chang et al. 2013). Evidence suggests bidirectional connectivity patterns between (anterior) insula and VS during incentive processing. More precisely, the insula has been hypothesized to code somatic changes in response to appetitive and aversive stimuli and project to the VS to facilitate motivated behavior (Clithero et al. 2011; Cho et al. 2013). Furthermore, a higher tract coherence between the anterior insula and NAcc has been shown to be negatively related to risk preferences (Leong et al. 2016). Likewise, the VS has been found to project to the insula particularly during high attention allocation to appetitive cues (Rothkirch et al. 2014).

Combined, these results highlight the possibility that an increased connectivity of insula and VS in individuals with obesity may reflect a stronger engagement of the reinforcement processing circuitry and increased attention allocation in response to the presentation of monetary feedback. However, this does not directly translate to learning performance, suggesting that potential differences in affective coding do not impact per se on reinforcement-based learning performance in individuals with obesity.

Other mechanisms in reinforcement-based learning

Other mechanisms may contribute to obesity-related alterations in reinforcement-based learning, instead. Indeed, learning and complex choice behavior have been discussed to rely on a combination of mechanisms beyond simple model-free learning based on striatal PEs only (Collins and Frank 2012; Doll et al. 2016). For instance, working memory capacity may play a distinct role in associative learning, particularly for so-called model-based learning processes that rely on building mental representations of the task environment. In complex 2-step learning tasks, designed to investigate such model-based compared to model-free processes, Parkinson patients with higher working memory capacity have been found to exhibit more model-based decisions (Sharp et al. 2016). Moreover, individuals were shown to be more resilient against the disruption of performance by external factors (Otto et al. 2013; Smittenaar et al. 2013). Similarly, Collins and Frank (2012) found that the combination of simple reinforcement-based learning models with working memory capacity best explained participants’ behavior in a putatively simpler instrumental learning task. For individuals with obesity, Coppin et al. (2014) reported working memory impairments and suggest that this may contribute to their failure to form preferences for highly rewarded stimuli. It is thus plausible to assume that working memory capacity contributed to learning deficits in the current study, though, surprisingly, we did not find a significant association in the data. This may be due to methodological issues: Firstly, we employed a simple working memory task in which both groups performed very well and performance variance was relatively small. Secondly, we focused on visual working memory, while other studies have employed different measures. This may suggest that performance in the current task did not depend on the ability to memorize complex visual stimuli, but leaves the possibility that other and more sensitive measures of working memory capacity may help to further elucidate potential mechanisms contributing to obesity-related alterations in reinforcement-based learning.

Strength, future directions and limitations

To our knowledge, the current study is the first fMRI study integrating behavioral as well as neural correlates of monetary reinforcement processing and reinforcement-based learning in individuals with obesity. While previous studies have mostly focused on general correlates of learning and response adaptation, the current paradigm allows for the investigation of two additional aspects: (1) a clear separation of learning from monetary gains compared to losses, and (2) the examination of both objective markers of learning performance and the subjective evaluation of the conditioned stimuli.

However, we could not conclusively resolve which underlying mechanisms contributed to obesity-related learning alterations in the current study. Thus some further aspects should be considered in future studies. Firstly, a relatively low overall sample size precluded the examination of gender differences in the current task, though previous studies have shown that alterations in executive functioning and behavioral adaptation may be particularly pronounced in women with obesity (Horstmann et al. 2011; Zhang et al. 2014). In the same vein, overweight participants should be included in future studies, as overweight and moderately obese participants seem to be more distinct from lean participants in reward sensitivity, working memory performance and monetary reward processing than individuals with severe obesity (Davis et al. 2004; Coppin et al. 2014; Dietrich et al. 2014; Verdejo-Román et al. 2017).

Additionally, while learning mostly took place during the first half of the experiment, performance in the second half was likely more influenced by fatigue and individual tendencies to exploit the learned associations or explore other options despite existing knowledge of the advantageous choice options. More dynamic paradigms with changing cue-outcome contingencies could reduce these potential biases.

Lastly, the current study was mainly focused on the utilization of feedback for learning. However, in order to understand the influence of altered negative value representations on behavior in individuals with obesity, additional measures of decision making and the processing of negative action outcomes, e.g. in the context of eating behavior and health consequences, should be employed in future studies.

Conclusion

The current study examined the neural representation of non-food reinforcement stimuli and their utilization for reinforcement-based learning in individuals with obesity employing a probabilistic learning paradigm with separate monetary gain and loss learning conditions. Findings of aberrant negative value representations and increased functional connectivity between the VS and insula point at generalized obesity-related differences in neural reinforcement processing that are present outside of the food context. Additionally, a reduction in reinforcement-based learning performance and specific alterations in disadvantageous outcome learning further support the idea of a lower impact of negative choice consequences on behavioral adaptation in individuals with obesity. Surprisingly, neither PE-related processes nor working memory explained obesity-related differences in learning, highlighting the need for further investigations, with potentially different methodological approaches.

References

Balodis, I. M., Kober, H., Worhunsky, P. D., White, M. A., Stevens, M. C., Pearlson, G. D., … Potenza, M. N. (2013). Monetary reward processing in obese individuals with and without binge eating disorder. Biological Psychiatry, 73, 877–886. https://doi.org/10.1016/j.biopsych.2013.01.014.
Article PubMed PubMed Central Google Scholar
Bartra, O., McGuire, J. T., & Kable, J. W. (2013). The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value. NeuroImage, 76, 412–427. https://doi.org/10.1016/j.neuroimage.2013.02.063.
Article PubMed Google Scholar
Beck, A. T., & Steer, R. A. (1987). Beck Depression Inventory (BDI). San Antonio: The Psychological Corporation Inc.
Google Scholar
Bradley, M. M., & Lang, P. J. (1994). Measuring emotion: the self-assessment manikin and the semantic differential. Journal of Behavior Therapy and Experimental Psychiatry, 25, 49–59. https://doi.org/10.1016/0005-7916(94)90063-9.
Article CAS PubMed Google Scholar
Canessa, N., Crespi, C., Motterlini, M., Baud-Bovy, G., Chierchia, G., Pantaleo, G., … Cappa, S. F. (2013). The functional and structural neural basis of individual differences in loss aversion. Journal of Neuroscience, 33, 14307–14317. https://doi.org/10.1523/JNEUROSCI.0497-13.2013.
Article CAS PubMed Google Scholar
Cauda, F., Cavanna, A. E., D’agata, F., Sacco, K., Duca, S., & Geminiani, G. C. (2011). Functional connectivity and coactivation of the nucleus accumbens: a combined functional connectivity and structure-based meta-analysis. Journal of Cognitive Neuroscience, 23, 2864–2877. https://doi.org/10.1162/jocn.2011.21624.
Article PubMed Google Scholar
Chang, L. J., Yarkoni, T., Khaw, M. W., & Sanfey, A. G. (2013). Decoding the role of the insula in human cognition: Functional parcellation and large-scale reverse inference. Cerebral Cortex, 23, 739–749. https://doi.org/10.1093/cercor/bhs065.
Article PubMed Google Scholar
Chase, H. W., Kumar, P., Eickhoff, S. B., & Dombrovski, A. Y. (2015). Reinforcement learning models and their neural correlates: An activation likelihood estimation meta-analysis. Cognitive, Affective & Behavioral Neuroscience, 15, 435–459. https://doi.org/10.3758/s13415-015-0338-7.
Article Google Scholar
Cho, Y. T., Fromm, S., Guyer, A. E., Detloff, A., Pine, D. S., Fudge, J. L., & Ernst, M. (2013). Nucleus accumbens, thalamus and insula connectivity during incentive anticipation in typical adults and adolescents. NeuroImage, 66, 508–521. https://doi.org/10.1016/j.neuroimage.2012.10.013.
Article PubMed Google Scholar
Claus, E. D., Blaine, S. K., Filbey, F. M., Mayer, A. R., & Hutchison, K. E. (2013). Association between nicotine dependence severity, BOLD response to smoking cues, and functional connectivity. Neuropsychopharmacology: Official Publication of the American College of Neuropsychopharmacology, 38, 2363–2372. https://doi.org/10.1038/npp.2013.134.
Article CAS Google Scholar
Clithero, J. A., Reeck, C., Carter, R. M., Smith, D. V., & Huettel, S. A. (2011). Nucleus accumbens mediates relative motivation for rewards in the absence of choice. Frontiers in Human Neuroscience, 5, 87. https://doi.org/10.3389/fnhum.2011.00087.
Article PubMed PubMed Central Google Scholar
Collins, A. G., & Frank, M. J. (2012). How much of reinforcement learning is working memory, not reinforcement learning? A behavioral, computational, and neurogenetic analysis. European Journal of Neuroscience, 35, 1024–1035. https://doi.org/10.1111/j.1460-9568.2011.07980.x.
Article PubMed Google Scholar
Coppin, G., Nolan-Poupart, S., Jones-Gotman, M., & Small, D. M. (2014). Working memory and reward association learning impairments in obesity. Neuropsychologia, 65, 146–155. https://doi.org/10.1016/j.neuropsychologia.2014.10.004.
Article PubMed PubMed Central Google Scholar
Cousineau, D. (2005). Confidence intervals in within-subject designs: a simpler solution to Loftus and Masson’s method. Tutorial in Quantitative Methods for Psychology, 1, 4–45. Retrieved from http://tqmp.org.
Craig, A. D. (2002). How do you feel? Interoception: the sense of the physiological condition of the body. Nature Reviews Neuroscience, 3, 655–666. https://doi.org/10.1038/nrn894.
Article CAS PubMed Google Scholar
Craig, A. D. (2009). How do you feel — now? The anterior insula and human awareness. Nature Reviews Neuroscience, 10, 59–70. https://doi.org/10.1038/nrn2555.
Article CAS PubMed Google Scholar
Craig, A. D. (2011). Significance of the insula for the evolution of human awareness of feelings from the body. Annals of the New York Academy of Sciences, 1225, 72–82. https://doi.org/10.1111/j.1749-6632.2011.05990.x.
Article PubMed Google Scholar
Critchley, H. D., Wiens, S., Rotshtein, P., Öhman, A., & Dolan, R. J. (2004). Neural systems supporting interoceptive awareness. Nature Neuroscience, 7, 189–195. https://doi.org/10.1038/nn1176.
Article CAS PubMed Google Scholar
Davis, C., Levitan, R. D., Muglia, P., Bewell, C., & Kennedy, J. L. (2004). Decision-making deficits and overeating: a risk model for obesity. Obesity Research, 12, 929–935. https://doi.org/10.1038/oby.2004.113.
Article PubMed Google Scholar
den Ouden, H. E. M., Daw, N. D., Fernandez, G., Elshout, J. A., Rijpkema, M., Hoogman, M.,…, & Cools, R. (2013). Dissociable effects of dopamine and serotonin on reversal learning. Neuron, 80, 1090–1100. https://doi.org/10.1016/j.neuron.2013.08.030.
Article CAS Google Scholar
Dietrich, A., Federbusch, M., Grellmann, C., Villringer, A., & Horstmann, A. (2014). Body weight status, eating behavior, sensitivity to reward/punishment, and gender: relationships and interdependencies. Frontiers in Psychology, 5, 1–13. https://doi.org/10.3389/fpsyg.2014.01073.
Article Google Scholar
Doll, B. B., Bath, K. G., Daw, N. D., & Frank, M. J. (2016). Variability in dopamine genes dissociates model-based and model-free reinforcement learning. Journal of Neuroscience, 36, 1211–1222. https://doi.org/10.1523/JNEUROSCI.1901-15.2016.
Article CAS PubMed Google Scholar
Eppinger, B., Schuck, N. W., Nystrom, L. E., & Cohen, J. D. (2013). Reduced striatal responses to reward prediction errors in older compared with younger adults. The Journal of Neuroscience, 33, 9905–9912. https://doi.org/10.1523/JNEUROSCI.2942-12.2013.
Article CAS PubMed PubMed Central Google Scholar
Etkin, A., & Wager, T. D. (2007). Functional neuroimaging of anxiety: a meta-analysis of emotional processing in PTSD, social anxiety disorder, and specific phobia. American Journal of Psychiatry, 164, 1476–1488. https://doi.org/10.1176/appi.ajp.2007.07030504.
Article PubMed Google Scholar
Feldstein Ewing, S. W., Claus, E. D., Hudson, K. A., Filbey, F. M., Yakes Jimenez, E., Lisdahl, K. M., & Kong, A. S. (2016). Overweight adolescents’ brain response to sweetened beverages mirrors addiction pathways. Brain Imaging and Behavior. Advance online publication. https://doi.org/10.1007/s11682-016-9564-z.
Article Google Scholar
García-García, I., Horstmann, A., Jurado, M. A., Garolera, M., Chaudhry, S. J., Margulies, D. S., … Neumann, J. (2014). Reward processing in obesity, substance addiction and non-substance addiction. Obesity Reviews, 15, 853–869. https://doi.org/10.1111/obr.12221.
Article PubMed Google Scholar
Garrison, J., Erdeniz, B., & Done, J. (2013). Prediction error in reinforcement learning: a meta-analysis of neuroimaging studies. Neuroscience and Biobehavioral Reviews, 37, 1297–1310. https://doi.org/10.1016/j.neubiorev.2013.03.023.
Article PubMed Google Scholar
Greenhouse, S. W., & Geisser, S. (1959). On methods in the analysis of profile data. Psychometrika, 24, 95–112. https://doi.org/10.1007/BF02289823.
Article Google Scholar
Horstmann, A., Busse, F. P., Mathar, D., Müller, K., Lepsien, J., Schlögl, H., … Pleger, B. (2011). Obesity-related differences between women and men in brain structure and goal-directed behavior. Frontiers in Human Neuroscience, 5, 58. https://doi.org/10.3389/fnhum.2011.00058.
Article PubMed PubMed Central Google Scholar
Johnson, P. M., & Kenny, P. J. (2010). Dopamine D2 receptors in addiction-like reward dysfunction and compulsive eating in obese rats. Nature Neuroscience, 13, 635–641. https://doi.org/10.1038/nn.2519.
Article CAS PubMed PubMed Central Google Scholar
Kanoski, S. E., & Davidson, T. L. (2011). Western diet consumption and cogntitive impairment: links to hippocampal dysfunction and obesity. Physiology & Behavior, 103, 59–68.
Article CAS Google Scholar
Kahnt, T., Park, S. Q., Cohen, M. X., Beck, A., Heinz, A., & Wrase, J. (2009). Dorsal striatal-midbrain connectivity in humans predicts how reinforcements are used to guide decisions. Journal of Cognitive Neuroscience, 21, 1332–1345. https://doi.org/10.1162/jocn.2009.21092.
Kelly, C., Toro, R., Di Martino, A., Cox, C. L., Bellec, P., Castellanos, F. X., & Milham, M. P. (2012). A convergent functional architecture of the insula emerges across imaging modalities. NeuroImage, 61, 1129–1142. https://doi.org/10.1016/j.neuroimage.2012.03.021.
Article PubMed Google Scholar
Kenny, P. J. (2011). Reward mechanisms in obesity: new insights and future directions. Neuron, 24, 664–679. https://doi.org/10.1016/j.neuron.2011.02.016.
Article CAS Google Scholar
Kim, H., Shimojo, S., & O’Doherty, J. P. (2006). Is avoiding an aversive outcome rewarding? Neural substrates of avoidance learning in the human brain. PLoS Biology, 4, e233. https://doi.org/10.1371/journal.pbio.0040233.
Article CAS PubMed PubMed Central Google Scholar
Kroemer, N. B., & Small, D. M. (2016). Fuel not fun: reinterpreting attenuated brain responses to reward in obesity. Physiology & Behavior, 162, 37–45. https://doi.org/10.1016/j.physbeh.2016.04.020.
Article CAS Google Scholar
Leong, J. K., Pestilli, F., Wu, C. C., Samanez-Larkin, G. R., & Knutson, B. (2016). White-matter tract connecting anterior insula to nucleus accumbens correlates with reduced preference for positively skewed gambles. Neuron, 89, 63–69. https://doi.org/10.1016/j.neuron.2015.12.015.
Article CAS PubMed PubMed Central Google Scholar
Levy, D. J., & Glimcher, P. W. (2011). Comparing apples and oranges: using reward-specific and reward-general subjective value representation in the brain. Journal of Neuroscience, 31, 14693–14707. https://doi.org/10.1523/JNEUROSCI.2218-11.2011.
Article CAS PubMed Google Scholar
Lin, A., Adolphs, R., & Rangel, A. (2012). Social and monetary reward learning engage overlapping neural substrates. Social Cognitive and Affective Neuroscience, 7, 274–281. https://doi.org/10.1093/scan/nsr006.
Article PubMed Google Scholar
Martin, L. E., Cox, L. S., Brooks, W. M., & Savage, C. R. (2014). Winning and losing: differences in reward and punishment sensitivity between smokers and nonsmokers. Brain and Behavior, 4, 915–924. https://doi.org/10.1002/brb3.285.
Article PubMed PubMed Central Google Scholar
Mathar, D., Neumann, J., Villringer, A. & Horstmann, A. (2017a). Failing to learn from negative prediction errors: Obesity is associated with alterations in a fundamental neural learning mechanism. Cortex, 95, 222–237. https://doi.org/10.1016/j.cortex.2017.08.022.
Mathar, D., Wilkinson, L., Holl, A. K., Neumann, J., Deserno, L., Villringer, A., Jahanshahi, M., & Horstmann, A. (2017b). The role of dopamine in positive and negative prediction error utilization during incidental learning – insights from positron emission tomography, Parkinson’s disease and Huntington’s disease. Cortex; a Journal Devoted to the Study of the Nervous System and Behavior, 90, 149–162. https://doi.org/10.1016/j.cortex.2016.09.004.
Article PubMed Google Scholar
Morey, R. D. (2008). Confidence intervals from normalized data: A correction to Cousineau (2005). Tutorial in Quantitative Methods for Psychology, 4, 61–64. Retrieved from http://tqmp.org.
Mullett, T. L., & Tunney, R. J. (2013). Value representations by rank order in a distributed network of varying context dependency. Brain and Cognition, 82, 76–83. https://doi.org/10.1016/j.bandc.2013.02.010.
Article PubMed Google Scholar
Opel, N., Redlich, R., Grotegerd, D., Dohm, K., Haupenthal, C., Heindel, W., … Dannlowski, U. (2015). Enhanced neural responsiveness to reward associated with obesity in the absence of food-related stimuli. Human Brain Mapping, 36, 2330–2337. https://doi.org/10.1002/hbm.22773.
Article PubMed PubMed Central Google Scholar
Otto, A. R., Raio, C. M., Chiang, A., Phelps, E. A., & Daw, N. D. (2013). Working-memory capacity protects model-based learning from stress. Proceedings of the National Academy of Sciences of the United States of America, 110, 20941–20946. https://doi.org/10.1073/pnas.1312011110.
Article CAS PubMed PubMed Central Google Scholar
Park, S. Q., Kahnt, T., Beck, A., Cohen, M. X., Dolan, R. J., Wrase, J., & Heinz, A. (2010). Prefrontal cortex fails to learn from reward prediction errors in alcohol dependence. The Journal of Neuroscience, 30, 7749–7753. https://doi.org/10.1523/JNEUROSCI.5587-09.2010.
Article CAS PubMed PubMed Central Google Scholar
Pessiglione, M., Seymour, B., Flandin, G., Dolan, R. J., & Frith, C. D. (2006). Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature, 442, 1042–1045. https://doi.org/10.1038/nature05051.
Article CAS PubMed PubMed Central Google Scholar
Philip, R. C., Dauvermann, M. R., Whalley, H. C., Baynham, K., Lawrie, S. M., & Stanfield, A. C. (2012). A systematic review and meta-analysis of the fMRI investigation of autism spectrum disorders. Neuroscience & Biobehavioral Reviews, 36, 901–942. https://doi.org/10.1016/j.neubiorev.2011.10.008.
Article Google Scholar
Postuma, R. B., & Dagher, A. (2006). Basal ganglia functional connectivity based on a meta-analysis of 126 positron emission tomography and functional magnetic resonance imaging publications. Cerebral Cortex, 16, 1508–1521. https://doi.org/10.1093/cercor/bhj088.
Article PubMed Google Scholar
Reichelt, A. C., Morris, M. J., & Westbrook, R. F. (2014). Cafeteria diet impairs expression of sensory-specific satiety and stimulus-outcome learning. Frontiers in Psychology, 5, 852. https://doi.org/10.3389/fpsyg.2014.00852.
Article PubMed PubMed Central Google Scholar
Rothemund, Y., Preuschhof, C., Bohner, G., Bauknecht, H. C., Klingebiel, R., Flor, H., & Klapp, B. F. (2007). Differential activation of the dorsal striatum by high-calorie visual food stimuli in obese individuals. NeuroImage, 37, 410–421. https://doi.org/10.1016/j.neuroimage.2007.05.008.
Article PubMed Google Scholar
Rothkirch, M., Schmack, K., Deserno, L., Darmohray, D., & Sterzer, P. (2014). Attentional modulation of reward processing in the human brain. Human Brain Mapping, 35, 3036–3051. https://doi.org/10.1002/hbm.22383.
Article PubMed Google Scholar
Samanez-Larkin, G. R., Levens, S. M., Perry, L. M., Dougherty, R. F., & Knutson, B. (2012). Frontostriatal white matter integrity mediates adult age differences in probabilistic reward learning. The Journal of Neuroscience, 32, 5333–5337. https://doi.org/10.1523/JNEUROSCI.5756-11.2012.
Article CAS PubMed PubMed Central Google Scholar
Saunders, B. T., & Robinson, T. E. (2013). Individual variation in resisting temptation: implications for addiction. Neuroscience & Biobehavioral Reviews, 37, 1955–1975. https://doi.org/10.1016/j.neubiorev.2013.02.008.
Article Google Scholar
Schönberg, T., Daw, N. D., Joel, D., & O’Doherty, J. P. (2007). Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making. The Journal of Neuroscience, 27, 12860–12867. https://doi.org/10.1523/JNEUROSCI.2496-07.2007.
Article CAS PubMed PubMed Central Google Scholar
Schultz, W., Dayan, P., & Montague, P. R. (1997). A neural substrate of prediction and reward. Science, 275, 1593–1599. https://doi.org/10.1126/science.275.5306.1593.
Article CAS PubMed Google Scholar
Sescousse, G., Caldú, X., Segura, B., & Dreher, J. C. (2013). Processing of primary and secondary rewards: a quantitative meta-analysis and review of human functional neuroimaging studies. Neuroscience and Biobehavioral Reviews, 37, 681–696. https://doi.org/10.1016/j.neubiorev.2013.02.002.
Article PubMed Google Scholar
Sharp, M. E., Foerde, K., Daw, N. D., & Shohamy, D. (2016). Dopamine selectively remediates “model-based” reward learning: a computational approach. Brain : A Journal of Neurology, 139, 355–364. https://doi.org/10.1093/brain/awv347.
Article Google Scholar
Simmank, J., Murawski, C., Bode, S., & Horstmann, A. (2015). Incidental rewarding cues influence economic decisions in people with obesity. Frontiers in Behavioral Neuroscience, 9, 278. https://doi.org/10.3389/fnbeh.2015.00278.
Article PubMed PubMed Central Google Scholar
Smittenaar, P., FitzGerald, T. H. B., Romei, V., Wright, N. D., & Dolan, R. J. (2013). Disruption of dorsolateral prefrontal cortex decreases model-based in favor of model-free control in humans. Neuron, 80, 914–919. https://doi.org/10.1016/j.neuron.2013.08.009.
Article CAS PubMed PubMed Central Google Scholar
Stice, E., Spoor, S., Bohon, C., & Small, D. M. (2008). Relation between obesity and blunted striatal response to food is moderated by TaqIA A1 allele. Science, 322, 449–452. https://doi.org/10.1126/science.1161550.
Article CAS PubMed Google Scholar
Stice, E., Spoor, S., Ng, J., & Zald, D. H. (2009). Relation of obesity to consummatory and anticipatory food reward. Physiology and Behavior, 97, 551–560. https://doi.org/10.1016/j.physbeh.2009.03.020.
Article CAS PubMed Google Scholar
Stice, E., Yokum, S., Blum, K., & Bohon, C. (2010). Weight gain is associated with reduced striatal response to palatable food. Journal of Neuroscience, 30, 13105–13109. https://doi.org/10.1523/JNEUROSCI.2105-10.2010.
Article CAS PubMed Google Scholar
Tom, S. M., Fox, C. R., Trepel, C., & Poldrack, R. A. (2007). The neural basis of loss aversion in decision-making under risk. Science, 315, 515–518. https://doi.org/10.1126/science.1134239.
Article CAS PubMed Google Scholar
Tzotzas, T., Krassas, G. E., Konstantinidis, T., & Bougoulia, M. (2000). Changes in lipoprotein(a) levels in overt and subclinical hypothyroidism before and during treatment. Thyroid: Official Journal of the American Thyroid Association, 10, 803–808. https://doi.org/10.1089/thy.2000.10.803.
Article CAS Google Scholar
Val-Laillet, D., Aarts, E., Weber, B., Ferrari, M., Quaresima, V., Stoeckel, L. E., … & Stice, E. (2015). Neuroimaging and neuromodulation approaches to study eating behavior and prevent and treat eating disorders and obesity. Neuroimage: Clinical, 8, 1–31. https://doi.org/10.1016/j.nicl.2015.03.016.
Article CAS Google Scholar
Van Holst, R. J., Chase, H. W., & Clark, L. (2014). Striatal connectivity changes following gambling wins and near-misses: associations with gambling severity. NeuroImage: Clinical, 5, 232–239. https://doi.org/10.1016/j.nicl.2014.06.008.
Article Google Scholar
Velázquez-Sánchez, C., Santos, J. W., Smith, K. L., Ferragud, A., Sabino, V., & Cottone, P. (2015). Seeking behavior, place conditioning, and resistance to conditioned suppression of feeding in rats intermittently exposed to palatable food. Behavioral Neuroscience, 129, 219–224. https://doi.org/10.1037/bne0000042.
Article CAS PubMed PubMed Central Google Scholar
Verdejo-Román, J., Vilar-López, R., Navas, J. F., Soriano-Mas, C., & Verdejo-García, A. (2017). Brain reward system’s alterations in response to food and monetary stimuli in overweight and obese individuals. Human Brain Mapping, 38, 666–677. https://doi.org/10.1002/hbm.23407.
Article PubMed Google Scholar
Wang, G. J., Volkow, N. D., Logan, J., Pappas, N. R., Wong, C. T., Zhu, W., … Fowler, J. S. (2001). Brain dopamine and obesity. Lancet, 357, 354–357. https://doi.org/10.1016/S0140-6736(00)03643-6.
Article CAS PubMed Google Scholar
Winecoff, A., Clithero, J. A., Carter, R. M., Bergman, S. R., Wang, L., & Huettel, S. A. (2013). Ventromedial prefrontal cortex encodes emotional value. The Journal of Neuroscience, 33, 11032–11039. https://doi.org/10.1523/JNEUROSCI.4317-12.2013.
Article CAS PubMed PubMed Central Google Scholar
Wittmann, B. C., & D’Esposito, M. (2015). Levodopa administration modulates striatal processing of punishment-associated items in healthy participants. Psychopharmacology, 232, 135–144. https://doi.org/10.1007/s00213-014-3646-7.
Article CAS PubMed Google Scholar
Yan, C., Yang, T., Yu, Q. J., Jin, Z., Cheung, E. F., Liu, X., & Chan, R. C. (2015). Rostral medial prefrontal dysfunctions and consummatory pleasure in schizophrenia: a meta-analysis of functional imaging studies. Psychiatry Research: Neuroimaging, 231, 187–196. https://doi.org/10.1016/j.pscychresns.2015.01.001.
Article PubMed Google Scholar
Zhang, W. N., Chang, S. H., Guo, L. Y., Zhang, K. L., & Wang, J. (2013). The neural correlates of reward-related processing in major depressive disorder: a meta-analysis of functional magnetic resonance imaging studies. Journal of Affective Disorders, 151, 531–539. https://doi.org/10.1016/j.jad.2013.06.039.
Article PubMed Google Scholar
Zhang, Z., Manson, K. F., Schiller, D., & Levy, I. (2014). Impaired associative learning with food rewards in obese women. Current Biology, 24, 1731–1736. https://doi.org/10.1016/j.cub.2014.05.075.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

Open Access Funding provided by Max Planck Society. We thank all participants involved in this study for their cooperation as well as B. Johst for her help in programming the paradigm, and R. Menger, A. Kummer, M. Jochemko, A. Theilemann, P. Schröder, and C. Rüdt von Collenberg for their assistance during participant recruitment and data collection. We especially thank I. García-García for her support and invaluable input during the formation of this manuscript.

Funding

This work was supported by the German Federal Ministry of Education and Research (FKZ: 01EO1001) (JK, AV, JN, AH) and the German Research Foundation (SFB 1052 Obesity mechanisms) (AV, JN, AH).

Author information

Authors and Affiliations

Max Planck Institute for Human Cognitive and Brain Sciences, Stephanstraße 1a, 04103, Leipzig, Germany
Jana Kube, David Mathar, Annette Horstmann, Sonja A. Kotz, Arno Villringer & Jane Neumann
IFB Adiposity Diseases, Leipzig University Medical Center, Leipzig, Germany
Jana Kube, Annette Horstmann, Arno Villringer & Jane Neumann
Faculty 5 - Business, Law and Social Sciences, Brandenburg University of Technology Cottbus-Senftenberg, Cottbus, Germany
Jana Kube
Department of Psychology, University of Cologne, Cologne, Germany
David Mathar
Department of Neuropsychology and Psychopharmacology, Faculty of Psychology and Neuroscience, Maastricht University, Maastricht, Netherlands
Sonja A. Kotz
Clinic of Cognitive Neurology, University Hospital Leipzig, Leipzig, Germany
Arno Villringer
Mind & Brain Institute, Berlin School of Mind and Brain, Humboldt-University, Berlin, Germany
Arno Villringer
Department of Medical Engineering and Biotechnology, University of Applied Sciences, Jena, Germany
Jane Neumann

Authors

Jana Kube
View author publications
You can also search for this author in PubMed Google Scholar
David Mathar
View author publications
You can also search for this author in PubMed Google Scholar
Annette Horstmann
View author publications
You can also search for this author in PubMed Google Scholar
Sonja A. Kotz
View author publications
You can also search for this author in PubMed Google Scholar
Arno Villringer
View author publications
You can also search for this author in PubMed Google Scholar
Jane Neumann
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jana Kube.

Ethics declarations

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.

Conflict of interest

The authors declare that they have no conflict of interest.

Electronic supplementary material

Below is the link to the electronic supplementary material.

11682_2017_9786_MOESM1_ESM.pdf

Statistical results showing no significant influence of working memory on subjective and objective markers of learning performance. (PDF 136 KB)

11682_2017_9786_MOESM2_ESM.pdf

Within-group and between-group fMRI results on PE processing in individuals with obesity and control participant during the first two blocks of the experiment (acquisition phase). (PDF 75 KB)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Kube, J., Mathar, D., Horstmann, A. et al. Altered monetary loss processing and reinforcement-based learning in individuals with obesity. Brain Imaging and Behavior 12, 1431–1449 (2018). https://doi.org/10.1007/s11682-017-9786-8

Download citation

Published: 29 December 2017
Issue Date: October 2018
DOI: https://doi.org/10.1007/s11682-017-9786-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Altered monetary loss processing and reinforcement-based learning in individuals with obesity

Abstract

Similar content being viewed by others

Higher body weight-dependent neural activation during reward processing

Dorsolateral and medial prefrontal cortex mediate the influence of incidental priming on economic decision making in obesity

Excessive body fat linked to blunted somatosensory cortex response to general reward in adolescents

Materials and methods

Participants

Procedure and probabilistic reinforcement learning task

Ratings

FMRI data acquisition

Behavioral data analyses

Computational learning model

FMRI data analyses

Association of neural responses and learning behavior

Data availability

Results

Behavioral performance

Ratings

FMRI results

Gain receipt and loss avoidance

Loss receipt and gain omission

PE representation

VS functional connectivity

Association of neural responses and learning behavior

Discussion

Outcome processing

Learning performance

PE processing and functional connectivity

Other mechanisms in reinforcement-based learning

Strength, future directions and limitations

Conclusion

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethical approval

Conflict of interest

Electronic supplementary material

11682_2017_9786_MOESM1_ESM.pdf

11682_2017_9786_MOESM2_ESM.pdf

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation