Abnormal modulation of reward versus punishment learning by a dopamine D2-receptor antagonist in pathological gamblers

Janssen, Lieneke Katharina; Sescousse, Guillaume; Hashemi, Mahur Melina; Timmer, Monique Harmina Maria; ter Huurne, Niels Peter; Geurts, Dirk Everdina Maria; Cools, Roshan

doi:10.1007/s00213-015-3986-y

Abnormal modulation of reward versus punishment learning by a dopamine D2-receptor antagonist in pathological gamblers

Original Investigation
Open access
Published: 20 June 2015

Volume 232, pages 3345–3353, (2015)
Cite this article

Download PDF

You have full access to this open access article

Psychopharmacology Aims and scope Submit manuscript

Abnormal modulation of reward versus punishment learning by a dopamine D2-receptor antagonist in pathological gamblers

Download PDF

Lieneke Katharina Janssen^1,2,
Guillaume Sescousse^1,2,
Mahur Melina Hashemi¹,
Monique Harmina Maria Timmer^1,3,
Niels Peter ter Huurne^1,2,
Dirk Everdina Maria Geurts^1,2 &
…
Roshan Cools^1,2

4606 Accesses
26 Citations
13 Altmetric
1 Mention
Explore all metrics

Abstract

Rationale

Pathological gambling has been associated with dopamine transmission abnormalities, in particular dopamine D2-receptor deficiency, and reversal learning deficits. Moreover, pervasive theoretical accounts suggest a key role for dopamine in reversal learning. However, there is no empirical evidence for a direct link between dopamine, reversal learning and pathological gambling.

Objective

The aim of the present study is to triangulate dopamine, reversal learning, and pathological gambling.

Methods

Here, we assess the hypothesis that pathological gambling is accompanied by dopamine-related problems with learning from reward and punishment by investigating effects of the dopamine D2-receptor antagonist sulpiride (400 mg) on reward- and punishment-based reversal learning in 18 pathological gamblers and 22 healthy controls, using a placebo-controlled, double-blind, counter-balanced design.

Results

In line with previous studies, blockade of D2 receptors with sulpiride impaired reward versus punishment reversal learning in controls. By contrast, sulpiride did not have any outcome-specific effects in gamblers.

Conclusion

These data demonstrate that pathological gambling is associated with a dopamine-related anomaly in reversal learning from reward and punishment.

Striatum-related functional activation during reward- versus punishment-based learning in psychosis risk

Article 04 July 2019

Associations between Electrophysiological Evidence of Reward and Punishment-Based Learning and Psychotic Experiences and Social Anhedonia in At-Risk Groups

Article 15 September 2016

Neuroimaging of reward mechanisms in Gambling disorder: an integrative review

Article 13 September 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Pathological gambling is a psychiatric disorder characterized by elevated risk seeking and compulsive gambling behaviour. It can have dramatic consequences including bankruptcy, unemployment, relationship problems and even attempted suicide in up to 24 % of individuals (DeCaria et al. 1996), and its prevalence is estimated between 1 and 2 % in Western countries (Wardle et al. 2010; Welte et al. 2014). In the DSM-5, pathological gambling (renamed gambling disorder) is recognized as a behavioural addiction based on similarities with substance addiction in terms of personality traits (impulsivity and compulsivity), clinical symptoms (tolerance, withdrawal, and craving), and associated neurobiological mechanisms (Petry 2007; Potenza 2008, 2013). For example, both substance addiction and pathological gambling have been associated with dopamine transmission abnormalities, in particular dopamine D2-receptor deficiency (Boileau et al. 2013; Clark et al. 2012; Cocker et al. 2012; Comings et al. 1996; Dalley et al. 2007; but also see Joutsa et al. 2012; Linnet et al. 2010). Moreover, pervasive theoretical accounts of addiction suggest a key role for dopamine-dependent abnormalities in reinforcement learning in both substance addiction (Everitt and Robbins 2005; Redish 2004) and pathological gambling (Redish et al. 2007). In these accounts, aberrant reward prediction error signals lead to compulsive over-selection of actions directed at targets of addiction.

Empirical evidence supports the link between dopamine, learning and substance addiction. For example, D2-receptor stimulation has been shown to remediate cognitive impairments in human drug addicts in the context of reversal learning, which reflects the ability to flexibly adapt one’s behavior in response to contingency changes in the environment (Ersche et al. 2011). This concurs with the suggestion that low levels of dopamine D2-receptor availability might predispose to compulsive drug taking (Belin et al. 2008; Dalley et al. 2007) and the claim that reversal learning is a valuable tool for investigating D2-dependent compulsive aspects of pathological reward-seeking behaviours (Izquierdo and Jentsch 2012).

By contrast to substance addiction, there appears to be no empirical evidence for a direct link between dopamine, reversal learning and pathological gambling. Indeed, while research supports links between dopamine and gambling, and between gambling and learning, there is no evidence for dopamine-dependent learning abnormalities in gamblers. Thus, several indices of gambling severity have been associated with lower density of striatal dopamine D2-receptors (Boileau et al. 2013; see also Clark et al. 2012), even though overall group differences have not been reported so far (Linnet et al. 2010). In rodents, D2-receptor agents have been found to influence a behavioural analogue of loss-chasing (Rogers et al. 2013) as well as risk-taking behaviour (St. Onge et al. 2011; Winstanley et al. 2011). Further, gamblers were shown to exhibit diminished ability to update previously learned reward contingencies, as measured with classic instrumental reversal learning tasks (Boog et al. 2014; de Ruiter et al. 2009; Vanes et al. 2014). However, a direct link between dopamine D2-receptor dysfunction, abnormal learning, and pathological gambling is still missing.

The aim of the present study was to triangulate dopamine, gambling and reversal learning by investigating the effects of the dopamine D2-receptor antagonist sulpiride (400 mg) on reversal learning in pathological gamblers. Performance was assessed using a deterministic reversal learning paradigm that enables separate investigation of reward- and punishment prediction learning. This feature of the task is particularly pertinent here, because gamblers have been suggested to be preoccupied with rewards rather than with punishments (Kreussel et al. 2013; Romanczuk-Seiferth et al. 2014). Moreover this paradigm was previously shown to be particularly sensitive to manipulation of dopamine (Cools et al. 2006, 2009; van der Schaaf et al. 2014). Specifically, we have shown that administration of 400 mg of sulpiride to young healthy volunteers altered reward- versus punishment-based reversal learning (van der Schaaf et al. 2014). This finding is in line with a series of other studies (Eisenegger et al. 2014; Frank and O’Reilly 2006; Jocham et al. 2011, 2014) showing effects of D2-receptor blockade in healthy volunteers on reward- versus punishment-based learning and reinforcement-based decisions. As such, this paradigm is a valuable tool to assess whether pathological gambling is accompanied by a dopamine D2-dependent imbalance in learning from reward versus punishment.

This question is particularly relevant in the light of current inconsistencies in the literature regarding the effects of dopaminergic drugs, and specifically dopamine D2-receptor antagonists, in human gamblers. Whereas administration of the D2-receptor antagonist haloperidol has been reported to increase the self-reported desire to gamble in pathological gamblers (Zack and Poulos 2007) and to enhance the impact of reward on betting behaviour (Tremblay et al. 2011), the same drug (although at a lower dose) did not alter subjective, physiological or motivation-to-gamble responses in recreational gamblers in another study (Porchet et al. 2013). So far no study has investigated the effects of D2-receptor antagonism on reward- versus punishment-based learning in human gamblers. Based on previous work (Frank et al. 2004; van der Schaaf et al. 2014), we expected sulpiride to impair reward versus punishment-based learning in healthy controls. In gamblers, we expected impaired punishment versus reward-based learning under placebo. Moreover, we expected that this impairment would be remediated by sulpiride.

Methods

Subjects

Twenty-two male pathological gamblers and twenty-two healthy men were included following an in depth structured psychiatric interview administered by a medical doctor (MINI Plus (Sheehan et al. 1998) and the gambling section of the DSM-IV Diagnostic Interview Schedule (Robins et al. 1998)). Two gamblers were excluded from the analyses because of their difficulty understanding the task. An additional two gamblers were excluded from the analyses because of comorbid cannabis dependence within the past 6 months (for details see Supplementary Materials ). Therefore, the reported results are based on data from 18 gamblers and 22 controls. Supplementary analyses including the two cannabis addicts are reported in the Supplementary Materials and confirmed the effects of primary interest reported in the main text. All subjects provided written informed consent, which was approved by the regional research ethics committee (Commissie Mensgebonden Onderzoek, regio Arnhem-Nijmegen, Registration Number: 2011/204, Date: 14 November 2011), and received compensation for participation.

Pathological gamblers were recruited through advertisement (n = 14) and addiction treatment centers (n = 4), and reported not to be medicated or in treatment for their pathological gambling at the time of testing. Controls were recruited through advertisement. All gamblers, with the exception of one, qualified as pathological gambler as they met ≥5 DSM-IV-TR criteria for pathological gambling, and were otherwise healthy. One gambler qualified as problem gambler as he met only 4 DSM-IV criteria. The severity of gambling symptoms was assessed using the South Oaks Gambling Screen (SOGS; Lesieur and Blume 1987). All gamblers had a minimum SOGS score of 6 (range = 6–18), whereas controls, with the exception of two subjects, had a SOGS score of 0 (range = 0–2).

The two groups were matched for age, net income, body mass index, and verbal IQ as estimated by the Dutch version of the National Adult Reading Test (NLV) (Table 1). Subjects were excluded (from both groups) if they were currently following psychiatric treatment (except cognitive behavioural therapy; n = 2); were using more than four alcoholic beverages daily; were using psychotropic medication; had a lifetime history of schizophrenia, bipolar disorder, attention deficit hyperactivity disorder, autism, bulimia or anorexia, anxiety disorder, obsessive compulsive disorder; or had a past 6-month history of major depressive episode. Given the high comorbidity between pathological gambling and other psychiatric disorders (Lorains et al. 2011), gamblers with the following comorbidities were included: lifetime history of dysthymia (n = 1); and remitted posttraumatic stress disorder (n = 1; remitted > 4 years). Excluding these gamblers from the analyses did not change the results. In addition, three gamblers used cannabis weekly in the past 6 months, but did not meet the DSM criteria for abuse/dependence. Control subjects had no relevant psychiatric history.

Table 1 Demographics and self-report measures

Full size table

Self-report questionnaires were administered to further characterize the subjects (Table 1): the Fagerstrom Test for Nicotine Dependence (FTND; Heatherton et al. 1991), the Alcohol Use Disorders Identification Test (AUDIT; Saunders et al. 1993), the Hospital Anxiety and Depression Scale (HADS; Zigmond and Snaith 1983), the Barratt Impulsiveness Scale (BIS-11; Patton et al. 1995), and the Behavioural Inhibition System/Behavioural Activation System scale (BIS/BAS; Carver and White 1994). Frequent forms of gambling were assessed using item 1 of the SOGS and are expressed in terms of the percentage of gamblers who play the following games at least once a week for money: slot machines (61 %), card games (61 %), casino games (33 %), sports betting (28 %), lotteries (22 %), bowling, pool, golf, darts or alike (5,5 %), stock market (5,5 %).

Procedure

Subjects visited the lab on two occasions; they were tested once after receiving an oral dose of sulpiride (Dogmatil®, Sanofi-Aventis; 400 mg), and once after a placebo. The order of administration was randomized according to a double-blind, cross-over design. The test sessions were separated by at least one week. Starting time of test sessions was always between 9 and 10 am. Subjects were asked to abstain from recreational drugs 1 week before testing, from alcohol 24 h before testing, and from caffeine and nicotine the morning before testing. The behavioural task was part of a larger protocol and was performed approximately 3.5 h after drug intake, and thus coincided with high plasma concentrations of sulpiride (Mehta et al. 2003).

Background neuropsychological tests (digit span, verbal fluency, number cancellation, and block completion) were administered at the end of the day, 4.25 h after drug intake. Mood, blood pressure and heart rate were measured immediately prior to drug intake, as well as 1 and 4.5 h following drug intake. Subjective mood was measured using the Bond and Lader visual analogue scales (Bond and Lader 1974) and the Positive and Negative Affect Scales (PANAS; Watson et al. 1988).

Experimental design

We employed a deterministic reversal learning task similar to that described elsewhere (Cools et al. 2006; van der Schaaf et al. 2014). The task was programmed with Presentation software (Version 16, Neurobiobehavioral Systems, Inc.). The layout of the task was adjusted to fit the original instructions of a casino setting (see Cools et al. 2006) and to be more intuitive for gamblers. On each trial, subjects were presented with two gambling cards simultaneously (Fig. 1). One of the two cards was associated with upcoming reward, the other one with upcoming punishment. Unlike classic instrumental reversal learning tasks, subjects did not choose between the two stimuli. Instead one card was highlighted, and subjects had to learn to predict the outcome associated with this preselected card by trial-and-error. Responses were made by pressing one of two buttons—one for reward, the other for punishment—with the right index or middle finger (counterbalanced across subjects), and were self-paced. After a 1,000-ms post-response delay the outcome was presented for 500-ms followed by a 500-ms intertrial interval. Note that the outcomes were not contingent on the subjects’ responses, but on the highlighted stimulus; thus, contingencies were Pavlovian rather than instrumental. The stimulus-outcome contingency reversed after five to nine consecutive correct predictions. Subjects performed two blocks, each consisting of two runs of 120 trials (i.e. a total of 480 trials). In one block, reversals were always signaled by unexpected rewards (“reward block”), and in the other block reversals were always signaled by unexpected punishments (“punishment block”). Reward consisted of a smiling emoticon with a “+€100” sign. Punishment consisted of a sad emoticon with a “−€100” sign. The order of blocks was counterbalanced between sessions and across subjects. Error rate on the trials immediately after reversals (i.e. unexpected reward or punishment) indexes the ability to update predictions of reward and punishment, i.e. how well subjects learned from either unexpected reward or unexpected punishment. On these reversal trials, the same stimulus was highlighted as on the previous unexpected outcome trial such that non-outcome-specific requirements for motor switching and prediction updating were matched between reward and punishment conditions. This enabled direct comparison between reward and punishment reversals. Subjects were instructed according to the original procedure by Cools et al. (2006) and were trained extensively before the experiment so that they understood the structure of the task and the Pavlovian, rather than instrumental, nature of the contingencies (for details see Supplementary Materials ).

Analyses

Error rates on reversal trials (trials immediately after unexpected outcomes) were arcsine transformed as is appropriate when variance is proportional to the mean (Howell 1997). Error rates on reversal trials were analysed using a mixed ANOVA (SPSS 19, Chicago, IL) with drug (placebo vs. sulpiride) and outcome (unexpected reward vs. punishment) as within-subject factors and group (gamblers vs. controls) as a between-subject factor. In addition, we assessed the total number of reversals obtained throughout the task. Because the stimulus-outcome contingency in the task reversed after five to nine consecutive correct predictions, and the total number of trials was fixed, the number of reversals for the reward and punishment block reflects performance also on the non-reversal trials.

Results

Figure 2 shows that sulpiride altered reward versus punishment reversal learning in controls, while not altering reversal learning in gamblers. This observation was substantiated by an ANOVA of the error rates on reversal trials (Table 2), which revealed a significant interaction of group × drug × outcome (F(1,38) = 5.288, p = 0.027). When decomposing the three-way interaction effect into two-way interaction effects for each group, we found that this was driven by a drug × outcome interaction in controls (F(1,21) = 4.768, p = 0.040). By contrast, there was no drug × outcome interaction in gamblers (F(1,17) = 1.183, p = 0.292). The drug × outcome interaction in controls was due to a significant simple main effect of drug on reward learning (F(1,21) = 5.439, p = 0.030), not punishment learning (F(1,21) = 0.523, p = 0.478). Thus, sulpiride induced a shift away from reward learning in controls, while not altering the balance between reward and punishment learning in gamblers. Under placebo there was no group × outcome interaction (F(1,38) = 0.976, p = 0.329). In addition to the outcome-specific effects of sulpiride on reversal learning, there was also an outcome-nonspecific main effect of drug on error rate (F(1,38) = 4.452, p = 0.041). This was due to sulpiride impairing performance across groups and outcomes. This raises the question whether the impairment is specific to reversal trials or extends to non-reversal trials. Supplementary analysis including the within-subjects factor trial type (reversal, non-reversal reward, and non-reversal punishment trials) revealed a significant group × drug × outcome × trial type interaction (F(2,37) = 3.581, p = 0.038). When decomposing this interaction into the simple three-way interaction effect for each trial type, we found that this four-way interaction was driven by a group × drug × outcome interaction for reversal trials only. In line with that, there was no significant effect of group, drug or outcome on performance on non-reversal trials as measured by the total number of reversals (Supplementary Fig. Fig. S1).

Table 2 Mean error rates on reversal trials

Full size table

When excluding gamblers with other comorbidities (PTSD and dysthymia; n = 2), the effects did not change. The interaction of group × drug × outcome remained highly significant (F(1,36) = 7.698, p = 0.009). Decomposing the three-way interaction effect into two-way interaction effects for gamblers revealed a marginally significant outcome × drug interaction effect (F(1,15) = 3.637, p = 0.076) driven by a highly significant impairing effect of drug on punishment learning (F(1,15) = 9.370, p = 0.008), but not on reward learning (F(1,15) = 0.206, p = 0.657). Thus in the comorbidity-free pathological gamblers, sulpiride tended to induce a shift away from punishment learning.

The groups did not differ in terms of alcohol use or in terms of the number of smokers, but they did differ significantly in terms of nicotine dependence (FTND), depression and anxiety (HADS), impulsivity (BIS-11), and reward sensitivity (BAS; Table 1). However, there were no correlations between these measures and the drug × outcome interaction effect of interest, suggesting that these between-group differences did not drive the current observations (all |r| < 0.35, p > 0.155).

Our results are unlikely driven by nonspecific effects of the drug manipulation, because heart rate, blood pressure, mood, and global cognitive function (as measured by background neuropsychological tests) did not significantly differ between the placebo and sulpiride sessions in either group (Supplementary Tables S1-3). Working memory capacity—as a proxy for dopamine synthesis capacity in the striatum (Cools et al. 2008)—was previously shown to predict the effect of sulpiride on learning from reward versus punishment in this paradigm (van der Schaaf et al. 2014). In the current study, working memory capacity did not correlate with the effect of the drug in either group. However, it should be noted that the majority of our sample (independent of group) falls in the low working memory group as reported by van der Schaaf et al. (2014), although the difference in the proportion of high and low working memory capacity subjects between the studies was only marginally significant (cut-off digit span = 17.4; chi(1) = 2.967, p = 0.085).

Discussion

Pathological gambling is thought to implicate dopamine, which is well established to modulate learning from reward versus punishment (Maia and Frank 2011). Like substance addiction, pathological gambling has been hypothesized to be accompanied by dopamine-related impairments in learning (Redish et al. 2007). However, this hypothesis has hitherto never been tested. Here, we establish for the first time a link between pathological gambling, dopamine and learning from reward versus punishment. Specifically, we show that administration of sulpiride, a D2-receptor antagonist, impaired reward- versus punishment-based reversal learning in controls, while not altering reward- versus punishment-based reversal learning in gamblers. However, caution is needed as to the interpretation of the lack of drug effect in gamblers, as supplementary analyses suggest that sulpiride might even have the diametrically opposite effect, i.e. impairing punishment rather than reward-based reversal learning, when only considering gamblers without comorbidities (n = 16).

The effect of sulpiride on outcome-specific reversal learning in controls is generally consistent with previous work using the same drug and task in healthy volunteers (van der Schaaf et al. 2014). In this prior study, we showed that the direction of the effect of sulpiride depended on baseline working memory capacity, so that sulpiride impaired reward relative to punishment learning in low working memory participants, whereas it improved reward relative to punishment learning in high working memory participants. The behaviour of our participants is consistent with the impairments observed in low working memory participants. Examination of working memory capacity in our participants showed that the majority of our sample falls in the low working memory group as reported by van der Schaaf et al. (2014). This might reflect the fact that the current sample is more heterogeneous in terms of age and received lower education than the sample of van der Schaaf et al. (2014).

The present results reveal a striking difference in how pathological gamblers and healthy controls respond to the same antipsychotic drug. The differential effect of sulpiride on punishment learning in controls versus non-comorbid gamblers is intriguing and might be relevant in the context of evidence that antipsychotic drugs can impair conditioned avoidance responding (Smith et al. 2004) (although note that our task was not optimized for measuring actual avoidance of punishment). One possibility is that these results reflect an underlying difference in the endogenous dopamine system. In this context, it is interesting to note that pathological gambling has been argued to be accompanied by reduced availability of D2-receptors (Comings and Blum 2000). Some evidence for this hypothesis comes from PET studies, showing that gambling severity and impulsiveness in pathological gamblers correlate with D2/D3-receptor availability (Boileau et al. 2013; Clark et al. 2012). In addition, there is evidence for enhanced drug- and task-induced dopamine release in individuals exhibiting compulsive gambling behaviour (Boileau et al. 2014; Evans et al. 2006; Linnet et al. 2010; O’Sullivan et al. 2011; Steeves et al. 2009).

According to current modeling work of striatal dopamine, D2-receptor blockade might alter reward- versus punishment-based learning and performance by shifting the balance between processing in the D1-mediated GO-pathway and D2-mediated NOGO-pathway of the basal ganglia (Frank 2005; Maia and Frank 2011). In line with the present observation in controls, Pessiglione et al. (2006) found that the D2-receptor antagonist haloperidol impaired reward-learning and attenuated reward prediction error signals in the striatum, suggesting a shift to processing in the D2-mediated NOGO-pathway favouring learning from punishment over learning from reward. Similarly, in a study by Eisenegger et al. (2014) a high dose (800 mg) of the D2-receptor antagonist sulpiride impaired reward-related performance. However, in apparent contrast to those previous findings as well as our current observation in controls, Frank and O’Reilly (2006) found that the D2-receptor antagonist haloperidol improved reward- versus punishment-based learning. In addition, a low dose (200 mg) of the D2-antagonist amisulpride has been found to improve reward- versus punishment-based learning and enhance reward prediction error signal in the striatum (Jocham et al. 2011), suggesting a shift to processing in the D1-mediated GO-pathway. Note however that at a higher dose (400 mg), amisulpride impaired both reward and punishment learning (Jocham et al. 2014). These seemingly paradoxical findings may be explained by the use of different doses, which may in turn lead to differential action of D2-receptor antagonists on pre- vs. postsynaptic receptors in different studies (Frank and O’Reilly 2006). Reward-related improvements with D2-receptor antagonists are generally attributed to action at self-regulatory presynaptic receptors (enhancing dopamine in the synapse), whereas reward-related impairments with D2-receptor antagonists are generally associated with action at postsynaptic receptors. In our study, sulpiride might have shifted the balance away from reward learning in controls, consistent with postsynaptic action, but not in gamblers, suggesting a reduction in postsynaptic action of sulpiride in gamblers versus controls. In fact, when assessing a comorbidity-free group of gamblers, sulpiride tended to actually impair punishment- rather than reward-based learning, raising the possibility that sulpiride might have acted pre- rather than postsynaptically. Preferential sensitivity of pre- versus postsynaptic D2-receptors might make particular sense when synaptic dopamine levels are supra-optimal, e.g. through enhanced dopamine release (Boileau et al. 2014). We emphasize that this hypothesis about the specific mechanism underlying our effect in pathological gamblers remains speculative. One reason is that dopamine D2-receptor antagonists like sulpiride seem to have dose-dependent effects on pre- vs postsynaptic striatal D2-receptors (Frank and O’Reilly 2006). Our dose of 400 mg has been shown to occupy ∼30 % of striatal postsynaptic D2-receptors (Mehta et al. 2008). However, we have no way of quantifying the exact occupancy of pre- and postsynaptic D2-receptors in this study. The pre- versus postsynaptic nature of these D2-receptor effects might be disentangled in future work, for example by administering a higher dose of sulpiride, or by exploiting common polymorphisms in the dopamine receptor D2 gene that are thought to affect the balance between pre- and postsynaptic action (Frank et al. 2007).

One might have expected a baseline difference in reward- and punishment-based learning between the groups. Indeed previous studies have reported slowed learning from punishment on an instrumental reversal learning task (de Ruiter et al. 2009; Vanes et al. 2014) and possibly increased preoccupation with rewards rather than punishments (Kreussel et al. 2013) in pathological gamblers versus controls. Surprisingly, in our study, gamblers and controls learned equally well from unexpected rewards and unexpected punishments under placebo. We are puzzled about this, and hypothesize that this might reflect compensatory mechanisms in the dopamine system, related, e.g. to upregulation of dopamine synthesis capacity, dopamine release, or postsynaptic dopamine receptor sensitivity. This generally concurs with the view that underlying pathology might not surface as impairment under baseline conditions, but only when probing the system, for example by using a pharmacological challenge (Verdejo-García et al. 2008). This also corresponds with the finding that gambling-related abnormalities in baseline D2-receptor availability (Boileau et al. 2013; Clark et al. 2012; Joutsa et al. 2012) are more subtle than these in drug- or task-induced dopamine release (Boileau et al. 2014; Linnet et al. 2010).

In short, we found that blockade of D2-receptors with sulpiride impaired reward versus punishment learning in controls, but not in gamblers. By contrast, in comorbidity-free gamblers, sulpiride impaired punishment, but not reward learning. This strongly suggests that pathological gambling is associated with a dopamine D2-receptor-related anomaly in learning from reward and punishment. Future neurochemical work, using PET or genetics is required to address the exact neurochemical mechanisms of this anomaly.

References

Belin D, Mar AC, Dalley JW, Robbins TW, Everitt BJ (2008) High impulsivity predicts the switch to compulsive cocaine-taking. Science 320:1352–1355
Article PubMed Central CAS PubMed Google Scholar
Boileau I, Payer D, Chugani B, Lobo D, Behzadi A, Rusjan PM, Houle S, Wilson AA, Warsh J, Kish SJ (2013) The D2/3 dopamine receptor in pathological gambling: a positron emission tomography study with [11C] (+) propyl hexahydro naphtho oxazin and [11C] raclopride. Addiction 108:953–963
Article PubMed Google Scholar
Boileau I, Payer D, Chugani B, Lobo DSS, Houle S, Wilson AA, Warsh J, Kish SJ, Zack M (2014) In vivo evidence for greater amphetamine-induced dopamine release in pathological gambling: a positron emission tomography study with [lsqb]11C[rsqb]-(+)-PHNO. Mol Psychiatry 19:1305–1313
Article CAS PubMed Google Scholar
Bond A, Lader M (1974) The use of analogue scales in rating subjective feelings. Br J Med Psychol 47(3):211–218
Article Google Scholar
Boog M, Höppener P, v. d. Wetering BJM, Goudriaan AE, Boog MC, Franken IHA (2014) Cognitive inflexibility in gamblers is primarily present in reward-related decision making. Front Hum Neurosci 8:569. doi:10.3389/fnhum.2014.00569
Carver CS, White TL (1994) Behavioral inhibition, behavioral activation, and affective responses to impending reward and punishment: the BIS/BAS Scales. J Pers Soc Psychol 67:319
Article Google Scholar
Clark L, Stokes PR, Wu K, Michalczuk R, Benecke A, Watson BJ, Egerton A, Piccini P, Nutt DJ, Bowden-Jones H et al (2012) Striatal dopamine D2/D3 receptor binding in pathological gambling is correlated with mood-related impulsivity. NeuroImage 63:40–46
Article PubMed Central CAS PubMed Google Scholar
Cocker PJ, Dinelle K, Kornelson R, Sossi V, Winstanley CA (2012) Irrational choice under uncertainty correlates with lower striatal D2/3 receptor binding in rats. J Neurosci 32:15450–15457
Article CAS PubMed Google Scholar
Comings DE, Blum K (2000) Reward deficiency syndrome: genetic aspects of behavioral disorders. Prog Brain Res 126:325–341
Comings D, Rosenthal R, Lesieur HR, Rugle L, Muhleman D, Chiu C, Dietz G, Gade R (1996) A study of the dopamine D2 receptor gene in pathological gambling. Pharmacogenetics 6:223–234
Article CAS PubMed Google Scholar
Cools R, Altamirano L, D’Esposito M (2006) Reversal learning in Parkinson’s disease depends on medication status and outcome valence. Neuropsychologia 44:1663–1673
Article PubMed Google Scholar
Cools R, Gibbs SE, Miyakawa A, Jagust W, D’Esposito M (2008) Working memory capacity predicts dopamine synthesis capacity in the human striatum. J Neurosci 28:1208–1212
Article CAS PubMed Google Scholar
Cools R, Frank MJ, Gibbs SE, Miyakawa A, Jagust W, D’Esposito M (2009) Striatal dopamine predicts outcome-specific reversal learning and its sensitivity to dopaminergic drug administration. J Neurosci 29:1538–1543
Article PubMed Central CAS PubMed Google Scholar
Dalley JW, Fryer TD, Brichard L, Robinson ES, Theobald DE, Lääne K, Peña Y, Murphy ER, Shah Y, Probst K (2007) Nucleus accumbens D2/3 receptors predict trait impulsivity and cocaine reinforcement. Science 315:1267–1270
Article PubMed Central CAS PubMed Google Scholar
DeCaria C, Hollander E, Grossman R, Wong C, Mosovich S, Cherkasky S (1996) Diagnosis, neurobiology, and treatment of pathological gambling. J Clin Psychiatry 57
de Ruiter MB, Veltman DJ, Goudriaan AE, Oosterlaan J, Sjoerds Z, van den Brink W (2009) Response perseveration and ventral prefrontal sensitivity to reward and punishment in male problem gamblers and smokers. Neuropsychopharmacology 34:1027–1038
Article PubMed Google Scholar
Eisenegger C, Naef M, Linssen A, Clark L, Gandamaneni PK, Mueller U, Robbins TW (2014) Role of dopamine D2 receptors in human reinforcement learning. Neuropsychopharmacology 39:2366–2375
Article CAS PubMed Google Scholar
Ersche KD, Roiser JP, Abbott S, Craig KJ, Müller U, Suckling J, Ooi C, Shabbir SS, Clark L, Sahakian BJ et al (2011) Response perseveration in stimulant dependence is associated with striatal dysfunction and can be ameliorated by a D2/3 receptor agonist. Biol Psychiatry 70:754–762
Article CAS PubMed Google Scholar
Evans AH, Pavese N, Lawrence AD, Tai YF, Appel S, Doder M, Brooks DJ, Lees AJ, Piccini P (2006) Compulsive drug use linked to sensitized ventral striatal dopamine transmission. Ann Neurol 59:852–858
Article CAS PubMed Google Scholar
Everitt BJ, Robbins TW (2005) Neural systems of reinforcement for drug addiction: from actions to habits to compulsion. Nat Neurosci 8:1481–1489
Article CAS PubMed Google Scholar
Frank MJ (2005) Dynamic dopamine modulation in the basal ganglia: a neurocomputational account of cognitive deficits in medicated and non-medicated Parkinsonism. J Cogn Neurosci 17:51–57
Article PubMed Google Scholar
Frank MJ, O’Reilly RC (2006) A mechanistic account of striatal dopamine function in human cognition: psychopharmacological studies with cabergoline and haloperidol. Behav Neurosci 120:497–517
Article CAS PubMed Google Scholar
Frank MJ, Seeberger LC, O’Reilly RC (2004) By carrot or by stick: cognitive reinforcement learning in Parkinsonism. Science 306:1940–1943
Article CAS PubMed Google Scholar
Frank M, Moustafa A, Haughey H, Curran T, Hutchison K (2007) Genetic triple dissociation reveals multiple roles for dopamine in reinforcement learning. Proc Natl Acad Sci 104:16311–16316
Heatherton TF, Kozlowski LT, Frecker RC (1991) The Fagerstrom test for nicotine dependence: a revision of the Fagerstrom tolerance questionnaire. Br J Addict 86:1119–1127
Article CAS PubMed Google Scholar
Howell DC (1997) Statistical methods for psychology. Belmont, USA: Wadsworth Publishing Company
Izquierdo A, Jentsch JD (2012) Reversal learning as a measure of impulsive and compulsive behavior in addictions. Psychopharmacology 219:607–620
Article PubMed Central CAS PubMed Google Scholar
Jocham G, Klein TA, Ullsperger M (2011) Dopamine-mediated reinforcement learning signals in the striatum and ventromedial prefrontal cortex underlie value-based choices. J Neurosci 31:1606–1613
Article CAS PubMed Google Scholar
Jocham G, Klein TA, Ullsperger M (2014) Differential modulation of reinforcement learning by D2 dopamine and NMDA glutamate receptor antagonism. J Neurosci 34:13151–13162
Joutsa J, Johansson J, Niemelä S, Ollikainen A, Hirvonen M, Piepponen P, Arponen E, Alho H, Voon V, Rinne J et al (2012) Mesolimbic dopamine release is linked to symptom severity in pathological gambling. NeuroImage 60:1992–1999
Article CAS PubMed Google Scholar
Kreussel L, Hewig J, Kretschmer N, Hecht H, Coles MGH, Miltner WHR (2013) How bad was it? Differences in the time course of sensitivity to the magnitude of loss in problem gamblers and controls. Behav Brain Res 247:140–145
Article PubMed Google Scholar
Lesieur HR, Blume SB (1987) The South Oaks Gambling Screen (SOGS): a new instrument for the identification of pathological gamblers. Am J Psychiatry 144:1184–1188
Article CAS PubMed Google Scholar
Linnet J, Peterson E, Doudet D, Gjedde A, Møller A (2010) Dopamine release in ventral striatum of pathological gamblers losing money. Acta Psychiatr Scand 122:326–333
Article PubMed Google Scholar
Lorains FK, Cowlishaw S, Thomas SA (2011) Prevalence of comorbid disorders in problem and pathological gambling: systematic review and meta-analysis of population surveys. Addiction 106:490–498
Maia TV, Frank MJ (2011) From reinforcement learning models to psychiatric and neurological disorders. Nat Neurosci 14:154–162
Article PubMed Central CAS PubMed Google Scholar
Mehta M, McGowan S, Lawrence A, Aitken M, Montgomery A, Grasby P (2003) Systemic sulpiride modulates striatal blood flow: relationships to spatial working memory and planning. Neuroimage 20:1982–1994
Article PubMed Google Scholar
Mehta MA, Montgomery AJ, Kitamura Y, Grasby PM (2008) Dopamine D2 receptor occupancy levels of acute sulpiride challenges that produce working memory and learning impairments in healthy volunteers. Psychopharmacology (Berl) 196:157–165
O’Sullivan SS, Wu K, Politis M, Lawrence AD, Evans AH, Bose SK, Djamshidian A, Lees AJ, Piccini P (2011) Cue-induced striatal dopamine release in Parkinson’s disease-associated impulsive-compulsive behaviours. Brain 134(4):969–978. doi:10.1093/brain/awr003
Patton JH, Stanford MS, Barratt ES (1995) Factor structure of the Barratt impulsiveness scale. J Clin Psychol 51:768–774
Article CAS PubMed Google Scholar
Pessiglione M, Seymour B, Flandin G, Dolan RJ, Frith CD (2006) Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature 442:1042–1045
Article PubMed Central CAS PubMed Google Scholar
Petry NM (2007) Gambling and substance use disorders: current status and future directions. Am J Addict 16:1–9
Article PubMed Google Scholar
Porchet RI, Boekhoudt L, Studer B, Gandamaneni PK, Rani N, Binnamangala S, Müller U, Clark L (2013) Opioidergic and dopaminergic manipulation of gambling tendencies: a preliminary study in male recreational gamblers. Front Behav Neurosci 7:138. doi:10.3389/fnbeh.2013.00138
Potenza MN (2008) The neurobiology of pathological gambling and drug addiction: an overview and new findings. Philos Trans Royal Soc B: Biol Sci 363:3181–3189
Article Google Scholar
Potenza MN (2013). How central is dopamine to pathological gambling or gambling disorder?. Front Behav Neurosci 7:206. doi:10.3389/fnbeh.2013.00206
Redish AD (2004) Addiction as a computational process gone awry. Science 306:1944–1947
Redish AD, Jensen S, Johnson A, Kurth-Nelson Z (2007) Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling. Psychol Rev 114:784–805
Robins LN, Cottler L, Bucholz K, Compton W (1998) Diagnostic interview schedule for DSM-IV (DIS-IV-Revision 11 Sep 1998). St. Louis, MO: Washington University, School of Medicine, Department of Psychiatry
Rogers RD, Wong A, McKinnon C, Wistanley CA (2013) Systemic administration of 8-OH-DPAT and eticlopride, but not SCH23390, alters loss-chasing behavior in the rat. Neuropsychopharmacology 38:1094–1104
Article PubMed Central CAS PubMed Google Scholar
Romanczuk-Seiferth N, Koehler S, Dreesen C, Wüstenberg T, Heinz A (2014) Pathological gambling and alcohol dependence: neural disturbances in reward and loss avoidance processing. Addict Biol 20(3):557–569
Article PubMed Google Scholar
Saunders JB, Aasland OG, Babor TF, de la Fuente JR, Grant M (1993) Development of the Alcohol Use Disorders Identification Test (AUDIT): WHO collaborative project on early detection of persons with harmful alcohol consumption–II. Addiction 88:791–804
Article CAS PubMed Google Scholar
Sheehan D, Lecrubier Y, Sheehan K, Amorim P, Janavs J, Weiller E, Hergueta T, Baker R, Dunbar G (1998) The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J Clin Psychiatry 59(Suppl 20):22–33
PubMed Google Scholar
Smith A, Li M, Becker S, Kapur S (2004) A model of antipsychotic action in conditioned avoidance: a computational approach. Neuropsychopharmacology 29:1040–1049
Steeves TDL, Miyasaki J, Zurowski M, Lang AE, Pellecchia G, Van Eimeren T, Rusjan P, Houle S, Strafella AP (2009) Increased striatal dopamine release in Parkinsonian patients with pathological gambling: a [11C] raclopride PET study. Brain 132:1376–1385
Article PubMed Central CAS PubMed Google Scholar
St. Onge JR, Abhari H, Floresco SB (2011) Dissociable contributions by prefrontal D1 and D2 receptors to risk-based decision making. J Neurosci 31:8625–8633
Tremblay AM, Desmond RD, Poulos CX, Zack M (2011) Haloperidol modifies instrumental aspects of slot machine gambling in pathological gamblers and healthy controls. Addict Biol 16:467–484
Article CAS PubMed Google Scholar
van der Schaaf ME, van Schouwenburg MR, Geurts DEM, Schellekens AFA, Buitelaar JK, Verkes RJ, Cools R (2014) Establishing the dopamine dependency of human striatal signals during reward and punishment reversal learning. Cereb Cortex 24:633–642
Article PubMed Google Scholar
Vanes LD, van Holst RJ, Jansen JM, van den Brink W, Oosterlaan J, Goudriaan AE (2014) Contingency learning in alcohol dependenc and pathological gambling: learning and unlearning reward contingencies. Alcohol Clin Exp Res 38:1602–1610
Article PubMed Central PubMed Google Scholar
Verdejo-García A, Lawrence AJ, Clark L (2008) Impulsivity as a vulnerability marker for substance-use disorders: review of findings from high-risk research, problem gamblers and genetic association studies. Neurosci Biobehav Rev 32:777–810
Article PubMed Google Scholar
Wardle H, Moody A, Spence S, Orford J, Volberg R, Jotangia D et al (2010) British gambling prevalence survey. National Centre for Social Research, London
Google Scholar
Watson D, Clark LA, Tellegen A (1988) Development and validation of brief measures of positive and negative affect: the PANAS scales. J Pers Soc Psychol 54:1063
Article CAS PubMed Google Scholar
Welte JW, Barnes GM, Tidwell M-C, Hoffman JH, Wieczorek WF (2014) Gambling and problem gambling in the United States: Changes between 1999 and 2013. J Gambl Stud 1–21. doi:10.1007/s10899-014-9471-4
Winstanley CA, Cocker PJ, Rogers RD (2011) Dopamine modulates reward expectancy during performance of a slot machine task in rats: evidence for a ‘near-miss’ effect. Neuropsychopharmacology 36:913–925
Article PubMed Central CAS PubMed Google Scholar
Zack M, Poulos CX (2007) A D2 antagonist enhances the rewarding and priming effects of a gambling episode in pathological gamblers. Neuropsychopharmacology 32:1678–1686
Article CAS PubMed Google Scholar
Zigmond AS, Snaith RP (1983) The Hospital Anxiety and Depression Scale. Acta Psychiatr Scand 67:361–370
Article CAS PubMed Google Scholar

Download references

Acknowledgments

Guillaume Sescousse was supported by a Rubicon fellowship from the Netherlands Research Organization (NWO). Data will be made available upon request.

Conflict of interest

The authors declare no conflict of interest

Author information

Authors and Affiliations

Donders Institute for Brain, Cognition and Behavior, Radboud University, PO Box 9101, 6500 HB, Nijmegen, The Netherlands
Lieneke Katharina Janssen, Guillaume Sescousse, Mahur Melina Hashemi, Monique Harmina Maria Timmer, Niels Peter ter Huurne, Dirk Everdina Maria Geurts & Roshan Cools
Department of Psychiatry, Radboud University Medical Center, PO Box 9101, 6500 HB, Nijmegen, The Netherlands
Lieneke Katharina Janssen, Guillaume Sescousse, Niels Peter ter Huurne, Dirk Everdina Maria Geurts & Roshan Cools
Department of Neurology, Radboud University Medical Center, PO Box 9101, 6500 HB, Nijmegen, The Netherlands
Monique Harmina Maria Timmer

Authors

Lieneke Katharina Janssen
View author publications
You can also search for this author in PubMed Google Scholar
Guillaume Sescousse
View author publications
You can also search for this author in PubMed Google Scholar
Mahur Melina Hashemi
View author publications
You can also search for this author in PubMed Google Scholar
Monique Harmina Maria Timmer
View author publications
You can also search for this author in PubMed Google Scholar
Niels Peter ter Huurne
View author publications
You can also search for this author in PubMed Google Scholar
Dirk Everdina Maria Geurts
View author publications
You can also search for this author in PubMed Google Scholar
Roshan Cools
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lieneke Katharina Janssen.

Additional information

Lieneke Katharina Janssen and Guillaume Sescousse contributed equally to this work.

Electronic supplementary material

Below is the link to the electronic supplementary material.

ESM 1

(DOCX 42 kb)

Fig. S1

Mean number of reversals per group (controls, gamblers), drug (placebo, sulpiride), and outcome (unexpected reward, unexpected punishment). There were no significant differences in the number of reversals between groups, drugs, or outcomes. (GIF 107 kb)

High-resolution image (TIFF 1379 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Janssen, L.K., Sescousse, G., Hashemi, M.M. et al. Abnormal modulation of reward versus punishment learning by a dopamine D2-receptor antagonist in pathological gamblers. Psychopharmacology 232, 3345–3353 (2015). https://doi.org/10.1007/s00213-015-3986-y

Download citation

Received: 22 April 2015
Accepted: 01 June 2015
Published: 20 June 2015
Issue Date: September 2015
DOI: https://doi.org/10.1007/s00213-015-3986-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Abnormal modulation of reward versus punishment learning by a dopamine D2-receptor antagonist in pathological gamblers