Accumulating evidence from both behavioral and neuroimaging studies consistently suggests that performance-contingent reward enhances cognitive control, as has been evidenced by the results from a variety of cognitive paradigms, including memory, attention, inhibition, and episodic memory (see Braver et al., 2014, for a recent literature review). This enhancing effect of rewards on cognitive control, sometimes referred to as “motivated cognitive control,” is associated with changes in activity in a number of brain regions, including the lateral prefrontal cortex (PFC) and the ventral striatum (see, e.g., Boehler, Schevernels, Hopf, Stoppel, & Krebs, 2014; Hollerman, Tremblay, & Schultz, 2000; Ivanov et al., 2012; Rothkirch, Schmack, Deserno, Darmohray, & Sterzer, 2014). Much of this research has focused on cue-related or transient changes in brain activation associated with the presentation of reward cues. However, motivational incentives can also increase brain activation in a sustained fashion (e.g., Engelmann, Damaraju, Padmala, & Pessoa, 2009; Jimura, Locke, & Braver, 2010), which has received less attention in the literature. As such, the purpose of this study was to examine both the sustained and transient effects of rewards (i.e., monetary incentives) on cognitive control, with a particular focus on the relative roles of the dorsolateral prefrontal cortex (DLPFC) and striatum. In addition, we investigated whether individual differences in a reward-related trait (i.e., anhedonia) would predict behavior or specific neural aspects of motivated cognitive control (i.e., sustained vs. transient effects).

Transient/cue-related brain activity associated with motivational incentives

The majority of the prior work on motivated cognitive control has focused on examining changes in behavioral performance and transient brain activation in response to external cues associated with reward value. A rich body of neuroimaging research has identified both cortical and subcortical brain regions (i.e., the lateral PFC, striatum) that are responsive to representing, predicting, and updating reward value in motivationally salient contexts during cognitive performance (e.g., Boehler et al., 2011; Breiter & Rosen, 1999; Dixon & Christoff, 2012, 2014; Kouneiher, Charron, & Koechlin, 2009; Krebs, Boehler, Roberts, Song, & Woldorff, 2012; O’Doherty, 2004; Stoppel et al., 2011; Vassena et al., 2014; see Dixon & Christoff, 2014, for a review). Single-unit studies in the lateral PFC with nonhuman primates have shown that the majority of neurons in the lateral PFC encode the representation of value-related information (Sakagami & Watanabe, 2007). For example, when monkeys were trained to make go or no-go responses to the physical features of cue stimuli such as colors, most neurons in the lateral PFC showed differential visual responses to rewarding cues, regardless of the physical features of the cue stimulus (Watanabe & Sakagami, 2007). In a similar vein, human neuroimaging and neurophysiological work has provided strong evidence that reward-predicting cues exert enhancing effects on cognitive control functions, thought to be supported by regions in the lateral PFC in a variety of cognitive control paradigms such as conflict processing (Krebs, Boehler, Appelbaum, & Woldorff, 2013; Krebs, Boehler, & Woldorff, 2010), working memory (Jimura et al., 2010; Pochon et al., 2002; Taylor et al., 2004), and context processing (Chiew & Braver, 2013; Locke & Braver, 2008). Many of these regions show transient increases in activations in response to cue information, indicating the potential for reward to be associated with upcoming performance.

These reward-predicting cues may be particularly relevant for modulating control function when there are competing stimulus dimensions, which often results in a high demand on cognitive control (e.g., Aupperle, Melrose, Francisco, Paulus, & Stein, 2015; Padmala & Pessoa, 2011). Behavioral and neuroimaging data in humans has shown that incentive cues can enhance cognitive control by decreasing conflict processing on reward trials (Krebs et al., 2010). For example, in neuroimaging work by Padmala and Pessoa (2011), healthy individuals performed a response conflict processing task, with reward trials cued by “$20” and no-reward trials by “$00.” Participants were instructed that correct and fast performance on “$20” trials would be rewarded. Consistent with prior studies showing enhancing effects of rewards on cognitive control (Engelmann, Damaraju, Padmala, & Pessoa, 2009; Small et al., 2005), these researchers found that cue-related neural responses on reward trials were increased in several fronto-parietal regions, as well as in the ventral striatum and caudate. Also, these researchers found that conflict-related responses in the medial PFC on reward trials were reduced as compared to on no-reward trials.

Sustained brain activations associated with motivational incentives

Transient brain responses to cues indicating the potential for reward may not be the only mechanism that links incentive information to enhanced cognitive control. It is possible that information about potential rewards may change cognitive processing and brain activity during the entire task in a more sustained fashion. A more recent line of research has examined state-dependent reward context effects on cognitive control, providing evidence for the presence of motivation-related “state” effects on cognitive function, as evidenced by increased sustained activations across blocks of trials with incentive information (Engelmann et al., 2009; Jimura et al., 2010; Locke & Braver, 2008). For example, in a study by Engelmann et al. (2009) using a Posner-type task in which cues indicated the location of the face target stimulus, motivation was manipulated in a blocked fashion by varying the valence (e.g., winning, avoiding loss) and magnitude of the rewards associated with task performance (e.g., winning $1 or $4, or avoid losing $2.5 or $0). They found that in several regions in a fronto-parietal attentional network (i.e., the posterior intraparietal sulcus, middle frontal gyrus, the caudate, and putamen), cue-related responses were modulated by the incentive values. Importantly, they also found that several regions thought to be involved in the control of attention (e.g., the intraparietal sulcus and middle frontal gyrus) showed increased sustained activations across the course of blocks with greater incentive values. These results suggest that the enhancing effects of rewards on cognitive control can be evidenced in at least two ways: by (1) increases in cue-related responses, and (2) increases in sustained responses.

In a similar vein, Locke and Braver (2008) had healthy adults perform a goal maintenance task under baseline, reward, and penalty conditions during scanning. These researchers found an increase in sustained activations during reward blocks in a network of cognitive control regions including the right lateral PFC (i.e., DLPFC and ventrolateral PFC) and the parietal cortex, as well as an improvement in task performance on trials in the reward blocks. Importantly, work by Jimura et al. (2010) extended the prior findings by showing that during reward blocks, better performance was seen even on trials on which people could not earn rewards. Specifically, when healthy individuals completed a working memory task under no-reward versus reward contexts, they showed faster performance on trials for which they could earn reward than on trials in a baseline condition (e.g., Locke & Braver, 2008; Padmala & Pessoa, 2011). Interestingly, their performance was better even on trials in the reward condition for which they could not earn reward, as compared to the baseline condition, an effect Locke and Braver referred to as a reward context effect. Importantly, this behavioral incentive context effect was associated with an increase in the right DLPFC (BA 9/46) activation that was sustained across both reward and nonreward trials in the task blocks (Jimura et al., 2010).

Incentives and the dual-mechanisms-of-control framework

To explain these findings, Jimura et al. (2010) invoked a theoretical framework of cognitive control developed by Braver and his colleagues (Braver, 2012; Braver, Gray, & Burgess, 2007), referred to as the dual-mechanisms-of-control theory. This theory postulates that cognitive control can be supported by at least two complementary mechanisms. The first is proactive control, which is the engagement of control mechanisms, including support for task-relevant goals and tasks sets, prior to and in anticipation of the need to implement them. The second is reactive control, which is the triggering of control mechanisms, such as the retrieval of task sets or goals, when conflict or difficulty in processing is encountered. This theory suggests that both the behavioral improvements seen on neutral trials in reward contexts and the increase in sustained activation in the DLPFC might reflect a role of the right lateralized DLPFC in the preparatory, proactive control used to integrate reward-related information with cognitive goal representations. That is, even the knowledge that it is possible to gain rewards may enhance cognitive control, even when the current trial does not contain a specific incentive cue. This may occur by facilitating the representation of task-relevant information (Niv, 2007; reviewed in Braver et al., 2014), which may be reflected in enhanced sustained activity, as well as by increased cue-related responses, in regions such as the DLPFC. Interestingly, Jimura et al. (2010) also found that such increases in cue-related responses (putatively reflecting enhanced proactive control) were accompanied by reduced probe-related responses in the same DLPFC region, potentially reflecting a decreased need for reactive control.

Individual-difference factors relating to motivated cognitive control

Importantly, providing rewards or incentives does not lead to changes in behavioral performance and brain activity in all individuals. A growing body of research suggests that individual differences in reward-related sensitivity may modulate behavioral and neural responses to either primary (e.g., food) or secondary (i.e., monetary incentives) rewards (e.g., Beaver et al., 2006; Cooper, Duke, Pickering, & Smillie, 2014; Jimura et al., 2010; Locke & Braver, 2008). Specifically, several studies have reported associations between individual differences in reward-related personality traits and reward-related neural activations (e.g., Beaver et al., 2006; Cohen, Young, Baek, Kessler, & Ranganath, 2005; Cooper et al., 2014; Jimura et al., 2010; Locke & Braver, 2008). For example, Beaver et al. (2006) found that individual differences in Behavioral Inhibition and Behavioral Approach Systems (BIS/BAS) drive scores (i.e., items asking about the pursuit of goals) were significantly associated with neural responses to appetizing relative to bland foods in the ventral striatum and orbitofrontal cortex. Cohen et al. (2005) found that when participants received immediate monetary rewards during a gambling task, people higher in extraversion showed a greater magnitude of neural response related to reward receipt versus no reward in the right medial orbitofrontal cortex, amygdala, and right nucleus accumbens. More recent work by Jimura et al. (2010) also showed positive associations between sustained DLPFC activations during reward context and reward sensitivity from the BAS. Thus, personality traits related to reward drive and sensitivity may be factors for understanding individual differences in incentive effects on cognitive control.

However, another less explored individual-difference factor related to reward processing is anhedonia. Anhedonia is defined as a reduction in the ability to experience pleasure. Experiencing rewards as positive or pleasurable maybe a critical factor that induces approach behavior toward goals and positive emotional states (e.g., as is reviewed in Gorwood, 2008). Even nonclinical populations show individual differences in anhedonia (see, e.g., Franken, Rassin, & Muris, 2007; Harvey, Pruessner, Czechowska, & Lepage, 2007). Individuals who experience rewards as being less pleasurable may be less “motivated” to modulate their behavior in order to enhance the likelihood of achieving such rewards. As such, individuals who self-report higher levels of anhedonia may show less improvement in cognitive control as a function of reward, and potentially less modulation for incentive-related brain activity, though it is less clear whether anhedonia may influence transient or sustained modulation of brain activity, or both.

Hypotheses of the present study

The purpose of this study was first to replicate prior work examining the neural mechanisms that mediate an enhancing effect of rewards on cognitive control, by examining sustained as well as incentive cue-related effects on cognitive control using a mixed state-item fMRI design. We focused on the DLPFC and the striatum, given prior research suggesting their involvement in mediating the influence of rewards on cognitive control. We modified a response conflict processing task originally developed by Padmala and Pessoa (2011) to fit in a mixed state-item design. The state-item fMRI design enabled us to examine sustained context-dependent effects and transient reward-related cue effects in the same study. Participants first performed baseline conditions without knowledge of the potential for incentives in future blocks. Participants then performed additional reward blocks on which they were told that they could win money on some trials (rewarded trials) by performing quickly and accurately. This variant of the paradigm enabled examinations of (1) reward context effects, by comparing performance and brain activity during the baseline versus the reward context, and (2) reward cue effects, by comparing performance during reward versus no-reward trials within reward blocks, as well as trials during the baseline blocks. We predicted that motivational states induced by reward contexts would produce greater sustained activity in the DLPFC than would those in nonincentive baseline blocks. We also predicted that incentive cues would generate increased transient neural activity in both reward-related cortical and subcortical regions.

Second, we wanted to test the hypotheses that individual differences in self-reported anhedonia would moderate the influence of rewards on cognitive control, with higher levels of self-reported anhedonia being associated with less of an improvement in performance as a function of reward, as well as with less of an increase in the activity of the DLPFC and/or striatum in response to reward information, in terms of either sustained or transient activation (or both). To test these hypotheses, the present study included two measures of trait anhedonia: the Snaith–Hamilton Pleasure Scale (Snaith et al., 1995) and the Social and Physical Anhedonia Scales (Chapman & Chapman, 1978; Eckblad, Chapman, Chapman, & Mishlove, 1982).

Method

Participants

A total of 28 healthy adults participated in the present study. One participant was not included in final analyses because the person failed to pass the fMRI quality control (described below). Thus, 27 healthy adults were included in final analyses (see Table 1 for the participant characteristics). Originally, we had planned to include 30 participants, to achieve 80% power to detect a medium effect size (Cohen’s d = 0.5), in order to replicate previous findings of changes in state-related activity between task blocks (d = 0.5 in a paired t test) and in cue-related activity (effect size f = 0.15; repeated measures analysis of variance [ANOVA] with one within-subjects factor, Trial Type). Due to practical constraints, we were able to include a sample size of 27, which provided 75% power to detect these effect sizes. Also, this sample size provided at least 68%–86% power to detect correlation coefficients ranging from .4 to .5 or greater.

Table 1 Participants characteristics and individual difference measures

All participants had no personal or family history of psychiatric or neurological disease. All participants were recruited through the Conte Center for the Neuroscience of Mental Disorders at Washington University in St. Louis and provided written informed consent. The study protocol was approved by the Washington University Human Research Protection Office. Participants received a maximum of $20 reward money that depended on their correct and fast behavioral performance, in addition to money for completing the experiment ($25/h).

Procedure

The study structure consisted of two sessions, one behavioral and one neuroimaging. In the behavioral session, participants completed several individual-difference questionnaires, described below, and a demographic form. During the neuroimaging session, they performed a modified response conflict processing task inside the scanner. They had two practice sessions, one before baseline blocks and one before the reward blocks, inside the scanner, to make sure that they were familiar with the task. At the end of the neuroimaging session, participants completed a postscan questionnaire to measure self-reported motivation and task difficulty during the task.

Stimuli and task paradigm

A mix of images-plus-words was used for the stimuli. The images were of either a house or a building, and each image was overlaid with a word to create congruent, incongruent, and neutral trials. For example, a congruent trial was one in which an image was presented with a matching word (e.g., a house picture with “HOUSE” below it, or a building picture with “BLDNG”). However, an incongruent trial was one in which an image was presented with a conflicting word (e.g., a house picture with “BLDNG” or a building picture with “HOUSE”). Neutral trials were ones in which an image was presented with “XXXXX.”

The task was a variant of a previously validated reward task paradigm (Padmala & Pessoa, 2011) that has revealed transient cue-related enhancing effects of monetary incentives on conflict processing in healthy individuals. We modified this task to use a mixed state-item fMRI design, allowing us to examine both the sustained context-dependent effects of reward and the transient cue-related effects of rewards. We used the recommendations for mixed state-item fMRI design from Petersen and Dubis (2012; see Fig. 1a). The fMRI task paradigm consisted of two baseline and four reward runs, with 54 trials per each trial type (i.e., baseline context, reward cue, and reward context), resulting in a total of 162 trials. During the task, participants were instructed to focus on the image and ignore the overlaid letters, and to categorize each picture by pressing a “1” for a house image or “2” for a building image.

Fig. 1
figure 1

Mixed state-item fMRI design of our response conflict task

As is presented in Fig. 1c, the baseline blocks consisted of two runs in which three different types of trials (i.e., congruent, incongruent, and neutral) were intermixed, with 18 trials per trial type (total of 54 trials). Each task block started with a start cue, “TASK,” and ended with an end cue, “DONE,” each presented for 2 s. After each start cue was a jittered period ranging from 0 to 4 s. During the baseline blocks, each trial started with an “XX” cue for 1 s, with prior instruction to the participants indicating that these cues were not relevant to the task. Then a jittered fixation period occurred, ranging from 2 to 6 s before the onset of the stimuli, to allow for estimates of event-related responses to the cues. The target stimulus was next presented for 1 s, which was followed by a delay of 0.5 s, during which time the participants responded. Participants were then provided with visual feedback indicating whether their performance was correct or incorrect, for 1 s. Finally, there was an intertrial interval (ITI) that varied between 2, 4, and 6 s.

After the two baseline runs, participants performed four additional runs, for which they were instructed that they could win money on some trials for their correct and fast responses, as is presented in Fig. 1d. The reaction time (RT) threshold to determine “fast” responses was set individually on the basis of the median correct RT for the second baseline run. During reward runs, half of the trials were preceded by a “$20” cue (reward cue: RC), indicating that a fast and correct response would be rewarded by 2,000 points, or by an “XX” cue (reward context: RCXT), indicating that zero points would be possible on the trial. A total of 108 trials were presented, with approximately equal numbers of congruent, incongruent, and neutral trials. After the target stimulus, participants received feedback regarding the reward points they had earned on that trial, as well as their cumulative earning in points. The accumulated points were converted into real money at the end of the experiment (with a maximum of $20).

fMRI data acquisition and processing

Imaging data were acquired on a 3-T Siemens TM TRIO system with a 12-channel head coil. Both structural and functional images were acquired every scanning session. High-resolution MPRAGE T1 images (echo time [TE] = 2.98 ms, repetition time [TR] = 2,300 ms, 160 slices, 1.0 × 1.0 × 1.2 mm voxels) and T2 images (TE = 84 ms, TR = 7,000 ms, 33 slices, 2.0 × 1.0 × 4.0 mm voxels) were acquired, to be registered and transformed to a standardized atlas space (Koch et al., 2010; Talairach & Tournoux, 1988) using a 12-dimensional affine transformation (Gradin et al., 2011; Woods, Cherry, & Mazziotta, 1992). Functional images were collected in six runs of 214 frames each, using an asymmetric spin-echo echo-planar sequence (TR = 2,000 ms, TE = 27 ms, field of view = 256 mm, flip = 90°, 33 slices). The functional images were acquired parallel to the anterior–posterior commissure plane with 4-mm3 isotopic voxels. Visual stimuli were projected using the E-Prime software, running on a Dell Inspiron laptop. Each stimulus was projected to participants with an LCD projector onto a screen located behind the scanner. A fiber-optic, light-sensitive keypress interfaced with the E-Prime button box was used to record participants’ behavioral performance inside the scanner.

All imaging data were preprocessed and analyzed using in-house Washington University software (the FIDL analysis package: www.nil.wustl.edu/~fidl/). The first four images of each run were discarded to allow for signal stabilization. The functional-imaging data preprocessing included (1) correction for slice-dependent time shifts; (2) removal of the first four images of each run; (3) elimination of odd/even slice intensity differences due to interpolated acquisition; (4) realignment of the data to compensate for rigid-body motion; (5) normalization of image intensity to a whole-brain mode value of 1,000; (6) registration of the 3-D structural volume (T1) to the atlas template in the Talairach coordinate system, using a 12-parameter affine transform and resampling to a 1-mm cubic representation (Buckner et al., 2004; Ojemann et al., 1997); (7) coregistration of the 3-D fMRI volume to the T2, and the T2 to the structural image; (8) transformation of the fMRI data to a 3 × 3 × 3 mm voxel atlas space, using a single affine 12-parameter transform; and (9) spatial smoothing using a 6-mm full-width at half-maximum Gaussian filter. We assessed head movement during scanning by using the output of the rigid-body rotation and translation algorithm. The translations and rotations in the x, y, and z planes across frames and the total root-mean-square (RMS) linear and angular precision measures were calculated for each run. If the standard deviation of the RMS movement exceeded 20, the BOLD runs were not included in the analysis [RMS/frame, as mean (SD): 27 participants = 0.15 (0.07)].

Self-reports of anhedonia

Participants completed the Snaith–Hamilton Pleasure Scale (SHPS; Snaith et al., 1995), which consists of 14 items that assess hedonic tone with a 4-point Likert scale (1 = definitely disagree, 4 = definitely agree). Thus, low scores represent the absence of hedonic tone (i.e., anhedonia). The SHPS has demonstrated high internal consistency, good test–retest reliability, and good convergent validity and discriminant validity (Franken et al., 2007). Participants also completed the Revised Social Anhedonia Scale (Eckblad et al., 1982) and the Revised Physical Anhedonia Scale (Chapman & Chapman, 1978). These scales measure the self-reported ability to experience pleasure from either social or physical stimuli using true/false questions. High scores represent high levels of anhedonia. Both the Social and Physical Anhedonia Scales have shown good internal consistency and test–retest reliability (Chapman, Chapman, & Kwapil, 1995).

Data analysis

Behavioral data

Repeated measures ANOVAs were conducted on the median RTs for correct trials and accuracy, with the within-subjects factors Reward (BCXT, RC, RCXT) and Trial Type (congruent, incongruent, and neutral). Post-hoc paired t tests followed, used to determine the source of significant interactions. A behavioral index of reward context effects was estimated by subtracting the RT on RCXT trials, cued by “XX,” from that on BCXT trials, with the same cue, “XX.” An index of reward cue effects was computed by subtracting the RT on RC trials, cued by “$20,” from that on RCXT trials, cued by “XX” within the same reward conditions. These behavioral indices of reward context and cue effects were used in a Pearson correlation analysis to examine the associations between individual differences in anhedonia trait and behavioral enhancements of rewards on cognitive tasks.

fMRI data

We used the recommendations for mixed state-item fMRI design from Petersen and Dubis (2012; see Fig. 1a). According to these recommendations, we simultaneously coded both sustained and transient cues, and target-related brain activity within the same general linear model (GLM). Specifically, in terms of modeling transient events, we used unassumed hemodynamic responses with eight time points, considering evidence that any deviation from the assumed shape using canonical waveforms can be misattributed to the sustained activation component (Miezin, Maccotta, Ollinger, Petersen, & Buckner, 2000; Visscher et al., 2003). The unassumed GLMs enabled us to obtain one parameter for every time point following stimulation in each event type modeled (Manoach, Greve, Lindgren, & Dale, 2003; Ollinger, Corbetta, & Shulman, 2001a; Ollinger, Shulman, & Corbetta, 2001b). Thus, we could test differences between the event types of interest by focusing on regions showing interactions with time point, due to our use of unassumed GLMs. As is reviewed by Petersen and Dubis (2012), the mixed state-item design enables one to dissociate sustained and transient effects with the assumption that event-related trial-by-trial effects should decay back to baseline during the ITI, whereas the state effects should remain sustained during the entire task block. Thus, the sustained effects (i.e., baseline and reward blocks) were modeled by boxcar functions lasting the length of the task block, using an assumption of a fixed-shape response of long duration (Fischl et al., 2002). Event-related cue and target effects for each trial type were analyzed separately by estimating the values of eight time point regressors (starting at trial onset) within the hemodynamic response epoch, leading to a 16-s estimate (TR 2 s, eight scanning frames) using unassumed hemodynamic response shapes. Thus, the model included two regressors for the sustained effects (baseline and reward context), three regressors for the cue types (BCXT “XX,” RC “$20,” and RCXT “XX”), and two regressors for the start and end cues, respectively, during the cue phase. Considering prior research suggesting that the effect of rewards on cognitive control can be reflected in changes to the cues, either reward or no reward in reward contexts (e.g., Jimura et al., 2010), and not only in sustained activity, we used the same cue, “XX,” in the baseline and reward contexts and modeled the regressor for BCXT in order to test this possibility. During the target phase, nine regressors for the combination of each cue type with each trial type (congruent, incongruent, and neutral) were included.

We used voxel-wise paired t tests to analyze the sustained estimates, with Reward (baseline, reward) as the within-subjects factor. For cue-related activity, we conducted a voxel-wise repeated measures ANOVA with Reward (BCXT, RCXT, RC) and Time Point (the eight time frame estimates for the hemodynamic response) as within-subjects factors. We focused on regions showing interactions with time point, due to our use of unassumed GLMs. Post-hoc ANOVAs and t tests were performed within all significant regions identified by the ANOVAs, as we described above. For these post-hoc analyses, we extracted the mean percent signal change across each region for each time point out of the eight estimated time points, to visualize the pattern of activity. For the statistical analyses, we focused on Time Point 4, because this time point encompassed 7–8 s after stimulus onset, which corresponded to the initial peak in a stereotyped hemodynamic responses, unconfounded by sustained activity. This was done for each applicable cue type and trial type effect. We then conducted post-hoc paired t tests to compare the three trial types, to parse the significant cue-related and condition-related effects.

For target-/receipt-related activation, we conducted two separate repeated measures ANOVAs. The first ANOVA ignored trial type (i.e., congruent, neutral, incongruent) and compared activation in the target phase across trials that could or could not earn reward in either the baseline or the reward blocks (i.e., Reward: BCXT, RCXT, or RC). Post-hoc paired t tests at Time Point 4 (i.e., BCXT–RC, RC–RCXT, BCXT–RCXT) were conducted to follow up on any significant effects. To attempt to replicate previous findings from prior work by Padmala and Pessoa (2011), another repeated measures ANOVA also included trial type (i.e., congruent, neutral, incongruent) and compared activation in the target phase across trials with activation in the reward context when participants could or could not earn reward, with Reward Context (i.e., BCXT, RC), Trial Type (i.e., congruent, neutral, incongruent trials), and Time Point as within-subjects factors.

DLPFC and basal ganglia (BG) a priori masks (see Supplemental Fig. 1)

We constrained our analyses to a priori masks within the DLPFC and BG, given the involvement of these regions in reward processing (see, e.g., Jimura et al., 2010; Locke & Braver, 2008). We used anatomically defined masks of voxels within the DLPFC (Rajkowska & Goldman-Rakic, 1995) and BG (Wang et al., 2008). Voxel-by-voxel neuroimaging analyses were conducted within these masks. All statistical activation maps from these masks were corrected for multiple comparisons using combined p-value and cluster thresholds, determined by the AlphaSim program in the AFNI software package. The DLPFC mask (Brodmann’s areas 9 and 46) included both left and right middle and superior frontal gyri according to anatomical landmarks (Rajkowska & Goldman-Rakic, 1995). For the DLPFC mask, we used as thresholds a z value of 2.05 and 13 voxels. The BG mask was based on Wang et al. (2008) and was generated by combining the caudate, nucleus accumbens, putamen, and globus pallidus together. We applied a z value of 2.05 and a 14-voxel threshold for the BG mask. Then we used information about the centroid of activation in these identified subregions to label them as a specific area. For any follow-up analyses to identify the sources of significant effects and further correlations analyses, as described above, we extracted the average of the BOLD response values of the voxels within the identified subregions within the mask regions and imported them into SPSS.

Individual difference analyses

We examined whether individual differences in anhedonia predicted behavior and/or the degree of brain activations as a function of rewards in the regions of interest (ROIs) that showed sustained reward and transient cue effects within each DLPFC and BG mask. We examined whether either of the following individual-difference measures predicted behavior, the magnitude of sustained brain activation during reward versus baseline contexts, or transient activation during RC versus RCXT trials: (1) total SHPS scores and (2) a composite score of the Chapman social and physical anhedonia scores (z-scored and then combined into one composite score). The brain–personality trait correlations were corrected by using the same small-volume procedures described above.

Results

Behavioral data

Reaction time (Fig. 2a)

Fig. 2
figure 2

Behavioral data: Reaction times and error data

The repeated measures ANOVA indicated significant main effects of reward [F(2, 52) = 18.20, p < .001, η p 2 = .41] and trial type [F(2, 52) = 21.00, p < .001, η p 2 = .45], but no significant Reward × Trial Type interaction [F(4, 104) = 1.36, p = .25, η p 2 = .05]. The main effect of reward (BCXT, RCXT, RC) reflected faster performance on RCXT than on BCXT trials [F(1, 26) = 4.09, p = .05, η p 2 = .14], as well as faster performance on RC than on RCXT trials [F(1, 26) = 26.55, p < .001, η p 2 = .50]. The main effect of trial type indicated slower responses on incongruent than on congruent trials [F(1, 26) = 28.92, p < .001, η p 2 = .53] and neutral trials [F(1, 26) = 12.75, p < .002, η p 2 = .33], and slower RTs on neutral than on congruent trials [F(1, 26) = 17.41, p < .001, η p 2 = .40].

Figure 2a seemingly shows that the magnitude of interference effects might be greater in the RC condition, even though the overall RTs were faster there. We conducted a paired t test comparing the interference effect (i.e., incongruent – neutral) on the RC as compared to the BCXT blocks. The interference effects were not significantly different during RC (mean = 38.64, SEM = 12.64) than during BCXT (mean = 17.79, SEM = 8.96) blocks [t(26) = 1.58, p = .13].

Accuracy (see Fig. 2b)

The analogous ANOVA on the error data indicated a significant main effect of trial type [F(2, 52) = 13.91, p < .001, η p 2 = .35], reflecting more errors on incongruent than on congruent trials [F(1, 26) = 16.36, p < .001, η p 2 = .38]. We observed no significant main effect of reward [F(2, 52) = 1.20, p = .31, η p 2 = .04] and no significant interaction of reward and trial type [F(4, 104) = 0.83, p = .51, η p 2 = .03]. Given the lack of a significant main effect of reward on the error data, further analyses were focused on the RT data.

Behavioral measures

Neither the behavioral indices of reward context nor those of the reward cues were significantly correlated with individual differences in the anhedonia measures.

Neuroimaging results

Sustained effects of reward

As is presented in Table 2 and Fig. 3, the analyses revealed a significant main effect of reward in bilateral DLPFC regions as well as in the dorsal striatum, with greater activity during reward as compared to baseline blocks.

Table 2 Sustained effects of reward
Fig. 3
figure 3

Regions displaying sustained effects of reward

Relationship between sustained effects of reward and anhedonia

Individual differences in responses to the Chapman scales and SHPS were not significantly associated with the magnitude of sustained activation in the comparison of reward to baseline in the ROIs described above that showed significant sustained activation (all ps > .10).

Cue-related effects of reward

Regions in the bilateral DLPFC, lateral globus pallidus, and caudate body displayed significant interactions between reward (RC, RCXT, BCXT trials) and time point (see Table 3). Post-hoc paired t tests at Time Point 4 (i.e., RC-RCXT, RC-BCXT, RCXT-BCXT) were conducted to determine the source of the effect of reward. Consistent with prior work showing the involvement of the DLPFC in reward-related effects (e.g., Jimura et al., 2010; Pochon et al., 2002; Watanabe, 2007), bilateral DLPFC showed increased cue-related activation on RC relative to both RCXT and BCXT trials. Similarly, several regions in the left lateral globus pallidus and left caudate body and head also showed increased cue-related activity during RC as compared to RCXT and BCXT trials (right lateral globus pallidus) (see Fig. 4 for example time courses). However, we did not see any regions in DLPFC or striatum that showed greater cue-related activation on RCXT than on BCXT trials (all ps > .05).

Table 3 Transient cue-driven effects
Fig. 4
figure 4

Example time courses from regions of interest displaying significant interactions of reward and time point. Note: L Left, R Right, TP Time Point

Relationship between cue-related effects of reward and anhedonia

Individuals reporting greater anhedonia on the Chapman scales showed less cue-related activation as a function of reward-predicting cues in the lateral globus pallidus (r = –.54, p = .003; see Fig. 5A for a scatterplot), an effect that passed Bonferroni correction (.05/6). We saw a similar relationship in this region to self-report of hedonic tone on the SHPS, with greater hedonic tone being associated with greater cue-related activation as a function of reward-predicting cues in the lateral globus pallidus (r = .45, p = .02; see Fig.5B), though this correlation did not pass Bonferroni correction.

Fig. 5
figure 5

Significant association between transient components of motivated cognitive control and hedonic tone

Effects of reward context on target-related activation

Regions in the DLPFC and the BG that displayed a significant reward context effect showed two patterns of results (see Table 4). In the top panel of Table 4, regions in the DLPFC and the BG show reduced target-related activation on RC relative to RCXT or BCXT trials. For example, in the medial portion of the right DLPFC (BA 9: x = 26, y = 37, z = 29), target-related activation on RC and RCXT trials was significantly reduced relative to BCXT trials. Interestingly, in several subcortical regions, such as the lateral globus pallidus, target-related activity at Time Point 4 did not differ among reward-related types. However, the source of the significance effect was observed at Time Point 7, with a greater degree of deactivation on RC than on BCXT and RCXT trials (see Fig. 6 for time courses from the DLPFC and BG regions displaying Reward Context × Time Point effects in the target phase). In the bottom panel of the same table, a different set of larger, more lateral, and more posterior regions in the DLPFC showed greater activation on RC relative to BCXT trials (see Supplemental Fig. 2 for the time courses of each region in the DLPFC).

Table 4 Effects from Reward Context (BCXT, RCXT, RC) × Time Point ANOVAs in the target phase
Fig. 6
figure 6

Example time courses from regions of interest displaying Reward Context × Time Point interactions in the target phase

Effects of trial type on target-related activation

Three regions showed interactions between trial type (incongruent, neutral, congruent) and time point: one left DLPFC region, one left putamen region, and one left caudate region. According to post-hoc paired t tests at Time Point 4, these regions displayed greater activation in the target phase on incongruent than on either congruent or neutral trials (see Table 5 for the exact coordinates of each region, and Fig. 7 for example time courses for each region).

Table 5 Effects from Trial Type × Time Point ANOVAs in the target phase
Fig. 7
figure 7

Time courses from regions of interest displaying Trial Type × Time Point interactions in the target phase

Interactions of reward context and trial type

We were also interested in whether reward context interacted with trial type (incongruent, neutral, congruent) as an indicator of whether reward context modulated conflict-related brain activity in DLPFC or striatum. No region in the DLPFC or BG mask regions displayed significant Reward Context × Trial Type interaction.

Discussion

The goals of the present study were (1) to replicate previous findings regarding sustained context-dependent and transient cue-driven effects of reward on cognitive control in healthy adults, especially by focusing on the involvement of the DLPFC and the striatum; and (2) to extend this line of research by examining the influence of individual differences in anhedonia. We found sustained increases in brain activations in bilateral DLPFC and in the putamen and caudate as a function of reward context. Furthermore, consistent with prior work, several regions in the bilateral DLPFC and other subcortical regions, including the lateral globus pallidus and the caudate, showed transient increases of brain activations as a function of reward cues. Importantly, individual differences in anhedonia predicted transient neural responses to reward-predicting cues in the lateral globus pallidus. Each of these findings is discussed in more detail below.

Behavioral reward context and cue effects

The accuracy data showed a main effect of trial type, reflecting more errors on incongruent than on congruent and neutral trials. However, we did not find main effect of reward or a Reward × Trial Type interaction in the accuracy data, meaning that we did find that incentives modulated accuracy. However, consistent with previous studies showing enhancement of rewards for cognitive function (Chiew & Braver, 2013; Gilbert & Fiez, 2004; Hughes, Mathan, & Yeung, 2013; Locke & Braver, 2008; Strang & Pollak, 2014; Vuillier, Whitebread, & Szucs, 2015), we did find a main effect of reward in the RT data, with faster responses on reward cue trials than on no-reward cue trials in the reward block (incentive cue effect), as well as faster responses on no-reward cue trials in the reward block than in the baseline block (incentive context effect). These enhancing effects of reward on RT suggest that knowledge of potential rewards enhanced participants’ behavioral performance, potentially through facilitated representations of reward value within the reward contexts. However, none of these main effects interacted further with trial type.

These results are somewhat in contrast with those of Padmala and Pessoa (2011), from whom we modified the same task paradigm for use with a mixed state-item fMRI design. Padmala and Pessoa found an interaction between reward and congruency, suggesting reduced interference effects (incongruent vs. neutral) in the reward as compared to the no-reward condition. The discrepancy in the magnitudes of the Reward × Trial Type interaction effect between the present and the previous study (Padmala & Pessoa, 2011) might reflect the different populations in each study. In the present study, our participants were from the community (mean age: 35.56 ± 8.61 years old), whereas those in Padmala and Pessoa’s work were college students (mean age: 22 ± 5 years old). Although the reward values were similar in the two studies ($18 in Padmala & Pessoa’s work, $20 in the present work), it is possible that the subjective reward value of these amounts might not have generated the same level of motivation among older community populations that it did among college students (Padmala & Pessoa, 2011). Although the participants in the present study clearly sped their responses as a function of reward, it is possible that the reward value among community populations in present study was not as salient enough to actually reduce the magnitude of the conflict effect (Padmala & Pessoa, 2011). Furthermore, it is possible that the task difficulty in the present study was not sufficiently challenging to leave room for modulation of the conflict effect. Specifically, the accuracy level was somewhat higher in the present study (mean error rates ~3%) than in Padmala and Pessoa’s (mean error rates ~6%). Nonetheless, the fact that participants in the present study were able to speed their responses without sacrificing accuracy suggests that both the reward cues and the reward context were able to enhance task processing, though not to the point of reducing the magnitude of the conflict effect.

Transient cue-related effects of reward-modulated cognitive control

In the present study, activation was increased in response to incentive cues versus no-incentive cues in bilateral DLPFC and in several reward-related subcortical regions, such as the lateral globus pallidus and caudate. These results are in line with prior work (e.g., Padmala & Pessoa, 2011; Vassena et al., 2014), which also showed transient increases in a distributed network of several regions, including the lateral PFC and parietal cortex, that are thought to be engaged in cognitive control (Owen, McMillan, Laird, & Bullmore, 2005; Wager & Smith, 2003).

The DLPFC has traditionally been considered a core component of cognitive control, as we described in the introduction. Recently, accumulated evidence has suggested that the DLPFC represents reward-related value information as well as “cold” information about task goals (see Dixon & Christoff, 2014, for a recent review). Furthermore, anatomical evidence has shown that the DLPFC is strongly connected with key regions involved in value representations, such as the orbitofrontal cortex and the anterior cingulate cortex (Pandya, Van Hoesen, & Mesulam, 1981; Petrides & Pandya, 1999, 2007). Also, the DLPFC (BA 9/10) projects to the BG, including the caudate nucleus and the globus pallidus. Projections from these striatal regions terminate in the thalamus, which in turn projects back to the DLPFC and to premotor and motor cortices (Joel & Weiner, 1994, 1997). With regard to this circuit of anatomical connections, the increased DLPFC and striatal activations in response to incentive cues might reflect enhanced neural representations of reward value through a top-down regulation of activations in this circuit.

In addition to the modulation of cue-related responses as a function of incentives, we also found modulation of target-related responses. As we expected on the basis of the prior literature, we found that neural activation on the incongruent trials was greater than the congruent and neutral activations in several regions of the DLPFC, as well as in both a left putamen and a left caudate region. Interestingly, small, bilateral, relatively medial and anterior regions of the DLPFC showed decreased target-related activations in the reward trials as compared to the baseline trials, whereas more lateral portions of the DLPFC showed the opposite pattern of greater target-related activations during the RC trials. The more lateral portions of the DLPFC are those more typically associated with cognitive control. In the Jimura et al. (2010) study, activations in a similar region on the right showed reduced activations at the time of the target, as compared to that of the cue. These researchers interpreted this effect as potentially reflecting decreased use of reactive control, though activations did not differ during the target phase for reward versus nonreward trials. As such, our results might suggest that in middle-aged community controls, reward cues encourage enhanced activations in cognitive-control-related regions, even during task execution. However, we are less clear what the functional significance may be of the moderated activation in the more medial DLPFC regions, and thus further research will be needed to elucidate the implications of these activation differences.

Sustained context-dependent effects of motivated cognitive control

As we discussed in the introduction, information about incentives can modulate brain activations in both a transient cue-related fashion and a more sustained fashion. Consistent with these previous findings, our results showed that a number of regions in bilateral DLPFC as well as in bilateral putamen showed greater sustained activations during reward blocks than during the nonreward, baseline blocks. According to the dual-mechanisms-of-control framework, proposed by Braver and his colleagues (Braver, 2012; Braver, Paxton, Locke, & Barch, 2009), cognitive control is thought to involve two types of control—preparatory, proactive control and reactive control. In prior work, Braver and colleagues argued that incentive cues promote a shift to proactive control, thought to be reflected in both enhanced cue-related activation as well as sustained neural activation in the DLPFC throughout the entire task context. This kind of sustained activation may reflect the active maintenance of task-relevant context information supported by the DLPFC. On the other hand, reactive control is thought to be represented as transient activations supported by a wider brain network, including the DLPFC, parietal cortex, and other reward-related subcortical regions, based on cognitive demand. In line with this hypothesis, the increased neural activation supported by the DLPFC during reward contexts in the present study may have reflected actively maintained reward-related context information, potentially reflecting enhanced proactive control.

Several regions in the BG also showed greater sustained activity during reward contexts. The present results extend prior work by suggesting that regions in a fronto-parietal network can show sustained increases with regard to reward information. For example, both Jimura et al. (2010) and Locke and Braver (2008) found sustained increases in the parietal cortex as well as the lateral PFC during reward contexts, but did not find effects in the BG. However, in the present study we conducted a hypothesis-driven analysis using an a priori ROI approach, whereas the previous studies had used voxel-wise whole-brain analysis. It is possible that this ROI approach offered greater power to detect sustained effects in both prefrontal and striatal regions (Nieto-Castanon, Ghosh, Tourville, & Guenther, 2003). This sustained activity may be reflective of BG’s hypothesized role in reward-based learning and goal-directed behavior (Dasgupta, Worgotter, & Manoonpong, 2014; Schultz, Tremblay, & Hollerman, 1998). Especially, the dorsal striatum is known to receive extensive projections from the DLPFC as well as other frontal regions. Thus, increased sustained activity in the dorsal striatum during the reward context may have represented greater effort to maintain reward-related context information, which may facilitate preparatory responses throughout reward contexts.

Implications for the dual-mechanisms-of-control framework in the reward context

As we described in introduction, according to the dual-mechanisms-of-control model, cognitive control operates via proactive and reactive control modes in different timescales, along with specific task demands. Proactive control refers to sustained and anticipatory processing of control, whereas reactive control refers to stimulus-driven, transient adjustment of control. Therefore, we hypothesized that proactive control would operate during both periods transient updating processing—during the preparatory, anticipatory period prior to stimulus onset, as well as throughout the entire task condition, due to stable task maintenance. On the other hand, reactive control should operate relatively late in the trial and should be more apparent at the time of the response to the probe. In the present study, as we present in the supplementary information, several regions in bilateral DLPFC displayed both sustained activations during reward contexts and transient cue-related activations. We interpret these overlapping regions as a reflection of a proactive control mode for stable and preparatory maintenance of reward-related context information in reward contexts. Also, we found that more lateral portions of the DLPFC showed greater activation on RC than on BCXT trials during the target phase, which may reflect transient reactive control. During the target phase, reactive control may re-access task-related goal information on a trial-by-trial basis.

These results are consistent with previous work suggesting reward-dependent temporal shifts in DLPFC activations from a transient to a tonic mode during reward contexts (Jimura et al., 2010; Soutschek, Stelzel, Paschke, Walter, & Schubert, 2015; Strang & Pollak, 2014; Wilk, Ezekiel, & Morton, 2012). For example, Jimura et al. (2010) found the involvement of the DLPFC (BA46/9) during both sustained effects through the entire reward conditions, and early-trial components of transient responses at cue presentation. Interestingly, increased sustained activations in the DLPFC during reward versus no-reward contexts lead to a reduction of transient activation during the target phase, potentially suggesting a distinction between proactive mechanisms throughout the entire task block and reactive control mechanisms operating transiently during the later target phase. Consistent with these data, in other work Wilk et al. (2012) asked healthy participants to perform a size congruency task in which two digits that differed in physical and numerical magnitude were presented, and the job of this task was to select the numerically larger digit. Consistent with the present findings, Wilk et al. (2012) found transient, moment-to-moment adjustments of reactive control in bilateral DLPFC and anterior cingulate cortex during target responses, and increased sustained, stable task-set maintenance in several regions in the superior frontal gyrus. Taken together, our findings suggest that motivational incentives influence cognitive control function via sustained and anticipatory proactive control, thought to enhance reward value representations, and also via transient adjustments of reactive control, potentially by updating task-relevant goal information on a trial-by-trial basis for successful response selection. These proactive and reactive control modes appear to activate in distinct temporal modes, but at least partially overlap anatomically in the DLPFC and BG (van Belle, Vink, Durston, & Zandbelt, 2014).

Relationships to anhedonia

In addition to group effects of reward-modulated cognitive control, the present study revealed that individual variations in anhedonia were associated with transient reward cue-related activations in the lateral globus pallidus, but not with sustained context-dependent activation in the DLPFC or BG. The lateral globus pallidus is the final output site of the BG (Joel & Weiner, 1994). Studies in nonhuman monkeys have revealed that neural activations in the BG are modulated by expected rewards (Hikosaka, Nakamura, & Nakahara, 2006). For example, in Hong and Hikosaka (2008), monkeys performed a saccade task in which a visual target was presented randomly on the left or the right. One direction was associated with getting rewards, whereas the other predicted no-reward outcomes. When the researchers measured the firing activity in the globus pallidus and lateral habenula, they found reward-dependent modulations of neural activity in the globus pallidus, as evidenced by increased activity in response to the reward-predicting target, and decreased responses to the no-reward-predicting target.

In the present study, we found that people reporting greater trait anhedonia showed less neural activation as a function of reward-predicting cues, but that there was no significant association between anhedonia and sustained context-dependent activation during reward contexts. These results are consistent with the hypothesis that anhedonia, or reductions in the ability to experience pleasure, may influence the degree to which the experience of rewards (or “liking”) influences modulations of brain activation or behavior in response to the explicit presentation of reward cues, as occurred on RC trials. However, these results do not support the idea that anhedonia influences more global effects of incentives that are reflected in either the sustained aspects of brain activation or the behavioral effects of reward context. As we discussed in the introduction, without having positive experiences/expectations of rewards, it may be hard to exert the effort to pursue goal-directed behavior, potentially providing one mechanism by which individual differences in anhedonia may influence behavior.

Limitations and future directions

Although this study provides crucial insight into one potential neural mechanism underlying anhedonia, it has several limitations that may be answered in future studies. One limitation to this study is that we did not find significant reductions in conflict effects in either accuracy or RT, though we did find a speeding of RT as a function of reward, without a sacrifice in accuracy. Consistent with these behavioral findings, we did not find significant regions displaying interaction effects of reward context and trial type in the target phase, though our analyses were focused on the DLPFC and the basal ganglia. As we discussed above, we cannot exclude a possibility that the difficulty level of present task may not have been optimally adjusted to tap cognitive control among community populations. Future research varying the difficulty level of task performance is needed to understand how engagement of the DLPFC to modulate cognitive control in reward contexts is moderated by the challenge level of the tasks. In summary, the present findings show both sustained context-dependent and transient cue-driven effects of reward on cognitive control thought to be supported by the DLPFC and the basal ganglia. Importantly, self-reported anhedonia trait in healthy adults were associated with transient neural activity during reward predictions implicated in the lateral globus pallidus, but not with sustained DLPFC activity.

Relevant to the present findings of no interaction between reward and trial type in the behavioral data, there is a possibility that our smaller sample size (n = 27) than in previous work by Padmala and Pessoa (2011; n = 45), whose task paradigm we modified and used in the present study, may have resulted in a failure to replicate some aspects of their work. However, an N of 27 still provided a reasonable power to detect transient and sustained effects of interest in the present study and the correlation coefficients ranging from .4 to .5 that are used for individual-difference analyses. Importantly, when we computed effect sizes for the enhancement of rewards on RT performance, the present study yielded an effect size for RT data (d = 0.33) similar to that from Padmala and Pessoa (d = 0.37).

Another major limitation is that, due to the fixed-order presentation of the baseline and reward conditions, the sustained context-dependent effect might have reflected practice-related effects. However, our differences between reward conditions are unlikely to have been due to practice effects, given the previous empirical evidence against this possibility. For example, Chiew and Braver (2013) examined the effect of reward incentives on cognitive control, as measured by the AX-continuous performance task (AX-CPT). Like us, they utilized a mixed block/event design. In the AX-CPT, in which participants perform baseline (no incentive offered) and reward blocks, within the reward blocks, nonincentive trials are randomly intermixed with the incentive trials. In Chiew and Braver’s supplementary analysis, they broke down each block into four 50-trial epochs, and found that potential practice effects disappeared after the first epoch, whereas incentive effects remained throughout. Furthermore, the differences between our RCXT and RC trials cannot be due to practice effects, since these trials were interleaved.

The present mixed state-item design allowed us to start to dissociate sustained versus transient effects of rewards. However, additional approaches could be used in future studies to further dissociate these effects. In particular, our design did not provide an estimate of the effects of sustained rewards in the absence of transient trial-by-trial effects. Thus, an alternative future design would be to include conditions in which participants were informed that there would be a bonus at the end of a block of trials for overall improved performance, but no trial-specific reward cues. With such a manipulation, one could create a crossed design in which there either was or was not an overall block-wise manipulation of reward bonus (e.g., context effect: high vs. low sustained reward motivation) versus the presence or absence of trial-by-trial cues about the potentially of additional rewards (transient effect: reward cue vs. no-reward cue trials).

In summary, the present findings replicated previous findings showing both sustained context-dependent and transient cue-driven effect modulations of activation in DLPFC and BG regions as a function of reward, thought to reflect modulation of cognitive control. Furthermore, we extended prior research by showing that individual differences in self-reported anhedonia were associated with individual differences in cue-related functional brain activation on reward trials in the lateral globus pallidus, but not with sustained DLPFC activation. These individual-difference relationships may reflect the degree to which the hedonic experience of reward influences motivated behavior.