It has been well documented that our spatial attention is guided by the physical salience of stimuli and by task goals (Corbetta & Shulman, 2002; Du & Abrams, 2012; Du, Zhang, & Abrams, 2014). However, recent studies have shown that people tend to assign attentional priority to objects associated with rewards, even when they are task-irrelevant and non-salient (Anderson, Laurent, & Yantis, 2011; Della Libera & Chelazzi, 2006; Hickey, Chelazzi, & Theeuwes, 2010). For instance, Anderson et al. (2011) required participants to search for two pre-specified color targets (e.g., red and green), each of which was associated with either a high-value or a low-value monetary reward during training. In the subsequent test, participants were required to search for a shape singleton target without rewards, while reward-related colors were sometimes presented as distractors. Anderson et al. (2011) found that only the high-value reward-related color delayed response time to find the shape singleton target. Additionally, Anderson and Yantis (2013) found that the reward-related color captured attention when presented as an irrelevant distractor over half a year later. These findings indicate that rewards have a long-lasting effect on involuntary capture of attention. Yet it is largely unknown whether and how high-level social manipulations (e.g., social comparison) affect involuntary attentional capture.

Several studies have shed new light on this issue by demonstrating that social comparison modulates reward processing (Fliessbach et al., 2007; Takahashi et al., 2009). For example, when two participants simultaneously performed a task with monetary reward in an fMRI study, Fliessbach et al. (2007) found that the magnitude of the neural responses in the ventral striatum was strongest when participants received more than their counterpart, intermediate for equal payments, and lowest when they received less than their counterpart. Moreover, people are usually dissatisfied with disadvantageous rewards and happy with advantageous rewards (Dvash et al., 2010; Qiu et al., 2010).

Although social comparison has been shown to modulate reward processing, few studies have examined whether social comparison can modulate involuntary attentional capture. Thus social comparison is particularly of interest in the present study. We speculated that social comparison could modulate reward-driven attentional capture. Specifically, an advantageous social context might magnify the subjective value of a reward, resulting in larger reward-driven attentional capture; in contrast, a disadvantageous social context might reduce the subjective value of a reward, leading to smaller reward-driven attentional capture.

Since Anderson et al.’s (2011) study bridges the areas of involuntary capture of attention and reward processing, the present study integrates social context manipulations with the value-driven attentional capture paradigm to examine whether social comparison can modulate attentional capture driven by a color distractor that has been previously associated with rewards. As in Anderson et al.’s (2011) study, the present study included training and test phases, but we added a social context in which participants were informed of not only their own rewards but also the rewards given to an imaginary partner in the training phase (see Fig. 1a). We set up four social comparison conditions (see Table 1) to examine their effects on reward-related capture of attention.

Fig. 1
figure 1

The sequence of events in a typical trial. The graph in (a) illustrates events in the training phase and (b) illustrates events in the test phase

Table 1 Manipulations of four social comparison conditions

Method

Participants

Eighty naïve volunteers were randomly assigned to Experiments 1–4 (20 per experiment, aged 18–30 years; 39 males). There were nine participants for the control experiment (aged 20–28 years; four males), and another 20 for Experiment 5 (aged 19–26 years; 12 males). Each had normal or corrected-to-normal vision and color vision.

Stimuli and procedure

In Experiments 1–4, stimuli were presented on a 17-in CRT with an 85 Hz refresh rate at a distance of 50 cm. Each experiment contained a training phase (Fig. 1a) and a test phase (Fig. 1b).

During training, a central fixation cross was presented for a period randomly chosen from 400, 500, or 600 ms, followed by the search display which was presented until a response was made or 600 ms had elapsed. Participants had another 1,000-ms interval to make a response if they did not respond within 600 ms of presentation of the search display. The search display consisted of six circles (2.3° × 2.3°), which were arranged equally in a circular array (5° radius). Each shape contained a small, white line tilted 45° to either the left or the right; only the line within the target was horizontal or vertical. The colors of the five distractor circles were randomly chosen from blue, cyan, pink, orange, yellow, and white. The target was either a red or green circle with equal probability. The participant received feedback 1,000 ms after a response about the reward value to himself/herself and the imaginary partner for that trial, together with their cumulative reward. The feedback lasted for 2,000 ms. The feedback procedure was designed to provoke certain kinds of social comparison. The inter-trial interval was 1,000 ms. Correct responses to high-value targets were rewarded with a high-value (10 cents) on 80 % of trials and a low value (2 cents) on the remaining 20 %. For low-value targets, the percentages were reversed. High-value targets were red for half of the participants, and green for the other half.

In the test phase, the search display consisted of six different colored shapes, which remained present until a response was made or 1,500 ms had elapsed. The target was defined as a shape singleton, which could be either a circle among five diamonds or a diamond among five circles. There were three distractor conditions: (1) the high-value distractor condition (25 % of trials): one of the five non-target shapes was in the color associated with the high-value reward during training, (2) the low-value distractor condition (25 % of trials): one of the five non-target shapes was in the color associated with the low-value reward; and (3) the neutral condition (50 % of trials): all five non-target shapes were in the color not associated with reward during training. The feedback was presented after response for 1,000 ms. The inter-trial interval was 500 ms.

Both the training and test phases consisted of 480 experimental trials. Participants performed 50 practice trials before the training phase and 20 practice trials before the test phase.

To encourage participants to make social comparisons, participants were told that another participant would perform the experiment simultaneously with them in the other laboratory room. However, they were clearly informed that their reward was independent of their partner’s reward. Moreover, an independent sample of eight participants only took the training phase and showed an average correct response rate of 95 % (89–98 %).Thus the mean accuracy of the imaginary participant was set at 95 % to make the experimental set-up more realistic. At the end of the experiment, all participants asked how much their partner earned, indicating that they all believed in the existence of the imaginary participant.

Design

As shown in Table 1, four different social comparison conditions were tested in Experiments 1–4: (1) The Equivalent reward condition (Self:Other = 1:1) was tested in Experiment 1, in which participants received an equal amount of reward to their imaginary partner; (2) The Disadvantageous reward condition (Self:Other = 1:2) was tested in Experiment 2. Whenever participants received a reward, their imaginary partner received a reward that was twice as valuable as they did; (3) The Advantageous reward condition (Self:Other = 2:1) was tested in Experiment 3. Whenever participants received a reward, their imaginary partner received only half of the reward that they did; and (4) The Inconsistent Ratio reward condition was tested in Experiment 4. Whenever participants received a high-value reward, their imaginary partner received twice as much as they did (Self:Other = 1:2); but when they received a low-value reward, their imaginary partner received only half of the reward that they did (Self:Other = 2:1).

Experiment 5 was a control experiment that had the same parameters as those in Experiment 2 (disadvantageous reward) with two exceptions. First, participants were only informed of their own reward during training so that they could not compare with others. Second, participants took an unrelated dots-estimation task with a higher reward (20 cents) before they performed the training and test phases. Thus, even without social comparison, the high-value reward in training was disadvantageous compared to rewards in the dots-estimation task.

In the unrelated dots-estimation task, a number of dots (11–50) were first presented for 1,500 ms. Participants were required to judge whether the number of dots was less or more than 24 within another 1,500 ms. A 20-cent reward was given for a correct response and no reward for an incorrect response. There were ten practice trials prior to 80 experimental trials. The mean accuracy was 93 %.

Results

As in Anderson et al. (2011), we used response times (RTs) in the test phase to measure the involuntary capture of attention. Mean RTs for correct trials and error rates were submitted to a 4 × 3 ANOVA with social comparison condition (equivalent, disadvantageous, advantageous, and inconsistent ratio) as a between-subjects factor and distractor condition (high-value, low-value, and neutral) as a within-subjects factor. Across all four experiments, error rates were generally low (see Table 2). There was no significant effect on error rates (all Fs < 1.4, all ps > .25). However, the analysis of RTs revealed a significant main effect of distractor condition, F (2, 152) = 39.334, p < .001, η p 2 = .341, but the main effect of social comparison condition was not significant, F (3, 76) = 1.106, p = .352, η p 2 = .042. Most importantly, the interaction was significant, F (6, 152) = 4.234, p = .001, η p 2 = .143. Thus we performed one-way ANOVA with distractor condition as a within-subject factor for each experiment.

Table 2 Mean error rates (in percentages) and standard errors (in parentheses) as a function of distractor condition in the test phase (for experiments 1–4)

Finally, since the equivalent condition provided a baseline for reward-driven attentional capture, we also conducted a 2 (social comparison condition) × 3 (distractor condition) ANOVA to compare the reward-driven attentional capture of the equivalent condition against the other three social comparison conditions (disadvantageous, advantageous, and inconsistent ratio).

Experiment 1: equivalent reward

As shown in Fig. 2a, Mean RTs differed significantly in the three distractor conditions, F (2, 38) = 13.778, p < .001, η p 2 = .420, with RTs for the high-value distractor trials (749 ms) longer than RTs for the low-value distractor (720 ms) and neutral (709 ms) trials, both ps < .01 with Bonferroni correction. The difference between the latter two conditions was not significant, p > .40.

Fig. 2
figure 2

Mean response times (RTs) by distractor condition during the test phase in (a) Experiment 1–Equivalent condition, (b) Experiment 2–Disadvantageous condition, (c) Experiment 3–Advantageous condition, and (d) Experiment 4–Inconsistent Ratio condition. Error bars represent the 95 % confidence intervals for the respective within-subject contrasts (Loftus & Masson, 1994)

In addition, nine new participants took an additional control experiment including only the test phase. The result showed that a red distractor (789 ms), a green distractor (791 ms), or no color distractor (790 ms) produced comparable RTs, F < 1, indicating that red, green, and other colors were equally salient.

Experiment 1 replicated the primary result of Anderson et al.’s (2011) study – an irrelevant color distractor which was previously associated with a high-value reward can involuntarily capture attention, disrupting a subsequent search for a shape target.

Experiment 2: disadvantageous reward

Participants always received a disadvantageous reward compared with their imaginary partner irrespective of reward value. As illustrated in Fig. 2b, there was no significant effect of distractor condition, F (2, 38) = 1.517, p = .232, η p 2 = .074, indicating comparable RTs for the high-value distractor (772 ms), low-value distractor (769 ms), and neutral (762 ms) trials.

To investigate whether a disadvantageous reward modulates the involuntary capture of a color distractor associated with a reward, RTs of Experiments 1 and 2 (equivalent and disadvantageous rewards) were submitted to a mixed-model ANOVA with experiment as a between-subjects factor and distractor condition as a within-subject factor. The difference between experiments was not significant, F (1, 38) = 2.092, p = .156, η p 2 = .052. However, there was an effect of distractor condition, F (2, 76) = 13.236, p < .001, η p 2 = .258. Most importantly, the effect of distractor condition varied across the two experiments, F (2, 76) = 5.641, p = .005, η p 2 = .129. Specifically, the high-value distractor significantly impaired a subsequent search for a form singleton target in Experiment 1 (equivalent reward) but not in Experiment 2 (disadvantageous reward). Moreover, we also compared the capture effect for the high-value distractor of the two experiments (the capture effect was estimated by subtracting the RTs in the neutral trials from the RTs in the high-value condition). The capture effect induced by the high-value distractor (9.7 ms) in Experiment 2 was significantly smaller than that in Experiment 1 (39.8 ms), t (38) = 2.812, p < .01. These results indicate that the disadvantageous reward context significantly reduced the attentional priority of a color distractor that was previously associated with the high-value rewards.

It is also possible that the reduced attentional priority of a high-value-reward-associated distractor was due to its short life-span of attentional priority. We further analyzed the RTs of the first half of the trails (240 trials) of the test phase. However, in this case, RTs for the high-value distractor (794 ms), low-value distractor (792 ms), and neutral (783 ms) trials were still not different, F (2, 38) = 1.247, p = .299, η p 2 = .062. Therefore, a high-reward-associated distractor does not have attentional priority at the first half of the test phase.

Experiment 3: advantageous reward

Participants were always given advantageous rewards compared with their imaginary partner irrespective of reward value. As shown in Fig. 2c, RTs differed significantly in the three distractor conditions, F (2, 38) = 39.029, p < .001, η p 2 = .673. Both the high-value (752 ms) and low-value (725 ms) distractors slowed RT compared to the neutral condition (713 ms), both ps < .05. Furtermore, the high-value distractor delayed RT more than the low-value distractor, p < .001.

Comparison of RTs between Experiments 1 and 3 (equivalent and advantageous rewards) showed a main effect of distractor condition, F (2, 76) = 39.930, p < .001, η p 2 = .512. However, there was neither a difference between experiments nor an interaction, both Fs < 1. Thus the advantageous reward condition mainly replicated the finding that the high-value distractor can capture attention involuntarily.

We also carried out a comparison between Experiments 2 and 3 (disadvantageous and advantageous rewards). Again, there was a main effect of distractor condition, F (2, 76) = 22.267, p < .001, η p 2 = .369. However, there was no difference between experiments, F (1, 38) = 1.872, p = .179, η p 2 = .047. Again, there was an interaction, F (2, 76) = 8.812, p < .001, η p 2 = .188, indicating the effect of distractor condition varied significantly between the two experiments. The capture effect induced by the high-value distractor (9.7 ms) in Experiment 2 was significantly smaller than that in Experiment 3 (38.9 ms), t (38) = 3.996, p < .001. Thus, the disadvantageous condition dramatically reduced the reward-driven attentional capture that was observed in the advantageous condition.

Experiment 4: inconsistent ratio reward

The results of Experiment 2 (disadvantageous) and Experiment 3 (advantageous conditions) showed that the disadvantageous condition substantially reduced the attentional priority of a color distractor that was previously associated with the high-value rewards, while the advantageous social comparison only numerically enhanced the priority of distractors previously associated with low-value rewards. We designed Experiment 4 to further examine these findings, in which participants were disadvantaged compared to their imaginary partner for high-value rewards, but were advantageous for low-value rewards. Thus we expected reduced capture by the high-value distractors and possibly slightly enhanced capture by the low-value distractors.

As shown in Fig. 2d, there was a significant effect of distractor condition, F (2, 38) = 8.095, p < .001, η p 2 = .299. Specifically, RTs in the high-value distractor condition (731 ms) and the low-value distractor condition (731 ms) were significantly longer than those in the neutral condition (710 ms), both ps < .05. The comparison between Experiments 1 and 4 (equivalent and inconsistent ratio rewards) revealed a main effect of distractor condition, F (2, 76) = 18.739, p < .001, η p 2 = .330, and an interaction, F (2, 76) = 4.525, p = .014, η p 2 = .106. Again there was no main effect of experiment, F<1. The capture effect induced by the high-value distractor (20.9 ms) in Experiment 4 was slightly smaller than that in Experiment 1 (39.8 ms), t (38) = 2.244, p < .05, one-tailed. However, there was no significant difference between the magnitude of capture induced by the low-value distractor (21.6 ms) in Experiment 4 and that induced by the similar distractor in Experiment 1 (11 ms), p >.05.

As expected, we again found that the disadvantageous social comparison reduced attentional priority of a high-value-reward-associated distractor, but the advantageous social comparison did not significantly enhance the priority of distractors associated with low-value rewards.

Experiment 5: higher reward context

Though the disadvantageous reward consistently reduced the attentional priority of previously rewarded distractors, the modulation could be driven by reward context instead of social comparison. For example, the fact that participants observe a 20-cent reward being granted in the disadvantageous reward might desensitize participants to the size of their own rewards and make the 10-cent reward no longer a “high” reward.

To test this possibility, we conducted Experiment 5 which provided a higher reward context as the disadvantageous context, but without social comparison. We found a significant effect of distractor condition, F (2, 38) = 13.207, p < .001, η p 2 = .410, with RTs for the high-value distractor trials (772 ms) longer than RTs for the neutral trials (742 ms), p = .001. RTs for the low-value distractor (755 ms) trials were only numerically longer than the neutral trials, p = .134.

The comparison of RTs between Experiments 2 and 5 (disadvantageous and higher rewards) showed a main effect of distractor condition, F (2, 76) = 11.664, p < .001, η p 2 = .235. However, the difference between experiments was not significant, F (1, 38) = .143, p = .708, η p 2 = .004. Crucially, the effect of distractor condition varied across experiments, F (2, 76) = 3.151, p < .05, η p 2 = .077. The capture effect induced by the high-value distractor (9.7 ms) in Experiment 2 was significantly smaller than that in Experiment 5 (30 ms), t (38) = 2.273, p = .029. Thus, the reduced reward-driven capture effect in the disadvantageous social context could not be explained by reward context per se.

Discussion

The present study found that reward-driven attentional capture was dramatically reduced in the disadvantageous social context. Moreover, the absence of value-driven capture in the disadvantageous social context cannot be explained by the higher reward context which only provides comparison between two values without social comparison. Nevertheless, advantageous social comparison seems insufficient to magnify reward-driven attentional capture. The present study is the first study to demonstrate that social comparison can modulate involuntary reward-related attentional capture.

Several underlying mechanisms might contribute to this modulatory effect. First, social comparison can modulate reward processing (Fliessbach et al., 2007; Takahashi et al., 2009), which in turn might either accelerate or hinder perceptual learning of attentional bias to reward-associated stimuli (Anderson et al., 2011). The disadvantageous social context might indirectly impair perceptual learning by reducing the subjective value of rewards, thus resulting in less value-driven capture.

Alternatively, ample evidence shows that different emotional responses might be evoked by different social contexts accompanying the reward processing (Dvash et al., 2010; Takahashi et al., 2009). A disadvantageous context is likely to evoke envy and dissatisfaction, while an advantageous context will induce “schadenfreude.” As emotion also plays an important role in learning and memory (D’Mello, Lehman, Pekrun, & Graesser, 2014), these emotional responses, especially dissatisfaction evoked by social comparisons, might have impeded perceptual learning of reward-related stimuli during the training, resulting in a weaker association between rewards and stimuli.

Finally, it is worth noting that the neural networks for social comparison and selective attention partially overlap. Several functional magnetic resonance imaging studies have shown that various brain regions, such as the ventral striatum (Fliessbach et al., 2007), dorsolateral prefrontal cortex (DLPFC), and anterior cingulate cortex (ACC) (Takahashi et al., 2009) are involved in social comparison. It is well known that both the ACC and the DLPFC are also important nodes of the attentional network (Corbetta & Shulman, 2002). The partial overlap between these two neural networks might also imply that social comparison may modulate allocation of attention in training. The altered attentional priority of those color stimuli were then transferred to a new task. These possibilities require further examination in future research.

A wide variety of rewards including food, nicotine, emotionally provoking pictures, money, and drugs can substantially shape voluntary behavior. By associative learning, the recipient of a rewarding stimulus might have a desire or “wanting” response when reward-associated stimuli are present (Robinson & Berridge, 2008), resulting in involuntary capture of a rewarding stimulus. Such involuntary capture by cues associated with rewarding stimuli has been reported among smokers and substance abusers (Field, Mogg, & Bradley, 2004). The present findings might provide a novel way to reduce the possibility that the attention of addicted individuals would be captured by reward-related cues. It is definitely worth further examination whether a disadvantageous social comparison context can reduce or even completely eliminate the capturing effect of reward-related cues.

In summary, the present results provide evidence that a disadvantageous social comparison context remarkably reduces reward-driven attentional capture.