Distractors associated with reward break through the focus of attention

Munneke, Jaap; Belopolsky, Artem V.; Theeuwes, Jan

doi:10.3758/s13414-016-1075-x

Distractors associated with reward break through the focus of attention

Open access
Published: 01 March 2016

Volume 78, pages 2213–2225, (2016)
Cite this article

Download PDF

You have full access to this open access article

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Distractors associated with reward break through the focus of attention

Download PDF

Jaap Munneke¹,
Artem V. Belopolsky¹ &
Jan Theeuwes¹

2818 Accesses
2 Altmetric
Explore all metrics

Abstract

In the present study, we investigated the conditions in which rewarded distractors have the ability to capture attention, even when attention is directed toward the target location. Experiment 1 showed that when the probability of obtaining reward was high, all salient distractors captured attention, even when they were not associated with reward. This effect may have been caused by participants suboptimally using the 100%-valid endogenous location cue. Experiment 2 confirmed this result by showing that salient distractors did not capture attention in a block in which no reward was expected. In Experiment 3, the probability of the presence of a distractor was high, but it only signaled reward availability on a low number of trials. The results showed that those very infrequent distractors that signaled reward captured attention, whereas the distractors (both frequent and infrequent ones) not associated with reward were simply ignored. The latter experiment indicates that even when attention is directed to a location in space, stimuli associated with reward break through the focus of attention, but equally salient stimuli not associated with reward do not.

Don’t let it distract you: how information about the availability of reward affects attentional selection

Article Open access 21 July 2017

Awareness is necessary for attentional biases by location–reward association

Article 23 March 2021

Spatial task relevance modulates value-driven attentional capture

Article 22 June 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Classic models of attention have long stated that two independent control mechanisms are instrumental for attentional guidance to visual stimuli or events in our immediate surroundings. One control mechanism is assumed to be voluntary and top-down, whereas the other is assumed to be automatic and bottom-up in origin. The interaction between these mechanisms influences the way we perceive the world by selecting stimuli for further processing on the basis of our current goals (top-down) or of the stimulus’s physical saliency in the environment (bottom-up). These models of attention have been studied extensively and have long been considered the only mechanisms responsible for attentional selection (for reviews of the matter, see Corbetta & Shulman, 2002; Theeuwes, 2010; Theeuwes, Olivers, & Belopolsky, 2010).

The strict dichotomy of attentional control mechanisms was called into question when converging evidence was provided for a third category of attentional control, termed “selection history” (Awh, Belopolsky, & Theeuwes, 2012). Selection history represents the attentional bias for stimuli or stimulus features that have been selected as a target in the past. Clear examples of selection history involve a phenomenon known as priming of pop-out (Maljkovic & Nakayama, 1994, 1996; Theeuwes & Van der Burg, 2007), reflecting the observation that visual search is more efficient when the target-defining feature is repeated on consecutive trials, as compared to when the target feature changes. This is an important finding, as these are among the first studies to show that attention can be allocated by factors other than top-down or bottom-up control. Furthermore, selection history has been shown to underlie attentional effects, previously attributed to top-down control (Belopolsky, Schreij, & Theeuwes, 2010; Wolfe, Butcher, Lee, & Hyle, 2003; see Theeuwes, 2013, for a review).

Furthermore, the history of reward associations is a form of attentional selection based on learned associations between a stimulus and a received (monetary) reward. The rationale is that associating a stimulus with a reward boosts its representation on an attentional priority map, biasing attention toward selection of this stimulus (Awh et al., 2012). Similar to the effects of selection history, attentional guidance by reward is driven by neither bottom-up nor top-down mechanisms. Rather, it appears to reflect stimulus history: The mere association of a stimulus with a reward results in attentional capture by the rewarded stimulus, even when a reward is no longer available. The effects of reward on attention can be clearly observed in typical reward tasks (e.g., Anderson, Laurent, & Yantis, 2011) in which stimuli or stimulus features are initially associated with a reward (training phase). In a subsequent testing phase, in which rewards are no longer delivered, the crucial observation is that when the rewarded stimulus is part of the search display (but not the target) it captures attention, even when it is a nonsalient distractor (Anderson et al., 2011; Anderson & Yantis, 2012; Failing & Theeuwes, 2014; Wang, Yu, & Zhou, 2013). This phenomenon is known as value-driven attentional capture, and these effects are taken as evidence that stimuli that have been previously rewarded attract attention because of their learned associated value.

It should be noted that many reward studies have a training phase that is separate from the testing phase (e.g., Anderson et al., 2011; Anderson & Yantis, 2013). Typically, during training, participants repeatedly select the rewarded color (because it defines the target and the participant’s subsequent response), which then during the testing phase has to be ignored (when the previously rewarded target now appears as the distractor). As such, in this setup, selection of the rewarded stimulus in the training phase is pivotal for the outcome of the trial, as it defines the response given by the participant as well as the obtained reward. As a consequence of this setup, it has remained unclear whether and to what extent repeatedly rewarding stimuli that do not need to be selected for correct task completion can lead to value-driven attentional capture. To test whether rewarding the stimulus that defines the response, and hence whether the outcome of the trial was critical for value-driven capture, Le Pelley Pearson, Griffiths, and Beesley (2015) had participants perform a visual search task in which a shape singleton defined the target and a colored distractor singleton signaled the magnitude of the reward that could be earned on that particular trial if observers were to select the target accurately and fast enough. Importantly, the color singleton that signaled the amount of reward was never the target, and its selection was never predictive of the correct response. In fact, selecting the distractor was detrimental to obtaining a reward, due to reaction time limitations. The results of Le Pelley and colleagues showed clear value-driven attentional capture by the distractor stimulus, which cannot be attributed to an association between the rewarded stimulus and the correct response and selection criteria of the task.

These findings suggest that when valued distractors are presented in a search display together with a target, attending the valued distractor is prioritized over the target, resulting in a delayed reaction time to the target. Similar effects of reward on attentional guidance have been observed under varying experimental conditions, consistently showing that previously valued stimuli attract attention, regardless of their experimental status (e.g., whether a stimulus is a target or a distractor; Anderson & Yantis, 2013; Bucker, Belopolsky, & Theeuwes, 2015; Bucker, Silvis, Donk, & Theeuwes, 2015; Failing & Theeuwes, 2014; Wang et al., 2013).

The involuntary nature of value-driven attentional capture parallels the apparent automaticity observed in classic saliency-driven attentional capture. In his classic work, Jonides (1981) defined a number of criteria for the automaticity of attentional allocation, suggesting that capture requires a minimal attentional capacity and is resistant to suppression. Resistance to suppression reflects the mandatory character of attentional capture: Even when observers try to ignore the salient event, they are simply incapable of doing so. Although attentional capture is often described as an automatic process, Jonides showed that increasing the validity of a salient spatial cue that indicated the target location resulted in greater benefits and larger costs for valid and invalid cues, respectively. This finding suggests that observers may have some form of control over the extent to which salient events capture attention.

In line with these observations, Yantis and Jonides (1990) demonstrated that the effects of attentional capture by sudden onsets could be completely annulled by using an effective endogenous location cue. In their studies, participants had to search for and identify a target letter presented among distractor letters in a search display. The target letter could either be an onset letter among no-onset distractor letters or a no-onset target letter with one of the distractors being an onset stimulus. The results showed that reaction times to the target were not slowed by the onset of a distractor stimulus when a symbolic cue indicated the target’s location prior to its onset (Exp. 2), but only when the location cue had a high validity (Exp. 3).

Theeuwes (1991) showed similar results: Salient events outside the attentional window (evoked by an endogenous cue) did not capture attention, whereas events within an attentional window did. These findings are in line with the findings by Yantis and Jonides (1990), clearly showing that attentional capture is not completely automatic and can be influenced by the observer’s top-down attentional set. A reduced attentional window surrounding the target location has been put forward as an explanation for the observed reduced saliency-driven attentional capture (Belopolsky & Theeuwes, 2010; Belopolsky, Zwaan, Theeuwes, & Kramer, 2007; Theeuwes, 1994). Precueing the target location leads to a smaller attentional window surrounding fixation, and salient distractors falling outside of the attentional window no longer capture attention.

Due to the apparent overlap in the involuntary nature of both value-driven and saliency-driven attentional capture, the question arises whether value-associated stimuli can be ignored when observers are strongly focused on a location in space by an endogenous cue. This relation was recently investigated by Munneke, Hoppenbrouwers, and Theeuwes (2015), employing an experimental paradigm in which the location of a target letter was endogenously precued with 80% validity. The target could be presented in one of two onscreen placeholder boxes (left and right of fixation), with a distractor letter presented in the nontarget box. Critically, one of the two placeholder boxes would change to a predefined salient color with the onset of the target, its color reflecting the magnitude of the reward obtainable on that particular trial. One observation of this study showed that, when the distractor box changed to a reward color (turning the box into a valued distractor), overall slowed reaction times to the target letter were observed, with the largest increase in reaction times being for trials in which the color signaled the possibility of high reward.

The findings observed by Munneke et al. (2015) are not fully consistent with the observations by Yantis and Jonides (1990) and Theeuwes (1991), in so far as the study by Munneke et al. showed attentional capture by salient and rewarded stimuli despite endogenously focused attention at the target location. The studies by Theeuwes (1991) and Yantis and Jonides (1990) did not show attentional capture by abrupt onset distractors when a 100%-valid cue informed participants in advance of the location of the upcoming target. This discrepancy between value-driven and saliency-driven capture may indicate that value-associated stimuli occupy a preferred position on an attentional priority map, in comparison to salient stimuli. However, Munneke and colleagues’ study may not have been the most optimal way to study the influence of top-down attention on reward processing during visual search, since only two locations were used, not realistically representing the attentional constraints observed during visual search. Furthermore, an endogenous cue was used that did not predict the location of the target with 100% validity. In the present study, we aimed to investigate the relationship between attentional allocation due to reward history and top-down attentional allocation. More specifically, we addressed the question of whether value-driven attentional capture occurs under conditions in which participants can make use of an effective cue informing them of an upcoming target location. In other words, do distractors associated with reward break through the attentional focus?

In the present work, we used a design based on the study by Le Pelley and colleagues (2015), and added a 100%-valid endogenous spatial cue to the design, presented well before target onset. In this way, we were able to gauge the influence of top-down attention on value-driven attentional allocation. If valued distractors exert a stronger influence on attentional mechanisms compared to nonrewarded salient distractors, then we might expect capture to still occur, despite focused attention.

Experiment 1

In the first experiment, we investigated whether attentional capture by valued distractors occurs when attention is fully focused on the target location by an endogenous cue (a pointer indicating the location of the target). Using an arrow to direct attention in advance to a location in space has been used in many studies as a way to endogenously manipulate spatial attention (see Jonides, 1981; Koelewijn, Bronkhorst, & Theeuwes, 2009; Theeuwes, 1991; Theeuwes & Van der Burg, 2007; Yantis & Jonides, 1990). Note, however, that some studies (e.g., Ristic & Kingstone, 2012) have shown that overlearned symbols such as arrows and possibly also pointers (as we used here) may result in orienting that is at least partly automatic. Regardless of whether orienting is purely endogenous (and/or partly automatic), using a cue before a display onset is an adequate way to manipulate the allocation of attention in the visual field.

Method

Participants

We tested 12 participants (seven females, five males; mean age ± standard deviation: 25.5 ± 4.9 years) with normal or corrected-to-normal vision and no history of mental illness. All participants gave written informed consent prior to the start of the experiment. For their participation, a monetary reward was provided. The experimental procedures of this and all subsequent experiments were approved by the local ethics committee and were in accordance with the Declaration of Helsinki.

Stimuli and procedure

Participants were seated in a dimly lit room 75 cm from a Samsung Syncmaster 2233 monitor with a 22-in. diagonal. All of the stimuli were created and presented using Psychophysics Toolbox 3.0.12 (Brainard, 1997; Pelli, 1997) for MATLAB 2014a (MathWorks, Inc.). Eye movements were monitored using an EyeLink 1000 eyetracker (SR Research, Oakville, Ontario, Canada). A chinrest was used to assure a fixed viewing distance.

The time courses of typical trials in Experiment 1 are illustrated in Fig. 1. Each trial was initiated by presenting a blank screen for 500 ms containing only a fixation dot (0.3°). Subsequently, eight circles (1.6° in diameter) surrounding fixation (radius 5.4°) would appear indicating the possible target locations. Each circle contained a figure-8 placeholder (0.8° × 0.5°) to be substituted with a letter at a later moment in the trial. Next, participants were shown an endogenous cue in the form of a small line-arrow (0.8°), which indicated the location of the upcoming target with 100% validity. After 750 ms, the placeholders turned into letters. The target letter, indicated by the cue, would turn into the letter “P” or “S,” whereas the remaining letters turned into the letters “E” or “H.” The target and distractor letters stayed on the screen until the participant had responded to the identity of the target stimulus by pressing one of two predefined keys on a standard keyboard (“z” key for S, “m” key for P) or until 1.5 s had passed. A reward screen would then appear, informing the participants about the number of points won on the trial. Prior to the start of the experiment, participants were instructed to respond as quickly as possible and not to make eye movements. In addition, they were informed that the cue was 100% valid and that the magnitude of the reward on each trial was dependent on the stimuli on the screen, their accuracy, and the speed of their responses. No further details were given with regard to the reward.

Two different trial types were used throughout the experiment. First, during an onset trial, simultaneously with revealing the target and distractor letters (by removing some of the line segments of the figure-8 premasks), a colored distractor would appear. The colored onset consisted of a circle presented at a random position between two of the original stimulus positions and never contained the target letter. On no-onset trials, an additional gray circle was present throughout the trial (see Theeuwes et al., 1999, who used a similar procedure). Similar to the other nontarget locations, this additional circle contained a placeholder stimulus that subsequently changed into a distractor letter at the moment the other placeholders changed into letters. The target location and the location of the additional stimulus in both trials types were counterbalanced over the experiment and occurred equally often at each location.

Importantly, prior work has shown that two stimuli presented close together will vie for neural representation. This competition is resolved (biased) by attention (Desimone, 1998; Desimone & Duncan, 1995). However, biased competition may entail that stimuli presented close together are processed in a qualitatively different manner than stimuli presented farther apart. Therefore, given the present design, in which the target and distractor could be presented next to each other, any influence of a closely proximate distractor on target processing does not need to reflect attentional capture. Instead, it may reflect the processes involved in resolving competition between the target and distractor. For this reason, all analyses in the Results sections for this and the following experiments have the trials removed in which target and distractor were presented directly next to each other.

Crucially, the distractor in the onset condition could be presented in three different yet equiluminant colors (red, green, and blue: 31.3 cd/m²). The remaining stimuli were presented in a light shade of gray (63 cd/m²). All stimuli were presented on a light gray background with an overall luminance of 26 cd/m². These colors, which were counterbalanced over participants, indicated the magnitude of the reward that could be obtained on that particular trial with 100% certainty. Three reward levels were used: high reward (10 points), low reward (2 points), and no reward (0 points). These points translated to real money paid at the end of the experiment. However, a reward could only be obtained if the participant responded accurately and within 800 ms after target onset. If participants responded incorrectly or slower than 800 ms, no reward was administered. In the no-onset condition, participants would always obtain 0 points, as in the no-reward condition.

The experimental design consisted of six blocks of 128 trials, preceded by 30 practice trials. The four reward conditions occurred equally often and were mixed within blocks (three reward onset conditions and a no-reward, no-onset condition; 25% trials per condition). The entire experiment took approximately 80 min to complete.

Results

Reaction times

Trials in which participants made eye movements larger than 1.25° away from fixation were discarded from the dataset (12.8%). Reaction times included in the analyses were derived from trials in which participants responded correctly (9.4% discarded) and with reaction times between 200 and 1,000 ms (fewer than 1% discarded). To gain a better understanding of how reward influences attentional allocation given full advance knowledge of the target’s location, a repeated measures analysis of variance (ANOVA) on the reaction times obtained in the experiment was performed. Figure 2 shows the mean reaction times per condition (top panel). Trial Type (high-reward, low-reward, no-reward, and no-onset) was used as a within-subjects factor. The results showed a main effect of trial type [F(3, 33) = 4.477, p = .01, η _p ² = .289, power = .838], indicating that participants did not respond equally quickly in all conditions. To investigate the amounts of capture for the different conditions, we used paired-samples t tests to compare the reaction times obtained for the different reward levels with those obtained in the no-onset condition. The results of these planned comparisons showed that all salient distractors captured attention. At each reward level, including the no-reward condition, reaction times were significantly slower than in the no-onset condition [no onset, 519 ms; high reward, 532 ms, t(11) = 2.586, p = .025; low reward, 530 ms, t(11) = 3.107, p = .01; no reward, 531 ms, t(11) = 3.484, p = .005]. An additional one-way ANOVA comparing the conditions in which a rewarded distractor was presented showed no differences in reaction times between the different reward levels (F < 1).

Accuracy

A repeated measures ANOVA with Reward Level as a within-subjects factor on the accuracy data did not yield any significant differences between the reward levels (F < 1). Figure 2 shows the average accuracies per condition (bottom panel).

Discussion

The results of Experiment 1 showed attentional capture for all conditions in which a salient onset distractor appeared, as compared to a no-onset condition. No differences in reaction times between any of the reward levels were observed. A straightforward explanation of these findings suggests that, despite participants being fully aware of the target location in advance, salient distractors still captured attention. However, this finding is not consistent with previous work that has repeatedly shown that salient stimuli falling outside the attentional window do not capture attention (Belopolsky & Theeuwes, 2010; Belopolsky et al., 2007; Theeuwes, 1991; Yantis & Jonides, 1990). This discrepancy may be explained by the role of reward in the present experiment. Since participants could obtain (different levels of) reward on each trial and might have preferred to monitor this information, they may not have optimally used the cue to focus attention on the target location. Reward-seeking behavior, the tendency to find and select stimuli associated with reward, may result in a less focused attention on the target location. Because attention is not fully focused on the target location, all salient stimuli, including distractors that are not associated with reward, will capture attention. In short, the presence of a possible reward evokes a reward-seeking attentional set at the cost of focused attention, which allows any salient distractor to capture attention. Note that this reasoning suggests that the presence of a possible reward-signaling stimulus leads to an attentional set that results in capture. Given that participants always knew the location of the target in advance, to further test the hypothesis that salient distractors only capture attention when they are associated with reward, we blocked the reward and no-reward conditions in Experiment 2.

Experiment 2

The findings of Experiment 1 indicated that when obtaining a reward is possible, top-down cues are not used to their full extent. Despite our use of a 100%-valid cue, attention may not have been fully focused on the target location, leading to saliency-driven attentional capture. This would be consistent with Yantis and Jonides (1990), who showed that only when observers fully focus their attention do abrupt onsets cease to capture attention. Indeed, in their 75%-and-25% validity condition, participants adopted a more diffuse mode of attentional allocation, leading to attentional capture by abrupt onsets.

An alternative explanation is that the results we obtained in Experiment 1 might be attributed to the type of distractors we used. The reward-signaling distractors were singletons varying along two dimensions (onset and color), making them more salient than the stimuli used by Yantis and Jonides (1990). The highly salient nature of the current distractors could have resulted in attentional capture, despite focused attention at the cued target location and independent of any reward-induced top-down set. If a claim can be made that the possibility of reward leads to attentional capture, further results should show differences in reaction times between trials that lead to reward and those that do not.

In Experiment 2, the ability to obtain a reward was blocked. In this way, capture by salient distractors with and without a reward-seeking attentional set could be investigated. If the possibility of reward leads to an attenuated use of the endogenous cue, then no attentional capture would be expected in blocks without reward administration (as in Yantis & Jonides, 1990, and Theeuwes, 1991). Additionally, attentional capture by salient distractors might still occur in blocks in which reward was associated with these stimuli.