Value-modulated oculomotor capture by task-irrelevant stimuli is a consequence of early competition on the saccade map

Pearson, Daniel; Osborn, Raphaella; Whitford, Thomas J.; Failing, Michel; Theeuwes, Jan; Le Pelley, Mike E.

doi:10.3758/s13414-016-1135-2

Value-modulated oculomotor capture by task-irrelevant stimuli is a consequence of early competition on the saccade map

Published: 16 May 2016

Volume 78, pages 2226–2240, (2016)
Cite this article

Download PDF

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Value-modulated oculomotor capture by task-irrelevant stimuli is a consequence of early competition on the saccade map

Download PDF

Daniel Pearson¹,
Raphaella Osborn¹,
Thomas J. Whitford¹,
Michel Failing²,
Jan Theeuwes² &
…
Mike E. Le Pelley¹

3017 Accesses
41 Citations
2 Altmetric
Explore all metrics

Abstract

Recent research has shown that reward learning can modulate oculomotor and attentional capture by physically salient and task-irrelevant distractor stimuli, even when directing gaze to those stimuli is directly counterproductive to receiving reward. This value-modulated oculomotor capture effect may reflect biased competition in the oculomotor system, such that the relationship between a stimulus feature and reward enhances that feature’s representation on an internal priority map. However, it is also possible that this effect is a result of reward reducing the threshold for a saccade to be made to salient items. Here, we demonstrate value-modulated oculomotor capture when two reward-associated distractor stimuli are presented simultaneously in the same search display. The influence of reward on oculomotor capture is found to be most prominent at the shortest saccade latencies. We conclude that the value-modulated oculomotor capture effect is a consequence of biased competition on the saccade priority map and cannot be explained by a general reduction in saccadic threshold.

Vision as oculomotor reward: cognitive contributions to the dynamic control of saccadic eye movements

Article Open access 25 January 2021

The effects of saccade-contingent changes on oculomotor capture: salience is important even beyond the first oculomotor response

Article 31 May 2014

Humans trade off search costs and accuracy in a combined visual search and perceptual task

Article Open access 30 November 2022

Traditionally, attention has been argued to be subject to two different types of control: one that is volitional and goal directed (top-down control) and another that is automatic and stimulus driven (bottom-up control) (Yantis, 2000). Recently, however, a case has been made for a third category of influences on attentional selection that is neither goal directed nor stimulus driven. Specifically, it has been suggested that our attention is influenced by what we have learned about how stimuli relate to other events in the environment (Anderson, 2013; Awh, Belopolsky, & Theeuwes, 2012; Chelazzi, Perlato, Santandrea, & Della Libera, 2013; Le Pelley, Mitchell, Beesley, George, & Wills, 2016; Le Pelley, Pearson, Griffiths, & Beesley, 2015; Mitchell & Le Pelley, 2010).

The suggestion that attention and learning might interact has a long history in the conditioning literature (Kamin, 1968; Mackintosh, 1975; Trabasso & Bower, 1968; for a review, see Le Pelley, 2004). However, recent work—much of it from the lab of Steven Yantis—has reinvigorated this idea by demonstrating conclusively that learning about the relationships between stimuli and rewards influences the likelihood that those stimuli will automatically capture attention (e.g., Anderson, Laurent, & Yantis, 2011a, 2011b; Anderson & Yantis, 2012; Della Libera & Chelazzi, 2009; Failing & Theeuwes, 2014; Hickey, Chelazzi, & Theeuwes, 2010, 2011; Le Pelley et al., 2015; Pearson, Donkin, Tran, Most, & Le Pelley, 2015; Rutherford, O’Brien, & Raymond, 2010; Theeuwes & Belopolsky, 2012; for a systematic review, see Le Pelley et al., 2016). Specifically, these studies demonstrate that stimuli associated with high-value rewards are more likely to capture attention than equally salient stimuli that are associated with low rewards (or no reward). For example, Anderson et al. (2011b) gave participants extensive pretraining on a visual search task in which red and green circles defined the targets and also signalled the reward magnitude that would be earned for a rapid response to those targets. If the target circle was rendered in (say) red, participants would receive a relatively large reward (5¢) for a rapid correct keypress response, whereas if the target was rendered in green, the participant would receive a relatively small reward (1¢) for the same response. Thus, in the example given here, red acted as the high-value colour and green acted as the low-value colour (these colour–reward contingencies were reversed for half of the participants). In a subsequent test phase, the target was defined by shape (e.g., a diamond in an array of circles), and participants no longer received rewards for their responses. On 50 % of trials in the test phase, one of the nontarget shapes (termed the distractor) was rendered in either the high-value colour or the low-value colour. The critical finding was that the distractor rendered in the high-value colour was more likely to capture attention than a distractor that had never been rewarded. This occurred despite the fact that participants were explicitly informed that colour was no longer relevant to the task during the test phase and that all shapes in the search display were uniquely coloured, such that the reward-associated distractor was not physically salient. This was taken as evidence to suggest that reward learning influences attentional capture separately from stimulus-driven and goal-directed processes.

Similar effects have been reported in studies of eye movements. For example, Theeuwes and Belopolsky (2012) used a gaze-contingent visual search task that was conceptually similar to Anderson et al. (2011b), in which participants were required to make a saccade to a target stimulus (a horizontal or vertical bar) presented in an array of other shapes during an initial pretraining phase. If the participant made an accurate and fast saccade to one of these target stimuli, they would receive a high-value reward, whereas those to the other target stimulus were followed by relatively low-value reward. In a subsequent test phase in which participants were to make a saccade to a colour-defined target, eye gaze was more likely to be captured by a distractor that had the shape that was previously paired with high-value rewards, compared to the shape that had been paired with low-value rewards.

A common feature of both of these studies is that the stimuli that captured attention during the critical test phase were task relevant during the initial training phase. That is, participants needed to quickly orient to the target stimulus in order to earn the reward for that trial. This raises the possibility that the attentional capture by these stimuli in the test phase (where attending to the stimulus was no longer relevant to the participant’s goals) was simply a carryover of the attentional and/or oculomotor orienting response that was initially trained. It is well known from the reward-learning literature that following an action with reward increases the likelihood that the action will occur in the future (Thorndike’s law of effect; Thorndike, 1911). Perhaps, then, it is not surprising that a rapid orienting response that was followed by large reward in the training phase continues to occur in a subsequent test phase, even when the response is no longer relevant to the task goals. That is, it is possible that the attentional and oculomotor orienting observed in these studies simply reflects a learned (conditioned) response that is automatically reenacted whenever the relevant conditioned stimulus appears (see also Beesley, Pearson, & Le Pelley, 2015).

However, a recent series of studies by Le Pelley et al. (2015; see also Pearson et al., 2015) has demonstrated that reward value modulates attentional and oculomotor capture, even when attending to the reward-predicting stimulus has never been task relevant. For example, in Le Pelley et al.’s Experiment 3, participants engaged in a gaze-contingent visual search task in which they were required to make a rapid saccade towards a shape singleton target (a diamond in an array of circles). On most trials, a colour-singleton distractor circle was present in the display, rendered in either red or blue (all other shapes were grey). Critically, the colour of this distractor signalled the size of the reward available on that trial. For example, a red distractor might signal that a rapid saccade to the diamond target would earn a large reward (10¢), whereas a blue distractor signalled that a rapid saccade to the target would earn a small reward (1¢). Note that while the coloured distractor signalled reward magnitude, it was not the target that participants were required to respond to in order to obtain that reward; in this sense, it was task-irrelevant. Indeed the task was arranged such that if ever any gaze was recorded on or near the distractor circle prior to looking at the target, the reward that would otherwise have been delivered on that trial was cancelled: These trials were called omission trials. Therefore, throughout the experiment, participants were never rewarded for shifting their gaze to the colour-singleton distractor. In fact, making an eye movement to the distractor was directly counterproductive because it cancelled the reward and hence resulted in a lower payoff for participants. Nevertheless, participants triggered more omission trials when the high-value distractor was present in the display than when the low-value distractor was present. That is, participants were more likely to make eye movements towards the high-value distractor than the low-value distractor, even though this led to the cancellation of more high-value rewards than low-value rewards. This suggests that people develop an attentional and oculomotor bias towards stimulus features that signal high reward, even when orienting to those features has never been relevant to obtaining the reward. We have termed this effect value-modulated oculomotor capture (VMOC; Pearson et al, 2015).

Given that the distractor stimulus in the search display of Le Pelley et al.’s (2015) procedure was physically salient (since it was a colour singleton), we might expect it to capture attention and eye movements on the basis of this physical salience in a stimulus-driven fashion (Theeuwes, 1992, 1994; Theeuwes, Kramer, Hahn, Irwin, & Zelinsky, 1999). However, the results of this study show that the physical characteristics of the stimuli cannot be the only determinants of attentional capture, because the salience of the high- and low-value distractors was matched across participants by counterbalancing. The implication is that the likelihood of capture is also influenced by learning about the size of the reward signalled by the distractor, independently of its physical salience (hence our description of value-modulated, rather than value-driven, capture). Furthermore, as the VMOC effect is directly counterproductive to the participant’s goal of maximizing reward, it seems that the processes responsible are automatic rather than being under the control of a top-down selection strategy (see also Pearson et al., 2015). In line with this idea, it has been suggested that pairing a particular stimulus feature (e.g., red colour) with reward increases the strength of that feature’s representation on attentional (Awh et al., 2012) and saccadic (Belopolsky, 2015; Theeuwes & Belopolsky, 2012) priority maps.

Several models of oculomotor selection have suggested that the relative priorities of all stimuli in the visual field are represented on a topographical saccade map (e.g., Itti & Koch, 2001; Li, 2002; Wolfe, Cave, & Franzel, 1989), which is generally argued to be located in the superior colliculus (e.g., Godijn & Theeuwes, 2002; Trappenberg, Dorris, Munoz, & Klein, 2001). The activity on this map determines which stimuli are selected by the visual system, with eye movements and attention being directed to the object that generates the largest peak of activity. In many of these models, the strength of a stimulus’ representation on the map is purely a consequence of its stimulus-driven salience, such that the more physically distinct a stimulus is from its surroundings, the greater its associated activity (e.g., Itti & Koch, 2001; Li, 2002). However, several findings have suggested that the priority map also incorporates goal-directed inputs to the oculomotor system (albeit more slowly) through a process of competitive integration (Godijn & Theeuwes, 2002; Meeter, Van der Stigchel, & Theeuwes, 2010; Trappenberg et al., 2001). According to this competitive integration model (Godijn & Theeuwes, 2002), when a search display containing a target and a colour-singleton distractor is presented to an observer, two peaks of activity are produced on the saccade map. Activity at each of these locations is assumed to spread to nearby locations, while inhibiting more distant activity. Once the activity at one point on the map passes a certain threshold, a saccade is made towards that location. This model can be usefully applied to a number of phenomena that arise from placing goal-directed and stimulus-driven activity in competition with one another. For example, when a salient onset distractor is presented in a relatively distant location from a target, the latency of correct saccades to the target is increased (remote distractor effect; Godijn & Theeuwes, 2002). This suggests that the stimulus-driven saccadic activity associated with the salient distractor inhibits the goal-directed saccadic activity associated with the target, such that it takes longer for the target program to reach the threshold for a saccade to be made.

According to the competitive integration account, as it was originally formulated, early saccades are driven purely by the physical salience of a stimulus, whereas the effect of goal-directed, top-down influences can be observed on slower saccades. However, as noted earlier, recent studies have identified an influence of reward prediction as a distinct influence on attention. This raises the question of how reward exerts its effect on the saccade map in the process of competitive integration. In particular, if early competition in the saccade map is purely driven by the physical salience of a stimulus, with other nonsensory signals being integrated at a later stage (as is commonly assumed; e.g., Donk & van Zoest, 2008; Godijn & Theeuwes, 2002; Ludwig & Gilchrist, 2002, 2003a, 2003b; Mulckhuyse, van Zoest, & Theeuwes, 2008; van Zoest, Donk, & Theeuwes, 2004), then we should expect to see no influence of reward on early saccades; instead, as in the case of goal-directed influences, an effect of reward may emerge only for slower eye movements. In contrast, if reward exerts its influence early on in visual processing, either by enhancing the bottom-up salience signal of the reward-associated stimulus in low-level cortex (Hickey et al., 2010) or through independent inputs to the saccade map that engage in direct competition with the bottom-up salience signal (Belopolsky, 2015), we would expect the effect of reward to be largest when saccade latency is short, because the slow, top-down target selection process has yet to be engaged. The experiments described in the current article provide a direct test of this idea.

As noted above, previous studies (Le Pelley et al., 2015; Pearson et al., 2015) have demonstrated that reward exerts an influence on oculomotor capture by task-irrelevant stimuli that is independent of physical salience in speeded search tasks, which might suggest that reward influences early competition on the saccade map. However, in these studies, the critical reward-predicting stimulus was a physically salient colour singleton. As a result, it is possible that the observed influence of reward on eye movements was not a consequence of reward enhancing activity at the locations of reward-related features on the saccade map at all. Instead, it may be the case that the presence of a stimulus feature associated with a high-value reward simply lowers the threshold for a bottom-up saccade to be made to any salient stimulus (Theeuwes & Belopolsky, 2012; see also Anderson, Lauren, & Yantis, 2013). Thus, oculomotor capture by the salient reward-associated distractor is more likely on high-value trials than on low-value trials, because the threshold that the associated saccadic activity has to exceed is reduced relative to low-value trials. On this account, the pairing of a particular stimulus feature with reward does not increase the likelihood for that feature to capture attention and eye movements in the future, per se. Rather, the presence of a reward-associated stimulus lowers the threshold for a saccade to be made to any salient stimulus, regardless of whether it possesses the reward associated feature. Notably, this hypothesis is consistent with findings from primate research, which demonstrate that the expectation of reward disinhibits the activity of neurons in the superior colliculus via feedback connections from reward-processing structures such as the basal ganglia (Ikeda & Hikosaka, 2003). In principle, the increased rate of baseline activity resulting from this disinhibition should make it easier for stimulus-evoked activity from other bottom-up inputs to elicit saccades (Vokoun, Mahamed, & Basso, 2011), effectively lowering the saccade threshold.

A recent study by Failing, Nissens, Pearson, Le Pelley, and Theeuwes (2015) goes some way towards empirically assessing this latter account of VMOC. In this study, participants completed the gaze-contingent visual search task used by Le Pelley et al. (2015), but all stimuli in the search display were rendered in different colours, so that the reward-predicting distractors (red and blue circles) were no longer colour singletons and hence not physically salient. Nevertheless, a VMOC effect occurred; participants were more likely to look at (and hence trigger omission trials for) the high-value distractor than the low-value distractor. Furthermore, the VMOC effect was found to be largest at short saccade latencies—a pattern that is very similar to what has previously been observed in oculomotor capture by physically salient stimuli (e.g., Donk & van Zoest, 2008). This is consistent with the idea that reward exerts an influence on low-level oculomotor competition in the saccade map. However, because all of the stimuli in the display were nonsalient, we would not expect the reward-associated distractors to generate any more bottom-up saccadic activity than any other stimulus in the display. Thus, while Failing et al.’s findings demonstrate that reward exerts its influence at an earlier stage of processing than goal-directed target selection does, it remains unclear whether reward value and physical salience engage in direct competition on the saccade map such that reward influences the very fastest saccades. That is, rapidly initiated saccades may be driven purely by the physical salience of stimuli in the visual field, whereas slow saccades are influenced by top-down target selection processes, and reward exerts its influence on saccades with an intermediate latency.

The current experiments aimed to establish more clearly whether reward influences early competition on the saccade map, using a variant of Le Pelley et al.’s (2015) VMOC procedure with task-irrelevant distractors. On the majority of trials, the reward-predicting distractors were salient colour singletons, as in our previous work. However, on a subset of trials, participants were presented with both reward-predicting distractors (i.e., the high-value and low-value distractors) in the same search display. Hence, on these “both-distractor” trials, the display contained two task-irrelevant, coloured items of equivalent physical salience. If reward value influences oculomotor capture by reducing the threshold for a bottom-up saccade to a physically salient stimulus, we would expect oculomotor capture to be equally distributed between the two equally salient distractors. If, however, reward value influences competition on the saccade map, gaze should be preferentially captured by the high-value distractor rather than the low-value distractor, on both-distractor trials. In particular, if this pattern of greater oculomotor capture by the high-value distractor were apparent for the most rapidly initiated saccades, then this would suggest that early competition on the saccade map is modulated by reward prediction and is not merely a function of physical salience.

Experiment 1

Method

Participants

Previous studies (Failing et al., 2015; Le Pelley et al., 2015; Pearson et al., 2015) have found medium to very large effect sizes for the single-distractor VMOC effect (Cohen’s d ≈ 0.54–2.2). Thus, we ran the experiment for as many days as required to test 40 participants, which would give us power of ~0.87 to detect an effect size of d = 0.5. In total, 42 UNSW Australia students (mean age = 21.4, SEM = 0.72, 10 females) participated for course credit. They also received a monetary bonus dependent upon their performance (M = 13.16 AUD, SEM = 0.43 AUD). All research reported in this article was approved by the Human Research Ethics Advisory Panel (Psychology) of UNSW Australia.

Apparatus

Participants were tested individually using a Tobii TX300 eye tracker, with 300 Hz temporal and 0.15° spatial resolution, mounted on a 23-inch monitor (1920 × 1280 resolution, 60 Hz refresh rate). Participants’ heads were positioned in a chin rest 60 cm from the screen. For gaze-contingent calculations, the experiment script sampled from the eye tracker every 10 ms, with current gaze location defined as the average gaze location during the preceding 10 ms sample. The eye tracker was calibrated prior to the practice phase, prior to the main experiment, and twice more during the experiment (after seven blocks, and after 14 blocks). Stimulus presentation was controlled by MATLAB using Psychophysics Toolbox extensions (Brainard, 1997; Kleiner et al., 2007; Pelli, 1997).

Stimuli

Each trial consisted of a fixation display, a search display, and a feedback display (see Fig. 1). All stimuli were presented on a black background. The fixation display consisted of a white cross (0.5 degrees of visual angle; dva) presented in the centre of the screen, inside a white circle (3.0 dva). The search display consisted of the fixation cross surrounded by six filled shapes (2.3 × 2.3 dva) equally distributed on an imaginary ring with diameter 10.1 dva. The first stimulus was positioned directly above the fixation cross. Five of the shapes were circles (nontargets), and one was a diamond (target). The diamond and three of the circles were always rendered in grey. The remaining two circles were rendered in either red, green, or the same shade of grey as the other shapes (CIE x, y, chromaticity coordinates of .602/.371 for red, .260/.588 for green, and .326/.388 for grey) depending on the trial type (see Design section). The values of red and green had similar luminance (~40 cd/m²), which was higher than that of grey (14.2 cd/m²). The feedback screen displayed the points earned on the previous trial as well as the total points accumulated in the experiment. If response time (RT) was greater than the soft-timeout threshold (see below), the message “Too slow” appeared below the feedback that the reward was 0 points for the trial.

Design

For half of the participants, red was the high-value colour and green was the low-value colour; these colour–reward relationships were reversed for the other half of participants. There were four different types of trial: (1) trials in which one of the nontarget circles was rendered in the high-value colour (henceforth referred to as high-single distractor trials), (2) trials in which one of the nontarget circles was rendered in the low-value colour (low-single distractor trials), (3) trials in which all nontarget circles were rendered in grey (distractor-absent trials), and (4) trials in which one of the nontarget circles (the high-value distractor) was rendered in the high-value colour and another non-target circle (the low-value distractor) was rendered in the low-value colour (both-distractor trials). The experiment comprised 21 blocks of 34 trials each, for a total of 714 experimental trials. Each block consisted of 12 high-single distractor trials, 12 low-single distractor trials, six both-distractor trials, and four distractor-absent trials, in random order.

On each trial, the location of the target was randomly determined. On high-single and low-single distractor trials, the location of the distractor was random with the constraint that it was never positioned directly opposite the target but was either one or two positions away (i.e., the polar angle between the target and the distractor was either 60° or 120°). On both-distractor trials, the high-value distractor was positioned with the same constraints as above, and the low-value distractor was positioned so as to be the same distance from the target as the high-value distractor (i.e., if the high-value distractor was 60° from the target in the clockwise direction, the low-value distractor would be 60° from the target in the anticlockwise direction).

A small circular region of interest (ROI) with diameter 3.5 dva was defined around the centre of the diamond target; a larger ROI (diameter 5.1 dva) was defined around each of the distractors. A response was registered after the participant accumulated 100 ms of gaze dwell time within the target ROI. Responses with RTs that were slower than the soft-timeout threshold were not rewarded; this threshold was 800 ms for the first training block and 600 ms for the subsequent blocks. If ever any gaze was detected within one of the distractor ROIs, the trial was recorded as an omission trial and the reward was not delivered. On distractor-absent trials, one of the nontarget circles (that was either one or two positions away from the target) was selected at random to act as the omission-triggering location; gaze falling within an ROI surrounding the selected grey circle triggered an omission in exactly the same way as if it were a distractor.

On each trial, reward was delivered if RT was faster than the soft-timeout threshold and an omission trial had not been triggered: 500 points on high-single distractor trials, 10 points on low-single distractor trials, and an equal likelihood of 500 points or 10 points on both-distractor trials^{Footnote 1} and distractor-absent trials.

Procedure

Participants were told that their task was to move their eyes to the diamond shape on each trial and that they could earn either 0 points, 10 points, or 500 points “depending on how fast and accurate” their response was. They were informed that the points earned during the task would determine the monetary reward they received at the end of the experiment and that most participants could earn between 7 and 15 AUD for good performance (no specific information was given about the conversion rate from points to AUD). The session began with eight unrewarded practice trials that contained a yellow distractor, followed by the experimental trials. Participants took a short rest break every 68 trials.

Each trial began with the presentation of the fixation display. Participants’ gaze location was superimposed on the display as a small yellow dot. Once 700 ms of gaze dwell time had been recorded within the circle surrounding the fixation cross, or after 5 s, the cross and the circle turned yellow, and the dot marking the participant’s gaze location disappeared. After 300 ms the screen went blank, and after a random period of 600, 700, or 800 ms, the search display appeared. The trial terminated after a response was recorded (see Design), or after 2 s (hard timeout) had passed. The feedback display then appeared and remained onscreen for 2,500 ms in the first experimental block, and 1,500 ms in all subsequent blocks. The intertrial interval was 700 ms.

Data analysis

Following previous protocols (Le Pelley et al., 2015; Pearson et al., 2015), data from the first two experimental trials and the first two trials after each break were discarded. Hard timeouts (1.8 % of all trials) were also discarded. For the remaining trials, averaging across all participants, valid gaze location data were registered in 97.0 % (SEM 0.6 %) of all samples. This suggests very high fidelity of gaze data on these trials.

For the analysis of saccade latencies (i.e., the time between the presentation of the search display and the initiation of the first saccade), a velocity-threshold identification (I-VT) algorithm (Salvucci & Goldberg, 2000) with a velocity criterion of 30 dva/s was used to detect saccades in the raw data from the eye tracker (sampled at 300 Hz rather than 100 Hz used for the gaze-contingent calculations). For these analyses (and again following Le Pelley et al., 2015), in addition to the exclusions described previously, we further excluded all trials in which the latency of the first saccade after the presentation of the search display was less than 80 ms (anticipatory saccades; 12.3 % of all trials) or no gaze was recorded within 5.1 dva (100 pixels) of the fixation point within the first 80 ms (7.5 % of all trials).

Results

Omission trials

Figure 2A shows the proportion of omission trials averaged across all blocks. These data were analysed using a one-way (trial type: high-single, low-single, both, absent) analysis of variance (ANOVA), which found a significant main effect of trial type, F(3, 123) = 50.4, p < .001, \( {\upeta}_{\mathrm{p}}^2 \) = .55. Planned pairwise t tests were used to further explore this effect. More omissions were triggered on each of the trial types that contained a salient distractor than on distractor-absent trials—high-single versus absent: t(41) = 7.39, p < .001, d = 1.14; low-single versus absent: t(41) = 6.53, p < .001, d = 1.01; both versus absent: t(41) = 9.74, p < .001, d = 1.50. Furthermore, more omissions were triggered on both-distractor trials (where gaze on either distractor could trigger an omission) than on trials that contained a single distractor—both versus high-single: t(41) = 4.42, p < .001, d = .68; both versus low-single: t(41) = 6.40, p < .001, d = .99. Finally, and most importantly, omissions were more likely on trials that contained the high-value distractor alone than trials that contained the low-value distractor alone, t(41) = 2.82, p = .007, d = .44, demonstrating a significant VMOC effect.

Distribution of gaze on both distractor trials

The difference in proportion of omissions on high-single distractor versus low-single distractor trials replicates the findings of Le Pelley et al. (2015). Having verified that this procedure produced a VMOC effect, we turned to the analysis of the both-distractor trials in order to determine whether reward-value information biases competition on the saccade map. Crucially, Fig. 2B shows that participants looked at the high-value distractor more often than the low-value distractor on both-distractor trials, and a paired t test revealed that this difference was significant, t(41) = 2.22, p = .032, d = .34.

Correlation of single-distractor VMOC effect with both-distractor VMOC effect

The difference in proportion on omission trials on high-single versus low-single distractor trials provides one measure of the influence of value on oculomotor capture (we term this the single-distractor VMOC effect). The difference in proportion of both-distractor trials with gaze on the high-value distractor versus the low-value distractor provides a second, independent measure (both-distractor VMOC effect). The scatterplot in Fig. 3 shows a strong positive correlation between these two measures, Pearson’s r(40) = .836, p < .001. That is, participants who showed a larger difference between proportion of omissions on high-single and low-single distractor trials (i.e., those who displayed a larger single-distractor VMOC effect) tended to show more oculomotor capture by the high-value distractor than the low-value distractor when both stimuli were presented simultaneously.

Latency of first saccades

Preliminary analysis showed that latency of first saccades did not change significantly over the course of the experiment, with no evidence of a linear trend across blocks, F(1, 39) < 1. Subsequent analyses therefore collapsed across blocks. Figure 2C shows mean latency of the first saccade for each trial type as a function of saccade direction. Saccades were defined as going in the direction of the target or the distractor if the endpoint of the saccade had an angular deviation of less than 30° to the left or right (i.e., half the distance between the stimuli in the display) of the critical stimulus. The data were analysed using a 3 (trial type: high-single, low-single, both) × 2 (saccade direction: target, distractor) ANOVA. This showed a significant main effect of direction, F(1, 41) = 194.3, p < .001, \( {\upeta}_{\mathrm{p}}^2 \) = .83, with shorter latency for saccades going to the distractor than those going to the target. The main effect of trial type, F(2, 82) = 2.22, p = .115, \( {\upeta}_{\mathrm{p}}^2 \) = .05, and trial type × direction interaction, F(2, 82) = .82, p = .444, \( {\upeta}_{\mathrm{p}}^2 \) = .02, were both nonsignificant. Paired t tests found no evidence that the presence of at least one physically salient distractor in the display had an effect on the latency of saccades to the target—absent versus high-single, t(41) = .577, p = .567, d = .11; absent versus low-single, t(41) = .876, p = .386, d = .14; absent versus both, t(41) = .718, p = .477, d = .11. Figure 2D shows a breakdown of first saccade latencies on both-distractor trials according to whether they were towards the target, the high-value distractor, or the low-value distractor. Two participants did not make any initial saccades towards one of the distractors and so were removed from subsequent analyses (remaining n = 40). As above, latencies of saccades going towards the target were significantly longer than those towards either of the distractors on these trials—target versus high-value distractor, t(39) = 5.11, p < .001, d = 0.81; target versus low-value distractor, t(39) = 8.18, p < .001, d = 1.29. There was no significant difference in the latency of saccades going to the high-value and low-value distractors, t(39) = .49, p = .63, d = .08.

Time course of the both-distractor VMOC effect

In order to investigate the time course of the VMOC effect on both-distractor trials, we analysed the proportion of first saccades going towards the high-value and low-value distractors on both-distractor trials as a function of saccade latency using the Vincentizing procedure (Ratcliff, 1979). We calculated mean first saccade latencies and the proportion of first saccades going towards each distractor separately for each decile of the individual saccade latency distributions (see Fig. 4). The data were initially analysed using a 2 (distractor: high-value, low-value) × 10 (decile) ANOVA. This found a significant main effect of distractor, F(1, 41) = 4.54, p = .039, \( {\upeta}_{\mathrm{p}}^2 \) = .10, such that more first saccades went to the high-value distractor than the low-value distractor, averaged across saccade latency. There was also a significant main effect of decile, F(9, 369) = 19.83, p < .001, \( {\upeta}_{\mathrm{p}}^2 \) = .33, indicating that the proportion of saccades going towards either distractor decreased as a function of saccade latency. The distractor × decile interaction was nonsignificant, F(9, 369) = .66, p = .748, \( {\upeta}_{\mathrm{p}}^2 \) = .02. To determine whether the both-distractor VMOC effect was evident for particularly rapid saccades, a planned, paired samples t test was used to compare the proportion of first saccades going towards the high-value distractor and low-value distractor for the fastest decile of saccades. This revealed a marginally significant difference, t(41) = 1.72, p = .093, d = .27.

Discussion

In Experiment 1, participants were more likely to have their gaze captured by the high-value distractor than the low-value distractor, even though looking at the distractor stimuli was directly counterproductive to the participant’s goals, because it resulted in the omission of the reward that would otherwise have been delivered on that trial. This is a replication of the VMOC effect that we have previously reported (Failing et al., 2015; Le Pelley et al., 2015; Pearson et al., 2015). The important novel finding of this experiment is that the VMOC effect was also evident when both the high- and low-value distractors were present in the same search display. We take this as evidence that reward value engages in competition with stimulus-driven and goal-directed inputs to the oculomotor system on a common saccadic priority map. In a time-course analysis, there was a trend towards this both-distractor VMOC effect being present in the shortest decile of saccade latencies, which suggests that reward information may influence competition on the saccade map even at particularly early stages of processing. Furthermore, the both-distractor VMOC effect was highly correlated with the VMOC effect evident on single-distractor trials, which suggests that the same processes are likely to be responsible for both effects.

However, some aspects of the data from Experiment 1 bear further consideration. First, although the mean latency of saccades directed towards either of the distractors was shorter than that of saccades directed towards the target (suggesting that the stimulus-driven activity associated with the physically salient distractor reached the saccade threshold faster than the goal-directed activity associated with selection of the less salient target; Godijn & Theeuwes, 2002; cf. Walker, Walker, Husain, & Kennard, 2000), there was no effect of reward value on saccade latencies to either the target or to the distractor (see Fig. 2). We return to this null finding in the General Discussion.

Second, the results of the time-course analysis of the VMOC effect on both-distractor trials were not clear-cut. Averaging across all deciles, significantly more first saccades was directed towards the high-value distractor than the low-value distractor. However, when considering just the shortest decile of saccade latencies, this trend reached only marginal significance. Furthermore, there was no significant interaction between the effect of distractor type and decile (i.e., the numerical trend suggesting a reduction in the influence of reward as decile increased did not reach significance, see Fig. 4). It is possible that the somewhat equivocal findings relating to time course reflect a lack of power in Experiment 1; participants experienced 126 both-distractor trials over the course of the whole experiment, meaning that the data for each decile of the saccade latency distribution correspond to only around 10–11 trials for each participant (given the data exclusions detailed above). Moreover, many of these trials would have occurred early in the experiment, before participants had much experience of the colour–reward relationships that drive the VMOC effect, thus reducing the power of the time-course analysis of this effect still further. We therefore ran a second experiment in which we increased the number of both-distractor trials, in order to enhance the sensitivity of the time-course analysis.

Experiment 2

Experiment 2 was effectively an extended version of Experiment 1, in which participants completed 1,260 trials—including 294 both-distractor trials—over the course of two sessions run on consecutive days (as compared to 714 trials—with 126 both-distractor trials—in a single session in Experiment 1). As the primary aim of Experiment 2 was to collect more data relating to saccades towards reward-related distractors, distractor-absent trials were removed. In an attempt to reduce the number of trials in which an anticipatory saccade was made, or in which no valid gaze samples were recorded in the centre of the screen at the start of the trial, the time between the offset of the fixation cross and the presentation of the search display was reduced to 150 ms, in keeping with previous studies that have used similar saccade latency analyses (e.g., Failing et al., 2015).