The pioneering work of Eriksen and Eriksen (1974) on visual attention introduced the flanker task, a procedure for investigating the limited capacity for controlling the spatial scope of visual attention. Studies continue to use the flanker task to study a wide range of phenomena, including perceptual grouping (Kramer & Jacobson, 1991), cognitive control (Nieuwenhuis et al., 2006), and the effect of arousal on performance (Schneider, 2018). In the present study, we use the flanker task to examine how action–outcome learning might influence attention to task-irrelevant distractors. To do so, we compare the effect of (anticipated) self-caused distractors with (unanticipated) externally caused distractors. We ask whether the influence of task-irrelevant distractors (“flankers”) can change once we learn the association between a distractor feature and the observer’s own action.

The reason the flanker task is suited for our purpose is that it provides a setting where targets and distractors are distinguished based on their locations. Examining the effectiveness of distractors that consistently appear at irrelevant locations, helps narrow down the interpretation of findings with respect to changes in stimulus salience, rather than changes in top-down strategy. Additionally, attributes of the flanker task, including possible variations in flanker color and target–flanker distance, could be made selectively predictable.

The present study is grounded in the insight that perception is not passive or reactive, but involves actively changing the sensory input. Recent studies have emphasized the difference between passively received stimuli and stimuli that have their antecedents in the observer’s own action (self-caused stimuli). These studies have demonstrated that human action, which is tied to anticipating sensory action–outcomes, influences perception (Bäß, Widmann, Roye, Schröger, & Jacobsen, 2009; Blakemore, Wolpert, & Frith, 2000; Bompas & O’Regan, 2006a, 2006b; Cardoso-Leite, Mamassian, Schütz-Bosbach, & Waszak, 2010; Gozli & Ansorge, 2016; Hommel, Müsseler, Aschersleben, & Prinz, 2001; Hughes & Waszak, 2011; Roussel, Hughes, & Waszak, 2013, 2014; Wykowska, Schubö, & Hommel, 2009). In explaining action–perception interface, forward models are frequently discussed. While preparing and performing an action involve generating a copy of the predicted (sensory) consequences of the motor command during action selection. The copy is referred to as an efference copy, which is a simulation or prediction of the consequence of the selected action (Miall & Wolpert, 1996; Wolpert, Ghahramani, & Jordan, 1995; Wolpert & Kawato, 1998). Other general mechanisms include ideomotor theory and predictive coding (Clark, 2013; James, 1890; Lotze, 1852; Rao & Ballard, 1999). These accounts characterize the close and necessary connection between perception and action, which in turn explains the reciprocal relation between the two.

A major phenomenon related to the action–perception interface is the sensory attenuation effect (Blakemore et al., 2000), referring to the observation that sensory events that match the prediction of self-induced actions are perceived as weaker, compared with unpredicted sensory events. This phenomenon has been described both as a neurophysiological effect and as a psychophysical effect (e.g., Aliu, Houde, & Nagarajan, 2009; Bäß , Jacobsen, & Schroger, 2008; Cardoso-Leite et al., 2010; Hughes & Waszak, 2011; Schafer & Marcus, 1973). The former describes a reduced neural response to self-caused events, whereas the latter refers to a reduced sensitivity in the subjective experience of self-caused events. An example from everyday life is that even very ticklish people can hardly tickle themselves (Blakemore, Wolpert, & Frith, 1998, 2000). Similarly, it has been reported that one’s own speech evokes a weaker electrophysiological response compared with the speech of another person (Numminen & Curio, 1999; Numminen, Salmelin, & Hari, 1999; see also Dignath, Kiesel, Frings, & Pastötter, 2019; Ford, Gray, Faustman, Heinks, & Mathalon, 2005; Ford, Mathalon, Heinks, et al., 2001a; Ford, Mathalon, Kalba, et al., 2001b; Heinks-Maldonado, Mathalon, Gray, & Ford, 2005; Heinks-Maldonado, Nagarajan, & Houde, 2006; Houde, Nagarajan, Sekihara, & Merzenich, 2002; Pfister, Heinemann, Kiesel, Thomaschke, & Janczyk, 2012; Schmidts, Foerster, & Kunde, 2018).

In the present study, we explore the phenomenon of sensory attenuation with the use of the flanker task, investigating whether attention is drawn less strongly toward self-caused and predictable distractors (in this case, “flankers”) compared with unpredictable distractors. Our purpose is to investigate the sensory attenuation phenomenon under different circumstances, while participants learn to associate different outcomes with their own actions. The typical finding of the flanker task is better performance with target–flanker compatible trials (faster and more accurate responses), relative to target–flanker incompatible trials, which indicates attention to the irrelevant flankers. Important for our purpose, a change in the flankers’ sensory salience should result in a change in the size of the flanker effect. That is, the difference between compatible and incompatible flankers should increase with more salient flankers (unanticipated), compared with less salient (anticipated, self-caused) flankers.

In each of the following Experiments 13, one feature of the flankers was associated with the action performed by participants. With an initial (“free-choice”) response that initiated the target–flanker stimuli, participants’ action determined a simple or complex feature of the flankers on that trial. By “simple” features, we refer to perceptual features that could be identified independently of the target on a particular trial. These include flanker color (Experiment 1) and target–flanker distance (Experiment 2). Of course, color and distance could be defined relative to the color or location of the target (Becker, 2013; Becker, Folk, & Remington, 2013), but since target color and target location were both fixed throughout the experiment, flanker color and target–flanker distance could be identified by looking only at the flankers. Thus, we could treat these features as relatively simple and nonrelational.

In contrast to the relatively simple features, a complex or relational feature requires identifying both the target and the flankers on each trial. In the present study, the complex feature was target–flanker compatibility. In Experiment 3, participants’ action determined target–flanker compatibility, which perhaps should not be considered as a sensory feature at all. Without sensory anticipation, there would be no sensory attenuation. As such, failing to find a learning effect despite the predictability of target–flanker compatibility would provide a boundary condition for the sensory attenuation phenomena.

Finally, in Experiment 4, we replaced participants’ free-choice response with the presentation of a visual cue (“X” vs. “O”). We reasoned that participants might be able to associate the letters with the subsequent flanker color (Itti & Baldi, 2009; Pearce & Hall, 1980). With Experiment 4, we aimed to see whether sensory attenuation would still occur in the absence of action, given that there is some evidence against such a possibility (Cardoso-Leite et al., 2010; Logan & Zbrodoff, 1982).

All four experiments included an acquisition phase (Blocks 1–3; 600 trials) and a test phase (Block 4; 200 trials). In each experiment, we linked only one flanker feature to the participants’ actions, while other features varied randomly. The action–outcome contingency was constant in the acquisition phase, and it was reversed in the test phase. In line with previous observations, we predicted that participants would learn the particular contingency, which would be reflected in (1) a reduced flanker effect across Blocks 1–3 (sensory attenuation) and (2) an increased flanker effect in Block 4, when the action–outcome contingencies are reversed, due to violating the acquired expectations (sensory amplification).

Given the use of free-choice responses (which were associated with a distractor feature), we should consider the possible role of choice probabilities. In particular, in a setup where the free-choice response determines the color of flankers (Experiment 1), favoring of one response over the alternative (e.g., 60%) might produce a general predictability of the corresponding flanker feature (e.g., flankers are 60% likely to be green), which is then reversed in the test phase. In Experiment 3, where the free-choice response determines target–flanker compatibility, some participants might favor the response that results in target–flanker compatibility, over incompatibility. If so, then target–flanker compatible trials become more frequent than incompatible trials in the acquisition phase, which is then reversed in the test phase. This leads to an alternative interpretation based on relatively long-term predictability of events, which differs from our interpretation based on learning specific response–outcome associations. Unfortunately, we could not include choice probabilities in the current analyses because we did not have access to the necessary software.Footnote 1 But given the availability of the data on Open Science Framework (https://osf.io/x28hy), follow-up analyses of choice probabilities and their impact is possible.

General method

Participants

We recruited a total of 157 University of Macau undergraduate students for the experiments in exchange for course credit (0.5 per experiment) or coupons usable at a local café. Based on the effect sizes observed in previous studies (Cardoso-Leite et al., 2010; Gozli & Ansorge, 2016; Gozli, Aslam, & Pratt, 2016a; Gozli, Huffman, & Pratt, 2016b), we estimated that recruiting 20 or more participants in each experiment would be sufficient (α = .05, 1- β = .8), for detecting a simple contrast between the condition that confirms a learned association (e.g., flanker effect in Block 3 in the present study) and the condition that violates the learned association (e.g., flanker effect in Block 4 in the present study). Previous studies with comparable procedures had recruited 16-26 participants per experiment (Cardoso-Leite et al., 2010; Gozli & Ansorge, 2016; Gozli et al., 2016a, b). Compared with previous studies, we extended our acquisition phase, and consequently, we expected our experiments to offer a robust test of the hypotheses. In addition, we decided to recruit at least 36 participants per experiment. We did not dismiss any participants who had already registered for an experiment, so we continued admitting participants who had signed up, even after reaching our minimum requirement. Moreover, we had to later exclude five participants from analysis due to their failure to follow task instructions (i.e., either selecting one of the “free-choice” responses too frequently or following an obvious pattern of responding; see Naefgen & Janczyk, 2018). The final analyses included 36, 34, 44, and 38 participants, respectively, for Experiments 14. All participants reported normal or corrected-to-normal vision and were unaware of the purpose of the study. All experimental protocols were approved by the local ethics committee at the Faculty of Social Sciences of the University of Macau. All data are available on the Open Science Framework (https://osf.io/x28hy).

Apparatus and stimuli

Participants performed the task in a well-lit room. Stimuli were presented on a 21-in LCD monitor, set at a 1,920 × 1,080 resolution and a refresh rate of 60 Hz. Participants sat at the viewing distance of about 55–75 cm from the monitor, and we did not control their viewing distance with a head/chin rest. For this reason, we have described the properties of the stimuli in centimeters rather than degrees of visual angle. The display structure and the sequence of events in a sample trial are shown in Fig. 1. A trial began with the presentation of a fixation cross (RGB: 250, 250, 250; 0.8 × 0.8 cm). Targets and flankers consisted of arrowheads (“>” and “<”) that could each fit in a 0.5 × 0.5 cm rectangle. On each trial, a target and two flankers were presented together, fitting in a 3.0 × 0.5 (for “near” flankers) or 5.5 × 0.5 cm (“far” flankers) rectangle. The target’s color appeared dark green (RGB: 50, 50, 10). The flankers’ colors were red (RGB: 255, 0, 0) or green (RGB: 0, 130, 0). All stimuli were presented against a black background (RGB: 0, 0, 0). The target was consistently presented at the display center, and flankers appeared to the left and right of the target. The two flankers on a given trial always had the same color.

Fig. 1
figure 1

Sequence of events on a sample trial of the acquisition phase

Participants performed two responses per trial. First, was the “free-choice” response that initiated the onset of the target–flanker stimuli and was linked to a particular visual effect. This initial response was performed using the index and the middle fingers of the left hand with the “A” and “Q” buttons on a QWERTY keyboard. We associated left-hand responses with action outcomes based on prior research suggesting that action–outcome learning may be stronger with the nondominant hand (Melcher, Weidema, Eenshuistra, Hommel, & Gruber, 2008; Melcher et al., 2013). The second response was performed with the right hand, using the left and right arrow keys (in response to the “<” and “>” targets, respectively).

Each experiment consisted of 10 practice trials and 800 experimental trials. Experimental trials were divided into four blocks (200 trials each). The first three blocks (600 trials) comprised the “acquisition” phase, and the final 200 comprised the “test” phase. After every 100 trials, participants had an opportunity to take a self-paced break.

Procedure: I. Acquisition phase

Each trial of the acquisition phase began with the presentation of a fixation cross at the display center. We instructed participants to press either the “Q” or the “A” upon noticing the cross. We asked participants to choose their left-hand responses spontaneously and try to avoid predictable patterns (e.g., “AQAQA . . .”; Naefgen & Janczyk, 2018). There is evidence suggesting that action–outcome associations might be acquired and/or recruited more robustly with this type of response (Gozli et al., 2016b; Pfister, Kiesel, & Hoffmann, 2011; Pfister, Kiesel, & Melcher 2010; see also, Gozli, 2019; Huffman, Gozli, Hommel, & Pratt, 2019; Wolfensteller & Ruge, 2014). If participants pressed a key other than “Q” or “A,” or if they pressed more than one key, they would receive visual feedback (“MISTAKE”) and no target–flanker display followed. Immediately after a correct first response was recorded (“Q” or “A”), the flankers appeared at the left and right of the center. The target appeared after a 100-ms delay following the presentation of the flankers. We introduced the 100-ms flanker–target onset asynchrony to increase the chance of the flankers being attended and influencing performance. We reasoned that with a robust baseline flanker effect we would more easily obtain modulations of the flanker effect.

For the second response during the trial (i.e., the target-directed response), we instructed participants to identify the target orientation (“>” = right, “<” = left) using the left/right arrow keys on the keyboard. In all the following experiments, trials differed with respect to target–flanker compatibility, flanker color (red vs. green), and target–flanker distance (near vs. far). The flankers and target remained on the screen until a response was recorded. Upon pressing an incorrect key, or more than one key, participants received a visual feedback (“MISTAKE”). The next trial began after the response for orientation was recorded or a visual feedback was presented.

Experiments 13 differed with respect to the outcome of the first (left-hand) response. Depending on the experiments, the left-hand response determined the flanker color (Experiment 1), target–flanker distance (Experiment 2), or the target–flanker compatibility (Experiment 3). The final Experiment 4, which served as a control experiment to Experiment 1, the first (left-hand) response was removed, and flanker colors were associated with visual cues (“X” and “O”). Participants were not informed at the beginning about the action–outcome or stimulus–outcome contingencies. During informal debriefing conversations with the participants, we found that they were either unaware of the contingencies or had a vague idea about it, which they did not further explore while performing the task.

Procedure: II. Test (reversal) phase

In the test phase, the action–outcome contingencies were reversed. For example, if the action determined the flanker color (“Q” ➔ red, “A” ➔ green), the mapping between the response and the color were reversed in the test phase (“Q” ➔ green, “A” ➔ red). The stimuli and the order of events were otherwise the same as the acquisition phase. Participants were not told that they were entering a different phase of the experiment. Indeed, we found during debriefing conversations that the participants were either unaware of the change occurring in the test (reversal) phase or had a vague idea, to which they did not pay much attention.

Experiment 1

Method

Thirty-six participants performed in the experiment (26 females; mean age = 21 years, SD = 2.71 years). In Experiment 1, the first response of each trial determined flanker colors (during the acquisition phase: red after “Q” and green after “A”). Action–outcome associations were reversed in the test phase. Target–flanker distance (near vs. far) and target–flanker compatibility (compatible vs. incompatible) were pseudorandomized and equiprobable.

Results

Response time

RT data were submitted to a 4 × 2 × 2 repeated-measures analysis of variance (ANOVA), with the factors block, target–flanker distance, and target–flanker compatibility (see Table 1). The analysis revealed main effects of block, distance, and compatibility. The main effect of block indicates faster responses on the final block (M ± SE for Blocks 1–4, respectively, 463 ± 13, 461 ± 12, 460 ± 10, and 448 ± 11 ms), showing a benefit of practice. The main effect of distance indicates faster responses with far flankers (M ± SE = 456 ± 11 ms) than with near flankers (460 ± 11 ms). The main effect of compatibility shows faster responses on target–flanker compatible trials (415 ± 12 ms) compared with incompatible trials (502 ± 11 ms).

Table 1. Results of ANOVAs on response time (RT) and percentage error (PE) in Experiment 1

In addition to the main effects, we found an interaction between distance and compatibility, which shows a larger flanker effect with near flankers (flanker effect = 100 ± 4 ms, dz = 4.14) compared with far flankers (74 ± 4 ms, dz = 3.12). Most importantly for our purpose, we found a Block × Compatibility interaction (see Fig. 2). This interaction shows that the flanker effect decreased from the first block to the third block, which might have partly been due to acquiring the consistent action–outcome contingencies (sensory attenuation of predicted colors). The flanker effect then increased on the final block, which could suggest the cost of violating the previously learned action–outcome contingencies (sensory amplification of unpredicted colors). We should note that the increased flanker effect in Block 4 happened despite the general practice effect on improving performance. These results are consistent with the effect of action–outcome learning on attention to the flankers.

Fig. 2
figure 2

Response time (RT) and percentage error (PE) data from Experiment 1, graphed as a function of target–flanker compatibility (compatible vs. incompatible), block (Blocks 1–3: acquisition of a consistent action–outcome association; Block 4: reversal of the acquired association). Error bars represent 95% within-subjects confidence intervals (Cousineau, 2005; Morey, 2008)

Moreover, we found a three-way interaction, which can be described with reference to our hypothesis. The increase in the flanker effect from Block 3 to Block 4 was slightly more pronounced with far flankers (flanker effect changing from 67 to 77ms) than with near flankers (from 90 to 98 ms). This might mean that violating predictions about action–outcome contingencies had a weaker effect when the flankers were physically more salient (near flankers) than when they were physically less salient (far flankers).

Percentage error

PE data were submitted to the same repeated-measures ANOVA (see Table 1), which revealed main effects of block, distance, and compatibility. The main effect of block indicates a trend of decreasing errors over Blocks 1–3 (mean PEs, respectively, 5.27%, 3.04%, and 2.70%,), and an increase in PE on Block 4 (M = 3.21%). The main effect of distance shows smaller PE with far flankers (M = 2.70%) than with near flankers (M = 4.41%). The main effect of compatibility indicates smaller PE on target–flanker compatible trials (M = 0.69%) than incompatible trials (M = 6.42%).

Additionally, we found a Block × Distance interaction, which shows an increase in the effect of distance over Blocks 1–4 (respectively, 0.36%, 2.09%, 1.81%, and 2.59%), and a Distance × Compatibility interaction, due to a large flanker effect with near flankers (flanker effect = 7.51%, dz = 1.12) compared with far flankers (3.95%, dz = 0.86). Most important for our purpose, we found an interaction between block and compatibility (see Fig. 2), which shows a decrease in flanker effect across Blocks 1–3 (respectively, 8.52%, 5.36%, and 3.51%) and an increase in the flanker effect on Block 4 (M = 5.53%). These are consistent with the RT results.

We also found a three-way interaction, which we could describe with reference to our hypothesis. The increase in the flanker effect from Block 3 to Block 4 was more pronounced with near flankers (from 5.55% to 8.01%) than with far flankers (from 1.47% to 3.05%). This might mean that the violating predictions about action–outcome contingencies had a stronger effect when the flankers were physically more salient (near flankers) than when they were physically less salient (far flankers).

Experiment 2

Method

Thirty-six participants performed in this experiment (26 females; mean age = 19 years, SD = 1.22). Two participants were excluded due to the failure to follow task instructions. In Experiment 2, the first response of the trial determined target–flanker distance (during the acquisition phase: near flankers after “Q” and far flankers after “A”). Action–outcome associations were then reversed in the test phase. Target–flanker color (red vs. green) and target–flanker compatibility were pseudorandomized and equiprobable.

Results

Response time

RT data were submitted to a 4 × 2 × 2 repeated-measures ANOVA, with factors block, target–flanker distance, and target–flanker compatibility (see Table 2).

Table 2. Results of ANOVAs on response time (RT) and percentage error (PE) in Experiment 2

The analysis revealed main effects of block, distance, and compatibility. The main effect of block indicates a benefit of practice, with relatively faster responses on Block 4 (M ± SE for Blocks 1–4, respectively, 444 ± 7, 445 ± 6, 437 ± 5, and 425 ± 5 ms). The main effect of distance indicates faster responses with far flankers (M ± SE = 436 ± 5 ms) than with near flankers (440 ± 5 ms). The main effect of compatibility shows faster responses on target–flanker compatible trials (398 ± 6 ms) than incompatible trials (478 ± 5 ms).

In addition to the main effects, we found a Distance × Compatibility interaction, due to a larger flanker effect with near flankers (flanker effect = 90 ± 26 ms, dz = 3.46) compared with far flankers (69 ± 22 ms, dz = 3.17). Most important for our purpose, we found an interaction between block and compatibility (see Fig. 3). From Block 1 to Block 3, the flanker effect showed a decreasing trend, which could suggest the benefit of consistent action–outcome contingencies. On the final block, however, the flanker effect increased, presumably showing a cost of violating the learned action–outcome contingencies.

Fig. 3
figure 3

Response time (RT) and percentage error (PE) data from Experiment 2, graphed as a function of target–flanker compatibility (compatible vs. incompatible), block (Blocks 1–3: acquisition of a consistent action–outcome association; Block 4: reversal of the association). Error bars represent 95% within-subjects confidence intervals (Cousineau, 2005; Morey, 2008)

Percentage error

The error data were submitted to the same repeated-measures ANOVA (see Table 2), which revealed main effects of block, distance, and compatibility. The main effect of block indicates a trend of decreasing errors over Blocks 1–3 (respectively, 4.25%, 2.90%, and 2.85%), and an increase in errors on Block 4 (M = 3.94%). The main effect of distance shows smaller PE with far flankers (2.67%) than with near flankers (4.31%). The main effect of compatibility indicates smaller PE on target–flanker compatible trials (1.12%) than on incompatible trials (5.85%).

Additionally, we found a Block × Distance interaction, which indicates an increase in the effect of distance over Blocks 1–4 (respectively, 0.01%, 1.09%, 2.30%, and 3.14%), and a Distance × Compatibility interaction, due to a larger flanker effect with near flankers (flanker effect = 6.53%, dz = 1.19) compared with far flankers (2.94%, dz = 0.84). Most important for our purpose, we found an interaction between block and compatibility (see Fig. 3), which shows reduction of flanker effect across Blocks 1–3 (respectively, 6.43%, 3.81%, and 2.36%) and an increase in the flanker effect on Block 4 (6.34%).

We also found a three-way interaction, which we could describe in terms of our hypothesis. The increase in the flanker effect from Block 3 to 4 was more pronounced with near flankers (from 3.3% to 9.6%) than with far flankers (from 1.4% to 3.1%). This could mean that violating the previous associations between actions and outcomes had a stronger effect when the flankers were physically more salient (near flankers) than when they were physically less salient (far flankers).

Experiment 3

Method

Forty-six participants performed in this experiment (33 females; mean age = 19 years, SD = 1.18 years). Two participants were excluded due to the failure to follow task instructions. The first response of the trial determined target–flanker compatibility (during acquisition, “Q” ➔ compatible, “A” ➔ incompatible). Action–outcome contingencies were reversed in Block 4. Flanker color (red vs. green) and target–flanker distance (near vs. far) were pseudorandomized and equiprobable.

Results

Response time

RT data were submitted to a 4 × 2 × 2 repeated-measures ANOVA, with factors block, target–flanker distance, and target–flanker compatibility (see Table 3). The analysis revealed main effects of block, distance, and compatibility. The main effect of block indicates increasingly faster responses with performance (M ± SE for Blocks 1–4, respectively, 457 ± 7, 456 ± 8, 456 ± 6, and 445 ± 5 ms). The main effect of distance indicates faster responses with far flankers (M ± SE = 451 ± 6 ms) than with near flankers (456 ± 6 ms). The main effect of compatibility indicates faster responses on target–flanker compatible trials (408 ± 6 ms) than incompatible trials (499 ± 7 ms).

Table 3. Results of ANOVAs on response time (RT) and percentage error (PE) in Experiment 3

In addition to the main effects, we found an interaction between distance and compatibility, due to a larger flanker effect with near flankers (flanker effect = 104 ± 4 ms, dz = 3.73) compared with far flankers (78 ± 4 ms, dz = 3.20). Most important for our purpose, we found no interaction between block and compatibility (see Fig. 4). In fact, the trend shown in Fig. 3 suggests a reduction in the flanker effect on Block 4, inconsistent with action–outcome learning.

Fig. 4
figure 4

Response time (RT) and percentage error (PE) data from Experiment 3, graphed as a function of target–flanker compatibility (compatible vs. incompatible), block (Blocks 1–3: acquisition of a consistent action–outcome association; Block 4: reversal of the acquired association). Error bars represent 95% within-subjects confidence intervals (Cousineau, 2005; Morey, 2008)

Percentage errors

PE data were submitted to the same 4 × 2 × 2 repeated-measures ANOVA (see Table 3). The analysis revealed main effects of distance and compatibility. The main effect of distance shows larger PE with near flankers (M = 4.61%) than far flankers (M = 3.12%). The main effect of compatibility shows smaller PE on target–flanker compatible trials (M = 1.38%) than on incompatible trials (M = 6.35%). Additionally, we found an interaction between distance and compatibility, which indicates a larger flanker effect (flanker effect = 6.46%, dz = 1.33) on near flanker trials than on far flanker trials (3.48%, dz =1.06). Matching the RT results, we did not find an interaction between block and compatibility (see Fig. 4).

Experiment 4

The aim of this experiment was to test whether action is unique in serving as a source of perceptual prediction, at least in our flanker task, or whether action–outcome associative learning represents one possible way for perceptual prediction (e.g., Clark, 2013; Press, Kok, & Yon, 2019, 2020). We replaced the first (left-hand) response with a perceptual cue (“X” vs. “O”). Otherwise, Experiment 4 was similar to Experiment 1, particularly since the flanker color was the predictable feature. Although not providing any instructions regarding the visual cues might render them ineffective (Logan & Zbrodoff, 1982), this is consistent with how free-choice responses were introduced to the participants in the other experiments (i.e., without drawing attention to their predictive value).

Method

Thirty-nine participants performed in this experiment (17 females, mean age = 19 years, SD = 0.99 years). One participant was excluded due to the failure to follow task instructions. Participants perform only one response per trial. The experiment started with a gray-green visual cue (“X” or “O”) presented at the display center for 1,000 ms. Upon the automatic offset of the cue, colored flankers appeared followed by, after a 100-ms delay, the target onset. Similar to Experiments 13, participants were instructed to respond to the target with left/right arrow key. Visual cues (“X”/“O”) were pseudorandomized and equiprobable. The cue predicted flanker color on each trial with 100% reliability (during the acquisition phase: “X” ➔ red flankers, “O” ➔ green flankers). The cue–color associations were reversed in the test phase (i.e., Block 4). Target–flanker distance (near vs. far) and target–flanker compatibility were pseudorandomized and equiprobable.

Results

Response time

RT data were submitted to a 4 × 2 × 2 repeated-measure ANOVA, with factors block, target–flanker distance, and target–flanker compatibility (see Table 4). The analysis revealed main effects of block and compatibility. The main effect of block indicates relatively faster responses on Block 4 (M ± SE for Blocks 1–4, respectively, 486 ±10, 488 ± 11, 478 ± 9, and 465 ± 7 ms). The main effect of compatibility shows faster responses with target–flanker compatible trials (442 ± 8 ms) than target–flanker incompatible trials (517 ± 8 ms).

Table 4. Results of ANOVAs on response time (RT) and percentage error (PE) in Experiment 4

Besides the main effects, we found an interaction between distance and compatibility, with larger flanker effect with near flankers (flanker effect = 85 ± 4 ms, dz = 3.20) compared with far flankers (65 ± 4 ms, dz = 2.95). Consistent with our hypothesis, we found Block × Compatibility interaction (see Fig. 5), which shows decreasing flanker effect over Blocks 1–3 (flanker effect for Blocks 1–3, respectively, 87 ± 5, 74 ± 6, and 66 ± 6 ms), and an increase in the flanker effect on Block 4 (74 ± 6 ms).

Fig. 5
figure 5

Flanker effect in response time (RT) data and percentage error (PE) data from Experiment 1, graphed as a function of block (Blocks 1–3: acquisition of a consistent cue–outcome association; Block 3: reversal of the acquired association). Error bars represent 95% within-subjects confidence intervals

Percentage error

PE data were submitted to the same repeated-measure ANOVA (see Table 4) that revealed main effects of distance and compatibility. The main effect of distance shows larger PE with near flankers (M = 2.5%) compared with far flankers (M = 2.1%). The main effect of compatibility indicates larger PE with target–flanker incompatible trials (M = 3.8%) than target–flanker compatible trials (M = 0.8%). In addition to the main effects, we found a Block × Distance interaction, indicating that the effect of distance increased over time (respectively, for Blocks 1–4, −0.10%, 0.19%, 0.62%, and 0.96%). We also found an interaction between distance and compatibility, indicating larger flanker effect with near flankers (flanker effect = 3.71%, dz = 0.88) compared with far flankers (2.44%, dz = 0.65). We did not find a Block × Compatibility interaction. But as the trend in Fig. 5 shows, the flanker effect tended to decrease over Blocks 1–3, and it tended to increase on Block 4, consistent with our hypothesis, although this trend did not reach statistical significance. Nonetheless, the PE trend suggests that the RT results cannot be due to speed–accuracy trade-off.

General discussion

We investigated the phenomenon of sensory attenuation under different conditions using the flanker task. Participants’ action (Experiments 13) or a visual cue (Experiment 4) was associated with one feature of the flankers. We found that both action–outcome contingency and cue–stimuli association can influence the effect of distractors, reflected in a reduced flanker effect with predicted flankers, during the acquisition phase, and an increased flanker effect with unpredicted flankers during the test phase.

In Experiments 1 and 2, color and distance of the flankers were associated with participants’ free-choice key press, respectively. The results showed that when actions were associated with relatively simple features of the distractors (i.e., color or distance), the flankers became less distracting as the associations were acquired (Blocks 1–3) and became more distracting when the association were violated (Block 4). In Experiment 3, participants’ initial key press determined target–flanker compatibility. Unlike Experiments 1 and 2, we did not find an increase of flanker effect from Block 3 to Block 4. Instead, RTs continued to decrease over the four blocks. These results indicate that action–outcome learning and its effect on distractor processing depends on the type of outcome. However, we could not, based on the null results of Experiment 3, rule out the possibility of learning with relatively complex feature, especially given that the accuracy data in Experiment 3 showed a statistically nonsignificant trend similar to the other experiments. Perhaps with a larger sample size or a more extensive learning phase, or perhaps with explicit instruction given to the participants, the same results could be observed in a variant of Experiment 3. Nonetheless, it is likely that we found a boundary condition for the sensory attenuation phenomenon. Since the predictable events in Experiment 3 could not be described as sensory events, it is rather unsurprising that an attenuation (or amplification in Block 4) was not observed. Finally, in Experiment 4, visual cues (“X” vs. “O”) were associated with the color of flankers. We found a pattern of decrease and increase in the flanker effect similar to Experiments 1 and 2, which suggests that learning to anticipate distractor features is possible without action–outcome learning (Press et al., 2019, 2020).

In Experiments 13, where a stimulus feature was associated with the free-choice response, questions regarding choice probabilities arise. Were participants more likely to choose the response that led to a distant flanker? Did they favor the response that led to target–flanker compatible trials? Assuming that participants acquired, to some degree, the response–outcome contingencies, we should also consider whether they used those contingencies strategically. In general, it is worth investigating whether choice probabilities, causing relatively long-term predictability (e.g., flankers more frequently being green, distant, or compatible with the target), have influenced the results in an unforeseen manner. Being unable to include choice probabilities in the current analyses reflects an important weakness in the present report, and an examination of possible biases in follow-up analyses of the present data—and in future studies—choice probabilities and their influence would be worthwhile and informative.

Experiments 34 have an implication regarding the possibility that the learning effect might have resulted from a strategic use of the free-choice responses. Such a strategy would involve frequent selection of one choice and producing predictability of flankers in the long term (e.g., selecting one of the two free-choice response on 70% of trials would mean that the flankers become predictable, though in a way that differs from our trial-by-trial descriptions; see Logan & Zbrodoff, 1979). Since this strategy was available in Experiment 3, where we did not observe an effect, and unavailable in Experiment 4, where we did find an effect, such a strategic use of the free-choice responses seems to be an unlikely alternative to the explanation based on associative learning. Nevertheless, this question merits a closer examination in future studies.

Our findings are consistent with various theoretical accounts of sensory attenuation (e.g., Cardoso-Leite et al., 2010; van Moorselaar & Slagter, 2019; Waszak, Cardoso-Leite, & Hughes, 2012; Yon, Gilbert, de Lange, & Press, 2018; Yon, Zainzinger, de Lange, Eimer, & Press, 2020). We could interpret the findings of Experiments 13 as an indication that acquiring action–outcome association is difficult in the case of relatively complex, relational features (perhaps requiring a more extensive learning phase) if at all possible. Alternatively, such a learning might require explicit instruction related to the contingencies, such that participants look for, and keep track of, the action–outcome associations. This possibility requires further research.

Theoretical accounts

Our findings cannot discriminate between major competing accounts of sensory attenuation, though we can still connect the present observations to more general theoretical issues. First, there is the possible link between sensory attenuation and stimulus intensity. While theoretical accounts of sensory attenuation predict reduced perceived intensity for action outcomes, their predictions depend on stimulus intensity (Roussel et al., 2013). That is because high-intensity stimuli are more likely to saturate sensory neurons, eliminating the difference made by sensory preactivation. By asking participants to discriminate stimuli with relatively low and high contrast values, Roussel et al. (2013) found larger sensory attenuation effect for low-contrast stimuli compared with high-contrast stimuli. In this regard, our findings seem ambiguous. The decrease and increase in the size of the flanker effect, as a function of learned associations, was more pronounced for far flankers (RT data in our Experiment 1, consistent with Roussel et al., 2013), though this pattern was not replicated in Experiments 2 and 4.

A preactivation account of sensory attenuation—assuming that action selection activates the anticipated sensory outcomes of the action—predicts a selective advantage for action-congruent stimuli in terms of speed of selection, which is thought to results in phenomena such as intentional binding (Waszak et al., 2012) and faster selection of action-congruent stimuli when they are relevant to task (i.e., serve as the target of visual search; Gozli & Ansorge, 2016; Gozli et al., 2016a; see also, Yon et al., 2018; Yon & Press, 2017).

A recent example of how attentional selection of action-congruent stimuli can be facilitated was reported by Nakashima (2019), who associated participants’ actions with stimulus movement. Nakashima found a bias toward the location that was associated with the participants’ action. Not only were participants more likely to attend to the location of their expected action–outcome, but there was also a link between the strength of bias toward the expected outcome and the participant’s sense of agency (more sense of agency over action–outcome was associated with more bias toward it). It should be noted that the location of the moving stimulus in Nakashima’s study was always, at least potentially, a target location. That is different from the present study, where participants were always instructed to ignore the flankers’ location. Important for the theoretical accounts, the present study and Nakashima’s study can both be reconciled with the preactivation account, which allows for both advantage and disadvantage of action–outcomes, depending on their task relevance (Gozli et al., 2016a).

There is also evidence that reflects the existence of two complementary processes, including both inhibition and preactivation, affecting our perception of action outcomes. Such evidence was provided by Yon and Press (2017), who asked participants to judge the brightness of the action-congruent and action-incongruent stimuli. These stimuli were presented at different delays after actions. The authors found that the intensity of the action-congruent stimuli, compared with the action-incongruent stimuli, increased at the early stage and decreased after a delay. Similarly, Gozli and Ansorge (2016) found bias both for and against a predicted outcome, when they varied the delay between the action and the outcome. These findings suggest that the processing advantages and disadvantages of action–outcomes could be reconciled within a unified framework that includes aspects of the preactivation and the inhibition accounts (e.g., Press et al., 2019, 2020; van Moorselaar & Slagter, 2019; Waszak et al., 2012; Yon & Press, 2017, 2018; Yon et al., 2018; Yon et al., 2020).

Relational features

Different from our Experiments 12, in Experiment 3 a relation between features was associated with participants’ action. The term “feature” might seem inappropriate here (Wolfe & Horowitz, 2017), given that participants’ action determined the relationship between target and flankers, while simple features in Experiments 1 and 2 (color and distance) were independent of the target. In Experiment 3, the flanker effect was not affected by the reversal of association in Block 4. Not observing an increase in the flanker effect in Experiment 3 is evidence against action–outcome contingency learning.

According to the relational theory of attention, visual attention is guided by relative attributes, but not absolute feature value, and targets’ features are not processed independently, but how they are different from the background features (Becker, 2013). According to the logic of relational features, all features can be regarded as in relation with the context: relative color difference, relative distance, and the relative orientation. Does a relational theory of perceptual features predict, in the case of Experiment 3, similar result to those of Experiments 1 and 2?

It is worth pointing out that, in Experiments 12, the varying flanker features were in relation to a constant reference (i.e., target color, target location), while in Experiment 3, the “reference” feature (i.e., target orientation) could itself vary on each trial. Thus, we can describe all three types of “features” as relationships, though with references that are themselves constant or changing. The target–flanker compatibility, if it is to be considered a feature at all, has to do with the relationship between target orientation and flanker orientation, whose identification occurs at a later point in time (relative to simpler features). Target orientation and flanker orientation, accordingly, cannot join and be detected, along a single feature map similar to color or distance (Treisman & Gormican, 1988).

When features are detected against a constant background, we could think of them as “simple” features, which can preattentively guide selective attention, while complex features may not operate the preattentive visual attention (Treisman & Gelade, 1980; Treisman & Gormican, 1988; Wolfe, 2005). In other words, detecting simple features rely on an early-stage process that is susceptible to preactivation, attenuation (for predicted features), or amplification (for unpredicted features). This is consistent with the idea that sensory attenuation reflects early, low-level mechanisms (Roussel et al., 2014). Also relevant are the findings that the attenuated brain response started as early as 27–33 ms after stimulus onset, which reveal early low-level auditory processing of predicted stimuli and confirm early-stage operation of the forward model and the efference copy (Bäß et al., 2009; Wolpert & Flanagan, 2001).

Action–outcome learning versus associative learning

In our control Experiment 4, we found a similar pattern of results to Experiments 12—namely, decreased flanker effect over Blocks 1–3 and an increase when the contingencies were reversed in the last block. These results suggest that perceptual associative learning can induce a similar bias that arises from action–outcome learning. This seems inconsistent with previous finding that attentional bias to action-related stimuli is different from expectation in general, and is linked to a sense of agency over predicted stimuli (Nakashima, 2019). The findings are also inconsistent with the findings of Weiss and Schütz-Bosbach (2012), who designed an experiment with three conditions: (1) self-generated action condition, (2) other-generated-unanticipated condition, and (3) other-generated-anticipated condition. They found that attenuation only in self-generated action condition. Moreover, Cardoso-Leite et al. (2010) conducted an experiment similar to the present control experiment. Rather than associating two visual stimuli, as in the present study, they associated Gabor orientation with high-pitch and low-pitch tones. They found no evidence of learned association between tones and Gabor stimuli, which they interpreted as a disadvantage for perceptual learning compared with action–outcome learning.

Conclusion

The present study shows that learning action–outcome contingencies can change distractor processing, though this learning does not apply equally to all types of outcome. While action–outcome contingencies with relatively simple features (the color & distance of the flankers) resulted in modulation of the flanker effect (indicating learned associations), the same was not observed with a relatively complex, relational feature (target–flanker compatibility). Finally, we found a similar pattern of associative learning with cue–outcome contingencies, which suggests, at least in the case of the present task, actions might not have a special status in associative learning.