Introduction

Searching for a visual target in a dense forest or a busy city is very challenging. A large number of distractors often put our limited attentional capability to the test. Nevertheless, searching difficulty could be greatly reduced if the critical features of the target are foreknown and stored in working memory (WM) as search templates (Bundesen, 1990; Desimone & Duncan, 1995; Wolfe, 1994). Sometimes, apart from information about the target, we could also obtain information about the to-be-ignored distractors. This raises the question that whether we could avoid paying attention to these distractors during the search. Previous studies have demonstrated that WM-matched distractors attract attention even when the participants were informed that those distractors would never be the target (Downing, 2000; Hollingworth & Beck, 2016; Mannan, Kennard, Potter, Pan & Soto, 2010; Olivers, Meijer, & Theeuwes, 2006; Soto, Heinke, Humphreys, & Blanco, 2005; Soto & Humphreys, 2007, 2009; for a review, see Soto, Hodsoll, Rotshtein, & Humphreys, 2008). According to these findings, top-down control may not be able to override the involuntary capture by WM-matched distractors.

Nonetheless, another line of research has provided evidence that was inconsistent with the uncontrollable nature of attentional capture by WM-matched distractors. These results have demonstrated that simply holding an item in WM was not sufficient for biasing attention toward the perceptual inputs that matched its critical feature (Dalvit & Eimer, 2011; Carlisle & Woodman, 2011; Houtkamp & Roelfsema, 2006; Peters, Goebel, & Roelfsema, 2009; Olivers, 2009; Lu et al., 2017; for a review, see Olivers, Peters, Houtkamp, & Roelfsema, 2011). Furthermore, several studies have shown that WM representations could be used to inversely guide attention away from the WM-matched distractors (Arita, Carlisle, & Woodman, 2012; Cunningham & Egeth, 2016; Downing & Dodds, 2004; Han & Kim, 2009; Woodman & Luck, 2007) and this effect was facilitated by the rewarding history of the WM representations (Gong, Yang, & Li, 2016). These studies argued that the deployment of attention to the WM-matched items could be strategically controlled to meet task requirements (Geng, 2014; Kiyonaga, Egner, & Soto, 2012; Woodman, Carlisle, & Reinhart, 2013). Therefore, two distinct phenomena concerning the WM guidance on attention have been suggested. Attention could either be captured by or guided away from the WM-matched distractors, resulting in opposite behavioral consequences. Identifying the critical factors that could explain the controversy about the effects of WM guidance in attention control is a necessary step towards further understanding of WM-attention interaction.

In most studies that observed successful suppression of WM-matched distractors, the cue that indicated the to-be-ignored features in visual search remained unchanged within a block of trials (i.e., the fixed cue condition) (Cunningham & Egeth, 2016; Gaspelin, Leonard, & Luck, 2015, 2017; Moher, Lakshmanan, Egeth, & Ewen, 2014; Noonan et al., 2016; Vatterott & Vecera, 2012). Of particular relevance to the present study, Cunningham and Egeth (2016) have found that participants could use the cued to-be-ignored features to facilitate their search only in the fixed cue condition as compared with the trials with a neutral cue. However, there was no reaction time (RT) benefit when the cued to-be-ignored features varied from trial to trial (i.e., the varied cue condition). They attributed the absence of facilitation effect in the varied cue condition to the failure of forming a stable representation of the to-be-ignored feature in WM named as rejection template (Arita et al., 2012; Cunningham & Egeth, 2016; Woodman & Luck, 2007). Meanwhile, utilizing such rejection template requires the involvement of cognitive control (Han & Kim, 2009). Hence, cognitive control could be a critical factor that accounts for the discrepancy between the fixed cue and varied cue conditions, because of its involvement in both the creation and execution of the rejection template. In the varied cue condition where WM representations are frequently switched between trials, suppression benefits might occur when the actual response time is long enough for the completion of effective cognitive control. Thus, in the present study, our first aim was to elucidate the influence of the available cognitive control on distractor suppression in the varied cue condition.

Meanwhile, recent studies have suggested that multiple memory systems that include WM and LTM contribute to attentional control (Hutchinson & Turk-Browne, 2012; Woodman & Chun, 2006). This proposal was supported by experimental investigations in which the content of LTM has been shown to optimize perceptual sensitivity (Stokes, Atherton, Patai, & Nobre, 2012) and induce attentional capture (Fan & Turk-Browne, 2016; Olivers, 2011). Nevertheless, the interaction between the predictive information in LTM and distractor suppression remained less explored. Recently, Gong and colleagues have shown that previously learned reward-feature associations could facilitate distractor suppression when the reward-associated feature always served as physically salient distractor (Gong, Jia, & Li, 2017). Despite not being designed to directly test the modulation effect of the LTM representations in distractor suppression, this study has suggested the potential benefits of the previously learned distractor information in visual search. Therefore, the second issue we aimed to address was whether and under what specific circumstances the LTM information could facilitate distractor suppression. In the context of cued visual search in the present study, LTM information referred to information that was provided at the beginning of the experiment and was not actively cued on every trial.

We conducted three behavioral experiments to address these two issues. In Experiment 1, we replicated the findings of previous literature by demonstrating the RT benefits of fixed cue in distractor suppression. We also showed that the LTM representation of a constant color that predicted the occasional appearance of its associated distractors induced similar RT benefits in the fixed cue condition. In Experiments 2 and 3, we examined the RT benefits of the WM and LTM representations for distractor suppression in the varied cue condition. These two experiments used the same and a different set of visual stimuli as in Experiment 1, respectively. The results from the two experiments agreed with each other and showed that the RT benefits in the varied cue condition could only be observed in the trials with the longest RTs, whereas there were no RT benefits of the LTM-held constant color in the varied cue condition. Taken together, these results suggested that the availability of cognitive control was critical for memory-guided distractor suppression in both the fixed cue and varied cue conditions.

Experiment 1

Experiment 1 was designed to examine the RT benefits of distractor suppression from the WM and LTM representations in the fixed cue condition. We adopted a dual-task paradigm where a to-be-memorized color cue appeared before search array and indicated that the target would not share the same color with the cue. If participants could strategically develop a rejection template using the color cue, they should be able to avoid unnecessary capture by the distractors in the cued color. We also designated a constant color that occasionally appeared in the search display but was never associated with the target. We aimed to test whether prior information of distractors’ color in LTM could be used to facilitate visual search in the context of fixed cue.

Method

Participants

Twenty-five naïve participants (11 males; age range: 19–26 years; mean age: 22.4 years; all right-handed) participated in the experiment and were paid for their participation. All of the participants had normal or corrected-to-normal visual acuity and normal color vision, none of them had known neurological or visual disorders. We used G-power (Faul, Erdfelder, Lang, & Buchner, 2007), given an f of 0.25 (which represents medium effect size) and power of 80%, to estimate how many participants should be collected for the experiment (the same procedure was applied to Experiments 2 and 3). Although the expected sample size was 28, we stopped data collection after 25 participants. This was mainly due to the similar sample size in previous related studies (Cunningham & Egeth, 2016; Han & Kim, 2009) and the consistent behavioral pattern across participants. However, we strictly followed the suggested sample size (i.e., 28) in Experiments 2 and 3. Participants provided written informed consent before the experiment. The study was approved by the local ethics committee.

Stimuli and apparatus

The experiments were programmed with Psychtoolbox 3 (Brainard, 1997; Kleiner et al., 2007) in a MATLAB (Mathworks, Natick, MA, USA) programming environment. The stimuli were displayed on a CRT monitor with a refresh rate of 100 Hz and a resolution of 1,024 × 768 pixels. Participants sat at a distance of 70 cm to the screen with a chin-rest to stabilize the head position. Stimuli were selected from a set of seven equiluminous CIELAB colors (physical luminance of the seven colors ranged from 28.5 cd/m2 to 32.2cd/m2, white background was 78.4 cd/m2) and there were three shapes in the search array (circle: 1.4°×1.4°; diamond: 1.6°×1.6°; hexagon: 1.4°×1.4°).

Procedure

We modified the paradigm of Gaspelin et al. (2015, 2017) by adding an initial color cue and a retention period. As shown in Fig. 1a, each trial began with the presentation of a central black fixation cross for 500 ms. Before the onset of the search array, a color cue (0.4°×0.4°) was presented for 500 ms, indicating that the target was not in the cued color in the following search array. Color cue remained the same within one block but changed across blocks. After a 1-s blank, the search array appeared for 150 ms before being masked by six grayscale checkerboard squares (1.6°×1.6°). The mask remained visible until the participants’ response or 3 s elapsed. The search array was composed of six shapes forming an imaginary circle with a radius of 2°. Search target was a randomly chosen unique shape that was explicitly informed to each participant and was the same across blocks. The shape of the target was counterbalanced across participants. The six shapes were in two different colors: the target and two of the distractors were in the target’s color and the remaining three distractors were in the distractor color. Specifically, the distractor color was defined as the color that was different from the target’s color (note that two out of five distractors were in the same color as the target). Inside of each shape, there was a black dot (0.2°×0.2°) located either at the left or right side (0.2° to the edge).

Fig. 1
figure 1

Schematic illustrations of the experimental procedure and the cueing contingencies of the distractor color. (a) An example of working memory (WM)-matched trial from Experiments 1 or 2. The color cue at the beginning of a trial indicated that items in this color would never be the target in the search array. The presentation time of search array in Experiment 1 was 150 ms while it was titrated for each participant in Experiments 2 and 3. In search trials, participants were instructed to discriminate the dot’s location inside the target shape (circle in this example) in Experiments 1 and 2 or the gap’s location of the target Landolt-square (with an upward or downward gap) in Experiment 3. Participants performed change detection in both the search and probe trials by indicating whether the presented color was the same as the cued color. The color cue, which remained unchanged within a block in Experiment 1 (the fixed cue condition), changed every trial in Experiments 2 and 3 (the varied cue condition). There were no probe trials in Experiment 3. (b) Search arrays of three cueing contingencies. (c) Three types of distractors position. DT distractors in target color, DD distractors in distractor color

There were two types of trials. In 90% of the trials (the search trials), participants were instructed to indicate the location of the dot inside the target shape by pressing left or right arrow key using their right hands. In 10% of the trials (the probe trials), after a 150 ms presentation of the search array, six randomly chosen capital letters (0.8° in height) superimposed on each shape for 100 ms and then were masked by six checkerboard squares for 500 ms. During the recall phase, participants were given unlimited amount of time to choose all the remembered letters from an onscreen alphabet using the mouse. At the end of the search and probe trials, the participants were required to complete a change detection task by judging whether the presented color square (0.4°×0.4°) was in the same color as the cued color in the same trial by pressing Y or N using their left hand within 3 s. The change detection task was the same for all three conditions. The inter-trial interval was set to 1.5 s.

The trials in each type were classified into three (WM-matched, LTM-matched, and neutral) conditions based on the cueing contingency of the distractor color (Fig. 1B). In the WM-matched condition (60% of the trials), the distractor color was the cued color. We deliberately assigned a larger proportion of the trials to this condition to encourage the participants to use the color cue for later visual search. In the LTM-matched condition (20% of the trials), the distractor color was a constant color (green). The participants were informed at the beginning of the experiment that the target would never be in this constant color. In the neutral condition (20% of the trials), the distractor color was a randomly selected color that was neither the cued color nor the constant color. For all three conditions, the target’s color was a randomly selected color from color set by excluding the cued color and the constant color.

There were six blocks and each contained 100 trials with three conditions (WM-matched, LTM-matched, and neutral) and two trial types (search and probe) randomly mixed according to the pre-defined proportions. Each block began with a 10 trials practice which was excluded from the analysis. Participants were encouraged to respond as quickly as possible without risking accuracy.

Data analysis

We only analyzed the search trials with correct responses in both visual search task and change detection task. This resulted in 18.06% of the search trials being dropped. For the rest of the trials, for each condition in each block, we excluded responses faster than 200 ms and recursively removed the trials with RTs less than and beyond 2.5 standard deviation from the mean (Van Selst & Jolicoeur, 1994). This resulted in 5.90% and 4.33% of all search trials removed based on search RT and change detection RT, respectively. All probe trials were involved for letter recall analysis. For all statistical analyses, we adopted Greenhouse-Geisser correction in case of sphericity violations and Boferroni correction for multiple comparison. The comparisons of interest were WM-matched and LTM-matched conditions versus neutral condition.

Results

Search trials

As shown in Fig. 2a, one-way repeated measures ANOVA on search RTs yielded a significant main effect among the three cueing contingencies (F (2, 48) = 10.572, p < .001, η2 = 0.306). Search RTs in the WM-matched (p = .008) and LTM-matched (p < .001) conditions were significantly faster than those in the neutral condition. These results had two implications. First, the results agreed with the previous findings that fixed to-be-ignored cues held in WM can accelerate distractor suppression (Cunningham & Egeth, 2016; Noonan et al., 2016). Second, the to-be-ignored feature held in LTM can be effectively applied in distractor suppression when its associated distractors occasionally appeared in the context of the fixed cue condition.

Fig. 2
figure 2

Results of Experiment 1. (a) Search RTs of the three cueing contingency conditions in the search trials. (b) RT benefits in five RT percentiles as measured by subtracting the RTs in the neutral condition from the WM-matched and LTM-matched conditions in the search trials. (c) Number of correctly recalled letters per location of each type (target, DT, and DD) in the probe trials. For each type of position, the value is the total recalled letters from all locations of this type divided by its location number (1 for target, 2 for DT, and 3 for DD). Error bars represent within-subject 95% confidence interval. Asterisks represent significances (* p<.05, ** p<.01, *** p<.001). DT distractors in target color, DD distractors in distractor color

In order to investigate the role of cognitive control in distractor suppression, we examined the possible relationship between the time available for cognitive control to take place and the observed suppression effect. For each participant, we merged the data of all six blocks and then, for each cueing contingency, grouped trials into five discrete bins based on quintiles of search RT distribution (i.e., 20, 40, 60, 80, and 100 percentiles) (Fig. 2b). Two-way repeated measures ANOVA on RT quintile and cueing contingency revealed significant main effects on RT quintile (F (4, 96) = 192.805, p < .001, η2 = 0.889), cueing contingency (F (2, 48) = 10.415, p <.001, η2 = 0.303), and their interaction (F (8, 192) = 4.023, p =.016, η2 = 0.144). Simple effect analysis revealed that three conditions differed under all quintiles (all ps < .05). The RT benefits in the WM-matched and LTM-matched conditions were observed even in those trials with the shortest search RTs.

Change detection RTs also differed among the three cueing contingencies (F (2, 48) = 7.587, p =.001, η2 = .240). Change detection RTs in the WM-matched condition was faster than that in the neutral (p <.001) and LTM-matched (p = .049) conditions while no significant difference was observed between the latter two (p = .869). These results could be resulted from the appearance of another color (the LTM-held constant color or the neutral color) in the search array that impaired the WM representation of the WM cue and therefore led to a slowdown in detection performance (Kiyonaga et al., 2012). As for search accuracy and change detection accuracy, there were no significant differences among the three conditions (search accuracy: F (2, 48) = 1.649, p = .203; change detection accuracy: F (2, 48) = 2.116, p = .132, descriptive statistics were provided in Table 1).

Table 1 Mean RTs and accuracies (with standard deviation) for the search and probe trials in the three experiments

Probe trials

Participants correctly recalled 1.98 letters per trial on average and there was no difference among three cueing contingencies (F (2, 48) = 1.549, p = .223, Table 2). The six to-be-recalled letters were grouped into three types of position, according to the color and shape of the search items where they were embedded (Fig. 1c): the target, the two distractors in target color (abbr. DT), and the three distractors in distractor color (abbr. DD). The number of the correctly recalled letters at each letter position for each cueing contingency was shown in Fig. 2c. Letter position (target, DT, and DD) and cueing contingency (WM-matched, LTM-matched, and neutral) were submitted to a repeated measures ANOVA. There was a significant main effect of letter position (F (2, 48) =190.802, p < .001, η2 = .888) but not of cueing contingency (F (2, 48) =0.569, p = .570). Crucially, there was a significant interaction effect (F (4, 96) = 4.103, p = .011, η2 =.146). The three cueing contingencies had equal number of recalled letters at the position of the target (F (2, 48) = .04, p = .963), but not at the position of DT (F (2, 48) = 7.85, p < .001, η2 = .258) or distractor color (F (2, 48) = 9.63, p < .001, η2 = .300). At the position of DT, participants recalled more letters in the neutral condition as compared with the WM-matched (p = .010) and LTM-matched (p = .009) conditions. At the position of DD, the result pattern was reversed: participants recalled more letters in the WM-matched (p < .001) and LTM-matched (p = .043) conditions as compared with the neutral condition. There were no differences among three cueing contingencies for the accuracy and RTs of the change detection task (F (2, 48) = 0.264, p = .715, F (2, 48) = 1.687, p =.196, respectively).

Table 2 Mean number (with standard deviation) of correctly recalled letters in the probe trials in Experiments 1 and 2

Discussion

The results of the search trials demonstrated that participants could effectively suppress the distractors that matched the WM representations of the to-be-ignored cues in the fixed cue condition. Meanwhile, the RT benefits obtained in the LTM-matched condition indicated that the LTM information could also guide attention away from the matched distractors under the context of fixed distractor cues. In contrast to Gaspelin et al. (2015), our participants recalled more letters at the positions of DD in WM-matched and LTM-matched than in neutral condition. One plausible explanation for this contradiction was that participants reallocated their attention when the letters appeared signaling the switch to a probe trial. During the reallocation process, stimuli matched memory representation would have a competitive advantage over those neutral stimuli and resulted in the capture effect in the probe trial.

Experiment 2

The goal of Experiment 2 was to investigate whether participants could create a rejection template based on varied WM representations and use it to facilitate distractor suppression.

Method

Participants

Twenty-nine new participants (10 males; age range: 18–28 years; mean age: 22.3 years; all right-handed) participated in the experiment and were paid for their participation. All participants had normal or corrected-to-normal visual acuity and normal color vision. None of them had known neurological or visual disorders. One subject was excluded from the analysis because of extremely poor performance in one block (no search trials that both the search and change detection tasks were correct).

Stimuli and apparatus

The stimuli and apparatus were the same as in Experiment 1.

Procedure

The main procedure in Experiment 2 was the same as in Experiment 1 with the following five exceptions. First, the cued color changed randomly from trial to trial. Second, an articulatory suppression task was added to prevent verbal rehearsal during WM maintenance (Downing & Dodds, 2004; Houtkamp & Roelfsema, 2006; Woodman & Luck, 2007). Three randomly selected digits were shown on the screen for 500 ms at the beginning of each trial and the participants were required to verbally repeat them throughout the whole trial. Third, the constant color that served as LTM representation was counterbalanced across participants to reduce personal color preference. Fourth, a 1-up/2-down staircase session was conducted to obtain an appropriate presentation duration of the search array for each participant. We found a strong practice effect in Experiment 1 as indicated by the increased search accuracy across six blocks (F (5, 120) = 7.716, p < .001, η2 = .243). Participants who had sufficient time to fulfill search task might strategically refresh WM representation by attending to the WM matched distractors to achieve better change detection performance (Woodman & Luck, 2007; Yi, Turk-Browne, Chun, & Johnson, 2008), leading to a prolonged search RT. The staircase procedure was introduced to control for this possibility. Participants came twice on two consecutive days and the staircase session was performed on the first day. Only the participants with a threshold around 100~200 ms continued their participation in the main experiment on the second day. The mean threshold across the participants was 137 ms. Fifth, we emphasized the hit rate in the probe trials because the guess rate (39%) was relatively high in Experiment 1.

Data analysis

The procedures for data exclusion and statistical analysis were the same as in Experiment 1, resulting in 28.4% of the trials being dropped for incorrect performance in either the search or change detection tasks. In addition, 4.81% and 4.72% of all search trials were excluded because of the out of RT range in the search and change detection tasks respectively. No probe trial was excluded from the analysis.

Results

Search trials

We did not observe significant differences across the three cueing contingencies in either search RTs (F (2, 54) = 1.845, p = .168, Fig. 3a) or search accuracy (F (2, 54) = 0.936, p = .398). As in Experiment 1, we compared the three cueing contingencies in five RT quintiles (Fig. 3b). There was a significant main effect on quintiles (F (4, 108) = 149.413, p < .001, η2 = 0.847), but the main effect on cueing contingency was not significant (F (2, 54) = 1.876, p = .163). Furthermore, significant interaction effect was found (F (8, 216) = 6.707, p = .001, η2 = 0.199), with three cueing contingencies differing in the first (F (2, 54) = 5.04, p = .010, η2 = 0.157) and fifth (F (2, 54) = 6.15, p = .004, η2 = 0.186) quintiles. For the trials of the first quintile, the search RTs in the LTM-matched condition was significantly shorter than those in the WM-matched condition (p =.016). However, there were no significant differences when compared the neutral condition with both the LTM-matched and WM-matched conditions in these fastest trials. For the trials of the fifth quintile, the search RTs in the WM-matched condition were significantly shorter than those in the neutral (p = .006) and LTM-matched (p = .027) conditions.

Fig. 3
figure 3

Results of Experiment 2. (a) Search RTs of the three cueing contingencies in the search trials. (b) RT benefits in the WM-matched and LTM-matched conditions in five RT percentiles in the search trials. (c) Number of correctly recalled letters per location of each type (target, DT, and DD) in the probe trials. Error bars represent within-subject 95% confidence interval. Asterisks represent significances (* p<.05, ** p<.01, *** p<.001). DT distractors in target color, DD distractors in distractor color

Consistent with Experiment 1, the analysis of RTs from the change detection task showed a significant main effect of cueing contingency (F (2, 54) = 18.836, p < .001, η2 = 0.411). Participants were faster in the WM-matched condition as compared with the neutral and LTM-matched conditions (both ps < .001). Moreover, change detection accuracy (F (2, 54) = 14.192, p < .001, η2 = 0.345) also demonstrated a similar pattern that the WM-matched condition was more accurate than the neutral and LTM-matched conditions (both ps < .001).

Probe trials

Participants correctly recalled 1.49 letters per trial on average and there was no significant difference among three cueing contingencies (F (2, 54) = 0.332, p = .719). The numbers of the correctly recalled letters per location of each type under different cueing contingencies were shown in Fig. 3c. A two-way repeated measures ANOVA (letter position × cueing contingency) revealed a significant main effect of letter position (F (2, 54) = 91.432, p < .001, η2 = .772), but not of cueing contingency (F (2, 54) = 1.989, p = .147) and their interaction (F (4,108) = 1.627, p = .188). Participants recalled more letters at the position of the target than at the positions of DT and DD (both ps < .001). Although change detection RTs of probe trials were roughly the same for the three cueing contingencies (F (2, 54) = 0.936, p = 0.378), a significant difference in change detection accuracy was observed (F (2, 54) = 11.197, p < .001, η2 = 0.293). The memory accuracy was higher in the WM-matched condition than that in the neutral (p < .001) and LTM-matched (p = .002) conditions.

Discussion

We did not find a significant overall cueing effect in search RTs in the varied cue condition, which was consistent with the findings from previous literature (Cunningham & Egeth, 2016; Kerzel & Barras, 2016; Noonan et al., 2016). However, when we examined the benefits of the cues in different RT quintiles, significant RT benefits in the WM-matched condition emerged in the trials with the longest RTs. These results agreed with the proposal that the suppression of cued distractors required sufficient cognitive control. The suppressive process required frequent updates of the cueing information in WM and only those trials with sufficiently long RTs to accomplish cognitive control in creating and executing rejection templates showed significant RT benefits. In contrast, we did not observe significant RT benefits in the LTM-matched condition, even among the trials with the longest RTs. One possible reason for the lack of RT benefits in the LTM-matched condition could be a weak LTM representation of the constant color, which will be addressed in Experiment 3 by adding a prior consolidation session. An alternative explanation was the failure in reactivating LTM information into WM when the distractors in the constant color appeared. A recent study has suggested that information retrieved from LTM was first represented in WM and therefore faced with the similar resource limitation (Fukuda & Woodman, 2017). Since the switching of WM representations between trials dominated the utilization of cognitive control, the resources that were available for activating the LTM information for suppressive purpose could be greatly reduced in the varied cue condition.

In the probe trials, when the color cue changed every trial, neither a capture nor a suppression effect was found in the positions in the distractor color. Since we adopted a staircase procedure to control for the possibility that participants might resample the stimuli in search array to optimize performance in recall phase, we expected that no capture effect was observed in Experiment 2 at the positions of the distractors in the cued color. On the other hand, the results of the search trials indicated that the significant suppression effect was not evident across all trials but instead only evident in the trials with the longer RTs. This might provide the potential explanation of the absence of the suppression effect in the probe trials. Given the small proportion (10%) of the probe trials, we could not split them into five RT quintiles.

Experiment 3

In Experiment 3, we aimed to investigate the suppression effect while placing the search items in a larger visual space (12°×12° vs. 4°×4° in Experiments 1 and 2) (e.g., Han & Kim, 2009; Soto et al., 2005; Woodman & Luck, 2007). This manipulation led to a significant increase in the spacing between search items, thus reducing the possibility of within receptive field competition among them (Desimone & Duncan, 1995). With this change, we were able to control the possible contributions from the between items competition.

Method

Participants

Twenty-eight new participants (six males; age range: 18–27 years; mean age: 21.3 years; one left-handed) participated in the experiment and were paid for their participation. All participants had normal or corrected-to-normal visual acuity and normal color vision, none of them had known neurological or visual disorders.

Stimuli and apparatus

The stimuli and apparatus were the same as in Experiment 2, except that we replaced the search items with Landolt-squares (0.6°×0.6°) in a larger visual space and used only six colors in this experiment.

Procedure

At the beginning of each trial, after the 1-s blank, a search array containing six Landolt- squares with different gap (0.12°×0.12°) orientations was presented. Distractors in the search array had a gap either on the left or right while the only target had an upward or downward gap. The six search items formed an imaginary circle with a distance of 6° to the center. Participants were instructed to find the target and indicate the location of the gap by pressing number 8 (upward gap) or 0 (downward gap) using their right hands. Similar to Experiment 2, we adopted a staircase procedure to obtain an appropriate presentation duration of the search array for each participants. We relaxed the criterion of threshold to 500 ms since the task was more difficult than that in Experiment 2. Participants’ thresholds ranged from 115 ms to 414 ms with a mean of 280 ms. Considering that the thresholds were long enough for participants to make saccades, we recorded eye movement using the Eyelink 1000 Plus (1,000 Hz, monocular) eye-tracker. Since maintaining varied WM representations increased WM load and therefore hampered reactivation of LTM, a consolidation session with 100 trials was conducted before the main experiment. In the consolidation session, no cue was presented before the onset of search display. There were only two conditions: neutral (the constant color did not appear in search array) and constant (the constant color was the distractor color). There were no probe trials in Experiment 3.

Data analysis

The procedures for data exclusion and statistical analysis were the same as in Experiment 1, resulting in 30.30% of the trials being dropped for incorrect performance in either the search or change detection tasks. Furthermore, 2.20% and 4.06% of all search trials were excluded because of the out of RT range in the search and change detection tasks respectively.

Results

Data analysis was conducted by including and excluding the trials with eye movements and both procedures showed similar patterns of result. Hence, we reported the results including the trials with eye movements. We observed significant main effect of cueing contingency on neither search RTs (F (2, 54) = 0.681, p = .462, Fig. 4a) nor accuracy (F (2, 54) = 1.403, p = .255). As shown in Fig. 4b, there was a significant main effect of RT quintile (F (4, 108) = 220.957, p < .001, η2 = .891), as well as a marginal significant interaction (F (8, 216) = 2.919, p = .051, η2 = 0.098), but not of cueing contingency (F (2, 54) = 0.680, p = .463). The search RTs in the WM-matched condition were significantly shorter than in the neutral condition (p = .040) in the trials with the longest RTs. Additionally, there were significant main effects on change detection RTs (F (2, 54) = 8.797, p < .001, η2 = 0.246) and accuracy (F (2, 54) = 13.835, p < .001, η2 = 0.339). Change detection RTs were slowest in the LTM-matched condition as compared with those in the WM-matched (p = .001) and neutral (p = .033) conditions. Change detection accuracy in the WM-matched condition was higher than those in the neutral and LTM-matched conditions (all ps <= .001).

Fig. 4
figure 4

Results of Experiment 3. (a) Search RTs of the three cueing contingencies. (b) RT benefits in the WM-matched and LTM-matched conditions in five RT percentiles. Error bars represent within-subject 95% confidence interval. Asterisks represent significances (* p<.05, ** p<.01, *** p<.001)

Discussion

We replicated the findings in Experiment 2 by showing that the facilitation effect of the WM-matched condition was evident in the trials with the longest RTs. Additionally, we added a consolidation session to exclude the possibility that the null effect of the LTM-matched condition in Experiment 2 was due to participants’ weak memory representation of the constant color. Lack of the RT benefits in the LTM-matched condition suggests that strong LTM representation might not be the critical factor that influences the LTM-based distractor suppression.

General discussion

WM plays an important role in maintaining and manipulating information for ongoing tasks (Baddeley, 1992; D’Esposito & Postle, 2015). The present study focused on the top-down guidance from WM and LTM in distractor suppression by adopting a dual task paradigm with a to-be-ignored color cue. The results of Experiment 1 showed that both the WM and LTM representations under the fixed cue condition could effectively guide attention away from matching distractors. These facilitation effects were evident across the different quintiles of RT distribution. In Experiments 2 and 3, when the color cue varied from trial to trial, RT benefits of the WM-matched distractors occurred only in the trials with the longest RTs where sufficient cognitive control could be established. Nevertheless, participants could not take the advantage of the LTM representation in the varied cue condition, even among the trials with the longest RTs. These results suggest that the availability of cognitive control is a critical factor for effective WM-guided distractor suppression, whereas the LTM-guided suppression occurs only if sufficient resource of WM is accessible by LTM reactivation.

As a result of the brain’s evolutionary development, stimulus signaling physical or valence-based significance could distract ongoing process of attentional allocation and may interfere with the task at hand (Anderson, Laurent, & Yantis, 2011; Schmidt, Belopolsky, & Theeuwes, 2015; Theeuwes, 1992). Recent studies have further suggested that task-irrelevant distractors that matched with the internally held WM representations could also automatically capture attention and impair task performance (Olivers et al., 2006; Soto et al., 2005; Mannan et al., 2010). However, other studies have pointed out that the influence of WM on attention deployment is rather flexible and could be strategically controlled (Dalvit & Eimer, 2011; Arita et al., 2012; Han & Kim, 2009; Woodman & Luck, 2007). Our results agreed with the latter proposal by demonstrating that voluntarily using the to-be-ignored cue could overcome the attraction from WM-matched distractor. In particular, in Experiment 1, we showed similar results as the existing literature that the interference of WM-matched distractors could be reversed into facilitation when to-be-ignored distractor feature was fixed (Cunningham & Egeth, 2016; Gaspelin et al., 2015), indicating the strategic development of a rejection template using the color cue. One might argue that the significant suppression effect found in Experiment 1 was caused by the absence of articulatory suppression task. However, Cunningham and Egeth (2016) demonstrated similar RT benefits of inhibiting non-targets in the fixed cue condition where no articulatory suppression task was used. Moreover, previous study has suggested that articulatory suppression task was not necessary to induce memory-driven attentional capture (Olivers, 2009) and even reduced the WM-driven capture effect if it was applied (Soto & Humphreys, 2008). Therefore, RT benefits we observed in Experiment 1 could not be attributed to the lack of articulatory suppression task.

Flexible control of attentional allocation is resource consuming in front of a rapidly changing environment. Indeed, previous literature showed that distractor suppression effect disappeared in the varied cue condition (Cunningham & Egeth, 2016; Kerzel & Barras, 2016), suggesting a failure of forming a rejection template under such situation. However, in Experiments 2 and 3, we found RT benefits in the trials with the longest RTs, indicating that participants could form a rejection template based on the varied cues. This result was consistent with the findings from Han and Kim (2009) who compared the suppression effect between participants with different search speeds and showed RT benefits only in the slow group. Importantly, our results also demonstrated that the RT benefits in distractor suppression were evident across the participants with different levels of search speed.

The sustained maintenance of goal-relevant information before the onset of an event is considered an important component of proactive control (Braver, Gray & Burgess, 2007; Braver, 2012). In the present study, the WM-matched conditions in the three experiments were specific instances of such proactive control. Under the fixed cue condition in Experiment 1, repetitively suppressing the same WM-matched distractor required less effort and thus did not rely on large consumption of cognitive resources. Hence, the facilitation effect in the WM-matched condition occurred even in the trials with shortest RTs. However, under the varied cue condition in Experiments 2 and 3, participants needed to update the WM representations of the cues in every trial, leading to a greater occupation of the cognitive resources and longer RTs for the effective cue-induced cognitive control. Thus, we could observe a significant facilitation effect in the WM-matched condition only in the trials with the longest RTs.

In contrast to the WM-matched condition, we observed facilitation effect in the LTM-matched condition only in Experiment 1, where the WM-held cues were fixed within a block. Interestingly, under the fixed cue condition, the facilitation effect in the LTM-matched condition was comparable with that in WM-matched condition. There are two possible interpretations for observing the comparable facilitation effects between the two conditions. First, the representations of the fixed cues were transferred to LTM after a few repetitions in the block, and it was the LTM-held cues that facilitated the search performance. Second, the LTM information of the constant color was loaded into WM throughout the experiment given that human can remember approximately four colors in their WM (Luck & Vogel, 1997). While both interpretations may account for the observed effects under the fixed cue condition, we consider the latter one as more plausible for the following three reasons. First, as suggested by both the classical theories (Atkinson & Shiffrin, 1968; Baddeley & Hitch, 1974) and the recent neurophysiological evidence (Fukuda & Woodman, 2017), the information retrieved from LTM is first represented in WM. It was trivial for the participants to retrieve the LTM information routinely in every trial. Therefore, holding the color cue and the constant color in WM rather than in LTM could reduce the retrieval cost significantly. Second, the facilitation effect in both conditions occurred even in the trials with the shortest RTs, suggesting that participants were ready to suppress those memory-matched distractors before the onset of the search array. In particular, the search array was presented for 150 ms in Experiment 1, making the LTM-based reactive rejection mechanism unlikely to be fulfilled as this engage-disengage process requires at least 150–300 ms to complete in the presence of the search display (Geng, 2014; Moher & Egeth, 2012; Theeuwes, 2010). Third, recent studies have shown that multiple WM representations could control attention simultaneously (Chen & Du, 2017; Hollingworth & Beck, 2016). Therefore, it was functionally possible for the participants to maintain two colors in WM (i.e., the cueing color and the constant color) when rapid switching was not required in Experiment 1. In Experiments 2 and 3, reactive control with late correction mechanism was more likely to play a role in the LTM-matched condition when the to-be-ignore cues varied on a trial-by-trial basis (Geng, 2014). In the present study, the appearance of the LTM-matched distractors was less prevalent and unpredictable. Rapid switch of the WM representations taxed a significant amount of cognitive capacity, thus making holding the constant color in WM a less efficient approach for performing the task. Taken together, our results suggested that facilitation of distractor suppression by LTM representations also required sufficient availability of cognitive control. However, the WM-LTM interaction in visual search is a complex process. Future investigations, particularly combined with neuroimaging and neurophysiological recordings, are required to elucidate its underlying mechanisms.

To conclude, our study showed that WM representation could be voluntarily used to suppress its associated distractors in the fixed cue condition. When varying WM-held distractor cues from trial to trial, WM-guided suppression occurred only in the trials with sufficient processing time that allowed successful creation and utilization of a rejection template. On the other hand, the LTM representation of the to-be-ignored feature could facilitate search performance only in the fixed cue condition in which sufficient WM resource is accessible by LTM reactivation. Future investigations with neuroimaging techniques are required to explore the neural mechanisms underlying these behavioral phenomena.