Attentional selection has been a fundamental issue in the study of attention and information processing, which has been regarded as having limited capacity. A great deal of scholarly concern has focused on when and how task-relevant information is selected. Various theories, such as early and late selection theories, have provided different views regarding the degree to which unselected items are processed. According to the early selection theory, physical characteristics of all incoming stimuli are processed in a preattentive stage in a parallel way, and only the selected item is identified one at a time at a focused attention stage and unselected items are filtered out before identification (Broadbent, 1958). However, the late selection theory claims that task-irrelevant stimuli are not excluded at the identification stage, and that attention is required to select a response after all incoming stimuli are identified in parallel (Deutsch & Deutsch, 1963; Norman, 1968). Many lines of evidence have been provided for and against both views, with the locus of selection the subject of much debate.

Lavie and Tsal (1994) proposed the perceptual load theory to explain the discrepancies in the evidence for the two different views to the question of selective attention. The perceptual load theory assumes that perceptual information is automatically processed until a limited amount of attentional resources is exhausted because the perceptual process is subject to attentional limited capacity but is not controlled voluntarily. Thus, when perceptual load is low, the processing of task-relevant stimuli leaves spare capacity, and leftover attentional resource is automatically allocated to task-irrelevant stimuli. However, when perceptual load is high, attentional resource is unavailable for task-irrelevant stimuli after this resource has been allocated to the task-relevant stimuli (Lavie, 1995). In her Experiment 1, in which perceptual load was manipulated by changing task-relevant set sizes in the search display, Lavie examined the amounts of distractor interference with high and low task-relevant set sizes. When task-relevant set size was low, the target was presented in one of six possible positions with a distractor that either matched or did not match the target, appearing at a task-irrelevant peripheral position. When it was high, the target was presented in one of six possible positions with five neutral stimuli presented in the other five possible positions and a distractor appearing in a peripheral position. Lavie found that distractor interference was obtained when task-relevant set size was low but not when it was high—the so-called the perceptual load effect. According to the perceptual load theory, the processing of a task-irrelevant distractor is modulated by the amount of the perceptual load for task-relevant stimuli, such as task-relevant set size. Since the perceptual load theory was first introduced, many subsequent studies have provided evidence for the role of perceptual load in attentional selection (e.g., Beck & Lavie, 2005; Forster & Lavie, 2008a, b, 2009; Lavie, 1995; Lavie & Cox, 1997; Wei, Kang, & Zhou, 2013; for review, see Murphy, Groeger, & Greene, 2016).

However, numerous other studies have also reported evidence against the perceptual load theory. For example, Johnson, McGrath, and McNeil (2002) found that early selection occurred in a low perceptual load context when participants knew the target location in advance. Eltiti, Wallace, and Fox (2005) also demonstrated that distractor interference disappeared even in a low task-relevant set size display when the properties of the distractor were in discord with attentional control settings for the target. Importantly, Tsal and Benoni (2010a) argued that the modulation of the distractor interference effect by task-relevant set size was actually due to dilution, which is a reduced distractor interference effect with increased display set size (Brown, Roos-Gilbert, & Carr, 1995; Cho, Lien, & Proctor, 2006; Kahneman & Chajczyk, 1983), rather than being a result of the perceptual load of the task-relevant stimuli. That is, according to the latter authors, the perceptual load effect obtained in experiments manipulating task-relevant set size was confounded with the effect of display set size. To dissociate the effect of display set size from the effect of task-relevant set size on distractor interference, Tsal and Benoni added another display called as the dilution display, in which a target and several neutral stimuli were presented in task-relevant positions with a distractor presented in a task-irrelevant peripheral position, as in the high task-relevant set size display. However, as the color of the target was different from all other stimuli, the task-relevant set size was low but the display set size was high.

Tsal and Benoni (2010a) found that the magnitude of the congruency effect was significantly greater when the display set size was low than when it was high, but its magnitude was significantly greater when the task-relevant set size was high than when it was low. In other words, the processing of the task-irrelevant distractor was modulated by display set size when task-relevant set size was controlled—the so-called the dilution effect. Moreover, when display set size was controlled, the amount of the congruency effect increased as a function of task-relevant set size, a finding that is inconsistent with Lavie’s (1995) original perceptual load theory. They suggested that as display set size increases the amount of dilution increases, resulting in decreased distractor interference. These display set size effect (e.g., Benoni & Tsal, 2010; Benoni, Zivony, & Tsal, 2014; Biggs & Gibson, 2014; Wilson, Muroi, & MacLeod, 2011) and reversed task-relevant set size effect on distractor interference (Benoni & Tsal, 2012; Wilson et al., 2011) were replicated in numerous other studies. Although Lavie and Torralbo (2010) proposed a spillover hypothesis to explain the results obtained by Tsal and Benoni (2010a), according to which spare capacity was allocated to neutral stimuli instead of the distractor when the display set size was high due to salience or proximity, Tsal and Benoni (2010b) criticized the spillover hypothesis with evidence that those factors did not impact on the processing of the distractor—an outcome that was inconsistent with the hypothesis.

To explain the display set-size effect on distractor processing, Tsal and Benoni (2010a) used Brown et al.’s (1995) early visual interference account, which was proposed to explain the Stroop dilution effect. The Stroop dilution effect is a reduced Stroop effect that occurs when a neutral word is presented with a target color carrier and a color word compared to the effect when no neutral word is presented. According to the visual interference account, even though letter (word) recognition occurs automatically, feature representations of stimuli other than the target interfere with each other at an early visual processing stage when multiple stimuli are presented simultaneously in a display. Thus, because of these degraded feature representations, the identification processing of the distractor is degraded, resulting in a reduced Stroop effect when a neutral word is presented with a color carrier and a color word. Wilson et al. (2011) also suggested that the modulation of distractor interference was due to dilution. However, they proposed a two-stage dilution hypothesis, suggesting that dilution operates not only in an early visual processing stage but also in the focused attention stage. According to Wilson et al., because the target is selected in the focused attention stage, all other items in the display are subject to dilution, so that dilution at this stage also increases with display set size. Similarly, Cho et al. (2006) suggested that the Stroop dilution occurs because of the attentional competition between the distractor and neutral stimuli at the focused attention stage. However, because Tsal and Benoni (2010a, b), and Wilson et al. (2011) did not directly examine the nature of dilution, it is unclear whether dilution was due to perceptual crosstalk in the preattentive stage or limited attentional capacity in the focused attention stage.

Meanwhile, Tsal and Benoni (2010a) assumed that the reversed task-relevant set-size effect on distractor interference was a result of task difficulty. They proposed that, because the uncertainty of the target location increases with the number of task-relevant stimuli during information processing in the preattentive stage (Wilson et al., 2011), the time window for distractor intrusions also increases, resulting in an increase in the amount of distractor interference. Unlike the prediction based on the perceptual load theory, Tsal and Benoni (2010a) and Wilson et al. (2011) suggested that task-relevant set-size increases the probability of the misallocation of attention to the distractor location before the target is selected.

In the present study, the nature of the display set-size effect on distractor interference was examined while controlling the effect of task-relevant set-size and the nature of the task-relevant set-size effect on distractor interference while controlling the effect of display set-size. Specifically, to see whether the display set-size and the task-relevant set-size effects on distractor interference were due to factors affecting information processing at the preattentive stage or other factors affecting information processing at the focused attention stage, the dual-task paradigm was used. The dual-task paradigm has been used widely to examine the capacity-free automaticity or whether a factor affects information processing in the preattentive stage or the focused attention stage (Davis, 1957; Welford, 1952; for review, see Lien & Proctor, 2002). In this paradigm, in which participants are instructed to make responses rapidly for two different simple tasks, Task 1 (T1) and Task 2 (T2), the stimulus-onset asynchrony (SOA) between the onset of the target stimulus (S1) for T1 and the onset of the target stimulus (S2) for T2 is manipulated. The typical finding in the dual-task paradigm is that the response (R2) for T2 is substantially slowed at short SOAs compared to R2 at long SOAs. This phenomenon is called the psychological refractory period (PRP) effect.

It has been suggested that this PRP effect is due to a central bottleneck in human information processing systems. In the processing stages before the bottleneck, information processing occurs for T1 and T2 simultaneously. However, in the processing stage, at which the bottleneck is located, information processing occurs for only one task at a time. Thus, at short SOAs, information processing for T2 at this stage must wait until processing for T1 is completed, resulting in delayed R2. The amount of time during which no information processing occurs is called “slack” (Schweickert, 1983; see Fig. 1a). Because the amount of slack decreases as SOA increases, R2 becomes faster as SOA increases. Using this logic of the central bottleneck, it has been investigated whether information processing of T2 at a specific stage occurs before or after the bottleneck (e.g., Lien, Ruthruff, & Johnston, 2006; McCann, Remington, & Van Selst, 2000). Specifically, when manipulating a variable that is supposed to affect information processing at a processing stage before the bottleneck, the effect of this variable on R2 decreases at short SOAs (underadditive effect) because the effect is absorbed into the slack (Pashler, 1994; Schweickert, 1978). However, if the variable affects information processing at a post-bottleneck processing stage, the effect of this variable on R2 should be additive with SOA (see Fig. 1b).

Fig. 1
figure 1

a The central bottleneck model in the psychological refractory period (PRP) paradigm. The central bottleneck stage of Task 2 cannot begin until the central processing of Task 1 is completed, resulting in cognitive slack at short SOA. S1 Stimulus for Task 1, S2 stimulus for Task 2, R1 response for Task 1, R2 response for Task 2, SOA stimulus onset asynchrony. b The expected results from the central bottleneck model. Left panel Additive effect with SOA when any factors affecting the stage after the bottleneck stage. Right panel Underadditive effect with SOA when any factors affecting the stage before the bottleneck stage

In the present study, participants were asked to perform a tone discrimination task as T1 and a letter discrimination task as T2. The critical manipulation was the display type of T2. In Experiment 1, as in Tsal and Benoni’s (2010a) Experiment 1b, display set-size was manipulated to ascertain whether the effect of display set size on distractor interference is attributable to perceptual crosstalk at the preattentive stage or to limited attentional capacity at the focused attention stage. If the display set-size effect on distractor interference is a result of degraded feature representations, the display set size of T2 would affect distractor interference underadditively with SOA. However, if the display set size effect is a result of limited attentional capacity, it would affect distractor interference additively with SOA. In Experiment 2, as in the high load and dilution displays used in Tsal and Benoni’s Experiment 1b, task-relevant set size was manipulated to examine the nature of the task-relevant set size effect on distractor interference. If task-relevant set size increases the probability of distractor intrusions at the preattentive stage because of an impaired target selection efficiency, task-relevant set size would affect distractor interference underadditively with SOA.

Experiment 1: the effect of display set size

To examine whether the display set size effect on distractor interference occurs at an early visual processing stage before the central bottleneck stage, as Tsal and Benoni (2010a) claimed, or whether it occurs at a post-bottleneck stage, as Wilson et al. (2011) suggested, the present experiment adopted the dual-task paradigm with a manipulation of display set size, as in the low load and dilution displays used in Tsal and Benoni’s Experiment 1b. In the present experiment, participants were asked to perform a tone-discrimination task as T1 and a letter discrimination task as T2 serially. SOA between the onsets of S1 and S2 was manipulated. The participants were instructed to make a response (R1) to the pitch of S1 first with one hand, and a response (R2) to the identity of the target letter with the other hand. In the letter discrimination task, there were two levels of display set size: low and high. When the display set size was low, the target display contained a target letter, which was positioned at one of four possible central locations, and a distractor, which was positioned at a left or right peripheral location. When the display set size was high, a colored target letter and three white neutral letters were presented at central positions and a distractor was presented at a peripheral location. Because the target was distinguished from the neutral letters by color in the high set size display, the task-relevant set size was same as the low display set size display. The display set size effect on distractor interference could therefore be examined through a comparison between the congruency effects with the low and high display set sizes.

If display set size affects the degree to which the distractor is processed before the target letter is selected for processing at the focused attention stage, as Tsal and Benoni (2010a) claimed, the display set size effect on distractor interference would increase with SOA because longer perceptual and letter recognition processes for the distractor are possible at short SOAs than at long SOAs (e.g., Levy & Pashler, 1995), resulting in slack absorbing the effect of perceptual crosstalk. However, if the display set size effect on distractor interference is due to limited central attentional capacity as Wilson et al. (2011) claimed, the amounts of the modulation of distractor interference by display set size would be constant across SOAs because the display set size effect on distractor interference is due to attentional competition among all visual items for central attentional resources except the target letter after the target is selected for further processing at the focused attention stage.

Method

Participants

Forty students (21 females) from Korea University with an age range between 18 and 29 years old participated and provided informed consent, as approved by the Institutional Review Board of Korea University (IRB Number: KU-IRB-15-47-A-1). All had normal or corrected-to-normal visual acuity and color vision by self-report.

Apparatus

The experiment was conducted in a dimly lit room. Visual stimuli were presented on a 15.9-in. CRT monitor (LG IBM FLATRON 1771FT) with a refresh rate of 60 Hz, and auditory stimuli were presented on speakers (PILLAR CS-3000 PLUS). Responses were collected via a standard computer keyboard. Stimuli presentation, timing, and data collection were controlled by E-Prime software (E-Prime 2.0, Psychology Software Tools, Inc., Pittsburgh, PA). The viewing distance was approximately 60 cm.

Stimuli

All stimuli were presented on a black background. The fixation mark was a white cross (0.46° × 0.46°). The interval between the onset of the target stimuli for T1 and T2 (stimulus onset asynchrony, SOA) was manipulated as 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms. T1 was a tone discrimination task. When a tone was presented through speakers, participants were to respond to the pitch of the tone by pressing a left or right key. There was no reference tone, and only one low tone (498 Hz) and one high tone (661 Hz) were presented for 50 ms in T1. T2 was a visual letter discrimination task, similar to Tsal and Benoni’s (2010a) Experiment 1b. In T2, participants were to respond to the identity of the target letter in the square array. C, S, H, or K was presented as a target letter (0.58° × 0.7°). Two letters (C and S) were assigned to one response and the other two letters (H and K) to the other response. Two types of display were distinguished based on the number of letters in the array. The first display type was a low set size display with a target letter (C, S, H, or K) in the central square array (0.58° × 0.7°) and a distractor letter (C, S, H, or K) to the left or right of the central array (0.8° × 1°). The target and distractor letters differed from one another in every case. A distractor was located at 3.08° from fixation. The target and distractor were presented in white. The second display type was a high set size display, which had a target letter (0.58° × .7°) and three neutral letters (0.58° × 0.7°) from a set of D, N, U, and Z in the central array and a distractor letter (0.8° × 1°) to the left or right of the central array. The distance between the letters in the central array was 0.9°. The target was colored red and the other letters were white. In addition, the congruency of T2 was manipulated. When a letter which was assigned to the same response as the target letter was presented as a distractor, it was a congruent trial. However, when a letter which was assigned to a different response from the target was presented, it was an incongruent trial.

Procedure

There were six blocks of trials. The first and fourth blocks were practice blocks, composed of 36 trials each. The low set size display was presented in the second and third blocks of 160 trials and the high set size display in the fifth and sixth blocks of 160 trials, or vice versa. The total number of trials was 712. The order of display type was counter-balanced across participants.

Figure 2 depicts an example trial sequence. In each trial, the fixation display was presented for 500 ms, followed by 100-ms interval with a blank display. Participants were asked to stare at the fixation mark. A low or high tone was presented as S1. Participants were instructed to press the ‘z’ key of a standard computer keyboard to the low tone with their left middle finger and the ‘x’ key to the high tone with their left index finger. After 50 ms, 100 ms, 300 ms, 700 ms, or 1100 ms, the target display for T2 was presented. Participants were instructed to press the ‘n’ key with their right index finger when the target letter was ‘C’ or ‘S’ and the ‘m’ key with their right middle finger when it was ‘H’ or ‘K’ while ignoring the peripheral distractor. Participants were instructed to respond to the target of T1 earlier than that of T2 and as fast and accurately as possible to both targets. In the case of a wrong response, a 50-ms auditory tone and a 1200-ms visual feedback display (“Incorrect” in Korean) was shown for the R1 and R2 (600 ms each).

Fig. 2
figure 2

An example of a trial sequence in Experiments 1 and 2. The two left displays were used in Experiment 1 and the two right displays in Experiment 2

Results

Trials were excluded if the RT1 was shorter than 125 ms or longer than 2000 ms (1% of all trials), or if the RT2 was shorter than 125 ms or longer than 2500 ms (1% of all trials). Also, trials with incorrect responses on either task were excluded in RT analyses (3% for PE1 and 5% for PE2 of all trials). The mean RTs and proportion of errors (PEs) were calculated for each participant as a function of display set size (high and low), congruency (congruent and incongruent), and SOA (50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms). Analyses of variance (ANOVAs) were conducted on the mean RTs and PEs data of T1 and T2, with these variables as within-subject factors. RT and PE data for Task 1 and Task 2 are shown in Tables 1 and 2, respectively.

Table 1 Mean response times (RT) (in miliseconds), percentages of errors (PE), and standard deviation (SD) (in parentheses) on T1 as a function of display set size, congruency, and stimulus onset asynchrony (SOA) in Experiment 1
Table 2 Mean RTs (in milliseconds), PE, and SD (in parentheses) on T2 as a function of display set size, congruency, and SOA in Experiment 1

RT and PE of Task 1

The main effect of display set size was significant F(1, 39) = 5.03, P = .0307, MSe = 24,511, \( {\eta}_p^2 \) = .1142. Mean RT1 was shorter when display set size was high (M = 573 ms) than when it was low (M = 598 ms). The main effect of SOA was also significant, F(4, 156) = 7.34, P = <.0001, MSe = 5329, \( {\eta}_p^2 \) = .1583. Mean RT1 was 608 ms, 595 ms, 573 ms, 574 ms, and 576 ms for 50-ms, 100-ms 300-ms, 700-ms, and 1100-ms SOAs, respectively. No other effect was significant.

The PE1 analysis revealed no significant main effect or interaction.

RT and PE of Task 2

Even though the mean RT2 tended to be shorter when display set size was high (M = 697 ms) than when it was low (M = 721 ms), the main effect of display set size was not significant, F(1, 39) = 2.48, P = .1233, MSe = 48,017. The main effect of congruency was significant, F(1, 39) = 13.04, P = .0009, MSe = 2246, \( {\eta}_p^2 \) = .2506. The mean RT2 was shorter in congruent trials (M = 703 ms) than incongruent trials (M = 715 ms). The main effect of SOA was significant, F(4, 156) = 297.38, P < .0001, MSe = 9080,\( {\eta}_p^2 \) = .8841, indicating a PRP effect. The mean RT decreased as SOA increased (Ms = 876 ms, 811 ms, 676 ms, 599 ms, and 583 ms for SOAs of 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms, respectively). The interaction between display set size and congruency was significant, F(1, 39) = 10.49, P = .0025, MSe = 48,017, \( {\eta}_p^2 \) = .2120. As in Fig. 3, a significant congruency effect was found when display set size was low (20 ms), F(1, 39) = 22.88, P < .0001, MSe = 1744, \( {\eta}_p^2 \) = .3697, but not when it was high (4 ms), F(1, 39) = 1.06, P = .3095, MSe = 1683. Importantly, the interaction of SOA and display set size was not significant, F(4, 156) < 1. The mean RTs were 894 ms, 821 ms, 687 ms, 609 ms, and 596 ms for SOA 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms, respectively, when display set size was low, and 858 ms, 802 ms, 666 ms, 589 ms, and 570 ms, respectively, when it was high (see Fig. 4). Therefore, the display set size effect did not significantly change across SOAs (35 ms, 20 ms, 21 ms, 20 ms, and 26 ms for SOAs of 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms, respectively). When another ANOVA was conducted with display set size, congruency, and pooled SOA (the two shortest SOAs and the two longest SOAs) for more precise examination of this additivity, the interaction of pooled SOA and display set size was not significant, F(1, 39) < 1. The amount of the display set size effect was similar at the short SOAs (28 ms) and at the long SOAs (23 ms).

Fig. 3
figure 3

Mean response time (RT) for Task 2 in Experiment 1 as a function of display set size and congruency

Fig. 4
figure 4

Mean RT for Task 1 and Task 2 in Experiment 1 as a function of display set size and stimulus onset asynchrony (SOA: 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms)

Also, the interaction of congruency and SOA was not significant, F(4, 156) = 1.28, P = .2785, MSe = 1816. The magnitude of the congruency effect was 2 ms, 18 ms, 7 ms, 21 ms, and 12 ms for 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms SOAs, respectively. The 3-way interaction of display set size, congruency, and SOA was not significant, either, F(4, 156) < 1. That is, the amount of the effect of display set size on the congruency effect was relatively constant across SOAs (–1 ms, 21 ms, 9 ms, 36 ms, and 14 ms for 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms SOAs, respectively, see Fig. 5). When another ANOVA was conducted with pooled SOA to examine this additivity more precisely, the display set size effect on distractor interference did not significantly differ between the two shortest SOAs (10 ms) and the two longest SOAs (25 ms), F(1, 39) = 1.12, P = .2959, MSe = 1978. There were no other significant main or interaction effects.

Fig. 5
figure 5

Mean RT for Task 2 for congruent and incongruent trials in Experiment 1 as a function of display set size and SOA

For PE2 data, the main effect of congruency was marginally significant, F(1, 39) = 3.85, P = .0568, MSe = 9, \( {\eta}_p^2 \) = .0899. PE2 was slightly higher in incongruent trials (4.78%) than congruent trials (4.36%). The main effect of SOA was significant, F(4, 156) = 6.72, P = <.0001, MSe = 18, \( {\eta}_p^2 \) = .1470. PEs decreased as SOA increased (5.7%, 4.78%, 4.97%, 3.88%, and 3.52% for SOA 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms, respectively). The interaction of display set size and SOA was marginally significant, F(4, 156) = 2.32, p = .0594, MSe = 12, \( {\eta}_p^2 \) = .0561. The differences of PE2 between the two display set sizes were 0.35%, 1.45%, 0.62%, 0.63%, and 0.47% for 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms SOAs, respectively. No other effect was significant.

Discussion

As in previous PRP studies, a significant PRP effect was obtained in the present experiment, showing that RT2 decreased as SOA increased. Although participants were instructed to respond as fast as possible to S1, a display set size effect was obtained in RT1. These results indicate that participants might have used a strategy to respond to S1 after the response for T2 was selected. That is, they possibly selected R1 and hold it until R2 was ready to be executed (Pashler, 1984; Ulrich & Miller, 2008). However, this was not the case in Experiment 1 because RT1 did not decrease linearly as the SOA increased and other S2 factors were not obtained in RT1.

As in Tsal and Benoni’s (2010a) Experiment 1b, a significant display set size effect on distractor interference was obtained in the present experiment even though task-relevant set size was low in both displays. That is, the congruency effect was significantly smaller when display set size was high [4 ms, 95% CI (–4.1 12.5)] than when it was low [20 ms, 95% CI (11.5, 28.4)]. This result is inconsistent with the perceptual load theory, which assumes that the amount of interference is primarily determined by the number of task-relevant visual items (Lavie & Tsal, 1994). Regardless of their task relevance, the number of visual items in a display determined the degree to which the distractor was processed. More importantly, no interaction was found between display set size and SOA and the magnitude of the display set size effect on distractor interference was relatively constant across SOAs (1 ms, 21 ms, 9 ms, 36 ms, and 14 ms for 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms SOAs, respectively). These results indicate that display set size affected task performance at the focused attention stage so that the display set size effect on distractor interference occurred because of the limited capacity of attentional resources. If this effect was due to perceptual crosstalk at the preattentive stage, as Tsal and Benoni (2010a) suggested, the effects of display set size and the modulation of distractor interference by display set size should have increased with SOA because the perceptual processing of the distractor could continue during cognitive slack at short SOAs (Levy & Pashler, 1995) even though the quality of the input to the letter recognition stage decreased with display set size. It has been suggested that perceptual processing for a stimulus is strengthened by recurrent processes until the offset of the stimulus (e.g., Francis, 1997). Thus, the results imply that the magnitude of distractor interference was modulated because of attentional competitions among visual items except the target in the search display at the focused attention stage, as Wilson et al. (2011) suggested.

Experiment 2: the effect of task-relevant set size

A larger congruency effect has been found when the task-relevant set size was high than when it was low (Tsal & Benoni, 2010a), even though the original perceptual load theory suggest that the amount of distractor interference varies as an inverse function of task-relevant set size (e.g., Lavie, 1995). According to Tsal and Benoni, the distractor was more likely processed when the task-relevant set size was high than when it was low because of a longer time window for target processing at the preattentive stage in the former than the latter, resulting in a higher probability of distractor intrusion. Wilson et al. (2011) also suggested that this reversed task-relevant set size effect on distractor interference was due to a higher decision noise when the task-relevant set size was high than when it was low. However, although many studies have obtained this reversed task-relevant set size effect on distractor interference (Tsal & Benoni, 2010a; Wilson et al., 2011), it has not been reliably obtained in other studies (see Benoni & Tsal, 2010).

In Experiment 2, the congruency effects were examined with a manipulation of task-relevant set size, as in the high load and dilution displays used in Tsal and Benoni’s (2010a) Experiment 1b, in the dual-task context to see the nature of the task-relevant set size effect on distractor interference when the effect of display set size was controlled. As in Experiment 1, participants were asked to perform a tone discrimination task as T1 and a letter discrimination task as T2. When the relevant set size was high, a white target and three white neutral letters were presented at four central positions and a distractor at a peripheral position and, when it was low, the display was identical to the high display set size display used in Experiment 1.

If the amount of distractor interference is determined solely by perceptual load (Lavie, 1995; Lavie & Tsal, 1994), a larger congruency effect would be obtained when the task-relevant set size was low than when it was high. However, if task-relevant set size modulates the target selection efficiency, as Tsal and Benoni (2010a) and Wilson et al. (2011) suggested, task-relevant set size would have an underadditive effect with SOA. Moreover, if the probability of distractor intrusion is determined by the target search efficiency, the congruency effect would be larger when the task-relevant set size was high than when it was low. Also, this modulation would increase with SOA, because the target selection efficiency affects task performance at a processing stage before the target is processed at the focused attention stage. However, if the effects of task-relevant set size occur after the target letter is selected at the focused attention stage, they would have an additive effect with SOA.

Method

Participants

Forty new students from Korea University (19 females) with an age range between 19 and 35 years old participated and provided informed consent, as approved by the Institutional Review Board of Korea University. All had normal and corrected-to-normal visual acuity and color vision by self-report. None of the students had participated in Experiment 1.

Apparatus, stimuli, design, and procedure

The apparatus, stimuli, design, and procedure of this experiment were almost identical to those used in Experiment 1, with the exception of T2’s display type (Fig. 2). While high and low display set size displays were used for T2 in Experiment 1, high and low task-relevant set size displays, which were identical to the high load and dilution displays of Tsal and Benoni’s (2010a) Experiment 1b, were used in Experiment 2. The low task-relevant set size display was identical to the high display set size display used in Experiment 1. On the high task-relevant set size display, a white-colored target was presented with three white neutral letters in the central square array and one distractor letter in the target display.

Results

With the same exclusion criteria as those in Experiment 1, 3% of all trials were excluded based on the RT1 criteria and 2% based on the RT2 criteria were excluded in both PE and RT analyses. Incorrect trials were also excluded (3% for PE1 and 4% for PE2) in RT analyses. The mean RTs and PEs were calculated for each participant as a function of task-relevant set size (high and low), congruency (congruent and incongruent), and SOA (50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms). ANOVAs were conducted on the mean RTs and PEs data of T1 (see Table 3) and T2 (see Table 4), with these variables as within-subject factors.

Table 3 Mean RT (in milliseconds), PE, and SD (in parentheses) on T1 as a function of task-relevant set size, congruency, and SOA in Experiment 2
Table 4 Mean RTs (in milliseconds), PEs, and SD (in parentheses) on T2 as a function of task-relevant set size, congruency, and SOA in Experiment 2

RT and PE of Task 1

The main effect of SOA was not significant, F(4, 156) = 1.35, P = .2539, MSe = 23,836. The interaction of SOA and task-relevant set size was significant, F(4, 156) = 7.09, P < .0001, MSe = 3793, \( {\eta}_p^2 \) = .1539. The RT differences of the two task-relevant set sizes were 18, 23, –10, –37, and –22 ms for each SOA 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms, respectively.

For PE1 data, the main effect of SOA was significant, F(4, 156) = 4.98, P = .0008, MSe = 9, \( {\eta}_p^2 \) = .1132. The mean PE decreased as SOA increased (3.12%, 2.49%, 2.2%, 2.23%, and 1.71% for 50 ms, 100 ms, 300, ms 700 ms, and 1100 ms SOAs). The interaction of congruency and SOA was significant, F(4, 156) = 2.96, P = .0217, MSe = 6, \( {\eta}_p^2 \) = .0705. However, the congruency effect did not linearly decrease as SOA increased (–1.14%, 0.17%, 0.4%, 0.59%, and 0.05% for 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms SOAs). There was no significant main effect or interaction in the PE1 analysis.

RT and PE of Task 2

The main effect of task-relevant set size was significant, F(1, 39) = 63.87, P < .0001, MSe = 69,643, \( {\eta}_p^2 \) = .6209. The mean RT2 was greater when task-relevant set size was high (M = 907 ms) than when it was low (M = 758 ms). The main effect of congruency was significant, F(1, 39) = 26.92, P = < .0001, MSe = 2506, \( {\eta}_p^2 \) = .4084. The mean RT2 was greater for incongruent trials (M = 842 ms) than congruent trials (M = 823 ms). The main effect of SOA was significant, F(4, 156) = 230.87, P < .0001, MSe = 14,222, \( {\eta}_p^2 \) = .8555, reflecting a PRP effect. The mean RT2s of SOA, 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms were 995 ms, 960 ms, 821 ms, 711 ms, and 675 ms, respectively. The interaction of task-relevant set size and congruency was not significant, F(1, 39) = 2.63, P = .1130, MSe = 3227, indicating no significant reversed task-relevant set size effect on distractor interference (see Fig. 6). A significant congruency effect was observed when task-relevant set size was high (24 ms), F(1, 39) = 22.55, P < .0001, MSe = 2746, \( {\eta}_p^2 \) = .3663, and when it was low (12 ms), F(1, 39) = 4.71, P = .0362, MSe = 2988, \( {\eta}_p^2 \) = .1077. The interaction of task-relevant set size and SOA was significant, F(4, 156) = 2.66, p = .0349, MSe = 4133, \( {\eta}_p^2 \) = .0638. The difference of the mean RT2 between the high and low task-relevant set size displays tended to increase as SOA increased (127 ms, 144 ms, 144 ms, 161 ms, and 169 ms for SOA 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms, respectively; see Fig. 7). However, the interaction of congruency and SOA was not significant, F(4, 156) = 1.38, P = .2424, MSe = 2487. The magnitude of the congruency effect was 16 ms, 32 ms, 22 ms, 13 ms, and 8 ms for 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms SOAs, respectively. The three-way interaction of task-relevant set size, congruency, and SOA was not significant, F(4, 156) < 1. Although the amount of the effect of task-relevant set size on distractor interference was 9 ms, 6 ms, –10 ms, 22 ms, and 37 ms for 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms SOAs, respectively (see Fig. 8), the effect of task-relevant set size on distractor interference was not significant even when the two shortest SOAs (8 ms) and the two longest SOAs (30 ms) were pooled, F(1, 39) = 1.68, P = .2026, MSe = 2867.

Fig. 6
figure 6

Mean RT for Task 2 in Experiment 2 as a function of task-relevant set size and congruency

Fig. 7
figure 7

Mean RT for Task 1 and Task 2 in Experiment 2 as a function of task-relevant set size and SOA

Fig. 8
figure 8

Mean RT for Task 2 for congruent and incongruent trials in Experiment 2 as a function of task-relevant set size and SOA

For PE2 data, only the main effect of congruency was significant, F(1, 39) = 13.2, P = .0008, MSe = 10, \( {\eta}_p^2 \) = .2529. PE2 was lower in congruent trials (3.7%) than incongruent trials (4.52%). No other effect was significant.

Discussion

As in Experiment 1, a significant PRP effect was obtained in Experiment 2. The mean RT2 increased as SOA decreased. Although RT1 showed the interaction of SOA and task-relevant set size, the main effect of SOA was not significant and other S2 factors had no influence on RT1. Therefore, it is highly unlikely that this interaction might have been due to the result of capacity allocation as mentioned in the Discussion of Experiment 1.

To ascertain the effect of task-relevant set size without the influence of display set size, the congruency effects with the low and high task-relevant set sizes were compared in the dual-task context. The mean RT2 was greater when task-relevant set size was high (M = 907 ms) than when it was low (M = 758 ms); it took longer to search for the target at the high task-relevant set size than the low task-relevant set size. In addition, importantly, task-relevant set size had an underadditive effect with SOA. That is, as shown in Fig. 7, the amount of the task-relevant set size effect on task performance increased with SOA (127 ms, 144 ms, 144 ms, 161 ms, and 169 ms for 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms SOAs, respectively). This result implies that, as Tsal and Benoni (2010a) suggested, the target selection efficiency at the preattentive stage was modulated by task-relevant set size. The target letter was efficient to find because it was easily distinguished from the other items by its unique color when task-relevant set size was low, while it took longer to find the target letter when it was high because the target letter did not pop out among non-target letters and the distractor when it was high. Therefore, distractor intrusion could have increased as a function of task-relevant set size at the preattentive stage.

Importantly, although the amount of distractor interference tended to be larger when task-relevant set size was high [24 ms, 95% CI (14.3, 35.5)] than when it was low [12 ms, 95% CI (0.8, 22.9)], unlike the results of Tsal and Benoni’s (2010a) Experiment 1b, this reversed task-relevant set size effect on distractor interference, which has not been reliably obtained in previous studies (e.g., Benoni & Tsal, 2010), was not significant. This lack of the modulation of distractor interference by task-relevant set size was possibly due to infrequent misallocations of attention to the distractor regardless of task-relevant set size because of top-down attentional control sets or other bottom-up factors at the preattentive stage. It is important, however, to note that the lack of the interaction of task-relevant set size, congruency, and SOA was possibly due to no task-relevant set size effect on distractor interference.

As in Experiment 1, the magnitude of the congruency effect was constant across SOAs. This result is consistent with the well accepted idea that the congruency effect is caused by a conflict between the responses activated by the target and the distractor stimulus features at the response selection processing stage, which is a limited capacity processing stage (Lien & Proctor, 2002; McCann & Johnston, 1992; Piai, Roelofs, & Schriefers, 2014).

General discussion

To test whether display set size and task-relevant set size effects on distractor interference occur at a preattentive stage or at a focused attention stage, the present study examined distractor interferences of the second task in a dual-task context by manipulating display set size or task-relevant set size, as well as SOA. The basic rationale was that if an independent variable affected information processing at a stage for T2, which could occur in parallel with information processing at the focused attention stage for T1, the effect of the independent variable should increase with SOA. However, if this independent variable affected information processing, which cannot occur in parallel with the focused attention stage for T1 the variable should have an additive effect with SOA.

In Experiment 1, display set size was manipulated to examine the natures of the display set size effect on distractor interference while controlling task-relevant set size. As in Tsal and Benoni’s (2010a) experiments, the amount of distractor interference was smaller when display set size was high than when it was low. However, importantly, this modulation of distractor interference by display set size, as well as display set size, had an additive effect with SOA. These results, which are inconsistent with Tsal and Benoni’s idea that the display set size effect on distractor interference is due to perceptual crosstalk among visual items in a display at the preattentive stage, suggest that display set size affected information processing at the focused attention stage, as Wilson et al. (2011) claimed.

In Experiment 2, in which task-relevant set size was manipulated to examine the nature of the task-relevant set size effect on distractor interference while controlling display set size, the amount of distractor interference was not significantly modulated by task-relevant set size, indicating no reversed perceptual load effect. However, task-relevant set size itself had an underadditive effect with SOA, indicating that task-relevant set size affected the efficiency of target selection at a processing stage before the target letter was processed at the focused attention stage, as Tsal and Benoni (2010a) and Wilson et al. (2011) suggested. That is, as task-relevant set size increases, the uncertainty regarding the target location increases and, consequently, the probability of distractor intrusion could increase. However, it is important to note that although the congruency effect was larger when task-relevant set size was high (24 ms) than when it was low (12 ms) and this effect tended to increase with SOA (9 ms, 6 ms, –10 ms, 22 ms, and 37 ms for 50 ms, 100 ms, 300 ms, 700 ms, and 1100 ms SOAs, respectively) in Experiment 2, these effects were not statistically significant. The lack of the underadditivity between the modulation of distractor interference by task-relevant set size and SOA indicates that the misallocation of attention to the distractor before the target letter is processed at the focused attention stage is determined not only by task-relevant set size but also by other top-down and bottom-up factors, such as attentional control settings (e.g., Benoni et al., 2014), and perceptual similarity between the target and distractor (e.g., Yeh & Lin, 2013). For example, distractor intrusion has been found to occur infrequently when the target and the distractor were perceptually distinguishable (Eltiti et al., 2005; Yeh & Lin, 2013).

Dilution in the focused attention stage

To explain the effect of the display set size on distractor interference, Tsal and Benoni (2010a) proposed the dilution account, which is based on Brown et al.’s (Brown et al., 1995; Brown, Gore, & Carr, 2002) early visual interference account. According to this account, all visual items in a display can be automatically recognized, so that responses are delayed when an incongruent distractor is presented. However, when multiple items are presented simultaneously in a display, feature representations of these visual items interfere with each other before lexical encoding, resulting in degraded inputs for word (or letter) recognition. Thus, the degree to which a distractor is processed varies as an inverse function of the number of visual items or the visual complexity in a display. However, the findings of the present study’s Experiment 1 provide evidence counteracting the idea that dilution occurs because of this perceptual crosstalk at the preattentive stage. In fact, it has been reported that Stroop dilution is not modulated by perceptual factors, such as visual complexity (Mitterer, La Heij, & Van der Heijden, 2003; Roberts & Besner, 2005). For example, Roberts and Besner (2005) found no Stroop dilution when a string of complex symbols was presented as a color carrier. Moreover, Mitterer et al. (2003) showed that strings of perceptually complex symbol induced smaller Stroop dilution than strings of letters, pseudowords, and words when these were presented as a dilutor.

In contrast, Wilson et al.’s (2011) dilution account suggests that the focused attention stage is, in part, responsible for the display set size effect on distractor interference. In their experiments, distractor interference decreased as the number of items in a display increased regardless of their relevancy. According to the latter authors, after all items are processed in parallel during the preattentive stage, one item most likely to be the target is selected for processing at the focused attention stage. In this latter stage, only the selected item is relevant, and all the other items are responsible for dilution. As Wilson et al. (2011) noted, “crosstalk increases, which in turn degrades the distractor representation and reduces distractor interference” (p. 330) at the focused attention stage. The amount of dilution therefore increases with display set size.

It has been suggested that one critical reason for distractor intrusion at the focused attention stage is attentional competition between visual items in a display (Cho et al., 2006; Choi, Cho, & Proctor, 2009; Gaspelin, Ruthruff, & Jung, 2014; Kahneman & Chajczyk, 1983; Mitterer et al., 2003). For example, Kahneman and Chajczyk (1983) suggested that dilution occurs because of attentional competition between the distractor and neutral stimuli. According to them, distractor interference is induced only when the distractor captures attention but not when the neutral stimulus does. In addition, Cho et al. (2006) claimed that the color carrier has attentional priority in the Stroop display in a top-down way because only the carrier has target-defining features specified in the attentional control settings and in a bottom-up way because only it has chromatic color. Once the color carrier has been processed, attentional competition is likely to occur among visual items other than the target in the display for some reasons (Kim, Cho, Yamaguchi, & Proctor, 2008; Lachter, Forster, & Ruthruff, 2004). Similarly, Gaspelin et al. (2014) argued that the distractor interference effect is caused by involuntary misallocations of attention to the distractor, which is called slippage. According to the latter authors, the probability of the misallocation of attention to the distractor decreases as display set size increases because non-target letters which have target-defining features compete with the distractor for slippage of spatial attention. These slippage-based accounts are applicable to the results of the present study’s Experiment 1. That is, the target was easy to select on the basis of its location when display set size was low and its color when display set size was high. Once the target had been selected to be processed at the focused attention stage, attentional competition among visual items other than the target occurred, resulting in a congruency effect varying as an inverse function of the number of the visual items, because the probability of the misallocation of attention to the distractor decreased with the number of the visual items. Specifically, the findings imply that the congruency effect can be modulated by the probability that the distractor captured attention after the target has been selected to process at the focused attention stage. In contrast, the response selection stage is most related to the central bottleneck stage because attentional resources are limited at the both stages.

The task-relevant set size effect in the preattentive stage

Since Lavie and Tsal (1994) proposed the perceptual load theory, this concept has played an important role as a “hybrid resolution of the early-late selection debate” (Benoni & Tsal, 2013, p. 1). The original perceptual load theory assumes that perception has limited capacity but is not controlled voluntarily so that the perceptual load of the task-relevant items determines the locus of selection (Lavie, 1995; Lavie & Cox, 1997; Lavie & de Fockert, 2003; Lavie & Fox, 2000; Lavie & Tsal, 1994). In other words, the locus of selection is determined by “the perceptual load involved in relevant processing” (Lavie & Cox, 1997, p. 395). Specifically, processing of the task-irrelevant distractor is weakened early because attention is highly selective when perceptual load is high, whereas attention resources spill over to the processing of the task-irrelevant distractor when perceptual load is low, resulting in distractor interference. However, it has been found that task-relevant set size increased the degree to which distractors were processed when display set size was controlled (Benoni & Tsal, 2012; Benoni et al., 2014; Chen & Cave, 2013; Tsal & Benoni, 2010a; Wilson et al., 2011).

According to Tsal and Benoni (2010a), because of increased processing time when task-relevant set size is high the probability that distractor intrusion occurs increases. More specifically, Wilson et al. (2011) suggested that because the uncertainty regarding the target location increases at the preattentive stage as task-relevant set size increases, distractor intrusion is more likely to occur when task-relevant set size is high than when it is low. In other words, the degree to which the distractor is processed is not directly determined by the perceptual load of the target display, but by the efficiency in the target selection based on the outcomes of the information processing at the preattentive stage. Studies have consistently found that distractor interference increases as the target selection efficiency decreases, regardless of perceptual load (Chen, 2003; Cosman & Vecera, 2012; Eltiti et al., 2005; Paquet & Craig, 1997). The efficiency of the target selection is modulated by both bottom-up and top-down factors at the preattentive stage (e.g., Wolfe, 1994).

It is important to note that the task-relevant set size effect on distractor interference was modulated by top-down attentional set (Benoni et al., 2014; Theeuwes, Kramer, & Belopolsky, 2004), but the display set size effect on distractor interference was not (Benoni et al., 2014). In Benoni et al.’s experiment, a marginally significant reversed task-relevant set size effect on distractor interference was found when the high and low set size displays were randomly presented, but no effect when the two types of display were presented in different experiment blocks. These results indicate that as the efficiency in selecting a target from a task-irrelevant distractor, which is determined by top-down attentional control settings, decreases in the preattentive stage, the processing of the distractor is more likely to occur before the target is selected to process. Thus, the reversed task-relevant set size effect on the congruency will be found only when the distractor is selected before the target in the high task-relevant set size condition because of a low selection efficiency. Specifically, as the uncertainty of the target location increases with task-relevant set size, the probability that distractor intrusion occurs before the target is selected for processing at the focused attention stage can increase, resulting in the congruency effect varying as a function of task-relevant set size.

Yeh and Lin (2013) found that efficiency of target selection also undergoes bottom-up modulation from the perceptual similarity between the target and distractor. In their experiments, in which the target was determined by its location when display set size was high but task-relevant set size was low, a significant congruency effect was obtained only when the target and distractor were presented in the same color and other non-targets were in a different color, but not when the target and distractor were presented in different colors or all stimuli were the same color. Gaspelin et al. (2014) also found increased distractor interference when the target color cue was presented at the distractor location even with a high display set size and decreased interference when the target color cue was presented at the target location with a low display set size. These results indicate that the congruency effect can be modulated by top-down and bottom-up factors affecting the probability that the distractor captures attention (distractor intrusion) before the target is processed at the focused attention stage.

Conclusion

When a target and a distractor are simultaneously presented with other visual items, the processing of the distractor could occur before or after the target is selected to process. The present study demonstrates that two important factors modulating the degree to which the distractor is processed—display set size and task-relevant set size—influenced information processing at different stages. Display set size modulates the congruency effect after the target has started to be processed at the focused attention stage because of attentional competition among the visual items other than the target. However, task-relevant set size seems to modulate the congruency effect before the target is selected for processing because of attentional competition between the target and other visual items, including the distractor. More generally, the size of the congruency effect is determined by the probability of the misallocation of attention to the distractor after the target is selected for processing, such as the number of visual items in a display and factors affecting the target selection efficiency, such as task-relevant set size and perceptual similarity, respectively. Even though the question of whether the congruency effect occurs only when it captures attention remains unanswered, converging evidence implies that the effect of the distractor is modulated by the misallocation of attention to the distractor.