Turning the attentional blink on and off: Opposing effects of spatial and temporal noise
- First Online:
- Cite this article as:
- Martin, E.W., Enns, J.T. & Shapiro, K.L. Psychon Bull Rev (2011) 18: 295. doi:10.3758/s13423-011-0056-2
- 443 Downloads
Brain oscillations in various frequency bands have been shown to be an important means of enabling interarea communication for high-level cognitive performance. Interestingly, perturbation to such oscillations in the form of weak noise has been shown to benefit perception in tasks such as the attentional blink (AB). Here, we investigated perturbation intrinsic to the AB task in two conditions in which noise arose from either temporal or spatial discontinuity. Consistent with theoretical predictions, temporal discontinuity resulted in a reduced AB, whereas spatial discontinuity resulted in an increased AB. The results are discussed in the framework of both stochastic resonance theory and pretarget alpha oscillations, two neurally based accounts of perceptual processing.
KeywordsAttentional Blink (AB)Spatial discontinuityTemporal discontinuityStochastic resonanceAlpha oscillationsNoise
The attentional blink (AB; Raymond, Shapiro, & Arnell, 1992) refers to a reduction in the accuracy of reporting the second of two targets (T1, T2) when they are presented by rapid serial visual presentation (RSVP). The reduction is maximal when the targets are separated in time by approximately 100–500 ms, although accuracy for T2 can be equivalent to that for T1 when participants are instructed to ignore T1, when nontarget items immediately following the targets are omitted, and/or when the temporal separation between the targets exceeds ~500 ms (cf. Raymond et al., 1992; Seiffert & Di Lollo, 1997). The present study explored two experimental factors that have diametrically opposite effects on the AB: one reducing the AB to levels usually observed only when participants are instructed to ignore T1, and the other increasing the AB by an equivalent amount.
A wide variety of accounts have been advanced to explain the AB (cf. Dux & Marois, 2009), but the point of departure for the present study was inspired by a growing body of evidence regarding the role of neural oscillations in perception (e.g., Busch, Dubois, & VanRullen, 2009; Buzsáki, 2004). For example, stochastic resonance theory (Gammaitoni, Hanggi, Jung, & Marchesoni, 1998; Wiesenfeld & Moss, 1995) is a putative mechanism whereby neural noise arising from external sources, normally a negative factor, can, under the right circumstances, increase signal detection. Recent behavioral studies have reported that detection of a weak visual target is enhanced by the presence of randomly flickering gray-level noise adjacent to two possible target locations (e.g., Kitajo, Yamanaka, Ward, & Yamamoto, 2006).
Although the application of stochastic resonance to behavioral outcomes in cognitive psychology is in its infancy, a recent study of the AB suggests that doing so can be profitable. Kawahara (2009) found that “noise” in the form of nontarget distractors surrounding the rapid serial visual presentation (RSVP) stream and continuously present could both increase and decrease AB magnitude. When the noise arose from a dimension different from that of the target detection requirement, T2 detection was enhanced. In contrast, when the noise arose from the same dimension as targets, a larger AB resulted. Stochastic resonance received support as a viable account, in that the type of distractor influenced the direction of the effect and the effect occurred only for targets in the temporal window defining the AB (see the Discussion section for a further treatment of these issues).
The noise in the experiments described above arose from sources in the periphery of the RSVP stream.1 However, noise generated by experimental stimuli themselves may also exert an influence but have not yet been investigated in the context of the AB. Importantly, precise experimental control over such noise allows one to determine whether the attentional modulation observed in the AB is related directly to events preceding each of the targets, or whether the modulation is a consequence of a more continuous state, as described above. To accomplish this goal, intrinsic distraction was presented in the RSVP stream either (1) prior to T1, (2) between T1 and T2, or (3) in both parts of the stream. A comparison of the second with the first and third of these conditions enabled us to examine whether the dramatically different outcomes of such intrinsic noise arise solely from effects on T1, which, by some accounts of the AB, can predict T2 performance or from effects on both targets. The goal of the present report is to gain a better understanding of the role of noise in high-level perception by carefully controlling the source in the specific context of the AB paradigm.
For our beneficial noise, we chose to manipulate the temporal rhythm of the RSVP stream. As has been suggested by Kawahara (2009), a dimension irrelevant to the target task should enhance perception. For our detrimental noise, again following Kawahara, we chose to introduce a degree of variability similar to that induced by manipulating the temporal rhythm, but from a dimension directly relevant to the performance of the imperative task (i.e., identifying a spatially defined letter target in the RSVP stream). Variability in item size should increase the amount of attention that participants devote to T1, because size variation makes shape identification less efficient and more effortful (Schultz & Eriksen, 1978).
Twenty-one undergraduate psychology students (15 females; mean age, 19.7 years; normal or corrected-to-normal visual acuity) from Bangor University volunteered to participate.
Stimuli were presented on a 1,024 × 768 pixel, 32-bit color quality, 17-in. cathode ray tube (CRT) monitor using E-Prime version 1.1 experimental software (Psychology Software Tools, Pittsburgh, PA). All the stimuli were presented in intervals of the 17-ms refresh rate of the monitor.
A stream of 24 or 252 letters was presented in the center of a gray screen, and participants attempted to identify two targets. All items in the stream were black, with the exception of the two white targets (T1 and T2). Nontargets were drawn from a randomly alternating sequence of the entire alphabet, with the exceptions of B, G, S, X, K, and Y. The letters B, G, and S were used as randomly selected T1 items; X, K, and Y were used as randomly selected T2 items. T1 always appeared as the 12th item in the stream in order to equate the amount of discontinuity prior to T1.
Temporal noise was introduced by varying the interstimulus-interval (ISI) between specific nontarget items as described above and shown in Fig. 1a, leaving a constant display duration (17 ms) for every item. Here, the letter font size was identical to that in the standard condition (20-point font), as was the display duration. The mean stimulus onset asynchrony (SOA) was identical to that in the standard condition (102 ms; composed of a mean ISI of 85 ms plus the 17-ms display duration). Temporal noise was created in each stream by selecting randomly from a predetermined uniform distribution of ISIs (17–51–85–119–153 ms). The two exceptions to the random selection of ISIs in the temporal noise conditions were (1) the ISI between each target and its respective mask, to maintain target/mask equality across all conditions, and (2) the ISI occurring between items following T2’s mask; these were presented with an 85-ms ISI, under the assumption that these items have no influence on target performance.
Spatial noise was introduced by varying the font size of specific nontarget letters as described below and as illustrated in Fig. 1b. Here, the average font size was nearly identical to that used in the standard condition (i.e., a 20-point font), with the varying font sizes selected randomly in each stream from 16, 18, 20, or 22 points. Two exceptions to the random selection of font size in the spatial noise condition were (1) T1, T2, and their respective masks, since these were kept the same as those in the canonical condition to maintain equality across targets and masks in all conditions, and (2) the items occurring subsequent to the T2 + 1 item, since these were all in 20-point font, under the assumption that items following the T2 mask have no influence on target performance. In all three spatial noise conditions, as in the standard, all stimuli were presented at ~10 items per second (SOA = 102 ms).
T1 and T2 accuracy were examined separately for each of the temporal and spatial noise conditions with a mixed-design ANOVA involving four conditions (baseline, pre-T1, pre-T2, pre-T1 + pre-T2) × 3 lags (204, 306, 714 ms) × 7 orders. All statistical tests were deemed significant at the .05 level, and all post hoc tests used a Bonferroni correction.
Temporal noise also led to increases in T2 accuracy, as reflected by significant main effects of condition F(3, 42) = 43.45, MSE = 36.33, p < .001, and lag, F(2, 28) = 122.53, MSE = 44.99, p < .001, and an interaction between condition and lag F(6, 84) = 10.69, MSE = 34.19, p < .001. There was no effect of order, F(6, 14) = 1.22, MSE = 38.65, p = .35, and no interaction of condition and order, F(18, 42) = 0.97, MSE = 36.33, p = .50, or of lag and order F(12, 28) = 0.30, MSE = 44.99, p = .98. Post hoc comparisons revealed that relative to the standard condition, T2 accuracy was significantly improved by temporal noise at lag 3 in all nonbaseline conditions (see Fig. 2a).
T1 and T2 correlations
Correlations between first- and second-target performance
Pearson correlation two-tailed
Temporal discontinuity Pre-T2 only
r = .39, p = .80
r = −.082, p = .72
r = .27, p = .23
Temporal discontinuity Pre-T1 only
r = −.03, p = .90
r = .09, p = .67
r = .47, p = .03*
Spatial discontinuity Pre-T2 only
r = −22, p = .32
r = −.49, p = .02*
r = −.15, p = .51
Spatial discontinuity Pre-T1 only
r = .43, p = .04*
r = .56, p < .01*
r = .07, p = .74
Spatial noise generally led to decreases in T1 accuracy, as reflected by a significant main effect of condition, F(3, 42) = 62.80, MSE = 53.24, p < .001. There was no effect of lag, F(2, 28) = 2.42, MSE = 63.39, p = .10, nor was an interaction between condition and lag, F(6, 84) = 1.05, MSE = 59.29, p = .39, an effect of order, F(6, 14) = 2.69, MSE = 35.11, p = .05, or an interaction of order and condition, F(18, 42) = 1.56, MSE = 53.24, p = .15, or order and lag, F(12, 28) = 1.02, MSE = 63.39, p = .52, found. Post hoc comparisons revealed that, relative to the standard condition, T1 accuracy was reduced by pre-T1 noise (see Fig. 2b) but not by noise that occurred between T1 and T2.
Spatial noise also led to decreases in T2 accuracy, as reflected by significant main effects of condition, F(3, 42) = 71.30, MSE = 45.29, p < .001, and lag, F(2, 28) = 387.92, MSE = 52.44, p < .001, and by a significant interaction between lag and condition, F(6, 84) = 4.06, MSE = 32.52, p < .001. There was no effect of order F(6, 14) = 0.519, MSE = 24.95, p = .78, and no interaction of condition and order, F(18, 42) = 0.76, MSE = 45.29, p = .72, or of lag and order, F(12, 28) = 0.71, MSE = 52.44, p = .73. Post hoc comparisons revealed that relative to the standard condition, T2 accuracy was significantly decreased at lag 3 in all but the pre-T2 condition involving spatial noise (see Fig. 2b).
T1 and T2 correlations
Table 1 shows the correlations between T1 and T2 accuracy at all lags. There were significant positive correlations at lags 2 and 3 in the pre-T1 condition, indicating T1 and T2 accuracy rising and falling together. Moreover, there was a significant negative correlation at lag 3 when the spatial noise occurred pre-T2.
The present study demonstrates that noise in a rapid visual stream of stimuli can have dramatically opposite effects during an AB task, depending on the type of intrinsic discontinuity. As is shown in Fig. 2a, temporal noise generally increased T2 accuracy, with the benefit arising when the noise occurred prior to T1 and when it occurred prior to the T2 and with a combined benefit when it occurred in both stream locations.
The opposite consequence was observed when the noise arose from spatial, rather than temporal, discontinuity. Variation in the size of nontarget letters in the stream produced a decrease in T2 accuracy, one that was comparable in magnitude to the increase caused by temporal noise. Fig. 2b shows that larger impairments were observed when the noise occurred prior to T1 than when it occurred prior to T2 and that, in a mirror reflection of the temporal condition, these impairments were additive when the noise occurred in both stream locations.
The observed T1–T2 correlations enable interesting speculation, although we note that the AB literature is replete with opposing views on this matter. The absence of correlations in the AB interval in the temporal noise conditions is likely due to a ceiling effect. The positive correlations in the pre-T1 spatial noise condition are consistent with accounts (e.g., Chun & Potter, 1995; Di Lollo, Kawahara, Ghorashi, & Enns, 2005) in which processes affecting T1 exert an effect on T2. The negative correlation in the pre-T2 spatial noise condition is likely due to a combination of the effects of T1 on T2 as discussed and the spatial noise manipulation. The effect of noise, both temporal and spatial, is further evident in the tight coupling between T1 and T2 performance differences, as clearly can be seen in Fig. 2a,b. The implication is that noise facilitates or impedes target selection, respectively, over and above the direct effects of T1 on T2.
The hypothesis motivating the present experiment is that oscillatory brain patterns can play an important role in modulating cognitive processes such as those involved in the AB. A theory exemplifying the importance of oscillatory patterns is stochastic resonance, which suggests that a weak signal becomes more perceptible when accompanied by a small amount of noise. Empirical support for this notion has come from Kitajo et al. (2006), who showed a performance improvement in detecting a visual target adjacent to a spatial location where 20-Hz gray-level noise was presented. Directly relevant to the hypothesis under consideration, Kawahara (2009) discovered improved performance in an AB task when T2 was accompanied by certain types of noise in the RSVP stream periphery. Although the noise arose from different sources, it was continuously present throughout the RSVP stream. Importantly, performance improvements were observed only when the noise arose from a category different from that of the target—for example, letter noise in a digit detection task. Noise arising from the same category—for example, digit noise accompanying a digit detection task—was found, instead, to reveal decreased performance.
As can be readily seen, the results of the present experiment conform to, yet extend, the findings from Kawahara’s (2009) lab. Like Kawahara, we observed improved performance (i.e., less AB) when the noise, temporal discontinuity, was from a dimension different from that of the spatial task required for letter target identification. Also consistent with Kawahara’s findings, we saw a larger AB when the noise—in this case, spatial discontinuity—arose from a source similar to that required by the same target tasks.
Stochastic resonance theory specifies two conditions that must be met if the theory is to hold to its predictions. First, the stimulus to be enhanced must be subthreshold, a requirement met by the (T2) target occurring in the AB interval. Second, not all types of noise will augment performance—again, a condition met by our findings, as revealed by the opposite effects of temporal and spatial noise. Our results can thus be accommodated within the framework of stochastic resonance theory, since they meet both of these conditions. Their main novelty lies in extending the previous findings from Kawahara’s (2009) lab, obtained with noise external to the RSVP stream, to variations in the noise associated with the stream itself. At the same time, we note one discrepancy with Kawahara’s results, in which there were no effects of noise on the T1 task. In contrast, we found T1 rising and falling in concert with T2, for the most part, suggesting the possibility that some other mechanism may be operating with stream-intrinsic noise.
Another account of the present results arises from electrophysiological evidence revealing that prestimulus alpha (8–12 Hz) oscillations predict behavioral performance. Long-distance (between brain regions) phase coupling in the alpha range has been correlated with high-level processing during cognitive tasks in humans, such as manipulation or maintenance of information in working memory (Palva, Monto, Kulashekhar, & Palva, 2010; Sauseng et al., 2005). In seeming contradiction, Hanslmayr et al. (2007) found the optimal conditions for target detection to arise under conditions of low prestimulus alpha phase coupling. This contradiction is resolved, however, if the conditions for external perception and those for internal processing are viewed as opposing states. In other words, it makes sense for the brain to cease inputting new information in order to process information previously received. Interruption masking, where a succeeding stimulus interrupts processing of a preceding stimulus, has the same effect but operates at a lower level in the visual-processing hierarchy (cf. Brehaut, Enns, & Di Lollo, 1999).
The 10-Hz RSVP rate used in the present and in virtually all AB experiments creates the antecedent conditions for variations in alpha oscillatory power to have the stated effect. The AB, by the argument above, arises when alpha phase coupling is effected in the process of T1’s being identified, in turn putting the brain in an internal state where it is no longer sensitive to external input and T2 identification suffers. Low phase coupling prior to T1 explains the accuracy associated with this target. To place the present results in this framework, we suggest that temporal noise interrupts phase coupling—in turn, facilitating T2, as well as T1, target perception. Spatial noise, on the other hand, and for reasons not yet clear, does not interrupt alpha phase coupling, leaving T2 in the state associated with poor detection. It is important to note that this account does not specify a relationship between the type of noise and the particular target task requirement, as underlies the application of stochastic resonance theory to the present findings.
The present results could also be viewed as related to the findings of Olivers and Nieuwenhuis (2005, 2006) that listening to a repetitive tune, thinking about one’s holiday, performing an additional memory task, previewing affectively positive pictures, and focusing less deliberately on the AB task all result in dramatic reductions in the AB. The paradoxical finding that various forms of distraction extrinsic to the stimulus stream can attenuate the AB (see also Arend, Johnston, & Shapiro, 2006; Ho, Mason, & Spence, 2007) is explained by assuming that the AB arises from overinvestment of attentional resources into filtering for T1 and that distraction attenuates the overinvestment (cf. Olivers & Meeter, 2008) and the subsequent blink. Although this hypothesis was not formulated to account for intrinsic forms of distraction—that is, noise arising from variations in the RSVP stream—the present finding from the temporal noise discontinuity condition is consistent with the account above. On the other hand, noise arising from the spatial discontinuity condition is inconsistent with the overinvestment account.
To summarize, the present data provide an important impetus for those seeking neurally inspired accounts of the AB based on oscillatory patterns, as in the mechanisms of stochastic resonance and alpha phase coupling. To our knowledge, no current account of the AB can accommodate the present results that intrinsic discontinuities of the RSVP stream from temporal and spatial sources can produce increases and decreases in AB performance, respectively.
Kawahara’s (2009) manipulation arose from stimuli spatially proximal to the AB task but not from the actual stimuli involved in the task per se.
Temporal noise lag 2 trials required two items during the intertarget interval to manipulate temporal noise while holding the T1–mask ISI constant. This could not be implemented in the spatial condition, since it would interrupt the required temporal regularity.
This research was funded by Wellcome Trust Project Grant GR071944 to K. Shapiro and was supported by a Discovery Grant (NSERC) to J. Enns.