A delay in sampling information from temporally autocorrelated visual stimuli

Callahan-Flintoft, Chloe; Holcombe, Alex O.; Wyble, Brad

doi:10.1038/s41467-020-15675-1

A delay in sampling information from temporally autocorrelated visual stimuli

Article
Open access
Published: 15 April 2020

Volume 11, article number 1852, (2020)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue

A delay in sampling information from temporally autocorrelated visual stimuli

Download PDF

3127 Accesses
8 Citations
5 Altmetric
Explore all metrics

Abstract

Much of our world changes smoothly in time, yet the allocation of attention is typically studied with sudden changes – transients. A sizeable lag in selecting feature information is seen when stimuli change smoothly. Yet this lag is not seen with temporally uncorrelated rapid serial visual presentation (RSVP) stimuli. This suggests that temporal autocorrelation of a feature paradoxically increases the latency at which information is sampled. To test this, participants are asked to report the color of a disk when a cue was presented. There is an increase in selection latency when the disk’s color changed smoothly compared to randomly. This increase is due to the smooth color change presented after the cue rather than extrapolated predictions based on the color changes presented before. These results support an attentional drag theory, whereby attentional engagement is prolonged when features change smoothly. A computational model provides insights into the potential underlying neural mechanisms.

Adaptation to average duration

Article Open access 08 October 2020

The whole is faster than its parts: evidence for temporally independent attention to distinct spatial locations

Article Open access 24 November 2015

Does visual attention drive the dynamics of bistable perception?

Article 26 May 2016

Introduction

The visual system allows for the selection and prioritization of certain pieces of information in our visual field over others. As our visual input is constantly changing, due to scene changes as well as head and eye movements, the visual system needs the ability to make this selection not only in space but also in time before the information is gone.

Classically, the time course of attention has been studied through the presentation of discrete stimuli either with rapid serial visual presentation (RSVP)¹ or by varying the stimulus onset asynchrony (SOA) between the presentation of the cue and a static target. In the latter studies, the target was immediately masked after it was presented, and the logic was that the minimum SOA needed for accurate performance provides an estimate of the time for attention to arrive at and sample the cued location^2,3. These studies found that performance rapidly increased as SOA increased to reach a peak ~80–120 ms after the cue. This work has yielded the conventional wisdom for the time needed to shift attention and sample a stimulus. However, it is not clear how straightforwardly such results apply to a continuous stream of potential target stimuli, as is often encountered in real-world scenes.

Extraction of a target at a particular time from a continuous stream of input is a nontrivial problem because the representation of a given stimulus is distributed across cortical areas with distinct processing latencies. Thus, there may be uncertainty regarding the relative time of a cue and the potential target stimuli. Moreover, the visual system may tend to group together stimuli over time, creating a potential segmentation problem, particularly when the features of those stimuli are temporally autocorrelated. For the remainder of this paper the term smooth will be used as a term for temporal autocorrelation of stimulus features.

One line of research using smoothly changing features displayed a set of clock faces whose hand smoothly changed throughout the trial. Participants were asked to report the cued clock’s hand position at the time of a cue. On average, participants reported the clock hand position presented ~130 ms after the cue⁴. Using a similar clock paradigm, this magnitude of lag has also been seen in tasks exploring divided attention⁵ and attentional shifts⁶. It is not limited to the sampling of position. Displaying a disk that changed smoothly in color, Sheth et al.⁷ flashed another colored disk on the opposite side of fixation. Participants on average reported the color of the changing disk as it was ~400 ms after the flashed cue disk. Delays were also found for objects changing smoothly in luminance (37 ms), spatial frequency (83 ms), and entropy (95 ms).

Substantial lags for sampling from a stream of stimuli are not always found. Vul et al.⁸ and Goodbourn and Holcombe⁹ used one or two simultaneous streams of letters and flashed a cuing circle around one of the streams. Participants attempted to report the letter that was presented in that stream at the time of the cue. The average delay estimated was very small, 25 ms or less in both cases. Similarly, Weichselgartner and Sperling¹⁰ found that participants were able to select a letter or digit presented simultaneously with a luminance cue in an RSVP stream. A notable difference with these studies is that those with longer lags used stimuli that changed smoothly over time, while Weichselgartner & Sperling¹⁰, Vul et al.⁸, and Goodbourn and Holcombe⁹ presented a sequence of unrelated letters or digits that changed abruptly from one to the next. Thus, it may be the case that sharp visual transients influence the latency of attention sampling. Previous studies have found that presenting a transient visual signal at the location of a changing stimulus improves temporal sampling whereas endogenous shifts of attention were not as effective¹¹.

Based on this difference in findings between these paradigms we propose a theory of attentional drag, which posits that temporal autocorrelation in a visual feature extends the duration of attentional engagement elicited by a cue in order to extract useful information from an object. Thus, attention gets dragged along in time by a temporally autocorrelated feature dimension, which increases the latency to disengage, and results in a delayed feature selection. Conversely, when features change abruptly, as in the case of a conventional RSVP stream of letters, the salient featural change after the cue leads to an earlier disengagement of attention and thereby an earlier selection. Thus, the attentional drag theory proposes that transients can decrease the latency to sample information in part by making it easier for attention to disengage.

By comparing two conditions, one where colors change randomly at a regular interval and another where they change smoothly, Experiment 1 shows empirically that selection can happen immediately in response to a cue in the random condition, but is delayed by ~100 ms in the smooth condition. Critically, in both conditions the presentation rate of the stimuli is the same, the only difference is the temporal autocorrelation of colors. In Experiment 2 these findings are extended to show that increasing the similarity between successive colors further extends the attentional window. Finally, Experiment 3 tested whether these results were the product of feature extrapolation based on the trajectory prior to the cue and find that this explanation is unable to account for the results. Together, the experimental and modeling work presented here demonstrate an influence of featural change in time on information extraction. The neurocomputational model provided in the discussion demonstrates how a simple attentional system can exhibit prolonged selection latencies. In a nutshell, the prolonged latencies reflects a feedback loop between attention-related neurons and sensory neurons. The sensory neurons have overlapping tuning curves that result in greater activation in the case of a smoothly changing stimulus, which in combination with the recurrence with attention neurons causes long-lasting activation.

Results

Experiment 1

In Experiment 1, 25 participants monitored two changing color disks, one on either side of fixation (Fig. 1). Between 2700 and 5400 ms into the trial a cue (a white or black ring, counterbalanced across participants) appeared for 27 ms around one of disks while a distractor ring (white or black, depending) appeared around the other disk. In the Smooth condition, the disks’ color changed 16° around a circular color trajectory with an SOA of 108 ms. Smooth trials were intermixed within block with Random condition trials where the color changed pseudo-randomly (each color was at least 30° away from the previously presented color), also with an SOA of 108 ms. In both conditions, the disks continued to change for 810 ms after the cue. At the end of each trial, participants were asked to report the color of the disk at the time of the cue by moving their mouse horizontally to update a test disk along the presented color trajectory and clicking when the color best matched memory. The minimum distance between the reported color and the stream of presented colors was used to calculate the serial position error (SPE) and then multiplied by the SOA for a selection latency in time. For all experiments, a permutation analysis was conducted in order to test whether the selection latency of two conditions was significantly different and 95% confidence intervals were generated through bootstrap sampling. A within-subjects Cohen’s d is reported for all comparisons¹².

**Fig. 1: Experimental paradigm for Experiments 1 and 2.**

When colors changed smoothly, the mean selection latency was 113 ms, 95% CI = [95, 131] (Fig. 2). However when the colors changed randomly, the mean selection latency was 2 ms, 95% CI = [−22, 28]. This reduction in selection latency across conditions was significantly different from zero, p < 0.001, d = 1.21. This experiment was replicated in a second, independent sample of 25 participants. In this second sample, the Smooth condition had a mean selection latency of 150 ms, 95% CI = [128, 163] and the Random condition had a mean selection latency of 34 ms, 95% CI = [8, 59] (Fig. 3). Again, this difference was significant, p < 0.001, d = 0.94. There is an obvious difference in the variance between the Smooth and Random distributions, but because color errors and time are conflated in the Smooth condition but not in the Random condition, this difference is hard to interpret. For example, Gaussian noise added to the color perceived or reported in the Smooth condition would result in a Gaussian distribution of serial position errors, whereas in the Random condition it would contribute to the uniform distribution. Future work could test whether another effect of the visual transients presented in the Random condition is to encourage more precise temporal binding.

**Fig. 2: Serial position error (also translated into time) histograms aggregated across 25 participants in Experiment 1’s Smooth and Random conditions.**

**Fig. 3: Serial position error histograms aggregated across 25 participants Experiment 1’s replication study.**

In Experiment 1 and its replication the Random condition yielded a reduced selection latency compared to the Smooth condition. These results suggest that something about the smooth changes leads to a later selected feature value than when stimuli are jumping randomly through feature space (i.e. the Random condition).

One possible explanation of the longer latency is that the smoothness of the stimulus changes extended attentional engagement. If attention is indeed dragged along by a stimulus’ temporal autocorrelation, then increasing the correlation between colors from one time point to the next should further increase the duration of attentional drag. To test this, in Experiment 2 we compared the Smooth condition of Experiment 1 to a condition where the colors on successive frames are more similar to one another.

Experiment 2

Experiment 2 included both the Smooth condition of Experiment 1 and a condition where the color trajectory was even smoother. In this experiment the disks changed 16° along the color trajectory every 108 ms in the Smooth_coarse condition (identical to the Smooth condition of Experiment 1) or 4° every 27 ms in the Smooth_fine condition. Importantly, both conditions presented the same rate of change along the circular color trajectory, only the granularity of change was increased from the Smooth_coarse to the Smooth_fine condition. The attentional drag theory predicts the Smooth_fine condition will result in a longer latency of color selection.

The mean selection latency was 166 ms, 95% CI = [147, 185] for the Smooth_fine condition and 133 ms, 95% CI = [116, 150] selection latency for the Smooth_coarse condition (Fig. 4). The mean selection latencies were significantly different across conditions, p = 0.03, d = 1.45. A second, independent sample of 25 participants was run on the same paradigm to replicate these results and found a selection latency of 166 ms, 95% CI = [149, 182] in the Smooth_fine condition and a selection latency of 109 ms, 95% CI = [94, 123] in the Smooth_coarse condition (Fig. 5). Again, there was a significant difference between conditions, p < 0.001, d = 1.72.

**Fig. 4: Serial position error histograms aggregated across 25 participants from Experiment 2’s Smooth_fine and Smooth_coarse conditions.**

**Fig. 5: Serial position error histograms aggregated across 25 participants from Experiment 2’s replication study.**

The results of Experiment 2 showed a significant reduction in the average selection latency from the Smooth_coarse to Smooth_fine presentation and this finding was replicated. Together, these results support the hypothesis that smooth feature changes leads to prolonged attentional engagement and causes a later selection of feature information.

However an alternative explanation for the combined results of Experiments 1 and 2 is that instead of the increased disruption of smoothness from Smooth_fine to Smooth_coarse to Random presentation, it is actually the decrease in predictability across conditions that is producing the selection latency reduction. The increased difference in color space from one update to the next in the Smooth_coarse condition compared to the Smooth_fine condition may make it more difficult for the visual system to extrapolate the next color. If the visual system is sampling from a generated internal representation instead of the current stimulus input, this increased difficulty might reduce how far into the future the system can predict and result in what appears to be an earlier selection latency. This effect would manifest similarly in comparing Smooth to Random selection in Experiment 1, as the Random condition does not offer a predictable change in color through time.

It would be seem counterintuitive for prediction effects to be the cause of these latency differences, since predictability is typically assumed to reduce errors rather than increase them. Regardless, to test the influence of predictability in causing these latency effects, Experiment 3 showed disks that changed in either a Smooth or Random pattern prior to the cue and switched to other pattern after the cue to test whether smoothness before or after the cue influenced selection latency.

Experiment 3

Experiment 3 tested whether the selection latency differences seen in the previous two experiments were a result of the visual system generating predictions of the color trajectory. The theory would be that in generating these predictions, when the cue triggers attention, the color sampled is from the internal representation (a future value) which leads to an appearance of selecting information later in time. To test this, 20 undergraduates participated in an experiment with a similar paradigm to the previous two used. However this time, in the Smooth-to-Random condition, the colors of the disks changed smoothly prior to the cue and randomly afterwards (in accordance with the smooth and pseudo-random color change of Experiment 1). These trials were intermixed within in block with Random-to Smooth trials where the disks changed randomly before the cue and smoothly afterwards. The internal model theory would predict that when the color changes are smooth before the cue, error distributions will be shifted forward along the before-cue trajectory as color reports are based on predictions. Note, that in this scenario, the color trajectory that would have followed after the cue was never actually presented to the participant. Conversely, when participants are first presented with randomly changing colors and then a smooth change after the cue, this theory predicts less of a shift in the error distribution as the system is unable to generate predictions prior to the onset of attention.

To test this, we measured the distance between the reported value and smooth trajectory, as it would have appeared if the presentation style was constant throughout the trial. The mean selection latency was 3 ms, 95% CI = [−16, 23] in the Smooth-to-Random condition and 91 ms, 95% CI = [75, 106] in the Random-to-Smooth condition (Fig. 6). This difference was significant, p < 0.001, d = 0.93. Note that this pattern of results is opposite to that expected from the prediction theory. A replication study was also performed with a second sample of 30 participants, yielding a mean selection latency in the Smooth-to-Random condition was −6 ms, 95% CI = [−24, 11] and 47 ms, 95% CI = [33, 61] in the Random-to-Smooth condition (Fig. 7), significantly different, p < 0.001, d = 0.68. The data from both of these samples is presented in its raw form in the Supplementary information (Supplementary Figs. 1, 2).

**Fig. 6: Serial position error histograms aggregated across 20 participants from Experiment 3.**

**Fig. 7: Serial position error histograms aggregated across 30 participants from Experiment 3’s replication study.**

The results from Experiment 3 and its replication support the idea that the later selection latency seen in the Smooth condition compared to the Random condition in Experiments 1 is not caused by the smoothness presented prior to the cue (as is assumed by prediction-based theories) but rather by the smoothness presented after the cue. However these results do not rule out the role of prediction entirely but rather one way it might have been used. This will be discussed further in the Discussion.

Experiment 4

As a further test of the effect of predictability of the color trajectory, in Experiment 4, the color trajectory was identical in every trial, with the exact same color trajectory for each disk. The only parameters that changed between trials then was the cue latency and the disk it was presented around. This is in contrast to Experiments 1, 2, and 3, in which the disk color trajectories were chosen at random from four different circles and direction of change was also randomly selected on each trial. If predictability leads to a later selection latency, this more predictable trajectory should produce an even later selection in this experiment than was observed in the Smooth_fine condition of Experiment 2.

The selection latency was 140 ms, 95% CI = [125, 154], 26 ms earlier than that of the Smooth_fine condition in Experiment 2 and its replication study (Fig. 8). As the hypothesis was that the predictable trajectory would lead to a later selection latency, which was not the case given that, if anything, it is earlier, no analysis was performed on this data. In this experiment, it was much easier to learn the color trajectory and yet the delay was not increased relative to the Smooth_fine condition of Experiments 2, reinforcing the conclusion from Experiment 3 that predictability of the sequence is not the cause of the latency to report a color.

**Fig. 8: Serial position error distribution (translated into time as well) aggregated across 17 participants in Experiment 4.**

Discussion

The experiments presented here found that the more smoothly the stimulus color changed, the later the reported color value was. This was only true, however, if the smooth change occurred after the cue onset. Like most empirical results, these findings have several potential explanations. We highlight our attentional drag theory, in which the smooth change of a feature keeps attention engaged for longer than a condition containing random transitions in feature space. While there are other possible mechanisms that could create such an effect, this explanation has the virtue of simplicity, and has been formally defined with a neurocomputational model. With this model, the attentional drag theory provides an explanation for why visual transients are such an effective visual stimulus, not just for capturing attention, but also for disengaging it.

The luminance transients that accompany sudden visual changes have long been suggested to capture or engage attention¹³, and sometimes to disengage attention or control binding^11,14,15. Such findings have typically been explained by proposals that transients directly affect attention – for example, that temporally high-pass cells have a particularly strong connection to neurons that elicit orienting¹⁶. We do not contest such theories here. However, our model does explain how the disengagement of attention may be a result not of transients per se but rather because transients are highly associated with the autocorrelation of features that results in lasting featural activity in our model. Future work may be able to tease these concepts apart, perhaps by superposing luminance transients on an otherwise smoothly changing stimulus.

Our core finding was shown by Experiments 1 and 2, revealing a relationship between selection latency and feature change, while Experiments 3 and 4 ruled out the possibility that this later selection was the product of future prediction based on a predictable trajectory presented prior to the cue. A possible alternative explanation for Experiment 3’s results is that color in the smooth presentation are easier to perceive and so reports are biased toward colors presented in the smooth portion of the trial, rather than attention being dragged along in time, as proposed here. However such an explanation would not account for the increased latency in Experiment 1’s Smooth condition where colors are presented smoothly throughout the trial and therefore should produce no bias. The attentional drag hypothesis provides a single explanation for the entire set of observed results.

To explain how the temporal engagement of attention could be extended by presenting a sequence of smooth feature changes, a computational model was developed. The model is based on the idea that attention is a recurrent process, as described in a previous model of attentional fluctuations over time (i.e. eSTST¹⁷). The eSTST model posits that attention is part of a positive feedback loop in which incoming sense data (e.g. the cue) triggers the deployment of attention at a given spatial location. This attention amplifies feedforward processing at that location to facilitate the encoding of information into memory and, consequently, amplifies its own input from that spatial location. As described in previous studies¹⁷, with typical RSVP stimuli this feedback loop leads to brief attentional episodes on the order of 150 ms in duration, which explains the prevalence of effects like lag-1 sparing in the attentional blink. The new model proposed here elaborates on the eSTST model’s basic architecture by incorporating biologically plausible tuning curves in the input neurons responsive to a given feature (e.g. color, orientation). For the sake of simplicity, this model is a simplification of eSTST in that it does not include memory encoding processes and runs at a finer resolution in time. This simplification is necessary because the eSTST model simulates the encoding of discrete stimuli as used in standard RSVP experiments. Future work will incorporate innovations, such as a distributed binding pool¹⁸ that can encode stimulus values selected from a continuum.

The attentional drag model here has three components for each spatial location: an array of color-sensitive neurons, a cue sensitive neuron, and an attention neuron (Fig. 9). To simulate neuronal membrane potential, a set of differential equations¹⁹ is used as a simple abstraction of excitation and leak current. To understand the drag phenomenon in more detail, consider that because each color-sensitive responds to an array of colors according to its tuning curve²⁰, the presentation of a single color will activate a distributed population of neurons. Thus, in the model color is represented as a vector representing the activation of 360 stimulus-responsive neurons with Gaussian-shaped tuning curves, which very roughly approximate V1 cells²⁰. Therefore, any one color evokes a gradient of activation across the population of 360 neurons. A consequence is that when similar colors are presented sequentially (as in the Smooth condition) overlapping neural representations are activated sequentially, resulting in individual neurons being activated for longer and reaching higher levels of activation (Fig. 10a). Conversely, when colors are presented in random order, the color presented at one time step is less likely to have overlapping activation with the color presented in the previous and following time steps, resulting in less accumulation over time (Fig. 10b). In summary, smooth feature changes result in greater activation across the population of sensory neurons.

**Fig. 9: A schematic of the model used to explain the attentional drag theory.**

**Fig. 10: Model simulations of color neuron activation.**

This greater activation of sensory neurons as a result of smooth color changes is not enough, according to the model, to explain the longer sampling latency observed in the data. The recurrent feedback in the model between attention and sensory neurons was required. It is widely accepted that reporting a stimulus involves feedback from higher-order neurons to sensory neurons²¹, which in our model is accomplished by the “attention neuron”. The attention neuron is triggered by a cue-responsive neuron. Specifically, when the cue is presented, this cue neuron is excited over threshold and sends excitation to the attention neuron. Once the attention neuron has been excited over threshold, it amplifies the excitation being received by all of the color neurons. For color neurons which are currently not receiving excitation (i.e. the color they are responsive to is not being presented) this attentional amplification does nothing. But for colors that are responsive to the current input, this attentional amplification excites them over threshold, allowing them to excite the attention neuron in turn. Thus, the presentation of the cue triggers the deployment of attention and consequently puts the system into an excitatory feedback loop. The establishment and maintenance of the feedback loop is how the model instantiates the engagement of attention.

In the case of randomly presented colors, when the overall activation of color neurons is lower, this loop has a short duration, decaying quickly after the offset of the cue (Fig. 10d). However when the system is presented with a smoothly changing color stimulus and color neuron activation accumulates from one frame to the next, this greater overall activation of the color neurons provides more excitation to the activated attention neuron. Therefore, when the cue neuron triggers the deployment of attention, this feedback loop is sustained for longer in the Smooth condition compared to the Random one (Fig. 10c). This extended attentional engagement results in a longer epoch of excited colors in the Smooth condition. An additional experiment, included in the Supplement, supports the model by showing this phenomenon is robust to different cue durations (Supplementary Notes).

This model does not make specific claims regarding how the color is selected for memory encoding and subsequent report, but presumes that whatever that process is, the extended attentional engagement that means greater activation for later-occurring colors leads to a higher probability of reporting a later color in the Smooth condition compared to the Random condition. Additionally, it may be that selecting a color from the extended color representation in the Smooth condition requires an endogenous form of attention that has slower response times³ and is poor at selecting information with a short latency from a dynamic stimulus¹¹.

While future work is needed to understand the ultimate selection process, the model does simulate the timing of the attentional window. Taking the midpoint of that window as a proxy for the selected color, the model yields a difference of 107 ms between the Smooth and Random conditions (similar to the 111 ms and 116 ms difference between conditions observed in Experiment 1 and its replication study). In comparing Smooth_fine to Smooth_coarse, the model simulates a 55-ms difference (a 33-ms and 75-ms difference between conditions was observed in Experiment 2 and its replication study). Without specifying a method of color selection, the model provides a formalized account of the attentional drag theory here and simulates the pattern of results seen in the empirical data. This general architecture could easily be incorporated into other models of attention, such as eSTST.

A crucial point to emphasize is that our model does not make any functional distinction between whether smoothness extends attentional engagement at a location or if a transient in color disengages attention, as here smoothness and the presence of a transient were perfectly confounded, as typically they are: lack of smoothness implies a discontinuity. Finally, a basic property of the model is that the increase in selection latency results from the degree of smoothness presented after the cue. Experiment 3 provided evidence that this is indeed the critical variable, thus ruling out an alternative account based on sampling from predictions of an internal model.

This is an abductive model in that it accounts for the empirical results with a simple set of neural mechanisms that are plausible inasmuch as they extend a model that has had success with other attentional phenomena¹⁷. However, previous research has had a different interpretation of similar findings. Namely, Sheth et al.⁷ similarly found a long selection latency when reporting the color of a disk at the time of a cue as the disks smoothly changes between red and green. Sheth and colleagues proposed that, as the color changes smoothly, priming occurs, creating a ramp-up in activation of subsequent colors along the trajectory. When attention is triggered by the cue, attention resets this build-up of activation and so the selection of color is delayed until priming can once again ramp-up a color’s activation sufficiently to be selected. This accounts for their results as well as the selection latency of the Smooth condition presented here. However, this theory, as is, would not explain the pattern of selection seen in the Random condition. Explaining this additional finding requires the priming theory to add some sort of immediate enhancement performed by attention in order to privilege the cued color for selection. Without this additional form of selection, every color presented in the Random condition would have an equal chance of being selected. This attentional enhancement could possibly be incorporated into these alternative theories, in addition to the resetting of priming by attentional shifts. However, since this would require two distinct roles for attention in Sheth and colleagues’ theory, it could be argued that the attentional drag theory offers a simpler account for the experimental results. Another potential alternative theory could be that attention integrates information over a fixed interval in time after the cue. While this theory offers a simpler account than the attentional drag, our simulations suggest that an attentional window of fixed duration applied to both random and smoothly presented color changes is unable to account for the results seen here (Supplementary Fig. 7). Again, the experimental work here does not necessarily eliminate these alternative theories. However the results do suggest that such theories would require additional mechanisms in order to account for the same data points the attentional drag is able to simulate. Further elaboration of such alternatives, ideally with a computational instantiation would be a key step in testing the distinction between these accounts since it is difficult to compare models with several different components without a concrete specification.

The empirical results presented here support an attentional drag theory wherein gradual or smooth changes at a location maintain attentional engagement compared to a salient feature change which disengages attention. There are of course many contributing factors to selection latency, such as whether attention needs to shift from a previously engaged location¹⁰ and the saliency of the cue which triggers deployment. However, what these results demonstrate is how feature dynamics affect sensory information extraction. When features make jumps in feature space, attention is able to select information at the time of the cue. This is consistent with previous findings that have shown a transient signal at a location aids in temporal sampling^11,22. Conversely, when features change smoothly in time, selection is delayed. These results suggest that the visual system samples information differently depending on the nature of change of the input. As features often change smoothly in our visual world, understanding this phenomenon could provide important insights into how attention tracks stimuli. This functionality may provide an inherent form of event segmentation that combines visual information from a sequence of time points only when they are likely to comprise the same visual event, which may be complementary to the brain’s parsing of event information on much larger timescales^23,24. Similarly, these mechanisms may underlie how we construct object representations and help to explain why the visual system updates with new information in some circumstances and initiates a new episode or “object file” in others²⁵.