The time course of salience: not entirely caused by salience

Krüger, Alexander; Scharlau, Ingrid

doi:10.1007/s00426-020-01470-6

The time course of salience: not entirely caused by salience

Original Article
Open access
Published: 18 February 2021

Volume 86, pages 234–251, (2022)
Cite this article

Download PDF

You have full access to this open access article

Psychological Research Aims and scope Submit manuscript

The time course of salience: not entirely caused by salience

Download PDF

Alexander Krüger¹ &
Ingrid Scharlau¹

1956 Accesses
Explore all metrics

Abstract

Visual salience is a key component of attentional selection, the process that guards the scarce resources needed for conscious recognition and perception. In previous works, we proposed a measure of visual salience based on a formal theory of visual selection. However, the strength of visual salience depends on the time course as well as local physical contrasts. Evidence from multiple experimental designs in the literature suggests that the strength of salience rises initially and declines after approximately 150 ms. The present article amends the theory-based salience measure beyond local physical contrasts to the time course of salience. It does so through a first experiment which reveals that—contrary to expectations—salience is not reduced during the first 150 ms after onset. Instead, the overall visual processing capacity is severely reduced, which corresponds to a reduced processing speed of all stimuli in the visual field. A second experiment confirms this conclusion by replicating the result. We argue that the slower stimulus processing may have been overlooked previously because the attentional selection mechanism had not yet been modeled in studies on the time course of salience.

Measuring and modeling salience with the theory of visual attention

Article 23 May 2017

Salience from multiple feature contrast: Evidence from saccade trajectories

Article 11 January 2018

Conditional control in visual selection

Article 06 July 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

“You never get a second chance to make a first impression.” This colloquial phrase aptly describes what happens to local contrasts—like color, orientation, or luminance contrasts—in the visual system. Physical contrasts like these that stand out from their surroundings are referred to as salience (e.g., Treue, 2003). Salience affects attention and hence how limited cognitive resources are distributed. The strength of this influence is contingent on the timing: Once the window of opportunity has closed, even strong contrasts cease to affect attention.

Characteristics of salience

Evidence for a fast time course of attention has been provided several decades ago by studies using a broad variety of designs. Early evidence for such a time course of attention stems from studies using peripheral cues that were present in some conditions and absent in others. In these studies, the peripheral cue’s effect on attention was strong after 50 ms to 150 ms and declined afterward (Shepherd & Müller, 1989; Nakayama & Mackeben, 1989; Müller & Rabbitt, 1989). This corresponds with the more general idea that timing is crucial for attention. Temporal dynamics of attention can severely affect the processing of visual stimuli (for reviews see; Kinchla, 1992; Egeth & Yantis, 1997; Olivers, 2007). Thus, understanding visual attention involves understanding its temporal dynamics including its quick and transient component driven by stimuli.

The time course of salience in the first second after onset was studied by using a variety of experimental paradigms and different operationalizations (Donk & Soesman, (2010; Dombrowe, Olivers & Donk, 2010; Couffe, Mizzi & Michael, 2016; Donk & Soesman, 2011; Donk & van Zoest, 2008; van Zoest, Donk & Van der Stigchel, 2012) Silvis & Donk, 2014; van Zoest & Kerzel, 2015). Dombrowe et al. (2010) cued speeded responses with salience displays. They reported that the response time advantage for the salient stimulus is low after a presentation duration of 30 ms–60 ms, rises to a peak at 240 ms and 480 ms, and is gone after 960 ms. Another cueing study by Donk & Soesman (2010) found a response time advantage for the salient stimulus after just 42 ms. This advantage reached its peak after 158 ms and was weakened after 483 ms. Donk and Soesman (2011) used temporal order judgments (TOJs). They tested two presentation durations of the salience display affecting the subsequent TOJ: After 58 ms, the effect of salience on attention was stronger than after 800 ms. However, the effects of salience did not vanish. Instead, a strongly salient and weakly salient stimulus induced the same attentional advantage after 800 ms. Apparently, the uniqueness of the element in the display maintained an attentional advantage, but the attentional advantage from the strength of contrast disappeared over the first second after presentation. Using a visual search task, Couffe et al. (2016) found an increase in attentional advantage over the first 100 ms. The saccadic latency study by Donk and van Zoest (2008) reports an advantage for salient stimuli for short latencies in the 175 ms and 200 ms bins that declines afterward. Van Zoest et al. (2012) report that the curvature of saccadic trajectories is strongly affected by the salience of a distractor after 180 ms but that this effect vanishes after 300 ms. To sum up the different findings, the general expectation is that the attentional advantage caused by salience rises quickly. Afterward, the influence of salience declines. A unique contrast can retain a weakened attentional advantage even after 800 ms.

In general, salience directs attention based on local physical contrast. This way of orienting of attention has been widely recognized as a central component of visual attention although its independence of other influences is still a matter of debate (e.g., Wolfe, Cave & Franzel, 1989; Müller & Krummenacher, 2006; Wolfe & Horowitz, 2017; Theeuwes, 2019). There are many types of physical contrast that attract attention (Wolfe & Horowitz, 2004). The effect of visual salience on attention is not merely present or absent. The higher a contrast between stimulus and its surroundings, the stronger its attentional advantage (Duncan & Humphreys, 1989).

How much salience is caused by a particular local contrast has been studied theoretically and empirically. Computational models (e.g., Itti & Koch, 2001; Li, 2002) provide an explanation of how a salience value may arise from a cognitive process. However, these models do not render empirical measures of salience irrelevant because their predictive power varies with different operationalizations (Koehler, Guo, Zhang, & Eckstein, 2014) and they are sometimes even conflicting with empirical findings (Einhäuser & König, 2003; Onat, Açık, Schumann & König, 2014).

The need for model evaluation motivates the development of empirical salience measures. Different empirical measures of salience have been proposed (Huang & Pashler, 2005; Nothdurft, 2000; Koene & Zhaoping, 2007). In the case of several local contrasts at the same location, it is not obvious how these contrasts interact to produce overall salience. Whereas these empirical studies largely agree on qualitative aspects of salience (e.g. that the more types of contrast, the stronger the salience), they differ in their quantitative estimation of the strength of salience (e.g., on the strength of interactions of contrasts; Koene & Zhaoping, 2007; Nothdurft, 2000). A possible cause for diverging results is that each operationalization for testing quantitative hypotheses about the strength of salience is justified in a verbal argument. A formal model linking the salience measure and the operationalization may provide a better explanation as to why a physical contrast is associated with a numerical salience value (Krüger, Tünnermann, Rohlfing, & Scharlau, 2018).

Why should researchers interested in attention care about—possibly minute—quantitative differences in salience caused by physical contrast and when it is presented? The reason why even small salience differences matter is that attention is a selective process (Carrasco, 2011). In this selective process, attended stimuli have an advantage in competing for limited resources (Desimone & Duncan, 1995; Beck & Kastner, 2009; Reynolds & Chelazzi 2004). Therefore, even small differences can determine whether a stimulus is processed further to be represented consciously or whether it passes unbeknownst to the observer (Luck & Vogel, 1997; Walker, Stafford & Davis, 2008).

Modeling salience-based selection

Understanding visual attention as a selective process that is—among other factors—driven by local contrasts and their timing results in a complex process. Formal models are particularly apt for dealing with complex cognitive phenomena (Marewski & Olsson, 2009; Rodgers, 2010). They provide good quantitative explanations (Krüger et al., 2018), have been successful in accumulating progress in attentional research in the past (Logan, 2004), and, importantly, force high specificity and quantitative precision (Luce, 1999; Taagepera, 2008).

The merits of modeling in psychology have been described by different authors (e.g., Taagepera, 2008; Rodgers, 2010; Marewski & Olsson, 2009). Models are particularly valuable in combination with Bayesian inference for understanding nonlinear cognition processes (e.g., Rouder & Lu, 2005; Lee, 2011; Van de Schoot, Winter, Ryan, Zondervan-Zwijnenburg & Depaoli, 2017). Also, models enable parameter estimation, which has arguably been undervalued in classical hypothesis tests (Cumming, 2014). Because models are more explicit in what is expected to happen than a prediction of a directed effect, they provide a more severe test of hypotheses (Rouder, Morey, Verhagen, Province & Wagenmakers, 2016) which is well in line with the hypothetico-deductive method applied in psychology (Gelman & Shalizi, 2013). Providing such a more severe test means that it is easier to potentially falsify a claim.

For attention research, cumulative progress in formal models of attention has been reviewed by Logan (2004). One of the frameworks reviewed by Logan is Bundesen’s theory of visual attention (TVA; Bundesen, 1990). TVA formally models visual selection and recognition as a parallel biased competition. The mechanism can be imagined as a race: Each stimulus in the visual field is associated with a processing speed. Only the stimuli finishing the race first are represented in visual short-term memory until its capacity is exhausted. Stimuli arriving thereafter cannot be represented for later recall. The sum of the stimuli’s processing speed is the overall visual processing capacity that represents the available processing resources for the current task. Each stimulus’ individual processing speed is affected by the relative attentional advantage of the stimulus, its attentional weights. These weights, in turn, are affected by the task relevance and the sensory evidence of the stimulus’ features (for a more detailed explanation, see Bundesen, Vangkilde & Petersen, 2015). The overall visual processing capacity and attentional weights are most important for the present paper. As will be detailed below, the TVA functions and parameters describe observed data, but also have very precise theoretical meanings.

Although salience was originally not explicitly modeled in TVA, a link between TVA’s attentional weights and visual salience had been suspected when TVA was interpreted in terms of neuronal processes (Bundesen, Habekost & Kyllingsbæk, 2005, Bundesen, Habekost & Kyllingsbæk, 2011). More recently, a salience measure, formally denoted as $\kappa$, has been added to TVA to include the influence of salience on the attentional weights and hence the quantitative contribution of salience to selection (Nordfang, Dyrholm, & Bundesen, 2013). This value functions as a common measure for salience. Such a single measure of salience has been sought before in experimental studies (e.g., Nothdurft, 2000, 1993; Huang & Pashler, 2005). Although pursuing the same goal, these studies used very different stimulus material (especially large sets of homogeneous background items) that cannot be interpreted in terms of TVA so that there is yet no direct connection between these two research strands. In a previous work, we made this connection between the stimulus material and a TVA-based formal salience measure (Krüger, Tünnermann, & Scharlau, 2016, 2017; Tünnermann, Krüger & Scharlau, 2017).

We combined TVA’s cognitive model of visual attention with a simple task that allows for uncomplicated salience-related stimulus manipulation. During the task, the temporal order of two visual events has to be judged in a so-called temporal-order judgment (TOJ). Both events are separated by a brief interval, the stimulus onset asynchrony (SOA). This accuracy-based task requires a binary decision (“A before B” or “B before A”). The task is related to attention because of the phenomenon that an attended stimulus is perceived earlier than an otherwise similar but unattended stimulus. Attention thus leads to a systematic deviation of the reported from the objectively correct order (for a review, see Spence & Parise, 2010). The TOJ allows us to investigate the time course of visual salience by manipulating the interval between the onset of a salience display and the subsequent two visual events that have to be judged. The resulting judgment can be modeled as the outcome of the general attentional cognitive processes assumed by TVA (Tünnermann, Petersen & Scharlau, 2015). Thus, while in line with previous empirical and modeling works, the TVA-based model additionally provides a formal link to a general theory of attention and its notion of salience (Nordfang et al., 2013) so that TOJ data can be explained in terms of the overall visual processing capacity and a theoretically meaningful salience measure (Krüger et al., 2016, 2017).

Originally, we merely aimed to show that the previous modeling and empirical work (Krüger et al., 2016, 2017) can be extended to measure the time course of salience in TVA’s salience measure $\kappa$. We planned Experiment 1 with five time intervals between salience onset and salience measurement with full randomization. Contrary to our expectations, only the overall visual processing capacity—the other free model parameter—was severely reduced before 150 ms. Because the result was an unexpected discovery, we conducted a replication. The only change from the original experiment was the use of a blocked design instead of a fully randomized design because this blocked design had been used in a previous TOJ time-course of salience article (Donk& Soesman 2011) and thus helps to make results comparable. Experiment 2 confirmed the discovery from Experiment 1, and both experiments suggest that the time course observed in the TOJ is affected by a difficulty to solve the task and by a change in the effect of salience on attention.

General method

Two TVA parameters are crucial for the following experiments. These parameters are the overall processing speed or capacity, C, and the stimulus-driven component of attention, $\kappa$, which measures salience. This section explains why and how data from TOJs can be mathematically linked to these two parameters. The section can be skipped without loss of continuity, if the reader is either familiar with the modeling or prefers to take both parameters at face value.

In a TOJ, two stimuli are task-relevant. To distinguish them, we call them probe, p, and reference, r. These names originate from the fact that the probe, the experimental stimulus, is salient while the reference, always non-salient, serves as the control stimulus. The TOJ can be understood as the outcome of a race between these two stimuli. Whichever finishes its race first is perceived to be the first stimulus. For winning the race, processing speed matters. According to TVA, each stimulus has an associated processing rate, $v_p$ and $v_r$, that determines the speed of processing. These rates are given in stimuli per second (Hz). Their sum yields the overall processing speed, C, also given as a rate.

According to the extended weight equation shown in Eq. 1 (Nordfang et al., 2013), salience and goal-directed influences interact multiplicatively to produce attentional weights. The variable $w_x$ determines the attentional weight for a specific stimulus x. The factor $\kappa _x$ describes the effect of salience of object x, R is the set of all relevant semantic categories, $\pi _j$ determines the pertinence of the category j, and $\eta (x,j)$ the sensory evidence that the stimulus in question x belongs to the semantic category j. Details about these attentional parameters are described by Bundesen (1990, 1998). In the current experimental context, the goal-directed influences on the attentional weight, $\pi$ and $\eta$, are kept constant by designing a task where both stimuli are equally task-relevant and provide the same sensory evidence for the event to be judged. If both stimuli are equally relevant, only $\kappa$ makes a difference for their attentional weight. Nordfang et al. (2013) defined $\kappa _r$ to be 1 for a stimulus without any specific bottom-up salience. The two stimuli’s attentional weights can be normalized by dividing by the sum of weights to distribute the overall processing speed, C, to either one as shown in Eqs. 2 and 3. So, it is possible to express the processing speeds of two stimuli as a function of the overall processing speed, C, and the salience value of the salience stimulus, $\kappa _p$.

$$\begin{aligned} w_x = \kappa _x \sum _{j \in R} \eta (x,j)\pi _j \end{aligned}$$

(1)

The two processing speeds, $v_p$ and $v_r$, are connected to the data stochastically in such a way that the variables $v_p$ and $v_r$ are used as parameters for a psychometric function. Together with the stimulus onset asynchrony (SOA), i.e. the temporal delay between the two temporal events of which the order has to be judged, the TOJ can be described formally by a psychometric function. This is explained in detail by Tünnermann et al. (2015) and Krüger et al. (2016). For the present article, it is important that this psychometric function describes the observed TOJ data.

$$\begin{aligned} v_p&= \frac{\kappa _p}{1+\kappa _p} \cdot C \end{aligned}$$

(2)

$$\begin{aligned} v_r&= \frac{1}{1+\kappa _p} \cdot C \end{aligned}$$

(3)

In terms of the parameters, $v_p$, $v_r$ and SOA denoted as $\Delta t$, the probability of encoding the probe stimulus p first, $P_{\mathrm {p}}$, can be expressed as two equations. The Eqs. 4 and 5 define a sigmoid-looking psychometric function (for a more detailed derivation of this function from the basic TVA, see Tünnermann et al., 2015). These functions together describe the order judgment over relevant SOA delays in terms of two processing speeds. As introduced in Eqs. 2 and 3, the processing speeds can be expressed as overall processing speed, C, and salience, $\kappa _p$. These two parameters will be estimated in the empirical part of the present article.

$$\begin{aligned} P_p(v_p, v_r, \Delta t) = 1-e^{v_p |\Delta t|}+e^{v_p |\Delta t|}\left( \frac{v_p}{v_p+v_r}\right) \quad {\text {for }} \,\, \Delta t < 0 \end{aligned}$$

(4)

for negative SOAs and

$$\begin{aligned} P_p(v_p, v_r, \Delta t) = e^{v_r |\Delta t|}\left( \frac{v_p}{v_p+v_r}\right) \quad {\text {for }} \,\,\Delta t \ge 0 \end{aligned}$$

(5)

From the formalism of TVA, it is difficult to imagine how the data patterns in the TOJ depend on the salience parameter, $\kappa$, and the overall processing capacity, C. In Fig. 1 we provide a visualization of different $\kappa$ and C values within previously observed ranges. The visualized function is the psychometric function defined by Eqs. 4 and 5. The parameters C and $\kappa$ are converted to processing rates $v_p$ and $v_r$ by the Eqs. 2 and 3, respectively. Specifically, Eq. 4 describing how $P_p$, the probability of reporting probe first, depends on the processing rates of both stimuli and SOAs smaller than 0, whereas Eq. 5 shows this for the positive SOAs. So, roughly speaking, Eq. 4 defines the left half of the psychometric function and Eq. 5 the right half. These illustrations show that a change in salience and hence attention brings about a distinctly different change in the TOJ data when compared to the second parameter, overall processing capacity. If the overall processing capacity is low, more mistakes are made in the temporal discrimination leading to a shallow slope of the function. If salience and thus attention is changed, the same amount of processing resources is distributed differently. This distribution leads to a characteristic increase in correct discrimination of the attended stimulus, if it is indeed first, but increases the number of mistakes, if the attended stimulus is second. Independently of TVA, these patterns are in line with the basic mechanisms of visual attention. TVA, however, provides a quantitative model of the relationship between overall performance in temporal discrimination and attentional advantage.

We chose a hierarchical Bayesian model for the parameter estimation. Although there are different pros and cons to consider when using Bayesian methods (Dienes, 2011), Bayesian hierarchical models provide a range of advantages for cognitive modeling in general (Lee, 2011) and are particularly suitable for parameter estimation under a given model (Little, 2006). The relevant details of the Bayesian analysis as well as the original data and posterior predictive for model checking are given in the Appendix. Also, the analysis script is published online together with additional results and graphics (Krüger, 2020).

When evaluating the results of the following experiments, the skeptical reader may ask whether the TVA-TOJ model aptly describes the observed data. Therefore, we compare the results of the introduced model to a model using a logistic psychometric function that is commonly used in the analysis of TOJ data. Like the TVA-TOJ model, the logistic function has two parameters to describe the TOJ data through two relevant properties: the point of subjective simultaneity (PSS) and the just noticeable difference (JND) (Spence & Parise, 2010). The PSS describes the intersection with the .5 level (judging A before B as often as B before A), the latter describes a threshold indicating the precision with which the events can be judged. This function may be known to the reader as logit response function from generalized linear models, but can also be implemented in Bayesian statistical analysis (Kuss, Jäkel & Wichmann, 2005). For example, in Fig. 1, the PSS parameter can be read off at the intersection of the dotted horizontal line and the graph. Note however that—in contrast to the TVA-TOJ model—these parameters are not derived from a general theory of attention but merely descriptive.

Experiment 1

Experiment 1 tests empirically whether salience parameter $\kappa$ depends on the display duration of a salient contrast. To this end, a salience display was presented for five durations ranging from 50 to 800 ms.

TVA’s $\kappa$ parameter is estimated from a model of the TOJ data that comprises the overall visual processing capacity C as a second free parameter. Whereas $\kappa$ describes the relative processing advantage of the salient stimulus, C describes the overall available processing resources for processing both of the stimuli.

The expectation we formulated in the introduction was that the attentional advantage caused by salience should rise quickly. Afterward, the influence of salience on attention should decline, although a unique contrast may retain a weakened attentional advantage independent of its quantitative salience value.

Building on previous studies using TVA or a TVA-based analysis of TOJ, we can formulate expectations for the parameters precisely: For healthy adults, a processing capacity of around 60 Hz is normal (Finke et al., 2005). An overall processing capacity around 60 Hz is also reported in the TVA-based analysis of TOJ (Krüger et al., 2016; Tünnermann et al., 2017). For the salience parameter, the same orientation contrast as in the present study has been measured in multiple experiments to be 2 to 2.5 (Krüger et al., 2017).

The previous results taken together lead to our hypothesis for Experiment 1: The salience parameter $\kappa$ should initially rise to around 2.5 and decrease for longer presentation durations. The overall processing capacity C is expected to be 60 Hz and is not expected to vary across the experimental conditions.

Method

Participants

Thirty persons (15 male and 15 female; $M_\mathrm{age}= 23.27$, range 19–35) participated in Experiment 1. The size of the sample was fixed in advance and was based on earlier studies (Krüger et al., 2017). The participants were students or members of Paderborn University. Each participant gave informed written consent and reported normal or corrected-to-normal visual acuity. Participants received course credit or a payment of 8 euros per hour.

Apparatus

Experiment 1 was conducted using two Microsoft Windows 7 PCs and two Iiyama Vision Master Pro512 22 in. ($40.4~{\mathrm {cm}} \times 30.3~{\mathrm {cm}}$) CRT monitors (resolution $1024 \times 768$ pixels, 32-bit colors, refresh rate 100 Hz). The experimental procedure was implemented with OpenSesame (Mathôt, Schreij & Theeuwes, 2012) and PsychoPy (Peirce, 2007). The monitors were luminance calibrated by an x-rite colormunki display colorimeter. The viewing distance was 50 cm. Responses were given using the left ctrl key and the right enter key (number pad). Left and right responses were given with the left and right hand, respectively. The experiment was conducted in an experimental booth that was dimly lit.

Stimuli

In the beginning of each trial, a fixation cross appeared for 900 ms in the center of the screen. Afterward, a bar array of $17 \times 16$ items was shown. The size of the array comprised $34.99^\circ \times 32.93^\circ$ of visual angle. Bar length was $1.07^\circ$ and bar width $0.18^\circ$. The fixation cross replaced a bar at the horizontal center with 8 rows of bars above and 7 below. The array was drawn on a gray background, RGB (96, 96, 96) and luminance $6.98 \frac{{\rm cd}}{m^2}$ whereas bars and fixation cross were drawn in white, RGB (224, 224, 224) corresponding to $65.2 \frac{cd}{m^2}$.

Probe and reference stimulus were placed at two fixed positions on the left and on the right of the fixation cross (eccentricity $8.24^\circ$ of visual angle). The reference stimulus had the same orientation as the background elements, whereas the probe stimulus was rotated by $90^\circ$ in comparison to the background elements and hence had the maximum orientation contrast of $\Delta o = 90^\circ$. Whether probe or reference occurred on the right was balanced and randomized. The orientation of the background elements was chosen randomly.

After a fixation period of 600 ms, the salience display was presented for a display onset asynchrony (DOA) of 50, 100, 200, 400, or 800 ms before the probe and reference stimuli flickered. The display persisted until the end of the trial—only the length of the visibility prior to the TOJ was varied. The only change to the display was the brief flicker of the probe and reference stimulus. The flicker was implemented by an offset and subsequent onset, that is, the stimulus disappeared for 80 ms. The order in which the two stimuli flickered depended on an SOA of $-80$ ms, $-40$ ms, 0 ms, 40 ms, or 80 ms, negative values meaning that the probe flickered before the reference. As an exception, in the DOA 50 condition, the 80 ms SOA could not be sampled (the reference stimulus would have to be presented before the salience display) and was therefore left out. Each SOA was presented in 34 trials except for the 0 ms SOA which was presented 68 times. The sketch in Fig. 2 shows the procedure.

Procedure

Instructions were presented on the screen and questions were answered by the experimenter verbally.

The participants were instructed to look at the fixation cross as soon as it became visible at the beginning of each trial. Participants then had to judge which of two flicker events was the first and respond by indicating the respective side: left or right. The response was given by pressing the left ctrl or the right enter key with the respective hand. A new trial started automatically, but breaks were offered after 50 trials. The experiment began with a training session of 40 trials. During this training, feedback about the correctness of the participant’s judgments was given. The locations of the relevant events were learned as well as the task in general. The whole session lasted approximately 45 min.

Results and discussion

Applying the model derived from TVA, two parameters are estimated from the TOJ data for each of the five experimental conditions. A Bayesian hierarchical model is used to represent different sources of uncertainty (different repetitions, different participants, and population) according to the logic of the experimental design. The parameter estimations for the population are shown in Figs. 3 and 4—in Fig. 3 for the salience parameter $\kappa$ and in Fig. 4 for the overall processing capacity C.

The similarity of two groups can be assessed by comparing their parameter distributions: The smaller their overlap, the larger the effect. To provide an objective assessment of the difference of two conditions, we compute the standardized effect size (Cohen’s d, 1988) for each consecutive condition and compare it to a region of practical equivalence (ROPE; Kruschke, 2014) between $-0.3$ and 0.3 which is equivalent to a small effect. This constitutes a parameter-based Bayesian test (Kruschke, 2011), which checks, if at least 95% of the effect sizes that are probable after observing the data fall outside this interval so that a medium (or larger) effect is likely. If the interval of the 95% highest probability density falls completely within the ROPE, then no or a small effect is highly likely. Conversely, if this interval is completely outside the ROPE, than a medium or large effect is highly likely. This interval is reported in the square brackets. If the interval somewhat overlaps with the rope, no clear decision can be. Still, the result may be interpreted as a tendency because the maximum a posteriori estimator of the effect size (most likely effect size estimate), the number in front of the square brackets. So, roughly speaking, this procedure tests whether increasing the DOA leads to at least a medium effect between two consecutive levels. Because the effect sizes are standardized, they can be compared between different parameters, although they vary in different ranges.

The figures for salience $\kappa$ and overall processing capacity C depict different patterns. The salience parameter $\kappa$ is in the expected range, but does not exhibit a clear trend: The posterior distributions are largely overlapping. For the consecutive conditions, this is further substantiated by effect sizes that always overlap with the ROPE. The estimated effect sizes are $0.03 [-0.09, 0.15]$ for Condition DOA 100 ms and DOA 50 ms, $-0.065 [-0.18, 0.05]$ for Condition DOA 200 ms and DOA 100 ms, 0.28 [0.15, 0.41] for Condition DOA 400 ms and DOA 200 ms, $-0.16 [-0.33, 0.03]$ for Condition DOA 800 ms and DOA 400 ms. Thus, if there is a difference, it is rather small.

Although hypothesized to be constant, the processing capacity C rises steeply during the first 200 ms. The non-overlapping posterior distributions suggest that the 50 ms and 100 ms conditions are profoundly different from each other. This is again substantiated by the first two effect sizes that fall completely outside the ROPE. The estimated effect sizes are 0.86 [0.65, 1.1] for Condition DOA 100 ms and DOA 50 ms, 1 [0.83, 1.2] for Condition DOA 200 ms and DOA 100 ms, $0.14 [-0.07, 0.35]$ for Condition DOA 400 ms and DOA 200 ms, $-0.01 [-0.23, 0.2]$ for Condition DOA 800 ms and DOA 400 ms. Hence, a medium or larger effect is highly likely to occur between the first three levels of DOA. The value of the processing capacity parameter C reaches the expected value only after 200 ms but appears to be constant from there on.

According to the hypothesis, the $\kappa$ value should explain a difference between conditions. To check this, we formulate and test alternative models. First, we check whether there is a time course in salience parameter $\kappa$. We compare the model that yielded the parameter estimates to a model that assumes the same $\kappa$ value for all conditions i.e. that $\kappa$ does not change with the progression of time. This model is called the fixed-$\kappa$ model. Also, we test whether or not overall processing capacity stays constant (although given the parameter estimates, it seems extremely likely that this is false). The respective model is called the fixed-C model. We use the leave-one-out Information Criterion (looIC; Vehtari, Gelman & Gabry, 2017) that accounts for model complexity and for which small numbers indicate a better model. This comparison yields an looIC of 3896.31 for the original model, an looIC of 4019.54 for the fixed-$\kappa$ model, and an looIC of 4781.0 for the fixed-C model. For the hypothesis, this ranking means that $\kappa$ does vary over time because fixing $\kappa$ results in a worse model. Also, C exhibits a time course. Fixing the C parameter however results in a worse model than fixing k. Thus, we can conclude that a time course of salience exists, although the time course of C is more important for the observed data pattern.

Next, we compare the present model to an alternative that replaces the TVA-based psychometric function with the logistic function while retaining the same hierarchical structure. Again, we use looIC for comparison. This procedure yields an looIC of 3896.31 for the original model and an looIC of 3924.89 for the logistic function model. Priors for the logistic function model are taken from a study by Krüger et al. (2016) who provided a logistic-function-based analysis along with the TVA-based TOJ analysis. For the sake of transparency, note that the comparison of models with different parameters can depend heavily on the priors used. Therefore, we tested the robustness against different priors. It was, for example, possible to revert the order by extending the $\kappa$ prior of the TOJ-TVA model to implausible values of smaller than 1. So, a more elaborate comparison would be needed to argue for a general superiority of the fit of the TVA-TOJ model. However, we can reject the hypothesis that the analysis merely reflects an inability of the TVA-TOJ model to adequately describe the data because it is the better model for the current analysis.

Although the analysis shows that assuming no change in the salience parameter $\kappa$ is inappropriate, its change over the first second after presentation is small. Non-salient elements have a $\kappa$ value of 1 (Nordfang et al., 2013) and the estimated differences vary between 0.08 and 0.8, see Fig. 3. In comparison to the salience difference of many orientation contrasts, these differences are rather small (Krüger et al., 2017). How does such a small difference fit with previous research? The study by Donk& Soesman (2011) is closely related to the present research because they use the same experimental paradigm, the TOJ. Unfortunately, they do not report the typical descriptions of a TOJ curve which is the PSS and the JND. However, when looking at the data of Donk & Soesman (2011), the slope corresponding to JND is visually twice as shallow for the 58 ms condition in comparison to their 800 ms condition. These patterns are also visible in the present TOJ data. Donk and Soesman interpret all variance in terms of attention which is justified by their experimental design. Although mathematically not equivalent, in the present model, C determines how steep the TOJ curve is, whereas $\kappa$ is similar to PSS in the regard that both describe shifts of the function (although the shifts themselves are different from each other). We found a similarly striking difference between our 50 ms and 800 ms conditions as Donk and Soesman found between their 58 ms and 800 ms conditions. Although the data patterns are similar, interpretation is different: If these data are linked to the TVA model of visual selection, then it is rather overall visual processing capacity than attention that changes. Nevertheless, the aforementioned cueing studies and studies based on saccades suggest much more variability due to salience, which does not fit with the present TVA-based analysis.

Previous studies on the time course of salience used a blocked design (e.g. , Dombrowe et al., 2010; Donk & Soesman 2011). An explanation for the small variance in the salience parameter may be the fully randomized design because it may cause an equal temporal expectation for each trial, independent of its actual display duration. In the design of Experiment 1, the expectancy value for the display duration was 310 ms. Temporal expectations can severely affect processing speed (Vangkilde, Coull & Bundesen, 2012), and thus may have distorted the parameter estimation.

To sum up, we applied a model derived from a theory of visual selection, the TVA, to an experimental design, the TOJ, that is used to investigate the time course of visual salience. However, the overall ability to perform the TOJ, measured by the visual processing capacity, exhibits a stronger time course than the salience parameter. This result contradicts the hypothesis that only the distribution of attention caused by salience is affected by a time course. This finding does not originate from applying an inappropriate model to the data. This cause has been ruled out by a model comparison between the TVA-based model and a model that uses the common logistic function instead of the function derived from TVA. Thus, from the TVA perspective, there is rather a time course in the accuracy of the TOJ than in the relative advantage of the salient over the non-salient stimulus. To preclude the possibility of a chance finding or a biased parameter estimation because of temporal expectancy, we conducted a blocked replication study in Experiment 2.

Experiment 2

Experiment 2 was conducted to replicate the findings on processing capacity C in Experiment 2 with blocked conditions. A blocked design makes the experiment more comparable to previous studies on the time course of salience that used blocked designs (e.g. Dombrowe et al., 2010; Donk & Soesman, 2011) and prevents temporal expectations corresponding to the mean of the DOAs that can severely influence processing speed (Vangkilde et al., 2012). The blocked design should lead to a clearer trend in the salience parameter $\kappa$. Based on this change of design and the previous parameter estimation, the hypothesis is that $\kappa$ declines over time and processing capacity rises up to 200 ms and stays constant afterward.