Initial training: 1-2 sessions
Each participant first received general training in the centroid task. This was done to (1) minimize differences in the centroid computations used by different participants and (2) decrease the noise in the responses of individual participants. Subjects without any prior centroid task experience received 800 training trials in a basic centroid task (Sun et al., 2015). Each display in this training task comprised 8, square black dots, each subtending 0.3 deg. in width.
In addition, each participant received 800 trials of training with the specific stimuli used for this experiment, performing the task for 50 trials in each attention condition in each of the 1-each, 2-each, 3-each, and 4-each displays to become familiar with the experimental set up.
Testing: 4 sessions
Upon completing his/her training, each participant ran 16 blocks per experimental session. The attention condition was fixed across all 16 blocks in a given session. The display type (1-each, 2-each, 3-each, or 4-each) was fixed within a given block. Participants did these n-each blocks in order n = 1, 2, 3, 4, 4, 3, 2, 1, 1, 2, 3, 4, 4, 3, 2, 1. Each block consisted of 50 trials (for participants 1, 2, 3, and 4) or 53 trials (for participants 5, 6, 7, and 8). Forty-five of these trials were “full-set” trials of the sort described above; five other “target-only” trials contained only the target bars that would occur in a full-set trial without any of the distractor bars; and (for participants 5, 6, 7, and 8 only) each block contained three additional “singleton” trials in which the display contained only a single bar which was randomly selected to be one of the two target types (e.g., either a horizontal white or a horizontal black bar in the Attend-to-horizontal condition) whose location was distributed identically to the location of the correct response on a full-set trial.
Subjects completed four such sessions—one for each of the four attention tasks. The order of these attention tasks was counterbalanced across subjects. Prior to each block, the participant was shown a gray screen containing each of the two types of target bars as well as each of the two types of distractor bars. Trials typically took approximately 2.5 sec; there was rest between blocks, and a session consisting of 800 trials took approximately 45 min.
Participant 1 ran an additional session, in which each stimulus cloud contained 4 target bars and 16 distractor bars. This session included 16 blocks, 4 for each attention condition, and within each block there were 45 full-set trials, and 5 target-only trials.
Data analysis
We will write “vw,” “vb,” “hw,” and “hb” for the vertical-white, vertical-black, horizontal-white, and horizontal-black bar types. In the Attend-to-Property task (where Property is one of “black,” “white,” “vertical,” or “horizontal”), the participant strives to click on the centroid of two target bar types and ignore the other two distractor types. (In the Attend-to-white task, for example, the target types are vw and hw, and the distractor types are vb and hb.) Thus, on a trial in which type
i
is the bar type of the i
th item in the display and x
i
and y
i
are the x- and y-coordinates of its location, the target location (Targx, Targy ) is
$$ Tar{g}_X=\frac{1}{W}{\displaystyle \sum_{i=1}^{4n}{f}_{targ}\left(typ{e}_i\right){x}_i}\kern0.5em \mathrm{and}\kern0.5em Tar{g}_Y=\frac{1}{W}{\displaystyle \sum_{i=1}^{4n}{f}_{targ}\left(typ{e}_i\right){y}_i} $$
(2)
where f
targ
is a real-valued function of bar type that assigns equal weight to two target types and weight 0 to the two distractor types, and W is the sum of f
targ
(type
i
) taken over all 4n items i in the display.
Typically, however, the participant cannot achieve this goal; instead, the x- and y-coordinates of his/her response (R
X
and R
Y
) can be well-approximated by
$$ {R}_X=\frac{D}{W}{\displaystyle \sum_{i=1}^{4n}f\left(typ{e}_i\right){x}_i+\left(1-D\right){x}_{default}+ Nois{e}_X} $$
(3a)
and
$$ {R}_Y=\frac{D}{W}{\displaystyle \sum_{i=1}^{4n}f\left(typ{e}_i\right){y}_i}+\left(1-D\right){y}_{default}+ Nois{e}_Y $$
(3b)
where x
default
and y
default
are the x- and y-coordinates of the location toward which the participant’s response is assumed to revert if he/she extracts only partial information on a given trial, D is a real number between 0 and 1, f is a real-valued function of bar type, W is the sum of f(type
i
) over all items i in the display, and Noise
X
and Noise
Y
are normally distributed random variables with mean 0 and standard deviation σ.
The function f is called the attention filter achieved by the participant in the given condition. The values f(vw), f(vb), f(hw), and f(hb) are called the filter weights of the four bar types. It should be noted that f is only defined up to an arbitrary multiplicative constant. For plotting purposes, we impose the constraint that f(vw) + f(vb) + f(hw) + f(hb) = 1.
The parameter D is called the Data-drivenness of participant’s response. If D = 1, then the participant’s response on a given trial is determined exclusively by the items in the stimulus cloud. At the other extreme, if D = 0, then the participant’s response on a given trial is influenced not at all by the stimulus but instead by the combined effects of random noise plus a tendency to click on the fixed location (x
default
, y
default
).
Estimating model parameters
To estimate the model parameters x
default
, y
default
, f, D and σ, we proceed as follows:
-
1.
Let L
X
(L
Y
) be the N
trials
× 6 matrix whose (i,j)th entry is the sum of the x-locations (y-locations) of all items of type
j
presented on trial i for j < 5, is 1 (0) for j = 5, and is 0 (1) for j = 6.
-
2.
Let R
X
(R
Y
) be the column vector of length N
trials
whose i
th entry is the x-coordinate (y-coordinate) of the participant’s response on trial i.
Then
-
1.
Form the 2N
trials
× 6 matrix M by appending the matrix L
Y
to the bottom of L
X
.
-
2.
Form the vector R of length 2N
trials
by appending R
Y
to R
X
.
-
3.
Perform linear regression to derive the weights W minimizing SS
Residual
= ||MW-R||2
.
Then, writing n
j
, j = 1, 2, 3, 4, for the number of items of type j in each stimulus cloud in a given condition, D (Data-drivenness) is estimated by
$$ D={\displaystyle \sum_{j=1}^4{W}_j{n}_j} $$
(4)
For j = 1, 2, 3, 4, f(j) is estimated by
$$ f(i)=\frac{W_j}{{\displaystyle {\sum}_{i=1}^4{W}_i}} $$
(5)
σ
2 is estimated by taking
$$ {\sigma}^2=\frac{S{S}_{residual}}{df},\kern1em df=2N-6 $$
(6)
(where model parameters x
default
, y
default
, and D absorb 3 degrees of freedom and the attention filter f absorbs only 3 additional degrees of freedom, because it is constrained to sum to 1). Finally, x
default
and y
default
are estimated by taking
$$ {x}_{default}=\frac{W(5)}{1-D}\kern0.5em \mathrm{and}\ {y}_{default}=\frac{W(6)}{1-D} $$
(7)
A formal justification of this modeling method and a description of the methods used to estimate confidence intervals for model parameters is provided in Sun et al. (2015).
Quantifying performance
Note that actual performance (as described by Eq. (2)) can deviate from target performance (Eq. (1)) for several reasons. We quantify these deviations using:
-
1.
Imperfect Data-drivenness. The Data-drivenness D of the participant’s responses can be less than 1.
-
2.
Filter mismatch. The attention filter f achieved by the participant can deviate from the target filter f
targ
. This will cause the responses of the participant to deviate systematically from the correct responses. To quantify the degree to which the responses of the participant are immune from this sort of error, we use two descriptors of the participant’s attention filter f. The first, called the Selectivity, is a ratio—the participant’s attention weight for targets divided by the attention weight for distracters. Selectivity is given by
$$ Selectivity=\frac{f\left({t}_1\right)+f\left({t}_2\right)}{\left|f\left({d}_1\right)\right|+\left|f\left({d}_2\right)\right|} $$
(8)
where t
1
and t
2
(d
1
and d
2
) are the two item-types designated as targets (distractors) in the given attention condition; i.e., f
targ
(t
1
) = f
targ
(t
2
) = 0.5, and f
targ
(d
1
) = f
targ
(d
2
) = 0. When the attention filter achieved by a participant in a given condition closely approximates the target filter, Selectivity becomes very large. For this reason, it is convenient to plot log10 (Selectivity) rather than Selectivity.
The second, which we call Filter-fidelity, is a measure of how far away from the ideal filter (ϕ) the achieved filter (f) is. This is compared to how far away the worst possible filter a participant can get (f
Worst
) is from the ideal filter. Specifically, we obtain Filter-fidelity with the formula
$$ Fidelity=1-\frac{\left\Vert f-\Phi \right\Vert }{\left\Vert {f}_{Worst}-\Phi \right\Vert } $$
(9)
Notice that when the participant perfectly matches this target filter, this will be 1, and when the participant perfectly matches the worst possible filter, this will be 0.
-
3.
Random noise. The standard deviation σ of the random variables Noise
X
and Noise
Y
is nonzero. This will cause the responses of the participant to deviate randomly from the correct responses. Although σ itself could be used to gauge the amount of random error corrupting the participant’s responses, this model parameter is difficult to interpret, because it depends on several factors (such as the size of stimulus display clouds) that are likely to vary across different experiments. To facilitate comparison of results across experiments, we use a descriptor called Efficiency to quantify immunity to random error. Efficiency is the greatest lower bound on the proportion of display items that the participant must be using to compute his/her centroid estimates. If the trial-to-trial random errors Noise
X
and Noise
Y
were due solely to missing (i.e., failing to include) some of the items in the display in computing the centroid, then the participant would need to miss a proportion p = 1 – Efficiency of the items on each trial in order for Noise
X
and Noise
Y
to have standard deviation σ. Thus, if the participant were to attain an Efficiency of 0.75, this would imply that he/she is including, on average, in his/her centroid computation at least three-quarters of the items in the stimulus display. It should be emphasized, however, that if some of the random noise corrupting responses were due to some other source, such as (1) early perceptual noise, or (2) instability in the centroid computation, or (3) motor noise, than the actual proportion of items included in computing the centroid would be higher than Efficiency.