Object substitution masking and its relationship with visual crowding

Camp, Sarah Jayne; Pilling, Michael; Gellatly, Angus

doi:10.3758/s13414-017-1316-7

Object substitution masking and its relationship with visual crowding

Published: 06 April 2017

Volume 79, pages 1466–1479, (2017)
Cite this article

Download PDF

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Object substitution masking and its relationship with visual crowding

Download PDF

Sarah Jayne Camp¹,
Michael Pilling¹ &
Angus Gellatly¹

1529 Accesses
1 Citation
Explore all metrics

Abstract

Object substitution masking (OSM) occurs when the perceptibility of a brief target is reduced by a trailing surround mask typically composed of four dots. Camp et al. (Journal of Experimental Psychology: Human Perception and Performance, 41, 940–957, 2015) found that crowding a target by adding adjacent flankers, in addition to OSM, had a more deleterious effect on performance than expected based on the combined individual effects of crowding and masking alone. The current experiments test why OSM and crowding interact in this way. In three experiments, target-flanker distance is manipulated whilst also varying mask duration in a digit identification task. The OSM effect—as indexed by the performance difference between unmasked and masked conditions—had a quadratic function with respect to target-flanker distance. Results suggest it is OSM affecting crowding rather than the converse: Masking seems to amplify crowding at intermediate target-distractor distances at the edge of the crowding interference zone. These results indicate that OSM and crowding share common mechanisms. The effect of OSM is possibly a consequence of changes to the types of feature detectors which are pooled together for target identification when that target must compete for processing with a trailing mask in addition to competition from adjacent flankers.

Exogenous spatial precuing reliably modulates object processing but not object substitution masking

Article 09 May 2014

Michael Pilling, Angus Gellatly, … Paul Skarratt

Size (mostly) doesn’t matter: the role of set size in object substitution masking

Article 13 June 2014

Hannah L. Filmer, Jason B. Mattingley & Paul E. Dux

Broad attention uncovers benefits of stimulus uniformity in visual crowding

Article Open access 14 December 2021

Koen Rummens & Bilge Sayim

In standard models, visual masking is understood as a consequence of inhibition or interference associated with the mask’s spatially overlapping or adjacent edges with the target, or with the transients associated with the mask’s delayed onset (Kahneman, 1968; Breitmeyer, Hoar, Randall, & Conte, 1984). The phenomenon of object substitution masking (OSM), first reported by Enns and Di Lollo (1997), has been argued to pose a challenge to standard models. In OSM, a mask consisting of just four surrounding dots is sufficient to prevent awareness of the target when the mask lingers after the target offset, the duration of the trailing mask being associated with the strength of masking (Di Lollo, Enns, & Rensink, 2000). In OSM, the mask, in being comprised of just four dots, contains no significant overlapping or adjacent edges; the onset of the mask does not seem to play any special role (OSM occurs irrespective of whether the mask onsets simultaneously with the target or with a delayed onset). Instead it has been suggested that the processes in OSM are object based, masking being a reflection of the process by which mask and target compete with each other as separate perceptual objects for conscious representation (Enns & Di Lollo, 1997; Di Lollo et al., 2000).

The original descriptions of OSM strongly emphasised the importance of attention as a factor in masking (Enns & Di Lollo, 1997; Di Lollo et al., 2000; Di Lollo, Enns, & Rensink, 2002; Enns, 2004). The reason for this was that initial empirical studies of OSM seemed to indicate that masking only occurred when the target and mask were presented in the context of multielement displays; with just the target and mask alone, OSM—as indexed by the difference in performance between simultaneous and delayed mask offset conditions—seemed to be absent (Enns & Di Lollo, 1997). Later studies found what seemed to be a systematic relationship between set size (i.e. the number of display items) and the magnitude of OSM (Di Lollo et al., 2000; Kotsoni, Csibra, Mareschal, & Johnson, 2007).

More recently, however, a number of studies have reported results which challenge the status of attention in OSM (Argyropoulos, Gellatly, Pilling, & Carter, 2013; Filmer, Mattingley, & Dux, 2014, 2015; Filmer, Wells-Peris & Dux, 2017; Goodhew & Edwards, 2016; Pilling, Gellatly, Argyropoulos, & Skarratt, 2014). For instance, both Argyropoulos et al. (2013) and Filmer et al. (2014) failed to observe a Set Size × Mask Duration Interaction in OSM in their data. Both authors claim that the interactions reported in the original experiments of Di Lollo et al. (2000) were artefactual in nature, the product of ceiling effects in the smaller set-size conditions (particularly when set size = 1). When, as in these later studies, the discrimination task was made more difficult to bring performance in the smaller set size conditions into a measurable range a masking effect in these conditions became apparent. Under such conditions set size had a clear main effect on performance; however the interaction with mask duration was no longer found. More recently Filmer et al. (2015) showed that OSM can even occur under conditions where the target is the sole focus of attention and is presented at fixation.

Together these findings suggest that the original claims regarding the status of attention as a variable in OSM were, at best, overstated. It seems that attention has, if at all, only a small effect on OSM. Certainly, at the very least, the role of attention cannot be considered a signature aspect of the OSM phenomenon as was originally claimed.

Though the role of attention is ostensibly small and though the presence of distractors has been demonstrated to be unnecessary for OSM to occur, recent research has suggested that distractors, where present, can influence OSM at least under some circumstances. Camp, Pilling, Argyropoulos, and Gellatly (2015) in contrast to the earlier described findings of Argyropoulos et al. (2013) and Filmer et al. (2014) found a reliable effect of set size on mask duration. Although OSM occurred without distractors, adding distractors to the display reliably increased the size of OSM. However, a further experiment showed that this effect was not a consequence of the changes in set size as Di Lollo et al. (2000) had earlier assumed. Rather, this effect was explained by the relative position of the distractors in the display with respect to the target. Where distractors were positioned to closely flank the target location OSM was stronger than when the distractors flanked a location opposite the target. This effect was found irrespective of overall set size. Camp et al. attributed this increased OSM which occurred with flanking distractors (hereafter ‘flankers’) to an effect of crowding on OSM.

Crowding is a well-established visual phenomenon (Levi, 2008; Whitney & Levi, 2011). One widely held theory of crowding deems it as a consequence of neural pooling or signal averaging. On this account the features of a target and those of sufficiently closely located flankers become mingled together, the result being that the visual system is unable to bind only the appropriate features to the token representation of the target (Parkes, Lund, Angelucci, Solomon, & Morgan, 2001; Levi & Carney, 2009; Greenwood, Bex, & Dakin, 2009). The interaction between OSM and crowding is interesting because it suggests the two phenomena, though distinct, share common mechanisms.

Camp et al. (2015) argued that crowding the target degraded the initial target percept and in doing so rendered it more susceptible to the trailing mask. They argued that the converse possibility, that OSM influenced crowding, was ruled out as an explanation of the interaction. This was argued on the basis of previous empirical findings and theoretical claims which suggest that OSM occurs as a later stage process than crowding within the visual processing hierarchy (Breitmeyer, 2014; Chakravarthi & Cavanagh, 2009).

Aside from OSM, some other forms of masking can influence crowding (Vickery, Shim, Chakravarthi, Jiang, & Luedeman, 2009). Vickery et al. (2009) presented a brief target in a location directly below the observer’s fixation. On unmasked trials no mask was present; on masked trials a surround ring (in a later experiment, a surround square) was presented around the target and onset and coterminated with it. Flankers were located at each of the four cardinal positions around the target at one of three increasing distances from the target. This flanker position manipulation was done on both unmasked and masked trials. On unmasked trials, a classic crowding effect was observed: Accuracy was low when flankers were closest to the target and much higher when at the middle and furthest distances given. With these outer two distances accuracy was the same as a baseline unmasked condition in which no flankers were present. On masked trials with flankers at the nearest position, accuracy was similarly low to that found on unmasked trials. However, unlike for unmasked trials, accuracy remained low for the middle and furthest flanker distances compared against a no flanker masked baseline. Thus, when the target was masked the flankers continued to have a deleterious effect on performance across a broader spatial range than they did under unmasked conditions. This spatially extended crowding effect the authors dubbed ‘supercrowding’. This effect occurred despite the fact that masking individually had only a marginal effect on performance.

The current study had two aims. The first was to attempt to replicate the finding of Camp et al. (2015) that crowding and OSM interact. In Camp et al. crowding was only specifically manipulated in one of the four experiments. Given this, it is important to demonstrate that this interaction is a replicable one. The study’s second aim was to more thoroughly explore the nature of the interaction. Specifically, the aim was to determine if the interaction is better understood as an effect of crowding on OSM (Camp et al., 2015) or some other process, such as OSM affecting crowding (Vickery et al., 2009). Camp et al. (2015) manipulated crowding only in a coarse way; the spatial character of crowding under masked and unmasked conditions was not determined in their experiment. These limitations make it difficult to determine what the relationship between crowding and OSM actually is. The current set of experiments aimed to provide a clearer picture on this relationship by presenting a greater number of crowding conditions, ones which allowed the spatial profile of the crowding effect to be determined under masked and unmasked conditions.

Crowding is strongly sensitive to the spatial distance between the flankers and the target, indeed crowding is typically operationalised in terms of this variable (Bouma, 1970; Whitney & Levi, 2011; Pelli & Tillman, 2008). Crowding is typically maximal when the flankers are nearest to the target and the effect declines monotonically as the distance is increased. The critical spacing for crowding to occur is dependent on target eccentricity with critical spacing increasing proportionally with the distance of the target from fixation. The effective distance for crowding tends to be approximately half that of the target’s distance from fixation though the range of the effect does depend on several other factors such as the position of the target and flankers with respect to fixation (Pelli & Tillman, 2008).

If crowding interacts with OSM because crowding makes a target more susceptible to OSM then we should find a certain data pattern with respect to manipulations of target-flanker distance. Specifically, OSM should be strongest at the smallest target-flanker distance, where crowding itself is strongest; OSM should then decrease to an asymptote as target-flanker distance is increased and crowding is correspondingly diminished. If this pattern of OSM decline does not occur with respect to target-flanker distance then it would challenge the explanation offered by Camp et al. (2015) regarding the relationship between crowding and OSM. Experiment 1 assessed this possibility.

Experiment 1

In Experiment 1, three target-flanker distance conditions are given, each of which is compared against an uncrowded condition in which the flankers surround a nontarget item at the same distance. A digit identification task was given.^{Footnote 1} The target, surrounded by a four dot mask (4DM) was presented at a random location on a virtual circle. On unmasked trials the 4DM coterminated with the target, on masked trials it lingered on-screen for a period after the target offset.

In the task on some trials two flanker digits flanked the target on either side (designated flanked-target trials). On other trials the two flankers flanked a nontarget digit located directly opposite the target on the virtual circle (designated unflanked-target trials). The distance between the flankers and the flanked item (i.e. target or nontarget) was also manipulated. Four flanker distance positions were given across both the flanked-target trials and the unflanked-target trials. This flanker distance manipulation, it was assumed, would give us a measure of the spatial profile of the flanker effect on OSM. The inclusion of the unflanked-target trials conditions reflected the same basic design given in Camp et al. (2015). These trials were included for two reasons. First, their inclusion made the experiment design a symmetrical one: For each target-flanker distance there was an equivalent control condition. Second, because of this symmetry, flankers did not potentially serve as a spatial cue to the target location as they would have done had only flanked-target trials been given.

It was predicted that OSM would be greater on the flanked target trials than on the unflanked target trials (i.e. trials where the flankers surround the nontarget), replicating the finding reported by Camp et al. (2015). A further prediction was made based on the claim stated in Camp et al. regarding the relationship between OSM and crowding. If Camp et al. are correct then OSM should be greatest when flankers were positioned closest to the target (where the crowding effect on the target was strongest) and diminish as the distance between the flankers and flanked target was increased. If this pattern is not found, then it would be evidence against their interpretation of the relationship between OSM and crowding.

Method

Participants

Thirty-five first-year Oxford Brookes Psychology students (27 female) took part in the experiment. All gave informed consent and received course credits for completing the experiment. All reported normal or corrected-to-normal visual acuity. This and all other experiments in this study received full approval by the Oxford Brookes University ethics panel.

Design

The experiment had three factors, all repeated measures: mask duration (0 ms, 180 ms), target condition (flanked target, unflanked target), and flanker distance (0.63°; 0.89°; 1.15°; 1.41°). The dependent variable was identification accuracy, measured by the percentage of correct responses.

Stimuli and procedure

The experiment was conducted in a darkened and sound-deadened room with back lighting. Stimuli were presented on a 20-inch Sony Trinitron CRT computer monitor (resolution = 1024 × 768; refresh rate = 100 Hz). The monitor was controlled by an Intel Pentium 4 (2.66 GHz) PC fitted with a NVDIA GeForce 4 graphics card. The monitor was viewed by the participant from a distance of approximately 110 cm. Bespoke software written in the BlitzMax programming language (BlitzMax V. 1.5; Sibley, 2011) controlled all aspects of stimulus presentation, randomisation and response recording. All stimuli were black (0.03 cd/m2) on a white (97 cd/m²) background. The stimulus array always consisted of four digits (0–9) positioned on the circumference of a virtual circle around a central fixation point. Each digit was in Arial font 32 type size (a subtended visual angle of 0.47° in height). The virtual circle itself had a radius subtending 3.9° from the centre of the fixation cross to the centre of each digit. One of the four digits was designated as the target, one as the nontarget and the other two as flankers. The target was presented at a point, randomly determined on each trial, on the virtual circle. The nontarget was always presented diametrically opposite the target on the virtual circle. The target was identified in the stimulus array by the surrounding 4DM. The 4DM was arranged in a virtual square (subtending 0.89° in height/width) around the target. The dots comprising the mask were each 0.10° of visual angle in width/height.

On flanked-target trials the flankers surrounded the target location at one of four distances: 0.63°; 0.89°; 1.15°; or 1.41° (distances are expressed in units of subtended visual angle of the circumferential distances between the midpoints of the surrounded item and the flanker digits on the virtual circle).^{Footnote 2} On unflanked-target trials the flankers surrounded the non-target location, again at one of four distances: 0.63°; 0.89°; 1.15°; or 1.41° (Fig. 1 gives an example of a flanked and unflanked trial for the nearest of the four flanker distances; 0.63°).

The identity of the target digit was randomly determined on each trial with the constraint that each of the 10 digits appeared with equal frequency for all trial types. The identity of the nontarget and flanker digits on each trial was determined randomly with replacement. A schematic depiction of an example trial sequence is shown in Fig. 1. All trials started with the onset of a blank white screen presented for 500 ms. A frame was then shown in which the fixation cross alone was presented for 250 ms. The onset of this frame was accompanied by a brief alerting tone. The stimulus array was presented with the 4DM surrounding the target digit. The stimulus array frame was shown for 40 ms. Then both the stimulus array and mask disappeared from screen (0-ms trailing mask), or the stimulus array disappeared but the mask remained for a further 180 ms (180-ms trailing mask). The fixation cross was present on-screen throughout these frames and remained visible until the participant responded. The task was to identify the target digit. Participants responded by pressing the corresponding key (0–9) on a standard computer keyboard. Immediate aural error feedback was given following an incorrect response. The participant’s response instigated the start of a new trial.

There were 640 trials in total, 40 trials for each combination of mask duration, target condition, and flanker distance. Trials were presented in 10 blocks of 64 trials. The computer prompted the participant to take a brief break after each 64 trial increment. Five demonstration trials presented at a slowed speed and 30 practice trials given at the real speed of the experiment were undertaken prior to the start of the experiment. Participants were instructed to emphasise accuracy in responding. The total session lasted approximately 30 minutes.

Results

Figure 2a gives the mean percentage correct responses for all conditions; Fig. 2b shows the masking strength in the different target conditions (masking strength is calculated by subtracting performance in the 180-ms mask duration trials from the corresponding 0-ms trials). A three-way repeated-measures ANOVA was performed to analyse the data. The three factors were mask duration (0, 180), target condition (flanked-target, unflanked-target), and flanker distance (0.63°; 0.89°; 1.15°; 1.41°). Significant main effects were found for all three factors: mask duration, F(1, 34) = 212.77, MS^error = 50.15, p < .001, η_p ² = .86; target condition, F(1, 34) = 174.56, MS^error = 220.14, p < .001, η_p ² =.84; and flanker distance, F(3, 102) = 7.08, MS^error = 44.46, p < .001, η_p ² =.17.

A significant two-way Mask Duration × Target Condition interaction was observed, F(1, 34) = 5.44, MS^error = 50.54, p = .026, η_p ² = .14. This reflects the fact that masking was stronger when the flankers surrounded the target compared to when they surrounded the nontarget. This interaction supports our first prediction; it replicates the finding reported by Camp et al. (2015). The two-way Target Condition × Flanker Position interaction was also significant, F(3, 102) = 11.72, MS^error = 47.26, p < .001, η_p ² = .26. This interaction simply reflects the fact that variation in flanker position has a greater effect on accuracy on flanked-target trials than on unflanked-target trials. The two-way Mask Duration × Target Position interaction was not significant, F(3, 102) = 1.47, MS^error = 40.63, p = .226. The three-way Mask Duration × Target Condition × Flanker Position interaction did not approach significance, F(3, 102) = 0.61, MS^error = 50.40, p = .609.

Discussion

Our first prediction of an interaction between flanker position and mask duration was supported. The interaction reflects the fact that masking tended to be stronger when flankers surrounded the target location compared to when they surrounded the non-target. This finding replicates the findings reported by Camp et al. (2015).

The second prediction was that OSM would be greatest when the flankers were located nearest to the target and diminish as flanker distance was increased. The data did not support this. In fact, the trend was in the opposite direction. For instance, for flanked-target trials slightly more masking was observed at the largest (1.41°) than the smallest (0.63°) flanker distance conditions. Second, and unexpectedly, flanker distance had at least as much of an effect on unflanked-target trials as it did for flanked ones (see Fig. 2b). We shall defer from making any further interpretation of these results at this stage other than to state that the pattern of data obtained was inconsistent with the crowding on OSM hypothesis proposed by Camp et al. (2015).

Given the pattern of the data obtained in Experiment 1, Experiment 2 looked at the effect of flanker distance on OSM over a much larger spatial range. This was done to obtain a clearer picture of the relationship between these variables. In Experiment 1 a distinction was made between flanked-target trials and unflanked-target trials. It should be noted that the distinction was somewhat arbitrary given that all stimuli are positioned on the same virtual circle. This arbitrariness becomes more palpable when the distances of the flankers from the target (or nontarget) are larger as they are for Experiment 2. Consequently for Experiment 2 it was deemed more appropriate to consider flanker distance as a single continuous variable.

Experiment 2

The aim of Experiment 2 was to explore the effect of flanker distance on OSM over a larger distance range than in Experiment 1. This distance covered the range of the entire arc of the virtual circle on which the stimuli were presented. Methods were the same as Experiment 1, except for the differences thus described. The aim of this experiment was to get a clearer indication of the relationship between flanker position and mask duration than was apparent from Experiment 1.