Contralateral delay activity tracks the influence of Gestalt grouping principles on active visual working memory representations

Peterson, Dwight J.; Gözenman, Filiz; Arciniega, Hector; Berryhill, Marian E.

doi:10.3758/s13414-015-0929-y

Contralateral delay activity tracks the influence of Gestalt grouping principles on active visual working memory representations

Published: 28 May 2015

Volume 77, pages 2270–2283, (2015)
Cite this article

Download PDF

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Contralateral delay activity tracks the influence of Gestalt grouping principles on active visual working memory representations

Download PDF

Dwight J. Peterson¹,
Filiz Gözenman²,
Hector Arciniega² &
…
Marian E. Berryhill²

3130 Accesses
33 Citations
1 Altmetric
Explore all metrics

Abstract

Recent studies have demonstrated that factors influencing perception, such as Gestalt grouping cues, can influence the storage of information in visual working memory (VWM). In some cases, stationary cues, such as stimulus similarity, lead to superior VWM performance. However, the neural correlates underlying these benefits to VWM performance remain unclear. One neural index, the contralateral delay activity (CDA), is an event-related potential that shows increased amplitude according to the number of items held in VWM and asymptotes at an individual’s VWM capacity limit. Here, we applied the CDA to determine whether previously reported behavioral benefits supplied by similarity, proximity, and uniform connectedness were reflected as a neural savings such that the CDA amplitude was reduced when these cues were present. We implemented VWM change-detection tasks with arrays including similarity and proximity (Experiment 1); uniform connectedness (Experiments 2a and 2b); and similarity/proximity and uniform connectedness (Experiment 3). The results indicated that when there was a behavioral benefit to VWM, this was echoed by a reduction in CDA amplitude, which suggests more efficient processing. However, not all perceptual grouping cues provided a VWM benefit in the same measure (e.g., accuracy) or of the same magnitude. We also found unexpected interactions between cues. We observed a mixed bag of effects, suggesting that these powerful perceptual grouping benefits are not as predictable in VWM. The current findings indicate that when grouping cues produce behavioral benefits, there is a parallel reduction in the neural resources required to maintain grouped items within VWM.

Electrophysiological correlates of the flexible allocation of visual working memory resources

Article Open access 19 December 2019

Does perceptual grouping improve visuospatial working memory? Optimized processing or encoding bias

Article Open access 08 July 2021

Attentional prioritisation and facilitation for similar stimuli in visual working memory

Article Open access 12 January 2023

Extending VWM capacity limitations by Gestalt principles of grouping

Visual working memory (VWM) underlies the temporary storage and active manipulation of visual information. The amount of information from our visual world that can be stored in VWM, however, is extremely limited (Baddeley, 2010; Hollingworth, 2006; Simons & Levin, 1997; Simons & Rensink, 2005). Behavioral estimates of capacity suggest an upper limit of ~3–4 items, though this number is debated (Bays & Husain, 2008; Cowan, 2001; Fukuda, Awh, & Vogel, 2010; Luck & Vogel, 2013). Converging evidence from electroencephalography (EEG), event-related potentials (ERP), and functional magnetic resonance imaging (fMRI) suggests that VWM capacity is constrained by the availability of a finite amount of neurobiological resources required to store information (Anderson, Vogel, & Awh, 2011; Todd & Marois, 2004; Vogel & Machizawa, 2004; Xu & Chun, 2006).

These VWM capacity limitations have prompted researchers to determine their basis and to identify ways of maximizing VWM. One area of interest has been to test whether performance benefits observed in perception extend to VWM. Specifically, Gestalt principles of grouping (e.g., similarity, proximity, common fate, continuity) and related grouping cues (e.g., uniform connectedness) facilitate perceptual performance (Palmer & Rock, 1994; Wertheimer, 1950; for a recent review, see Wagemans et al., 2012) by preattentively (Duncan, 1984; Duncan & Humphreys, 1989; Kahneman & Treisman, 1984; Moore & Egeth, 1997; Neisser, 1967; but see Mack & Rock, 1998; Mack, Tang, Tuma, Kahn, & Rock, 1992) parsing visual scenes into component objects (Duncan, 1984; Kahneman & Henik, 1977; Neisser, 1967). Given this perceptual benefit, it is logical to anticipate that these cues would preferentially bias grouped items’ entry into VWM (Woodman, Vecera, & Luck, 2003) and improve VWM performance. In describing the current suite of experiments, we borrow terms from the perceptual organization literature when referring to specific “grouping cues” and their impact on VWM processes when referring to “grouping-related benefits.”

Indeed, the VWM literature contains reports of improved VWM performance when grouping cues are available (strong collinearity: Anderson, Vogel, & Awh, 2013; similarity: Gao et al., 2011; Lin & Luck, 2009; Morey, Cong, Zheng, Price, & Morey, 2015; Quinlan & Cohen, 2012; similarity and proximity: Brady & Tenenbaum, 2013; Peterson & Berryhill, 2013; Shen, Yu, Xu, & Gao, 2013; amodal completion: Walker & Davies, 2003; connectedness and proximity: Woodman et al., 2003; Xu, 2006; common region: Xu & Chun, 2007; depth cues: Kristjánsson, 2006; and contextual grouping: Jiang, Chun, & Olson, 2004). Grouping may also add noise to estimates of VWM capacity because some paradigms introduce incidental grouping cues (e.g., similarity, proximity) by choosing stimuli with replacement (e.g., Luck & Vogel, 1997). Incidental grouping provides similarity and proximity cues and improves VWM performance compared to arrays without grouping cues (Brady & Tenenbaum, 2013). Furthermore, grouping cues may not provide equal benefits and may not reflect the same underlying mechanism(s).

The underlying mechanism(s) are unknown. One proposed explanation is that grouping reduces the neural requirements to maintain items in VWM, potentially by providing a redundant signal benefit or by taking advantage of object-based attention. VWM delay-related activity is lower for stimulus arrays containing grouped items in posterior regions measured using ERP (Gao et al., 2011) and fMRI (e.g., intraparietal sulcus, connectedness: Xu, 2008; common region: Xu & Chun, 2007; but see Berryhill, Peterson, Jones, & Stephens, 2014). One way of monitoring VWM contents is by measuring the contralateral delay activity (CDA). The CDA is an ERP waveform derived by subtracting the activity associated with ipsilateral hemifield stimulation from the activity associated with contralateral hemifield stimulation. It is calculated from posterior electrode sites, emerges around 400 milliseconds poststimulus onset, is sustained during the VWM maintenance period, increases in amplitude with VWM load, and reaches asymptote at an individual’s capacity limit (Vogel & Machizawa, 2004). Source localization suggests it arises from the area in and around the superior parietal lobe (Gao et al., 2011). Grouping also reduces CDA amplitudes. For instance, when distinct stimuli (e.g., wrench heads) were oriented to form collinear groups, the CDA amplitude was significantly smaller than when the same stimuli were rotated (Anderson et al., 2013). In addition, the CDA amplitude is equal when storing one item of one color (e.g., one blue square) or four identical items (e.g., four blue squares; Gao et al., 2011).

As noted, the literature contains many reports of VWM benefits from established perceptual grouping cues. However, these grouping benefits do not appear to be of equal robustness in VWM as in perception, as evidenced by failures to replicate VWM benefits (e.g., common region: Berryhill et al., 2014; for a successful replication in similarity, see Peterson & Berryhill, 2013). Second, VWM grouping benefits are likely not produced by a single neural mechanism. Here, we use the CDA to serve as a measure of the neural resources devoted to VWM that can clarify the mechanism(s) associated with any observed grouping benefits. In other words, leveraging the benefits of grouping cues may reveal where cognitive savings can be made to enhance VWM.

Experiment 1 examined the influences of grouping via similarity of color and spatial proximity. One recent study found that similarity alone did not modulate CDA amplitude (Shen et al., 2013). However, spatial proximity was not controlled for, as the locations of similar items were randomized. Previously, we found that proximity aids in developing a VWM benefit from similarity (Peterson & Berryhill, 2013; but see Morey et al., 2015). In Experiments 2a and 2b, we turned to a second well-established grouping cue, uniform connectedness, because it also has been reported to provide VWM benefits (Woodman et al., 2003; Xu, 2006; but see Berryhill et al., 2014), which may rely on regions within the posterior parietal cortex (e.g., Xu, 2008). Finally, in Experiment 3, we measured the consequences of combining similarity and uniform connectedness in the same study to examine the relative and potential superadditive benefits to VWM. We predicted that if behavioral benefits to VWM emerged, they would be accompanied by a reduction in CDA amplitude for grouped compared to ungrouped arrays of the same set size if they reflected the same underlying neural mechanism, and a disconnect between behavioral and CDA patterns if there were different underlying neural mechanisms.

Experiment 1: Similarity and proximity

Method

Participants

Twenty-two undergraduate students from the University of Nevada, Reno, participated in Experiment 1 (14 female, mean age = 22.2 years). To enter group-level analyses, we imposed a minimum average behavioral performance criterion of 70% response accuracy. As such, seven participants were excluded. All participants were right-handed, neurologically intact, and had normal or corrected-to-normal color vision. The Institutional Review Board at the University of Nevada, Reno, approved all experimental protocols.

Apparatus

The experimental task and stimuli were generated and presented with MATLAB (Mathworks, Natick, MA) using the Psychophysics Toolbox 3.0 extension (Brainard, 1997; Pelli, 1997). Stimuli were displayed on a 19-inch NEC MultiSync E1100 CRT monitor (refresh rate of 75 Hz at a resolution of 1024 × 768) via a Mac mini 2.5 GHz dual-core Intel Core i5.

Stimuli and procedure

Colored squares (0.7 × 0.7°) were randomly chosen from a set of seven colors (cyan, white, red, blue, yellow, green, magenta). Stimuli were presented symmetrically at three possible locations on each side of fixation (5.2°); see Fig. 1a. There were four conditions. In the two ungrouped item condition (2-UG), two differently colored squares were presented in two of the three locations. In the three ungrouped item condition (3-UG), three differently colored squares were presented. For the grouped conditions, two of the three squares shared the same color and were considered three items grouped via similarity and strong proximity (3-SSP) when the matching squares were neighbors, and three items grouped via similarity and weak proximity (3-SWP), when the matching squares were separated by a nonmatching square.

Trials began with the presentation of a black fixation cross (0.4° × 0.4°, 300 ms), followed by the presentation of a left or right black arrow (2.1° × 0.4°, 200 ms) above fixation cueing the side of the array to covertly attend during encoding. After a variable delay (300–400 ms) the VWM array was presented (100 ms). Stimuli were presented in two rectangular areas subtending 7.1° × 12.2° of visual angle centered 4.6° to the left or right of the fixation cross on a uniform medium gray background. Participants viewed the stimuli from a distance of 57 cm. After a delay-period (900 ms), a single probe stimulus appeared (3 s). A single probe was used to keep the current task design consistent with previous research examining the impact of grouping cues on the CDA (Gao et al., 2011; Shen et al., 2013) and to prevent participants from making their decision based on the initial spatial configuration between items (Jiang, Olson, & Chun, 2000). Participants indicated whether the color of the probed item matched (“o” key, 50%) or mismatched (“n” key) the original stimulus item. If no response was registered, the trial was considered incorrect and screen instructions appeared, stating to press any key to continue to the next trial. Prior to beginning the experiment, participants completed 24 practice trials. Thirteen blocks (with the opportunity for self-paced breaks between blocks) of 48 trials per block were presented for a total of 624 trials, with 156 trials per condition. Trial types were randomly interleaved within each block. Participants were instructed to maintain fixation and to avoid voluntary eye movements. Performance measures included accuracy, reaction time and VWM capacity estimates: K = Set size*(Hit rate – False alarm rate); (Cowan, 2001; Pashler, 1988).

Electroencephalography recordings

The EEG was recorded at a sampling rate of 1000 Hz with a vertex (Cz) reference from 256 high-impedance electrodes mounted in a HydroCel Geodesic Sensor Net amplified by a Net Amps 300 amplifier and acquired using Net Station 4.5.5 software (Electrical Geodesics Inc., Eugene, OR) running on a 2.7 GHz dual-core Apple Power Mac G5. EEG data were re-referenced off-line to the average of the left and right mastoids. Individual continuous EEG datasets were filtered using finite impulse response (FIR) filters high-passed at 0.01 Hz and low-passed at 30 Hz off-line. The EEG data corresponding to correct trials were segmented by condition using an epoch of 200 ms (baseline period) before stimulus onset and ending 1,000 ms after stimulus onset.

Artifact detection and rejection routines eliminated eye movements, blinks, and channel loss. The electrode sites around the eyes provided data for artifact detection and rejection routines. Segments contaminated by blinks or eye movements (>1°) were rejected prior to averaging (threshold criteria: eye movements: 20 μV, blinks: 150 μV). Trials were excluded if they contained residual artifacts (e.g., ocular artifact, movement artifact, amplifier saturation) exceeding ±75 μV from 600 ms prestimulus to 1,000 ms poststimulus onset. Bad channels (e.g., > 50 KΩ electrode impedance, line noise, drift) were detected and replaced using interpolation algorithms implemented by Net Station 4.5.5. software (Electrical Geodesics Inc., Eugene, OR). Finally, the segmented EEG data for correct trials from each condition were averaged to generate ERPs for each participant and each condition. Baseline correction was performed using the baseline period.

CDA analysis

The CDA is calculated as a difference score between responses to stimuli presented to the contralateral and ipsilateral hemifields (Vogel & Machizawa, 2004). For all experiments, average contralateral and ipsilateral waveforms for each condition were computed at posterior sites typically examined in CDA research (left hemisphere posterior sites: P7, PO7, TP7; right hemisphere posterior sites: P8, PO8, TP8). Difference waveforms for each electrode pair (P7–P8, PO7–PO8, TP7–TP8) were created by subtracting the average activity recorded from ipsilateral sites from the average activity recorded from contralateral sites. These data were then averaged across electrode pairs. The time window used to measure the CDA was 400–1,000 milliseconds poststimulus onset. Because the literature does not have a standard convention, we report all electrode pairs separately to facilitate comparison with other findings.

Results

Behavioral results

To examine whether the presence of similarity and proximity improved VWM performance, we analyzed several behavioral measures (e.g., accuracy, estimated capacity (K), and reaction time). As indicated by a repeated-measures ANOVA including the within-subjects factor of condition (2-UG, 3-UG, 3-SWP, 3-SSP), there was a significant difference in accuracy across conditions, F(3, 42) = 26.67, MSE = 0.022, p < .001, η _p ² = 0.66, β = 0.99. Bonferroni corrected pairwise comparisons indicated that this was driven by significantly lower performance in the 3-UG (M = 0.84) condition compared to all other conditions (2-UG = 0.91; 3-SWP = 0.93; 3-SSP = 0.91; all ps < .001); see Fig. 1b. These data confirmed a significant similarity benefit regardless of proximity, consistent with recent results (Morey et al., 2015).

A subsequent 2 × 2 repeated-measures ANOVA examined accuracy as a function of probe type (probing a grouped or ungrouped item) and strength of proximity between items (i.e., the 3-SSP and 3-SWP conditions). There was a main effect of probe type, F(1, 14) = 18.31, MSE = 0.09, p = .001, η _p ² = 0.57, β = 0.98, corresponding with higher accuracy for trials probing a grouped item (M = 0.95) than an ungrouped item (M = 0.87); see Fig. 1c. Additionally, there was a main effect of proximity, F(1, 14) = 5.44, MSE = 0.008, p = .04, η _p ² = 0.28, β = 0.58, such that accuracy for the 3-SWP condition was higher (M = 0.93) than in the 3-SSP condition (M = 0.91). Finally, there was a significant interaction between probe type and proximity, F(1, 14) = 7.35, MSE = 0.02, p = .02, η _p ² = 0.34, β = 0.71, driven by a significantly greater probe effect on 3-SSP arrays (grouped: M = 0.96, ungrouped: M = 0.85, p < .001) than the 3-SWP arrays (grouped: M = 0.95, ungrouped: M = 0.90, p = .09); see Fig. 1c.

The accuracy analyses indicated that VWM performance benefited from similarity and proximity. To examine whether grouping increased capacity we subjected the Cowan’s K values to a repeated-measures ANOVA revealed a significant difference in K across conditions, F(3, 42) = 97.54, MSE = 2.59 p < .001, η_p ² = 0.87, β = 0.99. The same differences observed in the accuracy data remained significant, for example, the increase from the 2-UG and all of the 3-item conditions (all ps < .001). In addition, there were significant differences between the 3-UG and both grouped conditions, 3-SWP and the 3-SSP, (both ps < .001), but not between the two grouped conditions (3-SWP, 3-SSP: p =.13). Higher K values were associated with grouped arrays, regardless of proximity. Finally, average reaction times for each condition were consistent and showed no grouping benefit, F(3, 42) = 2.23, p = .10; 2-UG = 1,103 ms; 3-UG = 1,159 ms; 3-SWP = 1,113 ms; 3-SSP = 1,142 ms.

Electrophysiological results

The CDA amplitudes followed the behavioral pattern. We focused on the electrode pair TP7/TP8, which produced the largest effect and an increase in CDA amplitude from two to three items; however, the pattern of results is similar at two other electrode pairs of interest (e.g., P7/P8, PO7/PO8; see Table 1). A repeated-measures ANOVA comparing the factor of experimental condition (2-UG, 3-UG, 3-SWP, 3-SSP) indicated a significant main effect in CDA amplitude across experimental conditions, F(3, 42) = 6.30, MSE = 14.31, p = .005, η_p ² = 0.31, β = 0.95, showing the expected increase in amplitude between the 2-UG and 3-UG conditions (2-UG = -0.25 μV, 3-UG = -2.28 μV, p = .006). Importantly, there was a significant increase in CDA amplitude from the 3-SSP to the 3-UG conditions (3-SSP = -0.78 μV, p = .02) and a borderline increase in CDA amplitude from the 3-SWP to the 3-UG conditions (3-SWP = -0.17 μV, p = .07). Importantly, there was no difference in the amplitude of the CDA between the 2-UG condition and either the 3-SSP or 3-SWP conditions (ps = 1); see Fig. 2.

Table 1 Electrophysiological results per electrode pair from Experiment 1

Full size table

Discussion

Experiment 1 replicates and extends previous work confirming that arrays containing similarity of color cues benefit VWM (Brady & Tenenbaum, 2013; Lin & Luck, 2009; Morey et al., 2015; Peterson & Berryhill, 2013). Behaviorally, participants performed better when storing or reporting on grouped VWM arrays than ungrouped arrays of the same set size. Importantly, the behavioral benefit to VWM was echoed in the neural data by a selective reduction in the CDA amplitude during the storage of grouped arrays. Given that the amplitude of the CDA tracks the number of distinct item identities in VWM (Gao et al., 2011; Vogel & Machizawa, 2004), the data suggest that grouped items were integrated into a single representation. Furthermore, the CDA amplitude for each grouped condition was similar to the CDA corresponding to two separate objects (2-UG condition) but not three separate objects (3-UG condition). To test the consistency of these benefits both behaviorally and mechanistically, we next investigated uniform connectedness because of previously reported benefits to VWM.

Experiment 2a: Uniform connectedness

There are several reports of a significant benefit to VWM from uniform connectedness during change detection VWM tasks (Woodman et al., 2003; Xu, 2006). However, in both cases it was clear that the connectedness benefit emerged under carefully titrated difficulty levels (e.g., only with certain set sizes). Using fMRI, Xu and Chun (2007) linked behavioral data with activation in intraparietal regions corresponding to the electrode pairs used in calculating the CDA. As such, we reasoned connectedness was a reasonable grouping mechanism to explore next. In Experiment 2a, we predicted both behavioral benefits and an accompanying reduction in the CDA. If behavioral benefits were not reflected in the CDA, we would infer a different underlying mechanism.