Spatial working memory load affects counting but not subitizing in enumeration

Shimomura, Tomonari; Kumada, Takatsune

doi:10.3758/s13414-011-0135-5

Spatial working memory load affects counting but not subitizing in enumeration

Published: 14 June 2011

Volume 73, pages 1694–1709, (2011)
Cite this article

Download PDF

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Spatial working memory load affects counting but not subitizing in enumeration

Download PDF

Tomonari Shimomura^1,2 &
Takatsune Kumada¹

2111 Accesses
7 Citations
1 Altmetric
Explore all metrics

Abstract

The present study investigated whether subitizing reflects capacity limitations associated with two types of working memory tasks. Under a dual-task situation, participants performed an enumeration task in conjunction with either a spatial (Experiment 1) or a nonspatial visual (Experiment 2) working memory task. Experiment 1 showed that spatial working memory load affected the slope of a counting function but did not affect subitizing performance or subitizing range. Experiment 2 showed that nonspatial visual working memory load affected neither enumeration efficiency nor subitizing range. Furthermore, in both spatial and nonspatial memory tasks, neither subitizing efficiency nor subitizing range was affected by amount of imposed memory load. In all the experiments, working memory load failed to influence slope, subitizing range, or overall reaction time. These findings suggest that subitizing is performed without either spatial or nonspatial working memory. A possible mechanism of subitizing with independent capacity of working memory is discussed.

Small numerosity advantage for sequential enumeration on RSVP stimuli: an object individuation-based account

Article 06 November 2019

Effects of strategy on visual working memory capacity

Article 03 July 2015

The effect of working memory load on the SNARC effect: Maybe tasks have a word to say

Article 01 December 2016

When one decides how many objects make up a scene, it is well known that reaction time (RT) for responding does not increase linearly as a function of the number of objects (Jensen, Reese, & Reese, 1950; Jevons, 1871; Oyama, Kikuchi, & Ichihara, 1981; Trick & Pylyshyn, 1993, 1994). Instead, RT for enumeration typically remains roughly constant up to approximately four items in a scene; it then increases linearly with the number of to-be-enumerated items. Several lines of research indicate that this discontinuity in RT reflects the operation of two distinct psychological processes that underlie the broader phenomenon known as enumeration. For enumerating a smaller number of items, the process is rapid and accurate; this is referred to as subitizing (Kaufman, Load, Reese, & Volkmann, 1949). In contrast, in enumerating larger numbers of items (e.g., above four), the process is timeconsuming and errorprone; this has been referred to as counting. In this study, we focus upon certain factors that determine the range of subitizing.

Previous research, which has used a variety of stimulus displays to study enumeration, offers a consensus that the number of items that can be subitized is approximately four. This holds for various displays consisting of different visual features (Frick, 1987; Watson & Maylor, 2006; Watson, Maylor, & Bruce, 2005). In addition, Watson, Maylor, and Bruce (Experiment 3) examined enumeration performance with a display consisting of dots differentiated by colors. They investigated the possibility that color differences might enhance individuation among dots and,thereby, affect subitizing. However, this was not observed. Instead, the range of subitizing remained constant at around four items regardless of their colors. Trick and Pylyshyn (1993) also found a constant range of four items indicative of subitizing in a task where participants were required to enumerate target items amid distractors. They used horizontal bars as targets and vertical bars as distractors; they found that the subitizing limit of four items was pegged to targets regardless of the presence of distractors. These results are consistent with the interpretation that subitizing reflects a distinct process with a limited capacity of four items.

Three main accounts address the process underlying subitizing and its limited range. First, Trick and Pylyshyn (1993, 1994) suggested that a subitizing range is constrained by a fixed number of mental tokens termed FINSTs (fingers of instantiations; Pylyshyn, 1989; Pylyshyn & Storm, 1988). The FINST system provided for indexes to spatial positions up to a maximum of four simultaneously presented items. Once tokens are bound to items, FINSTs can keep track of the items even if the items move to different positions or change their appearance. The index then allows spatial attention to rapidly access items. The FINST hypothesis holds that if the number of items presented does not exceed a fixed number of mental tokens limit, the set of mental tokens can be allocated to each item without attention. In turn, this would result in rapid enumeration of items when four or fewer are present. In contrast, if the number of items exceeds the fixed number of tokens, additional mental processing is required. In this case, the tokens allocated to the initial four items have to be serially reallocated to additional items. This reallocation of tokens requires attention, following which serial enumeration occurs, and also involves attentional shift. In sum, the FINST account proposes that a limited number of tokens are available for use by the visual system at a given time and that this constraint is responsible for the subitizing range.

A second account proposes that the configuration of stimulus items determines a subitizing range (Mandler & Shebo, 1982). According to this account, arrays with fewer items are more likely to match a familiar, canonical pattern (e.g., three dots on a dice) that is stored in long-term memory, together with other canonical configurations. In these long-term memory representations, a distinct number is also correlated with each configuration. A match with a stored configuration is likely to be quicker in displays with fewer visual items than in displays with many items, thus leading to fast retrieval of an associated numerical value. As the number of items increases, pattern matching becomes slower, due to the complexity of both presented and stored arrays; thus, retrieval of a number becomes less efficient. This account implies that the range of subitizing is not necessarily fixed; rather, it can vary depending on whether a presented configuration of stimulus items allows for an easy pattern match. But the range tends to be fairly stable. Furthermore, the assumption that canonical configurations uniquely correspond to certain numerical values is questionable. For example, although four dots are readily mapped onto a square pattern, a configuration of four dots could also represent a triangle pattern. In this case, the presented configuration should disrupt a correct decision. Although the configuration of stimuli may contribute to the subitizing range, it is possible that this is specific to certain stimuli and to regularly presented items.

Finally, the research most relevant to the present study has been reported by Klahr (1973) and Cowan (2001). In these studies, it was proposed that subitizing range reflects limits of short-term, or working, memory. This hypothesis assumes that subitizing employs a tagging mechanism that prevents enumerating the same item more than once; limits on working memory then place constraints on how many items this tagging mechanism can track simultaneously. Therefore, working memory should allow for rapid and correct enumeration when the number of visual items does not exceed the visual working memory capacity. Furthermore, it has been suggested that the capacity of visual working memory is around four objects, which is similar to the limit of subitizing (Luck & Vogel, 1997; Pashler, 1988; Vogel, Woodman, & Luck, 2001). The comparability of these limits suggests that a common process may be involved in both subitizing and working memory. However, to the best of our knowledge, this possibility has not been examined empirically.

Apparently, subitizing range resembles the visual working memory span reported in recent studies. Luck and Vogel (1997) showed that visual working memory could retain up to about four objects, where the objects differed with regard to multiple features, such as shape, color, and so on. In their experiment, a participant had to retain (for 900 ms) the colors of all objects presented in an initial display and then report whether or not they detected a change of color in a single object in a test display. Although the color of only one object could change for test, the number of items presented in the initial or in the test display varied from 1 to 12. Results indicated that accuracy remained nearly perfect for 1–3 items but declined systematically for 4–12 items. Another experiment showed that accuracy of remembering both color and orientation of objects was comparable to accuracy of memory for color or orientation alone. No significant difference in accuracy was found between 2 and 4 items; however, significant accuracy differences were observed in comparing displays with 6 items with those displays containing 2 or 4 items. Taken together, such results suggest that nonspatial visual working memory has the capacity to simultaneously maintain up to about 4 items, but not more.

The present study directly investigated whether the subitizing range involves visual working memory. Given that previous studies showed that the subitizing range was comparable to that of working memory, our working hypothesis in this research was that visual working memory capacity determines the range of subitizing. This hypothesis is reasonable if we assume, as well, that enumeration of a set of discrete items requires remembering the locations of items that already have been enumerated in order to distinguish these items from those not yet enumerated. If remembering already enumerated locations is mediated by visual working memory, the number of items that an individual may keep track of should depend on the capacity of visual working memory—that is, its availability for enumeration. Assuming that the capacity of visual working memory is restricted to about four objects, this hypothesis predicts that subitizing will not occur beyond this limit. In other words, five or more objects cannot be simultaneously held in memory; capacity limits imply that with a large number of items, one or more will be lost, and this leads to inefficient enumeration. Thus, in this study, we examined whether a concurrent working memory load affects the range and efficiency of subitizing. If the load on working memory reduces its available capacity for enumeration, the subitizing range should decrease even when displayed items number four or fewer. This means that subitizing efficiency, which is typically indexed by the slope of an RT function over the number of items in one to three items would change accordingly (e.g., Trick, 2005; Trick & Pylyshyn, 1993; Watson & Maylor, 2006; Watson, Maylor, Allen, & Bruce, 2007; Watson et al., 2005). Thus, in this case, instead of a flat slope of the RT function over one to three items, a greater memory load will elicit a positively inclining slope indicating that more processing time is required for displays even within the conventional subitizing range.

It is possible that visual working memory consists of two different subsystems. One subsystem may involve processing spatial representations of items, and the other would be dedicated to nonspatial visual representations of object features (Baddeley & Logie, 1999; Logie, 1995). In the present study, we refer to the former subsystem as spatial working memory and the latter as nonspatial visual working memory. The present study investigated the relationship between these subsystems and subitizing.

Previous studies have suggested a relationship between enumeration and working memory. Logie and Baddeley, (1987) examined enumeration performance, as a primary task, while participants performed either articulatory suppression or tapping as a concurrent secondary task. The reasoning was that articulatory suppression and tapping should interfere with rehearsal of verbal and spatial working memory, respectively (e.g., Cocchini, Logie, Della Sala, MacPherson, & Baddeley, 2002; Logie, 1995). Logie and Baddeley found that RTs for enumeration were longer with articulatory suppression than with either the tapping task or no secondary task. However, more important for the present purpose, the effect of articulatory suppression on enumeration was found only when the number of displayed items was large, and not when the number of items fell with the conventional range for subitizing. A recent study using either articulatory suppression or tapping as a secondary task reported a similar result, showing that performance efficiency on the primary enumeration task did not suffer from either of these secondary tasks, although RTs increased, overall, for displays with all numbers of items (Trick, 2005). These studies suggest that subitizing occurs without rehearsal of the enumerated stimuli.

A different line of research also showed that “verbal” working memory was not responsible for subitizing range. Tuholski, Engle, and Baylis (2001) and Barrouillet, Lépine, and Camos (2008) investigated whether performance of the enumeration task was affected by individual differences in working memory capacity. Working memory capacity was measured with an operation span test or a reading span test. Tuholski et al. found that a group of participants with high working memory capacity enumerated faster than those with a low capacity when the task included more than four items. By contrast, they found no difference due to capacity within the subitizing range; the subitizing range was approximately four in both capacity groups. This indicated that the range of subitizing was not constrained by verbal working memory capacity.

The goal of the present study was to explore the respective roles of spatial and nonspatial working memory in enumeration. While previous research has shown that neither a rehearsal process of working memory nor the capacity of verbal working memory is responsible for subitizing, it remains unclear whether the capacity of visual working memory plays a role in subitizing. Although previous studies have reported that neither articulatory suppression tasks nor tapping tasks interfered with subitizing, these tasks interfered only with rehearsal process and did not directly affect the capacity of visual working memory. Therefore, in the present study, we focused specifically on the impact of load on visual working memory capacities in the enumeration task. Specifically, we examined participants’ performance on subitizing when visual working memory capacity was filled with concurrent visual information, using a dual-task procedure that consisted of a primary enumeration task and a secondary working memory task. In different experiments, we considered, respectively, two different working memory tasks—namely, spatial versus nonspatial tasks. If subitizing involves a specific capacity limit on working memory—that is, due to the load incurred either from a concurrent spatial or from nonspatial visual working memory—performing a working memory task (spatial or nonspatial) concurrently with an enumeration task should result in interference, evident in poorer performance on the primary task. That is, in a dual-task paradigm, if subitizing requires visual working memory capacity, performing either secondary memory task should produce a decline in the subitizing range. Specifically, not only should the range in subitizing be reduced, but also efficiency should be degraded as a function of the load imposed upon memory by a secondary task. Inference is predicted in these cases, due to a competition for common memory capacity. Alternatively, if subitizing is not associated with the limited capacity of working memory, performance of subitizing should be comparable to that without a working memory task. A similar logic of dual-task interference was used for combining a visual search task with a working memory task (Woodman & Luck, 2004). Woodman and Luck found that efficiency of detecting a target among distractors in a visual search task decreased with a concurrent spatial working memory task, where efficiency was indexed by the slope of the function relating RTs to search display size. They concluded that this was due to the search process requiring participants to maintain locations of searched items. Using a similar rationale, we presented a spatial and nonspatial working memory task concurrently with an enumeration task: Experiments 1A, 1B, and 1C used a “spatial” working memory task in which participants had to remember the information about the spatial locations of objects in a working memory task, whereas Experiments 2A and 2B used a “nonspatial” working memory task in which participants had to remember the shape of objects.

Experiment 1A

Method

Participants

Twenty-four undergraduate or graduate students (17 males and 7 females), 19–36 years of age (M = 22.0 years), participated for payment. All reported normal or corrected-to-normal vision.

Stimuli and apparatus

Stimuli were generated using MATLAB and Psychophysics Toolbox (Brainard, 1997; Pelli, 1997). They were presented on a 17-in. CRT monitor at a viewing distance of 57 cm. Samples of stimuli, as they appear within a typical trial, are given in Fig. 1a. Stimuli in a display were presented on the monitor screen with a black background and a gray fixation cross at the center that remained visible throughout a trial. Enumeration stimuli were green dots (0.8° × 0.8°), and memory items were gray squares (0.2° × 0.2°). Mask items (which followed an enumeration display) were identical to enumeration stimuli. All three types of items were presented on two different spatial scales of invisible matrixes, one embedded within the other. The larger matrix was used for the enumeration array; it was composed of 7 × 7 cells. The central 3 × 3 cell region of the larger matrix was replaced with the smaller matrix scale of 6 × 6 cells, which was used for the memory array. No items were presented in the central 2 × 2 cells of the smaller matrix, in order to prevent overlap with a fixation cross. Thus, the dots for enumeration and squares for memorization were always presented within spatially distinct regions. For the enumeration array, center-to-center distances of adjacent cells were 2.29° horizontally and 2.48° vertically. For the memory array, these were 1.37° and 1.48°, respectively. Enumeration dots were presented randomly on the cells, with jitter up to 0.64° from the center of the cell for the x-axis and y-axis. The mask array was generated using the same matrix as the enumeration array; it consisted of 40 items in all cells in which the dots could appear.

Design and procedure

This experiment included a dual-task condition and two single-task control conditions. Control conditions were enumeration-alone and memory-alone conditions. The structure of a trial sequence of displays, shown in Fig. 1a, remained the same in all three conditions; only task(s) differed as function of condition. Participants performed in all three conditions. In the two control conditions, they performed only one task, whereas in the experimental, dual-task condition, they concurrently performed both a primary (enumeration) and a secondary (memory) task. In addition, the number of enumeration items varied, ranging from one to eight.

In the dual-task condition, each trial began with a 500-ms fixation display, followed by an initial memory display, enumeration display, mask display, test memory display (interleaved blank displays had only a fixation cross), and report duration. The initial memory display comprised two sequential presentations of a square for 500 ms, separated by a 500-ms blank screen. In the dual-task condition, a participant was required to remember the locations of both squares. These items were positioned in the smaller matrix, with the constraint that each item was never presented on cells adjacent to the cell of another item. Another 500-ms blank screen display occurred and was followed by an enumeration display consisting of one to eight dots. In the dual-task condition, a participant was then asked to press the space key as soon as he/she comprehended the number of the items. This display remained on the screen either until a participant made a response or until a total of 3,700 ms had elapsed from onset of the enumeration display if the participant made no response. If no response was made to the enumeration display until the display disappeared, a warning message and beep were presented at the end of the trial. The mask display then appeared for 300 ms, after which the screen remained blank for a variable duration based on the enumeration response time to ensure a retention interval of 5,000 ms. Next, a memory test display, composed of the two squares, was presented. On half the trials, the two squares were presented at the same locations as in the initial display. On the remaining trials, one square was realigned to a new location that adjoined its original location in the matrix, with the constraint that the location was selected from the cells that were never adjacent to the other memory item. A participant made a response to indicate whether or not these test squares were at positions identical to those in the initial memory display by pressing the zero ("0") key or the period (".") key, respectively. If an incorrect response was made, a feedback beep was presented. Then the participant reported the number of enumeration items, using a numeric key. It was emphasized that the participant’s initial numerical judgment of items, given by a space key press, could not be altered. No feedback was given for this enumeration response.

In the memory-alone condition, with the same displays and trial format as in the dual-task condition, participants were instructed to perform only the memory task. They had just to respond to the onset of the enumeration display by pressing the space key; they were told to do this as quickly as possible and to ignore the number of items presented. This procedure was used to ensure that overt responses to displays during the retention interval had no effect on memory performance. This condition then provided baseline performance of the memory task.

In the enumeration-alone condition, participants performed only the enumeration task. They were told to ignore items in initial and memory test displays. This provided enumeration performance without memory load as a baseline.

Participants completed eight blocks, with 30 trials per block, for each of the dual-task and the enumeration-alone conditions and one block of 30 trials for the memory-alone condition. The order of these blocks was counterbalanced across participants.

Results

Enumeration performance

Fig. 2 shows performance on the enumeration task. Mean correct RTs (panel a) and errors (panel b) for enumeration responses are presented as a function of the number of items for both the dual-task condition and the enumeration-alone conditions.

For the dual-task condition, only trials with correct responses on both memory task and enumeration tasks were included in the RT analysis. In addition, numerosity data from trials with eight items were excluded from the following analyses, to avoid the end effect (e.g., Trick, 2005; Trick & Pylyshyn, 1993; Watson et al., 2005). RTs greater than 2.5 standard deviations from the mean were excluded from further analyses (3.2% of correct trials). Correct RTs in the enumeration task were subjected to a two-factor 2 (task condition: dual-task and enumerationalone task) × 7 (numerosity: one to seven) repeated measures ANOVA. The analysis revealed significant main effects of condition, F(1, 23) = 26.62, p < .001, and numerosity, F(6, 138) = 86.76, p < .001, and a significant interaction between them, F(6, 188) = 6.921, p < .001. RTs were shorter, overall, in the enumeration-alone condition than in the dual-task condition. A multiple comparison (Ryan’s method) on the main effect of numerosity revealed that there were no differences among numerosities within four items, but all other numerosity comparisons showed significant differences (ps < .05). The main effect of task condition was found at every point of numerosity. Although a significant interaction was found, no additional effects of numerosity were found in either of the two task conditions. To pursue this interaction, slopes of the RT functions as a function of number of items were calculated using the method of least squares.

Slopes were estimated separately for the subitizing range (one to three) and the counting range (five to seven) in each task condition; these appear in Table 1. Data from trials with four items were not included in the calculation, because the numerosity of four has been considered the boundary between subitizing and counting in previous studies (e.g., Trick & Pylyshyn, 1993; Watson & Maylor, 2006; Watson et al., 2007;Watson et al., 2005). Paired t-tests revealed a significant difference in slope between the dual-task and the enumeration-alone conditions in the counting range, t(23) = 1.86, p < .05, but not in the subitizing range, t(23) < 1, n.s.

Table 1 Subitizing and counting slopes for enumeration-alone and dual-task conditions in Experiments 1A, 1B, 2A, and 2B

Full size table

Subitizing range was determined by fitting a bilinear model to mean correct RTs for numerosities one to seven (Watson et al., 2005). This fitting yielded the slope change point of two linear functions for each participant. One participant was excluded from this analysis because the fitting failed to converge with reliable parameters. For each task condition, the estimated slope change points were averaged across participants, yielding the subitizing range. The subitizing ranges are shown in Table 2. The subitizing range was not different between these conditions, t(22) < 1, n.s.

Table 2 Subitizing ranges for enumeration-alone and dual-task conditions in Experiments 1A, 1B, 2A, and 2B

Full size table

An ANOVA was performed on the error rate transformed by means of an arcsine transformation with the same two factors as the analysis for RTs. This analysis revealed significant main effects of numerosity, F(1, 23) = 36.38, p < .001, and condition, F(6, 138) = 6.78, p < .001, and a significant interaction, F(6, 138) = 3.77, p < .005. A multiple comparison showed significant differences between conditions in the range of one to five items. The effects of numerosity were due to the differences between the largest two numerosities (i.e., six and seven) and the other smaller numerosities (i.e., one to five). This effect was more apparent in the enumeration-alone condition than in the dual-task condition. The interaction also shows that the effect of task condition was significant at each of less than six items.

Memory performance

Fig. 3 depicts accuracy in the memory task as a function of numerosity (zero to seven) in the dual-task condition; also shown is performance in the memory-alone control condition (note that this corresponds to the zero number-of-items level). A one-way ANOVA with numerosity as a main factor yielded a significant main effect, F(7, 161) = 3.09, p < .005. A multiple comparison revealed that there was no interference in memory accuracy from concurrent enumeration for up to five items but that significant differences were found for six items relative to no and two items.

Discussion

Experiment 1A examined whether spatial working memory load interfered with enumeration processes. RTs indicated that this type of memory load had an effect only on enumeration in the counting range. Importantly, it affected neither subitizing efficiency nor subitizing range. If subitizing shared processes with spatial working memory, then, in the dual-task condition, we should have observed a decrease of the subitizing range and/or a decrease of enumeration efficiency (indexed by the slope) within the subitizing range. We observed neither. These results show that subitizing is functionally independent of spatial working memory capacity.

In the counting range, memory load selectively reduced enumeration efficiency as indexed by slope. This suggests that spatial working memory is involved in “counting.” In order to enumerate more than about four items, the locations of items should be maintained in spatial working memory. Indeed, accuracy of the secondary memory task did not decrease within the subitizing range, but it did decrease with six items. It is possible that maintaining memory items competes with enumerating items for spatial working memory capacity in the counting range. This also supports the claim that counting requires some maintenance of spatial locations of visual items, using spatial working memory.

Although both enumeration efficiency and memory accuracy support the notion that spatial working memory has no role in subitizing, the results for enumeration accuracy painted a different picture. More errors of enumeration occurred in the dual-task condition than in the single-task condition, especially within the subitizing range. One may argue that this is evidence for the involvement of a spatial memory component in subitizing. However, there is another possibility, that this result may not be specific to spatial working memory but, instead, may be due to interference with the postsubitizing process that arose from the dual-task situations. We examined this possibility in Experiment 1C.

RTs on the enumeration task were relatively longer in the dual-task condition than in the enumeration-alone condition, showing that the enumeration process overall was impeded by spatial working memory load. We propose that this cost reflects general interference levied by the secondary task. Although the overall increase in RT was not considered, in previous studies, to be a direct effect of interference (e.g., Woodman & Luck, 2004; Woodman, Vogel, & Luck, 2001), it may still be argued that the longer RTs observed in the present task reflect a general interference effect on enumeration due to working memory load.

Accordingly, in Experiment 1B, we further investigated whether enumeration RT varies with load in working memory. In this experiment, the dual-task condition presents memory displays containing two, four, or six to-be-remembered spatial positions. If memory load interferes with enumeration, we would expect to find that RT increases with the amount of the load to a point where this load exceeds the capacity of working memory (Emrich, Al-Aidroos, Pratt, & Ferber, 2010). Given that working memory capacity has been estimated to be around four items, this implies that RT should be elevated as the number of to-be-remembered item positions increases from two to four items, but not for increases from four to six items. On the other hand, if enumeration is not affected by working memory load, RT should remain constant regardless of the amount of the load.