Working memory load predicts visual search efficiency: Evidence from a novel pupillary response paradigm

Attar, Nada; Schneps, Matthew H.; Pomplun, Marc

doi:10.3758/s13421-016-0617-8

Working memory load predicts visual search efficiency: Evidence from a novel pupillary response paradigm

Published: 27 April 2016

Volume 44, pages 1038–1049, (2016)
Cite this article

Download PDF

Memory & Cognition Aims and scope Submit manuscript

Working memory load predicts visual search efficiency: Evidence from a novel pupillary response paradigm

Download PDF

Nada Attar¹,
Matthew H. Schneps¹ &
Marc Pomplun¹

2416 Accesses
14 Citations
1 Altmetric
Explore all metrics

Abstract

An observer’s pupil dilates and constricts in response to variables such as ambient and focal luminance, cognitive effort, the emotional stimulus content, and working memory load. The pupil’s memory load response is of particular interest, as it might be used for estimating observers’ memory load while they are performing a complex task, without adding an interruptive and confounding memory test to the protocol. One important task in which working memory’s involvement is still being debated is visual search, and indeed a previous experiment by Porter, Troscianko, and Gilchrist (Quarterly Journal of Experimental Psychology, 60, 211–229, 2007) analyzed observers’ pupil sizes during search to study this issue. These authors found that pupil size increased over the course of the search, and they attributed this finding to accumulating working memory load. However, since the pupil response is slow and does not depend on memory load alone, this conclusion is rather speculative. In the present study, we estimated working memory load in visual search during the presentation of intermittent fixation screens, thought to induce a low, stable level of arousal and cognitive effort. Using standard visual search and control tasks, we showed that this paradigm reduces the influence of non-memory-related factors on pupil size. Furthermore, we found an early increase in working memory load to be associated with more efficient search, indicating a significant role of working memory in the search process.

Cognitive load theory and educational technology

Article 01 August 2019

Cognitive Architecture and Instructional Design: 20 Years Later

Article Open access 22 January 2019

Extending Cognitive Load Theory to Incorporate Working Memory Resource Depletion: Evidence from the Spacing Effect

Article 30 September 2017

Our conscious visual experience and the performance of everyday visual tasks are strongly determined by the interaction of visual attention and working memory. Both of these cognitive mechanisms have been extensively studied (for recent reviews, see Carrasco, 2011, and Baddeley, 2012). One of the most common tasks that we arguably perform thousands of times every day is visual search, which is also one of the major paradigms for studying visual attention (Eckstein, 2011). Researchers have used various visual search paradigms to gain insight into attentional selection in the visual system. In a typical visual search task, participants are asked to report whether a visually distinctive target object is present among a set of distractors in a given scene. If the target and distractors have similar visual characteristics, observers have to sequentially attend to the search items to find the target or determine its absence. Many models of visual search have been proposed, aimed at explaining the role of visual attention (e.g., Cave & Wolfe, 1990; Treisman & Gelade, 1980; Wolfe, 1994).

Despite this research effort, the role of visual working memory in visual search is still being debated. Obviously, for successful task completion, the information about the search target needs to be kept in memory during search. This working memory content guides visual attention during the search process, both in artificial search displays (e.g., Cave & Wolfe, 1990) and in natural images (e.g., Pomplun, 2006). Interestingly, memory content guides search even if it is irrelevant to the search task (Downing, 2000). It is unclear, however, whether and to what extent working memory is involved in the search process beyond target memorization. Most importantly, working memory could be used to remember those items that have already been inspected, in order to avoid another, redundant inspection of the same item. If working memory can be utilized in this way without introducing significant delay, it could substantially facilitate search.

The results of many studies have suggested a significant role of memory in visual search (e.g., Baddeley, 1986; Logie, 1995; Luck & Vogel, 1997; Peterson, Kramer, Wang, Irwin, & McCarley, 2001; Phillips, 1974), and other studies have shown that memory can assist attention during the search task (e.g., Klein & MacInnes, 1999). However, Horowitz and Wolfe (1998) claimed that there is no memory involvement in visual search. In their experiment, they asked subjects to search for the presence of a letter “T” among letters “L” in either a random or a static condition. In both conditions, a new frame occurred every 111 ms, during the final 28 ms of which the letters were individually masked. In the random condition all letters were randomly relocated in each frame, but the letters remained constant in the static condition. The static condition should have allowed subjects to keep track of the inspected items, in contrast to the dynamic condition, in which the target could have appeared at any previously scanned location. Surprisingly, search efficiency did not differ between the two conditions, as indicated by equally steep slopes for response time (RT) as a function of set size. From these results, Horowitz and Wolfe inferred that visual search has no memory.

However, Peterson, Kramer, Wang, Irwin, and McCarley (2001) questioned the conclusion that the mechanism responsible for guiding attention during search has no memory for the locations that have already been visited. In their study, they found that the pattern of object revisitations did not fit the prediction of a memoryless search model. This finding suggests that observers can keep track of what they have previously inspected during visual search. A series of further studies supported this claim by providing evidence for memory involvement in visual search and showing that working memory can influence search performance on a number of levels (see Gilchrist & Harvey, 2000; Shore & Klein, 2000).

Another approach to investigating the role of working memory in visual search is the study of the relationship between an observer’s working memory capacity and his or her visual search performance. If search performance were directly correlated with working memory capacity, this would support the hypothesis that the use of working memory facilitates visual search. Unfortunately, whereas some researchers have found such a relationship (e.g., Anderson et al., 2013; Emrich et al., 2009), others have concluded that it does not exist (e.g., Kane, Poole, Tuholski, & Engle, 2006).

It is important to note that the direct assessment of working memory load through memory retrieval tasks would interfere with the search task. Consequently, the results of such experiments would be difficult to interpret. An unobtrusive way of estimating working memory capacity and use during an ongoing task has been discovered through electroencephalography (EEG) research (e.g., Emrich et al., 2009; Vogel & Machizawa, 2004). For example, Emrich et al. (2009) investigated an electrophysiological marker of visual working memory encoding and maintenance, termed the contralateral delay activity (CDA), in the context of visual search. They found the variations in CDA amplitude to be correlated with both visual working memory capacity and visual search efficiency, indicating a direct relationship between the latter two variables.

Although the electrophysiological approach allows for visual working memory load estimation without requiring a secondary task, it does require the availability and setup of EEG equipment, which adds considerable cost and effort to any given study. To overcome this problem, a different method of estimating working memory load, which also does not require the introduction of an additional task, can be applied. This technique is based on pupillometry, whose relationship with cognitive load was first explored in the 1960s. For example, Hess and Polt (1964) measured the pupillary responses of people engaged in performing mental arithmetic problems of increasing complexity. They found that the level of difficulty of the problem correlated positively with pupil diameter (see also Ahern & Beatty, 1979; Kahneman, 1973). Extending their research, Kahneman and Beatty (1966) related the variation in pupil dilation to memory load. Their participants were asked to remember series of digits orally presented by the examiner at the rate of one digit per second. After a short pause, participants were asked to retell the verbal digits. The results showed that during the encoding of each digit being presented, there was an increase in pupil size, providing evidence of a gradual increase in memory load. Furthermore, when reproducing the digits aloud, pupil size decreased for each individual digit, reflecting a parallel decrease in memory load. In trials with a critical number of digits involved, the pupil size would remain large during the entire procedure, indicating a somewhat sustained effect of increased memory load. Many subsequent studies have further illustrated this correspondence between pupillary dilation and working memory load (e.g., Ahern & Beatty, 1979; Karatekin, Couperus, & Marcus, 2004; Nuthmann & van der Meer, 2005; Stanners, Coulter, Sweet, & Murphy, 1979).

Porter, Troscianko, and Gilchrist (2007) used pupillometry to study visual working memory load during visual search. Their visual search and counting task experiments manipulated search difficulty by varying the number of distractors as well as the heterogeneity of the distractors, and the dilatory patterns were compared between the two tasks. The results indicated an almost constant, large pupil size during the counting task. In contrast, during search, pupil size increased from the start of the trial onward, which the authors interpreted as showing increasing working memory load as the search progressed. However, although this conclusion is plausible, it was not entirely assured by their data, due to the many factors that influence pupil size. For example, the amount of effort that an observer spent finding the target could have been a contributing factor. Furthermore, the pupil response to changes in working memory load is slow (e.g., Beatty & Lucero-Wagoner, 2000). It is thus possible that the working memory load did not increase during the search trial but was constant, and that the observed gradual increase in pupil size was due only to the delayed pupil response.

To overcome this problem, in the present study we modified the search paradigm used by Porter et al. (2007). At regular time intervals, the search display was replaced with an intermittent blank screen showing only a central fixation marker. Subjects were asked to fixate the marker until the search display returned, and then to resume their search. We hypothesized that measuring pupil size during these fixation intervals should lead to more reliable estimates of working memory load than measuring pupil size during search intervals. Our reasoning was that during the fixation intervals, the only mental effort required of the subjects was the memorization of the target object features and any information about the current search progress that they were keeping track of. While fixating the marker, subjects did not spend any effort inspecting the search items and did not encounter any target-relevant information, and therefore these two factors could not influence pupil size. It is also sensible to assume that, during these intervals, the absence of any cognitive task except the maintenance of working memory would induce no systematic variation in arousal levels that could interfere with memory load assessment. In addition, the fixation intervals were relatively long (3 s), so that the pupil size could approximate its equilibrium value. We also ensured that the average display luminance remained constant throughout all screens used in the experiment. This requirement minimized light-induced changes in pupil size that would otherwise have added noise to our measurements.

In our experiments reported here, we used this paradigm to test whether the use of working memory beyond memorizing the search target features is important for efficient search. Such working memory use would be indicated by increasing memory load during the course of a search trial. Therefore, if this memory use facilitates search, we would expect to find a negative correlation of the increase in working memory load and the time taken to find the target. In other words, in those trials in which an observer used his or her working memory more strongly, leading to a greater increase in working memory load, we would expect that this memory use would facilitate search. This facilitation should, on average, lead to faster target detection, indicated by shorter RTs. Since visual search trials could vary greatly in their duration, it was a sensible approach to measure the increase in working memory load early during the trial, in order to collect a maximum amount of data during an experiment. Consequently, we computed the amount of early memory load increase as the change in load between the first and second fixation screens in a trial. As we noted above, increasing working memory load tends to be accompanied by dilation of the observer’s pupils. Therefore, we used as an indicator of working memory load increase the average pupil size during the second fixation display minus the one measured during the first fixation display. As a control, we also computed this value for the first and second search screens in each trial.

The correlation of the resulting load difference with the subject’s RT in the same trial was then computed to test the hypothesis that a greater memory load increase would lead to more efficient search. Such an effect would be indicated by a negative correlation coefficient. Furthermore, if our new paradigm were effective at reducing the interference of other cognitive processes with the measurement of working memory load, this inverse correlation should be stronger when measuring pupil size during the fixation phases than during the search phases.

To gather baseline data for our paradigm, in Experiment 1 we used a modified visual search task that strictly required working memory encoding for its successful completion. In this task, the search items in each stimulus display were arranged randomly on four (invisible) circles, each of which included a random number of visually distinguishable target items. After subjects had indicated task completion, they were asked to report the number of targets on each circle. Clearly, to perform the task in an efficient manner, subjects needed to scan the display circle by circle and keep in memory the number of targets in each previously inspected circle. Consequently, their memory load was expected to increase steadily over time. According to our memory hypothesis, a faster increase in working memory load early in a trial should be associated with more efficient search, as long as the subjects searched in a systematic fashion. Therefore, besides analyzing the correlation between early changes in pupil size and RT, we also examined the subjects’ search strategies in order to obtain baseline data for the effectiveness of our paradigm at estimating the moment-to-moment working memory load.

Subsequently, in Experiment 2, we employed a “standard” visual search task and collected the same type of behavioral and pupil data. We compared a memory and a no-memory condition to investigate the use of working memory during visual search. In the memory trials, the same search stimulus was displayed after each intermittent screen, whereas in the no-memory trials, a different search display was shown after each intermittent screen. The no-memory condition thus did not allow subjects to accumulate task-relevant information about the search items across search intervals. Comparing the correspondence between pupil size and RT between the tasks thus allowed us to study the impact of working memory use on search task performance. The results indicated that working memory plays an important role in this task and that the new paradigm can provide insight into working memory use during various tasks.

Experiment 1

Method

Subjects

Thirteen students at the University of Massachusetts at Boston were recruited for the experiment. All subjects were between the ages of 21 and 35 years old and right-handed, with normal or corrected-to-normal vision. Each subject was compensated $10 for participating in the 1-h experiment.

Apparatus

On a separate computer, eye movements were recorded using an SR Research EyeLink 1000 desktop system with a sampling frequency of 1000 Hz. After calibration, the average calibration error was 0.5°. Stimuli were presented on a 22-in. ViewSonic LCD monitor with a refresh rate of 75 Hz and a screen resolution of 1,024 × 768 pixels. All viewers sat at a distance of approximately 70 cm from the screen in a room with a dim light setup and used a chin rest to stabilize their head. Only the left eye was tracked.

Materials

Sixty stimulus displays were generated using a MATLAB script. Each display consisted of 32 search items, which were Gabor patches with a diameter of 1°. These items were arranged randomly on four circles with radii of 5°, arranged in a 2 × 2 array. To avoid the overlapping and visual crowding of items, we set a distance of at least 3° between the centers of each item pair. Each stimulus was placed at the center of the screen and subtended a visual angle of 26° × 26°. Three to five of the Gabor patches were the designated search targets, indicated by their vertical or horizontal orientation. In each search display, it was randomly determined whether all targets had the same vertical or horizontal orientation. The distractors were oriented randomly with a minimum angular difference of 12° from both the vertical and horizontal orientations. The three, four, or five targets in each display were distributed randomly across the four circles, so that these circles did not necessarily contain the same numbers of targets. The intermittent fixation screen had a gray background of the same luminance as the search screens. Examples of these stimuli are shown in Fig. 1a.

Procedure

After participants had signed consent forms approved by the University of Massachusetts Boston institutional review board (IRB) and read the instructions, a standard 9-point grid calibration and validation of the gaze recording were completed. Participants were instructed to look at the central fixation marker on the pretrial and intermittent screens and search for the targets on each stimulus screen, counting the number of targets in each circle. For each trial, the search display was shown for 1 s and the intermittent screen for 3 s, followed by the same search display for 1 s and another fixation screen for 3 s, and so on. All trials started and ended with a fixation screen shown for 3 s.

Participants were to press the space bar when they had finished counting the targets in each circle. Once the space bar had been pressed or the maximum number of ten search displays had been reached, a new screen was displayed with four options for the potential numbers of targets in the different circles (see Fig. 1b). Participants were asked to choose the only option that showed the correct number of targets in each circle by pressing the corresponding button on a gamepad. For example, if the four circles contained two, one, zero, and one targets, respectively, then the option they should choose is 2, 1, 0, 1, where the order of the circles is described left-to-right and top-to-bottom. Subsequently, the next trial would start. Each subject performed four practice trials, followed by 60 experimental trials that were grouped into six blocks of ten trials. The same set of displays was used for all participants in an individually randomized order.

Results

In this experiment, we analyzed pupil size to investigate changes in working memory load during visual search. Furthermore, we analyzed eye movements to examine the participants’ shifts of attention while they performed the search task. We specifically investigated the interaction between the pupil size and visual scanning strategies to explain differences in search efficiency across participants.

All correct trials in which the participants found the targets within fewer than three search displays were removed from the analysis. Even though we only used the pupil data from the first two fixation and search displays, potential target detections within these screens could have influenced the subjects’ pupil sizes. Although it was almost impossible to complete the task in Experiment 1 within fewer than three search screens, this often occurred in Experiment 2. To ensure comparability of the results between the two experiments, we set the three-display limit for both of them. Furthermore, all trials that had an incorrect response were removed from the analysis, as well. These criteria limited the analysis to approximately 82 % of the trials in the entire experiment. The RT was measured from the time when the first search display appeared—that is, after the initial fixation screen until the time that the subject pressed the space bar, indicating that all targets had been counted. The time after the space bar being pressed and the next screen with the four options being shown was not included in the RT. The mean RT of those trials that were accepted according to the criteria was 25.57 s, with a standard deviation of 7.25 s.

For each subject, the correlation between (a) the difference in mean pupil size between the entire interval of the second intermittent fixation screen and the entire interval of the first intermittent fixation screen and (b) the RT was computed across all trials. Similarly, the correlation was also calculated between the pupil size difference between the second and first search screens and the RT. Pupil size was measured only during fixations, to minimize artifacts due to eye movements, and was taken as the number of pixels that the pupil covered in the EyeLink camera image. No further normalization of this variable was performed because we only investigated pupil size within individual subjects.

It should be noted that in video-based eyetracking the measured pupil size slightly varies with the gaze position, because of the varying angle between the pupil and the eyetracker camera. Although we had used a pupil calibration and normalization method (Pomplun & Sunkara, 2003) for our previously used EyeLink II system, we found that the system used in this study, the EyeLink 1000, provided significantly more robust pupil size measurements. In the present experiments, we found that the standard deviation in the average measured pupil size for fixation positions across the four screen quadrants during search was approximately 20 pixels, or 1.25 % of the mean pupil size. Given the overall variation in pupil size (cf. Fig. 4 below), even a systematic bias of the pupil size data by eye movements could have only a small effect on the results. However, we should be aware that when comparing the results obtained through pupil size measurement during central fixation (fixation screens) and during free viewing (search screens), the latter may contain some additional noise.

The pupil size increase during intermittent screens showed a slightly negative correlation with RT (r = –.085), t(12) = 1.410, p = .006, whereas measurement during the search displays revealed a weak positive correlation (r = .124), t(12) = 2.286, p = .005. This difference in the correlations between the two methods of measurement was significant, t(12) = 2.609, p = .023.

Although we found the hypothesized negative correlation between pupil size increase and RT for the pupil measurements during the fixation screens, it was very weak. One possible explanation for this finding is that some subjects did not perform the task systematically, and therefore did not load their working memory during the initial screens as expected. To examine this possibility, we divided the participants into two groups based on their r value for the fixation screens, with the cutting point set at the mean r value, which was –.085. Therefore, the first group, termed the “strong-correlation” group, consisting of six subjects, had correlations with r < –.085, with an average of r = –.287. The second group, named the “weak-correlation” group, had correlations with r > –.085, with an average of r = .10. We compared the gaze patterns in the search displays between the two groups in an attempt to understand the reason for the difference in correlations.

In the present task, for efficient performance, subjects had to process each circle in a serial pattern because they were asked to report the numbers of targets in the different individual circles. Examining their eye movements allowed us to determine how systematic a subject’s search strategy was. We analyzed the fixations during the first five search displays of each trial. Each fixation was assigned to the circle of Gabor patches whose center had the shortest Euclidean distance from it. Then, for each 1-s presentation of a search display, we determined the circle that received the most fixations and divided that number of fixations by the total number of fixations recorded during that display presentation. If subjects focused their search on one circle during each display presentation, which we assumed would reflect systematic search behavior, this variable would approach a value of 100 %. If, on the other hand, subjects showed erratic search patterns, the value could in the most extreme case fall to 25 %.

Figure 2 shows the separate results for the strong- and weak-correlation groups for each of the first five search displays in each trial. Participants in the weak-correlation group revealed lower values (mean = .67) than those in the high-correlation group (mean = .8), t(11) = 3.16, p = .022, indicating that they tended to fixate on several circles instead of one circle during each search display. This finding suggests that the weak-correlation group performed less systematic search, with fixations being distributed among more than one circle.

To further study the efficiency of observers’ search strategies, we considered that inefficient search performance can be indicated by a substantial proportion of eye movements revisiting one of the stimulus circles. A fixation was considered a revisit if it landed on a circle that had been visited before, with at least one fixation on a different circle occurring between these visits. These fixations were assumed to be made to recheck an already searched circle, which would be indicative of unsystematic search behavior.

As is shown in Fig. 3, the weak-correlation group had to return to search a circle whose items had already been examined almost once at every second and third search display, averaging at 0.69 revisits per display. The fixations for the strong-correlation group showed only 0.28 refixations, which was significantly lower than the value for the weak-correlation group, t(11) = 2.83, p = .053. Interestingly, the difference between the groups was extremely large for the second search display, but progressively diminished until it disappeared for the fifth display. This finding is most likely due to the necessity of systematic search to revisit circles if some of the target numbers determined earlier could not be memorized. Figure 4 shows the correlations for one of the subjects in the strong-correlation group and illustrates the difference that it makes to measure pupil size during the search or during the fixation screens.

Discussion

We hypothesized a negative correlation between pupil size increases during the early intermittent screens and RTs, indicating that a greater working memory load increase led to faster searches. This hypothesis was supported by our pupil size analysis. Pupil dilation during the intermittent screens had a small but significant negative correlation with RT, whereas there was a small positive correlation between pupil dilation and RT during the search displays.

To determine whether our pupillary results were affected by the subjects’ search strategies, we divided the participants into two different groups based on the mean value of the correlation between pupil size in the first two intermittent screens and RT. We compared their gaze patterns during the search displays. Our results indicated that, in general, the group with a strong negative correlation between pupil dilation and RT tended to search the stimulus circles in a systematic sequence, with fewer revisits of previously inspected circles. These results gave further support to our hypothesis that, using the fixation screens, working memory load during the search task can be estimated with useful accuracy. Although a correlation of r = –.29 is still weak, it is important to note that no strong correlation could be expected for these variables. RTs in visual search tasks generally show great variance, because by chance the target object may be the first one inspected, or in the worst case, it could be the last one. Even when using efficient search strategies, the RT difference between the best and worst cases is immense. Even though the need for finding multiple targets in Experiment 1 reduced this variance, we still could not expect to precisely predict RT in individual trials on the basis of pupil size changes.

When measuring pupil size while subjects were actively searching the display, no such negative correlation was found, suggesting that these data do not seem to allow reliable estimates of working memory load. As we noted above, the additional noise in pupil size measurement during free viewing may have contributed, at least to a small extent, to this difference between the fixation and search screens as the basis of measurement. Having obtained these supporting baseline data for our method, we conducted Experiment 2, in which we applied the method to a more “regular” visual search task that did not explicitly demand memory use, except for memorizing the target features.

Experiment 2

The search task in Experiment 2 can be considered a standard visual search task in which the subject had to report the identity of one target among a set of distractors. We introduced two experimental conditions—a memory task and a no-memory task. Each task was presented in separate blocks of trials. In the memory task, after each intermittent screen, the same search stimulus reappeared. Therefore, subjects could make use of any information that they had memorized from previous screens, such as the set of already-inspected items. In the no-memory task, a new search stimulus appeared after every intermittent fixation screen. Consequently, subjects did not need to memorize any information across search displays within the same trial, because this would be useless in the next search display.

In the memory task, we hypothesized that the loading of working memory was an integral part of performing the search task efficiently, as only a single search display was presented in each trial. As in Experiment 1, we expected this role of working memory to be revealed by a negative correlation between the pupil size increase between the first and second fixation screens and RT. The no-memory task served as a control condition in which no increasing memory load or its correlation with RT were expected. Once again, pupil measurement during the fixation screens was contrasted with pupil measurement during the search screens, to obtain further evidence for the suitability of our working memory load estimation paradigm.