On-item fixations during serial encoding do not affect spatial working memory

Czoschke, Stefan; Henschke, Sebastian; Lange, Elke B.

doi:10.3758/s13414-019-01786-5

On-item fixations during serial encoding do not affect spatial working memory

Open access
Published: 28 June 2019

Volume 81, pages 2766–2787, (2019)
Cite this article

Download PDF

You have full access to this open access article

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

On-item fixations during serial encoding do not affect spatial working memory

Download PDF

3134 Accesses
5 Citations
5 Altmetric
Explore all metrics

Abstract

Ample evidence suggests that there is overlap between the eye-movement system and spatial working memory. Such overlapping structures or capacities may result in interference on the one hand and beneficial support on the other. We investigated eye-movement control during encoding of verbal or spatial information, keeping the display the same between tasks. Saccades to to-be-encoded items were scarce during spatial encoding in comparison with verbal encoding. However, despite replicating this difference across different tasks (serial, free recall) and presentation modalities (simultaneous, sequential presentation), we found no relation between item fixations and memory performance—that is, no costs or benefits. Inducing a change from covert to overt encoding did not affect spatial memory performance as well. In contrast, regressive fixations on prior items, that were no longer on the screen, were associated with increased spatial memory performance. Regressions occurred mainly at the end of the encoding period and were targeted at the first presented item. Our results suggest a dissociation between two types of fixations that accompany serial spatial memory: On-item fixations are epiphenomenal; regressions indicate rehearsal or output preparation.

Eye-movements reveal the serial position of the attended item in verbal working memory

Article 28 September 2021

Refixation patterns reveal memory-encoding strategies in free viewing

Article Open access 01 May 2019

The effects of task-relevant saccadic eye movements performed during the encoding of a serial sequence on visuospatial memory performance

Article 01 March 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Eye-movements have been demonstrated to be highly related to visuospatial attention allocation (Chelazzi et al., 1995; Deubel & Schneider, 1996; Kowler, Anderson, Dosher, & Blaser, 1995; Shepherd, Findlay, & Hockey, 1986, see also the premotor theory of attention, e.g., D. T. Smith & Schenck, 2012). Just before saccadic eye movements are executed, attention is already shifted toward the saccadic goal (e.g., Deubel & Schneider, 1996). Furthermore, the eye-movement system is assumed to play a specific role for maintenance in visuospatial working memory (Baddeley, 1986; Belopolsky & Theeuwes, 2009a, 2009b; Morey, Mareva, Lelonkiewicz, & Chevalier, 2017; Pearson, Ball, & Smith, 2014; Postle, Idzikowski, Della-Sala, Logie, & Baddeley, 2006; Schut, Van der Stoep, Postma, & Van der Stigchel, 2017; Theeuwes, Olivers, & Chizk, 2005; Theeuwes, Van der Stigchel, & Olivers, 2006; Tremblay, Saint-Aubin, & Jalbert, 2006), as well as the other way around (Van der Stigchel & Hollingworth, 2018). That is, the constructs of visuospatial attention and visuospatial working memory are highly related to the eye-movement systems. In our study, we are particularly interested in the question of how eye-movement control is applied during the encoding of serial spatial information for memory recall.

A plethora of studies have been conducted to investigate the relation between eye-movement control and visuospatial working memory, and we will summarize the key results, reporting evidence for saccadic interference as well as rehearsal benefits. Because of the memory component, one important experimental set-up is to use a delayed-recall design. An overlap of structures predicts that saccades to nonmemorized locations result in interference with memory representations, whereas saccades to to-be-remembered locations should benefit performance, as they might qualify as rehearsal. In fact, research on the effect of eye movements to nonmemorized positions in the retention interval has presented evidence for saccadic interference with spatial memory (Hale, Myerson, Rhee, Weiss, & Abrams, 1996; Postle et al., 2006) and memory for shapes (Schut et al., 2017). Interestingly, this effect of interference is over and above the deleterious impact of covert spatial attention shifts (Lawrence, Myerson, & Abrams, 2004; Pearson & Sahraie, 2003). This suggests that the eye-movement system contributes a unique part of interference to spatial memory maintenance. In agreement with this interpretation, activated memory representations in the retention interval (RI) can also alter eye-movement control. Specifically, saccade trajectories deviated away from memorized locations (Theeuwes et al., 2005; Theeuwes et al., 2006) and saccadic latencies into the hemifield, in which a location had to be remembered, increased (Belopolsky & Theeuwes, 2009b). Given these results, it seemed unlikely, that the saccade system is used for rehearsal. Indeed, a study investigating eye movements in the delay period found rather low oculomotor activity (e.g., Pearson & Sahraie, 2003, Experiment 5). However, there is some evidence for eye movements as supportive mechanisms for rehearsal. For example, fixating target positions in the retention interval of a spatial serial-recall task showed a beneficial effect for fixation sequences that matched the serial presentation of items: memory performance increased (Tremblay et al., 2006). By manipulating open rehearsal activity experimentally via instructions, Godijn and Theeuwes (2012) demonstrated a benefit for saccades to the first three to-be-recalled locations in comparison with the control condition. Overtly targeting the last three digit positions, however, impaired performance for most of the accessed as well as not-accessed item positions, but produced no benefit for any. This result points toward a specific connection between eye movements and serial order. Comparing free viewing with a condition, where subjects were allowed to fixate one self-chosen position, revealed no difference between the conditions, even though the number of fixations differed dramatically (free: 14 vs. fixation: 1). That is, saccadic activity in general neither boosted nor impaired memory representations. One solution for the divergent results might be individual differences in preferences of saccadic control (Laeng & Teodorescu, 2002; Ridgeway, 2006). Some participants might choose more, and others less, saccadic activity.

Further evidence for oculomotor support of memory maintenance comes from studies outside the serial recall literature. For example, in the “looking-at-nothing” paradigm, it has been demonstrated that fixations on a blank position on a screen cue information associated with this position (Ferreira, Apel, & Henderson, 2008; Johansson & Johansson, 2014). In addition, fixation pattern in a stimulus-free delay period of a spatial change-detection task showed high similarity with eye movements during encoding (Olsen, Chiew, Buchsbaum, & Ryan, 2014), particularly for increased task difficulty (Wynn, Olsen, Binns, Buchsbaum, & Ryan, 2018). This indicates that eye movements during the retention interval might reanact encoding behavior. A match between reenactment and encoding behavior might support memory recall (Laeng & Teodorescu, 2002). However, there is also evidence against the reenactment/reinstatement hypothesis (Foulsham & Kingstone, 2013; Johansson, Holsanova, Dewhurst, & Holmqvist, 2012).

Important progress in understanding the role of eye-movement control in visuospatial working memory has been derived by studies using the eye abduction paradigm (Ball, Pearson, & Smith, 2013; Pearson et al., 2014), which allows disentangling the effects of oculomotor control, eye movements, and attention in a very controlled way. The abduction paradigm revealed selective impairment for spatial memory maintenance when stimuli are presented outside the oculomotor range (Pearson et al., 2014), but not for other visual features (Ball et al., 2013). Interestingly, using the eye abduction paradigm, it has also been demonstrated that the oculomotor control system plays an important role during the encoding phase of a spatial memory-span task as well (Pearson et al., 2014). However, the role of effective saccadic movements during the presentation phase of serial spatial memory tasks is still under debate (Lange & Engbert, 2013; Morey et al., 2017; Patt et al., 2014; Saint-Aubin, Tremblay, & Jalbert, 2007). Free viewing in a spatial serial recall task showed rather low numbers of fixations on to-be-remembered item locations (Lange & Engbert, 2013; Patt et al., 2014) as well as small saccadic amplitudes and long saccadic reaction times, whereas fixation probabilities on to-be-remembered verbal items were high (Lange & Engbert, 2013). Results are indicative of the active suppression of saccades during spatial and unimpeded execution during verbal memory encoding. Studies using distractor designs converge on those findings. Irrelevant concurrent saccades during encoding of a spatial memory task decreased spatial memory performance (Guérard & Tremblay, 2011; Guérard, Tremblay, & Saint-Aubin, 2009; Lange, Starzynski, & Engbert, 2012; Postle et al., 2006). This was also true when saccades were generated in a reflexive manner (Lange et al., 2012; Lawrence, Myerson, Oonk, & Abrams, 2001), or without visually presented saccadic goals (Postle et al., 2006), but not during postrotational nystagmus (Postle et al., 2006), which causes involuntary eye movements. Results strongly suggests that the loci of distractor interferences are processes involved in eye-movement control (Postle et al., 2006), not movements per se, similar to conclusions from the eye abduction paradigm. However, when manipulating eye-movement control by instructions, results are less clear. When participants had to trace upcoming stimuli, memory for spatial as well as verbal serial recall was impaired (Lange & Engbert, 2013), indicating general dual-task costs by forced-viewing instructions. On the contrary, Saint-Aubin et al. (2007), using a similar procedure, found a beneficial effect for forced item tracing compared with free viewing. This points to a general problem with forced viewing instructions: The affordances of the task might enforce adaptive behavior. In addition, costs based on the dependent variable (e.g., eye-movement control) are difficult to separate from dual-task costs.

To sum up, there appears to be overlap of the oculomotor system and spatial working memory. Investigating this connection has developed in two branches of research: The role of eye movements in spatial encoding and the role of eye movements in spatial memory maintenance. However, both are not independent, as is obvious for sequential encoding paradigms. The sequential encoding over a series of several items is not merely a matter of item encoding. Increasing serial positions requires subjects to encode upcoming items while simultaneously maintain an increasing number of prior items in memory within a common time frame. Therefore, overt fixation behavior in the encoding phase cannot be interpreted exclusively in terms of encoding demands or strategies, but interference might contribute to saccadic control as well as overt rehearsal processes, counteracting forgetting. To our knowledge, there is no study that has investigated these supposedly conflicting processes during the encoding phase of a spatial memory task.

Experiment 1

Evidence on the role of on-item fixations during presentation in a spatial serial-recall task does not converge toward a common conclusion. Fixations might be beneficial (e.g., Saint-Aubin et al., 2007), saccades might be suppressed because of interference (Lange & Engbert, 2013; Patt et al., 2014), or eye behavior might be optimized to fit individual strategies (e.g., Laeng & Teodorescu, 2002). In addition, regressions after stimulus presentation have been investigated in the retention interval only (Godijn & Theeuwes, 2012; Morey et al., 2017; Tremblay et al., 2006). The potentially beneficial effect of rehearsal during the memory encoding phase has not been investigated so far in serial recall paradigms. We study both types of fixations (during and after presentation) separately as well as individual differences in eye-movement control.

We decided on a comparative design, in which features of visually presented stimuli had to be encoded into either the verbal or spatial domain. On each trial, participants saw a series of five spatially distinct bigrams and had to recall either the verbal content, the spatial positions, or both features. Importantly, participants were free to move their eyes, allowing us to measure natural viewing behavior. We chose two approaches to understand the behavioral consequences of eye movements. First, we related fixations toward on-screen items as well as fixations on previous item positions (i.e., regressions^{Footnote 1}) to memory performance in a correlative, observational account. It is currently unknown whether low-fixation tendencies during serial spatial encoding (e.g., Lange & Engbert, 2013) reflect systematic avoidance of item-targeting saccades, and second, whether regressions that are carried out during the encoding episode reflect maintenance processes. Both behaviors can be interpreted as strategic when they clearly improve memory performance. Second, based on our earlier study (Lange & Engbert, 2013), we expected low-fixation probabilities on bigrams in the spatial memory condition, but high-fixation probabilities in the verbal memory condition. To investigate whether these diverging oculomotoric behaviors are based on task-specific affordances, we added a critical third condition, in which subjects memorized both the verbal content as well as the spatial position of the stimulus (combined condition). We reasoned that, having to encode two different materials that, in isolation, elicit different preferred oculomotor behavior, introduces a conflict (i.e., making saccades toward the items for verbal encoding versus suppressing saccades toward the items for spatial encoding/maintenance). Importantly, memory accuracy can be analyzed separately for verbal and spatial performance in this combined condition, which will uncover how a change in fixation probabilities between the single tasks and the combined condition will affect memory accuracy. If participants apply a strategy with high-fixation probability (as in the verbal single task), and fixations are detrimental to spatial encoding, then the performance in the spatial task will dramatically decrease in relation to the spatial single-task condition. Alternatively, if participants choose a low-fixation strategy (as in the spatial single task), and if this strategy benefits spatial encoding but hinders verbal encoding, impairment of memory performance will be particularly strong for verbal recall in comparison with the verbal single task.

Method

Participants

Thirty adults (20 females; ages 17–37 years; M = 24.13 years, SD = 4.35) participated in the experiment after giving written informed consent. All participants had normal or corrected-to-normal visual acuity. They were naïve to the purpose of the experiment and were paid for their participation (€10/hour). The experimental session lasted about 60 min.

Apparatus

Stimuli were presented on a 24-in. monitor (resolution: 1,920 × 1,080 pixels, refresh rate: 144 Hz). The experimental procedure was controlled by Python 2.5 and PsychoPy 1.8. We tracked the right eye with a sampling rate of 1000 Hz (EyeLink 1000, SR Research). A forehead and chin rest reduced head movements and was located 60 cm in front of the monitor. The experiment took place in a sound-attenuated booth, with the experimenter placed outside the booth but connected via an intercom.

Materials

Memory lists were composed of five bigrams. Items were constructed from two distinct letter pools. The first letter of each bigram was randomly drawn from [B, C, G, L, R, V], the second letter from [A, E, I, O, U, T] without replacement. The letter T was included in this second pool, to increase task difficulty, which had been pretested by a few pilot participants. This was important because we aimed at comparable task performances for the verbal and spatial task while keeping the list length equal (as this is required for the combined condition). The font color of the bigrams (letter height: 1° visual angle, bigram length: 1.5°) was white on a gray background (RGB: 128, 128, 128). Stimuli were shown on an isoeccentric, light-gray ring (RGB: 170, 170, 170) with a radius of 8° of visual angle (see Fig. 1). Item positions were randomly sampled on the circle without replacement from 20 equidistant positions (separated by 18 angular degrees on the circle or 2.5° of visual angle, and rotated by 7 angular degrees to avoid cardinal positions).

Design

Conditions (verbal, spatial, combined recall) were blocked, with two blocks per condition (six blocks in total). Serial order of conditions was balanced across participants. The first three blocks and the second three blocks comprised each condition once, respectively. Each block comprised 15 trials, with the first two trials being practice trials and excluded from data analysis, resulting in 26 trials per condition in total.

Procedure

The session started with a standard 9-point calibration of the EyeLink software. Participants initiated each trial by pressing the space bar (see Fig. 1 for a trial sequence). Each trial began with a fixation check, lasting 800 ms, which failed when the fixations deviated more than 1° visual angle from the centrally presented fixation cross. Calibration was repeated, when the fixation check failed twice or at the latest after five trials. Upon successful fixation check, the first item occurred. Each item remained on the screen for 1,000 ms, followed by the onset of the next item (see Fig. 1 for a trial sequence and the different recall procedures). After the fifth item, the recall display occurred without delay. Participants were instructed to report the items in order of presentation and to guess in case they did not remember; correction of a given answer was not possible. In the verbal task, recall was achieved by entering the bigrams via keyboard. In the spatial task, recall was achieved by moving the mouse pointer to the remembered positions and confirming each position by mouse click. In the combined task, recall was achieved by mouse-clicking on the remembered position and entering the respective bigram. When keeping list length equal, spatial recall usually results in lower task performance than verbal recall. We decided on first spatial then verbal recall, to motivate participants to keep track of the spatial task and not to ignore it.

Data treatment

Categorization of saccades

Eye-movement data were categorized into saccades and fixational eye movements, using the velocity-based algorithm from Engbert and Mergenthaler (2006; in our study Lambda = 10). Saccades with amplitudes shorter than 0.7° visual angle, or with a duration less than 10 ms, were ignored. The algorithm detected 24,373 saccades for all participants and trials, with 812 saccades per subject on average (range: 477–1,394).

Fixations

The time interval between the end of one saccade and the start of the next was defined as fixation. Note that this term is a simplification, as during these intervals eyes are still moving on a smaller scale (Engbert, 2006). We computed the position of the eyes during fixations by calculating the median of the x and y coordinates. Visualization of the gaze within a trial convinced us that the median was preferable over the mean, as outlier positions related to blink and noise made the mean measurement noisy. An item was defined as fixated if the median position during the fixational movement was located within a radius of 2° of visual angle from item center. Fixation probabilities express on how many instances item fixations occurred at all. We use the term for fixations on items as long as they are visible on the screen, in comparison with regression probabilities.

Regressions

We defined a fixation as regression when the fixation matched the position of an item that was presented earlier in the trial sequence than the current item. Note that earlier items were no longer visible on the screen; hence, regressions were memory based. Regression probability calculates how often at least one regression was made (instead of calculating how many regressions were made on average).

Performance accuracy

We followed a strict serial recall criteria—that is, items had to be recalled in presentation order. Bigrams were regarded correct if both letters were correctly entered. Spatial positions were regarded correct if the reported position deviated less than 2° of visual angle from the center of the correct item.

Data analysis

All reported analyses of variance (ANOVAs) and t tests were based on a repeated-measures design. An alpha level of .05 (two-tailed) was set for all frequentist statistical tests. However, to evaluate our data in terms of evidence for the null hypothesis, we added Bayes factors (BF₁₀), that quantify the likelihood of the alternative hypothesis (H1) relative to the null hypothesis (H0), given the data. Thus, technically, a BF₁₀ > 1 indicates evidence in favor of the H1, whereas a BF₁₀ < 1 supports the H0. So, for example, a BF₁₀ = 3.50 indicates that the data are 3.5 times more likely under the alternative hypothesis than under the null hypothesis. In accordance with Kass and Raftery (1995), we consider BFs between 1/3 and 3 as inconclusive evidence. Consequently, we treat BF₁₀ > 3 as support for the H1, and BF₁₀ < 0.33 as support for H0. For Bayesian ANOVAs (Rouder, Morey, Speckman, & Province, 2012), we report only the BF₁₀ of the best model (i.e., the factor combination with the strongest evidence against the null model that includes only between-subjects variance), except if the evidence in favor of the best model compared with another predictor combination was weak (BF₁₀ of the model comparison < 3). All data analyses were conducted with the statistics software JASP (Version 0.8.6.0; JASP Team, 2018) and the default settings of the Bayes Factor package (Morey & Rouder, 2015); that is, Bayesian ANOVAs were computed with a multivariate Cauchy prior with a fixed-effects scale factor of r = .5, and a random effects scale factor of r = 1. Bayesian paired t tests (Rouder, Speckman, Sun, Morey, & Iverson, 2009) were computed with a Cauchy prior, with a width of r = .707. Priors were centered on zero.

Results

The Results section is structured by three questions: (1) Do fixations on to-be-remembered items during their presentation affect memory performance—in particular, do they impair spatial memory? (2) Do regressions affect memory performance—in particular, do they improve spatial memory? (3) Do eye-movements strategies reflect general processes or rather individual differences in behavior?

On-item fixations and their relation to spatial memory encoding

Fixation probabilities

Fixation probabilities on memory items during encoding were very high for the verbal-recall task, with only a slight decrease from the first (M = 98.21%, SD = 3.60) to the fifth (M = 93.08%, SD = 10.52) serial position (see Fig. 2a). For the spatial-recall task, fixation probabilities were markedly lower, with a strong, almost linear decline from the first (M = 79.74%, SD = 25.03) to the fifth (M = 47.05%, SD = 29.70) serial position.^{Footnote 2} Crucially, the fixation pattern for the combined condition strongly converged toward the verbal condition (with M = 99.23%, SD = 1.56 for the first, to M = 89.23%, SD = 10.80 at the fifth serial position), and clearly differed from the observed fixation behavior in the spatial condition. Note that if participants chose to switch constantly between the fixation behavior of single verbal and single spatial encoding, serial position function in the combined task would be exactly placed in between the other two task’s functions. Note, too, that in the combined condition, spatial recall always preceded verbal recall, making it unlikely that the procedure itself biased participants toward concentrating on the verbal task.

The visual inspection of Fig. 2a was backed up by a two-factor ANOVA, with a significant main effect of condition (verbal, spatial, combined), F(1.03, 29.78) = 41.04, p < .001, η² = .59 (Greenhouse–Geisser corrected), a main effect of serial position, F(2.51, 72.75) = 40.93, p < .001, η² = .59 (Greenhouse–Geisser corrected), and an interaction, F(2.98, 86.50) = 17.94, p < .001, η² = .38 (Greenhouse–Geisser corrected). Accordingly, the best model contained both factors and the interaction, BF₁₀ = 8.47× 10⁶⁴. Given the clear pattern of results, we want to report here one more detail only: When fitting verbal and combined in the two-factor ANOVA, the main effect of condition was significant, F(1, 29) = 6.35, p = .017, η² = .18, but the interaction not, F(1.91, 55.36) = 2.61, p = .085, η² = .08. Accordingly, the best model contained both factors, but no interaction (BF₁₀ = 3.64 × 10⁹).

To clarify the quality of on-item saccadic suppression, we evaluated saccade numbers post hoc. In agreement with what has been shown by Lange and Engbert (2013), mean number of saccades did not differ for the verbal and spatial single tasks in Experiment 1 (F < 1). As the item fixation probability was lower in the spatial condition, but total saccade number was comparable between spatial and verbal, there was an increased investment of saccades onto nonitem positions and not a general inhibition of saccades.

Performance accuracy

Accuracy (see Fig. 2b) was highest in single verbal recall (M = 74.21%, SD = 20.41) followed closely by single spatial recall (M = 69.44%, SD = 11.36), and decreased performances in the dual-task situation (combined condition, denoted as subscript c, single as subscript s) for verbal_c (M = 63.23%, SD = 16.43), spatial_c recall (M = 52.31%, SD = 12.71). The three-factor ANOVA resulted in main effects of task domain (verbal, spatial), F(1, 29) = 7.47, p = .011, η² = .21, task condition (single, combined), F(1, 29) = 81.44, p < .001, η² = .74, and serial position, F(2.67, 77.54) = 199.21, p < .001, η² = .87 (Greenhouse–Geisser corrected), and significant two-way interactions of task condition × serial position, F(4, 116) = 20.17, p < .001, η² = .41, and task condition × task domain, F(1, 29) = 6.38, p = .017, η² = .18. The two-way interaction of task domain × serial position and the three-way interaction were nonsignificant (both Fs < 1). Accordingly, the best model contained all three factors and the interactions of task condition × serial position and task condition × task domain (BF₁₀ = 9.68 × 10⁹⁹).

Relation between fixation probability and performance accurac

First, we tested whether item fixations harm spatial memory by comparing the accuracy of spatial recall for fixated and nonfixated items in the single task situation only (low number of cases in the other conditions). However, since serial position is correlated with fixation probability and accuracy, we first calculated the mean accuracy score for each subject and serial position separately (including only serial positions that had values for both cases—fixation and nonfixation) and then averaged across all serial positions to ensure that each position had the same weight within the calculation. A paired t test revealed no significant difference of accuracy between fixated (M = 66.63%, SD = 14.84) and nonfixated items (M = 65.56%, SD = 18.54), t(29) = 0.42, p = .676, d = 0.08, BF₁₀ = 0.21.

Second, we reasoned that, if item fixations were detrimental to the maintenance of spatial memoranda, the accuracy for spatial recall in the combined condition should be modulated by the degree of deviation from the ideal encoding strategy for spatial material (applied in the single task). That is, the more participants deviate in the combined task from their fixation behavior under mere spatial encoding demands, the greater the relative performance decrement should be for spatial memoranda in the combined condition. To test this prediction, we correlated the difference in fixation probability between spatial_s and the combined condition for each subject with the difference of accuracy between spatial_s and spatial_c. The correlation, however, did not show any relationship between change in fixation probability and spatial recall accuracy, r = −0.165, p = .383, BF₁₀ = 0.33. In other words, the change in item fixation behavior between the spatial_s and combined condition had no systematic influence on the spatial recall performance in the combined task. When participants were motivated to change their strategy from low-fixation probabilities (spatial_s) to high-fixation probabilities (combined), memory for spatial information did not decrease in the combined condition. To sum, our manipulation to increase fixation probabilities during spatial encoding worked out. Importantly, the change in fixation behavior did not result in a systematic impairment for spatial serial recall.

Regressions and the relation to spatial memory encoding

Regression probabilities

Regression probabilities were similar in the spatial_s and combined conditions and more pronounced than in the verbal_s condition (see Fig. 3a). This interpretation was statistically supported by two related ANOVAs. The two-factor ANOVA resulted in significant main effects for condition (verbal, spatial, combined), F(2, 58) = 31.56, p < .001, η² = .52, serial position, F(1.7, 49.314) = 42.13, p < .001, η² = .59 (Greenhouse–Geisser corrected), and a significant interaction, F(3.49, 101.09) = 15.37, p < .001, η² = .35 (Greenhouse–Geisser corrected). Accordingly, the best model included both factors and the interaction (BF₁₀ = 2.08 × 10⁴¹). The main effect of condition and the interaction, however, were driven by the single verbal condition. When comparing regression probabilities for spatial_s and combined, there was no main effect of condition, F(1, 29) = 1.24, p = .275, η² = .04, and no interaction, F(2, 57.89) = 1.85, p = .166, η² = .06 (Greenhouse–Geisser corrected). Accordingly, the best model included only the factor serial position (BF₁₀ = 8.28 × 10²⁴).

As a complement, Fig. 3b depicts regression targets during presentation of the fifth item, which showed highest regression probabilities in Fig. 3a. Interestingly, there was a marked preference for regressions onto the first serial position. If regressions targeted accidentally on prior item locations, the frequency distribution should be uniform across all prior item positions. Note that such a preference for overtly revisiting the first serial position has been reported before for regressions during retention intervals (Godijn & Theeuwes, 2012) as well as during the encoding sequence (Lange & Engbert, 2013). To back up the interpretation of Fig. 3b, we calculated three one-factorial repeated-measures ANOVAs. The likelihood of being a regression target differed significantly between the serial positions in the spatial, F(1.92,53.66) = 21.32, p < .001, η² = .43, BF₁₀ = 7.34 × 10¹⁰ (Greenhouse–Geisser corrected), and the combined condition, F(1.85,53.54) = 28.56, p < .001, η² = .50, BF₁₀ = 1.58 × 10¹⁴ (Greenhouse–Geisser corrected), but there was no main effect of serial position in the verbal condition, F(3,48) = 1.61, p = .201, η² = .09, BF₁₀ = 0.68. However, evidence in favor of the H0 was very weak for the verbal condition. With a BF₁₀ of 0.68, H0 is only 1.5 times (1/0.68) more likely than the H1, in our sample.

Relation between regressions and performance accuracy

Analogous to the analysis of fixation probabilities, we calculated the mean accuracy score for each subject and serial position separately (including only serial positions that had values for both cases) and then averaged across all serial positions to ensure that each position had the same weight within the calculation. Importantly, paired t tests revealed a significant performance benefit for regression targets (M = 81.63%, SD = 21.11) in comparison with items that did not become regression targets in the progression of a trial (M = 74.71%, SD = 13.43) in spatial_s, t(28) = 2.27, p = .031. However, evidence for this performance benefit was very weak and rather inconclusive (BF₁₀ = 1.78). There was no such performance difference for spatial_c, t(29) = 0.80, p = .432, BF₁₀ = 0.26. In addition, regressing onto the prior item’s position neither affected memory performance in verbal_s, t(16) = 1.03, p = .320, BF₁₀ = 0.39, nor in verbal_c, t(29) = 0.27, p = .790, BF₁₀ = 0.20, as expected.

Results indicate that regressions might be useful for remembering items that were regression targets (mainly the first item in the series). That is, in the single spatial task, there was some weak evidence for improved recall of regression targets in comparison with other items. However, verbal performance did not benefit from regressions at all, and neither did spatial memory in the combined condition, as expressed in the Bayesian analyses that favored the null hypothesis for these conditions.

Individual differences in oculomotor behavior during spatial memory encoding

Figure 4a presents individual data of fixation probabilities for spatial encoding. As can be seen, there was a huge variability of individual encoding strategies, also depicted by the rather large error bars for spatial fixation probabilities in Fig. 2a. Interestingly, a subgroup of participants showed very high, others low, fixation probabilities across all serial positions. Participants used very different encoding strategies: from more overt to rather covert attention allocation. We can now ask further, whether low-fixation probabilities are related to other indicators of suppression, like higher saccadic latencies and smaller saccadic amplitudes (e.g., Ro, Pratt, & Rafal, 2000; Theeuwes et al., 2006). Indeed, Fig. 4b–c depict these correlations for spatial_s, showing very systematic effects (see Supplementary Materials demonstrating no such systematic effects for the verbal and combined conditions). Figure 4d shows that suppressive eye movement behavior was not related to memory accuracy at all (also demonstrated earlier by relating accuracies with eye-movement behavior). That is, on the level of participants, systematic and strong differences in saccadic suppression occurred, but individual oculomotor activity did not result in accuracy differences.

Discussion

We demonstrated saccadic suppression during spatial in comparison with verbal memory encoding, replicating earlier studies (Lange & Engbert, 2013; Patt et al., 2014). In addition, we observed a strong tendency for regressions under spatial encoding conditions at the final list position, which were mostly directed to the first location in the series. Whereas there was no evidence at all for saccadic suppression (to on-screen items) to be functional, we found weak evidence concerning the functional role of regressions (to items presented earlier in the series) in spatial memory maintenance. Importantly, we included also a condition, in which both contents (bigrams and their positions) had to be recalled. This manipulation was expected to result in high-fixation probabilities, and thereby would pose a challenge on spatial encoding, if on-item fixations during presentation were detrimental. In addition, regression probabilities in the combined task should match the spatial single task. Indeed, high-fixation and regression probabilities occurred in the combined task. But whereas regression probabilities related to spatial memory performance as predicted, fixation probabilities did not. Results on individual mean fixation probabilities suggest that participants differed on eye-movement control during encoding. Some used more overt visual attention allocation, and others applied systematic suppression of saccades. These specific behaviors did not relate to memory performance in general and also not to an individual optimization of encoding processes. Saccadic suppression does not indicate interference during spatial encoding, but it reflects differential applications of overt or covert attention allocation.

Regressions were mainly placed onto the first list item. Our results on regression targets converge nicely with existing WM models, suggesting that order information arises from coding the serial position of an item relative to the start of the memory list (e.g., Henson, 1998). The results are also compatible with the assumption, that encoding strength should be highest for the first item in a list, (e.g., Brown, Neath, & Chater, 2007; Lewandowsky & Murdock, 1989; Page & Norris, 1998), and the gaze is either supporting this or driven by this. Even though research on articulatory rehearsal processes indicates that the beginning of a list is rehearsed (e.g., Tan & Ward, 2008), we found high regression probabilities only in spatial but not in verbal recall. This points to a specific association between regressions and spatial memory. Regressions indicate maintenance processes, as the regression target, by definition, is no longer present on the screen and, consequently, new information about the item cannot be sampled. However, there was a sharp increase of regressions during presentation of the last item. This increase additionally indicates output preparation, as after the last item, presentation recall started immediately.

Arguably, the present dissociation for fixations and regression in two task domains (verbal, spatial) calls for the statistical test of an interaction—for example, a 2 (item fixated or not) × 2 (item regressed or not) ANOVA. However, the low number of cases for regressions as well as for nonfixated items (particularly in the verbal task) provide the difficulty to estimate sensible means to fit into an ANOVA. In addition, such an analysis should ideally include serial position as a confounding factor, which potentiates the problem of a low number of cases.

The serial position curves for memory accuracies are interesting for several reasons. First, the shape of the serial position curves matched for verbal_s and spatial_s serial recall, but differed only in intercept. This result is in line with the assumption that serial order memory is based on domain-general processes (Ward, Avons, & Melling, 2005). Second, the combined task situation had a detrimental effect on the slope of the serial position curve, which was third, again similar for verbal_c and spatial_c serial recall. The steepened slope eventually mirrors the fact that, due to the dual-task situation, encoding strength (e.g., Page & Norris, 1998) was diminished in the dual-task situation, or maintenance processes were hampered (e.g., rehearsal; Page & Norris, 1998), or there was less time for maintenance (e.g., refreshing; Barrouillet, Bernardin, & Camos, 2004), or recall for individual items was increasingly delayed (e.g., Brown et al., 2007). Whatever mechanisms were at play, the parallel slopes indicate that those mechanisms were likely domain-general and contributed to both features (verbal and spatial) to the same extend. Interestingly, parallel slopes make it unlikely that the high-fixation probability in the combined task hampered proper spatial memory encoding in a domain-specific way.

It is important to note that the memory decrements that we observed in the combined condition are well explained by the increased effort to encode two features per serial position (Langerock, Vergauwe, & Barrouillet, 2014). And it has been shown previously that spatial memory suffers more than verbal when combining both (Morey & Miron, 2016). Thus, our result of a stronger decrease for spatial in comparison with verbal recall by the combined condition does not reflect a stronger effect of the change in fixation behavior for spatial in comparison with verbal. The crucial analysis has to directly relate the spatial performance decrease to a change in eye-movement control. This is exactly what we did. We demonstrated a huge range of interindividual different eye-movement strategies during encoding. We compared fixation behavior between the spatial single and the spatial combined task. The more suppression of saccadic activity had to be reduced (from single to combined), the stronger memory impairment should be. However, this was not the case. There was no relation between a change in eye-movement behavior (from spatial single to combined) and a change on spatial memory performance.

Whereas fixations on to-be-encoded items during their presentation do not interfere with spatial memory and also do not play a functional role for spatial memory encoding, regressive eye movement acts as a maintenance process to support the beginning of the memory list, particularly.

Experiment 2

The role of eye movements for memory encoding and maintenance has been discussed in a variety of task designs. We now ask whether our findings from Experiment 1 are specific for tasks including serial-order memory. To do so, we compared five different settings: Serial, free, and cued recall for sequential presentation, and serial and free recall for simultaneous presentation of the stimuli. Free and cued recall do not require participants to encode serial order. There is evidence showing that eye-movement behavior might be particularly related to serial-order memory (Tremblay et al., 2006). Whereas free recall with serial presentation usually shows some encoding in presentation order (Bhatarah, Ward, & Tan, 2008; Cortis, Dent, Kennett, & Ward, 2015; Grenfell-Essam, Ward, & Tan, 2017; Howard & Kahana, 1999), in cued (verbal) recall, items are less likely rehearsed in series (e.g., Henson, Hartley, Burgess, Hitch, & Flude, 2003). Serial position curves, indicative for serial order, have been demonstrated to differ for all three tasks (e.g., Murdock, 1968a, 1968b). But we included cued recall for another reason: In our task, the cue of which feature (verbal or spatial) had to be recalled was given after list presentation by the other then the to-be-recalled feature (spatial or verbal). This makes the task similar to our combined condition in Experiment 1, as both features had to be encoded. We expected fixation probabilities to be again very high, because spatial as well as verbal information had to be encoded for later cued recall. This would enable us to repeat the key analyses from Experiment 1, aiming at replication. The planned change in oculomotor behavior from low-fixation (free or serial recall) to high-fixation probabilities (cued spatial recall) should not relate to performance differences, indicating again that fixation behavior has no negative or positive consequences for memory encoding. In addition, regression probabilities should be higher in spatial than verbal tasks, and they should again be functional for serial recall. To the extent that regressions indicate maintenance in serial-order memory, regressions should be low in cued recall. Accordingly, the beneficial effect of regressions should replicate for serial recall and be attenuated in free recall. Low-regression probabilities in cued recall might forestall relating their occurrence to performance.

We included two more conditions with simultaneous presentation: free and cued recall. During sequential presentation, each upcoming item attracts attention (e.g., Yantis & Jonides, 1984) and the gaze due to singleton pop out (e.g., Kramer, Hahn, Irwin, & Theeuwes, 1999; Theeuwes, Kramer, Hahn, Irwin, & Zelinsky, 1999). A suppression effect might then be shadowed because oculomotor control might be driven by gaze capture, or boosted by inhibition of return (IOR), similarly to attentional capture (Fecteau & Munoz, 2006). In these cases, the simultaneous presentation might qualify as a baseline. If suppression is characteristic for spatial encoding in general, this effect should appear and even be more pronounced in the simultaneous presentation. The expected high-fixation probabilities during cued but not free spatial recall will again provide the possibility to replicate the nonexisting relation between a change of fixation behavior and recall accuracy.