The goal of this article is to provide an expertise account of two spatial effects: the spatial–numerical association of response codes (SNARC) effect observed in numerical cognition (Dehaene, Bossini, & Giraux, 1993), and the spatial–positional association of response codes (SPoARC) effect observed in memory research (van Dijck & Fias, 2011). This expertise account builds upon experimental and theoretical knowledge acquired in the field of expert memory (e.g., Campitelli, 2015; Ericsson & Kintsch, 1995; Gobet & Simon, 1996; Guida, Gobet, Tardieu, & Nicolas, 2012). More specifically, we will employ the concept of “knowledge structures,” a generalization of the concepts of “chunks,” “retrieval structures,” and “templates,” which have been extensively used to explain results in memory experts (Chase & Simon, 1973; Ericsson & Kintsch, 1995; Gobet & Simon, 1996; Gobet, Lane, & Lloyd-Kelly, 2015; Williamon & Valentine, 2002).

The article is organized in three main sections: First, we present the SNARC and SPoARC effects; second, we describe the three main theories of expertise; and third, we provide our expertise account of the SNARC and SPoARC effects, indicating how the expertise account contrasts with the previous accounts.

The SNARC and SPoARC effects

SNARC and SPoARC are effects in which the items in memory seem to be organized along a vertical or horizontal dimension following a particular direction. We will refer to this general phenomenon as spatialization (Guida, Carnet, Normandon, & Lavielle-Guida, 2018), in which the cognitive system adds a spatial organization to presented information. Figure 1 illustrates the paradigms used to test SNARC, SPoARC, or both effects together.

Fig. 1
figure 1

Designs for testing only the SNARC effect, only the SPOARC effect, or both effects. Phase 2 in the figure presents one type of association between the stimuli and the responses (e.g., right: odd numbers), which usually represents 50% of the trials; the converse association is not presented (e.g., left: odd numbers) but is executed by the participants (remaining 50% of trials)

The SNARC effect and similar spatializations

In their seminal article, Dehaene et al. (1993) investigated the SNARC effect in nine experiments using odd-even judgments. As is shown in the first row of Fig. 1, participants had to press a key to indicate whether a number presented in the middle of the screen was odd or even. In half of the trials they had to use one hand for odd numbers and the other hand for even numbers and in the other half the hand assignment was reversed. The authors found that small numbers (e.g., 1, 2) were responded faster with the left hand and large numbers (e.g., 8, 9) were responded faster with the right hand.

The SNARC effect was found to be robust, no matter if the participants were left-handed or if they crossed hands (Dehaene et al., 1993; but see Wood, Nuerk, & Willmes, 2006), and it was observed with different tasks and formats, such as number comparison tasks (participants had to indicate whether a displayed number was bigger or smaller than a reference number; e.g., Brysbaert, 1995; Dehaene, Dupoux, & Mehler, 1990), one- or two-digit numbers (e.g., Brysbaert, 1995; Dehaene et al., 1993; Dehaene et al., 1990), or even with words (Dehaene et al., 1993; Fias, 2001), instead of Arabic digits, to represent numbers (for reviews of the SNARC effect, see Abrahamse, van Dijck, & Fias, 2016; Fias & Fischer, 2005; Fias, van Dijck, & Gevers, 2011; Gevers & Lammertyn, 2005; Wood, Willmes, Nuerk, & Fischer, 2008). Interestingly, the SNARC effect was reversed in studies with participants from right-to-left reading cultures (e.g., Dehaene et al., 1993; Shaki, Fischer, & Petrusic, 2009; Zebian, 2005), and shifted to a vertical orientation in Chinese (Hung, Hung, Tzeng, & Wu, 2008) or Japanese (Ito & Hatta, 2004) participants, who read top-down.

The SNARC effect is not limited to numbers: It has been found with days of the week (Gevers, Reynvoet, & Fias, 2004), months of the year, and letters (Gevers, Reynvoet, & Fias, 2003). In all these experiments, participants were shown a stimulus (e.g., “Monday,” “April,” or “B”) and had to indicate using both hands whether the stimulus came before or after a reference stimulus (e.g., “Wednesday,” “July,” or “R,” respectively), or whether or not the stimulus contained a specific feature (e.g., “a vowel”). In all these experiments, the left hand tended to be faster for stimuli at the beginning of the ordinal sequences, whereas the right hand tended to be faster with the last part of the ordinal sequences. More recently, Sellaro, Treccani, Job, and Cubelli (2015) showed a SNARC-like effect for the size of objects, with left responses being faster for small objects and right responses, for large objects.

Interestingly, the SNARC effect can be made disappear if working memory (WM) load is added to a task. Herrera, Macizo, and Semenza (2008) showed that when a number comparison task is administrated during the retention interval of a spatial WM task, SNARC disappears; contrarily, it does not disappear when the number comparison task is embedded within a verbal WM task. Van Dijck, Gevers, and Fias (2009) observed the reverse pattern when the number comparison is replaced by an odd–even judgment task. Overall these results show that numbers are associated with different spatial codes, which can have a visuospatial (i.e., number comparison) or verbally (i.e., odd–even judgment) mediated nature depending on the task.

The SPoARC effect

The SPoARC effect—also known as the ordinal position effect (Ginsburg, van Dijck, Previtali, Fias, & Gevers, 2014; not to be mistaken with Becker’s, 1953, ordinal position effect, which refers to the effect of speaking order on the perceived quality of a speech)—was first brought to light by van Dijk and Fias (2011; but see also Previtali, Hevia, & Girelli, 2009) using a WM task. The authors asked participants to remember in the correct order sequences of five random numbers (ranging from 1 to 10) displayed successively in the center of the screen. After this presentation phase, all numbers ranging from 1 to 10 were randomly and sequentially displayed in the center of the screen (twice each number). Participants had to execute an odd–even judgment but only for the five numbers that appeared in the presentation phase. After these judgments, four sequences of five numbers were presented, and participants had to indicate which of those sequences was the one presented in the presentation phase (also see third row of Fig. 1). The results showed that instead of having a classic SNARC effect, the left-hand responses were faster for the numbers presented in the first positions in the sequence displayed in the presentation phase (instead of the smaller magnitudes), and the right-hand responses were faster for the numbers presented in the last positions (instead of the larger magnitudes).

Since this study, several experiments (see Table 1 for an exhaustive list of experiments testing only the SPoARC or both the SPoARC and SNARC effects, ever since van Dijk & Fias’s, 2011, initial study) have been conducted. The SPoARC effect has been observed (1) with short-term memory (STM) paradigms necessitating only storage (see second row of Fig. 1; e.g., Guida, Leroux, Lavielle-Guida, & Noël, 2016) and with WM paradigms necessitating storage and processing (e.g., van Dijck & Fias, 2011); (2) when order is relevant for the task completion (such as in the van Dijck & Fias, 2011, paradigm, in which participants have to recognize each sequence) or when order is irrelevant (i.e., without sequence recall or recognition; e.g., Guida, Leroux, et al., 2016); (3) with auditory (Guida, Leroux, et al., 2016) or visual input (e.g., van Dijck & Fias, 2011); (4) with verbal material (e.g., Ginsburg et al., 2014; van Dijck & Fias, 2011) or images (Ginsburg, Archambeau, van Dijck, Chetail, & Gevers, 2017); (5) with open (Guida, Carnet, et al., 2018) or closed sets (e.g., van Dijck & Fias, 2011); (6) with an eye-tracking device (Rinaldi, Brugger, Bockisch, Bertolini, & Girelli, 2015); (7) with a line bisection task (Antoine, Ranzini, Gebuis, van Dijck, & Gevers, 2017); (8) when testing noncongenitally blind people, but not with the congenitally blindFootnote 1 (Bottini, Mattioni, & Collignon, 2016); and (9) when testing Arabic literates, for whom the effect is reversed (Guida, Megreya, et al., 2018).

Table 1 Experiments that have tested solely the SPoARC effect or the SPoARC and SNARC effects, starting from van Dijck and Fias (2011)

Umiltà, Bonato, and Rusconi (2018) have recently categorized SPoARC experiments as testing spatiotemporal associations because, in Westerners, the early items of a sequence are linked to left and the late to right. Although a sequential presentation of items has undeniably a temporal dimension, we do not think that time is a crucial dimension because in SPoARC experiments, participants do not have to execute a before/after judgment (e.g., was X item before Y item?), but just press a key if an item X was in the sequence, which does not stress the temporal dimension.

Finally, in several experimental conditions the SPoARC effect does not occur. First, no SPoARC effect was observed with sequences of 15 words (Guida, Carnet, et al., 2018), whereas it has been observed several times with sequences of four (Guida, Megreya, et al., 2018; van Dijck, Abrahamse, Majerus, & Fias, 2013) or five (e.g., Ginsburg et al., 2014; van Dijck & Fias, 2011) items. Second, the typical SPoARC effect does not seem to be observable with spatial sequences. Ginsburg et al. (2017) asked their participants to remember sequences of four black dots in an (invisible) 8 × 8 matrix; a SPoARC effect was subsequently tested, but no preferential association between the first positions of the sequences and the left hand and the last positions and the right hand was found. Third, Ginsburg et al. (2017) did not find a SPoARC effect with pseudowords, concluding that in order for the SPoARC effect to appear, information needed to be processed sufficiently deeply—namely, to a semantic level. We will come back to this conclusion in the section describing our expertise account.

Theoretical accounts that explain both SNARC and SPoARC effects

When the SNARC effect was first discovered, Dehaene et al. (1993) provided an LTM theoretical account of the effect putting forward the mental numerical line (MNL). They proposed that the SNARC effect was due to the representation numbers have in semantic memory that take the form of an MNL, its direction and orientation varying according to cultural reading habits. This LTM conception of the SNARC effect was also coherent with SNARC effects subsequently observed with nonnumerical ordinal sequences (e.g., the days of the week; Gevers, Reynvoet, & Fias, 2004). In both cases (ordinal and numerical sequences), the SNARC effect would be caused by the internal organization that these concepts have in LTM: that of a mental line.

The working memory account of the SNARC and SPoARC effects

The first account that encompassed both effects was proposed by van Dijck and Fias (2011) when they observed the SPoARC effect. To explain this effect, they could not use Dehaene et al.’s (1993) semantic LTM account, as their results showed a spatialization with random sequences, which do not exist in terms of numerical or ordinal sequences in semantic LTM. Their results pointed toward the idea that the participants created a mental line based on the order the items entered in WM, and not on the canonical order numbers have in semantic memory. Therefore, they proposed a WM account of the SPoARC, which was extended to the SNARC. The authors suggested that in both effects, position in WM is the determinant, as opposed to long-term memory (LTM) associations, and that the spatial coding of number does not have its origin in an LTM representation.Footnote 2 Whereas Dehaene et al.’s (1993) semantic LTM account cannot explain the SPoARC effect, van Dijck and Fias (2011) argued that the WM account could explain the SNARC effect. For example, in a classic parity judgment SNARC task, once the participants know the range of the numbers used in the experiment (e.g., from 1 to 9), they will spontaneously start encoding the numbers in WM using the canonical order of numbers. Since the positions in WM are associated with space, small numbers would thus acquire a left value, and large numbers a right value, throughout the task.

Although van Dijck and Fias (2011) pitted a long-term representation account against a WM account, the WM account does not deny the importance of LTM knowledge in the SNARC and SPoARC effects. This is apparent in later versions of the account (i.e., the mental whiteboardFootnote 3; Abrahamse, van Dijck, & Fias, 2016, 2017; Abrahamse, van Dijck, Majerus, & Fias, 2014), in which the role of LTM knowledge is more explicitly stated. Abrahamse et al. (2016) indicated that LTM knowledge intervenes at least at three levels. First, spatialization occurs because the incoming information in WM acquires a spatial value by being linked to a spatial LTM template (the mental whiteboard), such that spatial attention controls later search and selection. Second, numbers trigger an orientation in WM that depends on the direction of one’s reading/writing system. Third, “LTM contains—besides item representations—a representation for ordered item sets that are used frequently and systematically” (i.e., 1–9; Abrahamse et al., 2016, p. 4).

The dual account of the SNARC and SPoARC effects

Following van Dijck and Fias’s (2011) proposal, several studies have tested their WM account. Ginsburg et al. (2014) first showed that if van Dijck and Fias’s paradigm was tweaked, a SNARC effect would replace the SPoARC effect. In the original paradigm (i.e., van Dijck & Fias, 2011) presented above, participants must classify numbers only if they belong to the memoranda (i.e., the five numbers that appear in the presentation phase). The result is generally a SPoARC effect with no SNARC effect. However, in Experiments 3 and 4, Ginsburg et al. (2014) removed the no-go component by asking the participants to classify all the numbers displayed on the screen. And the result was a SNARC effect with no SPoARC effect. Because the two effects do not seem to be sensitive to the same task demands, the authors used their results to counter the idea that the SPoARC and SNARC effects were due to the same underlying processing mechanism, as claimed by the WM account.

However, the result that really laid the ground for the dual account as a counterproposal to the WM account came from Ginsburg and Gevers’s (2015) study. With an astute paradigm that intermingled go/no-go and go trials (for more details, see the Accounting for the Data section describing our expertise model), they showed that SNARC and SPoARC could coexist (for a similar result, also see Huber, Klein, Moeller, & Willmes, 2016) and therefore were not mutually exclusive (and no interaction was detected). Ginsburg and Gevers proposed that the SNARC and SPoARC effects derive from two different processes, the former based on LTM semantic knowledge (with numbers ordered canonically) and the latter based on the order items entered in WM. The dual nature of the effects was the reason why both effects could be detected conjointly. These results are difficult to account for with a pure WM account, because one would expect only one order of numbers to be in WM during the task: either the numbers ordered canonically or the numbers as they entered WM (for more details, see the Accounting for the Data section).

Expert memory

If the idea derived from the SPoARC effect (i.e., verbal items in WM are spatially organized) could seem new in the numerical cognition domain and even for classic theories of WM in which visuospatial information is separated from verbal information (Baddeley, 1986; Baddeley & Hitch, 1974; but see Baddeley, 2000), it is not new to mnemonists. Indeed, the allegedly most ancient memory technique (mnemonic)—the method of loci proposed by Simonides of Ceos more than two millennia ago (Worthen & Hunt, 2011; Yates, 1966)—involves organizing information spatially. Before a speech, Greek orators visualized a well-known route or a sequence of familiar locations (e.g., rooms in one’s own house), and mentally stored important words of the speech in different locations. Then, during the speech, orators would mentally picture their house, taking a mental tour and retrieving each word from familiar locations (e.g., rooms). This method clearly necessitates to spatialize the items to be remembered in various locations. The method of loci is still being used nowadays by expert mnemonists in World Memory Championships.

We believe that the concepts and findings from the domain of memory expertise can shed light on both the SNARC and SPoARC effects, not only because of the spatial connection between memory expertise and these effects but also because reading/writing expertise seems to play a role in both effects. In effect, the SNARC/SPoARC direction varies according to the reading/writing direction (e.g., Guida, Megreya, et al., 2018; Shaki et al., 2009) and illiterates do not seem to exhibit any SNARC or SPoARC effect (e.g., Guida, Megreya, et al., 2018; Shaki, Fischer, & Gӧbel, 2012; Zebian, 2005). Hereafter, we will succinctly introduce the expert memory domain.

Capturing memory expertise in the lab

Chase and Ericsson (1981, 1982, Ericsson, Chase, & Faloon, 1980) trained in their laboratory two university students (S.F. and D.D.) to increase their performance in a digit memory task from a standard score of seven digits to 82 digits in 264 h of training (S.F.) and to 106 digits after 800 h of training (D.D.). In the task, a list of digits was presented at a rate of one digit per second, and the participants had to recall the correct sequence. The theoretical accounts shown below were based on these findings.

Theoretical accounts of expert memory

EPAM IV

EPAM (Elementary Perceiver and Memoriser) IV (Richman, Staszewski, & Simon, 1995) is a computational model with a STM that can hold a limited number of chunks, a semantic LTM accessed from a discrimination net, and production rules. Both LTM and the discrimination net grow due to the functioning of learning processes stored in LTM. One important type of structure within its LTM is the retrieval structure, which is a specific net of nodes with their connecting links forming a treelike structure. New information (e.g., a sequence of digits) can be stored at different nodes (a.k.a. slots) of the retrieval structure. With experience or training the retrieval structure can also grow, allowing more pieces of information to be stored in their slots. Several retrieval structures can be grown in EPAM IV’s LTM.

D.D.’s performance was explained by two components of EPAM IV: the use of retrieval structures to store groups of digits, and the use of semantic knowledge of running times. D.D. was a runner, so he possessed semantic knowledge of running times (e.g., the sequence of digits “9 5 8” can be represented by the label “Usain Bolt’s 100 m world record,” which is 9.58 sFootnote 4). Semantic knowledge enables the chunking of information, thus separate elements can be processed as one unit (“9 5 8” are processed as one element “9.58”).

Long-term working memory

Before the long-term working memory (LTWM) theory (Ericsson & Kintch, 1995), Ericsson and colleagues (Chase & Ericsson, 1981, 1982) proposed the skilled memory theory and put forward three principles to explain how people without exceptional memory capacity could show exceptional performance in memory tasks. The first principle “meaningful encoding” states that for information to be stored effectively in LTM, it needs to be transformed into meaningful units through the knowledge one possesses (i.e., semantic knowledge), like what occurs in chunking for example. The second principle (structured retrieval) proposes that by building and using a retrieval structure one can store and access information more efficiently. The third principle (acceleration) states that, with extensive practice, experts can accelerate the encoding and retrieval of information in LTM. Figure 2 shows the evolution of the spatial structure that S.F. used as a retrieval structure. Like D.D., S.F. used his semantic knowledge to chunk together one-digit numbers as running times and link them to the slots of his retrieval structure.

Fig. 2
figure 2

(a) S.F.’s first retrieval structure. Adapted from “Memory Skill,” by K. A. Ericsson, 1985, Canadian Journal of Psychology, 39, pp. 188–231. Copyright 1985 by the American Psychological Association. (b) S.F.’s retrieval structure to recall 30 one-digit numbers. From “Long-Term Working Memory,” by K. A. Ericsson and W. Kintsch, 1995, Psychological Review, 102, pp. 211–245. Copyright 1995 by the American Psychological Association

Ericsson and Kintsch (1995; see also Ericsson & Delaney, 1999) built upon Chase and Ericsson’s (1981, 1982) skilled memory theory and developed the LTWM theory. One new structure that Ericsson and Kintsch (1995, p. 220) included is an elaborated memory structure “associating items from a given trial or context,” which means, when considering Fig. 2b, that on top of the retrieval structure, experts can create meaningful encodings between chunks that allow for distinguishing a trial from previous ones.Footnote 5

Whereas the skilled memory theory was meant to explain the performances of DD or SF, LTWM was proposed as an explanation for the memory involved in all domains of expertise. LTWM is domain-specific, and it is acquired to improve performance in specific tasks that require large WM demands. Although there are several differences between EPAM IV and LTWM (mainly, EPAM IV is implemented in a computational model, whereas LTWM is not), their main components are the same: retrieval structures and domain-specific semantic knowledge. For example, to explain memory performance in chess players, LTWM theory suggests that chess players use a representation of an empty chessboard as a retrieval structure. Likewise, for comprehending texts, items (words) are remembered better when the readers are able to form a situational model (for a detailed explanation, see van Dijk & Kintsch, 1983). The situational model is a complex structure that combines an idea of the topics and temporal cues in the text (like a retrieval structure) with semantic information (knowledge of words), and it is used by readers to remember critical components of a text to be able to comprehend it. Waiters can memorize orders by generating retrieval structures (e.g., the locations of the tables across the restaurant) and storing information within those structures. Medical experts generate retrieval structures, which include medical knowledge, and they are associated with specific information about patients (symptoms, laboratory test results, etc.).

Template theory

In LTWM and EPAM IV, retrieval structures and semantic knowledge are separated. The template theory developed by Gobet and Simon (1996) to explain memory in chess introduces a different structure—the template (for a detailed discussion of the differences between retrieval structures and templates, see Ericsson & Kintsch, 2000; Gobet, 2000). The templates have characteristics of both retrieval structures and semantic knowledge, and they evolve from more simple structures—chunks. Chunks are a meaningful group of individual items. By their accumulated experience, chess experts’ memory evolves chunks (configurations of three or four chess pieces) into templates (configurations of more than ten chess pieces). The difference between chunks and templates is not only their size but also that templates contain slots in which additional information (including small chunks) can be stored. The core of the template is not an empty structure; rather, it contains domain-specific knowledge (like the semantic knowledge in EPAM IV and LTWM) as well as slots (like retrieval structures). Figure 3 shows an example of a template: The pieces on the board indicate the core, whereas the crosses indicate the positions where chess pieces can be added without changing the identity of the template (i.e., the slots).

Fig. 3
figure 3

Example of a template from Gobet and Simon (1996). On the left are the attributes of the template: the nonvariable part of a template (the template core) and the variable part of a template (slots for pieces, squares, openings, plans, moves, and links to other templates). On the right is a diagrammatic representation of the same template: Pieces on the board indicate the core pieces in the template, and crosses indicate values contained in the piece or square slots. From “Templates in Chess Memory: A Mechanism for Recalling Several Boards,” by F. Gobet and H. A. Simon, 1996, Cognitive Psychology, 31, pp. 1–40. Copyright 1996 by Academic Press. Adapted with permission

Synthetizing and generalizing expert memory to everyday memory

Semantic knowledge and retrieval structures are the backbone of EPAM IV and LTWM, whereas chunks and templates constitute the core of the template theory. Despite their differences, the three theories evolve with regard to common principles, which were identified by Guida and colleagues (Guida, Campitelli, & Gobet, 2016; Guida, Gobet, & Nicolas, 2013; Guida et al., 2012). Building on their work, we propose the general term of “knowledge structure,” which refers to the meaningful blocks of knowledge that one builds while acquiring expertise. These knowledge structures can be of two types: non slotted schemas (e.g., semantic knowledge, chunks) or slotted schemas (e.g., retrieval structures, templates)

In addition, Campitelli (2015) proposed that slotted schemas such as retrieval structures are not only important for experts, but they are at the core of memory mechanisms for everyday life. He used the term “knowledge structures” to emphasize that they are involved in many cognitive processes, not only in memory retrieval. In this approach, humans use the most relevant available knowledge structures to assist them in remembering and processing information. If the task at hand and the information to be remembered belong to a domain in which we have expertise, we can use domain-specific and content-rich slotted schemas that we have developed while being exposed to that domain, similar to templates (e.g., a configuration of chess pieces, a typical sentence, the configuration of our house to be used in the loci method). In typical laboratory memory tasks with meaningless material, we do not have expertise, thus we will use slotted schemas with a generic core, like for example the retrieval structure first used by SF when he started to train (Fig. 2a). This is similar to the slotted structures proposed in traditional STM models (e.g., Luck & Vogel, 1997). Campitelli proposed that the traditional models of STM constitute only one of many possible knowledge structures that we can use for remembering information.

Previous models grounding working memory in long-term memory knowledge

Although we characterize everyday memory borrowing concepts from the domain of memory expertise and using them to account for two spatial effects that occur in WM, we do not want to minimize the importance of other types of LTM knowledge that are known to be activated in WM and to contribute to WM performances through LTM–WM interaction.

Indeed, the most influential theories of WM are built around the link between LTM and WM, postulating that WM is the activated portion of LTM (e.g., Anderson, Reder, & Lebiere, 1996; Barrouillet, Bernardin, & Camos, 2004; Cowan, 1995; Just & Carpenter, 1992; Oberauer, 2002). In these “LTM-based WM models” (Guida et al., 2013)—in contrast with models based on Baddeley and Hitch’s (1974) architecture, in which WM and LTM are envisaged as separated entities (but see Baddeley, 2000)—WM performances are interpreted in terms of LTM activation or through the process of reconstructing LTM activation from LTM knowledge (Schweickert, 1993; or the reconstruction hypothesis of Towse, Cowan, Hitch, & Horton, 2008). Among the advantages of “LTM-based WM models” is the fact that the link between WM and LTM facilitates the accounts of numerous studies that have demonstrated the effects of LTM knowledge on WM (for an exhaustive list, see Majerus, 2008). For example, word lists are recalled better than nonword lists (e.g., Hulme, Maughan, & Brown, 1991), words of high lexical frequency are recalled better than less frequent words (e.g., Roodenrys, Hulme, Alban, Ellis, & Brown, 1994), and lists of concrete words are recalled better than lists of abstract words (e.g., Walker & Hulme, 1999). These results clearly show that verbal WM strongly relies on the activation of LTM linguistic structures, moreover in a very fast and automatic manner (Kowialiewski & Majerus, 2018; Majerus, Van der Linden, Mulders, Meulemans, & Peters, 2004). Hence, it comes as no surprise that some WM models have grounded WM in terms of linguistic knowledge. Martin and Saffran (1992) is one of the most extreme examples, in that they viewed verbal STM as an emergent property of the activation of phonological, lexical, and semantic language representations in the language network (see also Burgess & Hitch, 1999; Gupta, 2003). A more recent proposal, the attention, order, short-term memory activation model (Majerus, 2008; Majerus, Heiligenstein, Gautherot, Poncelet, & Van der Linden, 2009) hybridizes Martin and Saffran’s model with more modern views concerning the importance of serial order (e.g., Burgess & Hitch, 1999), adding the proposal of an attentional modulator that can direct resources toward either language representations or item processing. Acheson and MacDonald’s (2009a; see also Acheson & MacDonald, 2009b) account of WM also heavily relies on linguistic knowledge, but this time in terms of production. They proposed that maintaining serial order in verbal WM may emerge from the language production architecture (i.e., language production hypothesis).

Overall, it should be clear that it is well-known that LTM knowledge aids memory and that the interaction between LTM knowledge and WM is a pervasive feature of modern models of WM. What our proposal suggests, in terms of novelty, is a further instantiation of this interaction, by proposing that knowledge structures as a generalization of concepts from theories of expertise can also explain everyday memory performance such as that implicated in the SPoARC and SNARC effects.

Accounting for both SNARC and SPoARC effects: An expertise account

Nonslotted schemas: The SNARC effect

Taking an example from the chess game, if a player starts playing consistently, he will increase his knowledge of the game, he will start remembering the game positions (or part of the positions) that are often played. And gradually, individual pieces that compose a position will cement into chunks. Once a player has built and stored these nonslotted schemas in LTM—transitioning from the chunk creation to the chunk retrieval phase (Guida et al., 2012)—he will be able to use them to encode information. This explains why STM capacity is a function of our capacity to chunk information (Cowan, 2001, 2012; Mathy & Feldman, 2012).

How do they apply to the SNARC effect?

These nonslotted schemas can be used to explain SNARC-like effects. Like chess players who are exposed to chess piece configurations, laypeople are usually exposed to numbers, process during which they gradually learn that numbers have a precise order. And some orders are highly practiced, such as the sequence 1 2 3 or the sequence 1–10. We suggest that very practiced sequences will behave as chunks and thus can be considered nonslotted schemas that can be directly activated from LTM. The exact identity of these sequences will vary as a function of expertise with numbers (see below for professional mathematicians). But for an average young adult, the canonical 1–9 number set (Abrahamse et al., 2016) will certainly be available as a chunk and will thus be used as a nonslotted schema. We believe that the same kind of nonslotted schemas are in use when one is processing the days of the week, the months of the year, or the alphabet. Hence, when participants are confronted with SNARC-like tasks, they will activate these nonslotted schemas or a portion of them

Like the MNL, these nonslotted schemas also include the spatial nature of these sequences. This kind of spatialization can be easily comprehended, especially in terms of grounded cognition (e.g., Barsalou, 2010; Glenberg, 1997; Shapiro, 2011); if one always experiences months or numbers as sequences written from left to right, it is comprehensible that when one thinks about months or numbers, the left-to-right dimension could still be associated. The idea of nonslotted schemas to explain the SNARC effect is not far from Dehaene’s LTM account of SNARC (MNL), the main difference being that our account stresses expertise. SNARC-like effects start to be detected in four-year-old children (McCrink, Shaki, & Berkowitz, 2014; Opfer, Thompson, & Furlong, 2010), therefore it seems that the kind of expertise—for this spatialization to appear—occurs by means of observational learning (for a review see McCrink & Opfer, 2014; Patro, Nuerk, Cress, & Haman, 2014) through the interaction between infants and caregivers (McCrink, Caldera, & Shaki, 2018) via storybook reading for example. In cultures in which numbers (left to right in Israel) and words (right to left in Israel) are written in opposite directions, nonslotted schemas seem less or not at all spatialized (Shaki et al., 2009), which suggests that for SNARC-like effects to be observed, the spatial biases in a culture must not be opposite to each other. Finally, concerning the relation between expertise and SNARC effect, recent work (Cipora et al., 2016; Kramer, Bressan, & Grassi, 2018) suggests that it could be quadratic, as the link between mathematical knowledge and the SNARC effect is positive in children but seems to be negative in adults. Different processing/representations could be engaged in professional mathematicians: Cipora et al. proposed nonlinear/nonhorizontal numerical representations or more abstract ones, which would correspond to different knowledge structures within our framework.

Slotted schemas: The SPoARC effect

As we presented above, when a player increases his knowledge, he will be able to use nonslotted schemas to encode information. However, if he continues increasing his knowledge, he will start developing slotted schemas, named templates (Gobet & Simon, 1996; also see Gobet, 2013; Gobet et al., 2001)

How do they apply to the SPoARC effect?

Similarly, if an individual increases his expertise by completing his formal training in reading/writing, we suggest that a slotted schema is developed (see also Guida & Lavielle-Guida, 2014), since Guida, Megreya, et al. (2018) did not observe the SPoARC effect with illiterates. This slotted schema takes the form of a mental line. However, unlike the MNL, it is a slotted mental line. As is shown by Fig. 4, the elements entering WM will be associated to it, and therefore their position in WM will be marked/tagged spatially. The slotted mental line can be envisioned as a primitive version of S.F.’s slotted schema pictured in Fig. 2a. However, contrary to a fully grown slotted schema (e.g., Fig. 2b), the slotted mental line does not offer the possibility to transfer information rapidly to LTM.Footnote 6

Fig. 4
figure 4

The slotted mental line used by Westerners to encode three elements through spatial positional tagging. Adapted from “2011 Space Odyssey: Spatialization as a Mechanism to Code Order Allows a Close Encounter Between Memory Expertise and Classic Immediate Memory Studies,” by A. Guida and M. Lavielle-Guida, 2014, in Psychological Perspectives on Expertise (pp. 176–180), Frontiers Media SA. Copyright 2014 by the authors

This idea of a slotted mental line fits well with recent computational models of WM and how they account for order coding. Since the seventies, results (e.g., Bjork & Healy, 1974; Dale, 1987; Engelkamp & Dehn, 2000; Henson, Hartley, Burgess, Hitch, & Flude, 2003; Majerus, Poncelet, Elsen, & van der Linden, 2006; Mulligan, 1999; Sperling & Melchner, 1976) have shown that WM models should account for two distinct kinds of information: memory for the content and memory for the order, which seem to be coded and represented separately (for a review, see Hurlstone, Hitch, & Baddeley, 2014; Majerus & Attout, 2018; Marshuetz, 2005). The type of models that account very well for this dissociation (content/order) and that have become predominant in the last two decades are positional models (e.g., Anderson & Matessa, 1997; Brown, Neath, & Chater, 2007; Brown, Preece, & Hulme, 2000; Oberauer & Lewandowsky, 2011; Portrat, Guida, Phénix, & Lemaire, 2016). In these models, order is coded through positional tags associated to each item. Even if no consensus has emerged concerning the actual nature of the positional tags, recent work on spatialization (e.g., Ginsburg et al., 2017; Guida, Carnet, et al., 2018; Guida, Leroux, et al., 2016; Rinaldi et al., 2015; van Dijck, Fias, & Andres, 2015) has showed that their nature could be spatial (see also Rinaldi, Merabet, Vecchi, & Cattaneo, 2018), like in the mental slotted line we are proposing.

Interestingly, literacy is not the only expert-related condition to be able to use the mental slotted line. As was recently shown by Ginsburg et al. (2017), the encoded information needs to be processed deeply enough at a semantic level for an individual to be able to use his mental slotted line. This result is interesting in terms of expertise as it parallels the first principle of the skilled memory theory (Chase & Ericsson, 1981, 1982), which states that in order to be able to use the skilled memory, a meaningful encoding is necessary.

Difference between the expertise account and previous ones

The central element that the WM account and our account have in common is the LTM cognitive structure that is activated when one must encode a sequence of verbal elements as in SPoARC tasks. The WM account (Abrahamse et al., 2016, p. 4) proposes that “each item is bound to a specific coordinate within an existing, well-coordinated templateFootnote 7 from long-term memory.” Building on work we published previously (Campitelli, 2015; Guida & Lavielle-Guida, 2014), we have also proposed an LTM structure (i.e., a slotted schema). Therefore, as far as the SPoARC effect is concerned our explanation is similar to the WM account even if our LTM structures have features derived from the theories of expertise, which is no the case for the WM account.

However, it is when accounting for the SNARC effect that our views really diverge. The WM account suggests that in the SNARC effect too, spatialization occurs in WM and depends on the same LTM template than in the SPoARC effect. Drawing from the literature on expertise, we have put forward a nonslotted schema to account for the SNARC.

To be more specific, in a SNARC task, in which elements from 1 to 9 are used, and as far as the WM account is concerned, only these numbers and their canonical order will be activated from LTM, no spatial value will be activated from LTM, the spatial value will be added when they interact with the LTM template in WM. For our account, the number set 1–9 will be activated from LTM as a nonslotted schema, which also contains their spatial value, similarly to the MNL account. It seems to us more economical for the cognitive system to use the spatial information already linked to numbers (or in other highly known sequences; e.g., the days of the week) and putatively stored in LTM, than to (re)spatialize the information when it reaches WM as suggested by the WM account. We believe that the cognitive system will spatialize information in WM as a last resort. Hence, we introduce here the three parsomony steps of our account. First, if spatial information is already provided by the task at hand, the system will just use the spatial information in the environment. Recent results show that we do not respatialize information if it is already spatialized. Guida, Abrahamse, and van Dijck (in revision) showed that if the information is presented to Westerners from right to left, they will not respatialize it from left to right, but just use the spatial information that is provided (for other examples, see the Simon effect [Simon & Rudell, 1967] or the visuospatial bootstrapping effect [Darling, Allen, & Havelka, 2017; Darling, Allen, Havelka, Campbell, & Rattray, 2012; Darling & Havelka, 2010]). Second, if the task does not provide spatial information, the cognitive system will use the spatial information in LTM, if any is available (i.e., nonslotted schemas; e.g., the SNARC effect). Third, if this LTM spatial information is not suitable to code the information (e.g., a random sequence of numbers), the cognitive system will use a slotted schema to spatialize the information.

Concerning the dual account as compared to our own, there are many converging points, but also some differences. The main convergence is that the dual account explains the SNARC and SPoARC effects separately, by invoking two mnemonic systems to account for them: LTM for SNARC and WM for SPoARC. In our account, too, the SNARC and SPoARC effects are accounted for separately, but not by invoking two memory systems; instead, we invoke two different LTM structures: nonslotted and slotted schemas. The second crucial point is that in our account, the capacity to use LTM is a function of expertise.

Table 2 summarizes the three accounts and how they explain the SNARC and SPoARC effects.

Table 2 Dual account, working memory account, and expertise account of the SNARC and SPoARC effects

Accounting for the data

When putting forward theories or hypotheses, an important concern is how these theories can account for the data, or inversely, do the data adjudicate between the different theories?

The first interesting pattern of results is that the SPoARC effect does not seem to operate on sets bigger than six (usually four or five; Guida, Carnet, et al., 2018). Data from Huber et al. (2016) suggest that the SPoARC effect decreases gradually when transitioning from sequences of four to five to six. The SPoARC effect could be limited by standard STM/WM limitations, which do not seem to apply to SNARC—it can be observed, for example, with a range of ten numbers (e.g., 0 to 9 in Dehaene et al., 1993).

From a dual-account perspective, this discrepancy could indicate that two different memory systems are differentially involved (LTM vs. STM/WM), which do not have the same limitations. From a WM account this would maybe be more difficult to explain, and to our knowledge Abrahamse and colleagues have not addressed this discrepancy.

From our perspective the explanation resides in the fact that two different structures are operating. Concerning the SNARC, the nonslotted schemas (e.g., the sequence 1 to 9) will behave as chunks and chunks allow to compress (e.g., Mathy & Feldman, 2012) various elements (e.g., F, B, I) into one (e.g., FBI), which allow to overcome the standard limits of STM/WM (Chase & Simon, 1973; de Groot, 1965). Hence, the SNARC effect is not limited by the standard STM/WM limitations.

Concerning the SPoARC effect, in our framework it makes sense that it is linked with standard STM/WM limits, as we believe that STM/WM capacities depend (among various elements) on the capacity to use the mental slotted line. For example, illiterates who putatively do not use the slotted mental line, have lower STM/WM capacities (Demoulin & Kolinsky, 2016; Kosmidis, Zafiri, & Politimou, 2011; Silva, Faísca, Ingvar, Petersson, & Reis, 2012). Since both the SPoARC effect and STM/WM capacities depend (among other factors) on the slotted mental line, it comes as no surprise within our framework that the SPoARC effect and STM/WM capacities are tied together.Footnote 8

The second pattern of results concerns Ginsburg and Gevers’s (2015) and Huber et al.’s (2016) results. Contrary to van Dijck and Fias (2011) and previous experiments (e.g., Ginsburg et al., 2014), Ginsburg and Gevers (2015) and Huber et al. (2016) observed both SNARC and SPoARC effects (but no interaction). Both studies used a paradigm similar to that of van Dijck and Fias: A sequence of numbers was displayed and had to be remembered, and then a classification task was administrated in which participants had to classify (e.g., by a left keypress for odd numbers and a right keypress for even numbers) each number displayed, but only if it belonged to the memoranda (third row of Fig. 1). There were also some differences between their paradigms and that in van Dijck and Fias’s (2011) study. Ginsburg and Gevers intermingled classic trials (as just explained) with inducer trials in which the classification task (i.e., a number comparison tasks) had to be executed for all numbers, not only for those among the memoranda. And Huber et al. varied the number of elements in the memoranda (four, five, or six) and the range of the numbers (1–9 or 1–10) in the classification task (i.e., an odd–even judgment task); moreover, they used more trials (almost 2,000) than in van Dijck and Fias’s study (400 trials).

Ginsburg and Gevers (2015) and Huber et al. (2016) interpreted their result (SNARC and SPoARC, but no interaction) as showing that the random sequences in WM (which produced the SPoARC) and the canonical 1–10 order (which produced the SNARC) were conjointly activated during the experiment but did not influence each other. They interpreted this result within the dual-account framework, as showing that both effects were produced by two distinct memory systems. The idea of a coexistence of both effects is more problematic for van Dijck and Fias’s (2011) WM account, as one would expect only one set of elements to be activated in WM, the random sequence (which produces the SPoARC) or the canonical 1–10 order (which produces the SNARC). However, to accommodate the results, Abrahamse et al. (2016, p. 4) extended the WM account by adding a supplementary assumption: “multiple item sets can be active in working memory simultaneously.” This would explain the results of Ginsburg and Gevers and Huber et al., in that both the random sequence and the canonical 1–10 order were in WM.

SNARC and SPoARC as mutually exclusive effects

From the expertise-account perspective, we suggest that at one moment in time both effects are mutually exclusive, since we believe that the SPoARC effect is due to the spatial values generated by a specific slotted schema, and the SNARC effect by the spatial values generated by another structure, a nonslotted schema. Hence, we put forward the testable assumption that only one set of spatial values (or spatial frame) will be activated at any moment. At the item level, it means that at one moment in time an activated item is only associated with one spatial value, which comes from a slotted schema or a nonslotted schema. We can switch very quickly from a spatial frame to another, but only one is in the focus of attention at one precise moment. We put forward this assumption for two reasons.

The first is due to the laws of physics of the world we live in, which mandate that an element cannot be at two places at the same moment. On the basis of grounded cognition (e.g., Barsalou, 2010; Glenberg, 1997; Shapiro, 2011), we believe that this physical principle observable by our perceptive system is internalized. This principle thus prevents an activated item from having two different spatial values (e.g., left and right) at any one moment.

Second, we believe that the lack of an interaction in Ginsburg and Gevers’s (2015) and Huber et al.’s (2016) results can indicate that the SNARC and SPoARC effects are mutually exclusive. So why did the authors conclude the converse: both effects coexisted, and therefore were not mutually exclusive? This was because both effects can indeed coexist in the same experiment but be mutually exclusive at a trial level,Footnote 9 so that during some trials, participants would activate the random sequence (which generates the SPoARC), and during other trials they would activate the canonical 1–10 order (which generates the SNARC). This could explain why no interaction was detected. This interpretation could be tested by looking at the reaction time distribution for trials in which the SPoARC and SNARC effects pulled in different directions. For example, if the memoranda were “9 5 7 1 6” and the 9 had to be classified, it should be linked to the left according to the SPoARC effect and to the right according to SNARC. Since there is no interaction between the two effects, we suggest that in some trials, 9 would be processed as belonging to the sequence and be linked to the left, but in other trials 9 would be processed as belonging to the canonical 1–10 order, and therefore linked to the right. Therefore, a bimodal distribution should appear for these kind of trials.

Conclusion and future perspectives

The purpose of this article was to show that an alternative to the dual account and the WM account, based on nonslotted (e.g., chunks) and slotted (e.g., templates) schemas was possible to account for the SNARC and SPoARC effects. In this concluding section, we would like to widen the lens by presenting the broader implications of our framework when considering knowledge structures, especially for WM.

We believe that one important purpose of these spatial knowledge structures is to enable individuals to organize and stabilize their thoughts in verbal WM. And we believe that people do so mainly in a horizontal, left-to-right fashion because of their writing/reading expertise. Usually the capacity to stabilize thoughts in face of interference in WM is linked to attentional control (or cognitive control, executive control, central executive, controlled attention, executive functioning, depending on the theory; see Chow & Conway, 2015; Engle, 2002; Kane, Bleckley, Conway, & Engle, 2001; Kane & Engle, 2003). Here we do not want to replace this explanation (for a discussion see Guida, van Dijck, & Abrahamse, 2017) but just stress the fact that knowledge structures, and especially slotted schemas, are important too, because they allow to reduce our need for attentional control. Indeed, we propose, as in Norman and Shallice’s (1986) supervisory attentional system, that learned spatial schemas (or schemata, in Norman and Shallice’s theory) enable people to be less dependent on attentional resources. Recent results by Mäntylä and colleagues (e.g., Mäntylä, 2013; Mäntylä, Coni, Kubik, Todorov, & Del Missier, 2017; Todorov, Del Missier, Konke, & Mäntylä, 2015; Todorov, Kubik, Carelli, Del Missier, & Mäntylä, 2018) seem compatible with this idea. They showed that spatial abilities contribute to multitasking, and that when spatial coding was impeded, multitasking performance dropped. The results suggest that cognitive control demands can be offloaded by relying on spatial cognitive structures and processes.

Therefore, the involvement of attentional control could be an inverse function of expertise and knowledge. If reading/writing has been practiced for a long period of years, it should have generated the capacity to use this horizontal, left-to-right schema with less attentional control than if less practiced. Or said otherwise, if an illiterate individual wants to achieve the same kind of spatialization as an experienced reader, he will have to use a lot of attentional control to stabilize the information in a horizontal, left-to-right fashion.

Obviously (and interestingly), the picture gets more complex if one considers that attentional control is certainly important to first acquiring the knowledge of schemas, and subsequently to increasing expertise. Several studies (e.g., Engle, Carullo, & Collins, 1991; Gathercole & Baddeley, 2014; Gathercole, Brown, & Pickering, 2003) have shown a link between language acquisition and WM (used here as an approximation of attentional control). In other words, attention control could be important to acquire knowledge; however, once acquired, knowledge could enable less attentional control to be used, as occurs with chunking in experts (Guida et al., 2012). As a consequence, high/low WM span differences could partly be due to the capacity to use schemas through knowledge and expertise in a way similar to that proposed by Ericsson and Kintsch (1995). This can be exemplified by mind wandering. As was shown by Kane et al. (2007), individuals with low WM span seem to “suffer” more from mind-wandering than do high-WM-span individuals. This seems to be due to their lack of cognitive control. Our idea would be that part of the capacity of individuals with high WM span to control their thoughts may come from the capacity to use well-practiced schemas (knowledge), such as an horizontal, left-to-right spatial schema, to stabilize their thoughts more easily. Therefore, the disadvantage of individuals with low WM span may come from both their lack of attentional control and lack of cognitive stabilizing structures.

Finally, we conclude this section by stating three principles that derive from our account. The first is the knowledge principle, which states that an average adult individual always uses knowledge to process and remember information; in fact, we cannot not use knowledge. This principle highlights an idea shared with many WM models (e.g., Acheson & MacDonald, 2009a, 2009b; Burgess & Hitch, 1999; Majerus, 2008; Martin & Saffran, 1992) that WM theories should not be built around the idea that we have empty stores that can be filled, but instead on the idea that the capacity to store information is a consequence of knowledge structures we build and use to store. Second is the expertise principle, which states that in the domains in which we have more expertise, we are able to use more appropriate knowledge structures, and therefore to increase our capacity to remember information. Third and last is the development principle: We develop knowledge structures based on our experience with the world, and we use these structures to remember. The SNARC and the SPoARC effects are two examples of how we have developed apt cognitive structures.

Author note

G.C. was granted a Visiting Fellow (Research) Grant by the School of Arts and Humanities at Edith Cowan University in order to invite A.G.. A.G. thanks the section 16 of the Conseil National des Universités for awarding him a 6 month sabbatical (C.R.C.T.). The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflicts of interest.