Training-induced improvement in working memory tasks results from switching to efficient strategies

Malinovitch, Tamar; Jakoby, Hilla; Ahissar, Merav

doi:10.3758/s13423-020-01824-6

Training-induced improvement in working memory tasks results from switching to efficient strategies

Brief Report
Open access
Published: 15 October 2020

Volume 28, pages 526–536, (2021)
Cite this article

Download PDF

You have full access to this open access article

Psychonomic Bulletin & Review Aims and scope Submit manuscript

Training-induced improvement in working memory tasks results from switching to efficient strategies

Download PDF

Tamar Malinovitch¹,
Hilla Jakoby² &
Merav Ahissar¹

3353 Accesses
12 Citations
4 Altmetric
Explore all metrics

Abstract

It is debated whether training with a working memory (WM) task, particularly n-back, can improve general WM and reasoning skills. Most training studies found substantial improvement in the trained task, with little to no transfer to untrained tasks. We hypothesized that training does not increase WM capacity, but instead provides opportunities to develop an efficient task-specific strategy. We derived a strategy for the task that optimizes WM resources and taught it to participants. In two sessions, 14 participants who were taught this strategy performed as well as fourteen participants who trained for 40 sessions without strategy instructions. To understand the mechanisms underlying the no-instruction group’s improvement, participants answered questionnaires during their training period. Their replies indicate that successful learners discovered the same strategy and their improvement was associated with this discovery. We conclude that n-back training allows the discovery of strategies that enable better performance with the same WM resources.

Divergent Research Methods Limit Understanding of Working Memory Training

Article 09 May 2019

The early effects of external and internal strategies on working memory updating training

Article Open access 06 March 2018

Working memory training revisited: A multi-level meta-analysis of n-back training studies

Article 23 January 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Working memory (WM) is defined as the ability to simultaneously retain and manipulate information within short time periods (Baddeley, 1992, 2003). The number of items that can be explicitly accessed and manipulated (i.e., WM capacity) is extremely limited and poses a strict bottleneck to human cognition (Cowan, 2001). Indeed, WM capacity is strongly correlated with fluid intelligence (Engle, Laughlin, Tuholski, & Conway, 1999; Süß, Oberauer, Wittmann, Wilhelm, & Schulze, 2002) and with academic achievements (Baddeley, 1992; Bayliss, Jarrold, Baddeley, & Gunn, 2005; Hitch, Towse, & Hutton, 2001; Swanson, 2004). One of the most studied WM tasks is the n-back task (e.g., Jaeggi, Buschkuehl, Jonides, & Perrig, 2008), in which participants are presented with a sequence of serially presented stimuli and are asked to respond when a stimulus is repeated at an interval of exactly n stimuli. This task requires holding the last n items, plus the new item, in WM. When each stimulus is presented, participants must compare it to their predicted target stimulus (the item presented n intervals earlier), respond if there is a match (target), and then update their WM representation to form a prediction for the next target stimulus. Since performance in this task is highly correlated with general intelligence scores, even compared with other WM tasks (Jaeggi, Buschkuehl, Perrig, & Meier, 2010), it has become a common task for training aimed at generally enhancing WM and fluid intelligence (e.g., Au et al., 2015; Redick, 2019; Schwaighofer, Fischer, & Bühner, 2015).

Training WM unequivocally yields improvement in the trained task, but the generalization of this benefit has been heatedly debated (e.g., Redick, 2019). Some meta-analyses and systematic reviews supported the existence of significant transfer (Au, Gibson, Bunarjo, Buschkuehl, & Jaeggi, 2020; Karbach & Verhaeghen, 2014). Yet others found no transfer to untrained tasks, or, at best, minimal transfer to very similar tasks (Au et al., 2015; Melby-Lervåg & Hulme, 2013; Melby-Lervåg, Redick, & Hulme, 2016; Redick, 2019; Soveri, Antfolk, Karlsson, Salo, & Laine, 2017). Gathercole, Dunning, Holmes, and Norris (2019) concluded that reliable transfer of WM training occurs only when the new task is very similar in structure (“near”) to the trained task and requires similar cognitive routines.

Others (e.g., Jacoby & Ahissar, 2013, 2015; Melby-Lervåg et al., 2016; Redick, 2019; Sala & Gobet, 2017; Simons et al., 2016) noted that “far” transfer is more characteristic of studies without an active-control group. In these studies, a no-contact control group that did not practice any task was included, either as the only control (e.g., Jaeggi et al., 2008) or as an additional control group, whose inclusion is crucial for attaining a significant transfer effect (e.g., Anguera et al., 2013). The no-contact group is not given monetary (or equivalent) rewards or stimulating personal attention, both of which positively impact performance. Therefore, differences in transfer may stem from the mere existence of a training protocol rather than from the specific training protocol of the experimental group (Foroughi, Monfort, Paczynski, McKnight, & Greenwood, 2016; Melby-Lervåg & Hulme, 2013; Shipstead, Redick, & Engle, 2012). Studies that used active control groups (trained with a similarly demanding task and a similar reward protocol) typically found either small near-only transfer (Linares, Borella, Teresa, Id, & Carretti, 2019) or no transfer at all (Jakoby, Raviv, Jaffe-dax, Lieder, & Ahissar, 2019).

The magnitude of transfer, when such was reported, is usually small, and is difficult to dissociate from a null result, since it is not resilient to correction for multiple comparisons. Typically, several tasks are assessed before and after training, and performance in most untrained tested tasks does not improve following training (Barnett & Ceci, 2002; Shipstead et al., 2012). Given that testing several tasks increases the probability of false positive(s), the target significance criterion should be raised (reviewed in Jacoby & Ahissar, 2013, 2015). However, the effect size of transfer to untrained tasks, if any, is small: ~0.3 standard deviations in methodologically weaker studies, ~0.01 in methodologically sound studies (Melby-Lervåg et al., 2016; Redick, 2019). Since the typical size of trained groups is also small (~15 per group; Chooi & Thompson, 2012; De Simoni & von Bastian, 2018; Gibson et al., 2012; Redick et al., 2013; Thompson et al., 2013), raising the target level of significance would have rendered the reported transfer nonsignificant (e.g., Anguera et al., 2013).

The combination of the huge effort required to conduct intensive training studies and the small (if any) generalization to untrained conditions that are not very similar, highlights the importance of understanding the cognitive mechanisms underlying training-induced behavioral improvement. Remarkably, these mechanisms have hardly been addressed. Deciphering these processes was the aim of the current study, with a specific focus on n-back training because it is the most commonly trained task. We asked, what is it that participants learn which enables their substantial improvement in a challenging updating task, designed to require limited-capacity online manipulations? The few recent studies that addressed this question suggest that the use of a task-specific strategy may facilitate training-induced improvement (Fellman et al., 2020; Laine, Fellman, Waris, & Nyman, 2018; Linares et al., 2019; Redick et al., 2013). Indeed, the importance of a strategy that reduces WM requirements has been gradually acknowledged (Redick, 2019). But can the use of an efficient strategy explain the entire learning process?

We began this study with practicing the n-back task ourselves and discussing our accumulative introspection of what facilitated our improved performance. These discussions clarified to us the strategy that we each had independently discovered. We measured whether participants who were explicitly taught this strategy could quickly reach the level of performance attained by those who go through intensive training without strategy instructions. We then deciphered, based on the self-reports of participants who had trained massively with no instructions, whether their improvement was associated with the discovery of an efficient (perhaps the same) strategy.

Method

The naïve strategy of n-updates versus the efficient strategy of 1-update

Naïve participants can typically perform well with n = 1 and n = 2, but find n ≥ 3 extremely challenging. The reason the task becomes difficult with n ≥ 3 is that participants need to update the content of n positions (slots) in WM following each item presentation. Figures 1 and 2 illustrate this naïve n-updates strategy (presented in the central column) for two types of n-back tasks—letters (Fig. 1) and spatial positions (Fig. 2). With this strategy, the last n items are always stored in WM in the order of their presentation. When a new stimulus is presented, it is compared with the oldest item (presented n stimuli earlier). Participants are asked to press a button if they recognize the match—stimulus repetition with an interval of n (denoted in yellow in Figs. 1 and 2). After each comparison, participants need to update all WM slots—all n(+1) items are “pushed” one position backward (left in Fig. 1), so that the most “recent” position holds the recently presented item and the “oldest” position holds the target of the next stimulus presentation. For example, when the items are letters, n = 3, the representation in WM is D, S, R, and the next letter is B (see Fig. 1, center, line 3)—this B will be compared with the item in the slot that holds the oldest item in WM—D (center, enclosed letter), and then added to WM at the most recent slot—following R. Then, the content of occupied slots in WM will be updated—shifted backwards, so that D will be dropped out, keeping the shifted three-letter representation—S,R, B. Thus, the naive strategy requires an update in the content of all WM slots—a shift in the slots of all n items in memory upon each stimulus presentation.

By contrast, the strategy described by Laine et al. (2018) for letters (see Fig. 1, right column) and its parallel for the spatial task that was derived by us (see Fig. 2, right column) includes no shifts in WM representation. Rather than shifting the items in WM slots, it shifts the slot to which attention is allocated in the given WM representation. The shift in the attended slot in WM does not put load on WM (Myers, Chekroud, Stokes, & Nobre, 2018). Crucially, in each trial only the item in the attended slot is updated (if it differs than the expected target). Thus, the strategy requires, at most, updating the content of one WM slot (compared with n slots in the naïve strategy), keeping track of which position is now relevant, and an attention shift, which does not occupy additional WM resources (Myers et al., 2018). For example, for letters and n = 3 (see Fig. 1, right), when the representation in WM is D, S, R, then B is the new letter, and the attended position is the first (line 3), only this position is updated so that the new representation in WM will be B, S ,R. When the next stimulus is presented (line 4), attention is shifted to the second position, and the new stimulus is compared with S, which will be updated if there is no match. Thus, if the new item is N, the updated WM sequence will be B, N, R. Next, the third position will be attended, and then back to the first (when n = 4, this loop has four positions, as illustrated in Fig. 5).

The difference between the amount of update required by each strategy can easily be seen when examining the similarity between consecutive WM representations, shown in consecutive lines in Fig. 1. One can also perceive the similarity in sound by sounding the preupdating and postupdating sequences (content of consecutive steps): D, S, R is much more similar to B, S, R (efficient 1-update strategy) than to S, R, B (naïve strategy), since the content of only one slot is modified in the former, as opposed to three slots in the latter.

The above description focuses on letters. We now extended the same conceptual strategy to other stimuli (though the analogy may not be transparent to participants). When the task is spatial (see Fig. 2), spatial locations of stimuli need to be retained in WM. Thus, the same n-updates versus 1-update strategy applies to the spatial task. In the naïve strategy, participants consistently compare the item in the first (oldest) slot, and then update the entire set of slots by pushing them backwards and removing the “oldest” slot from memory (as illustrated in Fig. 2, left). In the efficient strategy, only one slot is updated. This slot—the attended and updated one—changes with the presentation of each stimulus, in a loop with a length of n (for n items). Here too, efficiency results from solving the task using the 1-update strategy (only one WM slot is updated in each step of Fig. 2, right) and keeping track of which item needs to be attended next. As with letters, switching the strategy from updating all slots to updating only the attended slot, with the index looping over the number of items—first-second-third-first—reduces the WM resources required for attaining the same level of success.

In this study, we chose to use a spatial n-back task; in the past, we had trained a group of participants with this task, with no explicit strategy instructions, for 40 sessions (Jakoby et al., 2019). Most of those participants improved significantly in this task, but showed no transfer to other WM tasks. We now asked what these participants had actually learned during this training, and whether a similar degree of improvement could be gained in less time if participants were explicitly taught the efficient 1-update strategy.

Experimental design and participants

In this paper, we compared the data of the two following groups:

(1)
The strategy-instruction group (N = 14), who received three training sessions—a naïve session with no strategy instructions, and two subsequent sessions. At the beginning of each of these two sessions, they watched a detailed 8-minute video clip with strategy instructions in Hebrew (the English version of this video clip can be found here [https://youtu.be/-21tuZQNMMQ]). Then an experimenter showed the participant a sequence of six stimuli and asked them to describe the representation in memory on Steps 4–6 when n = 3. The criterion for understanding was strategy-correct answers for all three steps, and it was met by all the participants. Participants were then asked to perform the task according to the strategy presented in the video clip. The interval between consecutive sessions was 1–8 days. Participants were told that the aim of the study is to assess how using this specific strategy affects their performance of the task. Data for this group were collected specifically for this study.
(2)
The no-instruction group, who trained for 40 sessions with no explicit strategy instructions (five times a week for 2 months). The data of this group have been previously published, in a study aimed at assessing transfer to other WM tasks, which found no transfer (Jakoby et al., 2019). Participants were told that the aim of the study is to assess how training for a task improves their performance in the trained task and in other memory-challenging tasks. Both groups answered the questionnaires detailed below, in which they expounded on the strategy they had used to perform the task.

The choice of 14 participants in the strategy-instruction group was aimed to match the number of participants who had previously formed the no-instruction group. In the context of this study, the relevant effect size is the magnitude of improvement in the trained task. Since improvement was larger than three standard deviations (Jakoby et al., 2019), 14 participants per group were sufficient in both the previously studied group (no-instruction) and the newly added one (strategy-instruction group). Statistical power analysis shows that to get the power of 0.9 with α = 0.01, based on the results of the first and the last session of the no-instruction group (Jakoby et al., 2019), at least 10 participants are needed per group (Table 1).

Table 1 Demographics of both groups, mean ± SD; the data of the no-instruction group have been published previously (Jakoby et al., 2019)

Full size table

All participants received monetary compensation or course credit for their participation (for a detailed description of the monetary compensation of the no-instruction group, see Jakoby et al., 2019). The data of one participant from the no-instruction group were excluded from the analysis (data of 14 participants are reported) because her performance on the task before training was an extreme outlier (z score of over 2.5 in each session). Her first self-report indicates the discovery of the efficient strategy. We believe that she had discovered the strategy early in the first session. Importantly, all the reported results remain statistically significant when including this participant.

Spatial n-back task

Both groups were administered the same spatial n-back protocol (Jakoby et al., 2019). In this protocol, red circles are presented sequentially, one circle every 2 seconds (stimulus duration 500 ms; interstimulus interval 1,500 ms), in one of eight positions on a virtual rectangle on a computer screen. Participants respond by pressing a space bar with their index finger whenever the location of a newly presented circle matches the location of the circle presented n steps back (target). No response is required for nontargets. Participants are notified about the relevant n at the beginning of each block. Each block comprises n+20 steps (stimuli) and includes six targets. Particularly confusing stimuli are lures: repetitions with an interval slightly different than n—a circle appears at a previous position (repetition) but with an interval of (n − 1) or (n + 1), as illustrated in Fig. 3. Differentiating lures from targets is difficult—participants tend to press the button upon detecting a repetition, even with different intervals (Duncan, 2003). In our experiment, we included three possible levels of lure difficulty: easiest—no lures, intermediate—four lures per block (two of each type), and most difficult—eight lures per block (four of each type). We included lures because it has been previously shown that lures increase WM load and the requirements of cognitive control (e.g., Redick & Lindsey, 2013; Szmalec, Verbruggen, & Kemps, 2011). Each block’s level of difficulty was determined as follows: If the participant’s performance was 85% correct or above (calculated as hit rate minus false alarm), the difficulty level for the following block was increased by adding four more lures. After reaching a level of eight lures in a block, reaching the 85% accuracy criterion increased n by one. When performance was 65% correct or below, the number of lures was decreased from eight to four to zero, and eventually n was decreased by one (and the next block, with the smaller n, would include eight lures). Difficulty level was not modified otherwise. Each session lasted ~30 min and consisted of 25 blocks with short breaks between them. The first two sessions began with n = 2 and four lures per block for all participants. Subsequent sessions began for each participant at the difficulty level they had reached during the last block of the previous session. The same protocol was administered to both groups.

Questionnaires

Both groups answered questionnaires regarding the strategies they had used to perform the task. In the strategy-instruction group, participants filled out questionnaires only at the end of the third session. First, they were asked to describe their strategy in their own words (i.e., explain what they had done and evaluate the efficiency of their strategy). Then, they were presented with illustrations of two strategies—the naïve n-updates strategy and the efficient 1-update strategy—and were asked to state which was closer to their own strategy (if any). This questionnaire had two goals: (1) to make sure that the participants in the strategy-instruction group had indeed used the explicitly taught strategy; (2) to see whether other methods were developed and used by the participants.

In the no-instruction group, each participant answered a questionnaire at the end of each week of training (five training sessions). The same questionnaire was administered every week. The questionnaire included two open question regarding strategy use (“In general, could you describe your strategy for performing the training task?” and “Is it a different strategy from the one that you used in last week’s training?”). The questions were open-ended and nonspecific so that no particular strategy would be implied, and no guidance would be inadvertently provided. The answers to all questionnaires were read and analyzed only after the experiment had ended, so that participants would not be affected by the experimenters’ expectations. To decide which strategy had been used and whether it was modified with training, we asked four independent reviewers, who were familiar with the task yet blind to participants’ performance, to evaluate based on each week’s reply of each participant whether she or he had used the efficient 1-update strategy, and if so.

Results

Improvement was substantially faster in the strategy-instruction group

Initial performance without instructions (performance during the first session, measured by mean n per session) did not differ between groups (strategy-instruction group: mean n = 2.65, SD = .37, 95% confidence interval (CI) [2.46, 2.84]; no-instruction group: mean n = 2.43, SD = .44, 95% CI [2.2, 2.66]; p = .38, in a two-tailed, two-sample unequal variance t test, Cohen’s d = .4). The second session began with an instructional video clip for the strategy-instruction group, and with no specific instructions in the no-instruction group. Afterwards, both groups performed the same task, with the same adaptive protocol (see Method section).

Mean performance in the second session significantly differed between the two groups, with the n of the strategy-instruction group (mean n = 3.25, SD = .43, 95% CI [3.02, 3.48]) being significantly higher than that of the no-instruction group (mean n = 2.56, SD = .58, 95% CI [2.36, 2.82]; p = .002, Cohen’s d = 1.29), as shown in Fig. 4.

In the third session, performance was very different between the groups (strategy-instruction: mean n = 4.3, SD = .66, 95% CI [3.95, 4.65]; no-instruction: mean n = 2.8, SD = .6, 95% CI [2.49. 3.11]; p < .00001, Cohen’s d = 2.36). In fact, within three sessions performance of the strategy-instruction group reached the level attained by the no-instruction group only after 25–40 sessions, and did not significantly differ from the no-instruction group’s final performance during the fortieth session (strategy-instruction group third session: mean n = 4.3, SD = .66, 95% CI [3.95, 4.65]; no-instruction group 40th session: mean n = 3.96, SD = 1.13, 95% CI [3.37, 4.55]; p = .46, Cohen’s d = .35).

A repeated-measures two-way analysis of variance (ANOVA) for the three first sessions (2 groups × 3 sessions( showed a significant main effect of session, F(2) = 62.99, p < .0001, η_p² = .83, indicating a general improvement as participants completed more sessions; a significant main effect of group, F(1) = 17.47, p < .0001, η_p² = .4, indicating different performance levels in the two groups, with the strategy-instruction group showing significantly better performance overall; and crucially—a significant interaction between session and group, F(2) = 26.7, p < .0001, η_p² = .68, indicating faster improvement in the strategy-instruction group.

Figure 4 plots mean performance (mean n per session) as a function of session number in the two groups. The initial and end points are similar, but the strategy-instruction group improved much faster. Cross-participant variability is similar for both groups in the first session (strategy-instruction SD = .37, no-instruction SD = 0.44 in the no-instruction group), and it increases for both groups during training, with a greater increase observed in the no-instruction group (strategy-instruction SD = .66, no-instruction SD = 1.13). This pattern results from the substantially different rates of improvement across participants, particularly when no explicit instructions are given. Large cross-participant variability was also observed in previous (no-instruction) training studies (e.g., Jaeggi, Buschkuehl, Jonides, & Shah, 2011). Previously, this variability was attributed to the extent to which general WM capacity increased. However, Figs. 4 and 5 show that this cross-participant variability results from different success rates in discovering the efficient task-specific 1-update strategy.

Improvement in the no-instruction group is associated with the discovery of the efficient strategy

Participants in the strategy-instruction group indicated in the questionnaires that they all understood and used the instructed strategy during the two postinstruction sessions.

Analyzing the self-reports of participants in the no-instruction group was complex, because the verbal reports of eight (out of 14) participants were too vague to determine or rule out any specific strategy. However, six (out of 14) participants explicitly indicated that they used the efficient 1-update strategy starting from a particular training week (see Fig. 6). For example: “As the dots appear on the screen, I number them in my head. I do counts of four [n = 4], meaning that the fifth dot that appears is given the number one. If the dot appears in the same location as the original number one, I click the space bar, if not I memorize “number 1s” new location. I do the same thing with the other dots that appear, constantly memorizing new locations and keeping count at the same time.” Figure 5 illustrates how this account directly maps to the implementation of the efficient 1-update strategy with n = 4.

Figure 6a plots performance as a function of training week for each of the 14 participants of the no-instruction group. Performance is plotted with respect to “Week 0”—the week in which they discovered the efficient strategy, as evident from their self-report questionnaire (submitted at the end of every training week). As described above, the plots of six participants either started at or crossed Week 0. Four of them discovered this strategy within their first training week—that is, within their first five sessions (and hence their plots begin at “0”)—and the other two discovered it later, one in the third week and the other in the fourth week. Their slopes abruptly rise following this discovery. Plots of participants who did not report discovering a strategy during their 8 training weeks begin at “−8”, indicating that they trained 8 weeks without discovering the efficient strategy.

Figure 6b shows the individual gains in n between the first and last sessions in the strategy-instruction group and in two subgroups of the no-instruction group—the six participants who explicitly deciphered the efficient strategy, and the eight who did not. All participants improved in the strategy-instruction group, though not to the same extent. In the no-instruction group, improvement differed significantly between those who explicitly deciphered the efficient strategy and those who did not (p = .00067 in a Mann–Whitney U test), with no overlap between the degree of improvement of individuals in the two subgroups. Thus, the high cross-participant variability in the no-instruction group (see Fig. 4) is largely explained by the difference in improvement between those who discovered the efficient strategy, and substantially improve, and those who did not, whose improvement ranges between small to none at all.

Discussion and conclusions

Our results indicate that training-induced improvement in the n-back task can be fully explained by the discovery of a task-specific strategy. Hence, improvement does not indicate a general enhancement of WM capacity. This finding explains previous findings of no transfer, or only very-near transfer to other types of the n-back task (e.g., Jakoby et al., 2019; Linares et al., 2019). Based on our results, we predict that performance in untrained tasks will improve only when the discovered efficient strategy applies, and its relevance is transparent to the participants. Thus, performance in tasks with the same structure may be improved (Fellman et al., 2020; Redick, 2019), as previously reported (Linares et al., 2019). However, performance in most other WM tasks, and even in n-back tasks for which the strategy is difficult to implement (Jakoby et al., 2019), is not expected to benefit from the discovery of this strategy. This account is in line with the proposal of Gathercole et al. (2019), who claim that training-induced transfer occurs only when participants have acquired a new complex cognitive skill during training, and when that skill can be applied to a novel task.

Though to the best of our understanding, the same efficient strategy was repeated across participants, we do not claim that this is the only possible strategy. Yet, theoretically, we expect all efficient strategies across WM tasks to have something in common. In fact, and as mentioned before, our strategy is the same or very similar to the strategy described previously for n-back with letters (Laine et al., 2018) and with digits (Fellman et al., 2020) in their studies characterizing the continuous effect of training with and without an explicit strategy. We assert that all efficient strategies reduce the number of manipulations in WM per trial, compared with the naïve strategy. Importantly, there is no need to reduce the total number of operations per trial—only the operations that put load on WM. For example, scanning through items without changing their slot in WM does not add to WM load (Myers et al., 2018).

This training study provides further support for the strategy mediation theory of WM improvement (Dunning & Holmes, 2014; Peng & Fuchs, 2017), compared with the capacity theory (Engle & Kane, 2003). The strategy mediation theory assumes that WM has a relatively fixed capacity, and therefore claims that WM performance is determined by the efficiency with which its capacity is used (Bailey, Dunlosky, & Kane, 2008; McNamara & Scott, 2001). The capacity theory assumes that capacity can increase with practice, and is often described using the muscle-like metaphor—efficient training strengthens WM by increasing capacity (described by Peng & Fuchs, 2017).

Yet most of the literature of strategy mediation theory does not specify the strategies that would be efficient for WM tasks. The description of rehearsal or chunking (McNamara & Scott, 2001; Peng & Fuchs, 2017; Turley-Ames & Whitfield, 2003) does not capture the unique structure of WM tasks, which are designed to require online manipulations that cannot be organized into fixed chunks, or at least not in any straightforward manner. Our study adds to recent studies (Fellman et al., 2020; Laine et al., 2018) in specifying an efficient strategy for the case of the spatial n-back task. Unlike n-back tasks that use items that can be easily named, like letters or digits, spatial tasks cannot benefit from subvocal rehearsal of the items (see Chooi & Logie, 2020). Still, subvocal verbalization can help memorizing which of the n items in WM should be updated in each step.

The positive impact of strategy instructions had been recently studied. For example, Fellman et al. (2020) taught participants a specific strategy for the n-back task with digits via the Internet, and found that using this strategy facilitated initial learning. However, the advantage of explicit instruction, observed during the first few sessions, was partial, as the strategy group continued to improve throughout the 12 training sessions, and performance of the tutored and nontutored groups was similar following the first few training sessions. Thus, instruction was beneficial, but its impact was smaller than in our case, perhaps due to the nature of Internet training, or due to the difference in task stimuli. Another potential contribution to the enhanced efficiency of instruction in our study, which yielded the equivalent of more than 25 uninstructed training sessions following only three sessions, was the effort we put on strategy clarity (presenting instructions with a video clip [https://youtu.be/-21tuZQNMMQ]), and verification of participants' understanding. Additionally, we showed that the large individual differences in training-induced improvement within a no-strategy group delineate the participants who discovered an efficient strategy spontaneously versus those who did not.

One of the most interesting questions that our results have raised is what differentiates people who develop an efficient strategy during training, even without explicit instructions, from those who do not. Studying this question systematically requires testing whether those who developed an efficient strategy for one task also tend to develop efficient strategies for other challenging tasks, which is beyond the scope of this study. There is some evidence that individuals with larger WM pretraining are the ones who benefit more from training (Foster et al., 2017; Redick, 2019; Wiemers, Redick, & Morrison, 2019). There may be a link between initial WM capacity and the ability to quickly and efficiently adapt a strategy for a task, or there may be another cognitive trait underlying both. We do not find evidence for that in our group of participants; performance during the first session is not a good predictor of learning rate, though perhaps mean performance during the first session already includes some learning. Another suggestion in the literature is that action video game players are more likely to find efficient strategies, as their ability to learn is enhanced (Bejjanki et al., 2014, Green & Bavelier, 2014), perhaps due to enhanced attention and spatial cognition (Bediou et al., 2018). This claim had been challenged, and the findings regarding the advantages of strategic video games have been questioned (e.g., Roque & Boot, 2018; Sala, Tatlidil, & Gobet, 2018). We should note that even if action video game players demonstrate better strategies, it is still not clear whether playing action video games is the reason or the result of this enhanced strategic ability (or both).

Finally, perhaps the most important conclusion of this study is its contribution to the accumulative recent body of research, which support the strategy account of WM training, and indicate that time has come to change the metaphors we use to describe WM training studies. Training WM does not open a common bottleneck. Successfully trained individuals do not perform the same operations faster or better. They change the set of operations used to solve the task. This type of change is likely to underlie the acquisition of all expertise. When the same operations are administered to the same sequences of stimuli repeatedly, as in word reading, we replace the WM operations with chunking and schemas. But when the crux of the task is using the operations on untrained stimuli sequences (as in reading nonwords), chunking cannot replace online computations. Hence, training-based improvement probably results from using a set of more efficient task-specific operations. Better understanding of these task-specific strategies may both teach us about the structure of WM and facilitate performance in tasks that heavily load on our limited WM resources.

References

Anguera, J. A., Boccanfuso, J., Rintoul, J. L., Al-Hashimi, O., Faraji, F., Janowich, J., … Gazzaley, A. (2013). Video game training enhances cognitive control in older adults. Nature, 501(7465), 97–101. doi:https://doi.org/10.1038/nature12486
Article PubMed PubMed Central Google Scholar
Au, J., Gibson, B. C., Bunarjo, K., Buschkuehl, M., & Jaeggi, S. M. (2020). Quantifying the difference between active and passive control groups in cognitive interventions using two meta-analytical approaches. Journal of Cognitive Enhancement, 4(2), 192–210. doi:https://doi.org/10.1007/s41465-020-00164-6
Article PubMed PubMed Central Google Scholar
Au, J., Sheehan, E., Tsai, N., Duncan, G. J., Buschkuehl, M., & Jaeggi, S. M. (2015). Improving fluid intelligence with training on working memory: A meta-analysis. Psychonomic Bulletin & Review, 22(2), 366–377. doi:https://doi.org/10.3758/s13423-014-0699-x
Article Google Scholar
Baddeley, A. (1992). Working memory. Science, 255(5044), 556–559.
Article PubMed Google Scholar
Baddeley, A. (2003). Working memory: Looking back and looking forward. Nature Reviews Neuroscience, 4(10), 829–839. doi:https://doi.org/10.1038/nrn1201
Article PubMed Google Scholar
Bailey, H., Dunlosky, J., & Kane, M. J. (2008). Why does working memory span predict complex cognition? Testing the strategy affordance hypothesis. Memory & Cognition, 36(8), 1383–1390. doi:https://doi.org/10.3758/MC.36.8.1383
Article Google Scholar
Barnett, S. M., & Ceci, S. J. (2002). When and where do we apply what we learn? A taxonomy for far transfer. Psychological Bulletin, 128(4), 612–637. doi:https://doi.org/10.1037/0033-2909.128.4.612
Article PubMed Google Scholar
Bayliss, D. M., Jarrold, C., Baddeley, A. D., & Gunn, D. M. (2005). The relationship between short-term memory and working memory: Complex span made simple? Memory, 13(3/4), 414–421. doi:https://doi.org/10.1080/09658210344000332
Article PubMed Google Scholar
Bediou, B., Adams, D. M., Mayer, R. E., Tipton, E., Green, C. S., & Bavelier, D. (2018). Meta-analysis of action video game impact on perceptual, attentional, and cognitive skills. Psychological Bulletin, 144, 77–110. doi:https://doi.org/10.1037/bul0000130
Article PubMed Google Scholar
Bejjanki, V. R., Zhang, R., Li, R., Pouget, A., Green, C. S., & Lu, Z. (2014). Action video game play facilitates the development of better perceptual templates. Proceedings of the National Academy of Sciences of the United States of America, 111(47), 16961–16966. doi:https://doi.org/10.1073/pnas.1417056111
Article PubMed PubMed Central Google Scholar
Chooi, W. T., & Logie, R. (2020). Changes in error patterns during n-back training indicate reliance on subvocal rehearsal. Memory & Cognition. doi:https://doi.org/10.3758/s13421-020-01066-w
Chooi, W. T., & Thompson, L. A. (2012). Working memory training does not improve intelligence in healthy young adults. Intelligence, 40(6), 531–542. doi:https://doi.org/10.1016/j.intell.2012.07.004
Article Google Scholar
Cowan, N. (2001). The magical number 4 in short-term memory: A reconsideration of mental storage capacity. Behavioral and Brain Sciences, 24(1), 87–114. doi:https://doi.org/10.1017/S0140525X01003922
Article Google Scholar
De Simoni, C., & von Bastian, C. C. (2018). Working memory updating and binding training: Bayesian evidence supporting the absence of transfer. Journal of Experimental Psychology: General, 147(6), 829–858. doi:https://doi.org/10.1037/xge0000453
Article Google Scholar
Duncan, J. (2003). Intelligence tests predict brain response to demanding task events. Nature Neuroscience, 6(3), 6–8.
Article Google Scholar
Dunning, D. L., & Holmes, J. (2014). Does working memory training promote the use of strategies on untrained working memory tasks? Memory & Cognition, 42(6), 854–862. doi:https://doi.org/10.3758/s13421-014-0410-5
Article Google Scholar
Engle, R. W., & Kane, M. J. (2003). Executive attention, working memory capacity, and a two-factor theory of cognitive control. Psychology of Learning and Motivation—Advances in Research and Theory, 44, 145–199. doi:https://doi.org/10.1016/S0079-7421(03)44005-X
Article Google Scholar
Engle, R. W., Laughlin, J. E., Tuholski, S. W., & Conway, A. R. A. (1999). Working memory, short-term memory, and general fluid intelligence: A latent-variable approach. Journal of Experimental Psychology: General, 128(3), 309–331. doi:https://doi.org/10.1037/0096-3445.128.3.309
Article Google Scholar
Fellman, D., Jylkkä, J., Waris, O., Soveri, A., Ritakallio, L., Haga, S., … Laine, M. (2020). The role of strategy use in working memory training outcomes. Journal of Memory and Language, 110(June 2019), 104064. doi:https://doi.org/10.1016/j.jml.2019.104064
Article Google Scholar
Foroughi, C. K., Monfort, S. S., Paczynski, M., McKnight, P. E., & Greenwood, P. M. (2016). Placebo effects in cognitive training. Proceedings of the National Academy of Sciences of the United States of America, 113(27), 7470–7474. doi:https://doi.org/10.1073/pnas.1601243113
Article PubMed PubMed Central Google Scholar
Foster, J. L., Harrison, T. L., Hicks, K. L., Draheim, C., Redick, T. S., & Engle, R. W. (2017). Do the effects of working memory training depend on baseline ability level? Journal of Experimental Psychology: Learning Memory and Cognition, 43(11), 1677–1689. doi:https://doi.org/10.1037/xlm0000426
Article Google Scholar
Gathercole, S. E., Dunning, D. L., Holmes, J., & Norris, D. (2019). Working memory training involves learning new skills. Journal of Memory and Language, 105, 19–42. doi:https://doi.org/10.1016/j.jml.2018.10.003
Article Google Scholar
Gibson, B. S., Kronenberger, W. G., Gondoli, D. M., Johnson, A. C., Morrissey, R. A., & Steeger, C. M. (2012). Component analysis of simple span vs. complex span adaptive working memory exercises: A randomized, controlled trial. Journal of Applied Research in Memory and Cognition, 1(3), 179–184. doi:https://doi.org/10.1016/j.jarmac.2012.06.005
Article PubMed PubMed Central Google Scholar
Green, C. S., & Bavelier, D. (2014). Learning , attentional control , and action video games. Proceedings of the National Academy of Sciences of the United States of America, 111(47), 16961–16966. doi:https://doi.org/10.1016/j.cub.2012.02.012
Article PubMed PubMed Central Google Scholar
Hitch, G. J., Towse, J. N., & Hutton, U. (2001). What limits children’s working memory span? Theoretical accounts and applications for scholastic development. Journal of Experimental Psychology: General, 130(2), 184–198. doi:https://doi.org/10.1037/0096-3445.130.2.184
Article Google Scholar
Jacoby, N., & Ahissar, M. (2013). What does it take to show that a cognitive training procedure is useful? A critical evaluation. Progress in Brain Research, 207, 121–127. doi:https://doi.org/10.1016/B978-0-444-63327-9.00004-7
Article PubMed Google Scholar
Jacoby, N., & Ahissar, M. (2015). Assessing the applied benefits of perceptual training: Lessons from studies of training working-memory. Journal of Vision, 15(10), 1–13. doi:https://doi.org/10.1167/15.10.6
Article Google Scholar
Jaeggi, S., Buschkuehl, M., Jonides, J., & Shah, P. (2011). Short- and long-term benefits of cognitive training. Proceedings of the National Academy of Sciences of the United States of America, 108(25), 10081–10086. doi:https://doi.org/10.1073/pnas.1103228108
Article PubMed PubMed Central Google Scholar
Jaeggi, S. M., Buschkuehl, M., Perrig, W., & Meier, B. (2010). The concurrent validity of the N-back task as a working memory measure. Memory, 18(4), 394–412. doi:https://doi.org/10.1080/09658211003702171
Article PubMed Google Scholar
Jaeggi, S. M., Buschkuehl, M., Jonides, J., & Perrig, W. J. (2008). Improving fluid intelligence with training on working memory. Proceedings of the National Academy of Sciences of the United States of America, 105(19), 6829–6833. doi:https://doi.org/10.1073/pnas.0801268105
Article PubMed PubMed Central Google Scholar
Jakoby, H., Raviv, O., Jaffe-dax, S., Lieder, I., & Ahissar, M. (2019). Auditory frequency discrimination is correlated with linguistic skills , but its training does not improve them or other pitch discrimination tasks. Journal of Experimental Psychology: General, 148(11), 1953.
Article Google Scholar
Karbach, J., & Verhaeghen, P. (2014). Making working memory work: A meta-analysis of executive-control and working memory training in older adults. Psychological Science, 25(11), 2027–2037. doi:https://doi.org/10.1177/0956797614548725
Article PubMed Google Scholar
Laine, M., Fellman, D., Waris, O., & Nyman, T. J. (2018). The early effects of external and internal strategies on working memory updating training. Scientific Reports, 8(1), 1–12. doi:https://doi.org/10.1038/s41598-018-22396-5
Article Google Scholar
Linares, R., Borella, E., Teresa, M., Id, L., & Carretti, B. (2019). Nearest transfer effects of working memory training: A comparison of two programs focused on working memory updating. PLOS ONE,14(2), e0211321. doi:https://doi.org/10.1371/journal.pone.02113211-27.
Article PubMed PubMed Central Google Scholar
McNamara, D. S., & Scott, J. L. (2001). Working memory capacity and strategy use. Memory & Cognition, 29(1), 10–17. doi:https://doi.org/10.3758/BF03195736
Article Google Scholar
Melby-Lervåg, M., & Hulme, C. (2013). Is working memory training effective? A meta-analytic review. Developmental Psychology, 49(2), 270–291. doi:https://doi.org/10.1037/a0028228
Article PubMed Google Scholar
Melby-Lervåg, M., Redick, T. S., & Hulme, C. (2016). Working memory training does not improve performance on measures of intelligence or other measures of “far transfer”: Evidence from a meta-analytic review. Perspectives on Psychological Science, 11(4), 512–534. doi:https://doi.org/10.1177/1745691616635612
Article PubMed PubMed Central Google Scholar
Myers, N. E., Chekroud, S. R., Stokes, M. G., & Nobre, A. C. (2018). Benefits of flexible prioritization in working memory can arise without costs, 44(3), 398–411.
Peng, P., & Fuchs, D. (2017). A randomized control trial of working memory training with and without strategy instruction: Effects on young children’s working memory and comprehension. Journal of Learning Disabilities, 50(1), 62–80. doi:https://doi.org/10.1177/0022219415594609
Article PubMed Google Scholar
Redick, T. S. (2019). The hype cycle of working memory training. Current Directions in Psychological Science, 28(5), 423–429. doi:https://doi.org/10.1177/0963721419848668
Article PubMed PubMed Central Google Scholar
Redick, T. S., & Lindsey, D. R. B. (2013). Complex span and n-back measures of working memory: A meta-analysis. Psychonomic Bulletin & Review, 20(6), 1102–1113. doi:https://doi.org/10.3758/s13423-013-0453-9
Article Google Scholar
Redick, T. S., Shipstead, Z., Harrison, T. L., Hicks, K. L., Fried, D. E., Hambrick, D. Z., … Engle, R. W. (2013). No evidence of intelligence improvement after working memory training: A randomized, placebo-controlled study. Journal of Experimental Psychology: General, 142(2), 359–379. doi:https://doi.org/10.1037/a0029082
Article Google Scholar
Roque, N. A., & Boot, W. R. (2018). Action video games do not promote visual attention. In C. J. Ferguson (Ed.), Video game influences on aggression, cognition, and attention (pp. 105–118). doi:https://doi.org/10.1007/978-3-319-95495-0_9
Sala, G., & Gobet, F. (2017). Working memory training in typically developing children: A meta-analysis of the available evidence. Developmental Psychology, 53(4), 671–685. doi:https://doi.org/10.1037/dev0000265.supp
Article PubMed Google Scholar
Sala, G., Tatlidil, K. S., & Gobet, F. (2018). Video game training does not enhance cognitive ability: A comprehensive meta-analytic investigation. Psychological Bulletin, 144, 111–139. doi:https://doi.org/10.1037/bul0000139
Article PubMed Google Scholar
Schwaighofer, M., Fischer, F., & Bühner, M. (2015). Does working memory training transfer? A meta-analysis including training conditions as moderators. Educational Psychologist, 50(2), 138–166. doi:https://doi.org/10.1080/00461520.2015.1036274
Article Google Scholar
Shipstead, Z., Redick, T., & Engle, R. (2012). Is working memory training effective? Psychological Bulletin, 138(4), 628–654. doi:https://doi.org/10.1037/a0027473
Article PubMed Google Scholar
Simons, D. J., Boot, W. R., Charness, N., Gathercole, S. E., Chabris, C. F., Hambrick, D. Z., & Stine-Morrow, E. A. L. (2016). Do “brain-training” programs work? Psychological Science in the Public Interest, Supplement, 17(3), 103–186. doi:https://doi.org/10.1177/1529100616661983
Article Google Scholar
Soveri, A., Antfolk, J., Karlsson, L., Salo, B., & Laine, M. (2017). Working memory training revisited: A multi-level meta-analysis of n-back training studies. Psychonomic Bulletin & Review, 24(4), 1077–1096. doi:https://doi.org/10.3758/s13423-016-1217-0
Article Google Scholar
Süß, H. M., Oberauer, K., Wittmann, W. W., Wilhelm, O., & Schulze, R. (2002). Working-memory capacity explains reasoning ability—And a little bit more. Intelligence, 30(3), 261–288. doi:https://doi.org/10.1016/S0160-2896(01)00100-3
Article Google Scholar
Swanson, H. L. (2004). Working memory and phonological processing as predictors of children’s mathematical problem solving at different ages. Memory & Cognition, 32(4), 648–661. doi:https://doi.org/10.3758/BF03195856
Article Google Scholar
Szmalec, A., Verbruggen, F., & Kemps, E. (2011). Control of interference during working memory updating, 37(1), 137–151. doi:https://doi.org/10.1037/a0020365
Thompson, T. W., Waskom, M. L., Garel, K. L. A., Cardenas-Iniguez, C., Reynolds, G. O., Winter, R., … Gabrieli, J. D. E. (2013). Failure of working memory training to enhance cognition or intelligence. PLOS ONE, 8(5). doi:https://doi.org/10.1371/journal.pone.0063614
Turley-Ames, K. J., & Whitfield, M. M. (2003). Strategy training and working memory task performance. Journal of Memory and Language, 49(4), 446–468. doi:https://doi.org/10.1016/S0749-596X(03)00095-0
Article Google Scholar
Wiemers, E. A., Redick, T. S., & Morrison, A. B. (2019). The influence of individual differences in cognitive ability on working memory training gains. Journal of Cognitive Enhancement, 3(2), 174–185. doi:https://doi.org/10.1007/s41465-018-0111-2
Article PubMed Google Scholar

Download references

Funding

This work was supported by the Canadian Institutes of Health Research, the International Development Research Center, the Israeli Science Foundation, and the Azrieli Foundation (Grant No. 2425/15), and by a personal grant from the Israel Science Foundation (Grant No. 1650/17) and the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement No 833694) and the Israel Science Foundation (Grant No. 1650/17).

Author information

Authors and Affiliations

Hebrew University of Jerusalem, Jerusalem, Israel
Tamar Malinovitch & Merav Ahissar
Hadassah Academic College, Jerusalem, Israel
Hilla Jakoby

Authors

Tamar Malinovitch
View author publications
You can also search for this author in PubMed Google Scholar
Hilla Jakoby
View author publications
You can also search for this author in PubMed Google Scholar
Merav Ahissar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tamar Malinovitch.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Malinovitch, T., Jakoby, H. & Ahissar, M. Training-induced improvement in working memory tasks results from switching to efficient strategies. Psychon Bull Rev 28, 526–536 (2021). https://doi.org/10.3758/s13423-020-01824-6

Download citation

Accepted: 28 September 2020
Published: 15 October 2020
Issue Date: April 2021
DOI: https://doi.org/10.3758/s13423-020-01824-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Training-induced improvement in working memory tasks results from switching to efficient strategies

Abstract

Similar content being viewed by others

Divergent Research Methods Limit Understanding of Working Memory Training

The early effects of external and internal strategies on working memory updating training

Working memory training revisited: A multi-level meta-analysis of n-back training studies

Method