Contextual Adaptation of Cognitive Flexibility is driven by Task- and Item-Level Learning

Siqi-Liu, Audrey; Egner, Tobias

doi:10.3758/s13415-020-00801-9

Contextual Adaptation of Cognitive Flexibility is driven by Task- and Item-Level Learning

Published: 03 June 2020

Volume 20, pages 757–782, (2020)
Cite this article

Download PDF

Cognitive, Affective, & Behavioral Neuroscience Aims and scope Submit manuscript

Contextual Adaptation of Cognitive Flexibility is driven by Task- and Item-Level Learning

Download PDF

3311 Accesses
27 Citations
8 Altmetric
Explore all metrics

Abstract

Adaptive behavior requires finding, and adjusting, an optimal tradeoff between focusing on a current task-set (cognitive stability) and updating that task-set when the environment changes (cognitive flexibility). Such dynamic adjustments of cognitive flexibility are observed in cued task-switching paradigms, where switch costs tend to decrease as the proportion of switch trials over blocks increases. However, the learning mechanisms underlying this phenomenon, here referred to as the list-wide proportion switch effect (LWPSE), are currently unknown. We addressed this question across four behavioral experiments. Experiment 1 replicated the basic LWPSE reported in previous studies. Having participants switch between three instead of two tasks, Experiment 2 demonstrated that the LWPSE is preserved even when the specific alternate task to switch to cannot be anticipated. Experiments 3a and 3b tested for the generalization of list-wide switch-readiness to an unbiased “transfer task,” presented equally often as switch and repeat trials, by intermixing the transfer task with biased tasks. Despite the list-wide bias, the LWPSE was only found for biased tasks, suggesting that the modulations of switch costs are task set and/or task stimulus (item)-specific. To evaluate these two possibilities, Experiment 4 employed biased versus unbiased stimuli within biased task sets and found switch-cost modulations for both stimuli sets. These results establish how people adapt their stability-flexibility tradeoff to different contexts. Specifically, our findings show that people learn to associate context-appropriate levels of switch readiness with switch-predictive cues, provided by task sets as well as specific task stimuli.

Learned switch readiness via concurrent activation of task sets: Evidence from task specificity and memory load

Article 16 April 2024

Trading off switch costs and stimulus availability benefits: An investigation of voluntary task-switching behavior in a predictable dynamic multitasking environment

Article 09 March 2018

The dynamic balance between cognitive flexibility and stability: the influence of local changes in reward expectation and global task context on voluntary switch rate

Article 22 September 2017

Introduction

Life in a changing environment frequently confronts us with cognitive conundrums. Chief among these is the so-called the shielding-shifting dilemma (Goschke, 2013), which refers to two antagonistic challenges: 1) to accomplish our goals, we often need to strongly focus on a current task (requiring cognitive stability, the shielding of an ongoing task-set from distraction); and 2) we also need to remain sensitive to possible changes in our environment that might require us to quickly update our goals and cognitive strategies (requiring cognitive flexibility, the shifting of attention from one task set to another). Overly rigid goal-shielding may lead to negligence of crucial cues in the environment that should be prioritized; for example, a novice driver may neglect a light that changed from green to red, because they were fixated on the task of changing into the right turn lane. Conversely, an overly flexible processing mode may render the agent easily distractible when concentration is required, as when a driver takes their eyes off the road to glance at a new notification from their phone.

To meet this challenge, the brain needs to find—and continually adapt—a contextually optimal level of cognitive flexibility. Despite the central importance of this process to adaptive behavior (and survival), relatively little is known about the learning processes that underpin the ability to strategically match flexibility (or “switch readiness”) to changing contexts. The present study therefore sought to elucidate how people adjust their level of cognitive flexibility to suit changing task demands in the form of time-varying frequency (or likelihood) of having to switch tasks.

We begin with a brief literature review and some definitions of our theoretical assumptions and terminology. We investigated the topic of cognitive flexibility through the prism of cued switching between task sets (for reviews, see Kiesel et al., 2010; Monsell, 2003; Vandierendonck, Liefooghe, & Verbruggen, 2010; Koch et al., 2018). We define a task-set as a rule that specifies a set of task-relevant stimuli or stimulus features and their associated responses (Kiesel et al., 2010; Monsell, 2003; Rogers & Monsell, 1995). We assume that implementing a task-set involves the attentional selection of the relevant stimulus features and activation of their respective responses, and the shielding of these stimulus-response translations from potential interference by task-irrelevant information (Dreisbach & Haider, 2008; Dreisbach & Wenke, 2011; Meiran, 2010). In line with a large literature, we further assume that cued switching between task-sets (or task-set updating) requires 1) reconfiguration, that is, the active replacing of the previously active task-set with a new set of stimulus-response rules (Meiran, 1996; Monsell & Rogers, 1995; Monsell, 2003), and 2) the inhibition of, or resolution of interference from, the most recently active task set (overcoming “task set inertia”; Allport et al., 1994) and from other task sets previously associated with the stimulus set (Waszak, Hommel, & Allport, 2003). Together, these processes result in switch costs, slower and less accurate responses when a task has to be switched from the previous trial than when it is repeated (Kiesel et al., 2010; Monsell, 2003; Vandierendonck, Liefooghe, & Verbruggen, 2010; Meiran, Chorev, & Sapir, 2000). The ease with which switch processes are carried out can be modulated by a number of factors, including the frequency with which tasks have to be switched over periods of time, which we investigated in the current study.

Finally, we assume that the size of the switch cost can be considered indicative of someone’s current level of cognitive stability (or flexibility) (Braem & Egner, 2018; Dreisbach & Fröber, 2019). This level can be conceived of as a set-point on a stability-flexibility continuum, which has been conceptualized by Goschke (2003, 2013) as a meta-control parameter termed the “updating threshold”: when this threshold is low, tasks can be switched more easily (flexibility is high), but this necessarily bears the cost of poor task-set shielding against interference (stability is low); when this threshold is high, switching is rendered more difficult (cognitive flexibility is low) but in turn the current task-set is well protected against interference (stability is high). In the present article, we will use the term “switch readiness,” which is inversely related to the updating threshold and task-set shielding. Moreover, we use these terms to denote different set-points on the stability-flexibility continuum (e.g., low vs. high switch-readiness), but we treat them as neutral with respect to the underlying processes that are modulated to produce changes in switch costs (e.g., reconfiguration vs. inhibition/interference resolution processes). We will speculate in the General Discussion on the most likely aspect of switch cost that is modulated by switch frequency manipulations, however.

In summary, successfully navigating the shifting-shielding dilemma can be conceptualized as learning to strategically adjust one’s updating threshold to suit changes in environmental demand for relatively more or less cognitive flexibility (Goschke, 2003). Importantly, behavioral evidence for these types of dynamic adjustments in switch readiness has been obtained in cued task-switching protocols that manipulate the frequency (and thus, likelihood) of switch trials between blocks of trials. Specifically, a number of studies have shown that the magnitude of switch costs tends to scale inversely with the frequency that task switches occur within a given block of trials (Bonnin, Gaonac’h, & Bouquet, 2011; Dreisbach & Haider, 2006; Dreisbach, Haider, & Kluwe, 2002; Duthoo, De Baene, Wühr, & Notebaert, 2012; Mayr, 2006; Monsell & Mizon, 2006) or at a specific spatial location (Crump & Logan, 2010; Leboe et al., 2008). For instance, Monsell & Mizon’s (2006) Experiment 4 varied switch proportions from 25% to 50% to 75% between blocks of trials and observed the greatest switch costs at a switch frequency of 25% and the smallest switch costs at a switch frequency of 75%. We will refer to these block-based modulations of switch cost as the list-wide proportion switch effect (LWPSE), leaning on a similar nomenclature in the congruency effect literature (Bugg & Chanani, 2011; Bugg & Crump, 2012). While the above demonstrations of a LWPSE provides basic evidence that people can adapt their switch readiness to varying task statistics, the exact scope of this adaptation, as well the particulars of the underlying learning processes, are presently not known.

In the present study, we ask in particular what kind of learning drives these effects, and we distinguish between three ways in which changes in updating threshold could become associated with features of low- versus high-frequency switch blocks: the list-wide level (producing sustained and generalizable changes in flexibility), the task-set level (where a particular level of switch readiness becomes associated with a specific task-set), and the item level (where a particular level of switch readiness becomes associated with specific task stimuli). To investigate the kinds of learning that drive the LWPSE, we ask several questions that have not been previously addressed in the literature: first, because previous studies that found these context-sensitive switch cost modulations only required that participants switch between two tasks, it is not clear to what degree the LWPSE reflects a generic change in cognitive flexibility or task-specific preparation processes. In other words, reduced switch costs in high proportion switch blocks could reflect participants preparing for the particular alternate task, rather than general preparation for a task switch (to any other task). Intuitively, the latter would stand as stronger evidence for a genuine adjustment of cognitive flexibility, because flexible engagement with a changing environment requires increased aptitude to respond to events that often are unexpected.

Second, it is not yet known to what extent the LWPSE is driven by associating switch readiness with the global switch likelihood of the current block context (list-wide learning) or by using the specific task-sets and/or task stimuli (also referred to as “items”) as cues for adjusting switch readiness. In prior studies, in high-switch frequency blocks, all tasks and all task stimuli were also presented more frequently as switch versus repeat trials (and vice versa for low-switch frequency blocks). Therefore, any reductions in switch costs that were observed could have resulted from participants’ learning of task- and/or item-specific associations with switch frequencies instead of linking the temporal, list-wide context to a greater need for flexibility.

In the current paper, we present a series of four experiments that shed light on these unanswered questions about the scope and mechanisms of meta-control over the stability-flexibility tradeoff, as indexed by the LWPSE. Experiment 1 attempts to replicate the LWPSE using the design of Monsell & Mizon (2006) with a different stimulus set. Using three instead of two tasks, Experiment 2 tests whether the LWPSE is preserved when participants do not know which specific alternate task they will switch to. To tease apart list- and task-level biases, Experiments 3a and 3b probed for the generalization of the LWPSE to an unbiased “transfer task,” which occurred equally often as switch and repeat trials, presented in blocks with overall high or low switch bias. Following a similar logic, Experiment 4 used switch proportion biased versus unbiased stimuli to investigate whether the LWPSE can be observed in the absence of item-level biases. The data and materials for all experiments are available at https://osf.io/5cxam/, and none of the experiments were preregistered.

Experiment 1

The first experiment was a conceptual replication of Monsell & Mizon’s (2006) Experiment 4. We sought to replicate the switch proportion dependent switch cost to validate a basic task protocol with which to assess the determinants of the LWPSE in the subsequent experiments. Specifically, participants performed cued letter and digit categorization tasks under within-subject manipulations of task sequence (task repeat vs. task switch trials), CSI (short: 190 ms or long: 840 ms) and the proportion of switch trials per block (30%, 50%, or 70%). The CSI factor was included because the pattern of results in prior work suggested that the LWPSE may be CSI-dependent, with maximal effects of switch proportion evident at short CSIs (Monsell & Mizon, 2006). In other words, participants may rely more on context in aiding their task-set updating strategy when they have less time to utilize the trial-by-trial cue for task set reconfiguration. We therefore expected to find reduced switch costs with an increasing proportion of switch trials to be most pronounced in the short CSI condition.

Method

Participants

A power analysis based on the effect size of the smallest switch cost modulation (switch cost difference between the 50% and 75% switch condition) in Monsell & Mizon (2006) Experiment 4 suggested a total sample size of 26 to achieve 0.95 power. To be conservative and to take into account larger participant exclusion rates for online testing, we roughly doubled this estimate and recruited 56 participants from MTurk. The experiment lasted ~60 minutes, and 16 participants were excluded from data analysis for lower than 75% overall accuracy on the task, leaving a final sample size of 40.

Stimuli

Task stimuli consisted of a letter and a digit displayed simultaneously at either side of the center of the screen for each trial. The letter was randomly selected from A, E, I, U, G, K, M, or R, and the digit was randomly selected from 2, 3, 4, 5, 6, 7, 8, or 9. Whether the letter or the digit was presented on the left or right was randomized across trials.

Procedure

Experiment procedures roughly followed Experiment 4 of Monsell & Mizon (2006). Each trial began with a blank interval of 1,010 ms (short CSI condition) or 360 ms (long CSI condition), followed by a 450-ms long fixation display, a cue display lasting 150 ms, and another blank interval of either 40 ms (in the short CSI condition) or 690 ms (in the long CSI condition). Finally, the task stimuli appeared and remained on screen for 1,200 ms. The lengths of the blank intervals were varied so that the RSI, or the sum of the blank intervals, fixation display, and cue display, was a constant 1,650 ms for both short and long CSI trials (Fig. 1).

Participants were required to perform a letter classification task (“Is the letter a vowel or consonant?”) if they saw the cues “Letter” or “Alphabet” and to perform a digit classification task (“Is the digit odd or even?”) if they saw the cues “Digit” or “Number.” The 2:1 cue-to-task mapping allowed us to change the cue on every trial, regardless of whether the task was switched or repeated, thus eliminating the contribution of possible response time benefits that come from repeating cues on task repeat trials (Logan & Bundesen, 2003; Mayr & Kliegl, 2003) to our computation of task switch costs. Participants had to press the “d” or “k” key to categorize the stimuli as vowel/consonant or odd/even. Participants were randomly assigned to different response mappings for each task. Correct responses were followed by a 500-ms blank screen, and incorrect responses were followed by the word “Incorrect” displayed for 2,000 ms. Responses made while the task stimulus was not onscreen were considered incorrect.

Each participant completed 18 blocks of 31 trials. All trials except the first in each block were coded as belonging to either the task switch (preceded by a different task) or task repeat (preceded by the same task) condition. The percentage of switch trials per block was 30, 50, or 70; there were 6 blocks of each switch proportion condition. The trial sequence for each block was generated pseudo-randomly according to an algorithm that ensured each task was presented an approximately equal number of times. In the 30% switch block, each task was presented either 4 or 5 times as a switch trial and 11 or 10 times as a repeat trial, creating a 9:21 switch to repeat ratio. In the 70% switch block, the number of switch versus repeat trials per task was reversed, creating a 21:9 switch to repeat ratio; in the 50% switch block, each task was presented either 7 or 8 times as repeat and as switch trials. For a table depicting switch/repeat frequencies for each task in this and subsequent experiments, refer to Appendix 1 (Table 5). All six blocks of the same switch proportion were presented consecutively to increase the saliency of the switch/repeat context, but the presentation order of the chunk of blocks with the same switch proportion was counterbalanced across participants. CSI alternated from block to block beginning with the short CSI. Before starting the main experiment, participants completed two short CSI blocks and one long CSI block for practice. All practice blocks had 50% switch proportion.

Design

The experiment followed a 2 (task sequence: switch vs. repeat) × 2 (CSI: long vs. short) × 3 (switch proportion: 30% vs. 50% vs. 70%) repeated-measures factorial design.

Results and Discussion

For assessing performance accuracy, we analyzed data from all trials after excluding practice blocks and the first trial of each block. For RT analyses, we additionally excluded incorrect trials, and trials following incorrect trials. After applying these exclusion criteria, trials with response times (RT) outside 1.5 times the interquartile RT range of the remaining sample were filtered out for the RT analyses. Descriptive statistics are displayed in Table 1. Excluded trial counts and the number of remaining trials per smallest and largest cells are included in Supplementary Materials, Appendix 1, Table 4.

Table 1. Mean response times (ms) and accuracy (%) with standard errors in Experiment 1 and 2 as a function of CSI and switch proportion

Full size table

We ran a repeated-measures analysis of variance (ANOVA) with the independent variables of task sequence (switch vs. repeat), CSI (long vs. short), and switch proportion (30% vs. 50% vs. 70%). Replicating classic effects in the task switching literature, we observed a main effect of task sequence (i.e., switch costs), as reflected in slower RTs for switch trials (M_switch = 734.59 ms) compared with repeat trials (M_repeat = 712.15 ms), F(1,39) = 84.02, p < 0.0001, η_p² = 0.68; a main effect of CSI, as short CSIs yielded longer RTs (M_short = 784.74 ms) than long CSIs (M_long = 665.79 ms), F(1, 39) = 312.72, p < 0.0001, η_p² = 0.89; and a task sequence × CSI interaction (F(1, 39) = 10.77, p = 0.002, η_p² = 0.22), wherein short CSI trials produced larger switch costs (M_switchcost = 32.33 ms) than long CSI trials (M_switchcost = 18.45 ms).

More crucial to the focus of the current study, there was a significant interaction effect of task sequence × switch proportion, F(2, 78) = 3.87, p = 0.02, η_p² = 0.09, as switch cost was greater in the 30% switch condition (M_switchcost = 32.6 ms) than in the 50% (M_switchcost = 20.84 ms) and 70% (M_switchcost = 22.73 ms) switch conditions. Moreover, as anticipated, the interaction effect of task sequence × switch proportion was driven by a modulation of switch cost by switch proportion in the short but not in the long CSI condition (Fig. 2), as supported by a three-way interaction between task sequence × CSI × switch proportion (F(2, 78) = 3.14, p = 0.05, η_p² = 0.07). Post-hoc tests revealed that, in the short CSI condition, switch cost for the 30% switch condition (M_switchcost = 45.56 ms) was significantly larger than the 50% switch condition (M_switchcost = 24.34 ms, p = 0.004) and the 70% switch condition (M_switchcost = 27.10 ms, p = 0.025). On the other hand, in the long CSI condition, there were no significant switch cost differences between any of the three different switch proportions (p = 1). No other main or interaction effects were significant.

An identical ANOVA was run on subject’s mean accuracies. There was an expected main effect of task sequence (F(1,39) = 28.63, p < 0.0001, η_p² = 0.42) as participants performed with lower accuracy on switch (M_accuracy = 0.83) compared with repeat trials (M_accuracy = 0.88). We also observed a main effect of CSI, as trials with shorter CSI periods (M_accuracy = 0.83) produced significantly lower accuracy rates (F(1,39) = 34.63, p < 0.0001, η_p² = 0.47) compared with long CSI trials (M_accuracy = 0.89). There also was a significant task sequence × CSI interaction (F(1,39) = 6.19, p = 0.02, η_p² = 0.14), wherein short CSI trials were associated with larger accuracy switch costs (repeat - switch) (M_switchcost = 0.05) than long CSI trials (M_switchcost = 0.03). Unlike the RT data, all other effects were nonsignificant.

For this experiment and all following experiments, congruency effects in RTs and accuracy are reported in Supplementary Analyses (see Appendix 2).

Experiment 1 successfully replicated the key results of Experiment 4 of Monsell & Mizon (2006) in RTs: switch costs were reduced in conditions where switching was more frequent, but only for short CSIs. However, it is noteworthy that, unlike in Experiment 4 of Monsell & Mizon (2006), switch cost reductions in the current experiment seemed to be mainly driven by increases in repeat trial RTs, rather than decreases in switch trial RTs. This pattern of results is observed across all four experiments in this study and is discussed in depth in the General Discussion, where a probable explanation of the lack of switch trial RT improvements is offered.

Another caveat to interpreting our results is that switch frequency may be confounded with run length, i.e., the number of consecutive task repeats (Bonnin et al., 2011). Because run lengths are longer in low-switch frequency blocks, repeated exposure to the same task could promote within-run RT speeding and produce greater task-set inertia that requires more laborious inhibition when participants finally encounter a switch trial, leading to RT slowing. However, finding switch cost adjustments even after restricting their analysis to the first three positions in a run, Bonnin et al. (2011) demonstrated that run length is not the primary contributor to LWPSE.

Nonetheless, the switch cost reductions we observed suggest that participants employ the statistics of control demands—the incidence of switch trials in the different blocks—to guide their cognitive strategies. Moreover, this context-sensitive adjustment in switch-readiness is only evident under conditions where the cue-to-target interval is too short to engage in substantial task-set reconfiguration prior to target onset on a trial-by-trial basis. In the following experiments, we sought to characterize more closely the scope and sources of this form of learned cognitive flexibility.

Experiment 2

Experiment 1 replicated the basic effect of switch proportion (Monsell & Mizon, 2006), demonstrating that the costs of switching are lower when switches are more frequent. However, because participants were only switching between two tasks, they always knew which particular task they would be switching to when they expected a switch. Therefore, the results of Experiment 1 may reflect specific preparation for the particular alternate task instead of a general adjustment of switch-readiness or cognitive flexibility. To test whether the changes in switch cost reflect a modulation of generalizable switch readiness rather than better preparation for a specific alternate task, we adapted the design of Experiment 1 to involve three tasks instead of two (see Chiu & Egner (2017) for an equivalent approach in the context of an item-specific switch proportion manipulation). If switch cost were still moderated by switch proportion in Experiment 2, this would constitute evidence that participants are capable of using context to facilitate task switching even when they do not know what task they are switching to. That three task paradigms make actively anticipating the upcoming task more difficult in turn implies that inhibition (or lack thereof) of the previous task (Mayr & Keele, 2000) should have larger influences on the size of switch costs than anticipatory task-set reconfiguration (Rogers & Monsell, 1995; Monsell, 2003) .

In Experiment 2, a color classification task was included as the third task, in addition to the letter and digit tasks. Participants were cued from trial-to-trial as to which of the three tasks to perform. Only a short CSI (200 ms) was used because Experiment 1 demonstrated that a long CSI eliminated the effect of switch proportion on switch costs. Additionally, only 30% and 70% switch blocks were used, because the switch cost difference between 50% and 70% switch proportion blocks was nonsignificant in Experiment. The 50% condition also is more difficult to compare to the other two conditions, as that condition has a greater level of overall task uncertainty (0.5 in a two-task design) compared with the 30% and 70% blocks, which are equated in terms of uncertainty (0.3 in a two-task design).