Increasing control improves further control, but it does not enhance memory for the targets in a face–word Stroop task

Jiménez, Luis; Méndez, Cástor; Agra, Oscar; Ortiz-Tudela, Javier

doi:10.3758/s13421-020-01028-2

Increasing control improves further control, but it does not enhance memory for the targets in a face–word Stroop task

Open access
Published: 06 March 2020

Volume 48, pages 994–1006, (2020)
Cite this article

Download PDF

You have full access to this open access article

Memory & Cognition Aims and scope Submit manuscript

Increasing control improves further control, but it does not enhance memory for the targets in a face–word Stroop task

Download PDF

Luis Jiménez¹,
Cástor Méndez¹,
Oscar Agra¹ &
…
Javier Ortiz-Tudela ORCID: orcid.org/0000-0003-2844-2110²

1963 Accesses
7 Citations
Explore all metrics

Abstract

Recent research on the dynamics between attentional and memory processes have outlined the idea that applying control in a conflicting situation directly leads to enhanced episodic memory of the processed information. However, in spite of a small subset of studies supporting this claim, the majority of the evidence in the field seems to support the opposite pattern. In this study, we used a face–word Stroop task to enforce different control modes either from trial to trial or in an item-specific manner. Both manipulations of congruency proved to be effective in making participants’ responses to conflicting stimuli more efficient over time by applying a trial-specific control mode. However, these manipulations had no impact on memory performance on a surprise recognition memory test. To our knowledge, this is the first attempt at measuring the memory consequences of the application of specific control modes at the trial level. The results reported here call for caution and possibly reconceptualization of the relationship between cognitive control and memory.

What is cued by faces in the face-based context-specific proportion congruent manipulation?

Article 16 February 2022

Global precedence effects account for individual differences in both face and object recognition performance

Article 20 March 2018

Conflict monitoring and adaptation to affective stimuli as a function of ageing

Article 01 July 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Pursuing unusual goals (e.g., throwing a new sequence of punches when boxing or including new moves in your tango sequence) is more demanding than performing comparatively more habitual goals (e.g., sticking to your old moves in both scenarios) because to reach infrequent goals, performers have to take every step required to accomplish such goals and also prevent the potential intrusion coming from more habitual actions performed in those contexts. The processes recruited to overcome the conflict between alternative action courses are collectively referred to as “cognitive control,” and they have been explored systematically by means of interference lab tasks such as Stroop (MacLeod, 1992; Stroop, 1935), flanker (Eriksen & Eriksen, 1974), or Simon (Simon & Berbaum, 1990) tasks. For instance, in a Stroop task, if a participant is told to respond to a word denoting a color by referring to the color in which it is printed, its semantic content leads to an interference that is measured as the difference in reaction times between the conditions in which both features are congruent or incongruent with each other.

One important result from the literature on cognitive control is that the efficiency of control processes is not invariant, but is rather subject to systematic changes. Thus, the effect of congruency decreases immediately after responding to an incongruent trial (i.e., the congruency sequence effect, or CSE; Gratton, Coles, & Donchin, 1992), or after responding to a large proportion of incongruent trials over a given block (i.e., list-wide proportion congruency effect, or LWPCE; Logan & Zbrodoff, 1979), a specific context (context-specific proportion congruency effect, or CSPCE; Crump, Gong, & Milliken, 2006), or even for a specific item (item-specific proportion congruency effect, or ISPCE; Jacoby, Lindsay, & Hessels, 2003). Thus, it appears that the efficiency of cognitive control becomes finely attuned to the previous experience, and it improves precisely in those conditions in which it becomes challenged.

Learning and the dynamics of cognitive control

One of the most prominent attempts to account for the control dynamics outlined above came from the conflict monitoring theory (CMT) proposed by Botvinick, Braver, Barch, Carter, and Cohen (2001). The CMT suggests that conflict signals generated on an incongruent trial trigger a temporal up-regulation of cognitive control that improves its focus on the target immediately after having encountered that conflicting trial. Adaptation to conflict can explain both CSE and LWPCE, assuming that the increase of control produced immediately after a conflict trial does not decline completely after a single trial, but tends to produce gradual and cumulative effects, modulating control over extended periods of training. However, the model has more problems dealing with control effects that appeared linked to specific contexts or specific types of trial. Temporal modulations such as those proposed by the CMT are not well suited to account for changes associated to specific features, which instead call for the acquisition of enduring associations between those features and the specific parameters of control that are requested under these circumstances (Blais, Robidoux, Risko, & Besner, 2007; Verguts & Notebaert, 2008).

One prediction that follows from the link between associative learning and cognitive control is that the increased control triggered by any experience of conflict should result not only in more efficient responses but also in an enhanced encoding of the relevant episodes. Specifically, if the original CMT claims that the adaptation process takes place across successive trials (Botvinick et al., 2001), such improvements in control could be expected to lead to an enhanced encoding of the episodes that come right after an incongruent trial, rather than of the event that generated the conflict. However, other interpretations have suggested that a full readjustment of control could take place within a single trial, thus supporting the prediction of an enhanced encoding of the conflicting episodes (Scherbaum, Fischer, Dshemuchadse, & Goschke, 2011). Other authors have claimed that the experience of conflict acts as a trigger for arousal responses, which improve the efficiency of associative learning and hence promote adaptation as a consequence of learning, rather than the other way around (Verguts & Notebaert, 2008). In any of these cases, regardless of whether associative learning plays an antecedent or consequent role in its dynamic relation with cognitive control, all these accounts predict a close relationship between control and memory.

Control as memory

A few other theoretical accounts have attempted to explain this learning-adaptation dynamic by adopting a full-fledged episodic standpoint, thus conceiving the observed modulations of control as the effects of priming, derived from the reinstatement of the features of a previous trial, or of the repetition of the whole cognitive set that recurs after having experienced the same settings on an immediately previous trial. Thus, rather than assuming that conflict leads to control and thus to increased learning (Botvinick et al., 2001), or that conflict leads to learning, and hence to improved control (Verguts & Notebaert, 2008, 2009), the episodic accounts claim that all cognitive processes are primed by the reinstatement of the contexts in which they were implemented, and thus that fluctuations in performance reflect different instantiations of that rule. This episodic view was originally proposed by Mayr, Awh, and Laurey (2003) to account for the CSE as a result of repetition priming, and by Hommel, Proctor, and Vu (2004) to understand this phenomenon in terms of previous experience with particular feature bindings. A more global instantiation of the same idea was put forward more recently by Egner (2014), assuming that the control adjustments triggered in a particular context become incorporated into the episodic event files, thus binding the internal cognitive state and the attentional settings applied to that context together with the features of that episode; this binding makes it easier to apply the same settings in exactly the same ways when that particular stimulus configuration recurs. In any case, the question remains open with respect to whether such generalized binding processes can support exclusively an improvement in response to a close replication of the same task or whether it could also improve memory for the identity of the target.

Conflict enhanced memory

The hypothesis of conflict-enhanced memory has been recently examined through several studies using different paradigms (Krebs, Boehler, De Belder, & Egner, 2015; Ortiz-Tudela, Milliken, Botta, LaPointe, & Lupiañez, 2016; Ortiz-Tudela, Milliken, Jiménez, & Lupiáñez, 2018; Rosner, D’Angelo, MacLellan, & Milliken, 2015a). For instance, Krebs et al. (2015) used a face–word Stroop task, in which participants were asked to respond to the gender of a given set of faces that were overlaid with a distracting word (i.e., “MAN” or “WOMAN”). These words rendered Stroop-like congruency effects, as the gender of the face could match or mismatch the meaning of the word; accordingly, the authors measured faster responses when the word accurately indicated the gender of the target face. More important for the current purposes, the authors also reported that when participants were later asked to perform a recognition memory test on the target faces, they produced a larger proportion of high-confident recognition responses to those faces that were paired with an incongruent distracter.

Rosner and colleagues reported a similar result (Davis, Rosner, D’Angelo, MacLellan, & Milliken, 2019; Rosner et al., 2015a; Rosner, Davis, & Milliken, 2015b) using a naming task, in which participants were presented with pairs of spatially interleaved words written in two different colors and were told to read aloud the word written in one of the colors. Participants were faster when both words were identical, but they were more able to recognize those target words that had been presented with an incongruent distracter.

At variance with these previous results, Ortiz-Tudela et al. (2018) tested up to seven variations of a spatial cueing paradigm in which participants were told either to read aloud or to categorize a long series of words preceded by a visual cue that generated a spatial expectation about the target location. Even though these experiments succeeded in generating and breaking spatial expectations, as judged by the congruency effects obtained in participants’ reaction times (RTs), the authors found no evidence consistent with the hypothesis that a mismatch of such spatial expectations could be enough to trigger any enhancement in memory.

The present study

The main goal of this study was to further investigate the hypothesis that conflict enhances memory, going back to the original paradigm devised by Krebs et al. (2015), in an attempt to reproduce and extend the evidence gathered in that study. Because it is not clear whether the memory enhancement triggered by an upsurge of control should take place within a single trial (Scherbaum et al., 2011) or would be better expressed on the trial that immediately follows a conflicting trial, as originally proposed by the CMT (Botvinick et al., 2001), in Experiment 1, we conceptually replicated the original experiment, but assessed the effects of responding to a conflict trial (n) both on the recognition of the face presented on that trial (n) and on the face presented on the successive trial (n + 1), as compared with recognition of the face that immediately preceded the conflict trial(n − 1). To foreshadow the results, this experiment produced no evidence consistent with the hypothesis that conflict produced a generalized enhancement of memory, neither for the conflicting trial, nor for the trial that immediately followed that conflict, even though it suggested that memory may be selectively enhanced for those conflicting trials that come right after another conflict trial.

In Experiment 2, we modified the procedure in an attempt to strengthen any boost in encoding directly triggered by cognitive conflict. We reasoned that if enhanced encoding was indeed triggered by conflict, but in a very mild way, maybe a single presentation of a given face in incongruent conditions was not enough to produce a significant modulation of memory; we therefore aimed at increasing any potential effects by repeating the presentation of certain faces under conditions of high or low proportion of congruency. As described above, the ISPCE has been documented in several procedures (Bugg, Jacoby, & Chanani, 2011; Jacoby et al., 2003), indicating that participants can learn to associate particular control settings to specific stimuli when they are consistently presented in conditions of high versus low conflict. Applying this reasoning to the face–word Stroop task, we expected that a repeated exposure of particular faces in conditions of high versus low conflict could lead to (1) an adaptive modulation of control, as measured by the ISPCE, and (2) larger differences in memory performance between congruent and incongruent faces when those were presented in conditions of high versus low proportion of congruency.

Experiment 1

The extent to which conflict-driven memory enhancements are restricted in time to the boost in encoding of the conflicting information or whether they could also affect information presented following the conflict is still unsolved. Accordingly, one prominent theory that explains CSE states that the detection of conflict between coactive representations triggers enhanced processing not only of the current event but also of subsequent stimuli (Gratton et al., 1992). In this experiment, we intended to test whether the up-regulation of cognitive control observed for n-lagged trials produces recognition memory benefits for the items in said trials. We used a face–word Stroop paradigm in which we measured congruency effects in recognition memory both for trial n and trial n + 1.

Method

Participants

The original effect found by Krebs e al. (2015) was observed on a sample of 20 participants. An a priori power analysis based on the result of their matched-samples t test (t = 2.29) indicated that the size of the difference between the recognition of congruent and incongruent faces amounted to a Cohen’s d of .51, an effect size that the original sample of 20 participants would be able to catch with a power (1 − β) of only.71. To increase that power to the recommendable criterion of .80, we needed a sample of 28 participants. However, in order to increase that power to a target level of .90, we aimed at recruiting valid data from a total of 36 participants. We recruited 37 students from the Universidad de Santiago de Compostela to take part in the study. They signed informed consents and took part in the experiment in exchange for course credit. The study was part of a larger project that was approved by the local Ethical Committee of the University of Santiago de Compostela. One participant was removed from the sample because he or she did not understand the face gender task and simply watched the faces without responding to this task.

Stimuli

The face stimuli were selected from the same database used by Krebs et al. (2015), the Glasgow Unfamiliar Face Database (Burton, White, & McNeill, 2010), that includes 304 stimuli corresponding to male and female faces, cropped to preserve exclusively the contour of the heads. From the overall sample, we performed an initial selection to exclude those that appeared especially distinctive, and selected a subset of 180 faces (90 male and 90 female) to be included in the study. In the familiarization and memory test phases, each face was presented alone, over a white background, with dimensions of approximately 5 × 7 cm (note that these dimensions varied slightly among different pictures, because their size is not completely uniform). In the face–word Stroop task, each face was overlaid with a congruent or an incongruent word (the Spanish words for MAN or WOMAN) written in black, Arial bold 24-point capitalized font (3.4 × 0.8 cm), located approximately over the nose area (see Fig. 1).

Procedure

To conceptually replicate the procedure of Krebs et al. (2015), participants were first presented with a familiarization task, followed by the face–word Stroop task, and a surprise memory test, which was administered after a distracter task in which participants were asked to perform an unrelated task for a period of approximately 15 min.^{Footnote 1}

Familiarization

In the familiarization task, participants had the first opportunity to view the faces that were going to appear later in the Stroop task. This was included in the experiment by Krebs et al. (2015) under the argument that responding to completely novel stimuli can reduce the interference caused by the irrelevant words, and to avoid floor effects in the memory test. Participants were asked to pay attention to the faces, and to indicate whether each face had been seen previously or not. In the present version of the task, participants used the computer mouse to click on four possible buttons represented as pictures at the bottom of the screen, that contained the legends “sure not,” “believe not,” “believe yes,” and “sure yes” to represent their response to the question of whether that particular face had been presented earlier or not. For each participant, the program randomly chose 72 out of the 90 pictures of male faces, and another 72 out of the 90 pictures of female faces, for a total of 144 faces, which were presented twice at random. Thus, an already-presented face could appear at every moment in the task, and participants were free to inspect the faces for as long as they wished before deciding on a response. The following trial appeared immediately after having responded to the previous trial, and the task continued up to the end of the 288 trials.

Face–word Stroop task

After the familiarization task, the full set of 144 familiarized trials were presented once again in the context of a face–word Stroop task. Each of these Stroop trials was preceded by a fixation point presented for 1000 ms, centered on the position in which the irrelevant word would appear overlaid on the face. Both face and word appeared later for another period of 1000 ms, and participants were told to indicate the gender of the face regardless of the meaning of the word, using the keys “Z” and “M” from a standard QWERTY keyboard. The specific mapping between gender and responses was counterbalanced across participants, and the particular mapping used for each participant was reminded to them by using two horizontal color bars (blue and pink) presented at the bottom of the screen at the relative locations corresponding respectively to male and female categories. Upon pressing a response key, the inner part of the bar corresponding to the chosen response turned grey, to provide an immediate feedback of the performed response. If an error was committed, or if no response was emitted before the end of the 1000 ms exposure time, a warning error sound was emitted. Critically, the word superimposed on each face could be either congruent with the face gender (i.e., the word WOMAN on a female face) or incongruent with it (the word MAN over a female face), and the overall proportion of congruency was 50%.

Recognition memory test

After a delay of approximately 15 min, which was filled with an unrelated serial-reaction-time task, participants were presented with a surprise recognition memory test. On each of these trials, participants saw a face and were asked to judge whether it had been presented before or not, using the same categories employed during the familiarization task (i.e., “sure not,” “believe not,” “believe yes,” “sure yes”). The faces selected for the recognition task included 36 completely new faces (from here on, NEW) and four different types of faces already seeing in the face–word Stroop task. These old faces were automatically selected by the program to include all the incongruent faces that occurred after a series of at least two congruent trials (from here on, CON-INC), the congruent faces that immediately preceded each of these selected incongruent trials (and that therefore occurred after another congruent trial, CON-CON), and those faces that followed the referred incongruent trials, which were further subdivided as incongruent postincongruent (from here on, INC-INC) and congruent postincongruent (from here on, INC-CON; see Fig. 2 for visual depiction of the trial coding).

The number of trials contained in each category depended on the particular random distribution of trials generated for each participant, but it amounted to between 19 and 25 CON-INC and CON-CON trials, and the same number of postincongruent trials, which were evenly divided between INC-INC and INC-CON trials. This procedure allowed us to test recognition memory from a group of faces presented on closely neighboring trials, but differing specifically on the level of conflict experienced on that trial and on the preceding trial.

Results

Familiarization phase

The familiarization phase was analyzed only to confirm that participants were performing the orienting task properly and to assess if the amount of time devoted to processing each face depended on whether it was new or repeated. Recognition responses of “sure not,” “believe not,” “believe yes,” and “sure yes” were coded as −2, −1, 1 and 2, respectively. Participants produced an average negative score of −0.91 for the first presentation of the faces, and a positive score of .62 for their second presentation. An ANOVA conducted on these scores showed that participants’ responses accurately discriminated between new and repeated faces, F(1, 35) = 402.08, p < .001, η_p² = .92. If recognition responses were simply taken at their qualitative value, either as “yes” or “no,” the average proportion of correct responses amounted to .76 for the new faces and .66 for the repeated faces. Both scores were significantly larger than those expected by chance, t(35) = 12.11, p < .001, for new faces, and t(35) = 7.24, p < .001 for repeated faces. Responses were also faster for repeated than for new faces (1675 vs. 1860 ms), F(1, 35) = 15.91, p < .001, η_p² = .31. Taken together, these results indicated that participants were aptly performing the familiarization task.

Face–word Stroop task

Participants’ performance in this task was generally fast (537 ms) and accurate (.91 of correct responses). Proportion of correct responses and response times (RTs) were submitted to separate analyses of variance (ANOVAs), with congruency of the current trial (congruent vs. incongruent) and congruency of the previous trial (previous congruent vs. previous incongruent) as within-participants factors. For the analyses of RTs, only correct trials were included.

A Stroop effect was found both on RT and accuracy measures, as participants responded faster (517 vs. 557 ms) and more accurately (.95 vs. .87) to congruent than to incongruent trials, F(1, 35) = 45.57, p < .001, η_p² = .57, and F(1, 35) = 42.86, p = .001, η_p² = .55, respectively, for RT an proportion of hits. The effect of previous congruency was also significant for RT, showing faster responses after a congruent trial (527 vs. 547 ms), F(1, 35) = 18.39, p < .001, η_p² = .34, but not for the measure of accuracy (.912 vs. .907), F < 1. Even though the numerical pattern suggested that congruency effects were larger after a congruent trial than after an incongruent trial (13 ms and 1.9 points in accuracy), the Congruency × Previous Congruency interaction was not significant in any of the analyses, F(1, 35) = 1.65, p = .21, η_p² = .05, for RT, F < 1 for accuracy.

Recognition memory test

As shown in Fig. 3, the results indicate that participants’ recognition responses discriminated clearly between new and old faces, but that much smaller differences were found among the patterns observed in response to all the remaining types of trials. In keeping with the analyses conducted by Krebs et al. (2015), we focused on participants’ high-confidence recognition responses, as we expected to obtain an improved proportion of such responses specifically for faces presented under incongruent conditions. A preliminary analysis comparing the average proportion of high-confidence recognition responses for the full set of old faces as compared with those provided in response to new faces clearly showed that participants were able to discriminate between these two types of faces, F(1, 35) = 218.33, p < .001, η_p² = .86 (.55 vs. .14).

Once adequate overall memory performance was ensured, we turned to our main analyses of interest: memory performance as a function of current and previous trial congruency at encoding. In contrast with Krebs et al.’s (2015) result, an overall comparison of the proportion of high-confidence responses across all congruent and incongruent trials showed no significant difference between them, F(1, 35) = 1.58, p = .22, η_p² = .04. Participants recognized incongruent faces (.563) slightly more frequently than incongruent faces (.540), but a Bayesian analysis suggested that the evidence in favor of a difference between them was merely anecdotal, BF₁₀ = .456.

For the analysis of Previous Congruency × Current Congruency interaction, we analyzed the proportion of high-confident recognition responses to old faces presented either in a congruent or in an incongruent Stroop trial, as a function of the congruency of the preceding Stroop task. An ANOVA with congruency and previous congruency as repeated measures showed no effect of previous congruency (F < 1), but it showed both a significant Congruency effect (.537 vs. .577), F(1, 35) = 4.23, p = .047, η_p² = .11, and a significant Congruency × Previous Congruency interaction, F(1, 35) = 5.58, p = .024, η_p² =.14. This interaction showed a higher rate of high-confident recognition responses to faces presented under incongruent conditions selectively when they followed another incongruent trial (.523 vs. .606), F(1, 35) = 7.37, p = .011, η_p² = .17, but not when they occurred after a congruent one (.551 vs. .548), F < 1. Bayesian paired t tests confirmed that there was moderate evidence for the effect of congruency on recognition responses after an incongruent trial, BF₁₀ = 4.05, but not after a congruent trial, BF₁₀ = 0.18.

Discussion

The results of Experiment 1 showed clear congruency effects during the face–word Stroop task, but, unexpectedly, they failed to show a reliable CSE. Interestingly, the recognition task showed no evidence consistent with the hypothesis that conflict produced a generalized enhancement of memory, for either the conflicting trial or for those trials that come immediately after conflict. As noted in the introduction, this conflict-enhanced memory effect has been previously observed in similar paradigms, but the main aim of Experiment 1 was to assess whether the up-regulation of control provoked by an incongruent trial could lead to a better remembering of the face presented during the conflicting trial, or of the ones presented in the following trial. Contrary to both hypotheses, we found significantly higher recognition scores only for those incongruent trials that immediately followed another incongruent trial, thus suggesting that conflict over two successive trials might be required to trigger an effective increase in recognition.

In hindsight, the failure to replicate the recognition benefit found by Krebs et al.’s (2015) can be taken as less of a surprise if we consider two empirical and conceptual arguments. First, on purely empirical grounds, one should notice that the behavioral effect reported by Krebs et al. was relatively small, and it was statistically significant only in the context of a multiple series of planned t tests that compared the proportion of high-confident recognition responses given to a set congruent, neutral, and incongruent trials, without any correction for multiple comparisons. Second, on more conceptual grounds, one must take into account that recognition memory performance in this procedure can be driven not only by encoding those faces during the face–word Stroop task but also by processing of the same faces during the previous familiarization task. Note that Krebs et al.’s (2015) design and ours include a new set of faces as lures for the recognition memory phase, and thus it is impossible to disentangle effects of the two encoding phases. Moreover, if a conflict-driven enhancement is indeed taking place but is weak in nature, it is therefore possible that a single presentation of the faces in congruent or incongruent conditions is not enough to override the effects of the familiarization task. In Experiment 2, we introduced two main changes intended to increase the impact of conflict on the measures of memory: first, we made the conflict manipulation stronger, by presenting the faces repeatedly under congruent or incongruent conditions, and, second, we made the memory task dependent exclusively on the experiences gathered within the conflict task.

Experiment 2

Experiment 2 aimed at further exploring the puzzling result of Experiment 1 by testing whether repeated exposure of faces under congruent or incongruent conditions could produce differences in remembering by virtue of triggering an additive encoding boost. To achieve this main goal, we used the same core paradigm of Experiment 1, with the following changes: First, during the Stroop task, we repeatedly presented the faces under conditions of either high or low conflict, to reinforce any possible effect of conflict-enhanced memory produced by a single presentation. In order to strengthen any congruency effect, we also moved the presentation of the distracter word earlier in time (Appelbaum, 2009). Second, only a subset of familiarized faces was passed along to the Stroop task, and the remaining ones were used as lures in the memory test; we therefore changed from a pure recognition memory task to a source memory task (Konopka & Benjamin, 2009). On doing so, we achieved the dual goal of avoiding any functional ceiling effect that could have been reached if participants were presented with a simple recognition task after having experienced multiple exposures to a reduced group of faces, and making recognition performance depend exclusively on the experience accumulated in the conditions of high versus low conflict.

Finally, to ensure that our manipulation effectively affected the amount of control exerted on each trial, we included a cognitive control manipulation that has been shown to modulate the degree of control at the item level. Instead of repeating some faces exclusively under incongruent conditions and others only under congruent conditions, we manipulated the proportion of congruency in three probabilistic levels for different items and assessed whether the items that appeared most frequently under incongruent conditions produced (1) smaller congruency effects in the Stroop trials and (2) higher levels of recognition in the memory test, as compared with those presented most frequently under congruent conditions.