A few years ago one of the authors spent some days in Florida in a small hotel on the beach. During one of his morning runs on the shore, he realized he had forgotten the name of his hotel. Having repeatedly stayed at another hotel on a similar Florida beach, the name of the previous hotel The Diplomat came to mind, blocking his memory of the correct hotel name. He only found out the name of his hotel when he got back from his run, it was The Prestige. This is an example of red herring retrieval, that is, a mental block that occurs when a wrong answer comes from memory, blocking the correct response. In this example, two factors influenced retrieval of the wrong answer: repetition (he had stayed at The Diplomat twice) and context (it was on a similar beach in Florida). In the present study we tested a laboratory version of red herring retrieval, examining the influence of two factors known to affect memory of such wrong answers’ retrieval, repetition and context.

Exposure to wrong answers has been shown to impede creative problem solving (Smith & Blankenship, 1989, 1991; Wood & Pennington, 1973), a finding referred to as a fixation effect (e.g., Kohn & Smith, 2009; Smith, 1995a, 1995b, 2003; Smith & Blankenship, 1989, 1991; Wiley, 1998). Some studies of fixation effects have presented these wrong answers, what we will refer to as red herrings, simultaneously with problems (e.g., Smith & Blankenship, 1989, 1991), others have pre-exposed participants to these misleading stimuli (e.g., Smith & Blankenship, 1991; Wood and Pennington, 1973), suggesting that such fixation effects might be attributable to retrieval of the wrong answers. Do manipulations of variables that affect long term memory retrieval of red herrings affect observed levels of fixation in creative problem solving? Although it has been implied by previous studies that the memory strength of red herrings determines the strength of the subsequent fixation effect that occurs, in fact, there have been no previous reports of tests of this principle, what we will refer to as the red herring retrieval hypothesis.

We use the term red herring rather than blocker because the term blocker implies both a cognitive process, response competition, and a result, namely, poorer performance. The presentation of misleading or wrong answers to problems might, or might not cause the intended reduction in performance, and even if they do, they may decrease problem solving performance via a different cognitive mechanism than response competition. Here, we use the term red herring to indicate the intended effect of such stimuli, that is, to divert the problem solver from the path that leads to a correct solution, just as the use of red herrings was intended to divert hunting hounds from the trails of game animals.

In the creative cognition literature (e.g., Finke, Ward, & Smith, 1992) many studies have examined fixation effects, that is, blocks to creative problem solving, which can be caused by various factors. For example, Duncker (1945) studied functional fixedness, that is, fixation based on the commonly known functions of experimental objects. In another example, Luchins and Luchins (1959) demonstrated the negative effects of mental set with their Einstellung paradigm. Smith and Blankenship (1989) experimentally induced fixation by presenting clue words intended as red herrings for creative problem solving; they found that presenting red herrings reduced initial performance on rebus problems. Although fixation can have many different causes, the experiments of Smith and Blankenship (1989) indicate that remembered red herrings can cause a fixation effect.

In addition to the experiments of Smith and Blankenship (1989), other studies have also used the method of pre-exposure to red herrings before participants attempt creative problem solving. Some studies that have used Remote Associates Test (RAT) problems (Mednick, 1962) with this pre-exposure technique include Smith and Blankenship (1991), Vul and Pashler (2007) and Wood and Pennington (1973).

Furthermore, there have also been studies that experimentally pre-exposed participants to examples of responses to a subsequent creative idea generation task (Smith, Ward, & Schumacher, 1993; Landau & Leynes, 2004), or a brainstorming task in a group (Kohn & Smith, 2011). Such studies of conformity effects are similar to studies of fixation in creative problem solving because they show that participants are constrained to using features of the pre-exposed examples. If this happens in a group, initial ideas might even create a collaborative inhibition, which will impair the performance of the whole group working on the same problem (Weldon & Bellinger, 1997). There is also a growing literature on design fixation effects (Jansson & Smith, 1991; Chrysikou & Weisberg, 2005; Purcell et al., 1993, Sio, Kotovsky, & Cagan, 2015; Vasconcelos et al., 2017), which shows that fixation to design examples, or even to past design works can create constraints when facing a task requiring a novel design solution. Thus, there are constraining effects of experimentally controlled stimuli that can impede creative cognition across a range of creative activities.

In a general sense, the RAT fixation paradigm that involves pre-exposure to red herrings resembles negative transfer paradigms that have been studied at least since Müller and Pilzecker (1900). Negative transfer is said to occur when prior learning causes poorer subsequent performance, such as poorer interpolated learning in retroactive interference experiments (e.g., Neill & Valdes, 1992). A lengthy body of literature exists that shows negative transfer using recall tests and other direct measures of memory, that is, tests in which the to-be-remembered events are explicitly referred to in the directions to participants. A smaller literature shows negative effects of prior learning on indirect measures of memory, that is, tests that do not refer to the events in question. Relative to direct measures of memory, such as cued recall, indirect measures, such as word fragment completion, anagram solution, or category production may involve different amounts of conscious recollection and automatic retrieval (Jacoby, 1998). Indirect memory measures can be affected, for example, by numbers of repetitions of target items (e.g., Jacoby & Dallas, 1981), and by manipulations of perceptual modality between encoding and test (e.g., Kirsner, Milech, & Standen, 1983; Scarborough, Gerard, & Cortese, 1979). In the present study we ask whether RAT problem solving, a test typically used to assess creativity (e.g., Benedek & Neubauer, 2013), can function as an indirect measure of memory. Given that RAT problems are solved by retrieving candidate associates to the test words, we assume that the RAT would be more of a conceptually-driven test, rather than a primarily data-driven one. Is RAT performance affected by encoding and retrieval manipulations (of red herrings), such as repetition and context reinstatement?

A negative transfer effect, with an indirect memory measure was demonstrated and studied by Smith and Tindell (1997), who used as blockers (or red herrings) words that were orthographically similar to word fragments that were subsequently tested. Smith and Tindell (1997) showed that this memory blocking effect with an indirect memory measure followed the predictions of the transfer appropriate processing theory; manipulations of level of processing at encoding did not influence the blocking effect seen on the primarily data-driven fragment completion test, but changes in perceptual modality (from auditory at encoding to visual at test) did influence the effect. Similar paradigms yielded similar results at Logan and Balota (2003), Landau and Leynes (2006), and Leynes, Brown, and Landau (2011). Lustig and Hasher (2001) reported similar findings in their study of implicit memory. Furthermore, Landau and Leynes (2006), who dubbed this finding the memory blocking effect (MBE), reported that neither time, nor depth of processing or awareness of the MBE affected the magnitude of the effect.

The other body of findings of negative transfer effects using an indirect memory measure come from exclusion tasks in the process dissociation procedure (e.g., Jacoby, Toth, & Yonelinas, 1993; Jacoby, 1998). For example, after reading a list of words, a participant may be asked to complete word stems without using words recollected from the previously encoded study list (e.g., Jacoby, 1998), or completes a cue-word+stem with a related word not from the study list (e.g., Jacoby, 1996). In these opposition tasks, automatic retrieval of forbidden items can supplant correct responding, but only when recollection of the studied responses fails. In the context of the present study, one may think of the forbidden (studied) responses on exclusion tasks as similar to red herrings that lead participants away from correct responses.

If fixation in creative problem solving can be increased by making red herrings more retrievable, then we can think of creative problem solving as a type of indirect memory measure, and fixation caused by retrieval of red herrings as analogous to other types of negative transfer effects. Our red herring retrieval hypothesis states that factors that make red herrings more retrievable should reduce creative problem solving performance, as measured with RAT problems.

Several studies have shown that pre-exposure to various types of red herrings can cause subsequent fixation or conformity effects in creative problem solving, creative idea generation and memory retrieval, however none of these studies has provided a test of the red herring retrieval hypothesis, which states that the memory strengths of red herrings determine the magnitude of a fixation effect. Alternatively, strengthening the retrievability of a red herring might decrease fixation effects if faster retrieval of red herrings allows them to be rejected more quickly. The faster one can put aside a wrong answer, the more able one will be to devote more mental resources to searching for the correct answer. We conducted the present experiments to test the different possible outcomes of increasing retrievability of red herrings.

FormalPara The present study

In the present experiments a fixation effect was elicited using previously studied red herrings, to make it difficult for participants to solve Remote Associate Test (RAT) problems (Mednick, 1962). In the RAT, often used as a measure of creative ability (e.g., Benedek & Neubauer, 2013), there are three test words in each problem (e.g., COTTAGE - SWISS - CAKE.) The task is to find a single word that can be used to make a compound word or a two word phrase with each of the three test words. The solution for this particular test item is CHEESE (COTTAGE CHEESE, SWISS CHEESE, CHEESECAKE.) To elicit a fixation effect, paired associates were assigned to each of the three test words. These paired associates were conceptually related to the test words, but were not related to the solution. For example, for the RAT problem COTTAGE - SWISS - CAKE the red herring associates made the pairs COTTAGE - hut, SWISS - chocolate, CAKE - icing. We predicted that when the red herrings are made easier to recall, the observed fixation effect would be greater.

Numerous factors can affect the accessibility of events, including encoding and test manipulations. In the present study, we tested our red herring retrieval hypothesis by manipulating an encoding variable: number of practice trials, and a test variable: context reinstatement. If the red herrings are repeated and practiced more, then more of them should be retrieved when a triad of RAT words are presented. Repetition improves both recollection and automatic retrieval of items in memory (e.g., Jacoby & Dallas, 1981). Therefore, lower solution rates were expected when there were more repetitions and practice trials of red herring words, as compared with less practice. Similarly, the context manipulation was predicted to affect retrieval of red herrings on RAT trials: when the initial encoding context was reinstated, more red herrings were expected to be retrieved (e.g., Smith, 1979; Smith & Vela, 2001; Smith & Manzano, 2010; Smith et al., 2014), causing solution rates to suffer for those RAT problems. Several studies have reported context reinstatement effects with indirect memory measures (e.g., Parker, Gellatly, & Waterman, 1999; Smith, Heath, & Vella, 1990), but whether such effects are caused by automatic retrieval or conscious recollection may still be in question (see Mulligan, 2011; Smith et al., 2018). We expect to find context dependent fixation regardless of whether our manipulations of context affect conscious recollection, automatic retrieval, or both types of retrieval processes.

The method in the present experiments used a particularly robust paradigm for context-dependent memory effects, including the use of perceptually rich contexts (photo backgrounds of various environments), small context-to-item fans (only three words per context), and intentional encoding of items (red herrings, in our case) with their associated contexts (see Smith, 2013). Thus, we expected context reinstatement of red herrings, relative to non-reinstatement, to cause poorer performance on associated RAT problems (Experiments 1 and 2), and we expected that more practice with red herrings, relative to less practice, would cause poorer RAT solution rates (Experiment 1).

Experiment 1

Participants in Experiment 1 were given four tasks that were aimed at making sure that the photo contexts were well encoded (the context encoding task), that red herrings were associated with RAT test words (the paired associates encoding task), and that test word-red herring associations were strengthened (the retrieval practice task). After the red herrings, contexts, and RAT test words were encoded, the Remote Associates Test was given; each critical RAT triad was tested by being superimposed over a test context, which either reinstated the encoding contexts of red herrings, or presented a different context than the one seen for that triad at encoding. It was predicted that more practice with red herrings, relative to less practice, and context reinstatement at test, relative to non-reinstatement, would both decrease performance on critical RAT problems, because practice and context reinstatement should improve retrieval of red herrings. Our context reinstatement prediction, however, was predicated on the assumption that the three word-word associations created for each RAT triad would not outshine (or overpower) the weaker context-to-word associations (Smith & Vela, 2001).

Method

Participants

Based on an a priori power analysis, our experiment needed approximately 72 participants to detect a medium sized effect (Cohen’s d= .5) with α = .05 and 1 − β = .95. Participants were undergraduate psychology students volunteering as participants to fulfill a course requirement. They could enroll for the particular time slots via a computerized database. The sessions were conducted with 3–15 participants at a time. Altogether we had 133 participants with 30, 32, 35 and 36 in four counterbalancing groups. The numbers of participants in the counterbalancing groups varied because participants could enroll themselves in any session of the experiment, and experiment sessions were randomly assigned to the counterbalancing conditions.

Design and materials

A total of 36 RAT problems were used in Experiment 1. A RAT problem consists of three words, and the solution is a single word that is strongly associated with each of the three test words (e.g., COTTAGE - SWISS - CAKE, solution - CHEESE). The problems were selected from the Compound Remote Associate Problem compendium published by Bowden and Jung-Beeman (2003), using problems with high normed levels of solution rates. For each triad of RAT words, red herrings were generated, one red herring word for each of the three test words, resulting in three paired associates for each RAT problem (e.g., COTTAGE-hut, SWISS-chocolate, CAKE-icing). Test words, solution words, and red herring words were unique within our set of materials; none were repeated in other problems. All of the RAT test words were presented in capital letters and all of the red herrings in lowercase letters throughout the experiment. RAT problems, their solutions and their associated red herrings are shown in the Appendix. The items were presented as a PowerPoint slideshow shown on a large screen in clear view of the participants.

A total of 72 photo contexts, or photographs of places, were used in Experiment 1. The photos showed familiar types of places, such as an airport or a restaurant, but the specific places were likely not known to our participants. Each photo context was assigned at random to a RAT problem, except that obvious pre-experimental associations of words and accompanying photos was avoided in each pairing. Stimulus words were shown in a red font and outlined with a yellow border, and they were superimposed over their accompanying photo contexts. On retrieval practice trials, the encoding contexts accompanying paired associates were the same contexts paired with those items at encoding. On the RAT, triads were tested either with the photo context of the red herrings’ encoding, or with a new photo, that is, a photo not seen before in the experiment.

A 2 (reinstatement, reinstated or new contexts at the test phase, a within-subjects variable) × 2 (repetition, zero or two retrieval practices, a within-subjects variable) × 4 (counterbalancing, a between-subjects variable) mixed design was used. The 36 RAT triads were randomly divided into four sets of nine triads, sets A, B, C, and D. In the first counterbalancing group (CB-1), items from set A were used for the zero-practice new-context condition, set B was used for the two-practice new-context condition, set C for the zero-practice reinstated-context condition, and set D for the two-practice reinstated-context condition. In CB-2, CB-3, and CB-4, each set of items was assigned to a different condition, so that across the four counterbalancing groups, each item was used in each treatment condition. The proportion of correctly solved RAT problems in the final phase of the experiment served as our dependent measure. Strict scoring was used, that is, no alternative solutions to the RAT problems were accepted.

Procedure

The PowerPoint slideshow had four sections corresponding to the experimental tasks, context encoding, paired associates encoding, paired associates retrieval practice, and finally, the Remote Associates Test. Participants were initially told that they would complete several different tasks during the experiment and that they would receive instructions when it was time for each task. Each task consisted of 36 slides, one for each of the critical items, that were presented in an order randomized in blocks of six.

For the first task, context encoding, participants gave relatedness ratings on a 1–10 scale between each set of three red herring words and their background context. The backgrounds were not conceptually related to the red herrings in any obvious way. Participants had 8-sec to look at the screen, and write down their ratings.

For the next task, paired-associates encoding, participants tried to memorize triads of paired-associates consisting of the three test words each linked with its red herring word. Each triad of paired-associates was superimposed over the same photo context that had been seen with the triad’s red herring words at encoding. Each study trial of the paired-associates encoding task was 5-sec.

Following paired-associates encoding was the paired-associates retrieval practice task. For half of the items, on each trial of the retrieval practice task, participants were shown the three RAT words superimposed over their encoding context (only for items in the two-retrieval practices condition), and they tried to recall and write down the three words that had been the responses (the red herrings.) Participants had 10-sec for each practice trial. After 10-sec, the correct responses were displayed, and participants either circled their response if they got it right, or they wrote down the solutions they could not remember. This self-correction procedure took an additional 10-sec. There were two retrieval practice trials for each item in this condition; the two practice trials were evenly distributed, with at least 8–10 slides between the two trials.

The final task was the Remote Associates Test. For each problem, participants had 15-sec to think of the solution. The word triads were superimposed over either the encoding background for the associated red herrings (reinstated context condition) or a different context that had not been seen previously in the experiment (new context condition.) The order of the three words was never the same on the RAT task as it had been on the encoding or the retrieval practice tasks. No mention was made of the previous tasks in the instructions. The method of Experiment 1 is shown on Fig. 1.

Fig. 1
figure 1

Experiment 1 method

Results

A 2 (context: reinstated, new context; a within-subjects variable) × 2 (repetition: zero, two; a within-subjects variable) analysis of variance (ANOVA) was computed with the proportion of RAT problems solved as the dependent variable.Footnote 1 The main effect of repetition was significant [F(1,132) = 5.167, p = .025, partial η2 = .038]; items that had two retrieval practices with red herrings, relative to items with no practice trials, had lower solution rates, as predicted by the red herring retrieval hypothesis. The main effect of context was not significant [F(1,132) < 1, p = .541, partial η2 = .003]; although, numerically, the reinstated context condition was slightly worse than the new context condition, as predicted, the difference was small. The context X repetition interaction was not significant [F(1,132) = 3.029, p = .084, partial η2 = .022]. Figure 2 shows the means for the four conditions. The proportion correctly solved was .44 (SD = .19) in the reinstated-zero practice condition, .38 in the reinstated-two repetition condition (SD = .2), .42 in the new-zero repetition condition (SD = .19), and .41 in the new-two repetition condition (SD = .18). Post-hoc paired comparisons of reinstated vs new context conditions were not significant in either the zero [t(132) = .69,p = .49] or two retrieval practice [t(132) = 1.61,p = .11] condition.

Fig. 2
figure 2

Mean proportion of correct solutions for the RAT as a function of the four conditions in Experiment 1. The error bars represent 95% within-subjects CI

Mean relatedness rating on a 1–10 scale between the context pictures and the red herrings, in which 1 meant extremely unrelated and 10 meant highly related, was 2.85 (SD = 1.6). A paired-samples t-test showed that participants during the second retrieval practice trial (M = .78, SD = .2) remembered the red herrings significantly better [t(255) = 37.37, p < .001, d = 2.34] than during the first retrieval practice trial (M = .46, SD = .18).

Discussion

Experiment 1 tested whether repetition and context reinstatement would modulate the strength of the fixation effect caused by presentation of red herrings. As predicted, with more retrieval practices of the red herrings, solution rates for the RAT items suffered. Two extra retrieval practice trials decreased solution rates by 4%, an effect size of partial η2 = .038. Because Experiment 1 did not include an unprimed baseline condition, it is not clear how this effect compares with the overall fixation effect that may have occurred. Experiment 2 included an unprimed condition.

The main effect of context reinstatement was not significant. The methods that we used to observe context-dependent fixation have been shown to produce robust context-dependent memory effects (e.g., (Smith, 2013)). The context stimuli that we used were photographs of real places, making them “rich” enough to yield robust effects. There were only three paired-associates per context, a small contextual fan size. Our context encoding task ensured that items and contexts had been directly associated. These methods have yielded robust context-dependent memory effects for tests of free recall (Smith & Manzano 2010, Smith et al., 2014), recognition (Shahabuddin & Smith, 2016), and paired-associates recall (Smith & Handy, 2014, 2016; Smith et al., 2014). Why, then, was the effect of context reinstatement not significant in Experiment 2

One possible reason for our failure to detect context reinstatement-enhanced fixation, is, of course, that there may not be an effect of context reinstatement on experimentally-induced fixation effects. Alternatively, context reinstatement may truly influence fixation, but effect size may be much smaller than what we observed in Experiment 1 for the effect of repetition on fixation, and we may have had insufficient power to detect the effect. We suspected a different reason, however, based on the types of associations between contexts and red herrings, and between test words and red herring words. Whereas the associations between contexts and red herrings were purely arbitrary and episodic (e.g., a photo of an airfield and the red herring word hut), the associations between test words and red herrings were semantic and meaningful (e.g., COTTAGE-hut). The stronger semantic associations, which existed even before participants enrolled in the experiment, were further strengthened in Experiment 1 by the paired-associates encoding task and the retrieval practice task. Thus, when the three test words on a RAT problem appeared, the presence of the strong word-word associates at test obscured the weaker effects of contextual stimuli, a phenomenon known as an outshining effect (e.g., McDaniel et al., 1989; Smith & Vela, 1986, 2001). An alternative to outshining might be that because of the strong association of the paired-associates, the contexts were never even encoded. The strongly associated red herrings overshadowed the arbitrary contexts at encoding (Pavlov, 1927, p.141). Although we did not test these two explanations per se, in Experiment 2 we removed the possibility of either outshining or overshadowing by having red herrings associated with contexts directly rather than pairing them with the RAT words as was done in Experiment 1.

Although the main effect of context and the context X repetition interaction were not significant in Experiment 1, the non-significant trend (p = .084) suggests that greater effects of context manipulations may occur with more retrieval practice trials in a given context. This possibility is supported by the reasoning that more context-item practice strengthens not only the items themselves, but also the item-context associations, making memory of items more dependent upon the context. Therefore, in addition to mitigating potential outshining effects in Experiment 2, we also tried to maximize the likelihood of finding context-dependent fixation effects by using two context-item retrieval practice trials for all items.

Experiment 2

In Experiment 1, triads of paired-associates, made from each RAT test word paired with its red herring (e.g., COTTAGE-hut, SWISS-chocolate, CAKE-icing), had been studied in association with photo contexts (e.g., a photo of an airfield). In Experiment 2, to avoid an outshining effect from powerful word cues, we instead paired the three red herrings for a subsequent problem (e.g., hut, chocolate, icing) with a photo context (e.g., an airfield). Participants were told that for each trial of a subsequent memory test, they would see a photograph, and they were to recall the three words that had been paired with the photo. Following two practice trials in which participants tried to recall the three words for each photo context (with post-trial corrective feedback), participants were given a test of RAT problems. A third of the RAT problems were displayed on the photo context that was paired with red herring words at encoding (primed-reinstated context), a third were shown superimposed over a new context (primed-new context), and a third did not correspond to any red herrings shown in the experiment (unprimed items). The red herring retrieval hypothesis predicted that the unprimed RAT problems would have the highest solution rate, the primed-new context problems would have the next highest solution rate, and RAT problems in the primed-reinstated context condition would have the lowest solution rate.

Furthermore, following an analysis of solution rates of RAT problems from Experiment 1, 12 of the original set of 36 RAT items were excluded from Experiment 2 because their solution rates were too low to show significantly decreased performance in the fixation conditions. For Experiment 1, we selected items with high solution rates from the Bowden and Jung-Beeman (2003) norms (above 33% in the 15 seconds per problem condition), because we expected our fixation manipulations to decrease solution rates. In Experiment 1, twelve of the RAT items had mean solution rates below 10%, a base rate too low to observe effects of fixation. Therefore we eliminated those 12 items from the stimulus materials used in Experiment 2. The remaining items in the Bowden and Jung-Beeman (2003) norm had baseline solution rates that were too low to observe fixation effects, therefore we used the reduced set of 24 items with high solution rates in Experiment 2.

Method

Participants

An a priori power analysis indicated a desired total sample size of 84 to detect a medium-sized (Cohen’s d = .5) effect with an alpha level of .25 and power of .95. Participants were undergraduate psychology students volunteering as participants to fulfill part of a course requirement. They could enroll for time slots via a computerized database. The sessions were conducted with 3–15 participants at a time. There was a total of 89 participants in Experiment 2: 30 in CB-1, 31 in CB-2 and 28 in CB-3.

Design and materials

Test condition (primed-reinstated context, primed-new context and unprimed) was a within-subjects measure, and counterbalancing (CB-1, CB-2 and CB-3) was a between-subjects measure. There were 24 items distributed in each test condition (see Appendix). In Experiment 2, the set of items (i.e., 8 RAT problems with their accompanying red herrings) in the primed-reinstated context condition in CB-1 were in the primed-new context condition in CB-2, and the unprimed (baseline) condition in CB-3. The other two sets of 8 items were similarly counterbalanced among the three counterbalancing groups. The proportion of RAT problems solved was the dependent measure.

Procedure

The procedure used in Experiment 2 was the same as described for Experiment 1, with the following exceptions.

On the context encoding task in Experiment 2, participants were instructed to memorize each set of 3 words appearing together on the screen (i.e., the 3 red herrings for a RAT problem) in association with the context photos.

For each trial of the retrieval practice task in Experiment 2, participants were shown one of the encoded photo contexts as a cue, and they tried to recall the three words that they had encoded on the previous task. Participants wrote these responses on paper, with 10-sec per retrieval practice trial. After 10-sec, the correct responses were shown on the screen, participants either circled the responses that they got correct on that trial, or they wrote down the responses they could not remember. This self-correction procedure took another 10-sec per trial. All 16 of the encoded items were tested in this way twice. The method of Experiment 2 is shown on Fig. 3.

Fig. 3
figure 3

Experiment 2 method

Results

A repeated measures one-way ANOVA was computed, to test the effects of test position (unprimed, primed-reinstated context, primed-new context), using the proportion of RAT problems solved as a dependent variable.Footnote 2 There was a significant difference among the three conditions [F(2,176) = 8.127, p < .001, partial η2 = .085] Post-hoc pairwise comparisons with a Bonferroni correction revealed, that compared to the unprimed condition (M = .539, SD = .195), solutions for the RAT problems were significantly worse when the encoding contexts of red herrings at the test phase were reinstated (M = .456, SD = .205) [t(88) = 4.011, p < .001, d = .425]. We also found that compared to the new context condition (M = .506, SD = .205) there was a significant decline in correct solutions if the encoding contexts of red herrings was reinstated at test [t(88) = 2.585, p = .034, d = .274]. The difference between the unprimed and primed-new context conditions was not significant [t(88) = 1.517, p = .399, d = .161] (Fig. 4).

Fig. 4
figure 4

Mean proportion of correct solutions for the RAT as a function of the three context conditions in Experiment 2. The error bars represent 95% within-subjects CI

A paired-samples t-test showed that participants during the second retrieval practice trial (M = .58, SD = .24) remembered the red herrings significantly better [t(169) = 21.64, p < .001, d = 1.66] than during the first retrieval practice trial (M = .22, SD = .15).

Discussion

In contrast with the results of Experiment 1, there was a clear context-dependent fixation effect observed in Experiment 2. The RAT problems tested with the encoding contexts of corresponding red herrings (i.e., the primed-reinstated context items), relative to RAT problems tested with new contexts (i.e., the primed-new context items), were solved at a significantly lower rate. The primary difference between the procedures used in Experiments 1 and 2 was that in Experiment 1, both RAT test words and photo contexts were practiced in association with red herrings, whereas in Experiment 2, only photo contexts were practiced with red herrings. In Experiment 1, the strong semantic associations between RAT test words and red herring words were made even stronger because they were practiced together. The presence of those strong verbal cues at test, regardless of the test context, caused an outshining effect (e.g., Smith, 1988) in Experiment 1. Because the associations between test words and red herrings were practiced in Experiment 1, but not Experiment 2, the verbal cues were much stronger in Experiment 1 than in Experiment 2. As a result, the cuing effects of photo contexts were not outshone by the stronger cuing effects of RAT test words in Experiment 2, as they were in Experiment 1. In addition, all items received one presentation trial and two retrieval practice trials before the critical RAT problems, based upon the suggestion in the results of Experiment 1 that multiple context-item repetitions may strengthen the context-red herring associations, making them more context-dependent.

In Experiment 2, we included a baseline control condition (i.e., the unprimed condition) in which there were no corresponding red herrings that had been encoded and practiced before the RAT was given. The percentage of RAT problems solved in the primed-reinstated context condition, relative to the unprimed condition, was 8% lower, a fixation effect size (d = .43) in the medium range (Cohen, 1977). In contrast, the 3% fixation effect in the primed-new context condition did not rise to significance, with a very small effect size (d = .16). In addition to showing that a test manipulation, context reinstatement, affects the magnitude effect, the present findings suggest that trying to solve creative problems in new contexts may be a way to relieve fixation.

General discussion

Our results are consistent with the red herring retrieval hypothesis, the notion that red herrings retrieved from memory cause fixation effects, and furthermore, that factors that improve memory of red herrings also increase fixation effects. When the test words of RAT problems (e.g., COTTAGE - SWISS - CAKE), acting as stimulus members of paired associates, were practiced with red herrings (e.g., hut, chocolate, icing), that is, words conceptually related to test words but unrelated to solutions (e.g., cheese), performance on the RAT problems decreased, a fixation or blocking effect. Furthermore, when test word-red herring pairs were practiced more (Experiment 1), and when the encoding contexts of red herrings were reinstated (Experiment 2) the observed fixation effect was greater. The degree of practice or repetition should affect an item’s storage strength (Bjork & Bjork, 1992), or that item’s potential for being remembered, whereas providing context cues should affect the item’s retrieval strength, or the amount of storage potential that can be utilized. Our experiments show that manipulations of both the storage and retrieval strength of red herrings can affect the degree of fixation that is observed.

A not completely unexpected result was the lack of a context-dependent fixation effect in Experiment 1. Previous research has shown that cued recall, using paired associates, is susceptible to the types of context manipulations that we used in the present experiments (e.g., Smith et al., 2014; Smith & Handy, 2014, 2016). Smith et al., (2014), for example, observed context-dependent cued recall using Swahili-English word pairs, Smith and Handy (2014) found the effect using face-name pairs, and Smith and Handy (2016) found it using Tagalog-English word pairs. The word pairs used in Experiment 1 of the present study, however, differed from previously reported methods because the two words of our pairs were in the same language, semantically related to each other (e.g., COTTAGE-hut, SWISS-chocolate, CAKE-icing). The presence of strong semantic cues at test has been shown to weaken or eliminate episodic context reinstatement effects, consistent with the outshining hypothesis (Smith & Vela, 2001). Consistent with this reasoning, a clear effect of context reinstatement on fixation was seen in Experiment 2 when red herrings were associated with unrelated context cues, rather than with semantically related RAT word cues, at encoding. Another reason that context effects were significant in Experiment 2, but not in Experiment 1, might have been that each of the red herrings in Experiment 2 were presented and practiced twice, all in one single context, and the repeated association of contexts with red herrings might have made them more context-dependent, a notion consistent with the non-significant interaction of context X repetition observed in Experiment 1. Although these explanations are logical, they are also speculative; the present experiments do not provide a critical test of them.

Our experiments broaden the linkage of research on creative thinking with memory research (e.g., Storm & Koppel, 2012; Koppel & Storm, 2014; Smith & Blankenship, 1989, 1991; Storm & Angello, 2010; Storm, Angello, & Bjork, 2011; Angello, Storm, & Smith, 2015). The sensitivity of RAT problem solving, a test often used to assess creative ability (e.g., Benedek & Neubauer, 2013), to manipulations of the memorability of red herrings in the present two experiments shows that this test of creative problem solving has acted as an indirect measure of memory (Richardson-Klavehn & Bjork, 1988). In contrast to direct memory measures, indirect measures are those that do not refer participants to the items being remembered, but that are nonetheless influenced by memory of those items. Performance on tests of word fragment completion and anagrams, for example, reflect the manipulated memorability of solutions to those simple types of problems (e.g., Graf & Schacter, 1985; Kelley, Jacoby, & Hollingshead, 1989), as well as the manipulated memorability of red herrings (e.g., Landau & Leynes, 2004; Smith & Tindell, 1997). It is not clear whether the fixation effects observed in the present experiments were caused by automatic retrieval or deliberate recollection of red herrings. It is clear, however, that most participants knew that red herrings were not correct solutions; red herrings were given as intrusions on only 1.85% of all RAT problems in Experiment 1, and on only 0.79% of the problems in Experiment 2. The degree to which the present findings are due to conscious recollection of red herrings, versus how much is due to unconscious, automatic retrieval, should be a focus of future research.

Implicit memory blocking effects have been mitigated by weakening the retrieval strength of red herrings (or blockers) via memory inhibition procedures, including directed forgetting (Koppel & Storm, 2012) and suppression induced forgetting (Angello et al., 2015) of red herrings (e.g., ANALOGY) that were designed to block word fragment completion (e.g., A_L__GY). an indirect test of memory (see Smith & Tindell, 1997). Mitigating fixation in creative problem solving might be accomplished the same way, that is, by inhibiting the associations between problem stimuli and red herrings. This possibility is consistent with findings that show significant correlations between an individual’s RAT problem solving performance and the magnitude of that individual’s retrieval-induced forgetting effect; perhaps individuals who are good at inhibiting memories are also good at inhibiting inappropriate responses that may come to mind (and cause fixation) while trying to solve RAT problems. The present findings, that repetitions and context reinstatement of red herrings increased fixation in creative (RAT) problem solving, are consistent with the idea that better retrieval of red herrings increases their blocking (i.e., fixation) effectiveness.

Our experimental manipulations of participants’ memory of red herrings, which moderated blocks in creative problem solving, is consistent with a considerable body of research on creative cognition that has linked fixation and constraints that limit creative thinking with the inappropriate use of prior knowledge. The fixating effects of implicit assumptions has been explored and discussed at length by Gestalt psychologists, an acknowledgment that prior knowledge can cause fixation effects such as mental set (or Einstellung), (Luchins & Luchins, 1959; Duncker, 1945; Koffka, 1935; Köhler, 1925; Wertheimer, 1945) or functional fixity (e.g., Adamson & Taylor, 1954; Maier, 1931). In a similar vein, in constraint relaxation theory (e.g., Knoblich et al., 1999) knowledge about what does and does not serve as a satisfactory solution to a creative problem is described in terms of constraints, with such constraints having been learned and remembered from experiences with similar problems in one’s past. Likewise, the theory of structured imagination (Ward, 1994), which states that creative imagination is based on one’s conceptual knowledge, describes prematurely constrained idea development resulting from taking the “path of least resistance” after beginning with an overly constrained concept. And, using RAT problems whose incorrect solutions were baseball terms Wiley (1998) showed that prior knowledge about baseball could impede creative problem solving. What the present study adds to this body of research is the use of experimental paradigms that manipulate the memory of red herrings, or inappropriate solutions.

The red herring retrieval theory, which states that creative problem solving performance depends on the retrieval strength of red herrings, is closely related to Smith and Blankenship’s forgetting fixation theory of incubation effects. This theory of incubation states that a decrease in memory of fixating ideas is the cause of incubation effects, defined as resolution of initial blocks (or fixation) as a function of forgetting red herrings (blockers) during the incubation interval. Smith and Blankenship (1989), for example, showed that longer incubation intervals caused both poorer memory of blockers and better resolution of initially unsolved rebus problems, consistent with the forgetting fixation theory of incubation effects. Theories that link the passage of time with changes in one’s mental context, that is, contextual fluctuation theories (Estes, 1955; Bower, 1972; Glenberg, 1979; Howard & Kahana, 2002; Mensink & Raaijmakers, 1989), are consistent with the notion that contextual fluctuation, after initial fixation on a problem, rather than the passage of time per se, may be what mitigates fixation during an incubation period. The findings of context-dependent fixation in Experiment 2 of the present study indicate that initial fixation might be mitigated by a change of contexts, regardless of the passage of time. Such an explanation of “incubation” effects suggests a method of enhancing incubation effects may be to present initially unsolved problems in new contexts.

In conclusion, we have shown that creative problem solving is impeded by manipulations of factors that improve memory of red herrings for those problems. In Experiment 1, repeated practice of associations between problems and red herrings caused greater fixation. In Experiment 2, practice of associations between contexts and red herrings caused greater fixation for problems tested in those practiced contexts. The fact that retrieval of fixating ideas can be triggered by problems or by contexts shows that there can be multiple sources of fixation in creative problem solving, and indicates that efforts to reduce or overcome fixation might focus on memory cues provided by problems and/or by problem solving contexts.