What kind of processing is survival processing?

Kroneisen, Meike; Rummel, Jan; Erdfelder, Edgar

doi:10.3758/s13421-016-0634-7

What kind of processing is survival processing?

Effects of different types of dual-task load on the survival processing effect

Published: 01 August 2016

Volume 44, pages 1228–1243, (2016)
Cite this article

Download PDF

Memory & Cognition Aims and scope Submit manuscript

What kind of processing is survival processing?

Download PDF

Meike Kroneisen¹,
Jan Rummel² &
Edgar Erdfelder³

6530 Accesses
18 Citations
1 Altmetric
Explore all metrics

Abstract

Words judged for their relevance in a survival context are remembered better than words processed in non-survival contexts. This phenomenon is known as the survival processing effect. Recently, inconsistent results were reported on whether the size of the survival processing effect is affected by cognitive load. Whereas Kroneisen, Rummel, and Erdfelder (Memory 22: 92-102, 2014) observed that the survival processing effect vanishes under dual-task conditions, Stillman, Coane, Profaci, Howard, and Howard (Memory & Cognition 42: 175-185, 2014, Experiment 1) found that the size of survival processing effect is essentially unaffected by a cognitively demanding secondary task. In three experiments, we investigated the differences between these studies to achieve a better understanding of dual-task effects on the survival-processing advantage. In the first experiment, we replicated Stillman et al.’s results using their dual-task conditions combined with a sample more than twice as large as theirs. In the second experiment, we compared dual-task conditions that differed regarding how strongly the secondary task taxed (a) working memory load (maintenance of one vs. several items) and (b) processing demands (switching vs. time-sharing between tasks). A third experiment focussed on low (i.e., single-item) load under time-sharing processing conditions. Results consistently showed that the survival processing effect persisted under low load but vanished when the number of items held in working memory increased beyond one, irrespective of processing demands. Implications of these findings for explanations of the survival-processing advantage are discussed.

A meta-analysis of the survival-processing advantage in memory

Article 25 July 2017

Secondary task engagement drives the McCabe effect in long-term memory

Article 08 August 2023

Survival processing occupies the central bottleneck of cognitive processing: A psychological refractory period analysis

Article Open access 11 August 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Memory research has often focused on structural mechanisms, making use of more or less abstract materials in artificial learning environments. However, it seems unlikely that human memory evolved to learn, process, and store abstract information. If the evolution of human memory was shaped by the process of natural selection, then structural properties of memory should reflect their functionality (Tooby & Cosmides, 1992). The selection pressure, that is, the adaptive problem to be solved, constrains how and why a structure develops and also the form it takes. Looking at our memory system, it seems implausible that its function is only to remember the past. It seems more plausible that we need to remember the past to predict the likelihood of events occurring in the future (Suddendorf & Corballis, 1997; Tulving, 2002). Specifically, memory could be designed to retain information relevant for future survival, for example, by remembering the location of relevant food resources or potable water.

In line with this idea, Nairne, Thompson, and Pandeirada (2007) were able to demonstrate that verbal information processed in the context of an imagined ancestral survival scenario (i.e., being stranded in the grasslands of a foreign land) is recalled better than information encoded using alternative mnemonic procedures that proved to be highly effective. This phenomenon is known as the survival processing effect (SPE). In the typical experimental design, words are rated with respect to their relevance for an ancestral grassland survival scenario characterized by predators, lack of food, and lack of potable water. After a short distractor task, memory performance for the previously rated words is tested in a surprise free-recall test. Recall performance is typically much better in the survival group compared to various control groups, for example, groups that were required to rate the same words with respect to their pleasantness (e.g., Nairne et al., 2007; Nairne & Pandeirada, & Thompson, 2008) or their self-relevance (Kostic, McFarlan, & Cleary, 2012; Nairne et al., 2008; but see Klein, 2012), that got imagery instructions instead (Nairne & Pandeirada, 2008), or that rated the relevance of the same words for a control scenario such as a moving scenario (e.g., Nairne et al., 2007) or emotionally more arousing control scenarios (e.g., Butler, Kang, & Roediger, 2009). Moreover, one of the studies from Nairne et al. (2008) was also successfully replicated as part of the reproducibility project (Renkewitz & Müller, 2015, July 13). Thus the SPE can be considered a very stable memory phenomenon that does not depend on comparisons with a specific control condition.

However, it is still an open debate which mechanisms drive the survival processing advantage (for reviews, see Erdfelder & Kroneisen, 2014; Howe & Otgaar, 2013; Kazanas & Altarriba, 2015; Nairne & Pandeirada, 2016). As outlined by Kroneisen et al. (2014) there are different proximate explanations of the effect. According to evolutionary explanations, the SPE reflects “hardwired” automatic processes based on evolved domain-specific cognitive modules. For example, Nairne, Vasconcelos, and Pandeirada (2011) argued that the SPE can be seen as evidence that human learning and memory systems have been selectively tuned during evolution to process and retain information that is relevant to fitness. One implication of the evolutionary account is that the survival processing advantage should be very general and thus hold across a broad range of cultural and situational context conditions. In line with this prediction, the SPE was indeed found in all countries in which it was investigated so far (USA: e.g., Nairne et al., 2007; Germany: e.g., Kroneisen & Erdfelder, 2011; UK: e.g., Howe & Derbish, 2010; The Netherlands: e.g., Otgaar, Smeets, & van Bergen, 2010; Japan: e.g., Nouchi, 2012), and it also proved to be particularly robust across a wide range of contexts. Moreover, the SPE was replicated both in children (Aslan & Bäuml 2012; Otgaar & Smeets, 2010, Exp. 2) and in older adults (Nouchi, 2012; Yang, Lau, & Truong, 2014; but see Otgaar, Jelicic & Smeets, 2015, and Stillman, Coane, Profaci, Howard, & Howard, 2014, for different results). It was also found when replacing words with other to-be-remembered materials, for example, pictures (e.g., Otgaar, Smeets, & van Bergen, 2010). Thus, there is little doubt that the SPE is a robust and rather general memory phenomenon as predicted by evolutionary accounts (cf. Erdfelder & Kroneisen, 2014; Kroneisen & Erdfelder, 2016; Nairne & Pandeirada, 2016).

Evolutionary explanations stand in stark contrast to explanations arguing that survival processing may enhance memory because it recruits a powerful set of domain-general encoding processes more than typical control conditions do. One example is the richness of encoding (RE) hypothesis (Kroneisen & Erdfelder, 2011; Röer, Bell, & Buchner, 2013). According to this hypothesis, survival relevance ratings trigger richness of encoding, that is, each rated object is cognitively linked to a variety of possible functions it can have in ancestral survival contexts, both standard and atypical functions. By implication, thinking about possible survival functions of objects leads to highly distinctive and unique memory representations. In line with this hypothesis, Röer et al. (2013) found that SPE strength increases with the number of unique relevance arguments generated per item. Along the same lines, Bell, Röer, and Buchner (2015) further showed that rating the usefulness of objects compared to rating the dangerousness of objects in a survival scenario leads to better recall. Obviously, thinking about functions of objects (thus increasing the richness of encoding) results in better memory performance than thinking about the strength of emotions elicited by objects, as predicted by the RE hypothesis.

Domain-general “deep” encoding processes such as elaboration and distinctive processing assumed in the RE hypothesis involve consciously controlled forms of encoding that are cognitively demanding. By implication, they require selective attention and working memory capacity. Hence, the SPE should diminish or perhaps even vanish when working memory resources are scarce. This conflicts with the evolutionary view that implies stability of the survival processing advantage – as an evolved adaptation – irrespective of working memory resources. Evolutionary accounts often assume that psychological adaptations work automatically and unconsciously, that is, even if working memory resources are scarce. Fodor (1983), for example, claimed that psychological adaptations should meet the criteria automaticity, domain specificity, encapsulation, inaccessibility to consciousness, speed, shallow outputs, fixed brain location, and characteristic breakdown patterns. Admittedly, not all scholars agree with these criteria (see, e.g., Barrett & Kurzban, 2006; Jung, Ruthruff, Tybur, Gaspelin, & Miller, 2012). Nevertheless, if the survival processing advantage can be traced back to an evolved adaptation although it requires working memory resources, the least we would expect is that survival processing should be prioritized by default when several demanding tasks are processed simultaneously (see also Kroneisen et al., 2014, for a detailed discussion). Hence, the SPE should still persist under working memory load, in exchange for a deterioration in secondary task performance compared to control conditions.

There are several ways to test these conflicting predictions against each other. One option is to assess the strength of the SPE in populations known to have lower working memory resources compared to healthy young adults, for example, older adults. According to evolutionary accounts, the SPE should be unaffected in older adults whereas it should be reduced according to the RE hypothesis. The available evidence is more in agreement with the latter prediction. As outlined above, Nouchi (2012) and Yang, Lau, and Truong (2014) found a significant SPE in older adults, too. However, in line with the RE hypothesis, in the experiment by Nouchi (2012) this effect was smaller in older than in younger adults as indicated by a significant interaction. Clear-cut results were also observed by Stillman et al. (2014): In none of their three experiments was there any evidence for a significant SPE in older adults. Similarly, there was also no SPE for older adults in a recent study by Otgaar, Jelicic, and Smeets (2015). However, despite these unequivocal results, one should be reluctant to discard the evolutionary account based on age group comparisons only, because young and older adults differ on many dimensions of which working memory capacity is only one. Without additional evidence, it remains unclear whether the often weaker (or even absent) SPE in older adults is, in fact, due to reduced working memory resources.

A more direct way to assess the causal role of working memory capacity in the SPE is to manipulate processing demands and working memory load experimentally using healthy young adults as participants only. Here, the current evidence is mixed. Kroneisen et al. (2014) found no survival processing advantage when the survival task was combined with an auditory continuous choice reaction time (CRT) task (cf. Naveh-Benjamin, Guez, & Marom, 2003). Participants in the experiment by Kroneisen et al. (2014) rated words for their relevance to either a survival or a moving scenario under either standard conditions or dual-task conditions. Participants in the dual-task conditions were presented with a random sequence of high- and low-pitch tones via headphones while rating words with regard to their relevance for either the survival or the moving scenario. They were instructed to press the spacebar whenever the same tone was presented three times in a row.

In contrast to Kroneisen et al. (2014), Stillman et al. (2014, Exp. 1, young adults) were able to detect a significant survival processing advantage with the same two scenarios even when participants performed a secondary task during the relevance rating. In their Experiment 1, the size of the SPE was essentially unaffected by whether or not young adults processed the words in the survival or the moving scenario with or without a concurrent secondary task. Notably, however, their secondary task differed from the one used by Kroneisen et al. (2014). In Stillman and collaborators’ secondary task, a high-pitched tone or a low-pitched tone was presented following each relevance rating. Participants were required to count the number of high-pitched tones and report them after all 32 words had been rated.

The purpose of the present paper is to clarify the reasons for the conflicting results of Kroneisen et al. (2014) and Stillman et al. (2014, Experiment 1), thereby contributing to the debate on whether survival processing is cognitively demanding as predicted by the RE hypothesis or automatic as predicted by evolutionary accounts of the SPE. The experiments of Kroneisen et al. (2014) and Stillman et al. (2014, Exp. 1, young adults) made use of the same basic 2 × 2 design, that is, type of scenario (survival vs. moving) was crossed with processing condition (single task vs. dual task). In this design, a significant interaction is indicative of a reduction in the size of the SPE under load. In addition, they also employed the same learning materials (i.e., word lists of Nairne et al., 2007). Apart from differences in language and nationality of the participants, the only important discrepancies between experiments concerned (1) the sample size across the four groups of the 2 × 2 design and (2) some core features of the secondary task used in the dual-task conditions of the design.

To isolate possible causes for the discrepant results, we first looked at differences in sample size. Whereas Kroneisen and collaborators analyzed data of 169 participants across the four groups, Stillman and collaborators’ conclusions concerning young adults are based on 52 participants only. Thus, enhanced sampling error alone could be a reason for not finding a significant decrease of the SPE in the dual-task condition employed in Stillman et al.’s (2014) first experiment.

To test for the possibility that enhanced sampling error is responsible for the divergent results, an exact replication of Stillman et al.’s experiment seems necessary that has sufficient power to detect a reduction of the SPE under dual-task conditions if it exists. We thus opted to replicate their study with about 120 young adults across the four groups of the 2 × 2 between-subjects design, that is, approximately 30 participants per group. With this overall sample size and a type-1 error risk of α = .05, the power to detect a medium-sized interaction effect (f = .25, see Cohen, 1988) of type of scenario with processing condition is close to the .80 power level recommended by Cohen (1988) (for exact results, see below). In contrast, Stillman and collaborators’ experiment with the overall sample size of 52 had a power of only .42 for the same effect size.

There are two possible outcomes of the replication study. If the replication results in a significant reduction of SPE under the dual-task conditions employed by Stillman et al. (2014, Exp. 1), similar in size to the one previously observed by Kroneisen et al. (2014), then this would suggest that sampling error alone suffices as an explanation for Stillman and collaborators original results, making it obsolete to investigate differences between the secondary tasks of both experiments. If, in contrast, the replication is successful and again shows no significant reduction of the SPE under Stillman’s dual-task conditions despite sufficient power, then this would suggest that differences in the dual-task conditions employed by Kroneisen et al. (2014) and Stillman et al. (2014, Exp. 1) are responsible for the discrepant results. Experiment 1 was designed to evaluate these possibilities.

Experiment 1

Method

Participants

Fifty-nine students from the University of Mannheim and 61 students from Heidelberg University participated for course credit or monetary compensation. Two participants were excluded due to low recall performance (number of recalled items below 5) or experience with the basic design of the survival-processing experiments. Thus, data analyses are based on the remaining 118 participants (27 male) who either received monetary incentives or course credit for compensation. Participants’ age ranged from 18 to 36 years (M = 21.55; SD = 3.33). For a medium effect size f = .25 (Cohen, 1988), α = .05, and N = 118, the power of the 2 × 2 interaction F-test is .77 (calculated with G*Power, cf. Faul, Erdfelder, Buchner, & Lang, 2009).

Apparatus and materials

Following Kroneisen et al. (2014), stimulus materials (i.e., words to be rated for their relevance) were obtained from Nairne et al.’s (2007) first experiment and translated into German. Thus, target words were 30 typical words from 30 unique categories. To absorb primacy and recency effects typically found in free recall, we added 12 buffer words drawn from the German version of the Battig and Montague norms (Mannhaupt, 1983), six at the beginning and six at the end of the list. All words, except the buffer words, were presented in random order. The survival and moving descriptions were German translations of those used by Nairne et al. (2007). The stimulus presentation was controlled by personal computers running Eprime 2.0 (Psychology Software Tools, Pittsburgh, PA, USA).

To replicate the results from Stillman et al. (2014), we developed a task analogous to the one they used. To this end, following the presentation and the rating of each individual word, participants heard either a high- or a low-pitched tone (440 Hz and 330 Hz) and were asked to count all high-pitched tones. Tones were presented immediately after the presentation of each individual word. Hence, 42 tones were presented in total. After the rating task, participants were asked to write down the overall number of high-pitched tones.

Design

A 2 (Type of scenario: survival vs. moving) × 2 (Processing condition: single task vs. dual task) between-subjects design was used. Participants were randomly assigned to one of four groups (Group 1: survival scenario, single task, n = 31, Group 2: moving scenario, single task, n = 29, Group 3: survival scenario, dual task, n = 30, Group 4: moving scenario, dual task, n = 28).

The word-rating phase consisted of 42 trials, including the 12 buffer words at the beginning and at the end of the word list. Recall performance, response latencies, relevance ratings, and performance in the secondary task served as dependent variables.

Procedure

Depending upon the experimental condition, participants first read a scenario description (i.e., either the survival or the moving scenario depending on the experimental condition; see Kroneisen et al., 2014). They were then asked to rate the relevance of each item with respect to the relevant scenario. After two practice trials, half of the participants got the additional instruction to rate the items while performing a secondary task. Two different tones were presented to the participants via headphones. Their task was to count the overall number of the high-pitched tones. In the dual-task conditions, participants were asked to perform the secondary task without sacrificing the primary task (i.e., rating the relevance of the words presented). To familiarize participants with the tones, each of the two tones was presented once in the offset of the rating task. This was followed by a short practice phase where three words had to be rated for relevance under dual-task conditions. Subsequently, the main rating task (either with or without the secondary task, depending on experimental condition) started. Stimuli were presented individually for 5 s each. We asked participants to rate each word on a five-point scale, with 1 indicating “absolutely not relevant” and 5 indicating “extremely relevant” for the given scenario. Participants were required to respond within 5 s. If they failed to respond in time, a warning message appeared. The actual rating task was preceded by two short practice trials. After the rating task, participants performed a distractor task (i.e., filling-in an unrelated questionnaire) for 12 min and were then unexpectedly prompted with a free recall test for the words previously processed in the relevance-rating task. For this recall test, a time limit of 8 min was set for all participants. The experiment took approximately 30–35 min in total.

Results

The significance level was set to α = .05 for all statistical tests.

Recall performance

Figure 1 displays the mean proportions of words correctly remembered in the free recall test and their standard errors for all groups. A 2 (type of scenario) × 2 (processing condition) ANOVA for mean proportions of words recalled revealed a significant main effect of processing condition (F(1, 114) = 10.86, p = .001; η _p ² = 0.09). Under cognitive load, recall performance was generally worse (Fig. 1). In addition, there was a significant main effect of scenario (F(1, 114) = 20.51, p < .001; η _p ²= 0.15), but no significant interaction (F(1, 114) = 0.30, p = .59; η _p ² = 0.003), indicating an equally strong survival processing advantage under both single and dual-task conditions (M _{Survival; single} = 17.09, SD = 3.11; M _{Moving; single} = 13.93, SD = 3.60; M _{Survival; dual} = 14.70, SD = 3.55; M _{Moving; dual} = 12.21, SD = 3.27)

Response times

We also analyzed the response times of the rating task to see whether they were affected by the processing conditions. Table 1 shows the median response times of the participants’ ratings for all conditions. A 2 (type of scenario: survival vs. moving) × 2 (processing condition: single vs. dual task) ANOVA of response times showed a significant main effect of scenario, F(1, 114) = 5.17, p = .03; η _p ² = 0.04, indicating that ratings took longer in the survival condition. There was no significant main effect of processing condition (F(1,114) = 1.42, p = .23; η _p ² = 0.01) nor a significant interaction (F(1,114) = 0.30, p = .59; η _p ² = 0.003).

Table 1 Means (M) and standard errors (SEMs) of participants’ median response times, and participants’ relevance ratings (relevance rating: 1 = “absolutely not relevant”, 5 = “extremely relevant”) in Experiment 1

Full size table

Rating-task results

To test whether participants' ratings differ between scenarios or processing conditions, we compared the mean rating scores between conditions. Relevance ratings were provided for 98.39% of the presented words. As Table 1 shows, average ratings were higher for the survival scenario in comparison to the moving scenario. An ANOVA revealed a significant main effect of scenario (F(1, 114) = 8.55, p = .004; η _p ² = 0.07), but neither a significant main effect of processing condition (F(1, 114) = 0.22, p = .64; η _p ² = 0.002) nor a significant interaction between scenario and processing conditions (F(1, 114) = 0.13, p = .72; η _p ² = 0.001).

As common in analyses of the SPE, the data were also analyzed to determine possible congruity effects, that is, whether items with higher ratings were also better remembered. Figure 2 shows the proportion of words recalled correctly in the free recall test as a function of initial rating, type of scenario, and cognitive load. For all conditions, a significant correlation between ratings and the proportion of words recalled was found. Controlling for overall recall performance of the participants, the partial correlation between rating and recall rates was significant for the survival, single-task group (r = .16; p < .001), the moving, single-task group (r = .25; p < .001), the survival, dual-task group (r = .21; p < .001) and the moving, dual-task group (r = .31; p < .001).

Secondary-task performance

To test whether involvement in the secondary task was comparable between both scenarios, we compared the number of correct responses to the secondary task in the survival and in the moving group. There was no performance difference between both conditions, t(56) = 0.81, p = .42, two-tailed, η _p ² = 0.01. Thus, there was no evidence that participants in the survival group prioritized the survival-rating task at the cost of the secondary task.

Discussion

In this experiment, we replicated the effect found by Stillman et al. (2014) using a sample more than twice as large as the original sample. We combined the standard survival and moving scenarios with Stillman and collaborators’ working memory load manipulation. Both scenarios have often been used to assess the size of the survival-processing effect (e.g., Nairne et al., 2007). In line with previous research, the secondary task led to a decrease in overall recall performance (e.g., Craik et al., 1996; Fernandes & Moscovitch, 2000; Kroneisen et al., 2014; Stillman et al., 2014, Exp. 1). Replicating Stillman et al. (2014, Exp. 1), we observed SPEs of similar sizes in both the single-task and the dual-task conditions, as evident from a significant main effect of scenario and no interaction. Importantly, the nonsignificant interaction effect cannot be attributed to low statistical power.

The secondary task performance also did not differ between the survival and the moving conditions. Further, neither the relevance ratings nor their latencies varied as a function of single- and dual-task conditions. Thus, there was no evidence that people prioritized the primary task over the secondary task under any of the conditions.

However, we found a significant difference in response times between scenarios irrespective of processing conditions: Relevance ratings with respect to the moving scenario required less time than survival-relevance ratings. In addition, we also observed congruity effects between relevance ratings and memory performance, that is, higher perceived item relevance was associated with better recall later on. Similar effects were already obtained in previous studies (Butler et al., 2009; Kroneisen et al., 2013; Kroneisen et al., 2014; Kroneisen & Erdfelder, 2011; but see Nairne & Pandeirada, 2011).

In a nutshell, we replicated Stillman and collaborators’ (2014, Exp. 1) results with a sample size more than twice as large as theirs, thus effectively ruling out the possibility that sampling error is responsible for the differences in results between Kroneisen et al. (2014) and Stillman et al. (2014, Exp. 1). This suggests that the discrepant results are due to differences in key features of the secondary tasks used in both experiments. However, what are the key features that might make a difference?

In the secondary task used by Stillman et al. (2014), a high-pitched tone or a low-pitched tone was presented following each relevance rating. Participants were required to count the number of high-pitched tones and report them after all items had been rated. This secondary task thus allowed participants to switch between the primary task and the secondary task rather than to share processing-time between tasks, because the tones always occurred in the inter-trial-interval between the to-be-rated words. In addition, only a single number (i.e., the total score) had to be kept in active working memory and updated every time the next high-pitched tone was presented. By contrast, in the secondary task from Kroneisen et al. (2014) participants were presented with a random tone sequence consisting of two different auditory tones. Their task was to press the spacebar whenever the same tone was presented three times in a row. This CRT task was timed independently from the rating task so that participants could not simply switch back and forth between the primary task (the relevance rating) and the secondary task (the response to an auditory stimulus). Rather, participants were forced to perform both tasks concurrently all the time, that is, to share processing time between both tasks. In addition, the amount of information to be held in working memory was larger than in Stillman’s study, as it involved the previous two tones that had to be updated every time a new tone was presented.

In Experiment 2 we further investigated the effects of two crucial differences between the secondary tasks of Kroneisen et al. (2014) and Stillman et al. (2014) on the size of the SPE, that is (1) the necessity of time-sharing versus switching between the primary and the secondary task and (2) the amount of information to be maintained and updated in working memory (two items vs. one item).

Experiment 2

To assess how these different key features of the secondary tasks affect survival processing, we compared recall rates for the standard survival and moving conditions (Nairne et al., 2007) under three different dual-task conditions in Experiment 2. All three secondary tasks were different versions of an auditory choice reaction time task (i.e., 1-back and 2-back tasks, cf. D’Esposito & Postle, 2002). The three tasks differed in (1) whether they require maintaining, retrieving, and updating a single item versus two items in working memory (i.e., low vs. high working memory load induced by 1-back vs. 2-back tasks, respectively) and (2) whether they involve switching versus time-sharing between primary and secondary tasks. If high concurrent working memory load is crucial for the elimination of the SPE as observed by Kroneisen et al. (2014), then we would expect the SPE to vanish in the 2-back tasks conditions but not in the one-back task condition. If, however, time-sharing between the primary and the secondary task is crucial, then we would expect the SPE to vanish in the condition involving time-sharing but not in the conditions requiring switching between primary and secondary tasks.