The Deese/Roediger–McDermott (DRM) paradigm (Deese, 1959; Roediger & McDermott, 1995) has been widely used to study false memories. The typical procedure involves presenting participants with a sequence of words that are semantically related to a common associate word (i.e., the critical lure). At test, the participants frequently remember the critical lure, despite the fact that it was never presented. For example, they may be presented with a list of words, such as sour, candy, sugar, bitter, good, taste, and so forth, and when tested, they will often remember the critical lure sweet, despite the fact that sweet was never presented. This general finding has been replicated in a number of studies that have established that the DRM effect is quite robust (see Gallo & Roediger, 2001; Marsh, McDermott, & Roediger, 2004; Norman & Schacter, 1997; Payne, Elie, Blackwell, & Neuschatz, 1996; Seamon, Luo, & Gallo, 1998; Stadler, Roediger, & McDermott, 1999). Roediger and McDermott took Deese’s six strongest lists and demonstrated robust false recall and recognition. In their second experiment, they developed an additional 18 lists, providing a total of 24 lists. Their results showed false recognition equal to veridical recognition and a level of false recall that was similar to that of correct recall. Stadler, Roediger, and McDermott conducted a norming study showing that the different lists, although constructed in similar manners, varied considerably in the probabilities at which the critical lure was falsely recalled or recognized. Subsequent studies identified some factors that determine false memories in the DRM paradigm (Brainerd, Yang, Reyna, Howe, & Mills, 2008; Madigan & Neuse, 2004; Roediger, Watson, McDermott, & Gallo, 2001).

The DRM paradigm has been limited to English-speaking participants, with few studies examining illusory memories in other languages (for exceptions, see Anastasi, De Leon, & Rhodes, 2005; Anastasi, Rhodes, Marquez, & Velino, 2005; Pérez-Mata, Read, & Diges, 2002; Zeelenberg & Pecher, 2002). Pérez-Mata et al. tested Spanish speakers using associative Spanish lists. Zeelenberg and Pecher presented participants with items from the DRM lists that had been directly translated into Dutch. Anastasi, De Leon, and Rhodes (2005; Anastasi, Rhodes, et al., 2005) used a similar method by directly translating six of the DRM lists into Spanish, Japanese, and German, in order to test foreign exchange students in their native languages. To our knowledge, normative data for Italian DRM lists comparable to those reported by Stadler et al. (1999), for English, and Anastasi, De Leon, and Rhodes, for Spanish, have not yet been provided. Considering the Italian equivalents of the critical lures used by Roediger and McDermott (1995) and Stadler et al., we developed a set of 24 Italian word lists for the present study. Then we presented each of these 24 lists to Italian-speaking individuals and tested their memory, using both recall and recognition tests and measures. Finally, we present normative data for both the list items and the critical lures for each of the lists developed.

Method

Participants

All the participants reported Italy as their country of origin and Italian as their native language. Ninety participants produced association norms, which were then used to create Italian DRM lists. Their average age was M = 28.94 years (SD = 11.06). Once the 24 DRM lists had been created, we tested them using 68 additional Italian students of Sapienza University of Rome. We split the sample of 68 participants into two groups of 34 that were used to norm the Italian lists using recall and recognition tests. All of the 68 participants lived in Italy and reported Italian as their native language. The average age of the participants was M = 23.93 years (SD = 1.98), and they were tested in groups of up to eight people each.

Materials and procedure for association norms

The Italian lists in the present study were constructed using the same general method of Anastasi, De Leon, and Rhodes (2005). Anastasi, De Leon, and Rhodes had employed a PowerPoint presentation of the lists; instead, we used an online survey to present the participants with the Italian equivalents of the 24 critical lures one at a time. We collected the responses of 90 participants in a discrete association task. Participants were instructed to write in a textbox the first word that came to mind. All of the instructions were given in Italian. We then constructed 24 DRM lists from the 15 most frequent responses to each of the 24 critical lures that had been used as cues in the preceding discrete association task. The Italian DRM lists developed are provided in the Appendix, along with each item’s approximate English translation. Some words had to be replaced by the next most frequent associate, due to replication in other lists.Footnote 1

Materials and procedure for the 24 DRM lists

Twelve lists were presented to each of the two groups of 34 participants, in order to assess the likelihood that each list would produce false memories of the critical lure during recall and recognition testing (see the Appendix).

Several sessions of the same experiment were conducted in a room of the Psychology Faculty at Sapienza University of Rome. The participants who were tested in order to norm the lists were instructed to view each word as it was presented on a 4 m2 (2 × 2) screen for 2 s and were informed that their memory for these words would be tested.Footnote 2 Participants then viewed each word of the first list presented in 80-point font, centered on the screen. Then participants were instructed to write down all of the words that they could remember. This was repeated for all 12 lists of each group. After the final recall test, the participants were instructed to compile a 72-item, paper-based, yes–no recognition test that contained 36 list items (three from each of the 12 presented lists), the 12 critical lures from the presented lists, and 24 nonlist items. The nonlist items were unrelated to the items in the lists and to the critical lures. All of the instructions were given in Italian.

Results

We used SPSS software for our data analysis.

Recall data

The list and critical lure recall data for each Italian list are presented in Table 1. This table also provides the proportions of list items and critical lures recalled in the English and Spanish lists normed in previous studies (Anastasi, De Leon, & Rhodes, 2005; Roediger & McDermott, 1995; Stadler et al., 1999). The data from the Italian lists showed great variability in the probabilities that the different lists would produce illusory memories. For example, the uomo list resulted in very few critical lures being recalled, whereas lento led to more critical lures being recalled (.71) than list items (.59). Overall, the Italian lists were more likely to produce correct recall (M = .63, SD = .21) than critical recall (M = .22, SD = .37) [t(23) = .13.11, p < .001].

Table 1 Mean proportions of critical lures and list items recalled, for each Italian list in the present study; for the English list normed by Stadler, Roediger, and McDermott (1999) and Roediger and McDermott (1995); and for the Spanish list normed by Anastasi, De Leon, and Rhodes (2005)

The level of recall in the present study differed somewhat from those reported by Stadler et al. (1999) and Anastasi, De Leon, and Rhodes (2005), but the patterns were similar. For example, Stadler et al. found that a greater proportion of list items (.60) were recalled than critical lures (.40). Likewise, for Anastasi, De Leon, and Rhodes, the list item proportion (.53) was higher than the critical lure proportion (.30).

As in Stadler et al. (1999), the lists from the present study were split into the 12 lists most likely to lead to false memories and the 12 least likely to result in false memories. List recall was not significantly higher for the bottom 12 lists (M = .63, SD = .06) than for the top 12 lists (M = .62, SD = .05) [t(23) = 0.66, p = .513]. However, not surprisingly, critical lure recall was higher for the top 12 lists (M = .31, SD = .15) then for to the bottom 12 lists (M = .13, SD = .07) [t(23) = 3.71, p = .001]. Furthermore, list recall was significantly greater than critical lure recall for both the top 12 lists [t(11) = 6.31, p < .001] and the bottom 12 lists [t(11) = 19.22, p < .001].

Recognition data

The list item and critical lure recognition data for each Italian list are presented in Table 2. This table also provides the proportions of list items and critical lures recalled in the English and Spanish lists normed in previous studies (Anastasi, De Leon, & Rhodes, 2005; Roediger & McDermott, 1995; Stadler et al., 1999). The recognition data showed that 97 % of the participants tested with the soffice list recognized the critical lure, whereas 67 % of the participants recognized the critical lure when given the uomo and ragazza lists. Overall, the participants were significantly more likely to recognize critical lures (M = .82, SD = .11) than to recognize list items (M = .56, SD = .16) [t(23) = 5.89, p < .001]. This evidence is consistent with that obtained by Stadler et al., who reported that critical lures were falsely recognized 68 % of the time, whereas list items were correctly recognized 70 % of the time. As with the recall data, the recognition data were also split into the top 12 and bottom 12 lists, on the basis of the probabilities that the critical lure would be recognized. The participants recognized critical lures more often for the top 12 lists (M = .90, SD = .04) than for the bottom 12 lists (M = .74, SD = .05) [t(11) = 8.35, p < .001]. The list item mean for the top 12 lists (M = .59, SD = .20) was not significantly different from the mean for the bottom 12 lists (M = .54, SD = .23) [t(11) = 0.52, p = .605]. Similarly to the recall data, the recognition data were comparable to those from the English and Spanish lists. In fact, Stadler et al. reported that their top lists resulted in 71 % of list items and 77 % of critical lures being recognized, whereas their bottom lists resulted in 74 % of list items and 55 % of critical lures being recognized. Similarly, the Spanish lists (Anastasi, De Leon, & Rhodes, 2005) provided 77 % of list items and 84 % of critical lures being recognized for the top lists, as compared to 70 % of list items and 54 % of critical lures for the bottom lists.

Table 2 Mean proportions of critical lures and list items recognized, for each Italian list in the present study; for the English lists normed by Stadler, Roediger, and McDermott (1999) and Roediger et al. (2001); and for the Spanish list normed by Anastasi, De Leon, and Rhodes (2005)

Correlation data

Pearson correlations were calculated for the mean proportions of critical lures and list items recalled and recognized by each participant.

These results showed a significant correlation between critical lure recall and critical lure recognition (r = .73, p < .001), since the participants tended to produce false memories at similar levels in the recall and recognition tasks. Also, a significant moderate correlation between list item recall and list item recognition (r = .25, p < .05) was found. These data are coherent with those of Stadler et al. (1999), who found the recall versus recognition correlation to be higher for critical lures (.77) than for list items (.52). Anastasi, De Leon, and Rhodes (2005) instead reported a correlation between recall and recognition only for list items (.34). Furthermore, as has been argued by previous researchers (e.g., Seamon et al., 1998; Stadler et al., 1999), recognition performance could be influenced by the previous recall task. Additional correlations were computed to determine whether false recall and false recognition among the present Italian norms were related to results from the English norms published by Stadler et al. and the Spanish norms published by Anastasi, De Leon, and Rhodes. These data revealed a moderate correlation between critical lure recognition in the present norms and the Anastasi, De Leon, and Rhodes Spanish norms (r = .43, p = .05). In a similar analysis for critical lure recall, in which these norms were compared, no significant correlations were observed.

Discussion

In the present study, we created a set of Italian associative lists using native Italian speakers. To our knowledge, no other study has developed Italian associative lists using the DRM paradigm and provided normative data for recall and recognition of these lists. Such normative data can be useful for researchers interested in investigating illusory memories in Italian speakers using the DRM paradigm (Deese, 1959; Roediger & McDermott, 1995). As Anastasi, De Leon, and Rhodes (2005) suggested, another potentially important use of these lists will be to conduct research investigating bilinguals, in order to help delineate theoretical explanations for the false memory effect. Furthermore, in the present study we used the general method proposed by Anastasi, De Leon, and Rhodes for converting the DRM lists into the Italian language. These results demonstrated the robustness of the DRM false memory paradigm in other languages. The normative data for the present Italian lists are similar to those for the English lists normed by Stadler et al. (1999) and the Spanish lists normed by Anastasi, De Leon, and Rhodes. The effectiveness of the lists in producing false memories showed variability, with some lists demonstrating high levels of false recall or recognition, whereas other lists were much less likely to lead to false memories. For example, the lento list resulted in 71 % of the participants recalling the critical lure, whereas 3 % of the participants recalled the critical lures for the uomo and ragazza lists. The latter two were the lists most unlikely to produce false recognition; in contrast, soffice was the list most likely to result in false recognition (M = 97 %). Finally, we demonstrated a significant correlation between those lists that led to high levels of false recall and those that led to false recognition. However, there were some exceptions. For example, the re list was very effective in producing false recognition (M = .92), but not for producing false recall (M = .06).

The false memory effect has already gained the attention of many memory researchers, but nevertheless, the majority of studies have tested English speakers using the DRM paradigm. The present study adds to the literature by providing a set of Italian lists that can be used by researchers interested in testing individuals who speak Italian. These norms will be useful for researchers interested in assessing illusory memories with the DRM paradigm.