Skip to main content
Log in

Variability across subjects in free recall versus cued recall

  • Published:
Memory & Cognition Aims and scope Submit manuscript

Abstract

Memory scientists usually compare mean performance on some measure(s) (accuracy, confidence, latency) as a function of experimental condition. Some researchers have made within-subject variability in task performance a focal outcome measure (e.g., Yao et al., Journal of Clinical and Experimental Neuropsychology, 38, 227–237, 2016). Here, we explored between-subject variability in accuracy as a function of experimental conditions. This work was inspired by an incidental finding in a previous study, in which we observed greater variability in accuracy of memory performance on cued recall (CR) versus free recall (FR) of English animal/object nouns (Mah et al., Frontiers in Psychology, 14, 1146200, 2023). Here we report experiments designed to assess the reliability of that pattern and to explore its causes (e.g., differential interpretation of instructions, [un]relatedness of CR word pairs, encoding time). In Experiment 1 (N = 120 undergraduates), we replicated the CR:FR variability difference with a more representative set of English nouns. In Experiments 2A (N = 117 Prolific participants) and 2B (N = 127 undergraduates), we found that the CR:FR variability difference persisted in a forced-recall procedure. In Experiment 3 (N = 260 Prolific participants), we used meaningfully related word pairs and still found greater variability in CR than in FR performance. In Experiment 4 (N = 360 Prolific participants), we equated CR and FR study phases by having all participants study pairs and, again, observed greater variability in CR than FR. The same was true in Experiment 5 (N = 120 undergraduates), in which study time was self-paced. Comparisons of variability across subjects can yield insights into the mechanisms underlying task performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Data availability

All data/experiment programs are available online (https://osf.io/3tra5/).

Code availability

All analysis scripts/experiment programs are available online (https://osf.io/3tra5/).

Notes

  1. See the preregistration (https://osf.io/xfj6a) for the final word pool and details of the word selection procedure. Briefly, we began with the MRC Psycholinguistic Database (Wilson, 1988) of 21,561 nouns and then in several steps selected from that pool a set of nouns that are average on multiple dimensions (i.e., within a central mass of the database-wide distribution).

  2. Results were similar when looking at accuracy separately by test order (i.e., for those who did CR first vs. second); see Supplementary Material 3C.

  3. We also fit and compared Bayesian computational models of FR and CR performance (for this and all subsequent experiments). These analyses generally agreed with the ones reported here (see SOM 2 and our preregistration for more details about these analyses)

  4. If two out of three raters considered a word to have a salient non-noun meaning or to be too obscure, that word was removed from the pool.

  5. Before conducting Experiment 2A, we pilot tested the procedure on Prolific (N = 16). This testing revealed a high rate of exclusions (14/16 participants reported not understanding at least 75% of words), so we preregistered an additional inclusion criterion for this sample: English as a first language (self-reported on Prolific), in addition to self-reported English fluency. This had the added benefit of making our Prolific sample more comparable to our student samples in terms of language status.

  6. This was due to our sampling procedure (i.e., opening more study slots than we needed to maximize data collected while trying to anticipate exclusions), but the results were the same when including/excluding the seven additional participants.

  7. Results differed as a function of test order–the Pitman-Morgan test was significant for those that did FR before CR, but not for those who did CR before FR. This may be due to slightly lower CR performance in the latter group constraining CR variance (see Supplementary Material 4D).

  8. Perhaps this is why the corresponding Bayesian analysis did not provide compelling evidence for a CR:FR difference (see Supplementary Material 4A).

  9. Results were generally similar when comparing those who did CR before FR and vice versa, though the variability difference and CR accuracy were greater/higher for those who did FR before CR (see Supplementary Material 5D). It is possible that doing FR first (easier task) better prepares participants for CR, and this increase in accuracy (i.e., off of CR floor) serves to increase CR variability.

  10. Results were nearly identical when excluding the excess seven participants above our target N. Specifically, the Pitman–Morgan p < .001, bootstrapped CR:FR variance ratio = 1.54 (95% percentile bootstrap CI [1.34, 1.77]).

  11. 90% because our hypothesis of CR > FR variability was one-sided.

  12. We intended for the experiment program to auto-advance to the next word/pair after 30 s, but did not detect the programming error leading to the design reported in-text until after data had been collected. Study trials >30 s were rare (~4.5% of all trials), and excluding these trials or participants who had more than one such trial (n = 20) or participants who had any such trials (n = 33) did not change the results of our primary analysis. So, for subsequent analyses we did not exclude any trials/participants on this basis.

  13. The inclusion/exclusion of the additional four participants above our preregistered target N did not change the results of our primary confirmatory analysis, so they were included for all subsequent analyses.

  14. We conducted formal analysis of multimodality for FR and CR accuracy in all experiments via Hartigan’s Dip Test of Unimodality (Hartigan & Hartigan, 1985), and observed evidence against unimodality for FR and CR in all experiments other than Experiment 1 (see SOM 9)

References

* denotes Supplementary Material reference

Download references

Acknowledgements

We would like to thank Henry L. Roediger, Colleen M. Kelley, John Dunlosky, Larry Jacoby, Reed Hunt, and Roger Ratcliff for their helpful insights and suggestions.

Funding

This work was supported by an NSERC Discovery grant (#RGPIN-2016-03944) awarded to D.S.L.

Author information

Authors and Affiliations

Authors

Contributions

E.Y.M. and D.S.L. conceived of the experiments; E.Y.M. programmed the experiments, collected the data, and analyzed the data. E.Y.M. and D.S.L. drafted and revised the manuscript.

Corresponding author

Correspondence to Eric Y. Mah.

Ethics declarations

Conflicts of interest/Competing interests

We do not have any conflicts of interest to declare.

Ethics approval

All the experiments reported herein were approved by the ethics review board of the University of Victoria, and were conducted in accordance with the ethical standards as laid down in the 1964 Declaration of Helsinki and its later amendments or comparable ethical standards.

Consent to participate

All participants who took part in the experiment consented to participate.

Consent for publication

Both authors consent to the publication of this manuscript.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Open Practices Statement

The data and materials for all experiments are available online (https://osf.io/3tra5/), and all experiments were preregistered: Experiment 1 (https://osf.io/xfj6a), Experiments 2A and 2B (https://osf.io/3w6fm), Experiment 3 (https://osf.io/v67gy), Experiment 4 (https://osf.io/de7bu), and Experiment 5 (https://osf.io/my53w).

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 7.36 MB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mah, E.Y., Lindsay, D.S. Variability across subjects in free recall versus cued recall. Mem Cogn 52, 23–40 (2024). https://doi.org/10.3758/s13421-023-01440-4

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.3758/s13421-023-01440-4

Keywords

Navigation