Skip to main content
Log in

Experimental Philosophy of Free Will and the Comprehension of Determinism

  • Published:
Review of Philosophy and Psychology Aims and scope Submit manuscript

Abstract

The experimental validity of research in the experimental philosophy of free will has been called into question. Several new, important studies (Murray et al. forthcoming; Nadelhoffer et al., Cognitive Science 44 (8): 1–28, 2020; Nadelhoffer et al., 2021; Rose et al., Cognitive Science 41 (2): 482–502, 2017) are interpreted as showing that the vignette-judgment model is defective because participants only exhibit a surface-level comprehension and not the deeper comprehension the model requires. Participants, it is argued, commit bypassing, intrusion, and fatalism errors. We respond in two ways: (1) we critique and improve existing methods for assessing deeper comprehension and (2) we develop videos to convey deterministic principles of change that succeed in significantly reducing participants’ bypassing, intrusion, and fatalism errors. Consequently, we have the best existing instrument for gauging folk intuitions about the relationship between free will and determinism.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. Though most work in this field is quantitative see Lim & Chen (2018) and Thompson (2023) for qualitative approaches.

  2. It should be noted that the word “could” (and its cognates) can be interpreted dispositionally so that “Jeremy could have done something other than rob the bank” is consistent with determinism (see Vihvelin 2013). If so, then a participant who has a dispositional interpretation of “could” would not be making a comprehension error. Whether or not participants deploy such interpretations is an empirical matter worth pursuing in future studies.

  3. We reference two papers written by this team of researchers: one from 2023 (Thomas Nadelhoffer, Samuel Murray, and Elise Murray) and one forthcoming (Samuel Murray, Elise Dykhuis, and Thomas Nadelhoffer). The names “Elise Murray” and “Elise Dykhuis” refer to the same person. We will refer to this team with “MDN”.

  4. Some issues that are tightly bound to these worries (and others) were raised by Murray & Nahmias (2014) regarding the methodology of earlier studies that suggested participants were generally incompatibilist.

  5. Monroe & Malle’s (2010) study suggests something similar. When prompting participants to give a definition of free will, 65% of participants refer to the ability to make a choice/decision while only 33% refer to doing what one desires.

  6. Of the 185 participants, two participants did not report their age and sex, while a total of 18 participants did not report their ethnicity.

  7. Across all our studies, participants completed one item assessing free will attributions and three items assessing moral responsibility beliefs after the dependent variables (bypassing, fatalism, intrusion). Participants also reported their level of confidence in their response to each item, ranging from 0 to 100. We do not report the results of these measures in this study, but see the OSF page for a full list of materials: https://osf.io/u8vwj/

  8. MDN reported an error rate of 42/75 (56%) for IntrusionORIG under Universe A/B in their forthcoming paper, but this contradicts the data from their OSF page (https://osf.io/7r9xj/) which shows an error rate of 33/75 (44%).

  9. See Supplemental Materials for error percentages, means, and standard deviations of each comprehension item across the experimental conditions, along with chi-square tests, p-values, and odds ratios.

  10. Though MDN carried out a factor analysis of four fatalism items in their 2023 study and found them to hang together, they performed the factor analysis across both Universe A/B and Supercomputer. We performed factor analyses on these fatalism items separately: one analysis for the items under Universe A/B and a separate analysis for the items under Supercomputer. See Appendix 10 for more details.

  11. The text in the second paragraph of this vignette was adapted from a study run by Murray & Nahmias (2014) to help reduce bypassing mistakes.

  12. Similar findings are reported in Knobe (2014) regarding bypassing. Our results suggest that there is a difference between participant understanding of deterministic systems with and without humans regarding fatalism and intrusion as well.

  13. The text in the first paragraph of this vignette (like the second paragraph in the Video vignette) was adapted from a study run by Murray & Nahmias (2014) to help reduce bypassing mistakes.

  14. Mean scores for Moral-2NEW (punishment) and Moral-3NEW (rehabilitation) are not statistically different from the mean score of Moral-1NEW so they are not included in our discussion.

  15. Given our claim that participant scores under Video are the most reliable in terms of assessing folk intuitions about the relationship between determinism and free will we need some way of explaining why participant scores under Supercomputer and Universe A/B differ (and are therefore less reliable for assessing folk intuitions about compatibilism and incompatibilism). In Appendix 11 we examined whether deeper comprehension scores (bypassing, fatalism, intrusion) mediated the impact of the video manipulation on participants’ free will and moral responsibility scores.

  16. This dovetails nicely with Rose and Nichols’ work (2013). They use structural equation modeling to argue that incompatibilist judgments seem to cause bypassing judgments, not the other way around.

References

  • Berniūnas, R., A. Beinorius, V. Dranseika, V. Silius, and P. Rimkevicius. 2021. The Weirdness of Belief in Free Will. Consciousness and Cognition 87 (103054): 1–15.

    Google Scholar 

  • Chen, C., S. Lee, and H.W. Stevenson. 1995. Response Style and Cross-Cultural Comparisons of Rating Scales Among East Asian and North American Students. Psychological Science 6 (3): 170–175. https://doi.org/10.1111/j.1467-9280.1995.tb00327.x.

    Article  Google Scholar 

  • Douglas, B.D., P.J. Ewell, and M. Brauer. 2023. Data quality in online human-subjects research: Comparisons between MTurk, Prolific, CloudResearch, Qualtrics, and SONA. PLoS ONE 18 (3): e0279720. https://doi.org/10.1371/journal.pone.0279720.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Furr, R.M. 2011. Scale Construction and Psychometrics for Social and Personality Psychology. SAGE Publications Ltd. https://doi.org/10.4135/9781446287866.

    Article  Google Scholar 

  • Kane, R. 1996. The Significance of Free Will. Oxford: Oxford University Press.

    Google Scholar 

  • Knobe, J. 2014. Free Will and the Scientific Vision. In Current Controversies in Experimental Philosophy, ed. E. Machery and E. O’Neill, 69–85. New York: Routledge.

    Chapter  Google Scholar 

  • Lim, D., and J. Chen. 2018. Is Compatibilism Intuitive? Philosophical Psychology 31 (6): 878–897.

    Article  Google Scholar 

  • Litman, L., J. Robinson, and T. Abberbock. 2017. TurkPrim. Com: A versatile crowdsourcing data acquisition platform for the behavioral sciences. Behavior Research Methods 49 (2): 433–442. https://doi.org/10.3758/s13428-016-0727-z.

    Article  PubMed  Google Scholar 

  • Monroe, A., and B. Malle. 2010. From Uncaused Will to Conscious Choice. Review of Philosophy and Psychology 1: 211–224.

    Article  Google Scholar 

  • Monroe, A., K. Dillon, and B. Malle. 2014. Bringing Free Will Down to Earth. Consciousness and Cognition 27: 100–108.

    Article  PubMed  Google Scholar 

  • Murray, S., Dykhuis, E., & Nadelhoffer, T. (forthcoming). Do People Understand Determinism? The Tracking Problem for Measuring Free Will Beliefs. In T. Lombrozo, J. Knobe, & S. Nichols, Oxford Studies in Experimental Philosophy. Oxford University Press. Oxford

  • Murray, D., and E. Nahmias. 2014. Explaining Away Incompatibilist Intuitions. Philosophy and Phenomenological Research 88 (2): 434–467.

    Article  Google Scholar 

  • Nadelhoffer, T., J. Shepard, E. Nahmias, C. Sripada, and L. Ross. 2014. The free will inventory: Measuring beliefs about agency and responsibility. Consciousness and Cognition 25 (1): 27–41.

    Article  PubMed  Google Scholar 

  • Nadelhoffer, T., D. Rose, W. Buckwalter, and S. Nichols. 2020. Natural Compatibilism, Indeterminism, and Intrusive Metaphysics. Cognitive Science 44 (8): 1–28.

    Article  Google Scholar 

  • Nadelhoffer, T., S. Murray, and E. Murray. 2023. Intuitions about Free Will and the Failure to Comprehend Determinism. Erkenntnis 88: 2515–2536. https://doi.org/10.1007/s10670-021-00465-y.

    Article  Google Scholar 

  • Nahmias, E., and M. Thompson. 2014. A Naturalistic Vision of Free WIll. In Current Controversies in Experimental Philosophy, ed. E. O’Neill and E. Machery, 86–103. New York: Routledge.

    Chapter  Google Scholar 

  • Nahmias, E., S. Morris, T. Nadelhoffer, and J. Turner. 2005. Surveying Freedom: Folk Intuitions about Free Will and Moral Responsibility. Philosophical Psychology 18 (5): 561–584.

    Article  Google Scholar 

  • Nichols, S., and J. Knobe. 2007. Moral Responsibility and Determinism: The Cognitive Science of Folk Intuitions. Nous 41 (4): 663–685.

    Article  Google Scholar 

  • Palan, S., and C. Schitter. 2018. Prolific. ac—A subject pool for online experiments. Journal of Behavioral and Experimental Finance 17: 22–27. https://doi.org/10.1016/j.jbef.2017.12.004.

    Article  Google Scholar 

  • Peer, E., D. Rothschild, A. Gordon, Z. Evernden, and E. Damer. 2022. Data quality of platforms and panels for online behavioral research. Behavior Research Methods 54: 1643–1662. https://doi.org/10.3758/s13428-021-01694-3.

    Article  PubMed  Google Scholar 

  • Rose, D., and S. Nichols. 2013. The Lesson of Bypassing. Review of Philosophy and Psychology 4 (4): 599–619.

    Article  Google Scholar 

  • Rose, D., W. Buckwalter, and S. Nichols. 2017. Neuroscientific Prediction and the Intrusion of Intuitive Metaphysics. Cognitive Science 41 (2): 482–502.

    Article  PubMed  Google Scholar 

  • Thompson, K. 2023. Qualitative Methods Show that Surveys Misrepresent “Ought Implies Can” Judgments. Philosophical Psychology 36 (1): 29–57.

    Article  Google Scholar 

  • Van Inwagen, P. 1983. An Essay on Free Will. Oxford: Clarendon Press.

    Google Scholar 

  • Vihvelin, K. 2013. Causes, Laws, & Free Will. New York: Oxford University Press.

    Book  Google Scholar 

Download references

Acknowledgements

Authors wish to thank Thomas Nadelhoffer and Shaun Nichols for feedback on earlier drafts of this paper. We thank friends and family for taking early versions of our studies.

Funding

The research by the second author was supported by funding from the John Templeton Foundation via a subaward through Hillsdale College’s “Launching Experimental Philosophy of Religion” project (#1003).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Daniel Lim.

Ethics declarations

Conflicts of interest

Authors have no known conflicts of interest to disclose.

Consent was obtained from all participants in studies reported below. Institutional review board approval was obtained for these studies.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 17 KB)

Appendices

Appendix 1

For Study 3, a factor analysis with oblimin rotation was conducted on the 7-items assessing comprehension errors (intrusion, bypassing, fatalism) with the speculation of three underlying factors. The first factor accounted for 33.55% of the variance (Eigenvalue = 2.35), with the factor loading onto the two intrusion items (0.84, 0.95). The second factor accounted for an additional 26.07% of the variance (Eigenvalue = 1.83), with the factor loading onto Bypassing 1 and 2 (0.86, 0.79, respectively). The final factor accounted for an additional 13.83% of the variance, but had a low Eigenvalue of 0.97. The final factor loaded onto Fatalism-1 at 0.81, but weakly for Fatalism-2 at 0.37. Bypassing-3 did not load onto any factor above 0.18. We removed Bypassing-3 because it did not load onto any factor above 0.40. Similarly, we removed Fatalism-2 due to not loading above 0.40 onto any factor, having a negative zero-order correlation with Fatalism-1, and producing poor internal consistency.

Appendix 2

A better way to test for fatalism would be to explicitly include information about the fixity (or non-fixity) of the past. Interestingly, in MDN’s 2023 study, such a statement (Fatalism-4ORIG) was included among the statements used to test for fatalism mistakes but is not included in their forthcoming study. Here are the four statements that were used in conjunction with Universe A/B:

FatalismORIG.

  1. 1.

    In Universe A, there is no sense in which events could have unfolded differently than they did.

  2. 2.

    In Universe A, John would have ended up having French Fries no matter what he tried to do.

  3. 3.

    In Universe A, John will eat French Fries no matter what.

  4. 4.

    In Universe A, John’s eating French Fries had to happen, even if what happened in the past had been different.

MDN decided to keep Fatalism-2ORIG and Fatalism-3ORIG in their forthcoming study because of their strong correlation.

They report a combined score for all four statements, but, without examining their supplemental materials, it is impossible to see if the participants responded differently to the four different ways fatalism is presented. Most notably for our purposes, Fatalism-4ORIG, unlike the other three statements, includes information about the past (namely that it is not fixed). Here is a summary of the means, standard deviations, and error rates for each individual FatalismORIG statement from MDN’s 2023 study Table 8.

Table 8 Means, Standard Deviations, and Error Rates for FatalismORIG statements from MDN’s 2023 study

Notice that there is less agreement with Fatalism-4ORIG compared with the first three FatalismORIG statements and that the difference in means is statistically significant (p < 0.01). This suggests that participants may answer differently when given explicit information about the fixity of the past.

While MDN performed an exploratory factor analysis on all four fatalism items, they combined the fatalism items across the Universe A/B and Supercomputer conditions. This assumes that the scale is assessing fatalism across both conditions, which was not verified. We performed an exploratory factor analysis on the four fatalism items with a varimax rotation, similar to MDN, however, we performed this EFA across each condition. For the Universe A/B condition, one factor was extracted from the data (Eigenvalue = 2.12) with 54.55% of the variance being explained. Results showed that Fatalism-1ORIG (0.72), Fatalism-2ORIG (0.69), and Fatalism-3ORIG (0.89) loaded on this factor above the threshold of 0.40 for retaining items for a scale (Furr 2011). Fatalism-4ORIG had a loading of 0.07, which did not load above the threshold. Further, a Cronbach’s alpha showed poor reliability for the four item scale (ɑ = 0.600) but good reliability for the three-item scale that omitted Fatalism-4ORIG (ɑ = 0.804). The given results suggest that Fatalism-4ORIG should be dropped when aggregating the items together.

We conducted an additional EFA with varimax rotation on the four items assessing fatalism in MDN’s 2022 study for participants who read Supercomputer. One factor was extracted from the data (Eigenvalue = 2.83) with 70.72% of the variance being explained. All four of the fatalism items loaded onto the factor above the threshold of 0.40 (0.57, 0.86, 0.89, and 0.79, respectively). A Cronbach’s alpha analysis also found good internal consistency for the four-item measure of fatalism (ɑ = 0.858). The given results suggest that aggregating the fatalism items into a single composite variable was appropriate for this data set.

Though our EFAs give us mixed results, the EFA for the FatalismORIG statements under the Universe A/B condition suggests that Fatalism-4ORIG should not be aggregated with the other three. This is reason to suspect that information about the fixity of the past can make a difference.

Appendix 3

We conducted a mediation analysis using PROCESS, a macro in SPSS that allows for multi-categorical independent variables. The independent variable was the manipulation, with Video set as the control condition and contrasted against each other vignette (Universe A/B, Supercomputer, Hybrid). The three mediators were Bypassing-combined, Intrusion-combined, and Fatalism-1NEW. Bypassing-combined consisted of Bypassing-1NEW and Bypassing-2NEW (rs = 0.640). The reverse-coded Bypassing-3NEW was not included due to producing poor internal consistency in the measure. Intrusion-combined consisted of Intrusion-1NEW and Intrusion-2NEW (rs = 0.865). Furthermore, Fatalism-2NEW was not used because, as noted above, Fatalism-1NEW and Fatalism-2NEW are conceptually and statistically different and Fatalism-2NEW does not include information about the fixity of the past. We conducted separate mediation analyses on free will and moral responsibility scores.

Results of the mediation analysis on free will scores showed that Bypassing-combined and Fatalism-1NEW scores mediated the effect of Video (vs. Universe A/B) on free will scores; specifically, Video indirectly increased free will scores compared to Universe A/B due to lowering bypassing scores. Yet, simultaneously, Video indirectly lowered free will scores compared to Universe A/B due to lowering Fatalism-1NEW scores. The mediation analysis also showed that Intrusion-combined and Fatalism-1NEW scores mediated the effect of Video (vs. Supercomputer) on free will scores. Specifically, Video lowered free will scores compared to Supercomputer due to lowering intrusion and fatalism scores. These findings suggest that manipulations that change deeper comprehension errors will indirectly affect people’s attributions of free will Figures 7 and 8.

Fig. 7
figure 7

Mediation Analysis of the Determinism Manipulation on Free Will Through Comprehension Items. Note: Unstandardized regression coefficients are presented in Fig. 1. *p < 0.05. **p < 0.01. ***p < 0.001

Results of the mediation analysis on moral responsibility scores showed a similar patterns of results. First, the mediation analysis on moral responsibility scores showed that Bypassing-combined scores mediated the effect of Video (vs. Universe A/B) on moral responsibility scores; specifically, Video indirectly increased moral responsibility scores compared to Universe A/B due to lowering bypassing scores. The mediation analysis also showed that Intrusion-combined scores mediated the effect of Video (vs. Supercomputer) on moral responsibility scores. Specifically, Video lowered moral responsibility scores compared to Supercomputer due to lowering intrusion scores. These findings suggest that, similar to above, manipulations that change deeper comprehension errors will indirectly affect people’s attributions of moral responsibility. See Table 9.

Fig. 8
figure 8

Mediation Analysis of the Determinism Manipulation on Moral Responsibility Through Comprehension Items. Note: Unstandardized regression coefficients are presented in Fig. 1. *p < 0.05. **p < 0.01. ***p < 0.001

Table 9 Indirect Effects of the Determinism Manipulation on Free Will and Moral Responsibility Scores Through Comprehension Scores

These analyses help us understand why the free will and moral responsibility scores under Supercomputer and Universe A/B differ from the scores under Video. Participants under Supercomputer make more intrusion mistakes than those under Video. This means participants under Supercomputer have a greater tendency to think that Jeremy could have done otherwise. If participants tend to believe this then it’s natural for them to then think that Jeremy has free will and is morally responsible for his actions. This explains why free will (4.33) and moral responsibility (4.85) scores under Supercomputer are higher than the corresponding scores under Video. The scores under Video are more reliable as indicators of compatibilism or incompatibilism because Video, compared to Supercomputer, significantly reduces intrusion mistakes while maintaining deeper comprehension regarding bypassing and fatalism.

Participants under Universe A/B make more bypassing mistakes than those under Video. This means participants under Universe A/B tend to think that Jeremy’s mental states (e.g., his desires) make no difference to his subsequent behavior. If participants tend to think that Jeremy’s mental states are bypassed (and therefore irrelevant) in the production of his behavior then it seems natural for these participants to then think that Jeremy does not have free will and is not morally responsible for his actions. This explains why free will (3.16) and moral responsibility (3.88) scores under Universe A/B are lower than the corresponding scores under Video. The scores under Video are more reliable as indicators of compatibilism or incompatibilism because Video significantly reduces bypassing mistakes while maintaining deeper comprehension regarding fatalism and intrusion.

Interestingly, participants under Video make fewer mistakes on Fatalism-1NEW than under Supercomputer or Universe A/B. Because this suggests that participants can properly distinguish determinism from fatalism one would think this leads to higher free will scores. The mediation analysis, however, shows that lowering scores on Fatalism-1NEW indirectly lowers free will scores under Video. This is prima facie puzzling and may suggest directions for future research.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lim, D., Nichols, R. & Wagoner, J. Experimental Philosophy of Free Will and the Comprehension of Determinism. Rev.Phil.Psych. (2024). https://doi.org/10.1007/s13164-024-00726-z

Download citation

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13164-024-00726-z

Navigation