An array of practical situations involve Bayesian reasoning, which involves revising the baseline likelihood of an event happening, given new information. Examples include interpreting how a medical test result (new information) effects the prior likelihood someone has a disease, using new evidence to update (from some baseline estimate) how likely a defendant is to be guilty, and recognizing how recent interactions may have influenced a relationship partner’s previous level of satisfaction. Research on Bayesian reasoning abilities, however, have traditionally painted a distressing picture; Casscells et al. (1978) for instance found that only 18% of doctors and medical students were able to correctly use Bayesian reasoning to interpret a medical test result.
Subsequent research has established several ways to present Bayesian reasoning tasks in order to generally improve performance. Two of these key methods are giving the relevant information as whole numbers embedded within a natural sampling framework (i.e., natural frequencies) and including a pictorial representation of the information (see McDowell & Jacobs, 2017, for a meta-analysis). The facilitating effects of these task presentation formats have been demonstrated across legal, medical, and business settings (e.g., Garcia-Retamero & Hoffrage, 2013; Gigerenzer et al., 2007; Hoffrage & Gigerenzer, 1998; Hoffrage et al., 2000, 2015; Lindsey et al., 2003).
The present research focusses not on characteristics of Bayesian reasoning tasks but rather on the characteristics of the persons doing the reasoning. In particular, what are the individual differences that best predict Bayesian reasoning performance? This question is important because, although presentation format guides are undoubtedly useful (e.g., for presentations to patients, jurors, or stockholders), it is also important to know which presenters and receivers of information have the underlying dispositions and abilities to understand those presentations.
Research on individual differences associated with Bayesian reasoning ability has largely centered on numerical literacy (the ability to understand and work with numbers, often shortened to “numeracy”; e.g., Chapman & Liu, 2009), but that focus is widening now to include individual differences in other cognitive abilities. Another specific trait that appears to predict Bayesian reasoning is visuospatial ability (Brase & Hill, 2017; Kellen et al., 2013). Other studies have looked at more general constructs associated with Bayesian reasoning such as cognitive resources (Lesage et al., 2013), cognitive ability (Stanovich & West, 2000), cognitive reflection (Sirota & Juanchich, 2011), cognitive processing (Sirota et al., 2014), and working memory (Yin et al., 2020). A subsequent section develops how the diversity of these accounts, and the broad generality of their proposed predictors, present practical and theoretical challenges.
Numeracy and visuospatial ability
It makes some sense that differences in numeracy are associated with Bayesian reasoning because it is, after all, a numerically based task. There has been some disagreement, though, regarding how and why the two are related. Similar disagreement has been emerging about the interpretation of other individual differences predicting Bayesian reasoning (i.e., visuospatial ability, general cognitive abilities or resources), and the issues tend to align with two different theoretical viewpoints.
One long-standing general theoretical viewpoint about Bayesian reasoning is the ecological rationality approach (ERA) view, which stresses that people should perform better on tasks (including Bayesian reasoning) when they are presented in ways that are more consistent with the natural and evolutionary ecology of the real-world environment. This view stresses considerations of the mind being evolved to expect certain types of information (Brase et al., 1998) and mental processes being “fast and frugal”; taking advantage of structural regularities and intercorrelations that occur in the environment generally (whether evolutionarily recurrent or modern; Gigerenzer & Hoffrage, 1995). The ERA interprets the facts that Bayesian reasoning improves when tasks are given using naturally sampled frequencies (e.g., Gigerenzer & Hoffrage, 1995) and pictorial representation (e.g., Garcia-Retamero et al., 2010) as being due to those presentations better mapping onto the ecology of the natural environment (i.e., a world that predominately includes visually individuated, countable items that can be organized as they are encountered).
An ERA view suggests that measures of numeracy and visuospatial ability contribute fairly directly to Bayesian reasoning performances, as measures of how well individuals can transfer the written and drawn Bayesian tasks into an ecologically more suitable representation. Consistent with this, numeracy has been found to predict Bayesian reasoning performance across both levels of numeracy (e.g., high vs. low numeracy; Hill & Brase, 2012) and different measures of numerical literacy (Brase & Hill, 2017; Johnson & Tubau, 2013). Brase and Hill (2017) also found that multiple measures of visuospatial ability were correlated with Bayesian reasoning, and that this relationship (when dichotomized) was consistent across levels of ability.
Another long-standing theoretical viewpoint is the nested sets approach (NSA) view, which stresses the general ability to perceive the nested sets structure of any given Bayesian reasoning task and tends to generally eschew evolutionary and ecological rationality explanations. The NSA view is that both naturally sampled frequencies and pictures improve the “clarity” or help people to “see” the nested set structure of the Bayesian reasoning task. This has also variously been described as inducing a “partative formulation” (e.g., Macchi, 2000) as facilitating the “construction of a set inclusion mental model” (Evans et al., 2000), and as making nested relationships “opaque” or “transparent” (Sloman et al., 2003).
An NSA view suggests that numeracy predicts Bayesian reasoning performance indirectly, via more general abilities (e.g., being more able to see nested sets structures or even broader cognitive abilities; e.g., Sirota & Juanchich, 2011; Sirota et al., 2014). Better visuospatial ability, like the use of pictures to aid Bayesian reasoning, also helps by allowing people to be better able to understand the nested sets structure. Overall, the NSA view leads to the prediction that any variable that increases the general ability to form a good nested set task representation will therefore be associated with better Bayesian reasoning. That also implies that those variables will consistently tend to interact with each other, such that their predictive power will be diminished whenever other transparency-enhancing variables are also included as factors. In other words, if there is one general underlying factor (nested sets transparency) that is facilitated in multiple ways by specific manipulations (use of pictures, frequencies, higher numeracy, higher visuospatial ability, etc.), then the measured effectiveness of each of those manipulations will initially be strong but then appear to diminish as additional concurrent manipulations are assessed.
Different NSA views
The NSA view has more recently been incorporated by some advocates within either a mental-models framework (Johnson-Laird et al., 1999; Pighin et al., 2015) or a dual-systems framework (e.g., Barbey & Sloman, 2007; Lesage et al., 2013). Different versions of the NSA view lead to various different expectations about which individual differences will be more central and predictive of Bayesian reasoning performance.
NSA as a dual-system disposition
Some NSA views have adopted a dual-systems model. There are some variations (see Evans & Stanovich, 2013, for one overview), but dual-systems models generally propose that people’s thinking occurs in one of two broad modes: System 1 and System 2 processes. System 1 processes tend to be more implicit, intuitive, and automatic; a faster and less effortful. System 2 processes tend to be more explicit, conscious, and controlled; deliberative and effortful. According to this view, frequencies, natural sampling, and pictures all help promote a better understanding of the nested sets relationship inherent in Bayesian reasoning, and thereby invoke System 2 processes to correctly reason about the problem (e.g., Barbey & Sloman, 2007; Evans et al., 2000; Sirota, Kostovičová, & Vallée-Tourangeau, 2015; Sloman et al., 2003).
One dual-systems model of the NSA view then is that people who have a disposition to employ System 2 processes will thereby be better able to analyze and reason about Bayesian reasoning tasks. This is consistent, for example, with Sloman et al. (2003, p. 298), which uses the terminology of “inside” versus “outside” views:
“This [representing instances can reveal the set structure of a problem] is a fairly direct consequence of representing the instances of the categories in a problem. These instances make up the sets or classes that correspond (in an outside view) to the categories. Most representational schemes that identify instances and the categories they belong to will automatically also specify the set structures relating the categories.”
According to this version of the NSA view, a disposition toward greater task engagement and effort would be a strong driver of better performance because it promotes clearer or deeper thinking about the problem. This appears to correspond well to “need for cognition,” a long-standing and highly validated trait construct of people’s dispositions to deeply engage with cognitively difficult tasks. People high in need for cognition enjoy thinking about topics as they are presented, enjoy the process of thinking, and readily apply their thinking skills in effective ways (Cacioppo & Petty, 1982, 1984).
NSA as a dual-system ability
Another possible dual-systems model of the NSA view is that better abilities to access and use System 2 cognitive processes (rather than disposition to do so) should improve Bayesian reasoning (see Stanovich & West, 1998a, 1998b, 2000, 2008, for a more general approach along these lines). The exact nature of these general abilities is not well defined, but Sirota et al. (2014) claimed that two measures (the Cognitive Reflection Task and the Raven advanced progressive matrices) were appropriate and significantly predicted Bayesian reasoning performances.
The Cognitive Reflection Task (CRT; Frederick, 2005) has actually been found to predict Bayesian performance a few times (Lesage et al., 2013; Sirota & Juanchich, 2011). Sirota et al. (2014), however, employed it as a measure of “general reasoning,” and argued that this relationship indicated “the involvement of a general reasoning mechanism as postulated by the NSA, rather than the involvement of a specialized cognitive mechanism operating automatically, as posited by the ERA” (p. 201). There is considerable debate, however, about what the CRT actually measures, and it is known to partially overlap with numeracy (e.g., Campitelli & Gerrans, 2014; Thompson & Oppenheimer, 2016). Subsequent work (Sirota et al., 2018) has recognized that the CRT theoretically confounds mathematical ability (i.e., numeracy) with whatever else it measures and that it can be psychometrically problematic. Adding to this ambiguity, the analyses in Sirota et al. (2014) used a median split of the three-item CRT scores, which can be very problematic (e.g., Irwin & McClelland, 2003; MacCallum et al., 2002).
The Raven advanced progressive matrices (Raven et al., 1977) as a measure of general cognitive abilities was also a significant predictor of Bayesian reasoning performance in Sirota et al. (2014). This test also has a problematic theoretical confound, though. It consists of a series of visual geometric designs, each with a missing piece, and the test-taker is asked to pick from a set of options to complete the missing piece. There are ongoing debates about the degree to which the Raven’s results are a measure of just general cognitive ability, or if it is also assessing visuospatial ability (e.g., Colom et al., 2004; DeShon et al., 1995; Vigneau & Bors, 2005). Thus, it is again possible that the Sirota et al. (2014) results are due to a confound, this time with visuospatial ability.
Part of the challenge for this view, clearly, is how to accurately measure “general ability” given that it is an ambiguous construct. Evans and Stanovich (2013) have suggested that working memory is a defining feature of System 2 processes, but that was not assessed in Sirota et al. (2014). In any event, a good measure of working memory (as a general cognitive-processing ability) will need to not be theoretically confounded with numeracy or visuospatial ability.
NSA as a style of mental model
The NSA view has also been placed within the general framework of mental models (e.g., Johnson-Laird et al., 1999; Johnson-Laird et al., 2015). The mental-models framework explains the efficacy of format manipulations (frequencies, natural sampling, pictures, etc.) in improving Bayesian reasoning in terms of more effective construction of a mental model of the reasoning task.
As with the dual-systems framework, though, it is not always clear whether this is due to one’s disposition or one’s ability to construct mental models. One possibility is to construe greater task engagement as the driver of better performance, in much the same way as in the dual-systems framework. This interpretation is consistent with the third “fundamental principle” of mental-models theory: “with deliberation reasoners can use the meaning of assertions to flesh out mental models into fully explicit models.” (e.g., Johnson-Laird et al., 2015, p. 204). If this is the case, the Need for Cognition Scale is a viable option for assessing individual differences in the tendency to put more effort into fully fleshing out mental models.
NSA as a model ability
Another possibility for framing the NSA view within the mental-models theory is to maintain that there is a more specific ability for mental model construction of nested sets. Johnson-Laird (1983) claims multiple representation formats are possible within mental models, including tokens, spatial relations, and temporal or causal relations (p. 410). Johnson-Laird goes on to further delineate between “physical” models and “conceptual” models, with six major types of physical models and four major types of conceptual models (monadic, relational, meta-linguistic, and set-theoretic models). Apparently, then, the idea of nested set mental models as the underlying foundation for effective Bayesian reasoning is not just any mental model but a rather specific type: conceptual set-theoretic models. Indeed, this interpretation is most consistent with the mental-models view of the past 20 years (e.g., Johnson-Laird et al., 1999, 2015).
It follows, then, that individual differences in ability to construct conceptual set-theoretic mental models should be a better predictor of reasoning performance than other possible predictors. A difficulty arises at this point, though, because there are no known psychological measures of individual differences in abilities to understand and reason specifically with set-theoretic models. Part of the present research, therefore, includes the development of assessments designed to measure nested sets modeling abilities.
Hypotheses
All viewpoints reviewed here recognize that there are individual differences that should be meaningfully related to Bayesian reasoning abilities and performance.Footnote 1 The key differences are in which individual differences are primary factors in accounting for variations in Bayesian reasoning abilities. Crucially, some of these measures are likely to be intercorrelated to some extent (e.g., Brase & Hill, 2017), so looking at individual variables in isolation would risk finding significant correlations driven by unstudied third variables (e.g., see prior discussion of results in Sirota et al., 2014). The current research therefore concurrently measures multiple individual difference traits alongside the target behavior of Bayesian reasoning.
In summary, hypotheses about the relationship between various individual differences measures and Bayesian reasoning performance differ according to which theoretical view is entertained and which version of the view is considered:
-
The ERA predicts that both numeracy and visuospatial ability will best predict Bayesian reasoning performance. The predictive power of these will not be strongly affected by other individual differences, such as thinking dispositions, the ability to use nested sets, or general cognitive processing abilities.
-
The NSA as thinking disposition/mental model style views predict that one’s tendency to more extensively think about tasks (either more System 2 processing or fleshing out of one’s model) will best predict Bayesian reasoning performance. These views imply that the predictive power of thinking disposition or style (e.g., need for cognition) would dominate those other variables.
-
An NSA as set-theoretic modeling ability view predicts that the ability to construct conceptual set-theoretic models will best predict Bayesian reasoning performance, and that this relationship will not be significantly diminished by other variables.
-
An NSA as System 2 ability view predicts that general processing ability (e.g., working memory) will significantly predict Bayesian reasoning performance, and that its predictive power will largely subsume the predictive ability of other traits.
It does not appear that the later three hypotheses (all versions of the NSA view) are required to be mutually exclusive, even as they are distinct. While that is possibly problematic in terms of generating specific and testable predictions for the general NSA view, it also opens up the present opportunity to work out the more specific versions of this view. Additionally, by using concurrent assessments of multiple relevant individual difference measures it may be possible to identify even overlapping but supportive outcomes.