Representing credal imprecision: from sets of measures to hierarchical Bayesian models

Lassiter, Daniel

doi:10.1007/s11098-019-01262-8

Representing credal imprecision: from sets of measures to hierarchical Bayesian models

Published: 15 February 2019

Volume 177, pages 1463–1485, (2020)
Cite this article

Philosophical Studies Aims and scope Submit manuscript

Daniel Lassiter¹

402 Accesses
5 Citations
Explore all metrics

Abstract

The basic Bayesian model of credence states, where each individual’s belief state is represented by a single probability measure, has been criticized as psychologically implausible, unable to represent the intuitive distinction between precise and imprecise probabilities, and normatively unjustifiable due to a need to adopt arbitrary, unmotivated priors. These arguments are often used to motivate a model on which imprecise credal states are represented by sets of probability measures. I connect this debate with recent work in Bayesian cognitive science, where probabilistic models are typically provided with explicit hierarchical structure. Hierarchical Bayesian models are immune to many classic arguments against single-measure models. They represent grades of imprecision in probability assignments automatically, have strong psychological motivation, and can be normatively justified even when certain arbitrary decisions are required. In addition, hierarchical models show much more plausible learning behavior than flat representations in terms of sets of measures, which—on standard assumptions about update—rule out simple cases of learning from a starting point of total ignorance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Vague Credence

Article 05 July 2015

Representing Uncertainty

Second-Order Confidence in Supervaluationism

Article Open access 12 July 2023

Notes

See, for example, Halpern (2003) and Joyce (2005, 2010).
Note that the term “imprecise credences” is sometimes used to designate a specific formal model of belief based on sets of probability measures, rather than the phenomenon being modeled. To avoid confusion between model and the thing modeled, I will avoid the term “imprecise credences” altogether, using “credal imprecision” as a name for the phenomenon and “sets-of-measures” for the formal model under discussion.
Representations based on probability intervals or on upper and lower probabilities can for present purposes be treated as a special case of sets-of-measures models.
In addition to references just cited, and among many others: Spirtes et al. (1993), Pearl (2000), Glymour (2001), Woodward (2003), Sloman (2005), Koller and Friedman (2009), Russell and Norvig (2010), Goodman et al. (2016), Danks (2014) and Icard (2017).
This is called “probabilistic dilation”: see Seidenfeld and Wasserman (1993), van Fraassen (2006) and White (2010). While this feature of sets-of-measures models is intuitively bizarre, Pedersen and Wheeler (2014) discuss important subtleties that may help to improve its plausibility.
In general, conditioning a \({\text {Beta}}(a,b)\) prior on n heads/successes/wins and m tails/failures/losses yields a \({\text {Beta}}(a + n, b + m)\) posterior: see e.g. Griffiths et al. (2008) and Hoff (2009).
A fourth way to deal with the inability of sets-of-measures models to allow serious learning from a starting point of ignorance is suggested by Rinard (2013): we can conclude that a precise formal model of belief states is not possible. This might well be correct, but it would be defeatist to draw this conclusion simply because sets-of-measures models cannot account for simple cases of inductive learning. In particular, the hierarchical approach that I will sketch momentarily gives us another reason not to abandon hope for a formal model of belief.
The part that still hits home is the accusation that precise credence models give rise to “very specific inductive policies” which are not justified by evidence. This is closely related to the impossibility of assumption-free learning noted above, as well as the question of whether and how rules like the Principle of Indifference can be used to justify certain choices of priors. We will return to this issue in Sect. 6 below.
For discussion of richer languages based on probabilistic programming principles that can describe hierarchical Bayesian models with uncertainty over individuals, properties, relations, etc., see for example Milch et al. (2007) and Goodman et al. (2008, 2016), Tenenbaum et al. (2011), Goodman and Lassiter (2015), Pfeffer (2016) and Icard (2017).
The model is directly inspired by the Microsoft Trueskill system that is used to rank Xbox Live players in order to ensure engaging match-ups in online games: see Bishop (2013). It is conceptually close to the more complex tug-of-war model, with quantification and inference over individuals and their properties and relations, that is explored by Gerstenberg and Goodman (2012) and Goodman and Lassiter (2015).

References

Al-Najjar, N. I., & Weinstein, J. (2009). The ambiguity aversion literature: A critical assessment. Economics & Philosophy, 25(3), 249–284.
Article Google Scholar
Bishop, C. M. (2013). Model-based machine learning. Philosophical Transactions of the Royal Society of London A: Mathematical, Physical and Engineering Sciences, 371(1984), 20120222.
Article Google Scholar
Bradley, S. (2014). Imprecise probabilities. In E. N. Zalta (Ed.), The Stanford encyclopedia of philosophy (Winter 2014 ed.).
Danks, D. (2014). Unifying the mind: Cognitive representations as graphical models. Cambridge: MIT Press.
Book Google Scholar
de Finetti, B. (1977). Probabilities of probabilities: A real problem or a misunderstanding? In A. Aykac & C. Brumet (Eds.), New developments in the applications of Bayesian methods. Amsterdam: North-Holland.
Google Scholar
Elga, A. (2010). Subjective probabilities should be sharp. Philosophers’ Imprint, 10(5), 1–11.
Google Scholar
Ellsberg, D. (1961). Risk, ambiguity, and the savage axioms. The Quarterly Journal of Economics, 75, 643–669.
Article Google Scholar
Gerstenberg, T., & Goodman, N. D. (2012). Ping pong in church: Productive use of concepts in human probabilistic inference. In Proceedings of the 34th annual conference of the Cognitive Science Society (pp. 1590–1595).
Gigerenzer, G. (1991). How to make cognitive illusions disappear: Beyond heuristics and biases. European Review of Social Psychology, 2(1), 83–115.
Article Google Scholar
Glymour, C. N. (2001). The mind’s arrows: Bayes nets and graphical causal models in psychology. Cambridge: MIT Press.
Book Google Scholar
Goodman, N., Mansinghka, V., Roy, D., Bonawitz, K., & Tenenbaum, J. (2008). Church: A language for generative models. Uncertainty in Artificial Intelligence, 22, 23.
Google Scholar
Goodman, N. D., & Lassiter, D. (2015). Probabilistic semantics and pragmatics: Uncertainty in language and thought. In S. Lappin & C. Fox (Eds.), Handbook of Contemporary Semantic Theory (2nd ed.). London: Wiley-Blackwell.
Google Scholar
Goodman, N. D., Tenenbaum, J. B., & The ProbMods Contributors. (2016). Probabilistic models of cognition. Retrieved February 14, 2019 from http://probmods.org.
Griffiths, T., Tenenbaum, J. B., & Kemp, C. (2012). Bayesian inference. In R. G. Morrison (Ed.), The Oxford handbook of thinking and reasoning (pp. 22–35). Oxford: Oxford University Press.
Google Scholar
Griffiths, T. L., Kemp, C., & Tenenbaum, J. B. (2008). Bayesian models of cognition. In R. Sun (Ed.), Cambridge handbook of computational psychology (pp. 59–100). Cambridge: Cambridge University Press.
Google Scholar
Griffiths, T. L., & Tenenbaum, J. B. (2006). Optimal predictions in everyday cognition. Psychological Science, 17(9), 767–773.
Article Google Scholar
Grove, A. J., & Halpern, J. Y. (1998). Updating sets of probabilities. In Proceedings of the fourteenth conference on uncertainty in artificial intelligence (pp. 173–182). Morgan Kaufmann.
Hájek, A., & Smithson, M. (2012). Rationality and indeterminate probabilities. Synthese, 187(1), 33–48.
Article Google Scholar
Halpern, J. Y. (2003). Reasoning about uncertainty. Cambridge: MIT Press.
Google Scholar
Hoff, P. D. (2009). A first course in Bayesian statistical methods. Berlin: Springer.
Book Google Scholar
Icard, T. (2017). From programs to causal models. In A. Cremers, T. van Gessel & F. Roelofsen (Eds.), Proceedings of the 21st Amsterdam colloquium (pp. 35–44).
Jaynes, E. (2003). Probability theory: The logic of science. Cambridge: Cambridge University Press.
Book Google Scholar
Jeffrey, R. (1983). Bayesianism with a human face. In J. Earman (Ed.), Testing scientific theories (pp. 133–156). Minneapolis: University of Minnesota Press.
Google Scholar
Joyce, J. M. (2005). How probabilities reflect evidence. Philosophical Perspectives, 19(1), 153–178.
Article Google Scholar
Joyce, J. M. (2010). A defense of imprecise credences in inference and decision making. Philosophical Perspectives, 24(1), 281–323.
Article Google Scholar
Kahneman, D., Slovic, P., & Tversky, A. (1982). Judgment under uncertainty: Heuristics and biases. Cambridge: Cambridge University Press.
Book Google Scholar
Keynes, J. M. (1921). A treatise on probability. New York: Macmillan.
Google Scholar
Koller, D., & Friedman, N. (2009). Probabilistic graphical models: Principles and techniques. Cambridge: MIT Press.
Google Scholar
Kolmogorov, A. (1933). Grundbegriffe der Wahrscheinlichkeitsrechnung. Berlin: Springer.
Book Google Scholar
Körding, K. P., & Wolpert, D. M. (2004). Bayesian integration in sensorimotor learning. Nature, 427(6971), 244–247.
Article Google Scholar
Levi, I. (1974). On indeterminate probabilities. The Journal of Philosophy, 71(13), 391–418.
Article Google Scholar
Levi, I. (1985). Imprecision and indeterminacy in probability judgment. Philosophy of Science, 52(3), 390–409.
Article Google Scholar
Macmillan, N., & Creelman, C. (2005). Detection theory: A user’s guide. London: Lawrence Erlbaum.
Google Scholar
Milch, B., Marthi, B., Russell, S., Sontag, D., Ong, D. L., & Kolobov, A. (2007). Blog: Probabilistic models with unknown objects. In L. Getoor & B. Taskar (Eds.), Introduction to statistical relational learning (pp. 373–398). Cambridge: MIT Press.
Google Scholar
Nisbett, R. E., & Wilson, T. D. (1977). Telling more than we can know: Verbal reports on mental processes. Psychological Review, 84(3), 231.
Article Google Scholar
Pearl, J. (1988). Probabilistic reasoning in intelligent systems: Networks of plausible inference. Los Altos: Morgan Kaufmann.
Google Scholar
Pearl, J. (2000). Causality: Models, reasoning and inference. Cambridge: Cambridge University Press.
Google Scholar
Pedersen, A. P., & Wheeler, G. (2014). Demystifying dilation. Erkenntnis, 79(6), 1305–1342.
Article Google Scholar
Pfeffer, A. (2016). Practical probabilistic programming. New York: Manning Publications.
Google Scholar
Rinard, S. (2013). Against radical credal imprecision. Thought: A Journal of Philosophy, 2(2), 157–165.
Google Scholar
Russell, S., & Norvig, P. (2010). Artificial intelligence: A modern approach. Englewood Cliffs: Prentice Hall.
Google Scholar
Seidenfeld, T., & Wasserman, L. (1993). Dilation for sets of probabilities. The Annals of Statistics, 21, 1139–1154.
Article Google Scholar
Simeone, O. (2017). A brief introduction to machine learning for engineers. arXiv:1709.02840v1.
Sloman, S. A. (2005). Causal models: How we think about the world and its alternatives. Oxford: OUP.
Book Google Scholar
Spirtes, P., Glymour, C., & Scheines, R. (1993). Causation, prediction, and search. Cambridge: MIT Press.
Book Google Scholar
Tenenbaum, J. B., Kemp, C., Griffiths, T. L., & Goodman, N. D. (2011). How to grow a mind: Statistics, structure, and abstraction. Science, 331(6022), 1279–1285.
Article Google Scholar
Trommershäuser, J., Maloney, L. T., & Landy, M. S. (2008). Decision making, movement planning and statistical decision theory. Trends in Cognitive Sciences, 12(8), 291–297.
Article Google Scholar
Tversky, A., & Kahneman, D. (1974). Judgment under uncertainty: Heuristics and biases. Science, 185(4754), 1124–1131.
Article Google Scholar
van Fraassen, B. C. (1990). Figures in a probability landscape. In J. Dunn & A. Gupta (Eds.), Truth or consequences (pp. 345–356). Berlin: Springer.
Chapter Google Scholar
van Fraassen, B. C. (2006). Vague expectation value loss. Philosophical Studies, 127(3), 483–491.
Article Google Scholar
Vul, E., Goodman, N., Griffiths, T., & Tenenbaum, J. (2014). One and done? Optimal decisions from very few samples. Cognitive Science, 38(4), 599–637.
Article Google Scholar
Walley, P. (1996). Inferences from multinomial data: Learning about a bag of marbles. Journal of the Royal Statistical Society, Series B (Methodological), 58, 3–57.
Article Google Scholar
White, R. (2010). Evidential symmetry and mushy credence (pp. 161–186). Oxford: Oxford University Press.
Google Scholar
Wilson, T. D. (2004). Strangers to ourselves: Discovering the adaptive unconscious. Cambridge: Harvard University Press.
Book Google Scholar
Wolpert, D. H. (1996). The lack of a priori distinctions between learning algorithms. Neural computation, 8(7), 1341–1390.
Article Google Scholar
Woodward, J. (2003). Making things happen: A theory of causal explanation. Oxford: Oxford University Press.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Linguistics, Stanford University, 450 Serra Mall, Building 460, Stanford, CA, 94305, USA
Daniel Lassiter

Authors

Daniel Lassiter
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daniel Lassiter.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lassiter, D. Representing credal imprecision: from sets of measures to hierarchical Bayesian models. Philos Stud 177, 1463–1485 (2020). https://doi.org/10.1007/s11098-019-01262-8

Download citation

Published: 15 February 2019
Issue Date: June 2020
DOI: https://doi.org/10.1007/s11098-019-01262-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Representing credal imprecision: from sets of measures to hierarchical Bayesian models

Abstract

Access this article

Similar content being viewed by others

Vague Credence

Representing Uncertainty

Second-Order Confidence in Supervaluationism

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Representing credal imprecision: from sets of measures to hierarchical Bayesian models

Abstract

Access this article

Similar content being viewed by others

Vague Credence

Representing Uncertainty

Second-Order Confidence in Supervaluationism

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation