Abstract
We show a somewhat surprising result concerning the relationship between the Principal Principle and its allegedly generalized form. Then, we formulate a few desiderata concerning chance-credence norms and argue that none of the norms widely discussed in the literature satisfies all of them. We suggest that the New Principle comes out as the best contender.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Chances should guide credences; to put the point more guardedly, rational agents’ credences about chances should guide their credences: this is the essence of the intuition behind the Principal Principle (Lewis, 1986). The idea can be easily illustrated in simple cases: assuming we’re discussing the credences of some rational agent, her credence in A given (just) that the chance of A is 0.3 should equal 0.3. However, moving from this to a general formulation of a sensible candidate for a norm of rationality has been a daunting task; the multitude of texts published in the last four decades on this subject attest its difficulty.
The field has certainly seen progress; a few candidates for chance-credence norms have emerged, each with its own set of problems. Some of the issues have been given a formal treatment, with mathematical theorems enlightening previous metaphysical research. In this paper we would like to bring into focus a few issues on that frontFootnote 1:
-
a somewhat surprising relationship between two familiar norms, the Principal Principle (PP) and the Generalized Principal Principle (GPP): it turns out that, despite what some writers seem to have thought, the GPP is not “more general” of the two, at least in the sense that it does not even include the PP as a special case—the PP postulates stronger requirements on agents who are not certain what the objective chance function is than the GPP does;
-
this leads to a suggestion that there are two fundamentally different readings of the PP, the static and the dynamic one, which seems to illuminate the issue;
-
the PP seems to have at least one hitherto undiscussed unintentional consequence: if an agent has an opinion about a nontrivial chance function, then at least two possible worlds have to share that function;
-
related, but different issues arise for the New Principle (NP), originally proposed to address the problem of “undermining futures” (Lewis, 1994);
-
a “General New Principle” (GNP), whose relation to the NP is prima facie the same as the one holding between the GPP and the PP, turns out to be equivalent to the NP;
-
the GPP is not preserved by Bayesian Conditionalisation (while the PP and the NP are);
-
the GPP is not as good at handling conditional chances as the PP and the NP—it is not a “dot principle”, as defined below in Section 3.3.
It would seem, then, that neither the PP nor the GPP are good candidates for norms of rationality. What is left on the table? We will argue that the New Principle (NP), which avoids most of the problems discussed in this paper, also leads to a possibly unfortunate, somewhat metaphysical constraint on the chance functions about which the agent whose credence function satisfies the principle has an opinion: if such a chance function is not “fundamentally self-deceiving”, in the precise sense defined below, it has to be the chance function for at least two possible worlds (more precisely, it has to be their ur-chance function, as described below). Whether this constraint is unreasonable is another matter, of course. This will depend on one’s metaphysical account of chance, a matter on which we would like to be neutral in this paper.
The paper is structured as follows. In the next section we present the formal framework in which the proposed norms will be discussed. Section 3 contains the discussion of the various desiderata we would like the norms to satisfy. In Sect. 4 we sum up our findings.
2 The setup and initial observations
For the purposes of this paper, we have decided to adopt the formalism used in Pettigrew (2015). There have been numerous attempts at providing some formal language suitable for phrasing chance-credence norms, and we would not like to add another one to the mix. These attempts can be roughly divided into two groups depending on whether the PP is supposed to involve conditioning on a proposition about which we assume (only) that it specifies the chance of the proposition under discussion, or whether it does its job in the context of conditioning on a proposition which specifies the whole chance function. We encounter the first of these approaches whenever we see someone phrasing the PP as entailing that \(P(A | X)=x\), where X is assumed to “say that the chance of A is x”, or something to the similar effect.Footnote 2 We encounter the second one for example when we see someone phrasing the PP in a way which involves referring to some “complete theory of chance” (whatever that would be) instead of the proposition X. Pettigrew’s method brings clarity to the latter idea: instead of “complete theories of chance”, we refer to “chance propositions” true at those worlds which share the given chance function.
Let us then recapitulate the main ingredients of the formal setup from Pettigrew (2015). An agent’s epistemic state at a time t is given by a credence function \(b_t\) which is a classical probability function. It is defined on an algebra of propositions; these are the propositions about (the truth of) which the agent in question has an opinion. For each time t there is a proposition \(E_t\) which is the agent’s total evidence at t; we assume that \(b_t(E_t)=1\), that is, the agent does not dispute her evidence.
Given a possible world w we consider the “ur-chance function” at w. The idea is that, in w, chances at time t are obtained from a single chance function upon which we conditionalise with the history of w up to t: it is that function which is dubbed the ur-chance function. We may think of it as the “initial” chance function, if time has an initial instant; it specifies, in any case, the chances at a world “before anything happens in it”, i.e., when the history of it is a tautologous proposition. It is assumed that chance functions are classical probabilities and that they are defined on the same algebra of propositions as the credence functions. Given a chance function ch, \(C_{ch}\) is the proposition “the ur-chance is given by ch”, true at exactly those worlds where the ur-chance is ch.
Note that Pettigrew does not assume throughout that propositions are sets of possible worlds: he does so only in one of his arguments supporting the NP (Proposition 12 of Pettigrew, 2015). We also want to stay neutral on this in general. However, some arguments in what follows will require the actual construction of credence functions satisfying the GPP; to do so, we need to specify the algebra of propositions, and this is perhaps most conveniently described if propositions are sets. However, what will turn out to be important is a different issue: whether chance propositions are atoms of the algebra of propositions on which the credence function is defined (if it has atoms at all). If an atom of such an algebra is identified with a set of possible worlds, it is natural to treat it as a singleton: otherwise there will be two or more worlds which assign the same truth values to all propositions about which the agent has an opinion, i.e. which are indistinguishable from her perspective.
If every world has a single ur-chance function, it follows that the propositions \(C_{ch}\) correspond to a partition of the set of all possible worlds considered by the agent. An assumption Pettigrew makes “for the sake of mathematical simplicity” (p. 178) is that the domain of the credence function contains only finitely many propositions of the form \(C_{ch}\). The points made in this paper will require from us the construction of specific credence functions satisfying GPPs, but in each case finite structures (i.e. structures containing not only finitely many \(C_{ch}\)’s, as in Pettigrew’s case, but finite, full stop) will suffice. Therefore, we deem this assumption to be relatively innocent.
Let us move, then, to the formulation of the chance-credence norms under discussion here (lifted with minimal changes from Pettigrew, 2015).
(PP) At time t, an agent ought to have a credence function \(b_t\) such that, for all ur-chance functions ch and propositions A,
$$\begin{aligned}b_t(A|C_{ch})=ch(A|E_t),\end{aligned}$$unless \(ch(E_t)=0\); in which case it ought to be that \(b_t(C_{ch})=0\).
The PP as just given is not exactly what Lewis originally had in mind; he wanted the Principle to concern reasonable initial credence functions:
(PP)\(_0\) At the beginning of her epistemic life, an agent ought to have a credence function \(b_0\) such that, for all ur-chance functions ch and propositions A,
$$\begin{aligned}b_0(A|C_{ch})=ch(A).\end{aligned}$$
Observe that PP\(_0\) entails that at the beginning of her epistemic life the agent has to have a nonzero credence in each chance proposition she has an opinion about. This strikes us as a strong assumption. However, we can perhaps mitigate it by thinking that, instead of considering a \(C_{ch}\) such that \(b_0(C_{ch})=0\), we shall just remove the possible worlds at which such a \(C_{ch}\) is true from the set of the worlds under discussion. (This is perhaps additionally justified if we assume that the agent updates via Bayesian Conditionalisation, which throughout her life would never raise her credence in such a \(C_{ch}\).)
The PP implies the PP\(_0\) if we assume that there is an instant 0 at the beginning of an agent’s epistemic life where her evidence is tautologous (we keep this assumption throughout the paper). Pettigrew notes (in his Proposition 1) that PP is entailed by PP\(_0\) in conjunction with Bayesian Conditionalisation (BC):
(BC) If \(t < t'\), it ought to be the case that, for any A,
$$\begin{aligned}b_{t'}(A) =b_t(A|E_{t'}),\end{aligned}$$provided that \(b_t(E_{t'})>0\).
To quote Lewis, the PP is supposed to capture the intuition that “certainty about chances—or conditionality on propositions about chances—makes for resilient degrees of belief about outcomes” (Lewis, 1986, p. 86). However, an agent can satisfy PP (or PP\(_0\)) without bestowing credence 1 to any chance proposition.Footnote 3 In fact, it only makes real sense to write the PP in this way if one’s intention is to talk about agents who are not certain what the actual chance function is. Otherwise, if the actual chance function was ch, then the agent’s credence \(b_t(C_{ch})\) would equal 1, and writing the formula at the heart of the PP starting with “\(b_t(A|C_{ch})\)” would be puzzling indeed. We also should not read the PP as specifying what a rational agent’s credence should be once they learn what the actual chance is: first, the PP does not by itself specify a belief update rule, but is a synchronic normFootnote 4; second, the PP might be seen to be reasonable even if learning what actual chances are is fundamentally impossible. It just requires the rational agent’s personal odds to be set in a specific way: if \(ch(A |E_t)=.75\), then for the agent \(AC_{ch}\) should be thrice as likely as \(\lnot A C_{ch}\).Footnote 5
All this is not a peculiarity of the formalism we have adopted for the current paper. Lewis himself, immediately after the above-quoted fragment about how PP is to capture the intuition about “certainty about chances—or conditionality on propositions about chances”, defines the PP as involving the expression “\(C(A|XE)=x\)”,Footnote 6 where the “C” stands for the credence function, and the “X” for the proposition that the chance of A’s holding equals x. This, again, has nontrivial consequences for many agents who are not certain of what the chance function is, that is, for those with a credence function C such that \(C(X) \ne 1\). If the principle was supposed to concern just those agents who possess certainty about the objective chance, that is, those for which \(C(X)=1\), then writing the PP as Lewis did would indeed be baffling. Of the two options from the quote, then, we should point to “conditionality on propositions about chance” as the topic of the PP.
The PP might seem to be too specific: it literally refers only to cases involving a conditional credence in a proposition given a single proposition about chance, while in reality agents typically hold numerous hypotheses about possible chance functions. And it seems that indeed, Lewis intended the ultimate form of the Principal Principle to be somewhat different. In the context of an agent entertaining various options as to the chance of a certain coin falling heads, he writes “more generally, whether or not you are sure about the chance of heads, your unconditional degree of belief that the coin falls heads is given by summing over alternative hypotheses about chance” (ibid., p. 87). The suggestion seems clear that this is supposed to be the generalized version of the PP, covering more cases. In the parlance we have adopted here let us write it as follows, “GPP” meaning “Generalized Principal Principle”Footnote 7:
(GPP) At time t, an agent ought to have a credence function \(b_t\) such that, for all ur-chance functions ch and propositions A,
$$\begin{aligned} b_t(A)=\sum _{ch: ch(E_t)>0}b_t(C_{ch})ch(A|E_t). \end{aligned}$$
The intuition seems clear: “more generally”, the credences of an agent who satisfies the GPP are weighted means of conditional credences, the weights being the credences in chance propositions, and the conditional credences being conditional on those chance propositions. This we get just by the total probability theorem. Then we switch from the conditional credences to chances by appealing to (PP), and indeed (GPP) is what we end up with (see Appendix A.1).
If the GPP was indeed “more general”, then in the special case where the agent was certain of the truth of a single chance proposition, and just one weight (equal to 1) remained, we could be excused for thinking that we should end up with the “specific” PP. However, note that it is decidedly not the case: while it is easy to deduce the GPP from the PP just by the probability calculus, the GPP does not imply the PP, in the literal sense of there being a credence \(b_t\) which satisfies the GPP but which does not satisfy the PP (see Table 1 below, as well as Appendix A.2). So, quite literally, the GPP is not “more general” than the PP, since it doesn’t cover it as a special case. The situation is a little more subtle. If for some ch \(b_t(C_{ch})=1\), then both the PP and the GPP require that \(b_t(A)=ch(A|E_t)\), that is, that credences conform to known chances. This is the job the Principal Principle is usually informally expected to perform.Footnote 8 But the PP, as already noted, postulates requirements concerning a specific coordination of credences also on part of agents who are not certain about what the objective chance is, requirements the GPP does not posit. We also see that the GPP governs both the situations of certainty and of uncertainty about objective chances. When investigating the relationship between the PP and the GPP, and the alleged higher generality of the GPP, then, the topic of certainty is something of a red herring. The two principles demand the same credence function from agents certain about what the objective chance is; of other agents, actually, it is the PP that requires more.
After inspecting the literatureFootnote 9 we believe it is worth it to reiterate the following points:
-
the GPP, known also as “Ismael’s Principle”, is already present in Lewis (1986);
-
the PP implies the GPP (Appendix A.1);
-
the GPP does not imply the PP (Table 1 and Appendix A.2).
The direction of the logical relationship is one thing; justification is another.Footnote 10 Hoefer (2007) points out that “The justification of this GPP depends on the prior justification of PP in a fairly obvious way” (fn. 27).Footnote 11 Briggs (2009a) goes further and claims that “without PP, there is no reason to accept GPP” (p. 441). We disagree. In our opinion there are situations where the GPP is motivated just by the intuition that ideally credences should conform to chances, so in situations of uncertainty about chances one should set their credences as expectations of chances; sometimes it might be the case that the ratios of certain probabilities turn out to be such that the PP fails, but still, to reiterate, credences still are expectations of chances, as they should be.
Consider the situation in Table 1. Some agent’s credence \(b_t\) is defined on the space of four possible worlds, governed by two chance functions; \(C_{ch_1} = \{w_1, w_2\}\), \(C_{ch_2} = \{w_3, w_4\}\). What is the agent’s credence in, say, \(\{w_1\}\)? The agent does not know which of the two chance functions is the actual one. Still, there are two options, so they set their credence to be the appropriate weighted mixture of the two, that is, to be their expectation of the proposition’s chance: \(b_t(\{w_1\}) = b_t(C_{ch_1})ch_1(\{w_1\}) + b_t(C_{ch_2})ch_2(\{w_1\}) = 0.5 \times 0.4 + 0.5 \times 0.25 = 0.325\). This holds for all propositions, and so the agent’s satisfies the GPP (please take our word for it for now; for details see Appendix A.2). However, the PP fails: \(b_t(\{w_1\} | C_{ch_1}) = 0.65 \ne 0.4 = ch_1(\{w_1\})\). Is this a mark of synchronic irrationality? The agent’s odds of \(\{w_1\}C_{ch_1}\) vs. \(\{w_2, w_3, w_4\}C_{ch_1}\) are \(\nicefrac {13}{7}\), while according to \(ch_1\) the odds of \(\{w_1\}\) vs. \(\{w_2, w_3, w_4\}\) are \(\nicefrac {2}{5}\). However, the \(b_t\) is a classical probability function, so there is no risk of a Dutch Book. The credences are expectations of chances, as already noted. After the agent updates the credence function in light of some evidence, there might be trouble; we should expect, e.g., that after conditionalisation on some new evidence the GPP will cease to hold. However, this might not be thought of as a problem for GPP itself as a synchronic norm of rationality; and if one wishes, other belief update methods than conditionalisation could be considered, such as imaging (Lewis, 1976) or generalized imaging (Gardenfors, 1982). Perhaps sustaining the GPP under some reasonable belief update method is not out of the question.Footnote 12
To reiterate, the supposedly “generalized” PP does not imply the “specific” one. In fact, it’s the other way round: once we assume the PP, the GPP follows. It’s not the case that the GPP covers both the situations of uncertainty, where the agent holds various hypotheses about chance, and of certainty, where the agent bestows one chance proposition credence 1, while the PP deals exclusively with the latter. The situation, as already noted, is markedly different, and perhaps we can see one aspect of it more clearly if we assume that rational agents adhere to BC. As formulated, both the PP and the GPP are static, synchronic norms: they specify how an agent’s credences at a certain time should internally cohere. However, once we assume the agent will update their credence by conditionalization, the PP tells us how such an update should proceed in situations where they learn the truth of one of the chance propositions, bringing a dynamic aspect to the picture, which is not entailed by the static one.Footnote 13 The PP, then, covers the situations of becoming certain what the objective chance is, not sharing this aspect with the GPP. Perhaps this might be the reason for the associations of the PP with certainty and the GPP with uncertainty.
The third candidate for a chance-credence norm to be discussed here is the so-called New Principle (NP) that was first put forth in response to the problem of “undermining futures”.Footnote 14 At the core of the NP lies the idea that rather than letting unconditional chances guide credence as is the case for PP, we should let the chances, conditional on theirs being the true chance, guide credence. In other words, chance is to be treated as an oracle who is aware of her ‘oracle-status’: not only do we bring her up to speed with what our evidence is in line with what the PP already suggested, but we also assume that she knows that she is the oracle and then conform our subjective credence to the probability that she would assign to the proposition in question, conditional on the evidence and on hers being the true chance function (Hall, 1994). More precisely:Footnote 15
(NP) At time t, an agent ought to have a credence function \(b_t\) such that, for all ur-chance functions ch and propositions A,
$$\begin{aligned}b_t(A|C_{ch})=ch(A|E_t C_{ch}),\end{aligned}$$unless \(ch(E_t C_{ch})=0\); in which case it ought to be that \(b_t(C_{ch})=0\).
It might seem natural at this point to investigate a “general” counterpart to the NP, one explicitly involving an agent contemplating multiple chance hypotheses—an analogue of the GPP, but involving chance functions which are “brought up to speed” in the above sense:
(GNP) At time t, an agent ought to have a credence function \(b_t\) such that, for all ur-chance functions ch and propositions A,
$$\begin{aligned} b_t(A)=\sum _{ch: ch(E_t)>0}b_t(C_{ch})ch(A|E_t C_{ch}). \end{aligned}$$
Somewhat surprisingly, it turns out that this does not bring anything new to the table: the GNP and the NP are equivalent (Appendix A.3).
3 The Desiderata
We will now proceed to list and discuss a few conditions on prospective candidates for chance-credence norms of rationality.
3.1 Being preserved by BC
The first condition is simple: a good candidate for a chance-credence norm should stipulate a condition on credence functions which is preserved by BC, a widely accepted belief update operation. (This desideratum will not make sense, of course, for someone who believes that the right chance-credence norm is PP\(_0\).)
We do not wish to claim that this is a sine qua non: we would be open, say, to a slight divergence from Bayesianism, according to which an update via BC may require an immediate “chance-credence recalibration”; perhaps mere conditionalisation may loosen the proper fit between credences and credences about chances, which would need to be re-established. Let us check the facts as to whether the conditions imposed by the discussed norms are preserved by BC. Almost trivially, the PP is so preserved (Appendix A.4). However, as shown by Pettigrew and Titelbaum (2014), GPP is not (Pettigrew and Titelbaum attribute the principle to Ismael; see also Appendix A.5): one can construct credences \(b_t\) and \(b_{t'}\) (for \(t < t'\)), such that:
-
\(b_t\) satisfies the GPP;
-
\(b_{t'}\) arises from \(b_t\) in accordance with BC;
-
but \(b_{t'}\) does not satisfy the GPP.
(In such situations, in view of the PP implying the GPP, \(b_t\) will of course not satisfy the PP.) This is, then, another argument against the GPP.
The NP is also preserved by BC (Pettigrew, 2015, Proposition 8).
3.2 Avoiding unintentional metaphysical consequences
As is well-known, the PP faces tough problems related to the issue of “undermining futures”. These were initially formulated in the philosophical context of Humean Supervenience and led to several reformulations of the principle, of which we will discuss the NP (see Lewis, 1994 and Thau, 1994 for the initial statement of the problem and first attempts at a solution; among the papers important for subsequent discussion are e.g. Hall (1994) and Meacham (2010)). The issue can, however, be seen to deal a strong blow against the PP even if no deep philosophical commitment is made: only a few mathematical assumptions are needed. Specifically: take a chance function ch and a moment t such that \(ch(C_{ch} | E_t) < 1\) (Pettigrew calls such functions “modest in the presence of \(E_t\)”). Then from the fact that \(b_t\) satisfies the PP it follows that \(b_t(C_{ch}) = 0\) (Pettigrew, 2015, Proposition 4, but see also Proposition 3). Then, if all chance functions about which the agent has an opinion are modest in the presence of \(E_t\), a \(b_t\) satisfying the PP cannot be a probability function.
Perhaps even the weaker conclusion that an agent satisfying the PP cannot hold a nonzero credence in that the actual world is governed by a modest chance function is unpalatable. Assume an agent’s space is finite. If at the start of their epistemic life they consider more than one chance functions which are nowhere zero, then in order to satisfy the PP they need to choose one of those functions and go all in, setting their credence in that this is the objective chance to 1, and setting all other chance propositions to be empty. Such a behaviour does not strike us as rational, in view of complete lack of evidence.
This needs to be contrasted with the utmost ease with which one can, given the set of chance functions, produce credence functions satisfying the NP in situations where the agent has opinions regarding arbitrary chance propositions. As shown in Proposition 6 in Pettigrew (2015), for any finite set of chance functions, for any evidence \(E_t\), you can have an agent with arbitrary credences in the corresponding chance propositions, and still satisfy the NP; the issue of modesty is not relevant here. However, we will argue that the principle has other problems, which, again, will be most easily discussed if we start with how they trouble the PP.
Before we do this, let us note that, regarding modesty, the GPP falls somewhat in between the PP and the NP: it does not force the agent to have credence 0 in all chance propositions where the chance is modest in the presence of \(E_t\), but restricts this conclusion to these cases of modest ch’s for which there exists an immodest \(ch'\) such that \(ch(C_{ch'} | E_t)>0\), a noticeably weaker constraint.Footnote 16 Of the three principles, then, the NP seems to fare the best with regard to the modesty issue, with the GPP in the second place.
Let us now consider what it means if a \(C_{ch}\) is an atom of the algebra of propositions on which the credence function is defined.Footnote 17 If propositions are sets, it may be the case that \(C_{ch}\) is a singleton: among the possible worlds in the sphere of interest for the agent, there’s just a single one whose ur-chance function is ch. In a different case, \(C_{ch}\) may contain more than one world: but still, all the propositions about which the agent has an opinion have the same truth values at those worlds, and so from the perspective of the agent they (the worlds) are identical. This holds also in the general case where propositions are not sets of worlds: if \(C_{ch}\) is an atom, then the agent does not differentiate between the worlds at which ch is the ur-chance function.
Suppose, then, that \(C_{ch}\) is an atom. If the agent satisfies the PP, then, at any instant t, \(ch(A|E_t)\) is a trivial function (i.e., it obtains for any A only the values 1 and 0). And if we consider the case when \(E_t\) is tautologous, that is, we investigate the beginning of the agent’s epistemic life, we infer that the ur-chance function ch is trivial. This is because, if \(C_{ch}\) is an atom, then for any A, \(A \wedge C_{ch}\) can only be equal to \(C_{ch}\) or the bottom element of the algebra (i.e. \(\emptyset \) if propositions are sets), to which any probability function has to assign 0. Therefore \(b_t(A | C_{ch})\), and so \(ch(A|E_t)\), can only equal 0 or 1, which in the case where \(E_t\) is tautologous leads to the conclusion that ch is trivial.Footnote 18 In short, just because of how classical conditional probability works, the PP has the following consequence:
(Triv) If a chance function is non-trivial, there have to be at least two worlds for which it is the ur-chance function.
Whether (Triv) can be called a “metaphysical” consequence might depend on one’s view on the nature of possible worlds considered by agents in the context of chance-credence norms. Lewisians might just take the word literally. Those for whom these worlds are “personal possibilities” (Hacking, 1967, Sect. 6) may differ in their opinion; ersatzists likewise. We do not wish to put too fine a point on using the term.
Note also that (Triv) is an entirely reasonable principle on many, if not most, accounts of chance. What is surprising is that it is a consequence of a principle which is ostensibly about coordinating two types of credences: those about propositions and those about chances of propositions. (PP) requires your credence \(b_t\) in the propositions “A and the chance is ch” and “the chance is ch” to be related with each other by means of a very specific ratio, given by \(ch(A|E_t)\). And if \(ch(A|E_t)\) is different from 0 or 1, why should it follow from that that an agent has to consider at least two worlds for which ch is the ur-chance? This logical relation, not the conclusion, seems surprising to us. It is decidedly not odd to have two worlds with the same chance function; it is odd for a chance-credence norm to entail this.Footnote 19
The GPP does not entail (Triv); the Appendix contains an example of a credence function satisfying the GPP and involving atomic chance propositions for nontrivial chance functions (see the end of Appendix A.2). However, a related issue may be seen to trouble the NP. Call a chance function ch fundamentally self-deceiving in the presence of \(E_t\) if \(ch(\cdot | E_t)\) is nontrivial, but \(ch(\cdot | E_t C_{ch})\) obtains only the values 0 and 1. That is, ch considers itselfFootnote 20 to be deterministic (in the presence of \(E_t\)), but it really isn’t: there is a proposition A such that \(ch(A| E_t)\) is not equal to 0 or 1. Notice that if \(C_{ch}\) is an atom, then for any t, if \(ch(\cdot | E_t)\) is nontrivial, then ch is fundamentally self-deceiving in the presence of \(E_t\). In other words, the NP leads to the following:
(Deceiv) Take a chance function ch. If there is a t such that ch is not fundamentally self-deceiving in the presence of \(E_t\) and \(ch(\cdot | E_t)\) is nontrivial, then there have to be at least two worlds for which ch is the ur-chance function.
Perhaps the biggest trouble here is the following: if \(b_t\) satisfies the NP, then for a nontrivial chance function ch, if ch is not fundamentally self-deceiving in the presence of tautologous evidence, that is, if there is even a single proposition A such that \(ch(A| C_{ch})\) is different from 0 or 1, then there have to be at least two worlds for which ch is the ur-chance function.
It might very well be, then, that (Deceiv) is an unfortunate consequence of the NP. However, we have to note that the NP can only be an improvement over the PP if there are cases where the principles differ in the values of credence they mandate. These, in turn, are exactly the cases where \(ch(A|E_t) \ne ch(A |E_t C_{ch})\), cases, in other words, where ch is self-deceiving in the presence of \(E_t\): it would appear to be saying, for example, “if I’m right, the chance of A in the presence of \(E_t\) is 0.3”, while in reality it may be any other number. (We have to note that while we label these cases as those of deception, Hall (1994), who treats chance functions as performing the role of an expert, uses more moderate language and suggests that in such situation something is “news to the expert” (p. 511).) It is interesting to see that, if we want to avoid the modesty issue by switching from the PP to the NP, we necessarily have to invite some self-deception of this sort. Note that, as mentioned above, it is easy to construct examples of \(b_t\)’s satisfying NP where the atoms of the domain of \(b_t\) are chance propositions for nontrivial ch; in such cases all chances about which the agent has an opinion are fundamentally self-deceiving. To avoid this for some chance ch, there have to be at least two worlds sharing the ch as their ur-chance; it is, again, surprising to us that this “metaphysical” conclusion should follow just from a principle about coordinating two types of credences.
To sum up, with regard to the issue stemming from \(C_{ch}\)’s being atoms, it is the GPP which comes out on top, with NP second.
3.3 Conditional chances should guide conditional credences
A good chance-credence norm should capture the idea that chances guide rational credences, and chances can fulfill this function because they are taken to track patterns of events in the world. But this is something that conditional chances do as well, under the assumption that what we conditionalise upon occurs. A good chance-credence norm, then, should accommodate this last insight as well: conditional chances should guide conditional credences. Roughly, if I learn (just) that the conditional chance of some dice coming up 3 (A) given that it comes up odd (B) is .3, I should set my credence in A given B to .3. And if I entertain more than one hypothesis about this conditional chance, my conditional credence should be an appropriate mixture. It seems that these cases should be covered by chance-credence principles: and perhaps they already are?
To start exploring this issue, let us note that in the cases discussed so far the formulations of the chance-credence norms include the phrase “for all propositions A”, and apart from some chance proposition(s) and the evidence proposition, only the proposition A is subsequently used in the statement of the norm. It is a priori possible that in some cases, that is, for some norms, we could generalize this so that we quantify over pairs of propositions A and B, and proceed to use both of these when formulating some condition involving the conditional credence in A given B (this would be a generalization since B can be chosen to be tautologous).
Let us set this up slowly. It is probably safe to say that most of us, when encountering the notion of conditional probability, inquired about the possibility of iterating it: can we rigorously speak of things like "the probability of A given B, given C"? We were then told that there were ways of doing this, albeit somewhat convoluted. The probably most natural one is based on the following Fact:
Fact 1
Suppose \(B\in \mathcal {F}\) and \(P(B)>0\). Then \(P_B : \mathcal {F} \rightarrow [0,1]\) defined as, for any \(A \in \mathcal {F}\),
is a probability function with the domain \(\mathcal {F}\).
This operation can be iterated, so that we can speak e.g. of the measure \(P_{C_B}\), for some suitably chosen B and C. Note the following Lemma:
Lemma 1
Assume \(P(C)>0\) and \(P(B)>0\). Then \(P_{C_B}(A)=P(A|BC)\).
Proof
\(\square \)
This implies, for instance, that whenever \(P(C)>0\) and \(P(B)>0\), \(P_{C_B}=P_{B_C}\).
Suppose, then, we extend the language of probability and define \(P((A|B)|C) {:}{=}P_{C_B}(A)\). Then, the above Lemma implies that \(P((A|B)|C)=P((A|C)|B)=P(A|BC)\). Let us call this way of understanding expressions like P((A|B)|C), in the context of any probability function P, be it chance or credence, the conjunctive convention.Footnote 21
We can now roughlyFootnote 22 define what we’d like to call a “dot principle”, for reasons which should become clear soon. The “chance-credence language” to which the definition refers is the language we are using to formulate the conditions on credences and chances featuring in the specifications of the chance-credence norms. It features function names ch and \(b_t\), for various real values of t, possibly with other subscripts, propositional letters A, B, \(\ldots \), as well as chance-proposition symbols \(C_{ch}\), with subscripts, if necessary. We do not believe more rigor is needed at this point.
Definition 1
((Dot principle)) A dot principle is a principle involving an equation or inequality in the chance-credence language which features one of more occurrences of the ‘\(\cdot \)’ symbol; with the understanding that the expression in question is true on both the following interpretations:
-
when ‘\(\cdot \)’ is a variable ranging over all propositions;
-
and when for any two propositions A and B, \('(A|B)'\) can be substituted uniformly for ‘\(\cdot \)’ so that the result is a formula which is true under the conjunctive convention (provided that all conditional probabilities featuring in the resulting formula are defined).
As an initial example, which will also illustrate the point that “dot principles” may be a sensible topic of discussion also if they do not mention chance at all, consider the “dot” version of BC:
(BC.) If \(t < t'\), it ought to be the case that, for any A,
$$\begin{aligned}b_{t'}(\cdot ) =b_t(\cdot |E_{t'}).\end{aligned}$$
(BC\(\cdot \)) implies (BC), of course. However, it is elementary to notice that (BC) implies (BC\(\cdot \)): \(b_{t'}(A|B)=\frac{b_{t'}(AB)}{b_{t'}(B)}{\mathop {=}\limits ^{(BC)}}\frac{b_t(AB|E_{t'})}{b_t(B|E_{t'})} =b_t(A|BE_{t'}){\mathop {=}\limits ^{conj.~conv.}}b_{t}((A|B)|E_{t'})\). And so, the regular and the “dot” versions of BC are equivalent. In other words, we do not need to add anything to BC to make it cover conditional probabilities: the original BC suffices.
What about chance-credence norms? The “dot” versions of the principles under discussion here look as follows:
(PP.) At time t, an agent ought to have a credence function \(b_t\) such that, for all ur-chance functions ch and propositions,
$$\begin{aligned}b_t(\cdot |C_{ch})=ch(\cdot |E_t),\end{aligned}$$unless \(ch(E_t)=0\); in which case it ought to be that \(b_t(C_{ch})=0\).
(GPP.) At time t, an agent ought to have a credence function \(b_t\) such that, for all ur-chance functions ch and propositions,
$$\begin{aligned}b_t(\cdot )=\sum _{ch: ch(E_t)>0}b_t(C_{ch})ch(\cdot |E_t).\end{aligned}$$
(NP.) At time t, an agent ought to have a credence function \(b_t\) such that, for all ur-chance functions ch and propositions,
$$\begin{aligned}b_t(\cdot |C_{ch})=ch(\cdot |E_t C_{ch}),\end{aligned}$$unless \(ch(E_t)=0\); in which case it ought to be that \(b_t(C_{ch})=0\).
It turns out that while both the PP and the NP are equivalent to their “dot” versions (Appendix A.6), the GPP is not equivalent to the GPP\(\cdot \) (Appendix A.7). Therefore, of these three, the GPP stands out negatively: it is the worst at handling conditional credences and chances.Footnote 23
4 Conclusions
We began the paper by exploring the relationship between the GPP and the PP, noting the diversity of views regarding this issue. We argued that the GPP has been present already in Lewis, and the recent discussions regarding the principle proposed by Ismael pertain also to the original Lewisian notion. We discussed the logical relationship between the two principles (the PP does imply the GPP, and not vice versa) and their scope (it is identical: despite some remarks cited above, both principles apply both to situations in which the agent is certain about what the objective chance function is, and to those in which the agent displays uncertainty about that matter). We also suggested that in situations of the latter kind, where the GPP is satisfied—so the agent’s credences are their expectations of chances—but the PP is not, it is not easy to find a mark of synchronic irrationality on part of the agent; therefore, despite the logical relationship between the two principles, we should not think that the GPP gets all its justification from the PP. Ideally, we’d set our credences to equal the objective chances; if we don’t know what these chances are, we should set our credences to be expectancies of chances, and this is exactly what the GPP does. Notice that the starting point for this argument is only the informal intuition about ideally setting credences as equal to chances, and not the PP as formally stated, which says a great deal more.
We also recalled that, for those who subscribe to BC, the PP actually delineates something else than a special case of the GPP: it gives us potential future credences of the agent after updating on chance propositions. The GPP seems to lack this dynamic aspect.
We then introduced the NP and its “generalized” counterpart, GNP, noting that this generalization was only superficial. Then we offered a few desiderata on chance-credence norms and checked whether the three candidates under discussion here meet them. It seems that the NP is clearly the leader here: it is preserved by the BC, in at least in one aspect it handles conditional chances correctly (i.e., it is a “dot principle”), and it is not troubled by the phenomenon of modesty.Footnote 24 However, one outstanding issue remains: if a nontrivial chance function is not fundamentally self-deceiving, there have to be at least two possible worlds for which it is the ur-chance function. Perhaps it is a small price to pay for the benefits the principle brings; and perhaps, for many chance metaphysicians, it is no price at all.
Notes
Some of those are not novel. Citations are given below in each case, when the particular matter is discussed.
Why just “entailing”, and not, say, “stating”? This is because most authors feel the need to include another proposition after the conditioning bar, encompassing “admissible evidence” received by the agent; in this paragraph we assume that that proposition is tautologous, for ease of exposition.
And of course vice versa: she may consider just a single chance proposition to be really possible, awarding it credence 1, and still not satisfy any of the two norms.
We will return to this issue below.
For similar comments in the context of the norm of Reflection, see van Fraassen (1984).
Lewis (1986, p. 87).
Pettigrew (2015) calls this “Ismael’s Principle” (IP), quoting Ismael (2008). The formalism used by Ismael is almost completely different; however, we believe it is clear that what motivates the IP are the same considerations which inspired the Lewis’ quote just given. When responding to Pettigrew in Ismael (2015), Ismael stresses that on her account chance functions are not expected to be defined for chance propositions. She writes that the principle captures the idea that “if you don’t know what the correct ur-chance function is, you obtain your credences from a weighted mixture of the chances assigned by epistemically possible ur-chance functions” (p. 197).
Cf. the SEP: “The most prominent constraint has been the idea that chances, when known, should guide rational credence, at least when other things are equal” (Eagle, 2021, Appendix A.1 on the Principal Principle).
(This footnote is largely historico-terminological and may be safely skipped by those uninterested in such minutiae.) The two points—that the idea behing the GPP is Lewis’s, and that the GPP is not more general than PP, being implied by it—do not seem to be generally accepted in the literature on chance-credence norms. As for the logical relationships, already Kyburg (1981) claimed that the principle involving credence-weighted sums was “implicit in the fourth item of Lewis’s questionnaire (i.e., in the story about the rational credence in the chance of heads given three competing chance hypotheses – comment ours), but (...) definitely not provided by the principle itself” (Kyburg, 1981, p. 775). It would seem, then, that for Kyburg the GPP is not implied by the PP, and should be given a different name, Kyburg’s choice being “the principle of integration”, since it describes a way of integrating credences in competing chance hypotheses. Roberts (2001) claims that the PP is, for Lewis, a “very special limiting case of the true principle relating chance and credence” (p. S104), which is the GPP. Briggs (2009a) protests that “GPP is a consequence of PP—not the other way around” (p. 441), accurately describing the actual logical relationship between the two principles, but without giving an example of a credence that would satisfy one and not satisfy the other. (Interestingly, Briggs (2009b) does not differentiate between the two, claiming that the “PP advises [the agent] to set her credence in A equal to a mixture of its (epistemically) possible chances, where the strength of each component is determined by the agent’s credence that it is A’s chance” (p. 83), which is what the GPP explicitly does. Since the PP implies the GPP, this is indeed literally true.) That the GPP does not imply the PP is a part of Proposition 2.1 by Pettigrew and Titelbaum (2014); however, the argument seems to be missing. (That said, the structure from p. 6 of that paper could be employed in the argument.)
Since Ismael’s use of the GPP in her (2008) paper, under the name of “The General Recipe”, the GPP has been referred to as “Ismael’s Principle” in a part of the literature. (Ismael herself claimed it was a correction to the PP which Lewis had “misstated” (Ismael, 2015, p. 197).) This has the consequence that a newcomer to the literature might not realize that some results pertaining to “Ismael’s Principle” actually relate to the original Lewisian idea. For example, it has been shown that satisfying the principle is not preserved under Bayesian conditionalisation (Pettigrew & Titelbaum, 2014; more on that below). It is, we believe, important to see that this is a problem not just for Ismael’s proposal, but for the original insight due to Lewis, which was supposed to be “more general” than the PP itself. For example, Earman (2021, p. 4) credits Ismael with the idea that “behind squabbles about specific formulations of the PP there is the more general principle that rational credence is, or as I would prefer to say, can be represented as subjective uncertainty about what the objective chances are”; that more general principle, as we have noted, is already there on p. 87 of Lewis (1986). Currently, Ismael calls the principle “the GPP” (in fact, already Pettigrew, 2012 calls a similar principle the “Generalized Principal Principle”, but attributes it to Ismael, not Lewis) and claims it is a required addition to the PP: “the simple PP, which tells us what to do if we know what the chances are, needs to be supplemented with a principle that says how to let beliefs about chance guide credence where one doesn’t know what the chances are” (Sect. 2.4 of Ismael , forthcoming), suggesting that the PP does not imply the GPP (since it needs to be supplemented by it).
We would like to thank one of the reviewers for pressing us on this.
As an interesting sidenote, Hoefer (2019) suggests that the PP is an idealization, and “not something that corresponds to the epistemic situation of a real agent” (p. 144), all relevant actual situations being handled by the GPP.
This is a matter for further research. Note the somewhat, perhaps, unusual direction of investigation: instead of finding features of probability functions which would be preserved by a particular type of linear averaging (for a survey of classic results see Genest and Zidek (1986)), we would be looking for belief update procedures which preserve the property of “being a linear averaging” in the precise GPP sense.
This “dynamic” reading can be found in some writers’ approaches to the PP; see, e.g., Earman (2018, p. 1): “suppose that you learn that the objective chance of heads on the next tip of a coin is \(\nicefrac {1}{2}\), and that you update your credence function to reflect this information. Then (according to PP) in order to count as delivering rational degrees of belief your updated credence function ought to assign degree of belief \(\nicefrac {1}{2}\) to said outcome, and this is so regardless of other things you might have learned”, and Roberts (2001, p. S100), “the basic idea underlying the PP is simply that the chance distribution is one with the following property: when a cognizer like one of us gets admissible information about this distribution, she ought to set her degrees of belief accordingly”.
See also Hacking (1967, p. 315) for a similar use of “dynamic”; “dynamic assumption” here means that the agent updates via the BC: if “conditional probability is a defined notion, then Prob(h/e) stands merely for the quotient of two probabilities. It in no way represents what I have learned after I take e as a new datum point. It is only when we make the dynamic assumption that we can conclude anything about learning from experience”.
Roughly, an undermining future is a future which “has a chance of happening, but if it happens the chances are different and this conflicts with the Principal Principle” (Weatherson, 2021). As the problem is only tangential to the topic of this essay, we will not go into further details here.
We frequently omit the “\(\wedge \)” sign in formulas.
Pettigrew (2015, Proposition 7).
If you think that’s unrealistic because you believe, perhaps due to your views regarding chance, that no chance proposition should be an atom of a proposition algebra on which credences of a rational agent are defined—then a fortiori you subscribe to the thesis (Triv) below. But then you also believe that any ur-chance function under consideration is shared by at least two worlds; this might be problematic for deterministic chances. Perhaps a mixed view is possible: chance propositions may be atomic only if the chance function in question is deterministic. We still find it interesting that we reach this conclusion also as a consequence of a chance-credence principle such as the PP.
If a proposition is an atom of an agent’s proposition algebra, then it can feature in their descriptions of the world, but is not by itself conceptually divisible by that agent. Then e.g. frequentists might be tempted to say that “the chance of Heads is 0.2” is a disjunction of many possible sequences of coin tosses, and thus certainly not an atom. However, a chance proposition \(C_{ch}\) will be a conjunction of such disjunctions; and the agent’s proposition algebra contains only the propositions they have a credence in. We do not think, then, that the option of a chance proposition being an atom should be a priori excluded. We thank the reviewers for pressing us on this.
This becomes slightly more complicated if there is no “beginning of epistemic life”, and there’s no instant at which the agent has tautologous evidence. We believe the additional complexity of the argument is not worth the gain in generality.
We would like to thank the reviewers for pressing us on this point.
Please allow us this innocent bit of anthropomorphism; if scholars in the field are fine with calling mathematical objects such as probability functions “modest”, they should also accept our “self-deception” idea.
Note that, in a sense, the total probability theorem will fail now. That is, it is easy to find a probability space in which the equation
$$\begin{aligned} P(A|B)=P((A|B)|C)P(C)+P((A|B)|{\overline{C}})P({\overline{C}}) \end{aligned}$$does not hold (with the expressions understood in accordance with the conjunctive convention).
I.e., without delineating rigorously the language in which the formulas are written.
A similar idea seems to have been discussed in Masterton (2010, Appendix A) of that thesis contains a derivation of “Miller’s Conditional Principle” from its unconditional version.
It’s not as if there are other comparably reasonable options here. However, some might object that even a single \(ch_i\) such that \(b_t(C_{ch_i})=0\) suffices for the whole sum \(\sum _{i=1}^n b_t(C_{ch_i})b_t(A|C_{ch_i})\) to be undefined; hence the proviso.
This is not a criticism: perhaps the authors thought this to be trivial. It was not trivial to us.
References
Briggs, R. (2009). The anatomy of the big bad bug. Noûs, 43(3), 428–449.
Briggs, R. (2009). The big bad bug bites anti-realists about chance. Synthese, 167(1), 81–92.
Ćurgus, B., & Jewett, R. I. (2015). Somewhat stochastic matrices. American Mathematical Monthly, 122(1), 36–42.
Eagle, A. (2021). Chance versus randomness. In E. N. Zalta (Ed.), The Stanford Encyclopedia of Philosophy (Spring 2021 ed.). Metaphysics Research Lab, Stanford University.
Earman, J. (2018). The relation between credence and chance: Lewis’ ‘Principal Principle’ is a theorem of quantum probability theory. Technical report. http://philsci-archive.pitt.edu/14822/
Earman, J. (2021). Implementing David Lewis ‘Principal Principle’: A program for investigating the relation between credence and chance. http://philsci-archive.pitt.edu/19922/
Gardenfors, P. (1982). Imaging and conditionalization. The Journal of Philosophy, 79(12), 747–760.
Genest, C., & Zidek, J. V. (1986). Combining probability distributions: A critique and an annotated bibliography. Statistical Science, 1(1), 114–135.
Hacking, I. (1967). Slightly more realistic personal probability. Philosophy of Science, 34(4), 311–325.
Hall, N. (1994). Correcting the guide to objective chance. Mind, 103(412), 505–517.
Hall, N. (2004). Two mistakes about credence and chance. Australasian Journal of Philosophy, 82(1), 93–111.
Hoefer, C. (2007). The third way on objective probability: A sceptic’s guide to objective chance. Mind, 116(463), 549–596.
Hoefer, C. (2019). Chance in the world. A Humean guide to objective chance. Oxford University Press.
Ismael, J. (2008). Raid! Dissolving the big, bad bug. Noûs, 42, 292–307.
Ismael, J. (2015). In defense of IP: A response to Pettigrew. Noûs, 49(1), 197–200.
Ismael, J. (forthcoming). Humean disillusion. In M. T. Hicks, C. Loew, & S. Jaag (Eds.), Humean laws for human agents. Oxford University Press.
Kyburg, H. E. (1981). Principle investigation. The Journal of Philosophy, 78(12), 772–778.
Lewis, D. (1976). Probabilities of conditionals and conditional probabilities. The Philosophical Review, 85(3), 297–315.
Lewis, D. (1986). A subjectivist’s guide to objective chance (with postscripts). In Philosophical papers (Vol. II, pp. 83–132). Oxford University Press.
Lewis, D. (1994). Humean supervenience debugged. Mind, 103(412), 473–90.
Liesen, J., & Mehrmann, V. (2015). Linear Algebra. Springer.
Masterton, G. (2010). Objective chance. A study in the Lewisian tradition. Filosofiska institutionen Uppsala.
Meacham, C. J. (2010). Two mistakes regarding the Principal Principle. British Journal for the Philosophy of Science, 61, 407–431.
Pettigrew, R. (2012). Accuracy, chance, and the Principal Principle. Philosophical Review, 121(2), 241–275.
Pettigrew, R. (2015). What chance-credence norms should not be. Noûs, 49(1), 177–196.
Pettigrew, R., & Titelbaum, M. G. (2014). Deference done right. Philosopher’s Imprint, 14(35), 1–19.
Roberts, J. T. (2001). Undermining undermined: Why Humean supervenience never needed to be debugged (even if it’s a necessary truth). Philosophy of Science, 68(3, Supplement: Proceedings of the 2000 Biennial Meeting of the Philosophy of Science Association. Part I: Contributed Papers), S98–S108.
Thau, M. (1994). Undermining and admissibility. Mind, 103(412), 491–503.
van Fraassen, B. C. (1984). Belief and the will. The Journal of Philosophy, 81(5), 235–256.
Weatherson, B. (2021). David Lewis. In E. N. Zalta (Ed.), The Stanford encyclopedia of philosophy. Metaphysics Research Lab, Stanford University.
Acknowledgements
We would like to thank Richard Pettigrew, Pavel Janda, Nancy Abigal Nuñez, and the participants of the 2022 Formal Epistemology Workshop (UC Irvine) for their insightful comments on earlier versions of the paper. We would also like to express our gratitude towards the anonymous reviewers for the offered suggestions, which have clearly improved the paper.
Funding
This research was supported by the grant 2019/34/E/HS1/00044, ‘Epistemic inaccuracy and foundational issues in formal epistemology’, of the National Science Centre (Poland).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
1.1 A.1: The PP implies the GPP
Proof
Assume PP. Suppose that there are exactly n propositions \(C_{ch}\) in the domain of the agent’s credence function. Then, if we treat each case where \( b_t(A|C_{ch_i})\) is undefined due to \(b_t(C_{ch_i})=0\) as contributing 0 to the first sumFootnote 25:
Thus, GPP holds. \(\square \)
1.2 A.2: The GPP does not imply the PP
To show this, it is enough to present a single credence function \(b_t\) which satisfies the GPP but does not satisfy the PP. We will first do so, and then comment on a general strategy for creating similar examples.
The first thing to notice is that if the formula
is satisfied for all A’s which are the atoms of the algebra of propositions on which \(b_t\) is defined, then it is satisfied for all elements of that algebra. Therefore, to see whether—given the functions \(\{ch_i \}_{1 \le i \le n}\) and the proposition \(E_t\)—a credence \(b_t\) satisfies the GPP, it is enough to check whether the formula (1) holds for all aforementioned atoms.
Consider, then, the setup from Table 1 (p. 8) involving four worlds \(w_1, \ldots , w_4\), where \(C_{ch_1} = \{w_1, w_2\}\), \(C_{ch_2} = \{w_3, w_4\}\), and \(B = \{w_1, w_3\}\), so that each possible world corresponds to one choice of a chance function and a truth value for B. For simplicity take \(E_t\) to be tautologous. We can establish that \(b_t\) satisfies the GPP by checking that (1) holds with the four atomic propositions substituted for A. For example, \(b_t(\{w_1\}) = 0.325 = b_t(C_{ch_1})ch_1(\{w_1\}) + b_t(C_{ch_2})ch_2(\{w_1\}) = 0.5 \times 0.4 + 0.5 \times 0.25\). However, the PP fails, since, to continue with the choice of the selected proposition, \(b_t(\{w_1\} | C_{ch_1}) = 0.65 \ne 0.4 = ch_1(\{w_1\})\).
For an example with a non-tautologous evidence proposition, take the above setup with \(E_{t'} = \{w_1, w_2, w_3\}\) and the credence \(b_{t'}\) (note the change in the time subscript, which we will use in Appendix A.5) depicted in Table 2.
Again, \(b_{t'}\) satisfies the GPP; for example, \(b_{t'}(\{w_1\}) = 0.44 = b_{t'}(C_{ch_1})ch_1(\{w_1\}|E_{t'}) + b_{t'}(C_{ch_2})ch_2(\{w_1\}|E_{t'}) = 0.64 \times 0.5 + 0.36 \times \nicefrac {1}{3} \). However, the PP fails, since \(b_{t'}(\{w_1\} | C_{ch_1}) = 0.6875 \ne 0.5 = ch_1(\{w_1\}|E_{t'})\).
Note that chance propositions are not atomic in these examples.
A general procedure for generating credence functions satisfying the GPP. Inspired by what we believe to be an incomplete argument for Proposition 11 from Pettigrew (2015) (see the remarks in A.5 below), we wanted to produce a linear-algebra-based procedure for generating credence functions which would satisfy the GPP. The original plan was to seek those which would not satisfy the PP or would not be preserved by BC; we now make the two points just by using the two simple tables. However, perhaps someone might find the following to be interesting; for the current paper it is useful in that the method can be used to show that the GPP does not have the unintended “metaphysical” consequence discussed above in Sect. 3.2.
Our structures will be finite. Consider the situation where we have some proposition \(E_t\) and n chance functions such that for any \(i,j \in \{1, \ldots , n\}\), \(ch_i(C_{ch_j} | E_t) >0\). Consider the implications of the following matrix multiplication equality (whose first element we will call below the “conditional chance matrix”):
If this indeed holds, then formula (1) is satisfied for all propositions from \(\{C_{ch_i} \}_{1 \le i \le n}\). Were they the atoms of the algebra of propositions, we would have defined the whole \(b_t\) so that it satisfied the GPP. And, for a conditional chance matrix satisfying the above conditions we can always find a vector \((b_t(C_{ch_1}), \ldots , b_t(C_{ch_n}) )\) such that the identity holds, because that matrix is a positive column-stochastic matrix, for which a (unique) positive eigenvector corresponding to the eigenvalue 1 whose entries sum up to 1 always exists (Liesen & Mehrmann, 2015, Theorem 8.16, p. 110). It would seem, then, that generating examples of credence functions satisfying the GPP given some evidence proposition \(E_t\) is a straightforward matter of mundane linear algebra calculations.
Notice, however, that the above will work only if \(E_t\) is tautologous. If it is not, there is no way for the conditional chance matrix to be positive: it will have to contain at least one row filled exclusively with 0’s. For example, if propositions are sets, if there are n possible worlds \(w_1, \ldots , w_n\), if for any i \(C_{ch_i} = \{ w_i \}\), and \(E_t = \{ w_1, \ldots , w_{n-1} \}\), then the bottom row of the conditional chance matrix has to contain only 0’s: because, for any i, \(ch_i(C_{ch_n} | E_t) = 0\). However, in such a situation, if all other entries of the matrix are positive, the conditional chance matrix is a “somewhat stochastic matrix” (Ćurgus & Jewett, 2015) whose “column variation” (ibid. 2015, p. 37) is lower than 1, and so, by the main Theorem of that paper, a unique non-negative eigenvector of that matrix with entries summing up to 1 exists, which is all we need for the GPP to be satisfied.
We can, then, take any n chance functions, choose some evidence proposition, write out the conditional chance matrix in any way that is compatible with that evidence proposition (that is, that the appropriate rows are null, and the columns add up to 1), and linear algebra guarantees the existence of a credence function satisfying the GPP under such constraints: on the assumption that the set \(\{C_{ch_i} \}_{1 \le i \le n}\) is chosen as the set of the atoms of the Boolean Algebra of propositions on which the credences are defined. Thus, each atom of the algebra of propositions is a chance proposition, and vice versa. We wrote a Python script to calculate such credences for randomly generated conditional chance matrices; we then found specific cases which allowed us to illustrate the points made in Appendices A.2, A.5 and A.7. Since we are now showing them in a direct way, we are omitting the Python-generated examples; we can make the source code available to anyone interested. However, one point remains to be made: that GPP allows for non-trivial chance functions which are ur-chances at just a single world, and thus avoids one unintended “metaphysical” consequence as per Section 3.2 above.
Consider first a situation with a tautologous \(E_t\), three possible worlds \(w_i\) such that \(C_{ch_i} = \{w_i \}\) (for \(1 \le i \le 3\)), and the following conditional chance matrix (which is conditional in name only, due to the tautologous \(E_t\)):
There is a single credence function \(b_t\) satisfying the GPP for that chance matrix: \((b_t(C_{ch_1}), b_t(C_{ch_2}),\) \(b_t(C_{ch_3})) = (0.594346, 0.203234, 0.2024197)\). It is immediate that \(b_t\) does not satisfy the PP, since the three chance functions are nontrivial, and yet the chance propositions are atomic. Therefore, not only is this a very roundabout way of showing that the GPP does not imply the PP, but also the credence shows that the GPP avoids one unintended “metaphysical” consequence from Section 3.2 above.
For an example with a non-tautologous evidence proposition, choose \(E_{t} = \{w_1, w_3 \}\) and the following conditional chance matrix:
There is again a unique credence function \(b_t\) satisfying the GPP for that chance matrix: \((b_{t}(C_{ch_1}),\) \(b_{t}(C_{ch_2}), b_{t}(C_{ch_3})) = (0.5758157, 0, 0.4241842)\). It is again immediate that \(b_{t}\) does not satisfy the PP, and concerns non-trivial chance functions each of which is an ur-chance at only a single possible world.
1.3 A.3: The GNP is equivalent to the NP
That NP implies the GNP can be easily shown by relying on the total probability theorem, analogously to the argument from Appendix A.1.
We will show that the GNP implies the NP. Assume \(b_t\) satisfies GNP. Consider a chance function \(ch_i\). Apply the GNP substituting “\(A C_{ch_i}\)" for “A”:
Thus, for any choice of ch, \(b_t(A|C_{ch})=\frac{b_t(A C_{ch})}{b_t(C_{ch})}=\frac{b_t(C_{ch})ch(A |E_t C_{ch})}{b_t(C_{ch})}=ch(A |E_t C_{ch})\). Therefore the NP is satisfied.
Summing up: the GNP and the NP are equivalent.
1.4 A.4: The PP and the NP are preserved by BC
That the NP is preserved by BC is shown by Pettigrew (2015, Proposition 8). The proof for the PP is similar: assuming that \(b_t\) satisfies (PP) and (BC) holds, and also that \(E_{t'}\subseteq E_t\), \(b_{t'}(A|C_{ch})=\frac{b_{t'}(AC_{ch})}{b_{t'}(C_{ch})}{\mathop {=}\limits ^{(BC)}}\frac{b_{t}(AC_{ch}|E_{t'})}{b_{t}(C_{ch}|E_{t'})}=\frac{b_{t}(AE_{t'}|C_{ch})}{b_{t}(E_{t'}|C_{ch})}{\mathop {=}\limits ^{(PP)}}\frac{ch(AE_{t'}|E_t)}{ch(E_{t'}|E_t)}=ch(A|E_{t'})\). Therefore \(b_{t'}\) satisfies the PP.
1.5 A.5: The GPP is not preserved by the BC
Consider the setup from Tables 1 and 2. The first of these defines a credence \(b_t\); the second displays the outcome of conditionalising on \(E_{t'}\), \(b_{t}(\cdot | E_{t'})\), as well as the (unique) function \(b_{t'}\) which satisfies the GPP under the stated assumptions. Since the two functions differ, the GPP is not preserved under the BC. We don’t even need to appeal to the aforementioned uniqueness to see this; we can take any atomic proposition to see that \(b_{t}(\cdot | E_{t'})\) fails the GPP, and since it is obtained via conditionalisation from \(b_t\) which satisfies the GPP, we obtain the desired result.
A literature note. This fact was first shown in Pettigrew and Titelbaum (2014), as pertaining to the principle (IX) which “generalizes Jenann Ismael’s General Recipe” (ibid. 2014, p. 3). There is no argument in the paper that the prior credence employed in the proof satisfies the (IX) principleFootnote 26; we decided it would not be a waste, then, to produce a smaller structure which can do the same job and to offer the argument in question [see the discussion of formula (1) in Appendix A.2].
Pettigrew (2015, Proposition 9) ostensibly gives the same result (with “IP” being our “GPP”). However, we do not believe the argument is conclusive. That (1) holds regarding the credence function discussed there is established for chance propositions only. However, that credence function is regarded in the context of an extensive example about coin tosses, which involves other propositions, and so the matter of the credence function satisfying the GPP is not settled. It would be settled, by reasoning from the first paragraph of A.2, if the chance propositions were the atoms of the algebra of propositions, but it seems that in the coin toss example (ibid., pp. 182–183) this is not the case.
1.6 A.6: The PP and the NP are “dot principles”
We will give the argument for the NP; the one for the PP is almost identical. Recall the “conjunctive convention” from p. 9 above. The reasoning is as follows: \(b_t((A|B)|C_{ch}){\mathop {=}\limits ^{conj.~conv.}}b_t(A|BC_{ch})=\frac{b_t(AB|C_{ch})}{b_t(B|C_{ch})}{\mathop {=}\limits ^{(NP)}}\frac{ch(AB|C_{ch}E_t)}{ch(B|C_{ch}E_t)}=ch(A|BC_{ch}E_t){\mathop {=}\limits ^{conj.~conv.}}ch((A|B)|C_{ch}E_t)\). Therefore, the “regular” and the “dot” versions of the New Principle are equivalent.
1.7 A.7: The GPP is not a “dot principle”
Let us write out the formula from the definition of GPP\(\cdot \) (p. 9) with “\(A | E_{t'}\)” instead of the dot, under the conjunctive convention:
![](http://media.springernature.com/lw358/springer-static/image/art%3A10.1007%2Fs11229-022-03996-1/MediaObjects/11229_2022_3996_Equ20_HTML.png)
For the last equality we employ the natural assumption that \(E_{t'} \subseteq E_{t}\) and the usual practice of not counting the contributions of expressions with undefined conditional chances. Take now the values of \(b_t\) from Table 1, the other required values from Table 2, and see that the above does not hold e.g. for “\(\{w_2\}\)” substituted for “A”: \(b_t(\{w_2\}|E_{t'}) = 0.226 \ne 0.229 = 0.5 \times 0.125 + 0.5 \times \nicefrac {1}{3} = b_t(C_{ch_1})ch_1(\{w_2\} | E_{t'}) + b_t(C_{ch_2})ch_2(\{w_2\} | E_{t'})\) (the values “.226” and “.229” are approximate). Therefore the GPP is not a “dot principle”.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wroński, L., Gyenis, Z. & Cocchiaro, M.Z. What chance-credence norms should be. Synthese 200, 508 (2022). https://doi.org/10.1007/s11229-022-03996-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11229-022-03996-1