1 Introduction

Meet Sally. Like the rest of us, Sally has beliefs, broadly construed—there’s some way she takes the world to be that’s generally responsive to her evidence, and which along with her desires guides her behaviour. This paper concerns what Sally’s beliefs might be like at the most fundamental level, and the relationship between the different types of beliefs she might be taken to have.

To get the ball rolling, I’m going to assume that Sally has at least two kinds of belief: partial and comparative. With respect to the former, Sally is, for instance, quite certain there’s an external world, uncertain about the consequences of global warming, but doubtful they’ll be good. These are attitudes directed towards individual propositions, each coming with some (possibly vague or imprecise) degree of strength that can at least sometimes be represented numerically. Her comparative beliefs, on the other hand, relate pairs of propositions, and do not come in degrees. Sally is, for example, more confident that she’ll find good coffee in Melbourne than she will in Sydney, and indeed just as confident that she’ll find good coffee in Sydney as that she’ll win the lottery next week. If there’s also a sense in which Sally has ‘all-or-nothing’ beliefs, then we’ll assume that these derive in one way or another from the facts about her partial and/or comparative beliefs.

Taking all that for granted, it’s natural to wonder about the relationship between Sally’s partial and comparative beliefs. It’s clear enough they’re closely related: if Sally is x% confident that P and y% confident that Q, then she’s more confident that P than she is that Q just in case \(x>y\); and in the other direction, if Sally just as confident that P as she is that Q, then she’s x% confident that P just in case she’s x% confident that Q. Moreover, these conditionals have a certain feel of apriority about them, so it’s reasonable to think they’re underwritten by some interesting conceptual or metaphysical connection.

According to comparativism, the facts about Sally’s partial beliefs supervene on, and indeed hold in virtue of, the facts about her comparative beliefs.Footnote 1 Comparativism comes in a range of shapes and sizes, with roots going back to the writings of Keynes (1921) and de Finetti (1931). Works favourable to comparativism include (Koopman 1940; Savage 1954; Fine 1973; Stefánsson 2017, 2018; DiBella 2018). Closely related ideas have also been defended in Joyce (2010, (2015), and Hawthorne (2016). We’ll talk more about the details in a moment; but the general thought is that an agent like Sally’s system of partial beliefs can be reduced to a ranking of propositions by their relative confidence, with numerical strengths of belief thus serving as a ‘theoretical tool’ for representing the positions of propositions within that ranking.

In Sect. 3, I’ll argue that comparativism presently lacks any adequate account of the measurement of the strengths of our beliefs. Objections and responses to my arguments will be considered in Sects. 4 and Sect. 5, I will consider the related idea that the facts about partial beliefs are determined by the facts about comparative beliefs plus some further qualitative mental state (e.g. preferences, or qualitative judgements about evidential relationships). But before we discuss any problems with comparativism, we should get a clearer idea of what comparativism is.

2 Probabilistic Comparativism

There are two essential components to any comparativist’s theory. The first, to put it roughly, is an explanation of where the numbers come from. As B.O. Koopman once expressed the idea,

... all the axiomatic treatments of intuitive probability current in the literature take as their starting point a number (usually between 0 and 1) corresponding to the ‘degree of rational belief’ or ‘credibility’ of the eventuality in question. Now we hold that such a number is in no wise a self-evident concomitant with or expression of the primordial intuition of probability, but rather a mathematical construct derived from [comparative beliefs] under very special conditions... (1940, p. 269)

The second component is an explanation of where cardinality comes from. The numbers we use to represent the strengths of belief seem to encode more-than-merely-ordinal (read: cardinal) information, and this requires explanation. Why, for instance, does it seem to make sense to say such things as Sally believes P much more than Q, or twice as much as Q, or a fraction as much as Q?

In Sect. 2.1, I’ll say more about the first component; in Sects. 2.22.3, more about the second. I note, though, that I will not try to describe every possible variety of comparativism on the market. Instead, I’ll focus my exposition on a relatively straightforward and especially common version, probabilistic comparativism.

2.1 Where the Numbers Come From

I like to think that Sally accepts probabilism, as well she should. Supposing that she does, what is it then that she accepts? Well, according to the usual gloss, probabilism says that her beliefs ought to conform to the axioms of the probability calculus. But that can’t be quite right: the axioms of the probability calculus are constraints on real-valued functions, and whatever Sally’s beliefs might be, they aren’t literally functions between propositions and real numbers.

Of course, there’s no real problem here—not yet, anyway. The usual gloss on probabilism was only meant to be elliptical. The intention all along was that Sally’s beliefs ought to be such that they’re representable by a probability function. But now what does that mean?

We can break the claim down into two parts. The first is a constraint on the set of propositions (call it \({\mathbf {B}}\)) regarding which Sally has beliefs—specifically, that it should have a ‘Boolean’ structure. A nice and general way to describe that structure is to suppose that propositions are (or can be modelled by) sets of possible worlds.Footnote 2 If we let \(\Omega\) henceforth denote an appropriately chosen set of such worlds, then probabilists will typically require:

Boolean.:

\({\mathbf {B}}\) is non-empty, and if P and Q are in \({\mathbf {B}}\), then \(\Omega \setminus P\) (henceforth: \(\lnot P\)), \(P\cup Q\), and \(P\cap Q\) are also in \({\mathbf {B}}\)

The second (and more interesting) part is a constraint on the beliefs themselves. Specifically, Sally’s beliefs ought to be such as can be represented by a function \(pr :{\mathbf {B}}\mapsto {\mathbb {R}}\) satisfying:

Normalisation.:

\(pr (\Omega )=1\)

Non-Negativity.:

\(pr (P)\ge 0\)

Additivity.:

If \(P\cap Q = \varnothing\), then \(pr (P\cup Q) = pr (P)+ pr (Q)\)

So what would Sally’s beliefs have to be like, exactly, to be probabilistically representable?

Comparativism offers an answer. It’s not the only possible answer, but it’s not an intrinsically implausible one either, and historically it has been extremely influential. The comparativist says that Sally’s partial beliefs are nothing over and above her comparative beliefs. Consequently, a probability function represents Sally’s beliefs just when the ordering of the numerical values assigned to the propositions she believes corresponds exactly to the ordering induced over those propositions by her comparative beliefs.

Assume henceforth that \({\mathbf {B}}\) is Boolean, and assume furthermore that Sally’s full suite of comparative beliefs can be represented with a single binary relation \(\succsim\) defined over \({\mathbf {B}}\), where

$$\begin{aligned} P\succsim Q\text { iff } \text {Sally has at least as much confidence in }P\text { as in }Q, \end{aligned}$$

and where \(\succ\) and \(\sim\) stand for the more confident and equally confident comparatives respectively,

$$\begin{aligned} P\succ Q \text { iff } (P\succsim Q)\, \& \,\lnot (Q\succsim P),\qquad \, P\sim \,Q \text { iff } (P\succsim \,Q)\, \& \,(Q\succsim \,P) \end{aligned}$$

We’ll refer to \(\succsim\) as Sally’s confidence ranking. Given this, say that a real-valued function f (not necessarily a probability function) agrees with \(\succsim\) just in case

$$\begin{aligned} P\succsim \,Q \text { iff } f(P)\ge \,f(Q) \end{aligned}$$

Thus, if we suppose that Sally’s beliefs reduce to her comparative beliefs, the notion of agreement provides us with an unambiguous sense in which those beliefs can be represented by a probability function. It also gives clear meaning to Koopman’s assertion above that the numbers used to represent strengths of belief are just ‘mathematical constructs’ designed to help us reason about what, according to comparativism, is ultimately a system of purely ordinal judgements of relative confidence.

Supposing we accept all this, then an important benefit is that we’re able to supply an alternative and unambiguous formulation of exactly what it is that probabilism requires. Since (de Finetti 1931), we’ve known that probabilistic agreement requires Sally’s confidence ranking to satisfy (for all \(P,Q,R\in {\mathbf {B}}\)):

Weak Order.:

\(\succsim\) is transitive and complete

Non-Triviality.:

\(\Omega \succ \varnothing\)

Minimality.:

\(P\succsim \varnothing\)

Monotonicity.:

If \(R\cap (P\cup \,Q)=\varnothing\), then \(P\succsim \,Q\) iff \((P\cup \,R)\succsim (Q\cup \,R)\)

We need something a little stronger than Monotonicity if we want necessary and sufficient conditions for probabilistic agreement (Kraft et al. 1959; Scott 1964), and we need something even stronger still if we want there to be only one probability function that agrees with the confidence ranking (see Fishburn 1986). But the details of those further (and more complicated) conditions need not concern us—what’s listed is more than enough to get a handle on the kind of shape a confidence ranking has to have to be probabilistically representable.Footnote 3

2.2 Where Cardinality Comes From: The Transformation Argument

Now you might worry that something’s still missing from the picture I’ve just described: comparativists still owe us an explanation of where cardinality comes from. This is sometimes raised as a special challenge for comparativism (e.g. Meacham and Weisberg 2011, p. 659), and it will be the primary focus of my critical discussion in Sect. 3 as well.

Imagine for example that Sally is about to roll an ordinary six-sided die. Now consider:

Ordinal.:

Sally is more confident of rolling \(\ge\)2 than of rolling a 1

Interval.:

Sally is much more confident of rolling \(\ge\)2 than of rolling a 1

Ratio.:

Sally is five times as confident of rolling \(\ge\)2 than of rolling a 1

To get an initial sense of why one might think there’s a problem here, suppose we have a probability function, \(pr\), which agrees with \(\succsim\), such that

$$\begin{aligned} pr(\text {rolling}\ge 2)= \frac{5}{6},\quad \,pr(\text {rolling}\ 1)=\frac{1}{6} \end{aligned}$$

So pr also seems to capture the truth of Interval and Ratio. However, any order-preserving transformation of \(pr\) will also agree with \(\succsim\), and only information that’s common to each of these functions—the ordering—can be taken to represent something doxastically ‘real’. We know that neither ratios nor ratios of intervals need be preserved across arbitrary order-preserving transformations. Therefore, it seems, only claims like Ordinal can have genuine doxastic meaning, whereas claims like Interval and Ratio seem to depend on an arbitrary choice of scale. Call this the transformation argument; the upshot is supposed to be that comparativism cannot explain how partial beliefs manage to carry cardinal information.

The transformation argument is flawed. Comparativists have long-standing explanatory strategy for explaining cardinal information that’s entirely consistent with their view, one which is based on standard methodologies for explaining cardinality in basic extensive quantities like length and mass. (This explanation can be found in numerous locations, though it’s discussed in particular depth in (Fine 1973, pp. 68ff.) So, in the remainder of this section I’ll first explain why the transformation argument is flawed by showing why a parallel argument fails in the case of length; following that I’ll describe the standard ‘probabilistic comparativist’ explanation of cardinality.

We’ll focus on characterising ratios of lengths, since once those are defined it’s straightforward to define ratios of intervals of lengths. Hence, consider ‘\(\alpha\) is five times longer than \(\beta\)’. Our task is to provide truth conditions for this sentence without presupposing any cardinal information. Towards that end, suppose we have at hand five further objects, \(\gamma _1\)\(\gamma _5\), each of which is exactly as long as \(\beta\), and which share no parts with one another. You might refer to these as \(\beta\)’s length-duplicates. Glossing over a few complications, it’s plausible enough to say that ‘\(\alpha\) is five times longer than \(\beta\)’ would be true if, and only if, were you to take these five length-duplicates of \(\beta\) and lay them end-to-end, then the result would be as long as \(\alpha\):

figure a

This suggests that it’s possible to give at least rough-and-ready truth conditions for a claim about ratios of lengths in purely relational terms: ‘\(\alpha\) is five times longer than \(\beta\)’ just means \(\alpha\) is as long as 5 disjoint length-duplicates of \(\beta\) laid end-to-end. Note, in particular, that the truth conditions say nothing whatsoever about how lengths are numerically represented.

That’s the basic idea, and simple variations on the same strategy can be used to provide truth conditions for any other rational ratio comparison as well. (For example, what we’ve said already entails that \(\gamma _1\) and \(\gamma _2\) laid end-to-end will be \(\frac{2}{5}\) as long as \(\alpha\), and \(\frac{2}{3}\) as long as \(\gamma _3\), \(\gamma _4\), and \(\gamma _5\) laid end-to-end.) With a bit of mathematical trickery we can extend the strategy to provide truth conditions for irrational ratio comparisons as well.

But regardless of whether we generalise it to all real ratio comparisons or just stick with rational ratios, we’ll need to make two general assumptions. The first is existential: if we’re going to fix a length ratio between any pair of objects, then the strategy requires that there are always enough length-duplicates of the appropriate length lying about that we can lay end-to-end. That’s the rough version; we don’t need to worry too much about what the precise version looks like for this discussion. What matters, simply, is that the more length ratios you want to characterise, the more objects of varying lengths you’re going to need. And that’s reasonable enough, given there’s quite a few objects in the universe for us to play with. The second assumption concerns the structure of the ordinal length relations themselves, and how those relations interact with the physical operation of laying objects end-to-end. In short: the is at least as long as relation should behave like \(\ge\), and the laid end-to-end operation should relate to the is at least as long as relation in the same way \(+\) relates to \(\ge\).

We can state this requirement more exactly. Let \(\succsim ^\star\) designate the is at least as long as relation, with \(\sim ^\star\) and \(\succ ^\star\) defined in the usual way. Then, let \(\alpha \oplus \beta\) designate some object that’s as long as two disjoint \(\gamma _1\), \(\gamma _2\) laid end-to-end, where \(\gamma _1\sim ^\star \alpha\) and \(\gamma _2\sim ^\star \beta\). Furthermore, say that \(\alpha\) has zero length just in case for all \(\beta\), \(\alpha \oplus \beta \sim ^\star \beta\). Then, we need that for all \(\alpha\), \(\beta\), \(\gamma\),Footnote 4

Weak Order.:

\(\succsim ^\star\) is transitive and complete

Positivity.:

\(\alpha \oplus \beta \succ ^\star \alpha\) whenever \(\beta\) has non-zero length

Weak Commutativity.:

\((\alpha \oplus \beta )\sim ^\star (\beta \oplus \alpha )\)

Weak Associativity.:

\(\alpha \oplus (\beta \oplus \gamma )\sim ^\star (\alpha \oplus \beta )\oplus \gamma\)

Monotonicity.:

\(\alpha \succsim ^\star \beta\) iff \(\alpha \oplus \gamma \succsim ^\star \beta \oplus \gamma\)

Archimedean.:

Nothing is infinitely longer than anything else

Each of these conditions is plausible, and one of the key results in the theory of measurement is that if they’re all satisfied, and the appropriate existential condition is also satisfied, then there’s a way f of assigning real numbers to objects such that for all such objects \(\alpha ,\beta\),

  1. 1.

    \(f(\alpha )\ge \,0\)

  2. 2.

    \(f(\alpha )\ge \,f(\beta ) \text { iff } \alpha \succsim ^\star \beta\)

  3. 3.

    \(f(\alpha \oplus \beta )=f(\alpha )+f(\beta )\)

Call any f satisfying these three properties an additive measure of \(\succsim ^\star\).

Now it’s important to be clear how the existence of an additive measure relates to the transformation argument. It’s well known that additive measures are unique up to positive similarity transformations—that is, if f is an additive measure then so too is \(f^\star\) just in case

$$\begin{aligned} f^\star (\alpha )=r\cdot \,f(\alpha ),\quad r>0 \end{aligned}$$

And it’s also well known that positive similarity transformations preserve ratios. Consequently, if length is measured using any additive measure f, then \(f(\alpha )=n\cdot \,f(\beta )\) just in case \(\alpha\) is as long as n of \(\beta\)’s length-duplicates laid end-to-end. But none of these facts yet explain why ‘\(\alpha\) is n times longer than \(\beta\)’ has genuine real-world meaning, simply because there’s nothing about the nature of length itself that forces the use of an additive measure (cf. Krantz et al. 1971, pp. 11–12, 99ff; Luce and Narens 1984).

For concreteness, where \({\mathbf {O}}\) is the set of objects, say that f is an accurate measure of the length relation \(\succsim ^\star\) just in case it maps \({\mathbf {O}}\) onto some \({\mathbb {R}}^\star \subseteq {\mathbb {R}}\) such that \(\succsim ^\star\) is represented by \(\ge\) and \(\oplus\) is represented by some operation \(\circ :({\mathbb {R}}^\star \times {\mathbb {R}}^\star )\mapsto {\mathbb {R}}^\star\). The additive measures are then a special class of the accurate measures—those in which \({\mathbf {O}}\) is mapped onto a (subset of) the non-negative reals \({\mathbb {R}}_{\ge 0}\) and \(\oplus\) is represented with \(+\). But there’s an infinite variety of equally accurate non-additive measures that we could have chosen to measure length with instead. Consider, for example, a multiplicative measure \(f'\), whereby:

  1. 1.

    \(f'(\alpha )\ge \,1\)

  2. 2.

    \(f'(\alpha )\ge \,f'(\beta ) \text { iff } \alpha \succsim ^\star \beta\)

  3. 3.

    \(f'(\alpha \oplus \beta )=f'(\alpha )\times \,f'(\beta )\)

The two numerical systems \(\langle {\mathbb {R}}_{\ge 0},\ge ,+\rangle\) and \(\langle {\mathbb {R}}_{\ge 1},\ge ,\times \rangle\) are isomorphic to one another, so a multiplicative measure will be accurate whenever an additive measure is. But multiplicative measures are not unique up to positive similarity transformations, and \(\alpha\) will be as long as n of \(\beta\)’s length-duplicates laid end-to-end iff \(f'(\alpha )=f'(\beta )^n\).

More generally, given that \(\circ\) can be any operation on some \({\mathbb {R}}^\star \subseteq {\mathbb {R}}\), every order-preserving transformation \(f''\) of any accurate measure f will itself count as accurate relative to some \({\mathbb {R}}^\star\) and \(\circ\). In the vast majority of these cases the operation \(\circ\) will seem ‘unnatural’ by comparison to \(+\) or \(\times\), and only the most masochistic of us would want represent \(\oplus\) using it—but that doesn’t change the fact that we could in principle do so if were we so inclined.

There’s no fact of the matter as to which measure f, \(f'\), \(f''\), ..., is the correct one to use, only a choice borne of convention and convenience. So let’s say that a measure is adequate just in case it’s accurate and it represents \(\oplus\) in a convenient way. Additive measures of length are adequate, but then so too are multiplicative measures (albeit slightly less so). The measure \(f''\) generated by an arbitrary order-preserving transformation of f (or of \(f'\)) will in most cases be highly inadequate, though it’ll be no less accurate because of this.

So we can now finally say exactly where the transformation argument is going wrong. The error lies in thinking that what we’ve been calling ‘ratio information’ in our partial beliefs depends in any interesting way on the numbers we use to represent the strengths of those beliefs. This is a mistake, as the example with length shows. There’s nothing stopping us from using an additive measure f or a multiplicative measure \(f'\) to measure the length relation \(\succsim ^\star\), but

$$\begin{aligned} f(\alpha )=n\cdot \,f(\beta ) \text { iff } f'(\alpha )=f'(\beta )^n \end{aligned}$$

That is: there’s no deep connection between what we’ve determined to be the truth conditions for claims about ‘ratios’ of lengths and the specific numerical relationships that hold between the numbers assigned by different but equally accurate ways of measuring length. We call it ‘ratio information’ because we’re accustomed to using additive measures, but had physics developed slightly differently we’d be calling it ‘power information’.

Ultimately, it’s the structure of the underlying system of length relations that explains why a claim like ‘\(\alpha\) is five times longer than \(\beta\)’ has real-world meaning. And it’s that underlying structure which is established by the existence of an additive measure—the fact that additive measures are also unique up to a positive similarity transformation is entirely besides the point. By the same token, if comparativism is right, then it’s the confidence ranking itself that contains cardinal information if anything does; how we choose to represent that information numerically is irrelevant, and it’s the failure to appreciate this that leads to the erroneous transformation argument.

2.3 Where Cardinality Comes From: The SCEC

But comparativism isn’t out of the woods yet! Showing that the same style of explanation of cardinal information can in principle be made to work for comparative beliefs requires some heavy duty assumptions about \({\mathbf {B}}\) and \(\succsim\), analogous to the existential and structural assumptions needed to guarantee the existence of an additive measure of length.

Before anything else, it needs to be shown that there’s some operation on propositions (qua relata of \(\succsim\)) that can serve as an ‘addition’ operation in the same way that \(\oplus\) serves as an ‘addition’ operation for length. Now such an operation is easy to find, if Sally’s confidence ranking agrees with a probability function—for then the restriction of the union operation to disjoint propositions will behave like addition with respect to that ranking. This is an immediate consequence of Additivity. Consequently, comparativists have typically pointed towards the union of disjoint propositions as their proposed qualitative analogue of addition (e.g. Fine 1973, p. 68; Krantz et al. 1971, p. 200; Stefánsson 2018; DiBella 2018; though cf. Elliott 2020, Sect. 3.2, for a more general operation).

And indeed, if \(\succsim\) agrees with a probability function, then for all disjoint \(P,Q,R\in {\mathbf {B}}\),

Weak Order.:

\(\succsim\) is transitive and complete

Positivity.:

\((P\cup \,Q)\succ \,Q\) whenever \(P\succ \varnothing\)

Weak Commutativity.:

\((P\cup \,Q)\sim (Q\cup \,P)\)

Weak Associativity.:

\((P\cup (Q\cup \,R))\sim ((P\cup \,Q)\cup \,R)\)

Monotonicity.:

\(P\succsim \,Q\) iff \((P\cup \,R)\succsim (Q\cup \,R)\)

Archimedean.:

Sally is not infinitely more confident that P than that Q

So if Sally’s confidence ranking agrees with some probability function \(pr\), then ‘Sally believes P n times as much as Q’ whenever \(P\succsim \,(R_1\cup \,\dots \ \cup \,R_n)\), where the \(R_1,\dots ,R_n\) are disjoint and \(R_1\sim \,\dots \ \sim \,R_n\sim \,Q\). Here the \(R_1,\dots ,R_n\) are Q’s confidence-duplicates, and it follows that \(pr(P)=n\cdot \,pr(Q)\). Hence, \(\succsim\) has an ‘additive’ structure with respect to the union of disjoint propositions, which can thus be adequately represented using the additive measure pr.

We have arrived at what I’ll call the standard comparativist explanation of cardinality, or SCEC. If the SCEC is on the right track, then there’s a close analogy between the measurement of partial belief and the measurement of length, and this undoubtedly lends some plausibility to the comparativist’s thesis (cf. Fine 1976, 1973, pp. 15–16; Stefánsson 2017, 2018). After all, there’s something obviously compelling about explaining ratios of strengths of belief using a method that’s consonant with how such things are usually described for other basic physical quantities like length.

But we should be careful not to overstate what the foregoing discussion establishes. Nothing so far suffices to show that if \(\succsim\) agrees with a probability function, then Sally has, say, at least twice as much confidence in P as in Q if or only if \(P\succsim (R_1\cup \,R_2)\), where \(R_1\) and \(R_2\) are disjoint confidence-duplicates of Q. Still less have comparativists given us any detailed story on what to say when \(\succsim\) doesn’t agree with a probability function. Rather, the SCEC is a ‘how possibly’ story—an account of how comparativism might explain cardinality given some very strong assumptions about the shape of \(\succsim\) and its relationship with \(\cup\). It’s enough to establish that comparativism might in principle be able to account for cardinality, at least sometimes, but by no means does it establish that the SCEC is correct.

It is also important to note that there’s more than one way to explain how partial beliefs might come to contain meaningful cardinal information. Denying the SCEC does not commit one to the manifestly absurd idea that there is no qualitative explanation of where the numbers come from and how they manage to encode cardinal information. Other well-established methods for understanding ratio information do not follow the SCEC’s pattern—i.e., they do not require theorists to locate some operation (like \(\oplus\) or \(\cup\)) which shares structural characteristics with \(+\).

For example, in additive conjoint measurement (Luce and Tukey 1964; Krantz et al. 1971), cardinal information for a given quantity q can be determined by reference to how that quantity interacts with another quantity \(q'\) to produce an ordering \(\succsim ^\dagger\) with respect to some further quantity \(q^\dagger\) distinct from either q or \(q'\). No real-world ‘addition’ operation is needed here: structural properties of the \(\succsim ^\dagger\)-ordering are used to establish that intervals and/or ratios have meaning for q (and for \(q'\)), given background theoretical assumptions about how q and \(q'\) interact to produce \(\succsim ^\dagger\). Kahneman and Tversky (1979) have suggested that conjoint measurement theory can be used to simultaneously explain the measurement of both partial beliefs and utilities in terms of how they interact to produce preferences over choices, and a representation theorem which simultaneously builds subjective probabilities and utilities out of preferences (e.g., Savage 1954) can help to provide the foundations for such a view.

A distinct but closely related strategy explains the measurement of partial belief on the pattern of dimensionless quantities. Unlike conjoint measurement, where the cardinal properties of two quantities q and \(q'\) are defined by relation to a third quantity \(q^\dagger\), most dimensionless quantities q of interest are defined in terms of ratios of differences in a single distinct quantity \(q'\). Consider Mach numbers. A Mach number is not a unit of speed; rather, it’s a ratio that represents the speed of an object travelling through a medium relative to the speed of sound in that medium. If we let \(s\) be any measure of speed on at least an interval scale, then relative to a given medium we can define an object’s Mach number M:

$$\begin{aligned} M (\mathrm {object}) \,=\, \frac{ s (\mathrm {object}) - s (\mathrm {stationary})}{ s (\mathrm {sound}) - s (\mathrm {stationary})} \end{aligned}$$

Along these lines, I prefer an approach that originates with Ramsey (1931): beliefs are causally tied to preferences in such a way that cardinal information can be extracted from their relationship. On a very simplified version of this approach, the degree of belief Sally has towards P is a ratio that represents the impact the belief has on the utility of a gamble conditional on P relative to that gamble’s best and worst outcomes—where u measures the Sally’s preferences measured on an interval scale, Sally is indifferent between Q and \(\langle\) \(R\) if P, S otherwise\(\rangle\), and she prefers R to S, then

$$\begin{aligned} pr (P) \,=\, \frac{ u (Q)- u (S)}{ u (R)- u (S)} \end{aligned}$$

Note that saying this has no implications regarding the relative fundamentality of beliefs, utilities, and preferences. The claim is not that the facts about partial beliefs are nothing over and above the facts about preferences; nor is the claim that beliefs have no other functional roles other than those in relation to preferences. Instead, the claim need only be that the meaningfulness of ratios and intervals of strengths of belief is explicable by reference to that part of the typical causal role that partial beliefs play in relation to preferences.

Comparativists will no doubt have objections to these alternatives. I’ve defended both in other works, and I’m not going to do it again here (see Elliott 2017, 2019b). The present point is just that an explanation of ratios of strengths of beliefs need not be based on the model of the measurement of length at all. Comparativists don’t have a monopoly on scientifically respectable explanations of cardinal information!

3 The Case Against the SCEC

The overarching goal for the following discussion will be to show that the SCEC isn’t quite as illuminating as we might have hoped. Broadly speaking, I’ll argue for this in two ways.

First, there appear to be conceptually possible and theoretically-relevant distinct states of partial belief, with corresponding and apparently meaningful differences in cardinal information, which correspond to the very same probabilistically representable confidence ranking (Sects. 3.13.4). Hence, the SCEC cannot explain the differences in cardinal information. More generally, these arguments suggest that comparative beliefs do not suffice to determine the facts about partial beliefs, which alone rules out any strict version of comparativism.

Second, there appear to be conceptually possible and theoretically-relevant distinct states of partial belief that correspond to no probabilistically representable confidence ranking whatsoever (Sect. 3.5). So, again, the SCEC cannot explain the cardinal information present in those systems of belief.

3.1 The SCEC and Almost Omniscience

Suppose that Sally is almost omniscient:

Example 1

Sally is ideally rational, and her comparative beliefs satisfy all the requirements for agreement with a probability function. Furthermore, she is almost omniscient, in the sense that she’s narrowed down which possible world she inhabits to exactly two possibilities: \(\omega _1\) and \(\omega _2\). While Sally’s got some confidence in each, she’s much more confident that the actual world is \(\omega _1\) than that it’s \(\omega _2\).

The notion of almost omniscience should make sense; in fact, a minor variation on it already exists in the literature in the case of David Lewis’ two gods (1979, pp. 520–1). And we could easily imagine each one of Lewis’ gods being more or less confident regarding which of the two (centred) worlds they inhabit by some substantial amount, even if the exact amount is itself imprecise to some degree. (The point here won’t hinge on whether the strengths of belief are precise.) So I take it that the situation described is conceivable.

However, the SCEC cannot explain how an almost omniscient Sally might be much more confident that the actual world is \(\omega _1\) than that it’s \(\omega _2\). A probability function \(pr\) will agree with \(\succsim\) if and only if, for all P in \({\mathbf {B}}\),

$$\begin{aligned} pr (P)={\left\{ \begin{array}{ll} \,0,&\,\,\text {if neither }\omega _1\text { nor }\omega _2\text { are in }P,\\ \,x,&\,\,\text {if }\omega _1\text { is in }p,\text { but }\omega _2\text { is not in }P,\\ \,y,&\,\,\text {if }\omega _2\text { is in }p,\text { but }\omega _1\text { is not in }P,\\ \,1,&\,\,\text {if both }\omega _1\text { and }\omega _2\text { are in }P, \end{array}\right. }\quad \text { where }1>x>0.5>y>0 \end{aligned}$$

Now the problem here is not that there are some probability functions satisfying this condition where the different between x and y is large (arbitrarily close to 1), and some where the difference is small (arbitrarily close to 0). Either type of probability function would agree with Sally’s confidence ranking, so they’re just as accurate as one another. But the numbers are irrelevant. The reason the SCEC cannot give us any explanation of cardinality in this case is that one of the key background assumptions isn’t satisfied.

Recall that explaining cardinal information in the case of length requires an important existential assumption—roughly, that there are enough length-duplicates of objects of varying lengths for us to ‘add’ together. The analogous requirement is not satisfied in Example 1: for any P such that \(\omega _1\in \,P\) or \(\omega _2\in \,P\), there are no appropriate ‘confidence-duplicates’ of P—and so there’s not enough propositions around to ‘add’. And yet it certainly seems conceivable that Sally could be almost omniscient and believe one proposition much more than another. It’s not like, by virtue of knowing almost everything there is to know, Sally loses the ability to believe one thing much more than another.

So if the situation described in Example 1 is conceivable, then we’ve found an initial intuitive problem for comparativism. Just imagine that Sally has a friend Bob with the same confidence ranking, but who’s only a little more confident that the actual world is \(\omega _1\) than that it’s \(\omega _2\). Since it’s conceptually possible for Sally and Bob to have the very same confidence ranking but distinct doxastic states, it follows that there might be more to an agent’s beliefs than their comparative beliefs. Moreover, the missing information is cardinal information, which due to the lack of appropriate ‘confidence-duplicates’ in \({\mathbf {B}}\) cannot be explained by the SCEC.

There’s a few ways one might respond to this example. One might agree that the described situation is possible, and that the facts about comparative beliefs alone aren’t always enough to ground the facts about partial beliefs. Thus one might shift to a weaker view according to which the facts about Sally’s partial beliefs hold in virtue of the facts about her comparative beliefs plus some further qualitative phenomenon. I’ll come back to this idea in Sect. 5, since it’s an important response to several of the following examples as well. Until then, we focus on the stricter version of comparativism.

Another obvious response would be to deny that there’s any strong link between conceivability and possibility. Comparativism is a supervenience thesis, and if conceivability doesn’t entail possibility (at least in this case) then there’s simply no problem to worry about. Well, if that’s the route you want to take, then so be it—here’s not the place to decide fundamental questions of philosophical methodology, and I’ll leave it for comparativists to provide well-motivated reasons to reject any conceivability-possibility link here.

There is a third response, however, which I think is more interesting: perhaps the intuitive sense that there is cardinal information in this case lacks an appropriate theoretical foundation, and for that reason might be rightly ignored. As Fine notes,

... from the viewpoint of the theory of measurement it is only reasonable to insist upon an additive scale (probability) for uncertainty if this numerical relationship [i.e. \(pr (P\cup \,Q) = pr (P)+ pr (Q)\) for disjoint P and Q] reflects an underlying empirical relationship between uncertainties. (1973, p. 24)

That is: claims relating to the presence of cardinal information in our degrees of belief need to be founded appropriately in theory, and according to comparativism that foundation just is the structure of an agent’s confidence ranking. Without an appropriate theoretical foundation, who’s to say that any intuitions we have regarding this and similar examples aren’t just an illusion brought about by a lack of awareness regarding how probability functions actually serve to measure strengths of belief—similar to how some people might mistakenly think it makes sense to say that if it’s 30 °C during the day and 15 °C in the evening, then it’s twice as hot during the day as it is in the evening.

This is an important response, and I suspect it’s one that many comparativists will reach for. To provide evidence that any anti-comparativist intuitions regarding Example 1 aren’t a mere illusion, then, we need a theoretical justification for positing more structure than can be captured by a confidence ranking.

3.2 The SCEC and Decision Theory

Towards that end, consider the following example. Say that a non-empty proposition P is an atom for Sally just in case she has no beliefs regarding any non-empty propositions stronger than P. Furthermore, say that \({\mathbf {B}}\) is atomic just in case every non-empty proposition in \({\mathbf {B}}\) is identical to the union of some collection of atoms. Now consider:

Example 2

At \(t_1\), Sally has probabilistically representable beliefs with respect to an atomic algebra \({\mathbf {B}}\), with atoms \(A_1,A_2,A_3\). Her confidence ranking includes \((A_1\cup {A_2})\,\sim \,A_3\,\succ \,A_2\,\succ \,A_1\). Furthermore, Sally has more than twice as much confidence in \(A_2\) as in \(A_1\). At \(t_2\), she changes her beliefs slightly: while her comparative beliefs remain the same as they were at \(t_1\), she’s now just a little more confident regarding \(A_2\) than she is regarding \(A_1\).

Sally’s situation should again be conceivable. In pictorial form, where the size of the boxes represent the relevant strengths of belief,

figure b

The probabilistic comparativists cannot agree that Sally’s beliefs have changed; instead, they have to say that what seems here like a conceivable difference between Sally at \(t_1\) and at \(t_2\) is in fact impossible. But we can go beyond mere conceivability intuitions in this case, since we can have theoretically well-motivated reasons for saying that Sally attaches more confidence to \(A_1\) at \(t_1\) than she does at \(t_2\).

A probability \(pr\) agrees with Sally’s confidence ranking at either time iff

$$\begin{aligned} pr (A_1)+ pr (A_2)\,=\, pr (A_3)\,=\,0.5\,>\, pr (A_2)\,>\,0.25\,>\, pr (A_1)\,>\,0 \end{aligned}$$

There’s no shortage of probability functions satisfying this condition, and each predicts a different set of preferences when it’s (1) taken to model a possible belief state, and (2) combined with any standard model of rational preference formation, e.g. orthodox expected utility theory.

Suppose for instance that at both \(t_1\) and \(t_2\), Sally faces choices which have the same decision-theoretic structure:

 

\({A_1}\)

\({A_2}\)

\({A_3}\)

Option \(\alpha\)

\(-2x\)

x

x

Option \(\beta\)

0

0

x

Now imagine that at \(t_1\) Sally strictly chooses \(\alpha\); and at \(t_2\) she makes the opposite choice. The expected utility of \(\alpha\) is greater than the expected utility of \(\beta\) just in case \(pr (A_2)>\frac{1}{3}\), so a natural explanation for the change is that at \(t_1\) Sally has more than twice the confidence in \(A_2\) as she does in \(A_1\), while at \(t_2\) she doesn’t. More generally, we could easily imagine that Sally’s dispositions to choose regarding an infinite range of possible choice situations consistently point towards her believing \(A_2\) more than twice as much as \(A_1\) at \(t_1\), and less than twice as much at \(t_2\). Sally prefers \(\delta\) to \(\gamma\) at \(t_1\), and \(\gamma\) to \(\delta\) at \(t_2\):

 

\({A_1}\)

\({A_2}\)

\({A_3}\)

Option \(\gamma\)

2x

\(-x\)

x

Option \(\delta\)

0

0

x

There’s a natural explanation for why Sally would have dispositions ‘as if’ she believed \(A_2\) more than twice as much as \(A_1\) at \(t_1\) and less than twice as much at \(t_2\)—because she does indeed believe \(A_2\) more than twice as much as \(A_1\) at \(t_1\) and less than twice as much at \(t_2\). That explanation is off-limits to comparativism.

The point of all this is that not only is there a conceivable cardinal difference between Sally’s beliefs at \(t_1\) and \(t_2\), but that such differences can do useful theoretical work. If Sally has more than twice the confidence in \(A_2\) as she does in \(A_1\), then the utilities of the outcomes at \(A_2\) get more than twice the weighting as the utilities of the outcomes at \(A_1\)—and since the difference in utility between x and 0 is half the difference between 0 and \(-2x\), the cardinal differences in the weightings for \(A_1\) and \(A_2\) between \(t_1\) and \(t_2\) makes a difference to her preferences. There’s a theoretical foundation for the intuition that Sally’s beliefs changed from \(t_1\) to \(t_2\), despite no change in her comparative beliefs.

Cardinal differences between ordinally-equivalent systems of partial belief can show up in how those beliefs interact with utilities in the generation of preferences according to the standard theory of decision making. Those differences cannot be explained by reference to Sally’s comparative beliefs with respect to unions of disjoint equally-ranked propositions. Indeed, the SCEC in principle cannot help us explain any cardinal relationships between the strengths of her beliefs for \(A_1\) and \(A_2\) (at either time), since again the required ‘confidence-duplicates’ are missing. So, in at least some cases, the SCEC fails to adequately recapture one of the key theoretical reasons for supposing that our beliefs carry cardinal information.Footnote 5

3.3 The SCEC and Radical Pyrrhonianism

Is the SCEC at least still successful in those special cases where there’s no shortage of ‘confidence-duplicates’, and subsequently the confidence ranking agrees with exactly one probability function? Well, no—I don’t think so. Consider:

Example 3

Although her confidence ranking agrees with exactly one probability function, Sally is not an ideal Bayesian agent. After reading a little too much radical Pyrrhonian literature, she insists it’s never rational to be fully certain of anything: one should always reserve some slight doubt (say, 1%) that even the most firm of logical truths might be false, and that any logical falsehood might be true. Moreover, her preferences consistently reflect her new-found commitments—for instance, she’d prefer being given \(\$90\) outright to the gamble \(\langle \$100\) if \(P\vee \lnot \,P\), \(-\$1000\) otherwise\(\rangle\), and she prefers \(\langle \$100\) if \(P\vee \lnot \,P\), \(\$100,000\) otherwise\(\rangle\) to being given \(\$1000\) outright.

Here’s one explanation of Sally’s preferences: where \(pr\) is the unique probability function that agrees with Sally’s confidence ranking, her partial beliefs are in fact modelled by the non-additive \(pyr\), where

$$\begin{aligned} pyr (P)=0.98\times pr (P)+0.01 \end{aligned}$$

Think of \(pyr\) as \(pr\) squished down by 1% on either side. How would this help to explain Sally’s preferences? Because those preferences make sense given \(pyr\).Footnote 6 We may want to say that Sally is epistemically irrational, since she fails to attach complete certainty to self-evident tautologies. But she is at least pragmatically rational enough to choose appropriately given her slightly misled beliefs.

Such an explanation is off-limits for comparativism: \(pr\) and \(pyr\) are ordinally equivalent, so comparativists are committed to saying that \(pr\) represents Sally’s beliefs iff \(pyr\) does. They might say that at most one of \(pr\) and \(pyr\) constitutes an adequate representation of her beliefs, but both must be accurate. And any agent with beliefs accurately represented by the probability function \(pr\) ought to prefer \(\langle \$100\) if \(P\vee \lnot \,P\), \(-\$1000\) otherwise\(\rangle\) to being given $90 outright, and prefer \(\$1000\) outright to \(\langle \$100\) if \(P\vee \lnot \,P\), \(\$100,000\) otherwise\(\rangle\). Consequently, Sally must—for some reason—have chosen irrationally given her beliefs. But that’s hardly satisfying. We have to posit some irrationality somewhere in order to explain Sally’s preferences. If there’s a doxastically relevant difference between \(pr\) and \(pyr\), then the former explanation leaves us with an agent that makes sense: it’s easy to imagine an agent so committed to radical Pyrrhonianism that they doubt even the most obvious logical truths. The latter explanation, on the other hand, leaves us with an agent whose preferences are bafflingly nonsensical given what she supposedly believes.

I should note that my argument here does not fall foul of the error discussed by Joyce (2015, pp. 418–9) in his defence against a closely related objection to comparativism—the error of re-scaling our model of Sally’s beliefs without making adjustments to how expected utilities are calculated, thus giving the misleading impression that \(pyr\) generates different predictions about Sally’s preferences when they’re plugged into the standard decision-theoretic model. I agree this would be an error—it would be an instance of the very same error underlying the transformation argument. We can all agree that the scale we use to measure belief is a matter of stipulation. It doesn’t really matter if we represent Sally’s partial beliefs on a 0-to-1 scale, a 0.01-to-0.99 scale, or a \(\sqrt{2}\)-to-\(\pi\) scale. Likewise it doesn’t really matter if we represent Sally’s partial beliefs using an additive scale or a multiplicative scale (and so on), so long as we make the appropriate adjustments elsewhere in our theories to accommodate those changes.

However, we get to say that \(pyr\) is a mere ‘re-scaling’ of \(pr\) only under the substantive assumption that whatever Sally believes least (most) of all, she believes to the least (greatest) extent possible. This kind of assumption is required for the SCEC’s construction of ratio information: since \(P\cup \varnothing =P\) for all P, \(\varnothing\) needs to be treated as the identity element on any additive representation of belief (i.e. \(pr(\varnothing )=0\)). Yet it’s not a mere matter of stipulation that having at most as much confidence in P as in anything else is the same doxastic state as being 0% confident that P, since—as Example 3 suggests—it’s not obvious that P’s sitting at the bottom of Sally’s confidence ranking plays the same functional or theoretical roles that 0% confidence is supposed to play. These are substantive claims, and they need to be established by argument.

3.4 The SCEC and Non-additive Beliefs

Perhaps the foregoing examples are too far-fetched. Not to worry: whenever \(\succsim\) agrees with a probability function \(pr\), there’s any number of order-preserving transformations of \(pr\) with values in the 0-to-1 range inclusive, and it would be implausible for comparativists to claim that there’s no theoretical basis for treating at least some of these as representing distinct doxastic states as \(pr\).

Consider the following non-additive order-preserving transformation of the probability function \(pr\):

$$\begin{aligned} cap (P)= {\left\{ \begin{array}{ll} \,\frac{3}{4}\times \, pr (P),&\,\,\text {whenever }0\le \, pr (P)<0.5\\ \,1-\big (\frac{3}{4}\times \, pr (\lnot \,P)\big ),&\,\,\text {whenever }1\ge \, pr (P)>0.5\\ \, pr (P)&\,\,\text {otherwise} \end{array}\right. } \end{aligned}$$

\(cap\) is a capacity. Specifically, a capacity like \(cap\) satisfies the usual Normalisation and Non-negativity conditions that any probability function must satisfy, but replaces the Additivity condition with the weaker:

$$\begin{aligned} \text {If }P\subseteq \,Q, \text { then } cap (P)\le \, cap (Q) \end{aligned}$$

Every probability function agrees with a confidence ranking, and every such confidence ranking agrees with numerous non-probabilistic capacities. And, importantly, these ordinally-equivalent representations of belief will each predict different patterns of preference on any of a range of natural generalisations of ordinary expected utility theory which make room for partial beliefs represented by capacities—e.g. (Schmeidler 1989; Sarin and Wakker 1992; Elliott 2017), or (Pruss forthcoming).

Thus, suppose that Sally’s confidence ranking agrees with exactly one probability function \(pr\), and that \(P_1\), \(P_2\), and \(P_3\) are equally ranked and pairwise disjoint, with \((P_1\cup \,P_2\cup \,P_3)\sim \Omega\). According to the SCEC, Sally will have half as much confidence in \(P_1\) as she does in \(P_2\cup \,P_3\), as represented by \(pr\):

$$\begin{aligned} pr (P_1)\,=\,\frac{1}{3},\quad pr (P_2\cup \,P_3)\,=\,\frac{2}{3} \end{aligned}$$

In this sense, \(pr\) represents the ‘additive’ structure of \(\succsim\) in relation to the union of disjoint propositions. But we could easily imagine preferences over appropriately structured choice situations which suggest that Sally has only \(\frac{1}{3}\) as much confidence in \(P_1\) as she does in \(P_2\cup \,P_3\), as captured by \(cap\):

$$\begin{aligned} cap (P_1)\,=\,\frac{1}{4},\quad cap (P_2\cup \,P_3)\,=\,\frac{3}{4} \end{aligned}$$

In this case, it’s the non-additive \(cap\) that does a better job of capturing the cardinal information implicit in how her beliefs relate to her preferences. The fact that \(\succsim\) has an ‘additive’ structure doesn’t entail that the best representation of Sally’s beliefs is a function that satisfies Additivity.

3.5 The SCEC and Irrational Rankings

Most capacities do not agree with any \(\succsim\) that agrees with a probability function, but nevertheless seem to do a perfectly good job of representing possible belief states. In particular, there are many capacities that agree with a confidence ranking that does not satisfy the Monotonicity condition that the SCEC crucially relies upon. For instance:

Example 4

Relative to Sally’s comparative beliefs, \(P_1,\dots ,P_{100}\) and \(Q_1,\dots ,Q_{101}\) are two sequences of pairwise disjoint propositions, where \(P_1\,\sim \,\dots \ ,\sim \,P_{100}\,\sim \,Q_1\,\sim \,\dots \ ,\sim \,Q_{101}\). However, due to an accounting error, Sally has as much confidence in \(P_1\cup \dots \ \cup \,P_{100}\) as in R, and as much confidence in R as in \(Q_1\cup \dots \ \cup \,Q_{101}\). Moreover, her preferences fit with what we’d expect if Sally has beliefs represented by the capacity \(irr\), which is additive with respect to the \(P_i\) but sub-additive by \(\frac{100}{101}\%\) for significantly large unions of the \(Q_i\).

A confidence ranking like this is no doubt conceptually possible. However, it’s easy to see that the union of disjoint propositions cannot behave like doxastic analogue of addition for any confidence ranking that agrees with \(irr\). For suppose that it did. Then Sally would believe R 100 times as much as \(P_1\), and she would believe R 101 times as much as \(Q_1\). But she also believes \(P_1\) as much as she believes \(Q_1\), and she obviously doesn’t believe R 1% more than itself. Hence, \(\succsim\) doesn’t have the right structure for the SCEC to apply.

Should this mean that Sally doesn’t have partial beliefs with respect to the \(P_i\) and \(Q_i\), or that those partial beliefs don’t carry any determinate and meaningful cardinal information? Why should we say that, when we can plug the capacity \(irr\) into any decision model that makes room for non-additive capacities and have the determinate cardinal information represented by that function do useful theoretical work? And if that information is meaningful, then its being so cannot be explained by the SCEC.

The more general challenge here, of course, is for comparativism to plausibly explain cardinal information in the face of ordinary human irrationality. There is a substantial amount of work which suggests that ordinary agents’ comparative beliefs are not probabilistically representable, and that they’ll often fail to satisfy the Monotonicity condition. There are many examples to draw from, but one of the most striking—and robust—is the conjunction fallacy:Footnote 7

P.:

Linda is a bank teller

Q.:

Linda is active in the feminist movement

\(P\cap \,Q\).:

Linda is a bank teller and active in the feminist movement

A large proportion of people—the exact percentage doesn’t matter—when they are asked to judge the relative probabilities of these propositions, seem to commit the single conjunction fallacy: they will say that they judge \(P\cap \,Q\) to be more probable than one of the other propositions (usually P).

The propositions P and Q in this example are not disjoint, but it doesn’t take much work to show that instances of the conjunction fallacy run up against the general proposal that the confidence Sally has in the union of disjoint propositions will be the sum of the confidences she has for those propositions individually. For in this case, \(P=(P\cap \,Q)\cup (P\cap \lnot \,Q)\), and \((P\cap \,Q)\cap (P\cap \lnot \,Q)=\varnothing\). Hence, for the analogy with the measurement of length to hold, we would need:

$$\begin{aligned} pr (P)\,=\,\,P\,\big ((P\cap \,Q)\cup (P\cap \lnot \,Q)\big )\,=\, pr (P\cap \,Q)+ pr (P\cap \lnot \,Q) \end{aligned}$$

However, \((P\cap \,Q)\succ \,P\), so if \(pr\) agrees with \(\succsim\), then \(pr (P\cap \,Q)> pr (P)\)—which would require that \(pr (P\cap \lnot \,Q)<0\), a nonsensical assignment on anyone’s view.

So we have good evidence that ordinary agents’ confidence rankings sometimes falsify Monotonicity, and consequently don’t have the appropriate structure to support the measurement analogy. Yet this surely doesn’t prevent such agents from having beliefs with meaningful cardinal information. A person who falls foul of the conjunction fallacy might still, for example, be a little more confident that \(P\cap \,Q\) than that P; while another might be much more confident that \(P\cap \,Q\) than that P. I take it that this is intuitively obvious—or, at least, that if we’re going to say otherwise, then compelling reasons would be required. The people who commit the conjunction fallacy don’t suddenly lose their capacity to believe the relevant propositions with meaningful cardinal differences in strength. Just ask them to bet on P and on \(P\cap \,Q\), and see those differences at work.

4 Objections and Replies

I’ve discussed the limits of the SCEC when the requisite structural assumptions are satisfied but the existential assumptions are not (Sects. 3.13.2); when the existential and structural conditions are both satisfied (Sects. 3.33.4); and when the structural conditions are not satisfied (Sect. 3.5). In some of these cases the SCEC fails to supply any meaningful cardinal information at all, while in others it supplies the wrong cardinal information.

So it seems that comparativism still lacks an adequate response to the cardinality challenge after all. The reason, I think, is that cardinal information just isn’t grounded in the structure of \(\succsim\) in relation to the union of disjoint propositions, as proposed by the SCEC. The analogy with the measurement of length is misleading. Partial beliefs are more than just a ‘mathematical construct’ for representing comparative beliefs, and cardinal information is better explained by reference to the functional role partial beliefs play in relation to preferences.

In this section, I’ll consider some potential objections and responses to the arguments of the previous section. Following that, I’ll discuss the view according to which the facts about partial belief supervene on the facts about comparative beliefs plus some other qualitative phenomenon.

4.1 Impossible Worlds

My argument in Sect. 3.5, that the conjunction fallacy is inconsistent with any confidence ranking that supports the analogy with the measurement of length, relies on the assumption that \(\Omega\) is a set of logically possible worlds. This assumption is used to guarantee that \((P\cap \,Q)\cap (P\cap \lnot \,Q)=\varnothing\) and \(P=(P\cap \,Q)\cup (P\cap \lnot \,Q)\).

However, if \(\Omega\) were to include enough impossible worlds of the right kind, then the argument would be invalid. More generally, we know that any apparently non-probabilistic (complete) confidence ranking with respect to propositions drawn from one set of worlds \(\Omega\) can always be re-expressed as a probabilistically representable confidence ranking where the probability function in question is defined for propositions drawn from a larger space of worlds, \(\Omega ^+\). See, for example, Cozic (2006) and Halpern and Pucella (2011). Elliott (2019a) shows that for the fully general result to hold, \(\Omega ^+\) needs to include not only logically impossible worlds, but also ‘incomplete’ worlds—i.e., worlds that leave some matters unspecified. So perhaps comparativists might maintain the measurement analogy if they let propositions be characterised as sets of possible and impossible/incomplete worlds.

I have raised this objection only to acknowledge it, and then set it aside. In other works, I have argued that letting \(\Omega\) include logically impossible worlds creates special problems in the probabilistic context (Elliott 2019a), and will in fact severely undermine the comparativist’s analogy with the measurement of length rather than support it (Elliott 2020). I won’t repeat those arguments here, but let me add two further points.

First, impossible worlds aren’t going to help with any of the problems discussed in Sects. 3.13.4, where \(\succsim\) is probabilistically representable. Second, and more importantly, if saving the measurement analogy from the threat of irrational confidence rankings requires the use of logically impossible and incomplete worlds, then that is a significant theoretical cost for the view. There are general reasons to worry about the use of sets of possible and impossible/incomplete worlds as models of belief content (e.g. Bjerring 2014; Bjerring and Schwarz 2017), so comparativists might not want to put all their eggs into this one basket.

4.2 Approximating Probabilistic Agreement

While Sally’s confidence ranking doesn’t support the measurement analogy exactly in Example 4, it at least approximates one that does. So maybe we could use the cardinal information extracted from the probabilistically representable ranking or rankings that \(\succsim\) most closely approximates to explain how Sally’s beliefs still manage to support some indeterminate form of cardinal information? (cf. Stefánsson 2017, p. 576, fn. 6.)

Now, if the goal were merely to explain how Sally’s beliefs contain some cardinal information or other, whether determinate or indeterminate, then something like this kind of response to Example 4 might suffice. But the point of Example 4 was not that comparativism has no way of making sense of cardinal information in some form whenever \(\succsim\) doesn’t satisfy Monotonicity. Of course there are many ways one might preserve some semblance of cardinality in these cases. That’s obvious—what’s not obvious is whether this will be enough. My argument was that the SCEC cannot get us the right cardinal information in cases like this—specifically, cases where (1) Sally’s confidence ranking does not agree with any additive function, yet (2) at least one non-additive function seems to do a good job of representing her partial beliefs as reflected by fit with the facts about her preferences.

Sally’s confidence ranking in Example 4 will indeed approximate a ranking that agrees with some probability function; for instance,

$$\begin{aligned} pr (P_1\cup \dots \ \cup \,P_{100})\,=\, pr (Q_1\cup \dots \ \cup \,Q_{101})- pr (Q_1) \end{aligned}$$

And \(pr\) will in turn approximate \(irr\). But—and this is the important point—it won’t be \(irr\). It’s not implausible to think that \(irr\) represents a possible system of beliefs with determinate cardinal information, especially inasmuch as that information might be reflected in the consequences it has for Sally’s preferences. According to \(irr\), Sally has the same confidence regarding \(Q_1\cup \dots \ \cup \,Q_{101}\) as she does regarding \(P_1\cup \dots \ \cup \,P_{100}\); according to \(pr\), she doesn’t. So \(pr\) misrepresents her beliefs, and moreover it generates the wrong predictions about her preferences. So how, exactly, is an approximation like pr going to help us get at the right cardinal information?

4.3 Disjunctivism

Next up is what we might call disjunctivism. The idea is this: if \(\succsim\) doesn’t satisfy the requisite structural and/or existential assumptions needed for the measurement analogy under the assumption that it’s the union of disjoint propositions that’s serving as the doxastic analogue of addition, then perhaps there will be an alternative operation we could appeal to in those cases instead. We thus have a disjunctive explanation of cardinality: we let the union of disjoint propositions be our doxastic analogue of addition whenever the right conditions obtain, and look elsewhere when they don’t.

Obviously, disjunctivism won’t be enough to solve any of the problems raised in Sects. 3.13.4, which involve ordinally-equivalent pairs of representations. But more importantly, disjunctivism undermines one of comparativism’s main selling points. What makes the SCEC compelling—to the extent that it is—is that it’s straightforwardly analogous to the well-established explanations of cardinal information that we find in the case of basic physical quantities like length or mass. But there’s nothing at all like disjunctivism for any quantities in the sciences—i.e., where we sometimes appeal to one operation as the real-world analogue of addition, and sometimes appeal to another, depending on whatever works in the moment. The reason for this is obvious: it would make the meaning of cardinal information for that quantity too unstable.

The same applies to comparative beliefs. If different operations on propositions are supposed to explain cardinality for different agents, with the choice of each operation being contingent on what’s appropriate for that agent’s idiosyncratic confidence ranking, then both interpersonal and intrapersonal comparisons of belief would become quite useless in general. Before we could know what it means for Sally to believe P twice as much as Q, we would first have to take into account her entire confidence ranking, work out what the relevant operation should be, and only then give some doxastic meaning to the statement. Without knowledge of the overall structure of her confidence ranking, then, such a claim would only tell us:

  1. 1.

    \(P\succ \,Q\)

  2. 2.

    There’s some binary operation \(\circ\) on \({\mathbf {B}}\) that shares certain structural characteristics with \(+\) relative to \(\succsim\) such that for \(Q',Q''\) where \(Q\sim \,Q'\sim \,Q''\), \(Q'\circ {Q''}=P\)

The latter is utterly uninformative, and the former we don’t need cardinal information to express! Worse still, there’s no guarantee that the cardinal information we get out will track the most obvious implications of believing P twice as much as Q (e.g. being willing to bet twice as much on the former as on the latter). Disjunctivism is a non-starter.

4.4 ‘I Only Want to Model Ideally Rational Agents’

A final response to examples that involve non-ideal agents is to limit the intended explanatory scope of comparativism. The basic idea behind this response is that we can (at least for now) safely ignore irrational agents for the purposes of current philosophical theorising. There seem to be two versions of this thought: first, that we can ignore non-ideal agents because what matters for most contemporary philosophical purposes is that we have an explanation of cardinality for ideally rational agents; or second, that at this stage it’s perfectly reasonable to limit our theories to cases of ideal rationality where we can expect matters to be simpler and more manageable, and de-idealise at some point later on once we’ve got a handle on the easier cases. In support of both versions of the response, though, it’s noted that the SCEC does seem to work well for ideally rational agents—at least when there are sufficiently many ‘confidence-duplicates’ to guarantee unique probabilistic agreement.

Probabilism tells us that an ideally rational agent will have at least n times as much confidence in P as in Q if she has at least as much confidence in P as she does in \(R_1\cup \,\dots \ \cup \,R_n\), where the \(R_1,\dots ,R_n\) are disjoint and \(R_1\sim \,\dots \ \sim \,R_n\sim \,Q\). So probabilism predicts that the SCEC will generate the right results for ideally rational agents. But probabilism is common ground for most comparativists and non-comparativists alike. So the question is whether this fact reflects some interesting explanatory relationship between the meaning of ‘n times as much confidence’ and confidence rankings over disjoint unions, or whether it’s just a consequence of the claim that a rational system of partial beliefs ought to be representable by a function that satisfies Additivity.

If it does reflect an interesting explanatory relationship, then presumably that same relationship should also hold for non-ideal agents. We don’t want to have a disjunctivist explanation, with one kind of theory for the ideal agent and a wholly separate theory for the non-ideal agent. Moreover, it would be unreasonable to say that Sally doesn’t have partial beliefs encoding interesting cardinal information just because she isn’t ideally rational. I have partial beliefs with meaningful cardinal information, and I’m far from ideally rational. So, comparativists should be able to show at least that the SCEC or something much like it is plausibly generalisable. For if there doesn’t seem much hope for generalising the SCEC to non-ideal agents, then we’ve got good reasons to think that the explanation is false—even in the case of ideally rational agents.

In Sect. 3.5, I’ve discussed cases where \(\succsim\) fails to satisfy Monotonicity. In (Elliott 2020) I’ve shown that it’s possible to generalise the SCEC to some degree, such that \(\succsim\) need only satisfy a weaker condition R-Coherence. However it’s also shown in that work that R-Coherence is a minimal condition on any (non-disjunctive) explanation of cardinality that preserves the basic structure of the SCEC in cases where \(\succsim\) is uniquely probabilistically representable. Both Example 4 and the case of the conjunction fallacy involve confidence rankings that fail to satisfy R-Coherence under very natural assumptions. The upshot is that the strategy of the SCEC seems to essentially require that our comparative beliefs will satisfy quite strong and empirically dubious conditions.

It seems, then, that the prospects for generalising the SCEC to non-ideal agents are not particularly strong. Or at the very least: constructing a plausible explanation of cardinal information that extends to both ideal and non-ideal, potentially irrational agents remains a serious challenge for comparativism.

5 Supplemented Comparativism

Let supplemented comparativism denote the view that partial beliefs supervene on comparative beliefs plus something else (whatever that may be, so long as it doesn’t trivialise the whole affair). Some obvious possibilities might be:

  • Judgement-supplemented comparativism the facts about Sally’s beliefs are determined by her comparative beliefs and her qualitative judgements regarding probabilistic dependence and/or evidential relationships; for instance, that P is evidence for Q. (We’ll call these ‘evidential-dependence judgements’.)Footnote 8

  • Evidence-supplemented comparativism the facts about Sally’s beliefs are determined by her comparative beliefs and her history of evidence.

  • Preference-supplemented comparativism the facts about Sally’s beliefs are determined by her comparative beliefs plus facts relating to her preferences.Footnote 9

Will some version of supplemented comparativism fare better in response to the above problems than non-supplemented comparativism?

I cannot discuss every possible style of supplemented comparativism, so allow me instead to make a few general points on the matter. First, it’s important to be clear what the problems are supposed to be. In particular, I want to distinguish between the following two challenges for comparativism:

  1. 1.

    Granularity there seem to be meaningfully distinct systems of partial belief that correspond to the same system of comparative belief

  2. 2.

    Cardinality there seems to be meaningful cardinal information in some systems of partial belief that the SCEC cannot explain

Now it’s obvious that some form of supplemented comparativism will be better placed to deal with the granularity challenge. Trivially, if \(\succsim\) doesn’t contain enough information to determine the facts about her beliefs, then \(\succsim\) plus something else might. That’s not very interesting, and without a detailed theory to play with it’s hard to say much more on the matter. But the target of my arguments throughout has been the SCEC, and supplemented comparativism isn’t going to help save the SCEC from those arguments.

Consider judgement-supplemented comparativism, and recall Example 1: Sally is almost omniscient, but she has much more confidence that the actual world is \(\omega _1\) than that it’s \(\omega _2\). Now imagine that Sally’s evidential-dependence judgements are such that the uniquely correct probabilistic representation \(pr\) of her beliefs must be such that \(pr (\{\omega _1\})\gg pr (\{\omega _2\})\). Sally’s friend Bob has the same confidence ranking, but his dependence judgements are such that his uniquely best probabilistic representation \(pr '\) is such that \(pr '(\{\omega _1\})\approx pr '(\{\omega _2\})\). Great: judgement-supplemented comparativism has managed to make distinctions between belief states where ordinary non-supplemented comparativism could not. Of course it can—it has more resources to play with.

But what explains the sense in which Sally has much more confidence that the actual world is \(\omega _1\), while Bob doesn’t? The fact that \(pr (\{\omega _1\})\gg pr (\{\omega _2\})\) and \(pr '(\{\omega _1\})\approx pr '(\{\omega _2\})\) does not constitute an explanation, no matter how unique \(pr\) and \(pr '\) happen to be relative to Sally’s and Bob’s confidence rankings plus evidential-dependence judgements. The reason should by now be clear: the numbers are irrelevant. The mere fact that judgement-supplemented comparativism can distinguish between ordinally-equivalent representations of partial belief does not thereby entail that it has a satisfactory explanation of the cardinal information those distinctive representations seem to capture. Either ‘much more confidence’ reflects in a natural way some underlying real-world relationship that’s independent of the numbers we use to represent it, or it’s meaningless. So what is the real-world relationship that explains the cardinal information apparently present in Sally’s (or Bob’s) beliefs?

Obviously, the judgement-supplemented comparativist cannot make appeal to the SCEC, because in cases of almost omniscience there aren’t enough disjoint propositions at varying ranks around to ‘add’. The same applies for Example 2: no explanation in terms of the structure of \(\succsim\) can account for the differences in cardinal information between Sally’s beliefs at \(t_1\) and \(t_2\), simply because there are no differences in Sally’s comparative beliefs at \(t_1\) and at \(t_2\). For the same reason, no such explanation can account for the differences in cardinal information that seem to be represented in the differences between \(pr\), \(pyr\), and \(cap\) (from Sects. 3.33.4), since these are ordinally equivalent.

Judgement-supplemented comparativism owes us a compelling and general explanation of cardinal information just as much as non-supplemented comparativism does. I don’t know what such an explanation would look like, and frankly I’m doubtful that the judgement-supplemented comparativist will be able to come up with one that’s as intuitively compelling and consonant with standard methodologies as the SCEC was supposed to be.Footnote 10 But that’s neither here nor there. The main target of my arguments has been the SCEC, and the SCEC is just as problematic for judgement-supplemented comparativism as it is for non-supplmented comparativism.

The very same points apply, for the very same reasons, to any other variety of X-supplemented comparativism you care to think of. The fundamental problem is that the SCEC appeals only to comparative beliefs over disjoint unions. Regardless of whatever else we throw into the supervenience base for partial beliefs, the SCEC cannot account for differences in cardinal information represented in distinct but ordinally-equivalent representations of partial belief; nor can it account for cardinality in cases where the comparative beliefs lack the requisite structure. No ‘further fact’ view is going to magically make the SCEC apply even when Sally’s confidence ranking fails to satisfy Monotonicity, or when there aren’t enough ‘confidence-duplicates’ around to ‘add’. A different style of explanation is needed.

If the only takeaway messages of this paper are (a) the facts about partial beliefs carry more information than the facts about comparative beliefs, and moreover (b) the SCEC is inadequate, then I’ll be happy. The central challenge that I’ve been pushing is that comparativism currently lacks an adequate explanation of cardinal information. Until such time as supplemented comparativism offers us something new, that challenge extends to it as well.

6 Conclusion

Koopman was right about this at least: the numbers we use to refer to and reason about strengths of beliefs are not essential to them. Nobody thinks that there are numbers literally in the head—that numerical strengths of beliefs are somehow metaphysically sui generis and we just have to treat their ratios and intervals as intrinsically meaningful. All parties to this debate agree that the numbers are just a way of representing strengths of belief, while their ratios and intervals must in addition represent some closely related and fundamentally non-numerical psychological phenomenon by virtue of some abstract structure that phenomenon shares with the relevant numerical operations and relations. The hard part is saying what that structure could be.

Whatever the right account is, though, I think it’s unlikely to be found in the meagre resources allowed by (non-supplemented) comparativism. I doubt that the standard comparativist explanation of cardinality correctly locates the actual qualitative phenomena that underlies how our partial beliefs come to carry meaningful cardinal information. The purported analogy between the measurement of beliefs and the measurement of length is deeply misleading. A better explanation is needed.