1 Introduction

Suppose each individual member of some group (expert panel, court, jury etc.) assigns probabilities to some events. How can these individual probability assignments be aggregated into a collective probability assignment? Classically, this problem has been modelled as the aggregation of probability functions, which are defined on some \(\sigma \)-algebra of events, a set of events that is closed under negation and countable disjunction (and thereby also under countable conjunction). Each individual submits a probability function on the given \(\sigma \)-algebra, and these probability functions are then aggregated into a single collective probability function. One of the best-known solutions to this aggregation problem is linear pooling, where the collective probability function is a linear average of the individual probability functions. Linear pooling has several salient properties. First, if all individuals unanimously assign probability 1 (or probability 0) to some event, this probability assignment is preserved collectively (‘consensus preservation’). Second, the collective probability for each event depends only on individual probabilities for that event (‘event-wise independence’). Third, all events are treated equally: the pattern of dependence between individual and collective probability assignments is the same for all events (‘neutrality’).

In many practical applications, however, not all events are equal. In particular, the events in a \(\sigma \)-algebra may fall into two categories (whose boundaries may be drawn in different ways). On the one hand, there are events that correspond to intuitively basic propositions, such as ‘it will rain’, ‘it will be humid’, or ‘atmospheric \(\hbox {CO}_{2}\) causes global warming’. On the other hand, there are events that are intuitively non-basic. These can be viewed as combinations of basic events, for instance via disjunction (union) of basic events, conjunction (intersection), or negation (complementation). It is not obvious that when we aggregate probabilities, basic and non-basic events should be treated alike.

For a start, we may conceptualize basic and non-basic events differently, in analogy to the distinction between atomic and composite propositions in logic (the latter being logical combinations of the former). Second, the way we assign probabilities to non-basic events is likely to differ from the way we assign probabilities to basic events. When we assign a probability to a conjunction or disjunction, this typically presupposes the assignment of probabilities to the underlying conjuncts or disjuncts. For example, the obvious way to assign a probability to the event ‘rain or heat’ is to ask what the probability of rain is, what the probability of heat is, and whether the two are correlated.Footnote 1 If this is right, the natural method of making probabilistic judgments is to consider basic events first and to consider non-basic events next. Basic events serve as ‘premises’: we first assign probabilities to them, and then let these probability assignments constrain our probability assignments for other, non-basic events.

In this paper, we propose an approach to probability aggregation that captures this idea: the premise-based approach. Under this approach, the group first assigns collective probabilities to all basic events (the ‘premises’) by aggregating the individiduals’ probabilities for them; and then it assigns probabilities to all other events, constrained by the probabilities of the basic events. If the basic events are ‘rain’ and ‘heat’, then, in a first step, the collective probabilities for these two events are determined by aggregating the individual probabilities for them. In a second step, the collective probabilities for all other events are assigned. For example, the collective probability of ‘rain and heat’ might be defined as a suitable function of the collective probability of ‘rain’, the collective probability of ‘heat’, and an estimated rain/heat-correlation coefficient, which could be the result of aggregating the rain/heat-correlation coefficients encoded in the individual probability functions.

This proposal can be expressed more precisely by a single axiom, which does not require the (inessential) sequential implementation just sketched, but focuses on a core informational restriction: the collective probability of any ‘premise’ (basic event) should depend solely on the individual probabilities for this premise, not on individual probabilities for other events. We call this axiom independence on premises. Our axiomatic analysis of premise-based aggregation is inspired by binary judgment-aggregation theory, where the premise-based approach has also been characterized by a restricted independence axiom, for instance by Dietrich (2006), Mongin (2008), and Dietrich and Mongin (2010). For less formal discussions of premise-based aggregation, see Kornhauser and Sager (1986), Pettit (2001), List and Pettit (2002), and List (2006).

The way in which we have just motivated the premise-based approach and the corresponding axiom is bound to prompt some questions. In particular, although the distinction between ‘basic’ and ‘non-basic’ events is arguably not ad hoc, there is no purely formal criterion for drawing that distinction.Footnote 2 However, there is another, less controversial motivation for the premise-based approach. Our central axiom—independence on premises—privileges particular events, called the ‘premises’. We have so far interpreted these in a very specific way, taking them to correspond to basic events and to constitute the premises in an individual’s probability-assignment process. But we can give up this interpretation and define a ‘premise’ simply as an event for which it is desirable that the collective probability depend solely on the event-specific individual probabilities. If ‘premises’ are defined like this, then our axiom—independence on premises—is justified by definition (though of course we can no longer offer any guidance as to which events should count as premises).Footnote 3

We show that premise-based opinion pooling imposes significant restrictions on how the collective probabilities of the premises can be determined. At the same time, these restrictions are not undesirable; they do not lead to ‘undemocratic’ or ‘degenerate’ forms of opinion pooling. Specifically, given certain logical connections between the premises, independence on premises, together with a unanimity-preservation requirement, implies that the collective probability for each premise is a (possibly weighted) linear average of the individual probabilities for that premise, where the vector of weights across different individuals is the same for each premise. We present several variants of this result, which differ in the nature of the unanimity-preservation requirement and in the kinds of connections that are assumed to hold between premises. In some variants, we do not obtain the ‘linearity’ conclusion, but only a weaker ‘neutrality’ conclusion: the collective probability for each premise must be a (possibly non-linear) function of the individual probabilities for that premise, where this function is the same for each premise. These results are structurally similar to, but interpretively different from those in our companion paper (Dietrich and List 2017), to which we shall refer as ‘Part I’. Furthermore, our results stand in contrast with existing results on the premise-based approach in binary judgment aggregation. When judgments are binary, independence on premises leads to dictatorial aggregation under analogous conditions (see especially Dietrich and Mongin 2010).

Our results apply regardless of which events are deemed to serve as premises. In the extreme case in which all events count as premises, the requirement of independence on premises reduces to the familiar event-wise independence axiom (sometimes called the strong setwise function property), and our results reduce to a classic characterization of linear pooling (see Aczél and Wagner 1980; McConway 1981; see also Wagner 1982, 1985; Aczél  et al. 1984; Genest 1984a; Mongin 1995; Chambers 2007).Footnote 4

2 The framework

We consider a group of \(n\ge 2\) individuals, labelled \(i=1,\ldots ,n\), who have to assign collective probabilities to some events.

The agenda: a \(\sigma \)-algebra of events We consider a non-empty set \(\Omega \) of possible worlds (or states). An event is a subset A of \(\Omega \); its complement (‘negation’) is denoted \(A^{c}:=\Omega \backslash A\). The set of events to which probabilities are assigned is called the agenda. We assume that it is a \(\sigma \)-algebra, \(\Sigma \subseteq 2^{\Omega }\), i.e., a set of events that is closed under complementation and countable union (and by implication also countable intersection). The simplest non-trivial example of a \(\sigma \)-algebra is of the form \(\Sigma =\{A,A^{c},\Omega ,\varnothing \}\), where \(\varnothing \subsetneq A\subsetneq \Omega \). Another example is the set \(2^{\Omega }\) of all events; this is a commonly studied \(\sigma \)-algebra when \(\Omega \) is finite or countably infinite. A third example is the \(\sigma \)-algebra of Borel-measurable sets when \(\Omega ={\mathbb {R}}\).

An example Let us give an example similar to the lead example in Part I, except that we now take the agenda to be a \(\sigma \)-algebra. Let the set \(\Omega \) of possible worlds be the set of vectors \(\{0,1\}^{3}\backslash \{(1,1,0)\}\) with the following interpretation. The first component of each vector indicates whether atmospheric \(\hbox {CO}_{2}\) is above some threshold (1 = ‘yes’ and 0 = ‘no’), the second component indicates whether there is a mechanism to the effect that if atmospheric \(\hbox {CO}_{2}\) is above that threshold, then Arctic summers are ice-free, and the third component indicates whether Arctic summers are ice-free. The triple (1, 1, 0) is excluded from \(\Omega \) because it would represent an inconsistent combination of characteristics. Now the agenda is \(\Sigma =2^{\Omega }\).

The opinions: probability functions Opinions are represented by probability functions on \(\Sigma \). Formally, a probability function on \(\Sigma \) is a function \(P:\Sigma \rightarrow [0,1]\) such that \(P(\Omega )=1\) and P is \(\sigma \)-additive (i.e., \(P(A_{1}\cup A_{2}\cup \ldots )=P(A_{1})+P(A_{2})+\cdots \) for every sequence of pairwise disjoint events \(A_{1},A_{2},\ldots \in \Sigma \)). We write \({\mathcal {P}} _{\Sigma }\) to denote the set of all probability functions on \(\Sigma \).

Opinion pooling Given the agenda \(\Sigma \), a combination of probability functions across the individuals, \((P_{1},\ldots ,P_{n})\), is called a profile (of probability functions). An (opinion) pooling function is a function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }\), which assigns to each profile \((P_{1},\ldots ,P_{n})\) a collective probability function \(P=F(P_{1},\ldots ,P_{n})\), also denoted \(P_{P_{1},\ldots ,P_{n}}\). An example of \(P_{P_{1},\ldots ,P_{n}}\) is the arithmetic average \(\frac{1}{n}P_{1}+\cdots +\frac{1}{n}P_{n}\).

Some logical terminology We conclude this section with some further terminology. Events distinct from \(\varnothing \) and \(\Omega \) are called contingent. A set S of events is consistent if its intersection \(\cap _{A\in S}A\) is non-empty, and inconsistent otherwise; S entails an event B if the intersection of S is included in B (i.e., \(\cap _{A\in S}A\subseteq B\)).

3 Axiomatic requirements on premise-based opinion pooling

We now introduce the axioms that we require a premise-based opinion pooling function to satisfy.

3.1 Independence on premises

Before we introduce our new axiom of independence on premises, let us recall the familiar requirement of (event-wise) independence. It requires that the collective probability for any event depend only on the individual probabilities for that event, independently of the probabilities of other events.

Independence For each event \(A\in \Sigma \), there exists a function \(D_{A}:[0,1]^{n}\rightarrow [0,1]\) (the local pooling criterion for A) such that, for all \(P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }\),

$$\begin{aligned} P_{P_{1},\ldots ,P_{n}}(A)=D_{A}(P_{1}(A),\ldots ,P_{n}(A)). \end{aligned}$$

This requirement can be criticized—in the classical framework where the agenda is a \(\sigma \)-algebra—for being normatively unattractive. Typically only some of the events in the \(\sigma \)-algebra \(\Sigma \) correspond to intuitively basic propositions such as ‘the economy will grow’ or ‘atmospheric \(\hbox {CO}_{2}\) causes global warming’. Other events in \(\Sigma \) are combinations of basic events, such as ‘the economy will grow or atmospheric \(\hbox {CO}_{2}\) causes global warming’. The non-basic events can get enormously complicated: they can be conjunctions of (finitely or countably infinitely many) basic events, or disjunctions, or disjunctions of conjunctions, and so on. It seems natural to privilege the basic events over the other, more ‘artificial’ events by replacing the independence requirement with a restricted independence requirement that quantifies only over basic events. Indeed, it seems implausible to apply independence to composite events such as ‘the economy will grow or atmospheric \(\hbox {CO}_{2}\) causes global warming’, since this would prevent us from using the probabilities of each of the constituent events in determining the overall probability.

By restricting the independence requirement to basic events, we treat these as premises in the collective probability-assignment process, first aggregating individual probabilities for basic events and then letting the resulting collective probabilities constrain the collective probabilities of all other events. (The probabilities of the premises constrain those other probabilities because the probability assignments in their entirety must be probabilistically coherent.)

Formally, consider a sub-agenda of \(\Sigma \), denoted X, which we interpret as containing the basic events, called the premises. By a sub-agenda we mean a subset of \(\Sigma \) which is non-empty and closed under complementation (i.e., it forms an ‘agenda’ in the generalized sense discussed in Part I). We introduce the following axiom:

Independence on X (‘on premises’). For each \(A\in X\), there exists a function \(D_{A}:[0,1]^{n}\rightarrow [0,1]\) (the local pooling criterion for A) such that, for all \(P_{1},\ldots ,P_{n} \in {\mathcal {P}}_{\Sigma }\),

$$\begin{aligned} P_{P_{1},\ldots ,P_{n}}(A)=D_{A}(P_{1}(A),\ldots ,P_{n}(A)). \end{aligned}$$

In the climate-change example of Sect. 2, the sub-agenda of premises might be defined as \(X=\{A_{1},A_{1}^{c},A_{2},A_{2}^{c},A_{3},A_{3}^{c}\}\), where \(A_{1}\) is the event that atmospheric \(\hbox {CO}_{2}\) is above the critical threshold, \(A_{2}\) is the event that there is a mechanisms by which \(\hbox {CO}_{2}\) concentrations above the threshold cause ice-free Arctic summers, and \(A_{3}\) is the event of ice-free Arctic summers. Conjunctions such as \(A_{1}\cap A_{2}\) are not included in the set X of premises here. As a result, independence on X allows the collective probability for any such conjunction to depend not only on the experts’ probabilities for that conjunction, but also, for instance, on their probabilities for the underlying conjuncts (together with auxiliary assumptions about correlations between them).Footnote 5

We have explained why event-wise independence should not be required for non-basic events. But why should we require it for basic events (premises)? We offer three reasons:

  • First, if we accept the idea that an individual’s probabilistic belief about a given premise is not influenced by, but might influence, his or her beliefs about other events, then we may regard those other beliefs as either by-products of, or unrelated to, the individual’s belief about the premise in question. It then seems reasonable to treat those other beliefs as irrelevant to the question of what collective probability to assign to that premise. (More precisely, any beliefs about other events provide no relevant additional information once the individual’s belief about the premise is given.)

  • Second, the premise-based approach can be motivated by appealing to the idea of a ‘rational collective agent’ that forms its probabilistic beliefs by reasoning from premises to conclusions. This kind of collective reasoning can be implemented by first aggregating the probabilities for the premises and then letting these constrain the probabilities assigned to other events. In the case of binary judgment aggregation, Pettit (2001) has described this process as the ‘collectivization of reason’.

  • Third, as mentioned in the introduction, one might simply define the premises as the events for which it is desirable that the collective probabilities depend solely on the event-specific individual probabilities. This would render the requirement of independence on premises justified by definition.

3.2 Consensus preservation on premises

Informally, our second axiomatic requirement says that whenever there is unanimous agreement among the individuals about the probability of certain events, this agreement should be preserved collectively. We distinguish between different versions of this requirement. The most familiar one is the following:

Consensus preservation For all \(A\in \Sigma \) and all \(P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }\), if, for all i, \(P_{i}(A)=1\), then \(P_{P_{1},\ldots ,P_{n}}(A)=1\).Footnote 6

A second, less demanding version of the requirement is restricted to events in the sub-agenda X of premises.

Consensus preservation on X (‘on premises’) For all \(A\in X\) and all \(P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }\), if, for all i, \(P_{i}(A)=1\), then \(P_{P_{1},\ldots ,P_{n}}(A)=1\).

Restricting consensus preservation in this way may be plausible because a consensus on any event outside X may be considered less compelling than a consensus on a premise in X, for reasons similar to those for which we restricted event-wise independence to premises. A consensus on a non-basic event could be ‘spurious’ in the sense that there might not be any agreement on its basis (see Mongin 2005).Footnote 7

We also consider a third version of consensus preservation, which is still restricted to premises, but refers to conditional probabilities. It says that if all individuals assign a conditional probability of 1 to some premise given another, then this should be preserved collectively.Footnote 8

Conditional consensus preservation on X (‘on premises’) For all \(A,B\in X\) and all \(P_{1},\ldots ,P_{n}\in {\mathcal {P}} _{\Sigma }\), if, for all i, \(P_{i}(A|B)=1\) (provided \(P_{i}(B)\ne 0\)), then \(P_{P_{1},\ldots ,P_{n}}(A|B)=1\) (provided \(P_{P_{1},\ldots ,P_{n}}(B)\ne 0\)).

Conditional consensus preservation on X is equivalent to another requirement. This says that if all individuals agree that some premise implies another with probabilistic certainty (i.e., the probability of the first event occurring without the second is zero), then that agreement should be preserved collectively.

Implication preservation on X (‘on premises’) For all events \(A,B\in X\) and all \(P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }\), if, for all i, \(P_{i}(A\backslash B)=0\), then \(P_{P_{1},\ldots ,P_{n} }(A\backslash B)=0\).

The equivalence between conditional consensus preservation on X and implication preservation on X follows from the fact that the clause ‘\(P_{i}(A|B)=1\) (provided \(P_{i}(B)\ne 0\))’ is equivalent to ‘\(P_{i} (B\backslash A)=0\)’, and the clause ‘\(P_{P_{1},\ldots ,P_{n}}(A|B)=1\) (provided \(P_{P_{1},\ldots ,P_{n}}(B)\ne 0\))’ is equivalent to ‘\(P_{P_{1},\ldots ,P_{n} }(B\backslash A)=0\)’. Thus the statement of conditional consensus preservation on X can be reduced to that of implication preservation on X (except that the roles of A and B are swapped).

This equivalence also illuminates the relationship between conditional consensus preservation on X and consensus preservation on X, because the former, re-formulated as implication preservation on X, clearly implies the latter. Simply note that, in the statement of implication preservation on X, taking \(B=A^{c}\) yields \(P(A\backslash B)=P(A)\), so that a unanimous zero probability of any event A in X must be preserved, which is equivalent to consensus preservation on X.

In fact, conditional consensus preservation on X, when re-formulated as implication preservation on X, is also easily seen to be equivalent to a further unanimity-preservation requirement, which refers to unanimous assignments of probability 1 to a union of two events in X (just note that \(A\backslash B\) has probability 0 if and only if \(A^{c}\cup B\) has probability 1). This also shows that conditional consensus preservation on X is logically weaker than consensus preservation in its original form (on all of \(\Sigma \)), since it does not require preservation of unanimous assignments of probability 1 to intersections of two events in X, or unions or intersections of more than two events in X.

The following proposition summarizes the logical relationships between the different consensus-preservation requirements (in part (a)) and adds another simple but useful observation (in part (b)).

Proposition 1

  1. (a)

    For any sub-agenda X of \(\Sigma \) , conditional consensus preservation on X

    • implies consensus preservation on X;

    • is implied by (global) consensus preservation;

    • is equivalent to implication preservation on X, and to each of the following two requirements:

      $$\begin{aligned}&[\forall i \,\, P_{i}(A\cup B)=1]\Rightarrow P_{P_{1},\ldots ,P_{n}}(A\cup B)=1,\, \textit{for all }\,\,A,B\in X,\,\,P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma };\\&[\forall i\,\, P_{i}(A\cap B)=0]\Rightarrow P_{P_{1},\ldots ,P_{n}}(A\cap B)=0,\, \textit{for all }\,A,B\in X, \,\,P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }. \end{aligned}$$
  2. (b)

    For the maximal sub-agenda \(X=\Sigma \), all of these requirements are equivalent.

4 A class of applications

So far, all our examples of opinion pooling problems have involved events represented by propositions in natural language, such as ‘it will rain’. As argued in Part I, the assumption that the agenda is a \(\sigma \)-algebra is often unnatural in such cases. But there is a second class of applications, in which it is more natural to define the agenda as a \(\sigma \)-algebra \((\Sigma )\) and to restrict the independence requirement to some sub-agenda X. Suppose we wish to estimate the distribution of a real-valued or vector-valued variable, such as rainfall or the number of insurance claims in some period. Here, the set of worlds \(\Omega \) could be \({{\mathbb {R}}}\), \({\mathbb {Z}}\), \({{\mathbb {N}}}\), or \(\{0,1,\ldots ,m\}\), or it could be \({{\mathbb {R}}}^{k}\), \({\mathbb {Z}}^{k}\), \({{\mathbb {N}}}^{k}\), or \(\{0,1,\ldots ,m\}^{k}\) (for natural numbers m and k). In such cases, the focus on the \(\sigma \)-algebra of events is more realistic. First, we may need a full probability distribution on that \(\sigma \)-algebra. Second, individuals may be able to come up with such a probability distribution, because, in practice, they can do the following:

  • first choose some parametric class of probability functions (e.g., the class of Gaussian distributions if \(\Omega ={{\mathbb {R}}}\), Poisson distributions if \(\Omega ={\mathbb {N}}\), or binomial distributions if \(\Omega =\{0,1,\ldots ,m\}\));

  • then estimate the relevant parameter(s) of the distribution (e.g., the mean and standard deviation in the case of a Gaussian distribution).

Because the agenda in this kind of application (e.g., the \(\sigma \)-algebra of Borel sets over \({{\mathbb {R}}}\), or the power set of \({\mathbb {N}}\)) contains very complicated events, it would be implausible to require event-wise independent aggregation for all such events. For instance, suppose \(\Omega ={{\mathbb {R}}}\), and consider the event that a number’s distance to the nearest prime exceeds 37. It would seem artificial to determine the collective probability for that event without paying attention to the probabilities of other events. Here, the sub-agenda X on which event-wise independence is plausible is likely to be much smaller than the full \(\sigma \)-algebra \(\Sigma \).

Let us give a concrete example. Let \(\Sigma \) consist of the Borel-measurable subsets of \(\Omega ={\mathbb {R}}\). A natural sub-agenda of \(\Sigma \) is \(X=\cup _{\omega \in {\mathbb {R}}}\{(-\infty ,\omega ],(\omega ,\infty )\}\). If we require independence on X with a uniform decision criterion \(D=D_{A}\) (\(A\in X\)), where \(D(t_{1},\ldots ,t_{n})=\frac{1}{n}t_{1}+\cdots +\frac{1}{n}t_{n}\), we obtain a unique pooling function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }\), because the collective probabilities for X uniquely extend to a probability function on the entire \(\sigma \)-algebra \(\Sigma \). Alternatively, one might require independence on the smaller sub-agenda \(X=\cup _{\omega \in \{-1,+1\}}\{(-\infty ,\omega ],(\omega ,\infty )\}\), still with the same uniform decision criterion D. This under-determines the pooling function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }\), because probability assignments for X do not uniquely extend to all of \(\Sigma \). To fill this gap, one might define the collective probability function as the unique normal distribution which assigns the specified probabilities to \((-\infty ,-1]\) and \((-\infty ,+1]\), as determined by the decision criterion D.Footnote 9

Let us summarize how the present kinds of applications differ from the above-mentioned applications involving events represented by natural-language propositions such as ‘it will rain’ or ‘atmospheric \(\hbox {CO}_{2}\) causes global warming’:

  1. 1.

    \(\Omega \) is a subset of \({\mathbb {R}}\) or of a higher-dimensional Euclidean space \({\mathbb {R}}^{k}\), rather than a set of ‘possible worlds’ specified by natural-language descriptions;

  2. 2.

    it is often natural to arrive at a probability function by choosing a parametric family of such functions (such as the family of Gaussian distributions) and then specifying the relevant parameter(s), while this approach would seem ad hoc in the other kind of application;

  3. 3.

    in practice, we may be interested in a probability function on the entire \(\sigma \)-algebra (e.g., in order to compute the mean of the distribution and other moments), rather than just in the probabilities of specific events.

5 When is opinion pooling neutral on premises?

We now show that, if there are certain kinds of interconnections among the premises in X, any pooling function satisfying independence on X and consensus preservation in one of the senses introduced must be neutral on X. This means that the pattern of dependence between individual and collective probability assignments is the same for all premises. In the next section, we turn to the question of whether our axioms imply linear pooling on premises, over and above neutrality.

Formally, a pooling function for agenda \(\Sigma \) is neutral on \(X (\subseteq \Sigma )\) if there exists some function \(D:[0,1]^{n}\rightarrow [0,1]\)—the local pooling criterion for events in X—such that, for every profile \((P_{1},\ldots ,P_{n})\in {\mathcal {P}}_{\Sigma }^{n}\), the collective probability of any event A in X is given by

$$\begin{aligned} P_{P_{1},\ldots ,P_{n}}(A)=D(P_{1}(A),\ldots ,P_{n}(A)). \end{aligned}$$

If \(X=\Sigma \), neutrality on X reduces to neutrality in the familiar global sense, briefly mentioned in the introduction.

Our first result uses the strongest consensus-preservation requirement we have introduced, namely ‘global’ consensus preservation (on all of \(\Sigma \)). Here, we obtain the neutrality conclusion as soon as the sub-agenda of premises satisfies a very mild condition: it is ‘non-nested’. We call a sub-agenda X nested if it has the form \(X=\{A,A^{c}:A\in X_{+}\}\) for some set of events \(X_{+}\) which is linearly ordered by set-inclusion, and non-nested otherwise. For instance, \(X=\{A,A^{c}\}\) is nested (take \(X_{+}:=\{A\}\)), as is \(X=\{A,A^{c},A\cap B,(A\cap B)^{c}\}\) (take \(X_{+}=\{A,A\cap B\}\)). By contrast, \(X=\{A,A^{c},B,B^{c}\}\) is non-nested when the events A and B are logically independent. Also, the above-mentioned sub-agenda \(X=\{A_{1},A_{1}^{c},A_{2},A_{2}^{c},A_{3} ,A_{3}^{c}\}\) in our climate-change example is non-nested. Further examples are given in Part I.

Theorem 1

  1. (a)

    For any non-nested (finite)Footnote 10 sub-agenda X of the \(\sigma \) -algebra \(\Sigma \), every pooling function \(F:{\mathcal {P}} _{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma } \) satisfying independence on X and (global) consensus preservation is neutral on X.

  2. (b)

    For any nested sub-agenda X of the \( \sigma \) -algebra \(\Sigma \) (where X is finite and distinct from \( \{\varnothing ,\Omega \}\)), there exists a pooling function \( F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }\) satisfying independence on X and (global) consensus preservation but violating neutrality on X.

The possibilities arising for nested X are illustrated by variants of the two pooling functions constructed in Sect. 4, where \(\Sigma \) is the Borel \(\sigma \)-algebra on \(\Omega ={\mathbb {R}}\) and X is one of the nested sub-agendas \(\cup _{\omega \in {\mathbb {R}}}\{(-\infty ,\omega ],(\omega ,\infty )\}\) and \(\cup _{\omega \in \{-1,+1\}}\{(-\infty ,\omega ],(\omega ,\infty )\}\). To obtain pooling functions that are not neutral on X, as described in part (b), we must avoid the use of a uniform decision criterion on all elements of X.Footnote 11 Theorem 1 continues to hold if we weaken consensus preservation to conditional consensus preservation on premises, as shown next:

Theorem 2

  1. (a)

    For any non-nested (finite) sub-agenda X of the \(\sigma \) -algebra \(\Sigma \) , every pooling function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }\) satisfying independence on X and conditional consensus preservation on X is neutral on X.

  2. (b)

    For any nested sub-agenda X of the \(\sigma \) -algebra \(\Sigma \) (where X is finite and not \(\{\varnothing ,\Omega \}\)), there exists a pooling function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }\) satisfying independence on X and conditional consensus preservation on X but violating neutrality on X.

However, if we weaken the consensus-preservation requirement further—namely to consensus preservation on X—then the neutrality conclusion follows only if the events within the sub-agenda X exhibit stronger interconnections. Specifically, the set X must be ‘path-connected’, as originally defined in binary judgment-aggregation theory (often under the name ‘total blockedness’; see Nehring and Puppe 2010). To define path-connectedness formally, we begin with a preliminary notion. Given the sub-agenda X, we say that an event \(A\in X\) conditionally entails another event \(B\in X\)—written \(A\vdash ^{*}B\)—if there is a subset \(Y\subseteq X\) (possibly empty, but not uncountably infinite) such that \(\{A\}\cup Y\) entails B, where, for non-triviality, \(Y\cup \{A\}\) and \(Y\cup \{B^{c}\}\) are each consistent. In our climate-change example with sub-agenda \(X=\{A_{1},A_{1} ^{c},A_{2},A_{2}^{c},A_{3},A_{3}^{c}\}\), \(A_{1}\) conditionally entails \(A_{3}\) (take \(Y=\{A_{2}\}\)), but none of \(A_{1}^{c}\), \(A_{2}^{c}\), and \(A_{3}\) conditionally entails any event in X other than itself.

We call the sub-agenda X path-connected if any two events \(A,B\in X\backslash \{\varnothing ,\Omega \}\) can be connected by a path of conditional entailments, i.e., there exist events \(A_{1},\ldots ,A_{k}\in X\) (\(k\ge 1\)) such that \(A=A_{1}\vdash ^{*}A_{2}\vdash ^{*}\cdots \vdash ^{*}A_{k}=B\), and non-path-connected otherwise. For example, suppose \(X=\{A,A^{c} ,B,B^{c},C,C^{c}\}\), where \(\{A,B,C\}\) is a partition of \(\Omega \) (and \(A,B,C\ne \varnothing \)). Then X is path-connected. For instance, to see that there is a path from A to B, note that \(A\vdash ^{*}C^{c}\) (take \(Y=\varnothing \)) and  \(C^{c}\vdash ^{*}B~\) (take \(Y=\{A^{c} \}\)). Many sub-agendas are not path-connected, including all nested sub-agendas X (\(\ne \{\varnothing ,\Omega \}\)) and the sub-agenda \(X=\{A_{1},A_{1}^{c},A_{2},A_{2}^{c},A_{3},A_{3}^{c}\}\) in the climate-change example.

Theorem 3

  1. (a)

    For any path-connected (finite) sub-agenda X of the \(\sigma \) -algebra \(\Sigma \), every pooling function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}} _{\Sigma }\) satisfying independence on X and consensus preservation on X is neutral on X.

  2. (b)

    For any non-path-connected (finite) sub-agenda X of the \(\sigma \) -algebra \(\Sigma \), there exists a pooling function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }\) satisfying independence on X  and consensus preservation on X  but violating neutrality on X.

6 When is opinion pooling linear on premises?

Our next question is whether, and for which sub-agendas X, our requirements on an opinion pooling function imply linearity on premises, over and above neutrality. Formally, a pooling function for agenda \(\Sigma \) is called linear on X \((\subseteq \Sigma )\) if there exist real-valued weights \(w_{1},\ldots ,w_{n}\ge 0\) with \(w_{1}+\cdots +w_{n}=1\) such that, for every profile \((P_{1},\ldots ,P_{n})\in {\mathcal {P}}_{\Sigma }^{n}\), the collective probability of any event A in X is given by

$$\begin{aligned} P_{P_{1},\ldots ,P_{n}}(A)= {\displaystyle \sum _{i=1}^{n}} w_{i}P_{i}(A). \end{aligned}$$

If \(X=\Sigma \), linearity on X reduces to linearity in the global sense, familiar from the established literature.

As in the case of neutrality, whether our axioms imply linearity on a given sub-agenda X depends on how the events in X are connected and which consensus-preservation requirement we impose on the pooling function. Again, our first result uses the strongest consensus-preservation requirement and applies to a very large class of sub-agendas.

Theorem 4

  1. (a)

    For any non-nested (finite) sub-agenda X of the \(\sigma \) -algebra \(\Sigma \) with \(\left| X\backslash \{\Omega ,\varnothing \}\right| >4\), every pooling function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma } \) satisfying independence on X and (global) consensus preservation is linear on X.

  2. (b)

    For any other sub-agenda X of the \(\sigma \) -algebra \(\Sigma \) (where X is finite and distinct from \(\{\varnothing ,\Omega \}\)), there exists a pooling function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }\) satisfying independence on X and (global) consensus preservation but violating linearity on X .

If we weaken consensus preservation to conditional consensus preservation on X, the linearity conclusion still follows, but only if the sub-agenda X is ‘non-simple’—a condition stronger than non-nestedness, but still weaker than path-connectedness.Footnote 12 The notion of non-simplicity also comes from binary judgment-aggregation theory, where the non-simple agendas are those that are susceptible to majority inconsistencies, the judgment-aggregation analogues of Condorcet’s paradox (e.g., Nehring and Puppe 2010; Dietrich and List 2007). Formally, a sub-agenda X is non-simple if it has a minimal inconsistent subset \(Y\subseteq X\) of more than two (but not uncountably many) events, and simple otherwise. (A set Y is minimal inconsistent if it is inconsistent but all its proper subsets are consistent.) For example, the sub-agenda \(X=\{A_{1} ,A_{1}^{c},A_{2},A_{2}^{c},A_{3},A_{3}^{c}\}\) in our climate-change example is non-simple, since its three-element subset \(Y=\{A_{1},A_{2},A_{3}^{c}\}\) is minimal inconsistent. By contrast, a sub-agenda of the form \(X=\{A,A^{c}\}\) is simple.

Theorem 5

  1. (a)

    For any non-simple (finite) sub-agenda X of the \(\sigma \) -algebra \(\Sigma \), every pooling function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma } \) satisfying independence on X and conditional consensus preservation on X is linear on X.

  2. (b)

    For any simple sub-agenda X of the \(\sigma \) -algebra \(\Sigma \) (where X is finite and distinct from \(\{\varnothing ,\Omega \}\)), there exists a pooling function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }\) satisfying independence on X and conditional consensus preservation on X but violating linearity on X.

Finally, if we impose only the weakest of our three consensus-preservation requirements—consensus preservation on X—then the linearity conclusion follows only if the sub-agenda X is path-connected and satisfies an additional condition. A sufficient such condition is ‘partitionality’. A sub-agenda X is partitional if some subset \(Y\subseteq X\) partitions \(\Omega \) into at least three non-empty events (where Y is finite or countably infinite), and non-partitional otherwise. As an illustration, recall our earlier example of a sub-agenda given by \(X=\{A,A^{c} ,B,B^{c},C,C^{c}\}\), where \(\{A,B,C\}\) partitions \(\Omega \) (with \(A,B,C\ne \varnothing \)). This sub-agenda is both path-connected (as mentioned above) and partitional.

Theorem 6

  1. (a)

    For any path-connected and partitional (finite) sub-agenda X of the \(\sigma \) -algebra \(\Sigma \), every pooling function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }\) satisfying independence on X and consensus preservation on X is linear on X.

  2. (b)

    For any non-pathconnected (finite) sub-agenda X of the \(\sigma \) -algebra \(\Sigma \) , there exists a pooling function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }\) satisfying independence on X and consensus preservation on X but violating linearity on X.

It is clear from part (b) that path-connectedness of the premises is necessary for the linearity conclusion to follow. The other condition, partitionality, is not necessary. But it is not redundant:

Proposition 2

For some path-connected and non-partitional (finite) sub-agenda X of the \(\sigma \) -algebra \(\Sigma \) , there exists a pooling function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }\) satisfying independence on X (even neutrality on X ) and consensus preservation on X but violating linearity on X.Footnote 13

7 Classic results as special cases

As should be evident, if we apply our results to the maximal sub-agenda \(X=\Sigma \), we obtain classic results (by Aczél and Wagner 1980; McConway 1981) as special cases. To see why this is the case, note three things. First, when \(X=\Sigma \), our various conditions on the sub-agenda X all reduce to a single condition on the size of the \(\sigma \)-algebra \(\Sigma \).

Lemma 1

For the maximal sub-agenda \(X=\Sigma \) (where \(\Sigma \ne \{\Omega ,\varnothing \}\)), the conditions of non-nestedness, non-simplicity, path-connectedness, and partitionality are all equivalent, and they all hold if and only if \(\left| \Sigma \right| >4\), i.e., if and only if \(\Sigma \) is not of the form \(\{A,A^{c},\Omega ,\varnothing \} \).

Second, when \(X=\Sigma \), independence, neutrality, and linearity on X reduce to independence, neutrality, and linearity in the familiar ‘global’ sense, as already noted. Third, our various consensus-preservation requirements all become equivalent, by Proposition 1.

In consequence, our six theorems reduce to two classic results:Footnote 14

  • Theorems 13 reduce to the result that all pooling functions satisfying independence and consensus preservation are neutral if \(\left| \Sigma \right| >4\), but not if \(\left| \Sigma \right| =4\);

  • Theorems 46 reduce to the result that all pooling functions satisfying independence and consensus preservation are linear if \(\left| \Sigma \right| >4\), but not if \(\left| \Sigma \right| =4\).

The case \(\left| \Sigma \right| <4\) is uninteresting because it means that \(\Sigma \) is the trivial \(\sigma \)-algebra \(\{\Omega ,\varnothing \}\). Let us slightly re-formulate these two results:

Corollary 1

For the \(\sigma \)-algebra \(\Sigma \),

  1. (a)

    if \(\left| \Sigma \right| >4\), every pooling function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }\) satisfying independence and consensus preservation is linear (and by implication neutral);

  2. (b)

    if \(\left| \Sigma \right| =4\), there exists a pooling function \(F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }\) satisfying independence and consensus preservation but violating neutrality (and thereby also violating linearity).