Probabilistic opinion pooling generalized. Part two: the premise-based approach

Dietrich, Franz; List, Christian

doi:10.1007/s00355-017-1035-y

Probabilistic opinion pooling generalized. Part two: the premise-based approach

Original Paper
Open access
Published: 10 April 2017

Volume 48, pages 787–814, (2017)
Cite this article

Download PDF

You have full access to this open access article

Social Choice and Welfare Aims and scope Submit manuscript

Probabilistic opinion pooling generalized. Part two: the premise-based approach

Download PDF

Franz Dietrich¹ &
Christian List²

1902 Accesses
11 Citations
2 Altmetric
Explore all metrics

Abstract

How can several individuals’ probability functions on a given $\sigma $-algebra of events be aggregated into a collective probability function? Classic approaches to this problem usually require ‘event-wise independence’: the collective probability for each event should depend only on the individuals’ probabilities for that event. In practice, however, some events may be ‘basic’ and others ‘derivative’, so that it makes sense first to aggregate the probabilities for the former and then to let these constrain the probabilities for the latter. We formalize this idea by introducing a ‘premise-based’ approach to probabilistic opinion pooling, and show that, under a variety of assumptions, it leads to linear or neutral opinion pooling on the ‘premises’.

Probabilistic Opinion Pooling with Imprecise Probabilities

Article 21 January 2017

Probabilistic opinion pooling generalized. Part one: general agendas

Article Open access 07 April 2017

Weighted Probabilistic Opinion Pooling Based on Cross-Entropy

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Suppose each individual member of some group (expert panel, court, jury etc.) assigns probabilities to some events. How can these individual probability assignments be aggregated into a collective probability assignment? Classically, this problem has been modelled as the aggregation of probability functions, which are defined on some $\sigma $-algebra of events, a set of events that is closed under negation and countable disjunction (and thereby also under countable conjunction). Each individual submits a probability function on the given $\sigma $-algebra, and these probability functions are then aggregated into a single collective probability function. One of the best-known solutions to this aggregation problem is linear pooling, where the collective probability function is a linear average of the individual probability functions. Linear pooling has several salient properties. First, if all individuals unanimously assign probability 1 (or probability 0) to some event, this probability assignment is preserved collectively (‘consensus preservation’). Second, the collective probability for each event depends only on individual probabilities for that event (‘event-wise independence’). Third, all events are treated equally: the pattern of dependence between individual and collective probability assignments is the same for all events (‘neutrality’).

In many practical applications, however, not all events are equal. In particular, the events in a $\sigma $-algebra may fall into two categories (whose boundaries may be drawn in different ways). On the one hand, there are events that correspond to intuitively basic propositions, such as ‘it will rain’, ‘it will be humid’, or ‘atmospheric $\hbox {CO}_{2}$ causes global warming’. On the other hand, there are events that are intuitively non-basic. These can be viewed as combinations of basic events, for instance via disjunction (union) of basic events, conjunction (intersection), or negation (complementation). It is not obvious that when we aggregate probabilities, basic and non-basic events should be treated alike.

For a start, we may conceptualize basic and non-basic events differently, in analogy to the distinction between atomic and composite propositions in logic (the latter being logical combinations of the former). Second, the way we assign probabilities to non-basic events is likely to differ from the way we assign probabilities to basic events. When we assign a probability to a conjunction or disjunction, this typically presupposes the assignment of probabilities to the underlying conjuncts or disjuncts. For example, the obvious way to assign a probability to the event ‘rain or heat’ is to ask what the probability of rain is, what the probability of heat is, and whether the two are correlated.^{Footnote 1} If this is right, the natural method of making probabilistic judgments is to consider basic events first and to consider non-basic events next. Basic events serve as ‘premises’: we first assign probabilities to them, and then let these probability assignments constrain our probability assignments for other, non-basic events.

In this paper, we propose an approach to probability aggregation that captures this idea: the premise-based approach. Under this approach, the group first assigns collective probabilities to all basic events (the ‘premises’) by aggregating the individiduals’ probabilities for them; and then it assigns probabilities to all other events, constrained by the probabilities of the basic events. If the basic events are ‘rain’ and ‘heat’, then, in a first step, the collective probabilities for these two events are determined by aggregating the individual probabilities for them. In a second step, the collective probabilities for all other events are assigned. For example, the collective probability of ‘rain and heat’ might be defined as a suitable function of the collective probability of ‘rain’, the collective probability of ‘heat’, and an estimated rain/heat-correlation coefficient, which could be the result of aggregating the rain/heat-correlation coefficients encoded in the individual probability functions.

This proposal can be expressed more precisely by a single axiom, which does not require the (inessential) sequential implementation just sketched, but focuses on a core informational restriction: the collective probability of any ‘premise’ (basic event) should depend solely on the individual probabilities for this premise, not on individual probabilities for other events. We call this axiom independence on premises. Our axiomatic analysis of premise-based aggregation is inspired by binary judgment-aggregation theory, where the premise-based approach has also been characterized by a restricted independence axiom, for instance by Dietrich (2006), Mongin (2008), and Dietrich and Mongin (2010). For less formal discussions of premise-based aggregation, see Kornhauser and Sager (1986), Pettit (2001), List and Pettit (2002), and List (2006).

The way in which we have just motivated the premise-based approach and the corresponding axiom is bound to prompt some questions. In particular, although the distinction between ‘basic’ and ‘non-basic’ events is arguably not ad hoc, there is no purely formal criterion for drawing that distinction.^{Footnote 2} However, there is another, less controversial motivation for the premise-based approach. Our central axiom—independence on premises—privileges particular events, called the ‘premises’. We have so far interpreted these in a very specific way, taking them to correspond to basic events and to constitute the premises in an individual’s probability-assignment process. But we can give up this interpretation and define a ‘premise’ simply as an event for which it is desirable that the collective probability depend solely on the event-specific individual probabilities. If ‘premises’ are defined like this, then our axiom—independence on premises—is justified by definition (though of course we can no longer offer any guidance as to which events should count as premises).^{Footnote 3}

We show that premise-based opinion pooling imposes significant restrictions on how the collective probabilities of the premises can be determined. At the same time, these restrictions are not undesirable; they do not lead to ‘undemocratic’ or ‘degenerate’ forms of opinion pooling. Specifically, given certain logical connections between the premises, independence on premises, together with a unanimity-preservation requirement, implies that the collective probability for each premise is a (possibly weighted) linear average of the individual probabilities for that premise, where the vector of weights across different individuals is the same for each premise. We present several variants of this result, which differ in the nature of the unanimity-preservation requirement and in the kinds of connections that are assumed to hold between premises. In some variants, we do not obtain the ‘linearity’ conclusion, but only a weaker ‘neutrality’ conclusion: the collective probability for each premise must be a (possibly non-linear) function of the individual probabilities for that premise, where this function is the same for each premise. These results are structurally similar to, but interpretively different from those in our companion paper (Dietrich and List 2017), to which we shall refer as ‘Part I’. Furthermore, our results stand in contrast with existing results on the premise-based approach in binary judgment aggregation. When judgments are binary, independence on premises leads to dictatorial aggregation under analogous conditions (see especially Dietrich and Mongin 2010).

Our results apply regardless of which events are deemed to serve as premises. In the extreme case in which all events count as premises, the requirement of independence on premises reduces to the familiar event-wise independence axiom (sometimes called the strong setwise function property), and our results reduce to a classic characterization of linear pooling (see Aczél and Wagner 1980; McConway 1981; see also Wagner 1982, 1985; Aczél et al. 1984; Genest 1984a; Mongin 1995; Chambers 2007).^{Footnote 4}

2 The framework

We consider a group of $n\ge 2$ individuals, labelled $i=1,\ldots ,n$, who have to assign collective probabilities to some events.

The agenda: a $\sigma $-algebra of events We consider a non-empty set $\Omega $ of possible worlds (or states). An event is a subset A of $\Omega $; its complement (‘negation’) is denoted $A^{c}:=\Omega \backslash A$. The set of events to which probabilities are assigned is called the agenda. We assume that it is a $\sigma $-algebra, $\Sigma \subseteq 2^{\Omega }$, i.e., a set of events that is closed under complementation and countable union (and by implication also countable intersection). The simplest non-trivial example of a $\sigma $-algebra is of the form $\Sigma =\{A,A^{c},\Omega ,\varnothing \}$, where $\varnothing \subsetneq A\subsetneq \Omega $. Another example is the set $2^{\Omega }$ of all events; this is a commonly studied $\sigma $-algebra when $\Omega $ is finite or countably infinite. A third example is the $\sigma $-algebra of Borel-measurable sets when $\Omega ={\mathbb {R}}$.

An example Let us give an example similar to the lead example in Part I, except that we now take the agenda to be a $\sigma $-algebra. Let the set $\Omega $ of possible worlds be the set of vectors $\{0,1\}^{3}\backslash \{(1,1,0)\}$ with the following interpretation. The first component of each vector indicates whether atmospheric $\hbox {CO}_{2}$ is above some threshold (1 = ‘yes’ and 0 = ‘no’), the second component indicates whether there is a mechanism to the effect that if atmospheric $\hbox {CO}_{2}$ is above that threshold, then Arctic summers are ice-free, and the third component indicates whether Arctic summers are ice-free. The triple (1, 1, 0) is excluded from $\Omega $ because it would represent an inconsistent combination of characteristics. Now the agenda is $\Sigma =2^{\Omega }$.

The opinions: probability functions Opinions are represented by probability functions on $\Sigma $. Formally, a probability function on $\Sigma $ is a function $P:\Sigma \rightarrow [0,1]$ such that $P(\Omega )=1$ and P is $\sigma $-additive (i.e., $P(A_{1}\cup A_{2}\cup \ldots )=P(A_{1})+P(A_{2})+\cdots $ for every sequence of pairwise disjoint events $A_{1},A_{2},\ldots \in \Sigma $). We write ${\mathcal {P}} _{\Sigma }$ to denote the set of all probability functions on $\Sigma $.

Opinion pooling Given the agenda $\Sigma $, a combination of probability functions across the individuals, $(P_{1},\ldots ,P_{n})$, is called a profile (of probability functions). An (opinion) pooling function is a function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$, which assigns to each profile $(P_{1},\ldots ,P_{n})$ a collective probability function $P=F(P_{1},\ldots ,P_{n})$, also denoted $P_{P_{1},\ldots ,P_{n}}$. An example of $P_{P_{1},\ldots ,P_{n}}$ is the arithmetic average $\frac{1}{n}P_{1}+\cdots +\frac{1}{n}P_{n}$.

Some logical terminology We conclude this section with some further terminology. Events distinct from $\varnothing $ and $\Omega $ are called contingent. A set S of events is consistent if its intersection $\cap _{A\in S}A$ is non-empty, and inconsistent otherwise; S entails an event B if the intersection of S is included in B (i.e., $\cap _{A\in S}A\subseteq B$).

3 Axiomatic requirements on premise-based opinion pooling

We now introduce the axioms that we require a premise-based opinion pooling function to satisfy.

3.1 Independence on premises

Before we introduce our new axiom of independence on premises, let us recall the familiar requirement of (event-wise) independence. It requires that the collective probability for any event depend only on the individual probabilities for that event, independently of the probabilities of other events.

Independence For each event $A\in \Sigma $, there exists a function $D_{A}:[0,1]^{n}\rightarrow [0,1]$ (the local pooling criterion for A) such that, for all $P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }$,

$$\begin{aligned} P_{P_{1},\ldots ,P_{n}}(A)=D_{A}(P_{1}(A),\ldots ,P_{n}(A)). \end{aligned}$$

This requirement can be criticized—in the classical framework where the agenda is a $\sigma $-algebra—for being normatively unattractive. Typically only some of the events in the $\sigma $-algebra $\Sigma $ correspond to intuitively basic propositions such as ‘the economy will grow’ or ‘atmospheric $\hbox {CO}_{2}$ causes global warming’. Other events in $\Sigma $ are combinations of basic events, such as ‘the economy will grow or atmospheric $\hbox {CO}_{2}$ causes global warming’. The non-basic events can get enormously complicated: they can be conjunctions of (finitely or countably infinitely many) basic events, or disjunctions, or disjunctions of conjunctions, and so on. It seems natural to privilege the basic events over the other, more ‘artificial’ events by replacing the independence requirement with a restricted independence requirement that quantifies only over basic events. Indeed, it seems implausible to apply independence to composite events such as ‘the economy will grow or atmospheric $\hbox {CO}_{2}$ causes global warming’, since this would prevent us from using the probabilities of each of the constituent events in determining the overall probability.

By restricting the independence requirement to basic events, we treat these as premises in the collective probability-assignment process, first aggregating individual probabilities for basic events and then letting the resulting collective probabilities constrain the collective probabilities of all other events. (The probabilities of the premises constrain those other probabilities because the probability assignments in their entirety must be probabilistically coherent.)

Formally, consider a sub-agenda of $\Sigma $, denoted X, which we interpret as containing the basic events, called the premises. By a sub-agenda we mean a subset of $\Sigma $ which is non-empty and closed under complementation (i.e., it forms an ‘agenda’ in the generalized sense discussed in Part I). We introduce the following axiom:

Independence on X (‘on premises’). For each $A\in X$, there exists a function $D_{A}:[0,1]^{n}\rightarrow [0,1]$ (the local pooling criterion for A) such that, for all $P_{1},\ldots ,P_{n} \in {\mathcal {P}}_{\Sigma }$,

$$\begin{aligned} P_{P_{1},\ldots ,P_{n}}(A)=D_{A}(P_{1}(A),\ldots ,P_{n}(A)). \end{aligned}$$

In the climate-change example of Sect. 2, the sub-agenda of premises might be defined as $X=\{A_{1},A_{1}^{c},A_{2},A_{2}^{c},A_{3},A_{3}^{c}\}$, where $A_{1}$ is the event that atmospheric $\hbox {CO}_{2}$ is above the critical threshold, $A_{2}$ is the event that there is a mechanisms by which $\hbox {CO}_{2}$ concentrations above the threshold cause ice-free Arctic summers, and $A_{3}$ is the event of ice-free Arctic summers. Conjunctions such as $A_{1}\cap A_{2}$ are not included in the set X of premises here. As a result, independence on X allows the collective probability for any such conjunction to depend not only on the experts’ probabilities for that conjunction, but also, for instance, on their probabilities for the underlying conjuncts (together with auxiliary assumptions about correlations between them).^{Footnote 5}

We have explained why event-wise independence should not be required for non-basic events. But why should we require it for basic events (premises)? We offer three reasons:

First, if we accept the idea that an individual’s probabilistic belief about a given premise is not influenced by, but might influence, his or her beliefs about other events, then we may regard those other beliefs as either by-products of, or unrelated to, the individual’s belief about the premise in question. It then seems reasonable to treat those other beliefs as irrelevant to the question of what collective probability to assign to that premise. (More precisely, any beliefs about other events provide no relevant additional information once the individual’s belief about the premise is given.)
Second, the premise-based approach can be motivated by appealing to the idea of a ‘rational collective agent’ that forms its probabilistic beliefs by reasoning from premises to conclusions. This kind of collective reasoning can be implemented by first aggregating the probabilities for the premises and then letting these constrain the probabilities assigned to other events. In the case of binary judgment aggregation, Pettit (2001) has described this process as the ‘collectivization of reason’.
Third, as mentioned in the introduction, one might simply define the premises as the events for which it is desirable that the collective probabilities depend solely on the event-specific individual probabilities. This would render the requirement of independence on premises justified by definition.

3.2 Consensus preservation on premises

Informally, our second axiomatic requirement says that whenever there is unanimous agreement among the individuals about the probability of certain events, this agreement should be preserved collectively. We distinguish between different versions of this requirement. The most familiar one is the following:

Consensus preservation For all $A\in \Sigma $ and all $P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }$, if, for all i, $P_{i}(A)=1$, then $P_{P_{1},\ldots ,P_{n}}(A)=1$.^{Footnote 6}

A second, less demanding version of the requirement is restricted to events in the sub-agenda X of premises.

Consensus preservation on X (‘on premises’) For all $A\in X$ and all $P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }$, if, for all i, $P_{i}(A)=1$, then $P_{P_{1},\ldots ,P_{n}}(A)=1$.

Restricting consensus preservation in this way may be plausible because a consensus on any event outside X may be considered less compelling than a consensus on a premise in X, for reasons similar to those for which we restricted event-wise independence to premises. A consensus on a non-basic event could be ‘spurious’ in the sense that there might not be any agreement on its basis (see Mongin 2005).^{Footnote 7}

We also consider a third version of consensus preservation, which is still restricted to premises, but refers to conditional probabilities. It says that if all individuals assign a conditional probability of 1 to some premise given another, then this should be preserved collectively.^{Footnote 8}

Conditional consensus preservation on X (‘on premises’) For all $A,B\in X$ and all $P_{1},\ldots ,P_{n}\in {\mathcal {P}} _{\Sigma }$, if, for all i, $P_{i}(A|B)=1$ (provided $P_{i}(B)\ne 0$), then $P_{P_{1},\ldots ,P_{n}}(A|B)=1$ (provided $P_{P_{1},\ldots ,P_{n}}(B)\ne 0$).

Conditional consensus preservation on X is equivalent to another requirement. This says that if all individuals agree that some premise implies another with probabilistic certainty (i.e., the probability of the first event occurring without the second is zero), then that agreement should be preserved collectively.

Implication preservation on X (‘on premises’) For all events $A,B\in X$ and all $P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }$, if, for all i, $P_{i}(A\backslash B)=0$, then $P_{P_{1},\ldots ,P_{n} }(A\backslash B)=0$.

The equivalence between conditional consensus preservation on X and implication preservation on X follows from the fact that the clause ‘$P_{i}(A|B)=1$ (provided $P_{i}(B)\ne 0$)’ is equivalent to ‘$P_{i} (B\backslash A)=0$’, and the clause ‘$P_{P_{1},\ldots ,P_{n}}(A|B)=1$ (provided $P_{P_{1},\ldots ,P_{n}}(B)\ne 0$)’ is equivalent to ‘$P_{P_{1},\ldots ,P_{n} }(B\backslash A)=0$’. Thus the statement of conditional consensus preservation on X can be reduced to that of implication preservation on X (except that the roles of A and B are swapped).

This equivalence also illuminates the relationship between conditional consensus preservation on X and consensus preservation on X, because the former, re-formulated as implication preservation on X, clearly implies the latter. Simply note that, in the statement of implication preservation on X, taking $B=A^{c}$ yields $P(A\backslash B)=P(A)$, so that a unanimous zero probability of any event A in X must be preserved, which is equivalent to consensus preservation on X.

In fact, conditional consensus preservation on X, when re-formulated as implication preservation on X, is also easily seen to be equivalent to a further unanimity-preservation requirement, which refers to unanimous assignments of probability 1 to a union of two events in X (just note that $A\backslash B$ has probability 0 if and only if $A^{c}\cup B$ has probability 1). This also shows that conditional consensus preservation on X is logically weaker than consensus preservation in its original form (on all of $\Sigma $), since it does not require preservation of unanimous assignments of probability 1 to intersections of two events in X, or unions or intersections of more than two events in X.

The following proposition summarizes the logical relationships between the different consensus-preservation requirements (in part (a)) and adds another simple but useful observation (in part (b)).

Proposition 1

(a)
For any sub-agenda X of $\Sigma $ , conditional consensus preservation on X
- implies consensus preservation on X;
- is implied by (global) consensus preservation;
- is equivalent to implication preservation on X, and to each of the following two requirements:
  $$\begin{aligned}&[\forall i \,\, P_{i}(A\cup B)=1]\Rightarrow P_{P_{1},\ldots ,P_{n}}(A\cup B)=1,\, \textit{for all }\,\,A,B\in X,\,\,P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma };\\&[\forall i\,\, P_{i}(A\cap B)=0]\Rightarrow P_{P_{1},\ldots ,P_{n}}(A\cap B)=0,\, \textit{for all }\,A,B\in X, \,\,P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }. \end{aligned}$$
(b)
For the maximal sub-agenda $X=\Sigma $, all of these requirements are equivalent.

4 A class of applications

So far, all our examples of opinion pooling problems have involved events represented by propositions in natural language, such as ‘it will rain’. As argued in Part I, the assumption that the agenda is a $\sigma $-algebra is often unnatural in such cases. But there is a second class of applications, in which it is more natural to define the agenda as a $\sigma $-algebra $(\Sigma )$ and to restrict the independence requirement to some sub-agenda X. Suppose we wish to estimate the distribution of a real-valued or vector-valued variable, such as rainfall or the number of insurance claims in some period. Here, the set of worlds $\Omega $ could be ${{\mathbb {R}}}$, ${\mathbb {Z}}$, ${{\mathbb {N}}}$, or $\{0,1,\ldots ,m\}$, or it could be ${{\mathbb {R}}}^{k}$, ${\mathbb {Z}}^{k}$, ${{\mathbb {N}}}^{k}$, or $\{0,1,\ldots ,m\}^{k}$ (for natural numbers m and k). In such cases, the focus on the $\sigma $-algebra of events is more realistic. First, we may need a full probability distribution on that $\sigma $-algebra. Second, individuals may be able to come up with such a probability distribution, because, in practice, they can do the following:

first choose some parametric class of probability functions (e.g., the class of Gaussian distributions if $\Omega ={{\mathbb {R}}}$, Poisson distributions if $\Omega ={\mathbb {N}}$, or binomial distributions if $\Omega =\{0,1,\ldots ,m\}$);
then estimate the relevant parameter(s) of the distribution (e.g., the mean and standard deviation in the case of a Gaussian distribution).

Because the agenda in this kind of application (e.g., the $\sigma $-algebra of Borel sets over ${{\mathbb {R}}}$, or the power set of ${\mathbb {N}}$) contains very complicated events, it would be implausible to require event-wise independent aggregation for all such events. For instance, suppose $\Omega ={{\mathbb {R}}}$, and consider the event that a number’s distance to the nearest prime exceeds 37. It would seem artificial to determine the collective probability for that event without paying attention to the probabilities of other events. Here, the sub-agenda X on which event-wise independence is plausible is likely to be much smaller than the full $\sigma $-algebra $\Sigma $.

Let us give a concrete example. Let $\Sigma $ consist of the Borel-measurable subsets of $\Omega ={\mathbb {R}}$. A natural sub-agenda of $\Sigma $ is $X=\cup _{\omega \in {\mathbb {R}}}\{(-\infty ,\omega ],(\omega ,\infty )\}$. If we require independence on X with a uniform decision criterion $D=D_{A}$ ($A\in X$), where $D(t_{1},\ldots ,t_{n})=\frac{1}{n}t_{1}+\cdots +\frac{1}{n}t_{n}$, we obtain a unique pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$, because the collective probabilities for X uniquely extend to a probability function on the entire $\sigma $-algebra $\Sigma $. Alternatively, one might require independence on the smaller sub-agenda $X=\cup _{\omega \in \{-1,+1\}}\{(-\infty ,\omega ],(\omega ,\infty )\}$, still with the same uniform decision criterion D. This under-determines the pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$, because probability assignments for X do not uniquely extend to all of $\Sigma $. To fill this gap, one might define the collective probability function as the unique normal distribution which assigns the specified probabilities to $(-\infty ,-1]$ and $(-\infty ,+1]$, as determined by the decision criterion D.^{Footnote 9}

Let us summarize how the present kinds of applications differ from the above-mentioned applications involving events represented by natural-language propositions such as ‘it will rain’ or ‘atmospheric $\hbox {CO}_{2}$ causes global warming’:

1.
$\Omega $ is a subset of ${\mathbb {R}}$ or of a higher-dimensional Euclidean space ${\mathbb {R}}^{k}$, rather than a set of ‘possible worlds’ specified by natural-language descriptions;
2.
it is often natural to arrive at a probability function by choosing a parametric family of such functions (such as the family of Gaussian distributions) and then specifying the relevant parameter(s), while this approach would seem ad hoc in the other kind of application;
3.
in practice, we may be interested in a probability function on the entire $\sigma $-algebra (e.g., in order to compute the mean of the distribution and other moments), rather than just in the probabilities of specific events.

5 When is opinion pooling neutral on premises?

We now show that, if there are certain kinds of interconnections among the premises in X, any pooling function satisfying independence on X and consensus preservation in one of the senses introduced must be neutral on X. This means that the pattern of dependence between individual and collective probability assignments is the same for all premises. In the next section, we turn to the question of whether our axioms imply linear pooling on premises, over and above neutrality.

Formally, a pooling function for agenda $\Sigma $ is neutral on $X (\subseteq \Sigma )$ if there exists some function $D:[0,1]^{n}\rightarrow [0,1]$—the local pooling criterion for events in X—such that, for every profile $(P_{1},\ldots ,P_{n})\in {\mathcal {P}}_{\Sigma }^{n}$, the collective probability of any event A in X is given by

$$\begin{aligned} P_{P_{1},\ldots ,P_{n}}(A)=D(P_{1}(A),\ldots ,P_{n}(A)). \end{aligned}$$

If $X=\Sigma $, neutrality on X reduces to neutrality in the familiar global sense, briefly mentioned in the introduction.

Our first result uses the strongest consensus-preservation requirement we have introduced, namely ‘global’ consensus preservation (on all of $\Sigma $). Here, we obtain the neutrality conclusion as soon as the sub-agenda of premises satisfies a very mild condition: it is ‘non-nested’. We call a sub-agenda X nested if it has the form $X=\{A,A^{c}:A\in X_{+}\}$ for some set of events $X_{+}$ which is linearly ordered by set-inclusion, and non-nested otherwise. For instance, $X=\{A,A^{c}\}$ is nested (take $X_{+}:=\{A\}$), as is $X=\{A,A^{c},A\cap B,(A\cap B)^{c}\}$ (take $X_{+}=\{A,A\cap B\}$). By contrast, $X=\{A,A^{c},B,B^{c}\}$ is non-nested when the events A and B are logically independent. Also, the above-mentioned sub-agenda $X=\{A_{1},A_{1}^{c},A_{2},A_{2}^{c},A_{3} ,A_{3}^{c}\}$ in our climate-change example is non-nested. Further examples are given in Part I.

Theorem 1

(a)
For any non-nested (finite)^{Footnote 10} sub-agenda X of the $\sigma $ -algebra $\Sigma $, every pooling function $F:{\mathcal {P}} _{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma } $ satisfying independence on X and (global) consensus preservation is neutral on X.
(b)
For any nested sub-agenda X of the $ \sigma $ -algebra $\Sigma $ (where X is finite and distinct from $ \{\varnothing ,\Omega \}$), there exists a pooling function $ F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$ satisfying independence on X and (global) consensus preservation but violating neutrality on X.

The possibilities arising for nested X are illustrated by variants of the two pooling functions constructed in Sect. 4, where $\Sigma $ is the Borel $\sigma $-algebra on $\Omega ={\mathbb {R}}$ and X is one of the nested sub-agendas $\cup _{\omega \in {\mathbb {R}}}\{(-\infty ,\omega ],(\omega ,\infty )\}$ and $\cup _{\omega \in \{-1,+1\}}\{(-\infty ,\omega ],(\omega ,\infty )\}$. To obtain pooling functions that are not neutral on X, as described in part (b), we must avoid the use of a uniform decision criterion on all elements of X.^{Footnote 11} Theorem 1 continues to hold if we weaken consensus preservation to conditional consensus preservation on premises, as shown next:

Theorem 2

(a)
For any non-nested (finite) sub-agenda X of the $\sigma $ -algebra $\Sigma $ , every pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$ satisfying independence on X and conditional consensus preservation on X is neutral on X.
(b)
For any nested sub-agenda X of the $\sigma $ -algebra $\Sigma $ (where X is finite and not $\{\varnothing ,\Omega \}$), there exists a pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$ satisfying independence on X and conditional consensus preservation on X but violating neutrality on X.

However, if we weaken the consensus-preservation requirement further—namely to consensus preservation on X—then the neutrality conclusion follows only if the events within the sub-agenda X exhibit stronger interconnections. Specifically, the set X must be ‘path-connected’, as originally defined in binary judgment-aggregation theory (often under the name ‘total blockedness’; see Nehring and Puppe 2010). To define path-connectedness formally, we begin with a preliminary notion. Given the sub-agenda X, we say that an event $A\in X$ conditionally entails another event $B\in X$—written $A\vdash ^{*}B$—if there is a subset $Y\subseteq X$ (possibly empty, but not uncountably infinite) such that $\{A\}\cup Y$ entails B, where, for non-triviality, $Y\cup \{A\}$ and $Y\cup \{B^{c}\}$ are each consistent. In our climate-change example with sub-agenda $X=\{A_{1},A_{1} ^{c},A_{2},A_{2}^{c},A_{3},A_{3}^{c}\}$, $A_{1}$ conditionally entails $A_{3}$ (take $Y=\{A_{2}\}$), but none of $A_{1}^{c}$, $A_{2}^{c}$, and $A_{3}$ conditionally entails any event in X other than itself.

We call the sub-agenda X path-connected if any two events $A,B\in X\backslash \{\varnothing ,\Omega \}$ can be connected by a path of conditional entailments, i.e., there exist events $A_{1},\ldots ,A_{k}\in X$ ($k\ge 1$) such that $A=A_{1}\vdash ^{*}A_{2}\vdash ^{*}\cdots \vdash ^{*}A_{k}=B$, and non-path-connected otherwise. For example, suppose $X=\{A,A^{c} ,B,B^{c},C,C^{c}\}$, where $\{A,B,C\}$ is a partition of $\Omega $ (and $A,B,C\ne \varnothing $). Then X is path-connected. For instance, to see that there is a path from A to B, note that $A\vdash ^{*}C^{c}$ (take $Y=\varnothing $) and $C^{c}\vdash ^{*}B~$ (take $Y=\{A^{c} \}$). Many sub-agendas are not path-connected, including all nested sub-agendas X ($\ne \{\varnothing ,\Omega \}$) and the sub-agenda $X=\{A_{1},A_{1}^{c},A_{2},A_{2}^{c},A_{3},A_{3}^{c}\}$ in the climate-change example.

Theorem 3

(a)
For any path-connected (finite) sub-agenda X of the $\sigma $ -algebra $\Sigma $, every pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}} _{\Sigma }$ satisfying independence on X and consensus preservation on X is neutral on X.
(b)
For any non-path-connected (finite) sub-agenda X of the $\sigma $ -algebra $\Sigma $, there exists a pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$ satisfying independence on X and consensus preservation on X but violating neutrality on X.

6 When is opinion pooling linear on premises?

Our next question is whether, and for which sub-agendas X, our requirements on an opinion pooling function imply linearity on premises, over and above neutrality. Formally, a pooling function for agenda $\Sigma $ is called linear on X $(\subseteq \Sigma )$ if there exist real-valued weights $w_{1},\ldots ,w_{n}\ge 0$ with $w_{1}+\cdots +w_{n}=1$ such that, for every profile $(P_{1},\ldots ,P_{n})\in {\mathcal {P}}_{\Sigma }^{n}$, the collective probability of any event A in X is given by

$$\begin{aligned} P_{P_{1},\ldots ,P_{n}}(A)= {\displaystyle \sum _{i=1}^{n}} w_{i}P_{i}(A). \end{aligned}$$

If $X=\Sigma $, linearity on X reduces to linearity in the global sense, familiar from the established literature.

As in the case of neutrality, whether our axioms imply linearity on a given sub-agenda X depends on how the events in X are connected and which consensus-preservation requirement we impose on the pooling function. Again, our first result uses the strongest consensus-preservation requirement and applies to a very large class of sub-agendas.

Theorem 4

(a)
For any non-nested (finite) sub-agenda X of the $\sigma $ -algebra $\Sigma $ with $\left| X\backslash \{\Omega ,\varnothing \}\right| >4$, every pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma } $ satisfying independence on X and (global) consensus preservation is linear on X.
(b)
For any other sub-agenda X of the $\sigma $ -algebra $\Sigma $ (where X is finite and distinct from $\{\varnothing ,\Omega \}$), there exists a pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$ satisfying independence on X and (global) consensus preservation but violating linearity on X .

If we weaken consensus preservation to conditional consensus preservation on X, the linearity conclusion still follows, but only if the sub-agenda X is ‘non-simple’—a condition stronger than non-nestedness, but still weaker than path-connectedness.^{Footnote 12} The notion of non-simplicity also comes from binary judgment-aggregation theory, where the non-simple agendas are those that are susceptible to majority inconsistencies, the judgment-aggregation analogues of Condorcet’s paradox (e.g., Nehring and Puppe 2010; Dietrich and List 2007). Formally, a sub-agenda X is non-simple if it has a minimal inconsistent subset $Y\subseteq X$ of more than two (but not uncountably many) events, and simple otherwise. (A set Y is minimal inconsistent if it is inconsistent but all its proper subsets are consistent.) For example, the sub-agenda $X=\{A_{1} ,A_{1}^{c},A_{2},A_{2}^{c},A_{3},A_{3}^{c}\}$ in our climate-change example is non-simple, since its three-element subset $Y=\{A_{1},A_{2},A_{3}^{c}\}$ is minimal inconsistent. By contrast, a sub-agenda of the form $X=\{A,A^{c}\}$ is simple.

Theorem 5

(a)
For any non-simple (finite) sub-agenda X of the $\sigma $ -algebra $\Sigma $, every pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma } $ satisfying independence on X and conditional consensus preservation on X is linear on X.
(b)
For any simple sub-agenda X of the $\sigma $ -algebra $\Sigma $ (where X is finite and distinct from $\{\varnothing ,\Omega \}$), there exists a pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$ satisfying independence on X and conditional consensus preservation on X but violating linearity on X.

Finally, if we impose only the weakest of our three consensus-preservation requirements—consensus preservation on X—then the linearity conclusion follows only if the sub-agenda X is path-connected and satisfies an additional condition. A sufficient such condition is ‘partitionality’. A sub-agenda X is partitional if some subset $Y\subseteq X$ partitions $\Omega $ into at least three non-empty events (where Y is finite or countably infinite), and non-partitional otherwise. As an illustration, recall our earlier example of a sub-agenda given by $X=\{A,A^{c} ,B,B^{c},C,C^{c}\}$, where $\{A,B,C\}$ partitions $\Omega $ (with $A,B,C\ne \varnothing $). This sub-agenda is both path-connected (as mentioned above) and partitional.

Theorem 6

(a)
For any path-connected and partitional (finite) sub-agenda X of the $\sigma $ -algebra $\Sigma $, every pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$ satisfying independence on X and consensus preservation on X is linear on X.
(b)
For any non-pathconnected (finite) sub-agenda X of the $\sigma $ -algebra $\Sigma $ , there exists a pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$ satisfying independence on X and consensus preservation on X but violating linearity on X.

It is clear from part (b) that path-connectedness of the premises is necessary for the linearity conclusion to follow. The other condition, partitionality, is not necessary. But it is not redundant:

Proposition 2

For some path-connected and non-partitional (finite) sub-agenda X of the $\sigma $ -algebra $\Sigma $ , there exists a pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$ satisfying independence on X (even neutrality on X ) and consensus preservation on X but violating linearity on X.^{Footnote 13}

7 Classic results as special cases

As should be evident, if we apply our results to the maximal sub-agenda $X=\Sigma $, we obtain classic results (by Aczél and Wagner 1980; McConway 1981) as special cases. To see why this is the case, note three things. First, when $X=\Sigma $, our various conditions on the sub-agenda X all reduce to a single condition on the size of the $\sigma $-algebra $\Sigma $.

Lemma 1

For the maximal sub-agenda $X=\Sigma $ (where $\Sigma \ne \{\Omega ,\varnothing \}$), the conditions of non-nestedness, non-simplicity, path-connectedness, and partitionality are all equivalent, and they all hold if and only if $\left| \Sigma \right| >4$, i.e., if and only if $\Sigma $ is not of the form $\{A,A^{c},\Omega ,\varnothing \} $.

Second, when $X=\Sigma $, independence, neutrality, and linearity on X reduce to independence, neutrality, and linearity in the familiar ‘global’ sense, as already noted. Third, our various consensus-preservation requirements all become equivalent, by Proposition 1.

In consequence, our six theorems reduce to two classic results:^{Footnote 14}

Theorems 1–3 reduce to the result that all pooling functions satisfying independence and consensus preservation are neutral if $\left| \Sigma \right| >4$, but not if $\left| \Sigma \right| =4$;
Theorems 4–6 reduce to the result that all pooling functions satisfying independence and consensus preservation are linear if $\left| \Sigma \right| >4$, but not if $\left| \Sigma \right| =4$.

The case $\left| \Sigma \right| <4$ is uninteresting because it means that $\Sigma $ is the trivial $\sigma $-algebra $\{\Omega ,\varnothing \}$. Let us slightly re-formulate these two results:

Corollary 1

For the $\sigma $-algebra $\Sigma $,

(a)
if $\left| \Sigma \right| >4$, every pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$ satisfying independence and consensus preservation is linear (and by implication neutral);
(b)
if $\left| \Sigma \right| =4$, there exists a pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$ satisfying independence and consensus preservation but violating neutrality (and thereby also violating linearity).

Notes

The correlation might be due to causal effects between, or common causes of, rain and heat.
One could construct basic events from non-basic events, using the operations of negation and disjunction. Formally, while the basic events typically form a generating system of the $\sigma $-algebra, there exist many alternative generating systems, and usually none of them is canonical in a technical sense. The task of determining the ‘basic’ events therefore involves some interpretation and may be context-dependent and open to disagreement. One might, however, employ a syntactic criterion which counts an event as ‘basic’ if, in a suitable language (perhaps one deemed ‘natural’), it can be expressed by an atomic sentence (one that is not a combination of other sentences using Boolean connectives). An event expressible by the sentence ‘it will rain or it will snow’ would then count as non-basic. This syntactic criterion relies on our choice of language, which, though not a purely technical matter, is arguably not ad hoc. An n-place connective (e.g., the two-place connective ‘or’) is called Boolean or truth-functional if the truth-value of every sentence constructed by applying this connective to n other sentences is determined by the truth values of the latter sentences. For instance, ‘or’ is Boolean since ‘p or q’ is true if and only if ‘p’ is true or ‘q’ is true. Many languages, especially ones that mimic natural language, contain non-Boolean connectives, for instance non-material conditionals for which the truth-value of ‘if p then q’ is not always determined by the truth-values of p and q. When the sentence ‘if p then q’ is not truth-functionally decomposable, an event represented by it would count as ‘basic’ under the present syntactic criterion. The sentence ‘${\hbox {CO}_{2}}$ emissions cause global warming’ can be viewed as the non-material (specifically, causal) conditional ‘if p then q’, hence would describe a basic event. See Priest (2001) for an introduction to non-classical logic.
The terminology ‘premise’ is still justified, though not in the sense of ‘premise of individual probability assignment’ (since we no longer assume that premises are basic in the individuals’ formation of probabilistic beliefs), but in the sense of ‘premise of collective probability assignment’ (because the collective probabilities for these events are determined independently of the probabilities of other events and then constrain other collective probabilities).
Historically, linear pooling goes back at least to Stone (1961). Linear pooling is by no means the only plausible way to aggregate subjective probabilities. Other approaches include geometric and, more generally, externally Bayesian pooling (e.g., McConway 1978; Genest 1984b; Genest et al. 1986; Russell et al. 2015; Dietrich 2016), multiplicative pooling (Dietrich 2010; Dietrich and List 2016), supra-Bayesian pooling (e.g., Morris 1974), and pooling of ordinal probabilities (Weymark 1997). For literature reviews, see Genest and Zidek (1986), Clemen and Winkler (1999) and Dietrich and List (2016).
These auxiliary assumptions might be given exogenously; or they might be determined endogenously based on the experts’ probability functions (e.g., based on how dependent or independent the conjuncts are according to these probability functions).
Equivalently, one can demand the preservation of any unanimously assigned probability 0.
In Part I, we make the opposite move of extending consensus preservation to events outside the agenda, i.e., we extend it to events constructible from events in the agenda using conjunction (intersection), disjunction (union), or negation (complementation). In the present paper, there is no point in extending consensus preservation to other events, since there are no events outside the agenda constructible from events in it (as a $\sigma $-algebra, the agenda is closed under the relevant operations).
We are indebted to Richard Bradley for suggesting this formulation of the requirement.
For those special profiles of individual probability functions for which the collective probabilities for $(-\infty ,-1]$ and $(-\infty ,+1]$ coincide or one of them is zero or one, there is no such normal distribution. A different, non-normal extension must then be used.
The finiteness assumption in Theorems 1(a), 1(b), 2(a), 2(b), 3(a), 4(a), 4(b), 5(a), and 6(a) could be replaced by the assumption that the $\sigma $-algebra generated by X is $\Sigma $ (rather than a sub-$\sigma $-algebra of $\Sigma $). It might be that some of these finiteness assumptions (or their substitutes)—especially in Theorems 1(b), 2(b), and 4(b)—could be dropped.
For example, for every event of the form $A=(-\infty ,\omega ]$, we might use the decision criterion defined by $D_{A}(t_{1},\ldots ,t_{n} )=(\frac{1}{n}t_{1}+\cdots +\frac{1}{n}t_{n})^{2}$, and for every event of the form $A=(\omega ,\infty )$, we might use the decision criterion defined by $D_{A}(t_{1},\ldots ,t_{n})=1-(\frac{1}{n}(1-t_{1})+\cdots +\frac{1}{n} (1-t_{n}))^{2}$.
To be precise, path-connectedness implies non-simplicity as long as $X\ne \{\varnothing ,\Omega \}$.
In this proposition, we assume that the agenda $\Sigma $ is not very small, i.e., contains more than $2^{3}=8$ events (e.g., $\Sigma =2^{\Omega }$ with $\left| \Omega \right| >3$). Note that, as $\Sigma $ is a $\sigma $-algebra, it has the size $2^{k}$ for some $k\in \{1,2,3,\ldots \}$ or is infinite.
We require no restriction to a finite $\Sigma $, as observed in footnote 10.
In this case, each opinion function in ${\mathcal {P}}_{X}$ is extendable not just to a probability function on $\sigma (X)$, but also to one on $\Sigma $. Probability theorists will be aware that the extendability of a probability function to a larger $\sigma $-algebra cannot be taken for granted.
In that proof it suffices to choose the $Q_{A}$s appropriately, since each $Q_{A}$ equals $P(\cdot |A)$, provided $P(A)\ne 0$.
That proof took the four mentioned events to be singleton, but nothing depends on this.

References

Aczél J, Wagner C (1980) A characterization of weighted arithmetic means. SIAM J Algebra Discr Methods 1(3):259–260
Article Google Scholar
Aczél J, Ng CT, Wagner C (1984) Aggregation theorems for allocation problems. SIAM J Algebra Discr Methods 5(1):1–8
Article Google Scholar
Chambers C (2007) An ordinal characterization of the linear opinion pool. Econ Theor 33(3):457–474
Article Google Scholar
Clemen RT, Winkler RL (1999) Combining probability distributions from experts in risk analysis. Risk Anal 19(2):187–203
Google Scholar
Dietrich F (2006) Judgment aggregation: (im)possibility theorems. J Econ Theory 126(1):286–298
Article Google Scholar
Dietrich F (2010) Bayesian group belief. Soc Choice Welf 35(4):595–626
Article Google Scholar
Dietrich F (2016) A theory of Bayesian groups. Working paper
Dietrich F, List C (2007) Arrow’s theorem in judgment aggregation. Soc Choice Welf 29(1):19–33
Article Google Scholar
Dietrich F, List C (2016) Probabilistic opinion pooling. In: Hitchcock C, Hajek A (eds), Oxford Handbook of Probability and Philosophy, Oxford University Press, Oxford
Dietrich F, List C (2017) Probabilistic opinion pooling generalized. Part one: general agendas. Soc Choice Welf. doi:10.1007/s00355-017-1034-z (this issue)
Dietrich F, Mongin P (2010) The premise-based approach to judgment aggregation. J Econ Theory 145(2):562–582
Article Google Scholar
Genest C (1984a) Pooling operators with the marginalization property. Can J Stat 12(2):153–163
Article Google Scholar
Genest C (1984b) A characterization theorem for externally Bayesian groups. Ann Stat 12(3):1100–1105
Article Google Scholar
Genest C, McConway KJ, Schervish MJ (1986) Characterization of externally Bayesian pooling operators. Ann Stat 14(2):487–501
Article Google Scholar
Genest C, Zidek JV (1986) Combining probability distributions: a critique and annotated bibliography. Stat Sci 1(1):114–135
Article Google Scholar
Kornhauser LA, Sager LG (1986) Unpacking the Court. Yale Law J 96(1):82–117
Article Google Scholar
List C, Pettit P (2002) Aggregating sets of judgments: an impossibility result. Econ Philos 18(1):89–110
Article Google Scholar
List C (2006) The discursive dilemma and public reason. Ethics 116(2):362–402
Article Google Scholar
McConway KJ (1978) The combination of experts’ opinions in probability assessments: some theoretical considerations. Ph.D. thesis, University College London
McConway KJ (1981) Marginalization and linear opinion pools. J Am Stat Assoc 76(374):410–414
Article Google Scholar
Mongin P (1995) Consistent Bayesian aggregation. J Econ Theory 66:313–351
Article Google Scholar
Mongin P (2005) Spurious unanimity and the Pareto principle. LSE Choice Group working paper series, London School of Economics
Mongin P (2008) Factoring out the impossibility of logical aggregation. J Econ Theory 141(1):100–113
Article Google Scholar
Morris PA (1974) Decision analysis expert use. Manag Sci 20:1233–1241
Article Google Scholar
Nehring K, Puppe C (2010) Abstract arrovian aggregation. J Econ Theory 145(2):467–494
Article Google Scholar
Pettit P (2001) Deliberative democracy and the discursive dilemma. Philos Issue 11:268–299
Article Google Scholar
Priest G (2001) An introduction to non-classical logic. Cambridge University Press, Cambridge
Google Scholar
Russell JS, Hawthorne J, Buchak L (2015) Groupthink. Philos Stud 172(5):1287–1309
Article Google Scholar
Stone M (1961) The opinion pool. Ann Math Stat 32(4):1339–1342
Wagner C (1982) Allocation, Lehrer models, and the consensus of probabilities. Theor Decis 14(2):207–220
Article Google Scholar
Wagner C (1985) On the formal properties of weighted averaging as a method of aggregation. Synthese 62(1):97–108
Article Google Scholar
Weymark J (1997) Aggregating ordinal probabilities on finite sets. J Econ Theory 75(2):407–432
Article Google Scholar

Download references

Author information

Authors and Affiliations

Paris School of Economics and CNRS, Paris, France
Franz Dietrich
London School of Economics, London, UK
Christian List

Authors

Franz Dietrich
View author publications
You can also search for this author in PubMed Google Scholar
Christian List
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christian List.

Additional information

We are grateful to the referees and the editor for very helpful and detailed comments. Although we are jointly responsible for this work, Christian List wishes to note that Franz Dietrich should be considered the lead author, to whom the credit for the present mathematical proofs is due. This paper is the second of two self-contained, but technically related companion papers inspired by binary judgment-aggregation theory. Both papers build on our earlier, unpublished paper ‘Opinion pooling on general agendas’ (September 2007). Dietrich was supported by a Ludwig Lachmann Fellowship at the LSE and the French Agence Nationale de la Recherche (ANR-12-INEG-0006-01). List was supported by a Leverhulme Major Research Fellowship (MRF-2012-100) and a Harsanyi Fellowship at the Australian National University, Canberra.

Appendix A: Proofs

We now give all proofs. In Sect. A.1, we prove parts (a) of all our theorems by reducing them to results in Part I. In Sects. A.2, A.3, A.4, and A.5, we prove parts (b) of Theorems 1, 3, 4, and 5. Parts (b) of Theorems 2 and 6 require no separate proofs: Theorem 2(b) follows from Theorem 1(b) (since consensus preservation implies conditional consensus preservation on X by Proposition 1) and Theorem 6(b) follows from Theorem 3(b) (since non-neutrality on X implies non-linearity on X). In Sect. A.6, we prove Proposition 2.

1.1 A.1 Proof of part (a) of each theorem

We now prove Theorems 1(a) to 6(a). To do so, we first relate premise-based opinion pooling to opinion pooling on a general agenda as introduced in Part I. We begin by generalizing the present paper’s framework to agendas that need not be $\sigma $-algebras. In general, an agenda is a non-empty set X of events (each of which is of the form $A\subseteq \Omega $), where X is closed under complementation (i.e., $A\in X\Leftrightarrow A^{c}\in X $). It contains the events on which opinions are formed. Given an agenda X, an opinion function is a function $P:X\rightarrow [0,1]$ which is coherent, i.e., extendable to a probability function on the $\sigma $-algebra $\sigma (X)$ generated by X (i.e., the smallest $\sigma $-algebra which includes X, constructible by closing X under countable unions and complements). Let ${\mathcal {P}}_{X}$ be the set of all opinion functions for agenda X. If X is a $\sigma $-algebra, ${\mathcal {P}}_{X}$ consists of all probability functions on X, in line with the notation used above. An opinion pooling function for agenda X is a function ${\mathcal {P}} _{X}^{n}\rightarrow {\mathcal {P}}_{X}$ which assigns to each profile $(P_{1},\ldots ,P_{n})$ of individual opinion functions a collective opinion function, usually denoted $P_{P_{1},\ldots ,P_{n}}$. We call the pooling function linear and neutral, respectively, if it is linear and neutral on X in line with the definition above.

Crucially, a pooling function for a $\sigma $-algebra $\Sigma $ induces new pooling functions for any sub-agendas X on which it is independent. Formally, a pooling function $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$ for agenda $\Sigma $ is said to induce the pooling function $F^{\prime }:{\mathcal {P}}_{X}^{n}\rightarrow {\mathcal {P}}_{X}$ for (sub-)agenda X if F and $F^{\prime }$ generate the same collective opinions within X, i.e.,

$$\begin{aligned} F^{\prime }(P_{1}|_{X},\ldots ,P_{n}|_{X})=F(P_{1},\ldots ,P_{n})|_{X}\text { for all }P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma } \end{aligned}$$

(and if, in addition, ${\mathcal {P}}_{X}=\{P|_{X}:P\in {\mathcal {P}}_{\Sigma }\}$, where this addition holds automatically whenever X is finite or $\sigma (X)=\Sigma $).^{Footnote 15} Our axiomatic requirements on a pooling function for agenda $\Sigma $—i.e., independence on a sub-agenda X and various consensus requirements—should be compared with the following requirements on a pooling function for the agenda X (introduced and discussed in Part I). The first two requirements are unrestricted versions of independence and consensus preservation:

Independence For each event $A\in X$, there exists a function $D_{A}:[0,1]^{n}\rightarrow [0,1]$ (the local pooling criterion for A) such that $P_{P_{1},\ldots ,P_{n}}(A)=D_{A}(P_{1} (A),\ldots ,P_{n}(A))$ for any $P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{X}$.

Consensus preservation For all $A\in X$ and $P_{1} ,\ldots ,P_{n}\in {\mathcal {P}}_{X}$, if $P_{i}(A)=1$ for all individuals i, then $P_{P_{1},\ldots ,P_{n}}(A)=1$.

Note the following criterion for the existence of induced pooling functions:

Lemma 2

(cf. Part I, Lemma 14) If a pooling function for a $\sigma $-algebra $\Sigma $ is independent on a sub-agenda X (where X is finite or $\sigma (X)=\Sigma $), then it induces a pooling function for agenda X.

The next two axiomatic requirements are two different extensions of consensus preservation, namely to either implicitly revealed or unrevealed beliefs. An individual i’s explicitly revealed beliefs are given by the individual’s submitted opinion function $P_{i}$. Her implicitly revealed beliefs are given by the probabilities of events in $\sigma (X)\backslash X$ which are implied by her explicitly revealed beliefs, i.e., hold under every extension of $P_{i}$ to a probability function on $\sigma (X)$. If, for instance, $P_{i}$ assigns probability 1 to $A\in X$, then the agent implicitly reveals certainty of all events $B\supseteq A$ in $\sigma (X)\backslash X$. The following axiom extends consensus preservation to implicitly revealed beliefs:

Implicit consensus preservation For all $A\in \sigma (X)$ and all $P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{X}$, if each $P_{i}$ implies certainty of A (i.e., $\overline{P}_{i}(A)=1$ for every extension $\overline{P}_{i}$ of $P_{i}$ to a probability function on $\sigma (X)$), then so does $P_{P_{1} ,\ldots ,P_{n}}$.

By contrast, individual i’s unrevealed beliefs are any probabilistic beliefs which she privately holds relative to events in $\sigma (X)\backslash X$ and which cannot be inferred from the submitted opinion function $P_{i}$ because different extensions of $P_{i}$ assign different probabilities to the events in question. The following axiom requires the collective opinion function to be compatible with any unanimously held certainty of an event—including any unrevealed certainty, which is not implied by the submitted opinion functions but is consistent with them. This ensures that no consensus (not even an unrevealed consensus) is ever overruled.

Consensus compatibility For all $A\in \sigma (X)$ and all $P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{X}$, if each $P_{i}$ is consistent with certainty of A (i.e., $\overline{P}_{i}(A)=1$ for some extension $\overline{P}_{i}$ of $P_{i}$ to a probability function on $\sigma (X)$), then so is $P_{P_{1},\ldots ,P_{n}}$.

A final requirement pertains to conditional beliefs. Note that, based on individual i’s opinion function $P_{i}$, the conditional belief $P_{i}(A|B)=P_{i}(A\cap B)/P_{i}(B)$ of one agenda event A given another B (where $P_{i}(B)\ne 0$) may be undefined, since we may have $A\cap B\not \in X$ so that $P_{i}(A\cap B)$ is undefined. Hence, if the agent happens to be privately certain of A given B, then this conditional certainty may be unrevealed. Our axiom of conditional consensus compatibility requires that any (possibly unrevealed) unanimous conditional certainty should not be overruled. In fact, we require something subtly stronger: any set of (possibly unrevealed) unanimous conditional certainties should not be overruled (see Part I for details).

Conditional consensus compatibility For all $P_{1} ,\ldots ,P_{n}\in {\mathcal {P}}_{X}$, and all finite sets S of pairs (A, B) of events in X, if every opinion function $P_{i}$ is consistent with certainty of A given B for all (A, B) in S (i.e., some extension $\overline{P}_{i}$ of $P_{i}$ to a probability function on $\sigma (X)$ satisfies $\overline{P}_{i}(A|B)=1$ for all pairs $(A,B)\in S$ such that $P_{i}(B)\ne 0$), then so is the collective opinion function $P_{P_{1},\ldots ,P_{n}}$.

The following lemma shows how properties of a pooling function for a $\sigma $-algebra translate into corresponding properties of an induced pooling function for a sub-agenda:

Lemma 3

(cf. Part I, Lemma 12) Suppose pooling function F for $\sigma $-algebra $\Sigma $ induces pooling function $F^{\prime }$ for sub-agenda X (where X is finite or $\sigma (X)=\Sigma $). Then:

$F^{\prime }$ is independent (respectively neutral, linear) if and only if F is independent (respectively neutral, linear) on X,
$F^{\prime }$ is consensus-preserving if and only if F is consensus-preserving on X,
$F^{\prime }$ is consensus-compatible if F is consensus-preserving,
$F^{\prime }$ is conditional-consensus-compatible if F is conditional-consensus-preserving on X.

This lemma follows from a more general result:

Lemma 4

(cf. Part I, Lemma 13) Consider a $\sigma $-algebra $\Sigma $ and a sub-agenda X (where X is finite or $\sigma (X)=\Sigma $). Any pooling function for X is

(a)
induced by some pooling function for agenda $\Sigma $,
(b)
independent (respectively neutral, linear) if and only if every inducing pooling function for agenda $\Sigma $ is independent (respectively neutral, linear) on X, where ‘every’ can be replaced by ‘some’,
(c)
consensus-preserving if and only if every inducing pooling function for agenda $\Sigma $ is consensus-preserving on X, where ‘every’ can be replaced by ‘some’,
(d)
consensus-compatible if and only if some inducing pooling function for agenda $\Sigma $ is consensus-preserving,
(e)
conditional-consensus-compatible if and only if some inducing pooling function for agenda $\Sigma $ is conditional-consensus-preserving on X

(where in (d) and (e) the ‘only if’ claim assumes that X is finite).

Proof of parts (a) of Theorems 1–6

Using the above translation machinery, one can reduce Theorem 1(a) to Part I’s Theorem 1(a), Theorem 2(a) to Part I’s Theorem 2(a), and so on until Theorem 6(a). Since the reduction is analogous for each theorem, we only spell it out for Theorem 1. Let X be a non-nested finite sub-agenda of the $\sigma $-algebra agenda $\Sigma $, and let $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$ be independent on X and (globally) consensus preserving. By Lemma 2, F induces a pooling function $F^{\prime }$ for agenda X, which is independent and consensus-compatible by Lemma 3, hence neutral by Part I’s Theorem 1(a). So F is neutral on X by Lemma 3. $\square $

1.2 A.2 Proof of Theorem 1(b)

We now write $\mathbf {1}$ and $\mathbf {0}$ for the n-dimensional vectors $(1,\ldots ,1)$ and $(0,\ldots ,0)$, respectively. We draw on a measure-theoretic fact:

Lemma 5

(cf. Part I, Lemma 15) Every probability function on a finite sub-$\sigma $-algebra of $\sigma $-algebra $\Sigma $ can be extended to a probability function on $\Sigma $.

Proof of Theorem 1(b)

Consider a finite nested sub-agenda $X\ne \{\varnothing ,\Omega \}$ of the $\sigma $-algebra agenda $\Sigma $. (As will become clear, finiteness could be replaced by the assumption that $\sigma (X)=\Sigma $. Under this alternative assumption, the ‘Claim’ below can be skipped, and the rest of the proof remains almost unaffected.) We construct a pooling function $(P_{1},\ldots ,P_{n})\mapsto P_{P_{1},\ldots ,P_{n}}$ for agenda $\Sigma $ with all relevant properties. Without loss of generality, let $\varnothing ,\Omega \in X$.

Claim. If Theorem 1(b) holds in the case that $\sigma (X)=\Sigma $, then it holds in general.

Let Theorem 1(b) hold in the special case. Let $\Sigma ^{\prime }:=\sigma (X) $ ($\subseteq \Sigma $). By assumption, there is a pooling function $F^{\prime }:{\mathcal {P}}_{\Sigma ^{\prime }}^{n}\rightarrow {\mathcal {P}}_{\Sigma ^{\prime }}$ with all relevant properties. Let $\mathcal {A}$ be the set of atoms of the (finite) $\sigma $-algebra $\Sigma ^{\prime }$. We define $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$ as follows. Consider $P_{1},\ldots ,P_{n} \in {\mathcal {P}}_{\Sigma }$. Let $P^{\prime }:=F^{\prime }(P_{1}|_{\Sigma ^{\prime } },\ldots ,P_{n}|_{\Sigma ^{\prime }})$. For all $A\in \mathcal {A}$ such that $P^{\prime }(A)\ne 0$, there is an individual $i_{A}$ such that $P_{i_{A} }(A)\ne 0$, since otherwise everyone assigns probability one to $\Omega \backslash A$ while $P^{\prime }(\Omega \backslash A)\ne 1$, violating consensus-preservation. By Lemma 5, $P^{\prime }$ can be extended to a probability function P on $\Sigma $. As is clear from that lemma’s proof (in Part I), we may assume without loss of generality that^{Footnote 16}

$$\begin{aligned} P(\cdot |A)=P_{i_{A}}(\cdot |A)\quad \text { for each } \quad A\in \mathcal {A} \quad \text { such that }\quad P(A)\ne 0. \end{aligned}$$

Now let $F(P_{1},\ldots ,P_{n})$ be this P. It remains to show that the pooling function F just defined inherits all relevant properties from $F^{\prime }$. This is clear for independence on X and non-neutrality on X. To show that F is (globally) consensus-preserving, consider $B\in \Sigma $ and $P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }$ such that $P_{1}(B)=\cdots =P_{n}(B)=1$. To show that $P(B)=1$, where $P:=F(P_{1},\ldots ,P_{n})$, note first that $P(B)=\sum _{A\in {\mathcal {A}}:P(A)\ne 0}P(B|A)P(A)$. Here (in the notation above) each P(B|A) equals $P_{i_{A}}(B|A)$, which equals 1 as $P_{i_{A} }(B)=1$. So $P(B)=\sum _{A\in {\mathcal {A}}:P(A)\ne 0}P(A)=1$. This proves the claim.

Now let $\sigma (X)=\Sigma $, drawing on the above ‘Claim’. As X is nested, we may express it as $X=\{A,A^{c}:A\in X_{+}\}$ for some subset $X_{+}\subseteq X$ which is linearly ordered and contains both $\varnothing $ and $\Omega $.

As an ingredient of our construction, we consider any pooling function for agenda $\Sigma $ which is neutral (at least) on X and consensus-preserving and whose pooling criterion on X, denoted $D:[0,1]^{n}\rightarrow [0,1]$, is at least weakly increasing in each argument. (For instance, we might use dictatorship by individual 1, given by $(P_{1},\ldots ,P_{n})\mapsto P_{1}$, with pooling criterion given by $D(t_{1},\ldots ,t_{n})=t_{1}$.) As $X\ne \{\varnothing ,\Omega \}$, there is some $A\in X\backslash \{\Omega ,\varnothing \}$. As $A\ne \Omega ,\varnothing $, there are $P_{1},\ldots ,P_{n} \in {\mathcal {P}}_{\Sigma }$ which all assign probability 1 / 2 to A (hence to $A^{c}$), so that the collective probabilities of A and of $A^{c}$ are each given by $D(1/2,\ldots ,1/2)$. As these probabilities sum to 1, it follows that

$$\begin{aligned} D(1/2,1/2,\ldots ,1/2)=1/2. \end{aligned}$$

(1)

We now transform this pooling function, which is neutral on X, into a pooling function $(P_{1},\ldots ,P_{n})\mapsto P_{P_{1},\ldots ,P_{n}}$ which is non-neutral on X, but still independent on X and consensus-preserving. To do so, we consider a function $T:[0,1]\rightarrow [0,1]$ such that (i) $T(1/2)\ne 1/2$, (ii) $T(0)=0$ and $T(1)=1$, (iii) T is at least weakly increasing, and (iv) T is Lipschitz continuous, i.e., there is a $K>0$ such that $\left| T(x)-T(y)\right| \le K\left| x-y\right| $ for all $x,y\in [0,1]$. (T could be defined by $T(x)=\min \{2x,1\}$.) Now consider any $ P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }$. We have to define $P_{P_{1},\ldots ,P_{n}}$. We write P for the result of applying the neutral pooling function to $(P_{1},\ldots ,P_{n})$. To anticipate, our definition will imply that

$$\begin{aligned} P_{P_{1},\ldots ,P_{n}}(C)=T(P(C)) \quad \text { whenever } \quad C\in X_{+}. \end{aligned}$$

As a first step towards our definition, we define $P_{P_{1},\ldots ,P_{n}}$ on the subdomain

$$\begin{aligned} {{\widetilde{X}}}:=\{A\cap B:A,B\in X\}=\{B\backslash A:A,B\in X_{+} \quad \text { such that } \quad A\subseteq B\}. \end{aligned}$$

The restriction of $P_{P_{1},\ldots ,P_{n}}$ to ${\widetilde{X}}$, to be denoted g, is defined as follows. Each $C\in {\widetilde{X}}$ is uniquely representable as $C=B\backslash A$ with $A,B\in X_{+}$ and $A\subseteq B$ (and $A=B=\varnothing $ if $C=\varnothing $), and we let

$$\begin{aligned} g(C)&=T(P(B))-T(P(A))\\&=T(D(P_{1}(B),\ldots ,P_{n}(B)))-T(D(P_{1}(A),\ldots ,P_{n}(A))). \end{aligned}$$

It follows that

$$\begin{aligned} g(C)=\left\{ \begin{array} [c]{ll} T(P(C))=T(D(P_{1}(C),\ldots ,P_{n}(C))) &{}\quad \text {if}\,\, C\in X_{+}\\ 1-T(P(C^{c}))=1-T(D(P_{1}\left( C^{c}\right) ,\ldots ,P_{n}\left( C^{c}\right) )) &{}\quad \text {if}\,\, C\in X\backslash X_{+}, \end{array} \right. \end{aligned}$$

(2)

because, firstly, each $C\in X_{+}$ can be written as $C\backslash \varnothing $ where $C,\varnothing \in X_{+}$, and, secondly, each $C\in X\backslash X_{+}$ can be written as $\Omega \backslash C^{c}$ where $\Omega ,C^{c}\in X_{+}$ and where $T(P(\Omega ))=T(1)=1$.

Note that ${{\widetilde{X}}}$ is a semi-ring on $\Omega $, since (i) $\varnothing \in {\widetilde{X}}$, (ii) $C,C^{\prime }\in {\widetilde{X}}\Rightarrow C\cap C^{\prime }\in {\widetilde{X}}$, and (iii) for all $C,C^{\prime } \in {\widetilde{X}}$, the difference $C\backslash C^{\prime }$ is a union of finitely many—in fact, at most two—events in ${\widetilde{X}}$. We next show that the function g on this semi-ring is $\sigma $-additive. First, g is finitely additive, i.e., for all disjoint $C_{1},C_{2}\in {\widetilde{X}} $, if $C_{1}\cup C_{2}\in {\widetilde{X}}$, then $g(C_{1}\cup C_{2} )=g(C_{1})+g(C_{2})$, by definition of g and additivity of P. To show $\sigma $-additivity, consider pairwise disjoint $C_{1},C_{2},\ldots \in {\widetilde{X}}$ such that $\cup _{m=1}^{\infty }C_{m}\in {\widetilde{X}}$. We have to show that

$$\begin{aligned} \delta _{M}:=g(\cup _{m=1}^{\infty }C_{m})-\sum _{m=1}^{M}g(C_{m})\rightarrow 0\quad \text { as } \quad M\rightarrow \infty . \end{aligned}$$

For all $M\in \{1,2,\ldots \}$, note that the difference $\left( \cup _{m=1}^{\infty }C_{m}\right) \backslash \left( \cup _{m=1}^{M}C_{m}\right) =\cup _{m=M+1}^{\infty }C_{m}$ need not belong to ${\widetilde{X}}$, but can be partitioned into a finite set $\mathcal {C}^{M}$ of events in ${\widetilde{X}}$ (as $\cup _{m=1}^{\infty }C_{m}$ belongs the the semi-ring ${\widetilde{X}}$). So, $\mathcal {C}^{M}\cup \{C_{1},\ldots ,C_{M}\}$ partitions $\cup _{m=1}^{\infty }C_{m} $. Careful inspection of g’s definition reveals that $\delta _{M}=\sum _{C\in \mathcal {C}^{M}}g(C)$. So, as $g(C)\le KP(C)$ for each $C\in {\widetilde{X}}$ (by definition of g and property (iv) of T), $\delta _{M}\le K\sum _{C\in \mathcal {C}^{M}}P(C)=KP(\cup _{m=M+1}^{\infty }C_{m})$. As $M\rightarrow \infty $ we have $P(\cup _{m=M+1}^{\infty }C_{m})\rightarrow 0$ (by $\sigma $-additivity of P), and so $\delta _{M}\rightarrow 0$, as required.

As g is non-negative, $\sigma $-additive, and also $\sigma $-finite (i.e., $\Omega $ is a union of countably many events in ${\widetilde{X}}$ of finite g-measure, which trivially holds as $\Omega \in {\widetilde{X}}$), Caratheodory’s Extension Theorem tells us that g extends uniquely to a measure on $\sigma ({\widetilde{X}})=\sigma (X)=\Sigma $. Let $P_{P_{1},\ldots ,P_{n}}$ be this extension. $P_{P_{1},\ldots ,P_{n}}$ is indeed a probability function since $P_{P_{1},\ldots ,P_{n}}(\Omega )=1$ as $\Omega \in {\widetilde{X}}$ and $g(\Omega )=T(1)=1$.

Finally, we must prove that the pooling function $(P_{1},\ldots ,P_{n})\mapsto P_{P_{1},\ldots ,P_{n}}$, as just defined, is independent on X, (globally) consensus-preserving, and non-neutral on X.

Independence on X This holds because, for all $P_{1},\ldots ,P_{n} \in {\mathcal {P}}_{\Sigma }$, the function $P_{P_{1},\ldots ,P_{n}}$ extends g, which satisfies (2). Note that the pooling criterion $D_{C}$ for $C\in X_{+}$ is defined as $T\circ D$, while the pooling criterion $D_{C}$ for $C\in C\backslash X_{+}$ is defined by $\mathbf {t}\mapsto 1-T\circ D(\mathbf {1} -\mathbf {t})$.

Non-neutrality on X Here it suffices to show that, for some $C\in X\backslash \{\Omega ,\varnothing \}$, the pooling criteria $D_{C}$ and $D_{C^{c}}$ differ. This follows from the following argument. First, $X\backslash \{\Omega ,\varnothing \}\ne \varnothing $ as $X\ne \{\varnothing ,\Omega \}$. So we may pick $C,C^{c}\in X\backslash \{\Omega ,\varnothing \}$; say, assume $C\in X_{+}$ and $C^{c}\in X\backslash X_{+}$. So, as just shown, $D_{C}=T\circ D$ and $D_{C^{c}}=1-T\circ D(\mathbf {1}-\cdot )$. Hence $D_{C}\ne D_{C^{c}}$, since $D_{C}(1/2,\ldots ,1/2)\ne D_{C^{c}}(1/2,\ldots ,1/2)$, as is clear from the fact that $T(1/2)\ne 1/2$ and that

$$\begin{aligned} D_{A_{j}}(1/2,\ldots ,1/2)&=T\circ D(1/2,\ldots ,1/2)=T(1/2),\\ D_{A_{j}^{c}}(1/2,\ldots ,1/2)&=1-T\circ D(1-1/2,\ldots ,1-1/2)\\&=1-T\circ D(1/2,\ldots ,1/2)=1-T(1/2). \end{aligned}$$

Consensus preservation Let $P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }$ and $A\in \Sigma $ such that $P_{1}(A)=\cdots =P_{n}(A)=1$. We show that $P_{P_{1},\ldots ,P_{n}}(A)=1$. Let P be the result of pooling $P_{1},\ldots ,P_{n}$ using the (at least on X) neutral pooling function defined above. As that pooling function is consensus-preserving, $P(A)=1$. It suffices to show that $P_{P_{1},\ldots ,P_{n}}\le KP$, as this implies that $P_{P_{1},\ldots ,P_{n}} (A^{c})\le KP(A^{c})=K(1-P(A))=K(1-1)=0$, so that $P_{P_{1},\ldots ,P_{n}}(A)=1$. Now, to show that $P_{P_{1},\ldots ,P_{n}}\le KP$, note first that, by property (iv) of T, $g\le KP|_{{\widetilde{X}}}$, and so $KP|_{{\widetilde{X}}}-g\ge 0$. Since g and $KP|_{{\widetilde{X}}}$, and hence also $KP|_{{\widetilde{X}}}-g$, are $\sigma $-additive, $\sigma $-finite and non-negative functions on the semi-ring ${\widetilde{X}}$, each of them extends uniquely to a measure on $\sigma ({\widetilde{X}})=\Sigma $ by Caratheodory’s Extension Theorem. The first two extensions are $P_{P_{1},\ldots ,P_{n}}$ and KP, respectively. So the third one must be $KP-P_{P_{1},\ldots ,P_{n}}$. Hence $KP-P_{P_{1},\ldots ,P_{n}}\ge 0$, and thus $P_{P_{1},\ldots ,P_{n}}\le KP$. $\square $

1.3 A.3 Proof of Theorem 3(b)

Let X be a non-path-connected and finite sub-agenda of the $\sigma $-algebra $\Sigma $. As in the proof of Theorem 1(b), we begin by proving that we may assume without loss of generality that $\sigma (X)=\Sigma $.

Claim 1 If Theorem 3(b) holds when $\sigma (X)=\Sigma $, then it holds in general.

Assume Theorem 3(b) holds if $\sigma (X)=\Sigma $ and let $\Sigma ^{\prime }:=\sigma (X)$ ($\subseteq \Sigma $). By assumption, there exists $F^{\prime }:{\mathcal {P}}_{\Sigma ^{\prime }}^{n}\rightarrow {\mathcal {P}}_{\Sigma ^{\prime }}$ which, on X, is independent, consensus-preserving, and non-neutral. Consider some $F:{\mathcal {P}}_{\Sigma }^{n}\rightarrow {\mathcal {P}}_{\Sigma }$ which, for any $P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }$, generates a probability function in ${\mathcal {P}}_{\Sigma }$ extending $F^{\prime }(P_{1}|_{\Sigma ^{\prime } },\ldots ,P_{n}|_{\Sigma ^{\prime }})$ (where such an extension exists by Lemma 5 and finiteness of $\Sigma ^{\prime }$). The so-defined F inherits all relevant properties from $F^{\prime }$: it is, on X, independent, consensus preserving, and non-neutral. This proves the claim.

Now let $\sigma (X)=\Sigma $. Notationally, for any sub-$\sigma $-algebra $\bar{\Sigma }\subseteq \Sigma $, let $\mathcal {A}(\bar{\Sigma })$ be its set of atoms (i.e., minimal elements of $\tilde{\Sigma }\backslash \{\varnothing \}$). We now define a pooling function for agenda $\Sigma $ and show that it has the desired properties. As an ingredient to the definition, let $D^{\prime }:[0,1]^{n}\rightarrow [0,1]$ and $D^{\prime \prime }:[0,1]^{n} \rightarrow [0,1]$ be the local pooling criteria of two distinct linear pooling functions; and let $\bar{A}\in X\backslash \{\varnothing ,\Omega \}$ be a (by assumption existing) event such that not for all $A\in X\backslash \{\varnothing ,\Omega \}$ there is $\bar{A}\vdash \vdash ^{*}A$, where $\vdash \vdash ^{*}$ is the transitive closure of $\vdash ^{*}$. Consider any $(P_{1},\ldots ,P_{n})\in {\mathcal {P}}_{\Sigma }^{n}$. To define $P_{P_{1} ,\ldots ,P_{n}}\in {\mathcal {P}}_{\Sigma }$, we start by defining probability functions on two sub-$\sigma $-algebras of $\Sigma $, denoted $\Sigma ^{\prime }$ and $\Sigma ^{\prime \prime }$ and defined as the $\sigma $-algebras generated by the sets

$$\begin{aligned} X^{\prime }&:=\{A\in X : \bar{A}\vdash \vdash ^{*}B\text { for both }B\in \{A,A^{c}\}\},\\ X^{\prime \prime }&:=\{A\in X : \bar{A}\vdash \vdash ^{*}B\text { for no }B\in \{A,A^{c}\}\}, \end{aligned}$$

respectively. $(X^{\prime }$ and $X^{\prime \prime }$ might be empty, in which case $\Sigma ^{\prime }$ and $\Sigma ^{\prime \prime }$, respectively, are $\{\varnothing ,\Omega \}$.) Let $P_{P_{1},\ldots ,P_{n}}^{\prime }\in {\mathcal {P}} _{\Sigma ^{\prime }}$ and $P_{P_{1},\ldots ,P_{n}}^{\prime \prime }\in {\mathcal {P}} _{\Sigma ^{\prime \prime }}$ be defined by

$$\begin{aligned} P_{P_{1},\ldots ,P_{n}}^{\prime }(A)&=D^{\prime }(P_{1}(A),\ldots ,P_{n}(A))\text { for all }A\in \Sigma ^{\prime },\\ P_{P_{1},\ldots ,P_{n}}^{\prime \prime }(A)&=D^{\prime \prime }(P_{1} (A),\ldots ,P_{n}(A))\text { for all }A\in \Sigma ^{\prime \prime }. \end{aligned}$$

These two functions are indeed probability functions (on $\Sigma ^{\prime }$ and $\Sigma ^{\prime \prime }$, respectively), as they are linear averages of of probability functions.

Claim 2 The $\sigma $-algebras $\Sigma ^{\prime }$ and $\Sigma ^{\prime \prime }$ are logically independent, that is: if $A^{\prime }\in \Sigma ^{\prime }$ and $A^{\prime \prime }\in \Sigma ^{\prime \prime }$ are non-empty, so is $A^{\prime }\cap A^{\prime \prime }$.

Suppose the contrary. Then, as each non-empty element of $\Sigma ^{\prime }$ includes an atom of $\Sigma ^{\prime }$ and hence a non-empty intersection of events in $X^{\prime }$, and similarly for $\Sigma ^{\prime \prime }$, there are consistent sets $Y^{\prime }\subseteq X^{\prime }$ and $Y^{\prime \prime }\subseteq X^{\prime \prime }$ such that $Y^{\prime }\cup Y^{\prime \prime }$ is inconsistent. Let Y be a minimal inconsistent subset of $Y^{\prime }\cup Y^{\prime \prime }$. Then Y is not a subset of $Y^{\prime } $ or $Y^{\prime \prime }$, as $Y^{\prime }$ and $Y^{\prime \prime }$ are consistent. So there are $A\in Y\cap X^{\prime }$ and $B\in Y\cap X^{\prime \prime }$. Note that $A\vdash ^{*}B^{c}$, a contradiction since $A\in X^{\prime }$ and $B^{c}\in X^{\prime \prime }$. This proves Claim 2.

We now extend $P_{P_{1},\ldots ,P_{n}}^{\prime }$ and $P_{P_{1},\ldots ,P_{n}} ^{\prime \prime }$ to a probability function on the $\sigma $-algebra $\tilde{\Sigma }:=\sigma (\Sigma ^{\prime }\cup \Sigma ^{\prime \prime } )=\sigma (X^{\prime }\cup X^{\prime \prime })$, in such a way that the events in $\Sigma ^{\prime }$ are probabilistically independent of those in $\Sigma ^{\prime \prime }$. By Claim 2, the atoms of $\tilde{\Sigma }$ are precisely the intersections of an atom of $\Sigma ^{\prime }$ and one of $\Sigma ^{\prime \prime }$: $\mathcal {A}(\tilde{\Sigma })=\{A^{\prime }\cap A^{\prime \prime }:A^{\prime }\in \mathcal {A}(\Sigma ^{\prime }),A^{\prime \prime }\in \mathcal {A} (\Sigma ^{\prime \prime })\}$. Let $\tilde{P}_{P_{1},\ldots ,P_{n}}$ be the unique measure on $\tilde{\Sigma }$ that behaves as follows on the atoms:

$$\begin{aligned} \tilde{P}_{P_{1},\ldots ,P_{n}}(A^{\prime }\cap A^{\prime \prime })=P_{P_{1} ,\ldots ,P_{n}}^{\prime }(A^{\prime })P_{P_{1},\ldots ,P_{n}}^{\prime \prime } (A^{\prime \prime }), \end{aligned}$$

(3)

for all $A^{\prime }\in \mathcal {A}(\Sigma ^{\prime }),$ $A^{\prime \prime } \in \mathcal {A}(\Sigma ^{\prime \prime })$. Now $\tilde{P}_{P_{1},\ldots ,P_{n}} $ is a probability function as

$$\begin{aligned} {\displaystyle \sum _{A\in \mathcal {A}(\tilde{\Sigma })}} \tilde{P}_{P_{1},\ldots ,P_{n}}(A)&= {\displaystyle \sum _{A^{\prime }\in \mathcal {A} (\Sigma ^{\prime }),A^{\prime \prime }\in \mathcal {A}(\Sigma ^{\prime \prime })}} P_{P_{1},\ldots ,P_{n}}^{\prime }(A^{\prime })P_{P_{1},\ldots ,P_{n}}^{\prime \prime }(A^{\prime \prime })\\&= {\displaystyle \sum _{A^{\prime }\in \mathcal {A} (\Sigma ^{\prime })}} P_{P_{1},\ldots ,P_{n}}^{\prime }(A^{\prime })\underbrace{ {\displaystyle \sum _{A^{\prime \prime }\in \mathcal {A}(\Sigma ^{\prime \prime })}} P_{P_{1},\ldots ,P_{n}}^{\prime \prime }(A^{\prime \prime })}_{=1}=1. \end{aligned}$$

Check that restricting $\tilde{P}_{P_{1},\ldots ,P_{n}}$ to $\Sigma ^{\prime }$ and $\Sigma ^{\prime \prime }$ yields $P_{P_{1},\ldots ,P_{n}}^{\prime }$ and $P_{P_{1},\ldots ,P_{n}}^{\prime \prime }$, respectively. So

$$\begin{aligned} \tilde{P}_{P_{1},\ldots ,P_{n}}(A)=\left\{ \begin{array} [c]{ll} D^{\prime }(P_{1}(A),\ldots ,P_{n}(A)) &{} \text {for all}\,\,A\in \Sigma ^{\prime }\\ D^{\prime \prime }(P_{1}(A),\ldots ,P_{n}(A)) &{} \text {for all}\,\,A\in \Sigma ^{\prime \prime }. \end{array} \right. \end{aligned}$$

(4)

Before we can extend $\tilde{P}_{P_{1},\ldots ,P_{n}}$ to the full $\sigma $-algebra $\Sigma $, we prove another claim. For all $A\in X$ such that $\bar{A}\vdash \vdash ^{*}A$ but not $\bar{A}\vdash \vdash ^{*}A^{c}$, define

$$\begin{aligned} A_{P_{1},\ldots ,P_{n}}:=\left\{ \begin{array} [c]{ll} A &{}\quad \text {if} \,\,P_{i}(A)>0\,\, \text {for some}\,\, i\\ A^{c} &{}\quad \text {if}\,\, P_{i}(A)=0\,\,\text {for all}\,\, i. \end{array} \right. \end{aligned}$$

Claim 3 For all atoms C of $\tilde{\Sigma }$ ($=\sigma (X^{\prime }\cup X^{\prime \prime })$) with $\tilde{P}_{P_{1},\ldots ,P_{n}}(C)>0$, the event $C\cap \left( \cap _{A\in X:\bar{A}\vdash \vdash ^{*}A\text { and not }\bar{A}\vdash \vdash ^{*}A^{c}}A_{P_{1},\ldots ,P_{n}}\right) $ is an atom of $\Sigma $.

Let C be as specified, and write $C_{P_{1},\ldots ,P_{n}}$ for the event in question. As noted above, $C=A^{\prime }\cap A^{\prime \prime }$ with $A^{\prime }\in \mathcal {A}(\Sigma ^{\prime })$ and $A^{\prime \prime }\in \mathcal {A} (\Sigma ^{\prime \prime })$. By ${\tilde{P}}_{P_{1},\ldots ,P_{n}}(C)>0$ and (3), we have ${\tilde{P}}_{P_{1},\ldots ,P_{n}}^{\prime }(A^{\prime })>0$ and ${\tilde{P}}_{P_{1},\ldots ,P_{n}}^{\prime \prime }(A^{\prime \prime })>0 $. Since $A^{\prime }\in \mathcal {A}(\Sigma ^{\prime })$, we may write $A^{\prime } =\cap _{A\in Y^{\prime }}A$ for some set $Y^{\prime }\subseteq X^{\prime }$ containing exactly one member of each pair $A,A^{c}\in X^{\prime }$. Similarly, $A^{\prime \prime }=\cap _{A\in Y^{\prime \prime }}A $ for some set $Y^{\prime \prime }\subseteq X^{\prime \prime }$ containing exactly one member of each pair $A,A^{c}\in X^{\prime \prime }$. Note also that $\cap _{A\in X:\bar{A} \vdash \vdash ^{*}A\text { and not }\bar{A}\vdash \vdash ^{*}A^{c}} A_{P_{1},\ldots ,P_{n}}$ can be written as $\cap _{A\in Y_{P_{1},\ldots ,P_{n}}}A$, where the set

$$\begin{aligned} Y_{P_{1},\ldots ,P_{n}}=\{A_{P_{1},\ldots ,P_{n}}:A\in X,\text { }\bar{A}\vdash \vdash ^{*}A,\text { not }\bar{A}\vdash \vdash ^{*}A^{c}\} \end{aligned}$$

consists of exactly one member of each pair $A,A^{c}\in X\backslash (X^{\prime }\cup X^{\prime \prime })$. So $C_{P_{1},\ldots ,P_{n}}=\cap _{A\in Y^{\prime }\cup Y^{\prime \prime }\cup Y_{P_{1},\ldots ,P_{n}}}A$, where the set $Y^{\prime }\cup Y^{\prime \prime }\cup Y_{P_{1},\ldots ,P_{n}}$ consists of exactly one member of each pair $A,A^{c}\in X$. So, since $\Sigma =\sigma (X) $, $C_{P_{1},\ldots ,P_{n}}$ is an atom or is empty. Hence it suffices to show that $C_{P_{1},\ldots ,P_{n} }\ne \varnothing $. Suppose the contrary. Then $Y^{\prime }\cup Y^{\prime \prime }\cup Y_{P_{1},\ldots ,P_{n}}$ is inconsistent, hence has a minimal inconsistent subset Y. We distinguish two cases and derive a contradiction in each.

Case 1 There is some $B\in Y\cap Y_{P_{1},\ldots ,P_{n}}$ with $\bar{A}\vdash \vdash ^{*}B$. Consider some $B^{\prime }\in Y\backslash \{B\}$. We have (i) not $\bar{A}\vdash \vdash ^{*}B^{\prime }$ (otherwise by $B^{\prime }\vdash ^{*}B^{c}$ we would have $\bar{A}\vdash \vdash ^{*}B^{c}$, hence $B\in X^{\prime }$, a contradiction as $B\in Y_{P_{1},\ldots ,P_{n}}$). Further, (ii) $\bar{A}\vdash \vdash ^{*}(B^{\prime })^{c}$ (as $\bar{A}\vdash \vdash ^{*}B$ and $B\vdash ^{*}(B^{\prime })^{c}$). By (i) and (ii), letting $A:=(B^{\prime })^{c}$, the event $A_{P_{1},\ldots ,P_{n}}$ ($\in \{A,A^{c}\}$) is well-defined. Since $Y_{P_{1},\ldots ,P_{n}}$ contains $A_{P_{1},\ldots ,P_{n}}$ ($\in \{A,A^{c}\}$) and contains $B^{\prime }=A^{c}$ but not $(B^{\prime })^{c}=A$, we must have $A_{P_{1},\ldots ,P_{n}}=A^{c}$. So, for all i, $P_{i}(A)=0$ and hence $P_{i}(B^{\prime })=1$. Note that this holds for all $B^{\prime }\in Y\backslash \{B\}$. So $P_{i}(\cap _{B^{\prime }\in Y}B^{\prime })=P_{i}(B)$ for all i. Hence, as Y is inconsistent, $P_{i}(B)=0$ for all i. Thus $B_{P_{1},\ldots ,P_{n}}=B^{c}$. So $B^{c}\in Y_{P_{1},\ldots ,P_{n}}$, a contradiction as $B\in Y_{P_{1},\ldots ,P_{n}}$.

Case 2 There is no $B\in Y\cap Y_{P_{1},\ldots ,P_{n}}$ with $\bar{A}\vdash \vdash ^{*}B$. Then all $B\in Y\cap Y_{P_{1},\ldots ,P_{n}}$ take the form $A_{P_{1},\ldots ,P_{n}}=A^{c}$, so that $P_{i}(A)=0$ for all i, i.e., $P_{i}(B)=1$ for all i. So, (*) $P_{i}(\cap _{B\in Y}B)=P_{i}(\cap _{B\in Y\backslash Y_{P_{1},\ldots ,P_{n}}}B)$ for all i. Now, either (i) $Y\subseteq Y_{P_{1},\ldots ,P_{n}}\cup Y^{\prime }$, or (ii) $Y\subseteq Y_{P_{1},\ldots ,P_{n} }\cup Y^{\prime \prime }$, because otherwise there are $A^{\prime }\in Y^{\prime }$ and $A^{\prime \prime }\in Y^{\prime \prime }$, and $A^{\prime }\vdash ^{*}(A^{\prime \prime })^{c}$, whence $\bar{A}\vdash \vdash ^{*}(A^{\prime \prime })^{c}$, a contradiction as $(A^{\prime \prime })^{c}\in X^{\prime \prime }$. First suppose case (i) holds. Then $Y\backslash Y_{P_{1},\ldots ,P_{n}}\subseteq Y^{\prime }$, and so (*) implies that (**) $P_{i}(\cap _{B\in Y}B)\ge P_{i}(\cap _{B\in Y^{\prime }}B)=P_{i}(A^{\prime })$ for all i. Since by assumption $\tilde{P}_{P_{1},\ldots ,P_{n}}(A^{\prime })>0$, there is (by (4)) at least one i with $P_{i}(A^{\prime })>0$, hence by (**) with $P_{i}(\cap _{B\in Y}B)>0$. So $\cap _{B\in Y}B\ne \varnothing $, i.e., Y is consistent, a contradiction. Similarly, in case (ii), one can show that Y is consistent, a contradiction. This completes the proof of Claim 3.

Let $P_{P_{1},\ldots ,P_{n}}$ be the unique measure on $\Sigma $ behaving as follows on any atom C of $\Sigma $. If C takes the form as in Claim 3, i.e., $B=C\cap \left( \cap _{A\in X:\bar{A}\vdash \vdash ^{*}A\text { and not }\bar{A}\vdash \vdash ^{*}A^{c}}A_{P_{1},\ldots ,P_{n}}\right) $ where $C\in \mathcal {A}(\tilde{\Sigma })$ and $\tilde{P}_{P_{1},\ldots ,P_{n}}(C)>0$, then let $P_{P_{1},\ldots ,P_{n}}(B):=\tilde{P}_{P_{1},\ldots ,P_{n}}(C)$. Otherwise let $P_{P_{1},\ldots ,P_{n}}(B):=0$.

Claim 4 $P_{P_{1},\ldots ,P_{n}}$ extends $\tilde{P}_{P_{1},\ldots ,P_{n}}$ (in particular, is a probability function).

It suffices to show that $P_{P_{1},\ldots ,P_{n}}$ coincides with $\tilde{P}_{P_{1},\ldots ,P_{n}}$ on $\mathcal {A}(\tilde{\Sigma })$. Consider any $C\in \mathcal {A}(\tilde{\Sigma })$. As $\Sigma $ is a refinement of $\tilde{\Sigma } $,

$$\begin{aligned} P_{P_{1},\ldots ,P_{n}}(C)= {\displaystyle \sum _{B\in \mathcal {A}(\Sigma ):B\subseteq C}} P_{P_{1},\ldots ,P_{n}}(B). \end{aligned}$$

(5)

There are two cases.

Case 1 $\tilde{P}_{P_{1},\ldots ,P_{n}}(C)=0$. Then, for all $B\in \mathcal {A}(\Sigma )$ with $B\subseteq C$, we have $P_{P_{1},\ldots ,P_{n} }(B)=0$ (by definition of $P_{P_{1},\ldots ,P_{n}}$), and so by (5) we have $P_{P_{1},\ldots ,P_{n}}(C)=0=\tilde{P}_{P_{1},\ldots ,P_{n}}(C)$, as desired.

Case 2 $\tilde{P}_{P_{1},\ldots ,P_{n}}(C)>0$. Then, among all atoms $B\in \mathcal {A}(\Sigma )$ with $B\subseteq C$, there is by definition of $P_{P_{1},\ldots ,P_{n}}$ exactly one such that $P_{P_{1},\ldots ,P_{n}}(B)>0$ (namely $B=C\cap (\cap _{A\in X:\bar{A}\vdash \vdash ^{*}A\text { and not }\bar{A} \vdash \vdash ^{*}A^{c}}A_{P_{1},\ldots ,P_{n}})$), and $P_{P_{1},\ldots ,P_{n} }(B)=\tilde{P}_{P_{1},\ldots ,P_{n}}(C)$. So by (5) $P_{P_{1},\ldots ,P_{n} }(C)=\tilde{P}_{P_{1},\ldots ,P_{n}}(C)$. This completes the proof of Claim 4.

Claim 5 For all $A\in X$ such that $\bar{A}\vdash \vdash ^{*}A$ and not $\bar{A}\vdash \vdash ^{*}A^{c}$, $P_{P_{1},\ldots ,P_{n}}(A)$ is 1 if, for some individual i, $P_{i}(A)>0$, and 0 otherwise.

By definition of $P_{P_{1},\ldots ,P_{n}}$, all atoms of $\Sigma $ with positive probability are subsets of the event $\cap _{A\in X:\bar{A}\vdash \vdash ^{*}A\text { and not }\bar{A}\vdash \vdash ^{*}A^{c}}A_{P_{1},\ldots ,P_{n}}$. So this event has probability 1. Hence, for all $A\in X$ such that $\bar{A} \vdash \vdash ^{*}A$ and not $\bar{A}\vdash \vdash ^{*}A^{c}$, we have $P_{P_{1},\ldots ,P_{n}}(A_{P_{1},\ldots ,P_{n}})=1$, and so

$$\begin{aligned} P_{P_{1},\ldots ,P_{n}}(A)=\left\{ \begin{array} [c]{ll} 1 &{}\,\text {if}\,\,A_{P_{1},\ldots ,P_{n}}=A,\,\, i.e.,\quad \text {if}\,\, P_{i}(A)>0\,\, \text {for some}\,\, i\\ 0 &{} \,\text {if}\,\, A_{P_{1},\ldots ,P_{n}}=A^{c},\,\, i.e.,\quad \text {if}\,\, P_{i}(A)=0\,\, \text {for all}\,\, i. \end{array} \right. \end{aligned}$$

This proves Claim 5.

By Claim 4, we have constructed a well-defined pooling function $(P_{1} ,\ldots ,P_{n})\mapsto P_{P_{1},\ldots ,P_{n}}$ for agenda $\Sigma $. By (4) and Claims 4 and 5, we know its behaviour on the entire sub-agenda X: the pooling function is independent on X and the local pooling criteria $D_{A}$ of events $A\in X$ are given by

(i)
the linear criterion $D^{\prime }$ if $A\in X^{\prime }$,
(ii)
the different linear criterion $D^{\prime \prime }$ if $A\in X^{\prime \prime }$,
(iii)
a non-linear criterion ${\hat{D}}$ (taking the value 0 at $\mathbf {0}$ and the value 1 everywhere else) if $\bar{A}\vdash \vdash ^{*}A$ but not $\bar{A}\vdash \vdash ^{*}A^{c}$,
(iv)
the different non-linear criterion $1-\hat{D}(\mathbf {1} -\mathbf {\cdot )}$ if not $\bar{A}\vdash \vdash ^{*}A$ but $\bar{A} \vdash \vdash ^{*}A^{c}$.

These pooling criteria also ensure unanimity preservation on X. To check non-neutrality, it suffices to show that at least two of the four different types of events (i)–(iv) do indeed occur. This is so because $\bar{A}$ is of type (i) or (iii) and because by assumption there exists an $A\in X $ such that not $\bar{A}\vdash \vdash ^{*}A$, i.e., such that A has type (ii) or (iv). $\square $

1.4 A.4 Proof of Theorem 4(b)

Consider any finite sub-agenda $X\ne \{\varnothing ,\Omega \}$ (of the $\sigma $-algebra agenda $\Sigma $) which is nested or satisfies $\left| X\backslash \{\varnothing ,\Omega \}\right| \le 4$. If X is nested, the claim follows from Theorem 1(b), as non-neutrality on X implies non-linearity on X. Now assume $\left| X\backslash \{\varnothing ,\Omega \}\right| \le 4$. We reduce the claim to Part I’s Theorem 4(b). By that result, there is a pooling function $F^{\prime }$ for agenda X which is independent, consensus compatible, and not linear. By Lemma 4, $F^{\prime }$ is induced by a pooling function for agenda $\Sigma $ which is independent on X, (globally) consensus-preserving, and not linear on X. $\square $

1.5 A.5 Proof of Theorem 5(b)

Consider a simple sub-agenda X of $\sigma $-algebra $\Sigma $, where X is finite and not $\{\varnothing ,\Omega \}$. We construct a pooling function which, on X, is independent (in fact, neutral), conditional-consensus-preserving, and non-linear. We may assume without loss of generality that $\sigma (X)=\Sigma $, because the ‘Claim’ in the proof of Theorem 1(b) holds analogously here as well.

As an ingredient of the construction, we use an arbitrary pooling function $(P_{1},\ldots ,P_{n})\mapsto P_{P_{1},\ldots ,P_{n}}^{\mathrm {lin}}$ which, at least on X, is linear and conditional-consensus-preserving. The function could be simply given by $(P_{1},\ldots ,P_{n})\mapsto P_{1}$, which is even globally linear and conditional-consensus-preserving. Let $D^{\mathrm {lin}}$ be its pooling criterion for all events in X. To anticipate, the pooling function $(P_{1},\ldots ,P_{n})\mapsto P_{P_{1},\ldots ,P_{n} }$ to be constructed will have the pooling criterion $D:[0,1]^{n} \rightarrow [0,1]$ for each event in X, where

$$\begin{aligned} D(t_{1},\ldots ,t_{n}):=\left\{ \begin{array} [c]{ll} 0 &{}\quad \text {if}\,\, D^{\mathrm {lin}}(t_{1},\ldots ,t_{n})<1/2,\\ 1/2 &{}\quad \text {if}\,\, D^{\mathrm {lin}}(t_{1},\ldots ,t_{n})=1/2,\\ 1 &{}\quad \text {if}\,\, D^{\mathrm {lin}}(t_{1},\ldots ,t_{n})>1/2. \end{array} \right. \end{aligned}$$

(6)

Consider any $P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }$. We must define $P_{P_{1},\ldots ,P_{n}}$. We use the following notation (which suppresses the parameters $P_{1},\ldots ,P_{n}$):

$$\begin{aligned} p(A)&:=P_{P_{1},\ldots ,P_{n}}^{\mathrm {lin}}(A)\text { for all }A\in \Sigma ,\\ X_{\ge 1/2}&:=\{A\in X:p(A)\ge 1/2\},\\ X_{>1/2}&:=\{A\in X:p(A)>1/2\},\\ X_{=1/2}&:=\{A\in X:p(A)=1/2\}. \end{aligned}$$

Notice that for all $A\in X$ we have $A\in X_{>1/2}\Rightarrow A^{c} \not \in X_{>1/2}$ and $A\in X_{=1/2}\Leftrightarrow A^{c}\in X_{=1/2}$. We now prove two claims (which use X’s simplicity).

Claim 1 $X_{=1/2}$ can be partitioned into two (possibly empty) sets $X_{=1/2}^{1}$ and $X_{=1/2}^{2}$ such that (i) each $X_{=1/2}^{j}$ satisfies $p(A\cap B)>0$ for all $A,B\in X_{=1/2}^{j}$ and (ii) each $X_{=1/2}^{j}\cup X_{>1/2}$ is consistent (whence $X_{=1/2}^{j}$ contains exactly one member of each pair $A,A^{c}\in X_{=1/2}$).

To show this, note first that $X_{=1/2}$ has a subset Y such that $p(A\cap B)>0$ for all $A,B\in Y$ (e.g., $Y=\varnothing $). Among all such subsets $Y\subseteq X_{=1/2}$, let $X_{=1/2}^{1}$ a maximal one, and let $X_{=1/2}^{2}:=X_{=1/2}\backslash X_{=1/2}^{1}$. By definition, $X_{=1/2}^{1}$ and $X_{=1/2}^{2}$ form a partition of $X_{=1/2}$. We show that (i) and (ii) hold.

(i)
Property (i) holds for $X_{=1/2}^{1}$ by definition, and for $X_{=1/2}^{2}$ by the following argument. Let $A,B\in X_{=1/2}^{2}$ and for a contradiction let $p(A\cap B)=0$. By the maximality property of $X_{=1/2}^{1} $, there are $A^{\prime },B^{\prime }\in X_{=1/2}^{1}$ such that $p(A\cap A^{\prime })=0$ and $p(B\cap B^{\prime })=0$. Thus, $p(A\cap C)=p(B\cap C)=0$ where $C:=A^{\prime }\cap B^{\prime }$. Since the intersection of any two of the sets A, B, C has zero p-probability, we must have $p(A)+p(B)+p(C)=p(A\cup B\cup C)\le 1$, a contradiction because $p(A)=p(B)=1/2$ and $p(C)=p(A^{\prime }\cap B^{\prime })>0$ (the latter because $X_{=1/2}^{1}$ satisfies (i)).
(ii)
For a contradiction, let some $X_{=1/2}^{j}\cup X_{>1/2}$ be inconsistent. Then (since X and hence $X_{=1/2}^{j}\cup X_{>1/2}$ are finite) there is a minimal inconsistent subset $Y\subseteq X_{=1/2}^{j}\cup X_{>1/2}$. Since X is simple, we have $\left| Y\right| \le 2$, say $Y=\{A,B\}$. Since $A\cap B=\varnothing $, we have $p(A)+p(B)=p(A\cup B)\le 1$. And since $p(A),p(B)\ge 1/2$, it follows that $p(A)=p(B)=1/2$, i.e., $A,B\in X_{=1/2}^{j}$. Hence, by (i), we have $p(A\cap B)>0$, a contradiction as $A\cap B=\varnothing $.

Claim 2 $\cap _{C\in X_{=1/2}^{1}\cup X_{>1/2}}C$ and $\cap _{C\in X_{=1/2}^{2}\cup X_{>1/2}}C$ are atoms of the $\sigma $-algebra $\Sigma $, i.e., ($\subseteq $-)minimal elements of $\Sigma \backslash \{\varnothing \}$ (they are the same atoms if and only if $X_{=1/2}=\varnothing $, i.e., if and only if $X_{=1/2}^{1}=X_{=1/2}^{2}=\varnothing $).

To show this, first write X as $\{C_{j}^{0},C_{j}^{1}:j=1,\ldots ,J\}$, where $J=\left| X\right| /2$ and each pair $C_{j}^{0},C_{j}^{1}$ consists of an event and its complement. We may write $\Sigma $ as

$$\begin{aligned} \Sigma =\{\cup _{(k_{1},\ldots ,k_{J})\in K}(C_{1}^{k_{1}}\cap \cdots \cap C_{J}^{k_{J}}):K\subseteq \{0,1\}^{J}\}. \end{aligned}$$

(7)

Recall that $\Sigma $ is the $\sigma $-algebra generated by X. The inclusion ‘$\supseteq $’ in (7) is obvious, and the inclusion ‘$\subseteq $’ holds because the right side of (7) includes X (since any $C_{j}^{k}\in X$ can be written as the union of all $C_{1}^{k_{1}}\cap \cdots \cap C_{J}^{k_{J}}$ for which $k_{j}=k$) and is a $\sigma $-algebra (check closedness under taking unions and complements).

From (7) and the pairwise disjointness of the intersections of the form $C_{1}^{k_{1}}\cap \cdots \cap C_{J}^{k_{J}}$, it is clear that every consistent such intersection is an atom of $\Sigma $. Now $\cap _{C\in X_{=1/2}^{j}\cup X_{>1/2}}C$ is (for $j\in \{0,1\}$) precisely such a consistent intersection. Indeed, $\cap _{C\in X_{=1/2}^{j}\cup X_{>1/2}}C$ is consistent by Claim 1, and contains a member of each pair $A,A^{c}$ in X. The latter holds by Claim 1 if $p(A)=p(A^{c})$ ($=1/2$), and otherwise because there is a $B\in \{A,A^{c}\}$ with $p(B)>1/2$, i.e., with $B\in X_{>1/2} \subseteq X_{=1/2}^{j}\cup X_{>1/2}$. This proves Claim 2.

We can now define $P_{P_{1},\ldots ,P_{n}}$. By Claim 1, we may pick $\omega ^{1}\in \cap _{C\in X_{=1/2}^{1}\cup X_{>1/2}}C$ and $\omega ^{2}\in \cap _{C\in X_{=1/2}^{2}\cup X_{>1/2}}C$, where we assume that $\omega ^{1}=\omega ^{2}$ if $X_{=1/2}=\varnothing $, i.e., if $\cap _{C\in X_{=1/2}^{1}\cup X_{>1/2}} C=\cap _{C\in X_{=1/2}^{2}\cup X_{>1/2}}C=\cap _{C\in X_{>1/2}}C$. Let $\delta _{\omega ^{1}}$ and $\delta _{\omega ^{2}}$ be, respectively, the Dirac measures on $\Sigma $ at $\omega ^{1}$ and $\omega ^{2}$, given for all $A\in \Sigma $ by $\delta _{\omega ^{j}}(A)=1$ if $\omega ^{j}\in A$ and $\delta _{\omega ^{j}}(A)=0$ if $\omega ^{j}\notin A$. Let

$$\begin{aligned} P_{P_{1},\ldots ,P_{n}}:=\frac{1}{2}\delta _{\omega ^{1}}+\frac{1}{2}\delta _{\omega ^{2}}, \end{aligned}$$

where $\omega ^{1}$ and $\omega ^{2}$ depend on $P_{1},\ldots ,P_{n}$ via $X_{=1/2}^{1},X_{=1/2}^{2},X_{>1/2}$. So $P_{P_{1},\ldots ,P_{n}}(A)$ is 1 or 1/2 or 0 depending on whether A ($\in \Sigma $) contains both, exactly one, or none of $\omega ^{1}$ and $\omega ^{2}$; and $P_{P_{1},\ldots ,P_{n}}=\delta _{\omega }$ if $\omega ^{1}=\omega ^{2}=\omega $, i.e., if $X_{=1/2}=\varnothing $. We finally show that the so-defined pooling function $(P_{1},\ldots ,P_{n})\mapsto P_{P_{1},\ldots ,P_{n}}$ has all desired properties.

Independence on X We in fact show something stronger, i.e., neutrality on X with pooling criterion D given in (6). Let $P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }$, $A\in X$ and $(t_{1},\ldots ,t_{n} ):=(P_{1}(A),\ldots ,P_{n}(A))$. We prove that $P_{P_{1},\ldots ,P_{n}}(A)=D(t_{1} ,\ldots ,t_{n})$ by considering three cases and using the above notation p, $X_{>1/2},$ $X_{=1/2}^{1},X_{=1/2}^{2},\omega ^{1},\omega ^{2}$.

Case 1 $p(A)=D^{\mathrm {lin}}(t_{1},\ldots ,t_{n})<1/2$. Here $D(t_{1},\ldots ,t_{n})=0$. So we must prove that $P_{P_{1},\ldots ,P_{n}}(A)=0$, i.e., that $\omega ^{1},\omega ^{2}\not \in A$. Assume for a contradiction that $\omega ^{1}\in A$ (the proof is analogous if we instead assume $\omega ^{2}\in A$). Then A includes $\cap _{C\in X_{=1/2}^{1}\cup X_{>1/2}}C$, as this set contains $\omega ^{1}$ and is by Claim 2 an atom of $\Sigma $. So $A^{c} \cap [\cap _{C\in X_{=1/2}^{1}\cup X_{>1/2}}C]=\varnothing $. Hence the set $\{A^{c}\}\cup X_{=1/2}^{1}\cup X_{>1/2}$ is inconsistent, so has a minimal inconsistent subset Y. As X is simple, $\left| Y\right| \le 2$. Now $\varnothing \not \in Y$ as $A^{c}\ne \varnothing $ (by $p(A^{c} )=1-p(A)>1/2$) and as all $B\in X_{=1/2}^{1}\cup X_{>1/2}$ are non-empty (by $p(B)\ge 1/2$). So $\left| Y\right| =2$. Further, Y is not a subset of $X_{=1/2}^{1}\cup X_{>1/2}$, as this set is consistent by Claim 1. So $Y=\{A^{c},B\}$ for some $B\in X_{=1/2}^{1}\cup X_{>1/2}$. As $A^{c}\cap B=\varnothing $ and as $p(A^{c})=1-p(A)>1/2$ and $p(B)\ge 1/2$, we have $p(A^{c}\cup B)=p(A^{c})+p(B)>1/2+1/2=1$, a contradiction.

Case 2 $p(A)=D^{\mathrm {lin}}(t_{1},\ldots ,t_{n})>1/2$. Then $D(t_{1},\ldots ,t_{n})=1$. Hence we must prove that $P_{P_{1},\ldots ,P_{n}}(A)=1$, i.e., that $P_{P_{1},\ldots ,P_{n}}(A^{c})=0$. The latter follows from case 1 as applied to $A^{c}$, since $p(A^{c})=1-p(A)<1/2$.

Case 3 $p(A)=D^{\mathrm {lin}}(t_{1},\ldots ,t_{n})=1/2$. Then $D(t_{1},\ldots ,t_{n})=1/2$. So we must prove that $P_{P_{1},\ldots ,P_{n}}(A)=1/2$, i.e., that A contains exactly one of $\omega ^{1}$ and $\omega ^{2}$. As $p(A)=1/2$, exactly one of $X_{=1/2}^{1}$ and $X_{=1/2}^{2}$ contains A and the other one contains $A^{c}$, by Claim 1. Say $A\in X_{=1/2}^{1}$ and $A^{c}\in X_{=1/2}^{2}$ (the proof is analogous if instead $A\in X_{=1/2}^{2}$ and $A^{c}\in X_{=1/2}^{1}$). So $A\supseteq \cap _{C\in X_{=1/2}^{1}\cup X_{>1/2}}C$, whence $\omega ^{1}\in A$. Further, $\omega ^{2}\notin A$ because A is disjoint from $A^{c}$, hence from its subset $\cap _{C\in X_{=1/2} ^{2}\cup X_{>1/2}}C$ which contains $\omega ^{2}$.

Non-linearity on X Pooling cannot be linear, since otherwise for any fixed $A\in X\backslash \{\Omega ,\varnothing \}$ ($\ne \varnothing $) the collective probabilities $P_{P_{1},\ldots ,P_{n}}(A)$ could take any given values $t\in [0,1]$ (for instance by letting $P_{1}(A)=\cdots =P_{n}(A)=t$), a contradiction, since by definition $P_{P_{1},\ldots ,P_{n}}(A)\in \{0,1/2,1\}$.

Conditional-consensus-preservation on X Let $A,B\in X$ and $P_{1},\ldots ,P_{n}\in {\mathcal {P}}_{\Sigma }$ such that $P_{i}(A\cup B)=1$ for all i. We show that $P_{P_{1},\ldots ,P_{n}}(A\cup B)=1$, which establishes conditional-consensus-preservation on X by Proposition 1(a). For all i, $P_{i}(A)+P_{i}(B)\ge P_{i}(A\cup B)=1$, and hence $P_{i}(A)\ge 1-P_{i}(B)=P_{i}(B^{c})$. So, as $D^{\mathrm {lin}}:[0,1]^{n}\rightarrow [0,1]$ takes a linear form with non-negative coefficients and hence is weakly increasing in every component,

$$\begin{aligned} D^{\mathrm {lin}}(P_{1}(A),\ldots ,P_{n}(A))&\ge D^{\mathrm {lin}}\left( P_{1} \left( B^{c}\right) ,\ldots ,P_{n}\left( B^{c}\right) \right) \\&=D(\mathbf {1})-D^{\mathrm {lin}}\left( P_{1}(B),\ldots ,P_{n}(B)\right) \\&=1-D^{\mathrm {lin}}\left( P_{1}(B),\ldots ,P_{n}(B)\right) . \end{aligned}$$

So, with p as defined earlier, $p(A)\ge 1-p(B)$, i.e., $p(A)+p(B)\ge 1$. We distinguish between three cases.

Case 1 $p(A)>1/2$. Then, by the above proof of independence on X, $P_{P_{1},\ldots ,P_{n}}(A)=1$. So $P_{P_{1},\ldots ,P_{n}}(A\cup B)=1$, as desired.

Case 2 $p(B)>1/2$. Then, again by the above proof of independence on X, $P_{P_{1},\ldots ,P_{n}}(B)=1$. Hence, $P_{P_{1},\ldots ,P_{n}}(A\cup B)=1$, as desired.

Case 3 $p(A),p(B)\le 1/2$. Then, as $p(A)+p(B)\ge 1$, we have $p(A)=p(B)=1/2$. Let $X_{>1/2},X_{=1/2}^{1},X_{=1/2}^{2},\omega ^{1},\omega ^{2}$ be as defined above. Note that $A,B\in X_{=1/2}^{1}\cup X_{=1/2}^{2}$. It cannot be that A and B are both in $X_{=1/2}^{1}$: otherwise $A^{c}$ and $B^{c}$ are both in $X_{=1/2}^{2}$ by Claim 1, whence $p(A^{c}\cap B^{c})>0$ (again by Claim 1), a contradiction since

$$\begin{aligned} p\left( A^{c}\cap B^{c}\right) =p\left( \left( A\cup B \right) ^{c}\right) =1-p(A\cup B)=1-1=0 \end{aligned}$$

(where $p(A\cup B)=1$ because $p(A\cup B)=P_{P_{1},\ldots ,P_{n}}^{\mathrm {lin} }(A\cup B)$ and $P_{i}(A\cup B)=1$ for all i). Analogously, it cannot be that A and B are both in $X_{=1/2}^{2}$. So one of A and B is in $X_{=1/2}^{1}$ and the other one in $X_{=1/2}^{2}$; say $A\in X_{=1/2}^{1}$ and $B\in X_{=1/2}^{2}$ (the proof is analogous otherwise). So $A\supseteq \cap _{C\in X_{=1/2}^{1}\cup X_{>1/2}}C$ and $B\supseteq \cap _{C\in X_{=1/2} ^{2}\cup X_{>1/2}}C$, and hence $\omega ^{1}\in A$ and $\omega ^{2}\in B$. Thus $\omega ^{1},\omega ^{2}\in A\cup B$, whence $P_{P_{1},\ldots ,P_{n}}(A\cup B)=1$. $\square $

1.6 A.6 Proof of Proposition 2

Consider the $\sigma $-algebra agenda $\Sigma $, and let $\left| \Sigma \right| >2^{3}=8$, i.e., $\left| \Sigma \right| \ge 2^{4} =16$. Then $\Sigma $ includes a partition of $\Omega $ into four non-empty events. Let X be the sub-agenda consisting of any union of two of these four events. In the proof of Part I’s Proposition 2 we construct a pooling function for this agenda X which is neutral, consensus-preserving, and non-linear.^{Footnote 17} By Lemma 4, this pooling function is induced by a pooling function for agenda $\Sigma $ which, on X, is neutral, consensus-preserving, and non-linear. $\square $

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Dietrich, F., List, C. Probabilistic opinion pooling generalized. Part two: the premise-based approach. Soc Choice Welf 48, 787–814 (2017). https://doi.org/10.1007/s00355-017-1035-y

Download citation

Published: 10 April 2017
Issue Date: April 2017
DOI: https://doi.org/10.1007/s00355-017-1035-y

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Probabilistic opinion pooling generalized. Part two: the premise-based approach

Abstract

Similar content being viewed by others

Probabilistic Opinion Pooling with Imprecise Probabilities

Probabilistic opinion pooling generalized. Part one: general agendas

Weighted Probabilistic Opinion Pooling Based on Cross-Entropy

1 Introduction

2 The framework

3 Axiomatic requirements on premise-based opinion pooling

3.1 Independence on premises

3.2 Consensus preservation on premises

Proposition 1

4 A class of applications

5 When is opinion pooling neutral on premises?

Theorem 1

Theorem 2

Theorem 3

6 When is opinion pooling linear on premises?

Theorem 4

Theorem 5

Theorem 6

Proposition 2

7 Classic results as special cases

Lemma 1

Corollary 1

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix A: Proofs

Appendix A: Proofs

1.1 A.1 Proof of part (a) of each theorem

Lemma 2

Lemma 3

Lemma 4

Proof of parts (a) of Theorems 1–6

1.2 A.2 Proof of Theorem 1(b)

Lemma 5

Proof of Theorem 1(b)

1.3 A.3 Proof of Theorem 3(b)

1.4 A.4 Proof of Theorem 4(b)

1.5 A.5 Proof of Theorem 5(b)

1.6 A.6 Proof of Proposition 2

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation