1 Introduction

Social choice theory is the study of mechanisms for collective decision making [35]. This includes voting rules as mechanisms to collectively make political decisions, and consequently social choice theory is chiefly associated with the disciplines of political science and economics. But similar mechanisms can also be used to make decisions in multiagent systems, to coordinate the actions of individual agents, to resolve conflicts between them, and to bundle their information and expertise [8]. Closely related applications of social choice theory in computer science furthermore include recommender systems [29], Internet search engines [2], and crowdsourcing [21].

This widening of the scope of social choice theory has renewed interest in the formal foundations of the field. As we are designing ever more specialised social choice mechanisms for novel types of tasks, better tools to analyse the formal properties of these mechanisms are needed. Specifically, there is now a growing literature on the formal verification of social choice mechanisms by means of logical modelling and the use of techniques from automated reasoning [1, 5, 9, 12, 13, 16, 24, 34, 37, 38]. (We will review some of the contributions to this field in Sect. 6).

An obvious yardstick against which to measure different approaches to the formalisation of social choice frameworks is Arrow’s Theorem [3], the seminal result in the field, which shows that it is impossible to design preference aggregation mechanisms for three or more alternatives that are Pareto efficient and for which the relative ranking of two alternatives is based only on the rankings for the same two alternatives submitted by the individual voters. For instance, recent work has modelled the Arrovian framework in propositional logic [34], first-order logic [16], higher-order logic [24, 38], and a tailor-made modal logic [1]. Some of this work has resulted in methods to prove Arrow’s Theorem either automatically [34] or semi-automatically [24, 38], while other work has generated logical formalisations of the theorem that are easily accessible to humans and thus helpful in deepening our understanding of social choice [1, 16]. A shortcoming of the latter contributions, however, is that they have so far not resulted in a full proof of Arrow’s Theorem or similar results within the chosen logical framework itself.Footnote 1 Rather, such work has proceeded by showing that a given logical system is complete w.r.t. an appropriate class of models of social choice theory, thereby proving that a rendering of Arrow’s Theorem in the logical language in question must be a theorem of that logic. That is, such work has derived results about a given logic by means of reference to existing “semantic” proofs of Arrow’s Theorem. The ultimate goal of such research, however, must be the opposite: to use the logic to derive proofs for Arrow’s Theorem and similar results.

In this paper, we close this gap by providing a Hilbert-style syntactic proof of Arrow’s Theorem within a simple tailor-made modal logic that is shown to be complete. We have opted for a Hilbert calculus, rather than, say, an approach based on natural deduction, because Hilbert calculi are still the systems used most widely by modal logicians and thus facilitate comparison to proof systems for other logics, and because this choice allows for a particularly compact presentation of our assumptions. Having said this, other proof systems have other advantages (e.g., in view of readability of proofs or implementability) and thus certainly also have a place in the study of social choice theory. Our logic of choice is a fragment of the modal logic of social choice functions proposed by Troquard et al. [37]. Troquard et al. have used their (full) logic to reason about the strategy-proofness of voting rules (but it has not previously been applied to Arrow’s Theorem). This logic can be used to model a (resolute) social choice function (SCF), i.e., a function that maps any given profile of preference orders to a single winning alternative. While Arrow originally formulated his theorem for social welfare functions, i.e., functions that map any given profile of preference orders to a single social preference order [3], we will instead work with a standard variant of the theorem for SCF’s [35]. Arguably, SCF’s (returning a top alternative rather than a full ranking of all alternatives) are relevant to a wider range of applications. In any case, known techniques to prove either version of the theorem are very similar [12, 35]. Thus, our work also suggests how one might construct a similar syntatcic proof of Arrow’s Theorem for social welfare functions, using, for instance, a logic such as that of Ågotnes et al. [1].

Besides encoding and proving Arrow’s Theorem, we also cover two further seminal impossibility results from social choice theory, namely Sen’s Theorem [32] on the impossibility of a Paretian liberal and the Muller–Satterthwaite Theorem [23], thereby demonstrating the generality and flexibility of our approach. Both of these theorems have so far received only very little attention in the literature on logics for social choice, with the notable exception of the work of Tang and Lin [34]. Sen’s Theorem shows that the Pareto principle, by which unanimously held preferences should be respected, and a very weak form of liberalism, by which there should be certain private issues that only concern a single agent and that therefore should be dictated by that agent, are incompatible. The Muller–Satterthwaite Theorem shows that the only SCF’s that satisfy a particular—strong but intuitively appealing—form of monotonicity are the dictatorships and those SCF’s that bar certain alternatives from winning, even if they are preferred by all agents. Arguably, these are three of the four most important classical impossibility results in social choice theory. The fourth, the Gibbard–Satterhwaite Theorem [15, 31] on the impossibility of devising a strategy-proof SCF, is outside the scope of this paper as it requires us to model both declared preferences (as for the three theorems covered here) and actual preferences, so as to be able to distinguish truthful agents from agents engaging in strategic manipulation. The modal logic of SCF’s we are working with can only model one type of preference. This is intended and appropriate for our purposes. However, the full original logic of Troquard et al. [37] can model these two layers of preferences—indeed, this is the main objective it had been designed for originally. Our work, together with the fact that the Gibbard–Satterthwaite Theorem may be considered a relatively simple corollary to the Muller–Satterthwaite Theorem requiring only a proof showing that strategy-proofness implies strong monotonicity [12], therefore strongly suggests that proving the Gibbard–Satterthwaite Theorem in the full logic of Troquard et al. using an extension of our approach is possible in principle.

Our proofs are presented as human-readable recipes for how to construct a fully formal derivation inside the modal logic of SCF’s of the three impossibility theorems discussed. These recipes can be transformed into machine-readable proofs relatively easily, and it is therefore possible in principle to have the proofs verified automatically by a proof-checker for this logic. In this sense, our contribution narrows the gap between, on the one hand, work on logics for modelling social choice [1, 16, 37] and, on the other, work on automated reasoning for social choice [9, 13, 24, 34, 38]. Having said this, there currently is no work on automated theorem proving for the modal logic we are working with, so while narrowed, aforementioned gap has not yet been fully closed. As a further step in this direction, we also discuss how to translate from modal logic into propositional logic. While this does result in a blow-up of the size of the representation of theorems (meaning that we lose readability for humans) it makes it possible for us to use standard tools, particularly SAT solvers, to automatically reason about these theorems. This perspective provides a close connection to the approach pioneered by Tang and Lin [34], and later refined by others [9, 13], of automatically proving results in social choice theory using SAT solvers.

The remainder of this paper is organised as follows. Section 2 recalls the definition of a SCF, and then introduces our logic of SCF’s and establishes completeness for it. This is followed up in Sect. 3, where we show how various concepts of interest for social choice theory can be modelled in this logic. This includes a discussion of the universal domain assumption and encodings of desirable properties of SCF’s, such as Pareto efficiency and monotonicity. The three theorems are encoded and then proved in Sect. 4. The translation into propositional logic, offering a means of implementation via a SAT solver, is presented in Sect. 5. Finally, Sect. 6 discusses related work in some detail and Sect. 7 concludes.

2 A modal logic of social choice functions

In this section, we recall the formal definition of a SCF and introduce the fragment of the logic put forward by Troquard et al. [37] required to define such a SCF, adapting some of their notation and terminology to our purposes. We then demonstrate that the known completeness theorem for the full logic extends to the fragment that is of interest to us here. Finally, we discuss the limitations of this logic in view of expressing properties of families of SCF’s ranging over electorates of varying size, as well as how to overcome these limitations in practice.

2.1 Social choice functions

Let \(N=\{1,\ldots ,n\}\) be a finite set of agents (or individuals) and let X be a finite set of alternatives (or candidates). To vote, each agent \(i\in N\) expresses her preferences by supplying a linear order \(\succcurlyeq _i\) over X, i.e., a binary relation that is reflexive, antisymmetric, complete, and transitive.Footnote 2 Let \(\mathscr {L}(X)\) denote the set of all such linear orders. We shall also refer to \(\succcurlyeq _i\) as the ballot provided by agent i, to stress the fact that this is the preference declared by the agent, but not necessarily her true preference. A profile is an n-tuple \((\succcurlyeq _1,\ldots ,\succcurlyeq _n)\in \mathscr {L}(X)^n\) of such ballots, one for each agent.

Definition 1

A resolute social choice function is a function \(F : \mathscr {L}(X)^n \rightarrow X\) mapping any given profile of ballots to a single winning alternative.

Examples for resolute SCF’s are well-known voting rules, such as the Borda rule or the plurality rule [35]—when combined with a suitable tie-breaking rule that ensures that there always is just a single winner. Under the Borda rule, for instance, an agent assigns as many points to a given alternative as she lists other alternatives below it (with the alternatives obtaing the most points winning). Ties may be broken, for instance, by using the ballot of the first agent.

2.2 Language

Troquard et al. [37] have introduced a modal logic, which they call \({\varLambda }^{\mathrm{scf}}[N,X]\), to reason about resolute SCF’s (mapping declared preferences to winners) as well as the agents’ truthful preferences. This logic can be used to model strategic behaviour in voting. Here we are not specifically interested in this strategic component, but rather in the purely aggregative aspect of social choice, i.e., in the question of whether a given SCF fairly aggregates individual ballots into a social decision. For the purposes of the present paper, we shall refer to the relevant fragment of the logic of Troquard et al. as L[NX], the logic of SCF’s parametrised by N and X. Next, we define the language, i.e., the set of well-formed formulas, of this logic.

This language is built on top of two types of atomic propositions. First, for every \(i\in N\) and \(x,y\in X\), \(p^i_{x\succcurlyeq y}\) is an atomic proposition (with the intuitive meaning that agent i prefers x to y). \({\textit{Pref}}\,[N,X]{:}= \{p^i_{x\succcurlyeq y} \mid i\in N\ \text{ and }\ x,y\in X\}\) is the set of all such propositions. Second, by a slight abuse of notation, every alternative \(x\in X\) is also an atomic proposition (with the intuitive meaning that x wins). Besides the usual propositional connectives, we have a modal operator \(\Diamond _C\) for every coalition of agents \(C\subseteq N\) (with the intuitive meaning that C can ensure the truth of a given formula, provided the others do not alter their ballots). The following definition summarises how the language is constructed.

Definition 2

The set of well-formed formulas \(\varphi \) in the language of L[NX] is generated by the following Backus-Naur Form (where \(p\in {\textit{Pref}}\,[N,X]\), \(x\in X\) and \(C\subseteq N\)):

$$\begin{aligned} \varphi \quad :{:}= \quad p \mid x \mid \top \mid \lnot \varphi \mid \varphi \vee \varphi \mid \Diamond _C\varphi \end{aligned}$$

Additional propositional connectives and a dual modal operator are defined in the usual manner: \(\varphi \wedge \psi \) is short for \(\lnot (\lnot \varphi \vee \lnot \psi )\), \(\varphi \rightarrow \psi \) is short for for \(\lnot \varphi \vee \psi \), \(\varphi \leftrightarrow \psi \) is short for for \((\varphi \rightarrow \psi )\wedge (\psi \rightarrow \varphi )\), \(\bot \) is short for \(\lnot \top \), and \(\Box _C\varphi \) is short for \(\lnot \Diamond _C\lnot \varphi \). For \(i\in N\), we write \(\Diamond _i\) as a shorthand for \(\Diamond _{\{i\}}\) and \(\Box _i\) as a shorthand for \(\Box _{\{i\}}\).

The full logic of Troquard et al. [37] includes an additional pair of modal operators to speak about true preferences.

2.3 Semantics

The semantics of the logic is a standard possible-worlds semantics for modal logics, defined in terms of a set of possible worlds, a family of accessibility relations, and a valuation function [6]. We first give a short high-level description intended for readers familiar with such semantics, and then provide complete formal definitions.

First, the set of possible worlds is the set of all possible profiles—which is fully determined by N and X. The semantics of atomic propositions of the form \(p^i_{x\succcurlyeq y}\) will be defined solely in terms of this set of possible worlds: \(p^i_{x\succcurlyeq y}\) is true at a given world/profile w, if agent i prefers x to y in w. Only to model the truth of atomic propositions of the form x will we require a valuation function. Valuation functions here are SCF’s: x is true at world/profile w if the SCF in question maps profile w to the winning alternative x. Finally, for every coalition \(C\subseteq N\), there is an accessibility relation between worlds/profiles: w is connected to \(w'\) if they differ only w.r.t. the preferences of agents in C. These accessibility relations will be used to define the semantics of modal formulas of the form \(\Diamond _C\varphi \) in the usual manner.

Definition 3

A model is a triple \(M=\langle N,X,F\rangle \), consisting of a finite set of agents N with \(n=|N|\), a finite set of alternatives X, and a SCF \(F:\mathscr {L}(X)^n\rightarrow X\).

For fixed sets N and X, we sometimes write \(M_F\) for the model \(M=\langle N,X,F\rangle \) based on the SCF F. From now on we shall use the terms ‘world’ and ‘profile’ interchangeably. We are now ready to define what it means for a formula \(\varphi \) to be true at a world \(w=(\succcurlyeq _1,\ldots ,\succcurlyeq _n)\) in a given model M.

Definition 4

Let \(M=\langle N,X,F\rangle \) be a model. We write \(M,w\models \varphi \) to express that the formula \(\varphi \) is true at the world \(w=(\succcurlyeq _1,\ldots ,\succcurlyeq _n)\in \mathscr {L}(X)^n\) in M. The satisfaction relation \(\models \) is defined inductively:

  • \(M,w \models p^i_{x\succcurlyeq y}\) iff \(x\succcurlyeq _i y\)

  • \(M,w \models x\) iff \(F(\succcurlyeq _1,\ldots ,\succcurlyeq _n)=x\)

  • \(M,w \models \lnot \varphi \) if \(M,w\not \models \varphi \)

  • \(M,w \models \varphi \vee \psi \) iff \(M,w\models \varphi \) or \(M,w\models \psi \)

  • \(M,w \models \Diamond _C\varphi \) iff \(M,w' \models \varphi \) for some world \(w'\!=\!{(\succcurlyeq '_1,\ldots ,\succcurlyeq '_n)}\in \mathscr {L}(X)^n\) with \({\succcurlyeq _i}={\succcurlyeq '_i}\) for all agents \(i\in N{\setminus } C\).

That is, \(\Diamond _C\varphi \) is true at w, if the agents in C can make \(\varphi \) true by changing their own ballots (assuming none of the other agents change as well). Thus, \(\Box _C\varphi \) is true at w if \(\varphi \) holds at every world that is reachable from w by only the agents in C changing their ballots.

In some sense, the truth of every formula of the form \(p^i_{x\succcurlyeq y}\) is under the control of agent i. Because of this feature, this kind of logic is sometimes classified as a logic of propositional control. The motivation underlying such logics is essentially game-theoretic: every individual is conceived as having “control” over a set of atomic propositions. The choice of a particular truth value for these atomic propositions can be seen as an action of the individual, and therefore a valuation of all the atomic propositions of this sort corresponds to a strategy profile. For more details and motivations on logics of propositional control we refer to the work of van der Hoek and Wooldridge [19], Gerbrandy [14], Balbiani et al. [4] and Troquard et al. [37], amongst others. We also note that these logics are closely related to Pauly’s coalition logic [27], Boolean games [7, 18], and the Ceteris Paribus Logic of Grossi et al. [17].

Let \(\varphi \) be a formula in the language based on N and X. Then \(\varphi \) is called satisfiable, if there exist a SCF F and a world \(w\in \mathscr {L}(X)^n\) such that \(M_F,w\models \varphi \). It is called true in the model M, denoted \(M\models \varphi \), if \(M,w\models \varphi \) for every world \(w\in \mathscr {L}(X)^n\). Finally, it is called valid, denoted \(\models \varphi \), if \(M\models \varphi \) for every model M based on N and X.

The logic of Troquard et al. [37] is known to be decidable and this result immediately extends to the fragment of their logic discussed here:

Proposition 1

Determining whether a formula in the language of L[NX] is valid is a decidable problem.

Proof

Since N and X are fixed, we can enumerate all models and check for each of them whether our formula is true at every world in the model. \(\square \)

2.4 Axiomatisation and completeness

Next, we review the axiomatisation due to Troquard et al. [37], restricted to the fragment L[NX] discussed here, and then adapt their completeness result to this fragment. The first few axioms ensure that the propositions of the form \(p^i_{x\succcurlyeq y}\) really encode linear orders.

  1. (1)

    \(p^i_{x\succcurlyeq x}\)    (reflexivity)

  2. (2)

    \(p^i_{x\succcurlyeq y} \leftrightarrow \lnot p^i_{y\succcurlyeq x}\) for \(x\not =y\)    (antisymmetry and completeness)

  3. (3)

    \(p^i_{x\succcurlyeq y}\wedge p^i_{y\succcurlyeq z} \rightarrow p^i_{x\succcurlyeq z}\)    (transitivity)

Here x, y and z range over atomic propositions in X, and i ranges over agents. Before we continue with the axiomatisation, let us first introduce a couple of additional language constructs to refer to ballots and profiles within the logical language. Consider a profile \(w=(\succcurlyeq _1,\ldots ,\succcurlyeq _n)\in \mathscr {L}(X)^n\). For a given agent \(i\in N\), let \(x_1,x_2,\ldots ,x_m\) be a permutation of the elements of X such that \(x_1\succcurlyeq _i x_2 \succcurlyeq _i \cdots \succcurlyeq _i x_m\). Then \({\textit{ballot}}_i(w)\) is defined as the following formula:

$$\begin{aligned} {\textit{ballot}}_i(w):=p^i_{x_1\succcurlyeq x_2} \wedge p^i_{x_2\succcurlyeq x_3} \wedge \cdots \wedge p^i_{x_{m-1}\succcurlyeq x_m} \end{aligned}$$

Thus, \({\textit{ballot}}_i(w)\) is true at world \(w'\) if and only if w and \(w'\) agree as far as the ballot of agent i is concerned. Note that \({\textit{ballot}}_i(w)\) is a purely syntactic representation of a semantic notion (namely, agent i’s preference order \(\succcurlyeq _i\)). Similarly, we define \({\textit{profile}}(w)\) as the following formula:

$$\begin{aligned} {\textit{profile}}(w):={\textit{ballot}}_1(w) \wedge {\textit{ballot}}_2(w) \wedge \cdots \wedge {\textit{ballot}}_n(w) \end{aligned}$$

Hence, the formula \({\textit{profile}}(w)\) is true at world w, and only there. This shows that nominals, i.e., formulas uniquely identifying worlds [6], are definable within this logic. Furthermore, due to the finiteness of X and N, there can be only finitely many formulas of type \({\textit{profile}}(w)\) that are consistent with the axioms.

Let \(N^w_{x\succcurlyeq y} :{=} \{i\in N \mid x\succcurlyeq _i y\}\) denote the set of agents that prefer x over y in profile \(w=(\succcurlyeq _1,\ldots ,\succcurlyeq _n)\). By a slight abuse of notation, we use the same expression as a construct of our language:

$$\begin{aligned} N^w_{x\succcurlyeq y}:{=}\bigwedge \left\{ p^i_{x\succcurlyeq y} \mid x\succcurlyeq _i y\ \text {in}\ w\right\} \end{aligned}$$

We write \(N^w_{x\succcurlyeq y}\) to denote both the set of agents and the formula; the context will disambiguate the intended meaning. Note that \(\bigwedge _{x,y\in X} N^w_{x\succcurlyeq y}\) is logically equivalent to \({\textit{profile}}(w)\): this reflects the fact that a profile can either be presented by specifying the preferences of each individual or by specifying the sets of agents preferring one alternative over another, for all pairs of alternatives.

For any two alternatives \(x,y\in X\), we define \({\textit{profile}}(w)(x,y)\) as the formula fixing the relative ordering of x and y for all agents as in profile w:

$$\begin{aligned} {\textit{profile}}(w)(x,y):=N^w_{x\succcurlyeq y} \wedge N^w_{y\succcurlyeq x} \end{aligned}$$

This formula will be used to express the fact that two profiles ‘agree’ on the preferences concerning the alternatives x and y.

We now state the remaining axioms defining the logic L[NX]:

  1. (4)

    all propositional tautologies

  2. (5)

    \(\Box _i(\varphi \rightarrow \psi )\rightarrow (\Box _i\varphi \rightarrow \Box _i\psi ) \quad \) (K(i))

  3. (6)

    \(\Box _i\varphi \rightarrow \varphi \quad \) (T(i))

  4. (7)

    \(\varphi \rightarrow \Box _i\Diamond _i\varphi \quad \) (B(i))

  5. (8)

    \(\Diamond _i\Box _j\varphi \leftrightarrow \Box _j\Diamond _i\varphi \quad \)(confluence)

  6. (9)

    \(\Box _{C_1}\Box _{C_2}\varphi \leftrightarrow \Box _{C_1\cup C_2} \varphi \quad \) (union)

  7. (10)

    \(\Box _\emptyset \varphi \leftrightarrow \varphi \quad \) (empty coalition)

  8. (11)

    \((\Diamond _i p\wedge \Diamond _i\lnot p)\rightarrow (\Box _j p\vee \Box _j\lnot p)\), where \(i\not =j \quad \) (exclusiveness)

  9. (12)

    \(\Diamond _i {\textit{ballot}}_i(w) \quad \) (ballot)

  10. (13)

    \(\Diamond _{C_1}\delta _1\wedge \Diamond _{C_2}\delta _2\rightarrow \Diamond _{C_1\cup C_2} (\delta _1\wedge \delta _2) \quad \) (cooperation)

  11. (14)

    \(\bigvee _{x\in X}(x\wedge \bigwedge _{y\in X{\setminus }\{x\}} \lnot y) \quad \) (resoluteness)

  12. (15)

    \(({\textit{profile}}(w)\wedge \varphi )\rightarrow \Box _N({\textit{profile}}(w)\rightarrow \varphi ) \quad \) (functionality)

Here \(\varphi \) and \(\psi \) range over arbitrary formulas, x over atomic propositions in X, i and j over agents, \(C_1\) and \(C_2\) over coalitions, and w over profiles. In axiom (11), p is ranging only over atomic propositions in the set \({\textit{Pref}}\,[N,X]\), and in axiom (13) \(\delta _1\) and \(\delta _2\) do not contain any common atoms.

Axioms (4)–(8) describe well-known properties of normal modal logics [6]. Axiom (9) describes the capability of a coalition to enforce a certain formula in terms of the capabilities of its sub-coalitions. Axiom (10) states that the empty coalition cannot enforce any formula. Axiom (11) enforces a division among the atomic propositions of the shape \(p^i_{x\succcurlyeq y}\): if an atom is controlled by an agent i, then other agents cannot change its value. Axiom (12) ensures that every agent can express every possible preference. Due to axiom (13), if two formulas \(\delta _1\) and \(\delta _2\) do not contain a common atom and two coalitions \(C_1\) and \(C_2\) can each enforce one of the formulas, then the joint coalition can enforce the conjunction \(\delta _1\wedge \delta _1\). Axiom (14) expresses that any outcome associated with a profile must be a single winning alternative. Thus, this axioms encodes the resoluteness of the SCF in question. Finally, axiom (15) ensures that every profile is associated with a single outcome, i.e., it encodes the fact that the SCF being modelled must be a function.

The inference rules of the logic are modus ponens and necessitation w.r.t. all modalities of the form \(\Box _i\) [6]:

  • (MP) from \(\varphi \rightarrow \psi \) and \(\varphi \), infer \(\psi \)

  • (Nec\(_i\)) from \(\vdash \varphi \), infer \(\vdash \Box _i\varphi \)

Here we write \(\vdash \varphi \) to express that a well-formed formula \(\varphi \) in the language parametrised by N and X is a theorem of the logic L[NX], in the sense that it can be derived from axioms (1)–(15), together with the above inference rules. The \(\vdash \varphi \) appearing in the second rule thus indicates that the rule can only be applied to theorems. We define a set of formulas \(\varGamma \) to be consistent if we cannot derive a contradiction from it. The theorems of L[NX] coincide with the valid formulas:

Theorem 1

(Completeness) The logic L[NX] is sound and complete w.r.t. the class of models of SCF’s.

Proof

(sketch) Since our logic is a fragment of \({\varLambda }^{\mathrm{scf}}[N,X]\), the soundness result due to Troquard et al. [37] applies directly. The same is not true for completeness. However, as we shall outline next, mutatis mutandis, the proof of Troquard et al. [37] for the richer logic can be adapted to our fragment.

First, we show the existence of an isomorphism between the models of Definition 3 and particular Kripke models. The latter structures are tuples \(\langle W, (R_C)_{C\subseteq N}\rangle \) where W is the set of profiles and \(R_C\subseteq W\times W\) are relations defined as

$$\begin{aligned} w R_C w' \quad \hbox {iff}\quad {w\upharpoonright N{\setminus } C} = {w'\upharpoonright N{\setminus } C}, \end{aligned}$$

where \(w\upharpoonright N{\setminus } C\) is the profile w restricted to only the individuals outside of C. Intuitively, \(w R_C w'\) holds if all the agents outside of C express the same preferences in w and \(w'\).

Second, given a consistent formula \(\varphi \), we build a maximally consistent set \(\varGamma _\varphi \) containing it using the usual Lindenbaum construction.

Define \(\text { Cluster}(\varGamma _\varphi )\) to be the set of maximally consistent sets that describe the same SCF:

Finally, we consider the submodel of the canonical model generated by \(\text { Cluster}(\varGamma _\varphi )\). Let us call this submodel \(M_\varphi \). It remains to be checked that:

  • the Truth Lemma holds for \(M_\varphi \),

  • there is a bijection between profiles and states of \(M_\varphi \),

  • \(M_\varphi \) is one of the aforementioned particular Kripke models corresponding to the models of our logic.

The first item is shown in the customary way, while the other items are proven by exploiting the axioms. \(\square \)

2.5 Representing families of social choice functions

To complete the outline of the expressive capabilities of L[NX], we illustrate how it is possible to encode a SCF as a formula. Given a SCF F, its representation will be:

$$\begin{aligned} \rho ^F = \bigwedge \left\{ {\textit{profile}}(w)\rightarrow x \mid w\in \mathscr {L}(X)^n\ \text{ and }\ F(w)=x \right\} \end{aligned}$$

That is, \(\rho ^F\) is simply the conjunction, over all profiles w, of implications between a formula describing w and a formula identifying the winning alternative for profile w under F. In other words, we need to have the full graph of the function, that is, the full set of input-output pairs, to be able to encode F in the language. This is indeed possible, because, strictly speaking, \(\rho ^F\) represents the function only for a fixed number of alternatives and a fixed number of agents. Moreover, since we are able to encode any set of input-output pairs, we can represent any SCF in the language.

Unfortunately, for the very same reason, \(\rho ^F\) cannot be taken as a proper representative of a SCF, because it only tells us what the output of the function is in a very limited case: when the alternatives are exactly those in X and when the agents are exactly those in N. In practice, however, we are interested in families of SCF’s. If, say, F is the Borda rule and X and N both have cardinality 3, then \(\rho ^F\) will only express the workings of the Borda rule for three alternatives and three agents. A full representation of the Borda rule (which formally is a family of SCF’s in the sense of Definition 1), however, should contain the information necessary to compute the output from any given profile. It should be a conjunction of all the formulas \(\rho ^F\) for all possible choices of X and N. But even assuming that we had all such sets of pairs, there are countably many \(\rho \)’s of this kind, and our logical language does not contain countable conjunctions. Given that the language is not powerful enough to encode an algorithmic specification, there is no hope that our logic, or a similar logic, will do better than using \(\rho ^F\) in representing SCF’s. Indeed, this restriction to specific sets of alternatives and agents is a recognised limitation of most existing logic-based approaches to modelling frameworks of social choice [12].

Interestingly, however, this problem affects the representations of the properties of SCF’s only partially. Since most of the properties do not directly refer to the specific number of alternatives and agents, we can formulate the properties leaving X and N as parameters. The same can be done when proving the relative dependencies between properties. This means that, to prove that property \(P_1\) entails \(P_2\), we prove that, for fixed choices of X and N, there is a proof in the logic from the formula encoding \(P_1\) to the formula encoding \(P_2\) (both these formulas are instantiated to X and N themselves). This is the approach we shall take here.

3 Modelling features of social choice theory

In this section, we show how to model several important concepts of social choice theory in our logic. We start by proving the Universal Domain Lemma, which demonstrates that there exists a formula in our language that expresses that for every possible preference profile there exists a world where it is realised, and that is a theorem of our logic. This simple but important result will be used throughout the paper. We then pause to introduce and encode a notion that will feature in several properties and proofs in the next (sub)sections, the concept of a decisive coalition. Finally, we formalise the main properties featuring in the classical impossibility theorems we want to prove, particularly Pareto efficiency, independence of irrelevant alternatives, strong monotonicity, and liberalism. For each property we suggest an encoding in the logic and prove that it indeed captures the corresponding semantic notion.

Throughout, we exploit freely the finiteness of the language: we will use big conjunctions and disjunctions to quantify over individuals, alternatives, and profiles.

3.1 The Universal Domain Lemma

The following lemma states that all the possible profiles are also possible worlds in the semantics. This fact, which is implicit in our definition of a SCF, is called the universal domain condition in Arrow’s original work [3].

Lemma 1

(Universal Domain Lemma) For every possible profile \(w\in \mathscr {L}(X)^n\), we have that \(\vdash \Diamond _N {\textit{profile}}(w)\).

Proof

Take any profile w. Then \({\textit{ballot}}_1(w)\) encodes the preferences of the first agent. We have, by axiom (12), that \(\Diamond _1 {\textit{ballot}}_1(w)\), and similarly for the second agent we get \(\Diamond _2 {\textit{ballot}}_2(w)\). Because \({\textit{ballot}}_1(w)\) and \({\textit{ballot}}_2(w)\) contain different atoms (the former only atoms with superscript 1, the latter only atoms with superscript 2), we can apply axiom (13) and obtain \(\Diamond _{\{1,2\}}({\textit{ballot}}_1(w)\wedge {\textit{ballot}}_2(w))\). We can repeat this reasoning for all the finitely many agents in N to prove \(\Diamond _N{\textit{profile}}(w)\). \(\square \)

3.2 Decisive coalitions

We will call a coalition of agents \(C\subseteq N\) decisive over a pair of alternatives \((x, y)\in X^2\) if the members of C preferring x to y is a sufficient condition for preventing y from winning. We use the following formula to encode decisiveness of C over (xy):

$$\begin{aligned} C{\textit{dec}}(x,y):{=}\left( \bigwedge _{i\in C} p^i_{x\succcurlyeq y}\right) \rightarrow \lnot y \end{aligned}$$

If C is decisive on every pair, we will simply write \(C{\textit{dec}}\). Along the same lines, we define a weakly decisive coalition C for (xy) as a coalition that can bar y from winning if exactly the agents in C prefer x to y. We encode weak decisiveness of C over (xy) as follows:

$$\begin{aligned} C{\textit{wdec}}(x,y):{=}\left( \bigwedge _{i\in C} p^i_{x\succcurlyeq y} \wedge \bigwedge _{i\not \in C} p^i_{y\succcurlyeq x} \right) \rightarrow \lnot y \end{aligned}$$

The reader can easily check that these syntactic notions match the semantic ones; for example, in the case of decisiveness we have that \(C{\textit{dec}}(x,y)\) is true in the model \(M_F\) if and only if the coalition C is decisive over that pair of alternatives for the corresponding SCF F.

3.3 Pareto efficiency

We now introduce several properties that one might reasonably want to require a SCF to satisfy. The first is Pareto efficiency, expressing the desideratum that, if all the agents rank an alternative x above another alternative y, then y should not win.

Definition 5

A SCF F is Pareto efficient if, for every profile \(w\in \mathscr {L}(X)^n\) and every pair of distinct alternatives \(x,y\in X\) with \(N^w_{x\succcurlyeq y} = N\), we obtain \(F(w)\ne y\).

This is formalised as follows:

$$\begin{aligned} {\textit{Par}}:{=}\bigwedge _{x\in X} \bigwedge _{y\in X{\setminus }\{x\}} \left[ \left( \bigwedge _{i\in N} p^i_{x\succcurlyeq y}\right) \rightarrow \lnot y \right] \end{aligned}$$

Observe that \({\textit{Par}}\) is equivalent to \(N{\textit{dec}}\), i.e., to saying that the grand coalition N is decisive on every pair.

Lemma 2

For every SCF F, \(M_F\vDash {\textit{Par}}\) if and only if F is Pareto efficient.

Proof

Straightforward. \(\square \)

3.4 Independence of irrelevant alternatives

Our next property of interest is independence of irrelevant alternatives (IIA). It expresses the intuitively desirable property of a SCF F that, for every two profiles and for every two alternatives x and y, if the outcome of F in the first profile is x and the two profiles are identical as far as the preferences of the agents over x and y are concerned, then the outcome of F in the second profile should not be y. The original formulation of IIA given by Arrow [3] was applied to social welfare functions rather than SCF’s. Our definition is the most natural adaptation of Arrow’s idea to SCF’s. It has also been used by Taylor [35], amongst others.

Definition 6

A SCF F satisfies IIA if, for every pair of profiles \(w, w'\in \mathscr {L}(X)^n\) and every pair of distinct alternatives \(x,y\in X\) with \(N^w_{x\succcurlyeq y}=N^{w'}_{x\succcurlyeq y}\), it is the case that \(F(w)=x\) implies \(F(w')\ne y\).

We formalise this property in our logic as follows:

$$\begin{aligned} {\textit{IIA}}:{=}\bigwedge _{w\in \mathscr {L}(X)^n}\bigwedge _{x\in X}\bigwedge _{y\in X{\setminus }\{x\}} \left[ \Diamond _N({\textit{profile}}(w)\wedge x)\rightarrow ({\textit{profile}}(w)(x,y)\rightarrow \lnot y) \right] \end{aligned}$$

That is, if in some world (reachable via the \(\Diamond _N\)-modality) we observe profile w with alternative x winning, then in the present world, if it agrees with w as far as the relative ranking of x and y is concerned, y cannot be the winner.

The following lemma formally establishes the correspondence between the syntactic formulation of IIA and its semantic definition.

Lemma 3

For every SCF F, \(M_F\vDash {\textit{IIA}}\) if and only if F satisfies the property of independence of irrelevant alternatives.

Proof

From right to left, assume F satisfies IIA. We want to prove every conjunct of the formula \({\textit{IIA}}\). So take any generic world \(w'\) such that \(M_F,w'\vDash \Diamond _N({\textit{profile}}(w)\wedge x)\). We want to show that \(M_F,w' \vDash ({\textit{profile}}(w)(x,y)\rightarrow \lnot y)\). So suppose \(M_F,w'\vDash {\textit{profile}}(w)(x,y)\), which entails \(N^w_{x\succcurlyeq y} = N^{w'}_{x\succcurlyeq y}\). By the semantics of \(\Diamond _N\), there is a world \(w''\) such that \(M_F, w''\vDash {\textit{profile}}(w)\wedge x\), which entails \(N^w_{x\succcurlyeq y} = N^{w''}_{x\succcurlyeq y}\). Thus, also \(N^{w'}_{x\succcurlyeq y} = N^{w''}_{x\succcurlyeq y}\). From \(M_F,w''\vDash x\) we can infer \(F(w'')=x\). Now we can apply IIA to \(w''\) and \(w'\) and obtain \(F(w')=x\) and thus \(F(w')\not =y\). Again by the semantics, this is tantamount to \(M_F, w'\vDash \lnot y\).

From left to right, assume \(M_F\vDash {\textit{IIA}}\). Take any two profiles \(w, w'\) and two alternatives xy with \(N^w_{x\succcurlyeq y}=N^{w'}_{x\succcurlyeq y}\). Now assume \(F(w)= x\). We thus have \(M_F, w\vDash {\textit{profile}}(w)\wedge x\) and, by the semantics of \(\Diamond _N\), also \(M_F, w'\vDash \Diamond _N({\textit{profile}}(w)\wedge x)\). Using modus ponens and formula \({\textit{IIA}}\), we get \(M_F, w'\vDash ({\textit{profile}}(w)(x,y)\rightarrow \lnot y)\). But we assumed \(N^w_{x\succcurlyeq y}=N^{w'}_{x\succcurlyeq y}\), hence \(M_F, w'\vDash {\textit{profile}}(w)(x,y)\) and thus \(M_F, w'\vDash \lnot y\), which by the semantics entails \(F(w')\ne y\). \(\square \)

3.5 Strong monotonicity

Next is a monotonicity property known as Maskin monotonicity or strong monotonicity. It requires that, whenever alternative x wins in a given profile and we (weakly) improve the standing of x vis-à-vis all other alternatives, then x should still win in the new profile—even if the relative rankings of other alternatives change in the profile as well. While its formal definition is similar to that of IIA, there are subtle differences: we are now quantifying over all other alternatives y rather than considering one specific such alternative.

Definition 7

A SCF F is strongly monotonic if, for every pair of profiles \(w, w'\in \mathscr {L}(X)^n\) and every alternative \(x\in X\), it is the case that \(F(w)=x\) and \(N^w_{x\succcurlyeq y}\subseteq N^{w'}_{x\succcurlyeq y}\) for all \(y\in X{\setminus }\{x\}\) together imply \(F(w')=x\).

This property can be encoded as follows:

$$\begin{aligned} {\textit{SM}}:{=} \bigwedge _{w\in \mathscr {L}(X)^n} \bigwedge _{x\in X} \left[ \Diamond _N({\textit{profile}}(w)\wedge x) \;\wedge \; \left( \bigwedge _{y\in X{\setminus }\{x\}}N^w_{x\succcurlyeq y}\right) \;\rightarrow \; x \right] \end{aligned}$$

Lemma 4

For every SCF F, \(M_F\vDash SM\) if and only if F is strongly monotonic.

Proof

From left to right, suppose \(M_F\vDash SM\) is the case. Now, fix two profiles w and \(w'\) and an alternative x, assume that \(F(w)=x\), and assume that \(N^w_{x\succcurlyeq y}\subseteq N^{w'}_{x\succcurlyeq y}\) for all \(y\in X{\setminus }\{x\}\). Due to \(F(w)=x\), we have \(M_F, w\vDash {\textit{profile}}(w)\wedge x\) and, by the semantics of \(\Diamond _N\), also \(M_F, w'\vDash \Diamond _N({\textit{profile}}(w)\wedge x)\). By the second assumption, namely \(N^w_{x\succcurlyeq y}\subseteq N^{w'}_{x\succcurlyeq y}\), we obtain that the second conjunct of \({\textit{SM}}\), namely \(\bigwedge _{y\in X{\setminus }\{x\}} N^w_{x\succcurlyeq y} \), is also true at \(w'\). From the validity of \({\textit{SM}}\) we can conclude \(M_F, w'\vDash x\) and hence \(F(w')=x\).

From right to left, say that F is strongly monotonic and fix w and x. Take a generic profile \(w'\) and assume \(M_F, w'\vDash \Diamond _N[x\wedge {\textit{profile}}(w)]\wedge \bigwedge _{y\in X{\setminus }\{x\}} N^w_{x\succcurlyeq y}\). Due to the first conjunct we know that \(F(w)=x\), while in light of the second we can conclude that \(N^w_{x\succcurlyeq y}\subseteq N^{w'}_{x\succcurlyeq y}\) for all \(y\in X{\setminus }\{x\}\), because by the semantics all the supporters of x over y in w still support x over y in \(w'\). By strong monotonicity we get \(F(w')=x\) and \(M_F, w'\vDash x\). Since \(w'\) was generic we can conclude that \({\textit{SM}}\) is a validity in \(M_F\). \(\square \)

3.6 Surjectivity

Our most basic property is surjectivity. It expresses the desideratum that every alternative should be the winner for at least one profile.

Definition 8

A SCF F is surjective if, for every alternative \(x\in X\) there exists a profile \(w\in \mathscr {L}(X)^n\) such that \(F(w) =x\).

We can encode surjectivity as follows:

$$\begin{aligned} {\textit{Sur}}:{=}\bigwedge _{x\in X} \bigvee _{w\in \mathscr {L}(X)^n} \Diamond _N({\textit{profile}}(w)\wedge x) \end{aligned}$$

Lemma 5

For every SCF F, \(M_F\vDash {\textit{Sur}}\) if and only if F is surjective.

Proof

Straightforward. \(\square \)

3.7 Liberalism

The idea that a form of liberalism can be modelled as a property of SCF’s is due to Sen [32]. He postulated that every agent should have the power to determine the relative ranking of at least two alternatives x and y. For example, x might be the state of the world in which Barack Obama is president of the United States of America and you paint the walls of your bedroom in pink, and y might be the state of the world where Barack Obama is president of the United States of America and you paint the walls of your bedroom in white. Then you should have the power of excluding one of x and y from being the collectively chosen alternative (which of course does not mean that the other one of the two necessarily needs to be chosen). In this case, we say that you are (two-way) decisive on x and y.

Definition 9

A SCF F satisfies the property of liberalism if, for every individual \(i\in N\) there exist two distinct alternatives \(x,y\in X\) for which i is two-ways decisive.

The property of liberalism can be encoded as follows:

$$\begin{aligned} {\textit{Lib}}:{=}\bigwedge _{i\in N} \bigvee _{x\in X} \bigvee _{y\in X{\setminus }\{x\}} \left( \{i\}{\textit{dec}}(x,y)\wedge \{i\}{\textit{dec}}(y,x)\right) \end{aligned}$$

Lemma 6

For every SCF F, \(M_F\vDash {\textit{Lib}}\) if and only if F satisfies liberalism.

Proof

From left to right, suppose \(M_F\vDash {\textit{Lib}}\). Suppose for the sake of contradiction that F does not satisfy liberalism. If there is an individual i that is not two-ways decisive on any pairs then for every pair there is a profile w such that the outcome F(w) is in conflict with the preferences of i (say, \(x\succcurlyeq _i y\) and \(F(w)=y\)). This means that \(\{i\}{\textit{dec}}(x,y)\wedge \{i\}{\textit{dec}}(y,x)\) cannot be a validity in the model \(M_F\), and the same holds for all the pairs, so \(\bigvee _x \bigvee _{y\ne x} (\{i\}{\textit{dec}}(x,y)\wedge \{i\}{\textit{dec}}(y,x))\) cannot be a validity either, for our fixed i. This in turn entails that \(M_F\vDash {\textit{Lib}}\) is not the case, which constitutes the desired contradiction.

From right to left, say that F satisfies liberalism. Fixing an agent i, it is easy to check that, calling xy the alternatives for which i is decisive, we must have \(\{i\}{\textit{dec}}(x,y)\wedge \{i\}{\textit{dec}}(y,x)\) as a validity on the model \(M_F\). Thus, also \(\bigvee _x \bigvee _{y\ne x} (\{i\}{\textit{dec}}(x,y)\wedge \{i\}{\textit{dec}}(y,x))\) is a validity, and the same holds for every i, so we get the validity of \({\textit{Lib}}\).\(\square \)

3.8 Dictatorships

Finally, we will require one undesirable property of SCF’s. A dictatorship is a SCF for which one individual, the dictator, can enforce their top alternative as the outcome. Denote with \({ top}^w_i\) that alternative \(x\in X\) for which \(x\succcurlyeq _i y\) for all other alternatives \(y\in X\) in profile \(w=(\succcurlyeq _1,\ldots ,\succcurlyeq _n)\).

Definition 10

A SCF F is a dictatorship if there exists an agent \(i\in N\) (the dictator) such that, for every profile \(w\in \mathscr {L}(X)^n\), we obtain \(F(w)={ top}^w_i\).

The property of being a dictatorship is encoded by the following formula:

$$\begin{aligned} {\textit{Dic}}:{=}\bigvee _{i\in N} \bigwedge _{x\in X} \bigwedge _{y\in X{\setminus }\{x\}}\left( p^i_{x\succcurlyeq y} \rightarrow \lnot y\right) \end{aligned}$$

Observe that \({\textit{Dic}}\) is equivalent to \(\bigvee _{i\in N} \{i\}{\textit{dec}}\), i.e., a SCF is dictatorial if and only if there exists an individual that is decisive on every pair.

Lemma 7

For every SCF F, \(M_F\vDash {\textit{Dic}}\) if and only if F is a dictatorship.

Proof

From right to left, suppose F is a dictatorship, and call the dictator i. Take any world \(w=(\succcurlyeq _1,\ldots ,\succcurlyeq _n)\). We want to show that the disjunct corresponding to i is true at w. Thus, for any two distinct alternative xy we want to show that \(p^i_{x\succcurlyeq y} \rightarrow \lnot y\) is true at w. First, if \(x\succcurlyeq _i y\), then \({ top}^w_i \not = y\) and thus, due to F being a dictatorship of i, we have \(F(w)\not =y\). By the semantics, this entails \(M_F, w\vDash \lnot y\) and thus \(M_F, w\vDash p^i_{x\succcurlyeq y}\rightarrow \lnot y\). Second, if \(x\not \succcurlyeq _i y\), then \(M_F, w\not \vDash p^i_{x\succcurlyeq y}\), and the implication holds vacuously.

From left to right, suppose \(M_F\vDash {\textit{Dic}}\). Then one of the disjuncts must be valid, say for agent i. Suppose \(x={ top}^w_i\) under profile w. Then \(M_F, w\vDash \bigwedge _{y\in X{\setminus }\{x\}} p^i_{x\succcurlyeq y}\). Since (the disjunct referring to i in) the condition \({\textit{Dic}}\) is true at w, we obtain \(M_F, w\vDash \bigwedge _{y\in X{\setminus }\{x\}} \lnot y\). By resoluteness, this entails \(M_F,w\vDash x\) and thus \(F(w)=x\). \(\square \)

Note that, in the presence of axiom (14), encoding resoluteness, the disjunction in the formula \({\textit{Dic}}\) is actually an exclusive one, i.e., not only must there be some dictator, but there must be exactly one dictator.Footnote 3

4 Impossibility theorems

We are now ready to state the three major impossibility theorems we are interested in as formulas in the language of our modal logic of SCF’s. For each of them, we then demonstrate how to construct a full proof of the theorem within the axiomatic system we have seen to be complete for our logic (cf. Theorem 1). We start with Arrow’s Theorem and then prove the Muller–Satterthwaite Theorem as a corollary. The third theorem, Sen’s Theorem on the impossibility of a Paretian liberal, is mathematically much simpler and also admits a relatively short proof in our logic.

Before we begin, we need to make one important remark concerning the expressivity of our logic. Given that the language of L[NX] is parametrised by the set of individuals N and the set of alternatives X, strictly speaking the aforementioned theorems, which all apply to scenarios with arbitrary numbers of individuals and alternatives (provided those numbers are sufficiently large), cannot be stated or proven within the logic. To prove each of these impossibility theorems in their full generality we have to resort to a meta-argument, using a proof schema, to show that, for each choice of N and X, it is possible to prove a version of the theorem in the logic instantiated to those two parameters. The same proviso also holds for the properties of SCF’s featuring in the previous section: rather than being formulas in the logic, they are schemas of the representations of the properties in the logic.

4.1 Encoding Arrow’s Theorem

First published in 1951, Arrow’s Theorem is widely regarded as the seminal contributions to social choice theory [3]. The original theorem concerns social welfare functions, i.e., functions mapping profiles of (weak) preference orders (permitting indifference between alternatives) to single collective preference orders. The version we present here is adapted for preference orders that do not permit indifferences between alternatives and to SCF’s (which return a single winning alternative rather than a collective order). We refer to Taylor [35] for an extensive discussion of this variant of the theorem. From a mathematical point of view, both variants are essentially equivalent and can be proven using the same methods [12, 35]. We focus on linear orders (not permitting indifferences), because most standard voting rules impose this requirement on ballots [35]. We furthermore focus on SCF’s, because the problem of choosing a single best alternative is more pervasive in applications than that of choosing a full ranking over alternatives.

Arrow showed that, rather surprisingly, any SCF for three or more alternatives that is Pareto efficient and that satisfies the property of independence of irrelevant alternatives must be dictatorial.

Theorem 2

(Arrow’s Theorem) Any SCF for at least three alternatives that satisfies IIA and the Pareto condition is a dictatorship.

We now proceed to code a proof of Arrow’s Theorem in our logic. We will use a familiar technique, based on the concept of decisive coalitions (as defined in Sect. 3.2), to guide our search for a proof [12, 33]. What is novel about our approach is that we show that this technique can be fully embedded into a formal derivation of the axiomatic system for L[NX] presented earlier. We offer an outline on the main steps of the proof, from which a complete formal derivation can be recovered.

The proof is based on two main lemmas. The first lemma shows that, under certain conditions, a coalition being weakly decisive over a specific pair of alternatives implies that the same coalition is (not only weakly) decisive over all pairs.

Lemma 8

Consider a language parametrised by X such that \(|X|\geqslant 3\). Then for any coalition \(C\subseteq N\) and any two distinct alternatives \(x,y\in X\), we have that:

$$\begin{aligned} \vdash {\textit{Par}}\wedge {\textit{IIA}}\wedge C{\textit{wdec}}(x,y)\rightarrow C{\textit{dec}}\end{aligned}$$

Proof

Suppose x, y, \(x'\) and \(y'\) are distinct alternatives.Footnote 4 To prove \(C{\textit{dec}}\) we need to prove each of the conjuncts in the following formula:

$$\begin{aligned} \bigwedge _{x\in X}\bigwedge _{y\in X{\setminus }\{x\}}\left[ \left( \bigwedge _{i\in C}p^i_{x\succcurlyeq y}\right) \rightarrow \lnot y\right] \end{aligned}$$

Denote by \(C'\) one of the possible subsets of \(N{\setminus } C\) preferring \(x'\) over \( y'\). Now consider the following derivation:

  1. (1)

    \((\bigwedge _{i\in C} p^i_{x'\succcurlyeq y'}) \rightarrow [(\bigwedge _{i\in C} p^i_{x'\succcurlyeq y'})\wedge \bigvee _{C'\subseteq N{\setminus } C}((\bigwedge _{i\in C'} p^i_{x'\succcurlyeq y'})\wedge (\bigwedge _{i\not \in C'\cup C} p^i_{y'\succcurlyeq x'}))]\)

    By finiteness of agents and alternatives and the theorems \(p^i_{x'\succcurlyeq y'}\vee p^i_{y'\succcurlyeq x'}\) for all \(i\in N\) we can, rearranging conjunctions and disjunctions, prove the second line of the formula; the implication follows.

  2. (2)

    \((\bigwedge _{i\in C} p^i_{x'\succcurlyeq y'})\rightarrow \bigvee _{C'\subseteq N{\setminus } C} \,[ (\bigwedge _{i\in C} p^i_{x'\succcurlyeq y'}) \wedge (\bigwedge _{i\in C'} p^i_{x'\succcurlyeq y'}) \wedge (\bigwedge _{i\not \in C'\cup C} p^i_{y'\succcurlyeq x'})]\)

    by distributivity from (1)

  3. (3)

    This part of the proof contains the derivation of the following formula, for every coalition \(C'\subseteq N{\setminus } C\):

    \({\textit{Par}}\wedge {\textit{IIA}}\wedge C{\textit{wdec}}(x,y) \rightarrow [ (\bigwedge _{i\in C} p^i_{x'\succcurlyeq y'})\wedge (\bigwedge _{i\in C'} p^i_{x'\succcurlyeq y'}) \wedge (\bigwedge _{i\not \in C'\cup C} p^i_{y'\succcurlyeq x'}) \rightarrow \lnot y']\)

    We will present the derivation for any such \(C'\) below.

  4. (4)

    \({\textit{Par}}\wedge {\textit{IIA}}\wedge C{\textit{wdec}}(x,y) \rightarrow \bigvee _{C'\subseteq N{\setminus } C} [(\bigwedge _{i\in C} p^i_{x'\succcurlyeq y'})\wedge (\bigwedge _{i\in C'} p^i_{x'\succcurlyeq y'}) \wedge (\bigwedge _{i\not \in C'\cup C} p^i_{y'\succcurlyeq x'}) \rightarrow \lnot y']\)

    by propositional reasoning from all the instances of (3)

  5. (5)

    \({\textit{Par}}\wedge {\textit{IIA}}\wedge C{\textit{wdec}}(x,y)\rightarrow [(\bigwedge _{i\in C} p^i_{x'\succcurlyeq y'})\rightarrow \lnot y'] \)

    by propositional reasoning from (2) and (4)

We still need to show (all the finitely many instances of) step (3). We prove each of them in the following way. Consider a specific profile \(w=(\succcurlyeq _1,\ldots ,\succcurlyeq _n)\) for which we can rearrange the conjuncts in the formula \({\textit{profile}}(w)\) as follows:

$$\begin{aligned} {\textit{profile}}\left( w\right)= & {} \left( \bigwedge _{i\in C} p^i_{x\succcurlyeq y}\right) \wedge \left( \bigwedge _{i\in N}\left( p^i_{x'\succcurlyeq x}\wedge p^i_{y\succcurlyeq y'}\right) \right) \wedge \\&\left( \bigwedge _{i\in C\cup C'}p^i_{x'\succcurlyeq y'}\right) \wedge \left( \bigwedge _{i\not \in C} p^i_{y\succcurlyeq x}\right) \wedge \left( \bigwedge _{i\not \in C\cup C'} p^i_{y'\succcurlyeq x'}\right) \wedge \alpha \end{aligned}$$

Here \(\alpha \) is the formula expressing the fact that all the other alternatives (if any) are ranked by all agents below \(x,y,x',y'\). We are now ready to present a derivation for a specific coalition \(C'\):

  1. (a)

    For any \(z\in X{\setminus }\{x,y,x',y'\}\):

    \({\textit{Par}}\wedge {\textit{profile}}(w)\rightarrow \lnot x \wedge \lnot y'\wedge \lnot z\)

    from formula \({\textit{Par}}\), the second part of \({\textit{profile}}(w)\), and \(\alpha \)

  2. (b)

    \(C{\textit{wdec}}(x,y)\wedge {\textit{profile}}(w)\rightarrow \lnot y\)

    by definition of \(C{\textit{wdec}}(x,y)\)

  3. (c)

    \({\textit{Par}}\wedge C{\textit{wdec}}(x,y)\rightarrow ({\textit{profile}}(w)\rightarrow x')\)

    by axiom (14), encoding resoluteness, with (a) and (b)

  4. (d)

    \(\Diamond _N {\textit{profile}}(w)\) by the Universal Domain Lemma

  5. (e)

    \({\textit{Par}}\wedge C{\textit{wdec}}(x,y)\rightarrow \Diamond _N( {\textit{profile}}(w)\wedge x')\)

    by standard modal reasoning from (c) and (d)

  6. (f)

    \({\textit{Par}}\wedge {\textit{IIA}}\wedge C{\textit{wdec}}(x,y)\rightarrow \Diamond _N( {\textit{profile}}(w)\wedge x')\)

    by propositional reasoning from (e)

  7. (g)

    \({\textit{Par}}\wedge {\textit{IIA}}\wedge C{\textit{wdec}}(x,y)\rightarrow [({\textit{profile}}(w)(x',y')\rightarrow \lnot y')]\)

    from (f) and formula \({\textit{IIA}}\) w.r.t. \(x'\) and \(y'\)

But \({\textit{profile}}(w)(x',y')\) consists of the following conjuncts:

$$\begin{aligned} \left( \bigwedge _{i\in C} p^i_{x'\succcurlyeq y'}\right) \wedge \left( \bigwedge _{i\in C'} p^i_{x'\succcurlyeq y'}\right) \wedge \left( \bigwedge _{i\not \in C'\cup C} p^i_{y'\succcurlyeq x'}\right) \end{aligned}$$

Hence, we may infer that this latter formula entails \(\lnot y'\). This shows step (3) and concludes the proof. \(\square \)

The next lemma establishes a syntactic counterpart of what is known as the Contraction Lemma in the literature [33]. It says that, under certain conditions, for any way of splitting a decisive coalition of two or more agents into two sub-coalitions, one of those sub-coalitions must also be decisive.

Lemma 9

(Contraction lemma) Consider a language parametrised by X such that \(|X|\geqslant 3\). Then for any coalition \(C\subseteq N\) and any two coalitions \(C_1\) and \(C_2\) that form a partition of C, we have that:

$$\begin{aligned} \vdash {\textit{Par}}\wedge {\textit{IIA}}\wedge C{\textit{dec}}\rightarrow (C_1{\textit{dec}}\vee C_2{\textit{dec}}) \end{aligned}$$

Proof

Consider C, \(C_1\) and \(C_2\) as in the statement of the lemma (i.e., \(C=C_1\cup C_2\) and \(C_1\cap C_2=\emptyset \)) and let xyz be three distinct alternatives. Now consider any profile w for which \({\textit{profile}}(w)\) has the following form:

$$\begin{aligned} {\textit{profile}}(w)= & {} \left( \bigwedge _{i\not \in C_2} p^i_{x\succcurlyeq y}\right) \wedge \left( \bigwedge _{i\in C_1} p^i_{x\succcurlyeq z}\right) \wedge \left( \bigwedge _{i\in C_1\cup C_2}p^i_{y\succcurlyeq z}\right) \\&\left( \bigwedge _{i\in C_2} p^i_{y\succcurlyeq x}\right) \wedge \left( \bigwedge _{i\not \in C_1} p^i_{z\succcurlyeq x}\right) \wedge \left( \bigwedge _{i\not \in C_1\cup C_2}p^i_{z\succcurlyeq y}\right) \wedge \alpha \end{aligned}$$

Here \(\alpha \) encodes the fact that all other alternatives (if any) are ranked by all agents below xyz. By propositional reasoning and the fact that in profile w all agents in C prefer y over z we can derive:

$$\begin{aligned} C{\textit{dec}}\rightarrow ({\textit{profile}}(w)\rightarrow \lnot z) \end{aligned}$$
(1)

For any other alternative k different from x or y, we can derive:

$$\begin{aligned} {\textit{Par}}\rightarrow ({\textit{profile}}(w)\rightarrow \lnot k) \end{aligned}$$
(2)

This is so because \(\alpha \) in \({\textit{profile}}(w)\) encodes the fact that all other alternatives are ranked by all agents below xyz. Formulas (1) and (2), together with axiom (14), encoding resoluteness, enforce that x or y must be the outcome:

$$\begin{aligned} {\textit{Par}}\wedge C{\textit{dec}}\rightarrow (\left( {\textit{profile}}(w)\rightarrow x)\vee ({\textit{profile}}(w)\rightarrow y)\right) \end{aligned}$$
(3)

As an aside, we note that we know (again from resoluteness) that this disjunction must be exclusive. By the Universal Domain Lemma, we have that \(\Diamond _N{\textit{profile}}(w)\) is a theorem, and thus, using standard modal reasoning on formula (3), we obtain:

$$\begin{aligned} {\textit{Par}}\wedge C{\textit{dec}}\rightarrow (\Diamond _N({\textit{profile}}(w)\wedge x) \vee \Diamond _N({\textit{profile}}(w)\wedge y)) \end{aligned}$$
(4)

Now propositional reasoning together with \({\textit{IIA}}\), first w.r.t. the pair (xz) and then w.r.t. the pair (yx), allows us to derive from formula (4) the following formula:

$$\begin{aligned} {\textit{Par}}\wedge {\textit{IIA}}\wedge C{\textit{dec}}\rightarrow \left( ({\textit{profile}}(w)(x,z)\rightarrow \lnot z) \vee ({\textit{profile}}(w)(y,x)\rightarrow \lnot x)\right) \end{aligned}$$

Recall that in \({\textit{profile}}(w)\) the agents in \(C_1\) are the only ones supporting x over z. Hence, \(({\textit{profile}}(w)(x,z)\rightarrow \lnot z)\) means that \(C_1\) is weakly decisive for the pair (xz). Likewise, the agents in \(C_2\) are the only ones supporting y over x; thus \(({\textit{profile}}(w)(y,x)\rightarrow \lnot x)\) means that \(C_2\) is weakly decisive for the pair (yx). In this fashion we can conclude that:

$$\begin{aligned} {\textit{Par}}\wedge {\textit{IIA}}\wedge C{\textit{dec}}\rightarrow (C_1{\textit{wdec}}(x,z)\vee C_2{\textit{wdec}}(y,x)) \end{aligned}$$
(5)

We can now use Lemma 8 and propositional reasoning on formula (5) to derive:

$$\begin{aligned} {\textit{Par}}\wedge {\textit{IIA}}\wedge C{\textit{dec}}\rightarrow (C_1{\textit{dec}}\vee C_2{\textit{dec}}) \end{aligned}$$

We have thus shown that \({\textit{Par}}\wedge {\textit{IIA}}\wedge C{\textit{dec}}\rightarrow (C_1{\textit{dec}}\vee C_2{\textit{dec}})\) must be a theorem of the logic. Note that the disjunction is still exclusive. \(\square \)

We can now state and prove a syntactic counterpart of Arrow’s Theorem:

Theorem 3

(Arrow’s Theorem) Consider a logic L[NX] with a language parametrised by X such that \(|X|\geqslant 3\). Then we have:

$$\begin{aligned} \vdash {\textit{Par}}\wedge {\textit{IIA}}\rightarrow {\textit{Dic}}\end{aligned}$$

Proof

As mentioned earlier, \({\textit{Par}}\) is equivalent to \(N{\textit{dec}}\). Exploiting the formula \({\textit{IIA}}\), we can apply the Contraction Lemma and prove that one of two disjoint subsets of N is decisive. Repeating the process finitely many times (we have finitely many agents), we can show that one of the singletons that form N is decisive. But this is tantamount to saying that there exist a decisive agent, i.e., a dictator, so the formula \(\bigvee _{i\in N}\{i\}{\textit{dec}}\), which is equivalent to \({\textit{Dic}}\). Hence, the formula \({\textit{Par}}\wedge {\textit{IIA}}\rightarrow {\textit{Dic}}\) can be derived as a theorem of the logic L[NX] for any set X with \(|X|\geqslant 3\) as claimed. \(\square \)

Note that throughout the proof we have made implicit use of the condition \(|X|\geqslant 3\) when assuming the availability of three distinct alternatives (in fact, in the proof of Lemma 8 we have only gone through the most interesting case, requiring at least four alternatives).

As we already mentioned, the proof provided here is not, strictly speaking, a full syntactic proof of Arrow’s Theorem within the logic, because the language is parametric in the set of agents N and the set of alternatives X. Nevertheless, apart from the proviso on the number of alternatives stated in Theorem 3, our proof is independent of the choice of N and X; that is to say, this proof can be used as a template to prove the appropriate instance of Arrow’s Theorem in any logic L[NX] for N and X such that \(|X|\geqslant 3\).

Due to Theorem 1 establishing completeness of the logic and Lemmas 23, and 7 establishing the correctness of our representation of the Arrovian conditions within the logic, Theorem 3 is equivalent to the usual, semantic, rendering of Arrow’s Theorem for SCF’s stated as Theorem 2. Thus, our purely syntactic proof constitutes an independent proof of the theorem. This shows that the logic L[NX] is a useful tool for reasoning about nontrivial concepts in social choice. In the remainder of this section we offer further support for this assertion, by proving two additional results.

4.2 Encoding the Muller–Satterthwaite Theorem

The Muller–Satterthwaite Theorem [23] establishes that, when there are at least three alternatives, the only SCF’s that are strongly monotonic—and that do not rule out some of the alternatives as potential winners to begin with (by failing surjectivity)—are the dictatorships. Like Arrow’s Theorem, this result shows that certain intuitively appealing properties of SCF’s simply cannot be realised in general. We directly give a syntactic formulation of this important result in our logic.

Theorem 4

(Muller–Satterthwaite Theorem) Consider a logic L[NX] with a language parametrised by X such that \(|X|\geqslant 3\). Then we have:

$$\begin{aligned} \vdash {\textit{SM}}\wedge {\textit{Sur}}\rightarrow {\textit{Dic}}\end{aligned}$$

Proof

It is enough to show that \(\vdash {\textit{SM}}\wedge {\textit{Sur}}\rightarrow {\textit{Par}}\wedge {\textit{IIA}}\). Then, by the syntactic derivation of Arrow’s Theorem given earlier, we obtain \({\textit{Dic}}\). We begin by showing that the two premises entail IIA. If we can show that

$$\begin{aligned} SM\rightarrow \left[ \Diamond _N({\textit{profile}}(w)\wedge x)\rightarrow ({\textit{profile}}(w)(x,y)\rightarrow \lnot y)\right] \end{aligned}$$

for any w and any distinct xy then we have that the two premises prove the conjunction of all such consequents, which is IIA.

The general strategy is the following: first we construct a profile \(w''\) which ranks the alternatives xy above all others and preserves the ordering of w encoded in \({\textit{profile}}(w)(x,y)\); second, by \({\textit{SM}}\), we conclude that x must be the outcome in this profile \(w''\); third we show that, for any profile \(w'\) that agrees with \({\textit{profile}}(w)(x,y)\) , if the outcome at \(w'\) is y then, again by \({\textit{SM}}\), the outcome at \(w''\) is also y; this last passage contradicts the fact that x is the outcome at \(w''\), hence y cannot be the outcome at any such \(w'\). We proceed to encode this reasoning.

First, construct a formula representing \(w''\):

$$\begin{aligned} {\textit{profile}}(w''):={\textit{profile}}(w)(x,y) \wedge \bigwedge _{i\in N}\bigwedge _{z\ne x,y} \left( p^i_{x\succcurlyeq z}\wedge p^i_{y\succcurlyeq z}\right) \end{aligned}$$

By construction, we clearly have that:

$$\begin{aligned} {\textit{profile}}(w'')\rightarrow \bigwedge _{k\in X{\setminus }\{x\}} N^w_{x\succcurlyeq k} \end{aligned}$$

Together with \({\textit{SM}}\), this latter formula readily entails the following implication:

$$\begin{aligned} {\textit{SM}}\wedge \Diamond _N({\textit{profile}}(w)\wedge x)\wedge {\textit{profile}}(w'')\rightarrow x \end{aligned}$$

By the Universal Domain Lemma, we know that we have \(\Diamond _N {\textit{profile}}(w'')\). Thus:

$$\begin{aligned} {\textit{SM}}\wedge \Diamond _N({\textit{profile}}(w)\wedge x)\rightarrow \Diamond _N ({\textit{profile}}(w'')\wedge x) \end{aligned}$$
(6)

This concludes the first two parts, showing that x must be the outcome for the profile \(w''\). We now reason by contradiction, assuming

$$\begin{aligned} SM\wedge \Diamond _N({\textit{profile}}(w)\wedge x)\wedge {\textit{profile}}(w)(x,y)\wedge y \end{aligned}$$

and deriving \(\Diamond _N ({\textit{profile}}(w'')\wedge y)\), in contradiction with formula (6), thereby forcing us to conclude that the following holds:

$$\begin{aligned} SM\wedge \Diamond _N({\textit{profile}}(w)\wedge x)\wedge {\textit{profile}}(w)(x,y)\rightarrow \lnot y \end{aligned}$$

Clearly, we can derive from the axioms that:

$$\begin{aligned} {\textit{profile}}(w)(x,y)\wedge y \rightarrow \bigvee _{w'} \left[ {\textit{profile}}(w')\wedge {\textit{profile}}(w)(x,y) \wedge y\right] \end{aligned}$$

That is to say, there is a profile \(w'\) containing the preferences in \({\textit{profile}}(w)(x,y)\) for which the outcome is y. By the Universal Domain Lemma, we can put a diamond in front of the profile formula. Hence, after some rearrangement we obtain:

$$\begin{aligned} {\textit{profile}}(w)(x,y)\wedge y \rightarrow \bigvee _{w'}\Diamond _N [{\textit{profile}}(w') \wedge y] \end{aligned}$$
(7)

Notice now that the part inside the disjunction looks like the first formula in the antecedent of \({\textit{SM}}\), formulated for variable y. Upon inspection we can also check that for all such \(w''\), we get:

$$\begin{aligned} {\textit{profile}}(w'')\rightarrow \bigwedge _{k\ne x} N^{w'}_{x\succcurlyeq k} \end{aligned}$$

Thus, we know that by applying \({\textit{SM}}\) we obtain:

$$\begin{aligned} {\textit{SM}}\wedge \Diamond _N [{\textit{profile}}(w') \wedge y]\wedge {\textit{profile}}(w'')\rightarrow y \end{aligned}$$

Now we can push \({\textit{SM}}\) inside the disjunction in formula (7), use the Universal Domain Lemma to get \(\Diamond _N {\textit{profile}}(w'')\), and apply the latter formula to conclude that each of the disjuncts entails \(\Diamond _N ({\textit{profile}}(w'')\wedge y)\). But then the whole disjunction entails it and we can derive:

$$\begin{aligned} {\textit{profile}}(w)(x,y)\wedge y \rightarrow \Diamond _N \left( {\textit{profile}}(w'')\wedge y\right) \end{aligned}$$

This contradicts formula (6), since only one alternative can be the outcome and \(x\ne y\). Hence, we have derived \({\textit{IIA}}\).

Now for the derivation of \({\textit{Par}}\). It is enough to show that \({\textit{SM}}\wedge {\textit{Sur}}\) entails each conjunct of the following form:

$$\begin{aligned} \left( \bigwedge _{i\in N} p^i_{x\succcurlyeq y}\right) \rightarrow \lnot y \end{aligned}$$
(8)

From \({\textit{Sur}}\) we know that \(\bigvee _{w}\Diamond _N ({\textit{profile}}(w)\wedge x)\). Construct the profile \(w''\) which is the same as w but with the difference that x has been ranked over y by all agents:

$$\begin{aligned} {\textit{profile}}(w''):{=}\bigwedge _{i\in N} p^i_{x\succcurlyeq y} \wedge \bigwedge _{z,k\ne x}{\textit{profile}}(w)(z,k)\wedge \bigwedge _{y>_i z} p^i_{x\succcurlyeq z} \wedge \bigwedge _{z>_i y} {\textit{profile}}(w)(x,z) \end{aligned}$$

where \(y>_i z\) in the subscript is just notation to mean that y is ranked over z by i in w and similarly for \(z>_i y\). Clearly by this formula we have that if \(z>_i y\) in w then in \(w''\) their ranking in unchanged, while for \(z=y\) and \(y>_i z\) now x is ranked above z; thus

$$\begin{aligned} {\textit{profile}}(w)\rightarrow \bigwedge _{y\in X\backslash \{x\}} N^w_{x\succcurlyeq y} \end{aligned}$$

Hence by \({\textit{SM}}\) we have that \(\Diamond _N ({\textit{profile}}(w)\wedge x)\wedge {\textit{profile}}(w'')\rightarrow x\), that is, x is still the outcome in \(w''\). Hence, every disjunct in \({\textit{Sur}}\) entails \(\Diamond _N({\textit{profile}}(w'')\wedge x)\). Now notice that the antecedent in the formula (8), namely \((\bigwedge _{i\in N} p^i_{x\succcurlyeq y})\), by construction is just \({\textit{profile}}(w'')(x,y)\). So pushing the latter into the disjunction we obtain that each disjunct entails:

$$\begin{aligned} \Diamond _N\left( {\textit{profile}}(w'')\wedge x\right) \wedge {\textit{profile}}(w'')(x,y) \end{aligned}$$

But this is the antecedent of \({\textit{IIA}}\), hence each disjunct actually entails \(\lnot y\). Therefore, the whole disjunction entails \(\lnot y\), and we have proved the desired implication (8). \(\square \)

4.3 Encoding Sen’s approach to rights

Sen’s Theorem [32] shows that it is impossible to satisfy both the property of Pareto efficiency and the property of liberalism. Unlike the other impossibility theorems discussed, this result does not depend on any assumptions regarding the number of alternatives. We again give directly a syntactic formulation using our logic.

Theorem 5

(Sen’s Theorem) Consider any logic L[NX]. Then we have:

$$\begin{aligned} \vdash \lnot ({\textit{Par}}\wedge {\textit{Lib}}) \end{aligned}$$

Proof

Our derivation will mirror the standard proof of the theorem [12, 32]. It is sufficient to show that \(({\textit{Par}}\wedge {\textit{Lib}})\) entails a contradiction. To make the notation lighter we will use the following abbreviation, meaning that an agent i is two-way decisive over the pair (xy):

$$\begin{aligned} {\textit{Lib}}^i(x,y) :{=}\{i\}{\textit{dec}}(x,y)\wedge \{i\}{\textit{dec}}(y,x) \end{aligned}$$

Consider only two of the conjuncts of \({\textit{Lib}}\), say for agents \(i_1\) and \(i_2\). If we can prove that these two conjuncts together with \({\textit{Par}}\) entail a contradiction then we are done. Begin by rearranging the conjunction of disjunctions in the definition of \({\textit{Lib}}\) into a disjunction of conjunctions. For two agents this will look like this:

$$\begin{aligned} \bigvee _{x_1, x_2, y_1, y_2} \left( {\textit{Lib}}^{i_1}(x_1,y_1)\wedge {\textit{Lib}}^{i_2}(x_2,y_2)\right) \end{aligned}$$
(9)

This formula essentially says that there are two pairs of elements on which the two agents are respectively two-way decisive. If we can prove that each of the disjuncts entails a contradiction, then by the laws of disjunction we can infer that the whole formula entails a contradiction. Note that we can push \({\textit{Par}}\) inside such a conjunction. Therefore, the task at hand is to show that formulas of the following shape entail a contradiction for every choice of the four alternatives:

$$\begin{aligned} {\textit{Par}}\wedge \left( {\textit{Lib}}^{i_1}(x_1,y_1)\wedge {\textit{Lib}}^{i_2}(x_2,y_2)\right) \end{aligned}$$

We focus on the cases, i.e., the disjuncts, in which these are all distinct alternatives; the cases of two or three alternatives follow via a similar argument with some alternatives being merged into one. For each choice of \(x_1, x_2, y_1, y_2 \in X\) build the profile \(w_{x_{1,2}, y_{1,2}}\) with the following properties:

  • Individual \(i_1\) ranks \(x_1\) above \(y_1\).

  • Individual \(i_2\) ranks \(x_2\) above \(y_2\).

  • All individuals rank \(y_1\) above \(x_2\) and also \(y_2\) above \(x_1\).

  • All individuals rank \(x_1, x_2, y_1, y_2\) above all other alternatives.

These properties correspond to the following formulas:

  • \(p^{i_1}_{x_1\succcurlyeq y_1}\)

  • \(p^{i_2}_{x_2\succcurlyeq y_2}\)

  • \(\bigwedge _{i\in N} (p^{i}_{y_1\succcurlyeq x_2}\wedge p^{i}_{y_2\succcurlyeq x_1})\)

  • \(\bigwedge _{i\in N}(p^{i}_{x_1\succcurlyeq z}\wedge p^{i}_{x_2\succcurlyeq z}\wedge p^{i}_{y_1\succcurlyeq z}\wedge p^{i}_{y_2\succcurlyeq z})\) for all other alternatives \(z\in X\)

Therefore, they will be part of a big conjunction forming \({\textit{profile}}(w_{x_{1,2}, y_{1,2}})\). Clearly, by combining the latter two of the above formulas with the formula representing the Pareto condituon, we we can derive the following two formulas:

  • \(\left( {\textit{Par}}\wedge \bigwedge _{i\in N} \left( p^{i}_{y_1\succcurlyeq x_2}\wedge p^{i}_{y_2\succcurlyeq x_1}\right) \right) \rightarrow \left( \lnot x_2\wedge \lnot x_1\right) \)

  • \(\left( {\textit{Par}}\wedge \bigwedge _{i\in N}\left( p^{i}_{y_1\succcurlyeq z}\wedge p^{i}_{y_2\succcurlyeq z}\wedge p^{i}_{x_1\succcurlyeq z}\wedge p^{i}_{x_2\succcurlyeq z}\right) \right) \rightarrow \lnot z\) for all other alternatives \(z\in X\)

Thus, we can derive:

$$\begin{aligned} \left( {\textit{Par}}\wedge {\textit{profile}}(w_{x_{1,2}, y_{1,2}})\right) \rightarrow \left( \lnot x_2\wedge \lnot x_1\wedge \lnot z\right) \end{aligned}$$

It is also easy to prove that the following two formulas hold:

  • \({\textit{Lib}}^{i_1}(x_1,y_1)\wedge p^{i_1}_{x_1\succcurlyeq y_1}\rightarrow \lnot y_1\)

  • \({\textit{Lib}}^{i_2}(x_2,y_2)\wedge p^{i_2}_{x_2\succcurlyeq y_2}\rightarrow \lnot y_2\)

Recall that the formulas \(p^{i_1}_{x_1\succcurlyeq y_1}\) and \(p^{i_2}_{x_2\succcurlyeq y_2}\) are also contained in \({\textit{profile}}(w_{x_{1,2}, y_{1,2}})\). Hence, summing up what we have seen so far, we obtain:

$$\begin{aligned} \left[ {\textit{Par}}\wedge {\textit{Lib}}^{i_1}(x_1,y_1) \wedge {\textit{Lib}}^{i_2}(x_2,y_2) \wedge {\textit{profile}}(w_{x_{1,2}, y_{1,2}})\right] \rightarrow \left[ \lnot x_1\wedge \lnot x_2\wedge \lnot y_1\wedge \lnot y_2\wedge \overline{Z}\right] \end{aligned}$$

In the above formula, we use \(\overline{Z}\) as a shorthand for the conjunction \(\bigwedge _{z\in X{\setminus }\{x_1,x_2,y_1,y_2\}}\lnot z\). The consequent of the implication above is a negation of all the alternatives in X, a formula that is inconsistent with the first part of axiom (14), the axiom encoding resoluteness of the SCF. Hence, we obtain:

$$\begin{aligned} \left[ {\textit{Par}}\wedge {\textit{Lib}}^{i_1}(x_1,y_1) \wedge {\textit{Lib}}^{i_2}(x_2,y_2) \wedge {\textit{profile}}(w_{x_{1,2}, y_{1,2}})\right] \rightarrow \bot \end{aligned}$$
(10)

Due to the Universal Domain Lemma, we know that the formula \(\Diamond _N {\textit{profile}}(w_{x_{1,2}, y_{1,2}})\) is a theorem of the logic. So if we are given \({\textit{Par}}\wedge {\textit{Lib}}^{i_1}(x_1,y_1)\wedge {\textit{Lib}}^{i_2}(x_2,y_2)\), we can certainly deduce:

$$\begin{aligned} {\textit{Par}}\wedge {\textit{Lib}}^{i_1}(x_1,y_1)\wedge {\textit{Lib}}^{i_2}(x_2,y_2) \wedge \Diamond _N {\textit{profile}}(w_{x_{1,2}, y_{1,2}}) \end{aligned}$$

By this formula, formula (10), and modal reasoning we can conclude:

$$\begin{aligned} \left[ {\textit{Par}}\wedge {\textit{Lib}}^{i_1}(x_1,y_1)\wedge {\textit{Lib}}^{i_2}(x_2,y_2) \wedge \Diamond _N {\textit{profile}}(w_{x_{1,2}, y_{1,2}})\right] \rightarrow \Diamond _N \perp \end{aligned}$$

Since \(\Diamond _N \perp \rightarrow \perp \) is a theorem of normal modal logic we get:

$$\begin{aligned} \left[ {\textit{Par}}\wedge {\textit{Lib}}^{i_1}(x_1,y_1)\wedge {\textit{Lib}}^{i_2}(x_2,y_2)\right] \rightarrow \perp \end{aligned}$$

Thus, we have shown that one of the disjuncts of formula (9) implies a contradiction. Repeating the same proof for every permutation of the four alternatives, we can thus prove that the whole disjunction entails a contradiction. Thus, also \(({\textit{Par}}\wedge {\textit{Lib}})\) entails a contradiction and we are done. \(\square \)

5 Implementing the logic

In this section we expand on the possibility of implementing the logic. As we will see, it is possible to translate the language of L[NX] into classical propositional logic, and more specifically into the propositional language used by Tang and Lin [34]. This paves the way for the application of SAT solvers to check the validity of formulas in our logic, thereby allowing for a fully automated check of the validity of the theorems formulated in this paper.

The language for modelling social choice functions used by Tang and Lin consists of two predicates: p(ixyw), expressing that in profile w agent i prefers x over y, and s(xw), expressing that alternative x is the winner in profile w. In full generality, these predicates belong to a multi-sorted first order logic with variables for agents, alternatives, and profiles. However, when the number of agents and alternatives is fixed, we can translate the quantified formulas into propositional formulas substituting for the variables all the finitely many constants; this is how Tang and Lin obtain a propositional language that can be fed into a SAT solver. Formulas in the resulting propositional language are also evaluated on the models of Definition 4:

  • \(M\models p(i,x,y, w)\) iff \(x\succcurlyeq _i y\) in profile w

  • \(M \models s(x,w)\) iff \(F(w)=x\)

We show here how to adapt the so-called Standard Translation [6] from modal logic into first-order logic to a translation from our modal language into the multi-sorted first-order logic with predicates p(ixyw) and s(xw). Once this is done, the formulas of the latter language can be turned into propositional clauses and checked following the approach of Tang and Lin [34]. Consider the following translation of the language of L[NX] into the language with predicates p(ixyw) and s(xw). The translation is parametric in w, a variable ranging over profiles:

$$\begin{aligned} t_w(p^i_{x\succcurlyeq y})\quad&\mapsto \quad p(i,x,y, w)\\ t_w(\lnot \varphi )\quad&\mapsto \quad \lnot t_w(\varphi ) \\ t_w(\varphi \wedge \psi )\quad&\mapsto \quad t_w(\varphi )\wedge t_w(\psi ) \\ t_w(x)\quad&\mapsto \quad s(x,w)\\ t_w(\Diamond _C \varphi ) \quad&\mapsto \quad \exists w' \left( \bigwedge _{i\in N\backslash C}\bigwedge _{x\ne y\in X} \left[ t_w\left( p^i_{x\succcurlyeq y}\right) \leftrightarrow t_{w'}\left( p^i_{x\succcurlyeq y}\right) \right] \wedge t_{w'}(\varphi ) \right) \end{aligned}$$

The other propositional connectives are handled accordingly. The following lemma establishes the connection between the satisfiability problem for L[NX] and the satisfiability problem for the propositional language associated with the same parameters.

Proposition 2

For every formula \(\varphi \) in the language of L[NX], profile w, and SCF F, \(\varphi \) is satisfiable at \(M_F, w\) if and only if \(t_w(\varphi )\) is satisfiable at \(M_F\).

Proof

The proof proceedss by induction on the complexity of \(\varphi \). The base cases are immediate by the translation and the semantics; we expand only on the case of the modality.

First, suppose \(\Diamond _C \varphi \) is satisfiable. Then there are a SCF F and a profile w such that \(M_F, w\models \Diamond _C \varphi \), which in turn entails that there is another profile \(w'\) with \({\succcurlyeq _i}={\succcurlyeq '_i}\) for all \(i\in N{\setminus } C\) such that \(M_F ,w' \models \varphi \). By the induction hypothesis, \(t_{w'}(\varphi )\) is satisfiable at \(M_F\). Since \({\succcurlyeq _i}={\succcurlyeq '_i}\) is the case, we will have that \(\bigwedge _{i\in N\backslash C}\bigwedge _{x,y\in X} [t_w(p^i_{x\succcurlyeq y}) \leftrightarrow t_{w'}(p^i_{x\succcurlyeq y})]\) is true at \(M_F\). So we can conclude that \(t_w(\varphi )\) is satisfiable at \(M_F\) when \(w'\) is the witness of the existential quantifier.

For the other direction, suppose \(t_w(\Diamond _C \varphi )\) is satisfiable at \(M_F\). Then there exists a profile \(w'\) such that \(\bigwedge _{i\in N{\setminus } C}\bigwedge _{x,y\in X} [t_w(p^i_{x\succcurlyeq y}) \leftrightarrow t_{w'}(p^i_{x\succcurlyeq y})]\) and \(t_{w'}(\varphi )\) are true. From the second formula and induction hypothesis we get that \(M_F ,w' \models \varphi \), while from the first we can conclude that \({\succcurlyeq _i}={\succcurlyeq '_i}\) for all \(i\in N{\setminus } C\). Thus, \(M_F, w\models \Diamond _C \varphi \). \(\square \)

The reader may now wonder: why are we using modal logic at all, if we can collapse everything to propositional logic? The key here is size: the readability of the formulas of L[NX], and therefore its usefulness as a tool for formalisation, is lost in the translation into propositional logic.

To make this point precise, we inductively define a function s assigning a size to each formula in a modal propositional language: the size of propositional atoms is 1, and the size of any other formula is the sum of the sizes of its immediate subformulas plus 1. For example, the size of \(p\wedge \lnot q\) is \(1+1+(1+1)=4\). It is easy to see that the size contributed by the propositional atoms and the boolean connectives remains constant during the translation: \(s\left( t_w(p^i_{x\succcurlyeq y})\right) = s\left( p(i,x,y, w)\right) = 1= s(p^i_{x\succcurlyeq y})\), and similarly for the other cases. For the modality, however, we have a significant difference. The formula \(\Diamond _C \varphi \) has size \(s(\varphi )+1\), while its translation \(t_w(\Diamond _C \varphi ) \) has size

$$\begin{aligned} |\mathscr {L}(X)|^n \times \left[ s(t_w(\varphi ))+1+ (|N{\setminus } C| \times (|X^2|-|X|) \times 4 ) -1\right] \end{aligned}$$

This formula comes from the definition of the translation. First of all, after X and N have been fixed, we have to transform the existential quantifier into a big disjunction over all possible profiles; this explains the multiplication with the factor \(|\mathscr {L}(X)|^n\). Within the square brackets, we have to add the size of the translation of \(\varphi \) to the size of the formula \(\bigwedge _{i\in N{\setminus } C}\bigwedge _{x,y\in X} [t_k(p^i_{x\succcurlyeq y}) \leftrightarrow t_{k'}(p^i_{x\succcurlyeq y})]\), plus 1 because of the conjunction. Now let us look at the latter formula. If we take the bi-implication between atomic propositions as primitive, the inner formula has size 3 (otherwise it would be even greater). This needs to be multiplied with the size of the complement of C and the size of \(|X^2|\) minus all the pairs in the diagonal (we consider \(x\ne y\)). Counting the conjunction associated with each of the instances of the bi-implication and subtracting 1 for the additional conjunction that we are considering, we arrive at the formula above.

The reader can get a feel of the blow-up by considering the following example. Let us analyse the simple case in which there are three alternatives and two agents, and where C is a singleton. We take \(\varphi = x\), an atomic proposition, so that \(s(\varphi ) = 1\) and \(s(\Diamond _C \varphi )=2\). On the other hand, the size of the translation into propositional logic is \(s(t_k(\Diamond _C \varphi )) = (3!)^2 \times [1+ 1\times (9-3)\times 4] = 36\times 25 = 900\). Clearly, formulas of such size are unwieldy for humans; their best use is for automated reasoning.

Thus, the logic L[NX] can fulfill two roles in the study of social choice theory. First, as demonstrated in the main part of this paper it is a convenient formalism in which to cast proofs of theorems regarding the characterisation of SCF’s in terms of basic properties. Second, as demonstrated in this section, it can serve as a convenient interface between social choice theory and propositional logic, with L[NX] ensuring readability and the propositional counterpart allowing for the use of standard computational tools, particularly SAT solvers, to automatically reason about the SCF’s.

6 Related work

The idea of using formal methods to subject social procedures to the same kind of formal analysis routinely applied to algorithms and software systems can be traced back to, at least, the work of Parikh [25, 26]. The two main arguments motivating this kind of enterprise are obvious and well known: formal analysis will deepen our understanding of social procedures; and formal analysis can increase our confidence in the correctness of social procedures. Pauly [28] has suggested a third argument that is specific to the use of logic in social choice theory: the expressive power of a logical language required to express a choice-theoretic property (such a IIA) is a relevant criterion in judging the interestingness of a characterisation result making use of such a property. A fourth argument fueling this line of research is that it has the potential to uncover entirely new characterisation and impossibility results [9, 13, 34]—results that are of independent interest to economists [10].

Successful applications of logic and automated reasoning to social choice theory have included the automated verification of the correctness of practical algorithms for implementing voting rules [5] and the automated search for new impossibility theorems in the domain of ranking sets of objects [13]. However, most work to date has focussed on the Arrovian framework of preference aggregation and the challenges of representing Arrow’s Theorem in a variety of logical frameworks [1, 16], of verifying the correctness of existing proofs for the theorem [24, 38], and of finding new such proofs [34]. Indeed, Arrow’s Theorem is arguably the best yardstick against which to measure new formal methods for reasoning about problems of social choice. The work of Lange et al. [20] on the use of automated reasoning in different areas of economic theory, such as auctions and cooperative games, demonstrates that the basic concepts and techniques developed for the seemingly narrow domain of Arrovian preference aggregation can have a ripple-on effect on the use of formal methods in economics more widely.

Regarding Arrow’s Theorem, starting at the top as far as the expressive power of the logical systems employed is concerned, Nipkow [24] and Wiedijk [38] have shown how to verify existing proofs for the theorem in higher-order logic proof assistants. Grandi and Endriss [16] have shown that classical first-order logic is sufficiently expressive to model all aspects of Arrow’s Theorem, with the sole exception being the requirement that the set of agents be finite (the theorem is not valid for infinite electorates; cf. the use of induction in the proof of Theorem 3). In particular, modelling IIA does not require second-order quantification. At the most extreme end of the spectrum, Tang and Lin [34] have shown that the theorem can be embedded into classical propositional logic, albeit only for a fixed set of agents and a fixed set of alternatives. This embedding itself ceases to be useful for deepening our understanding of social choice (as it involves thousands of clauses, even for the simplest case of \(|N|=2\) and \(|X|=3\)). Instead, the great significance of the work of Tang and Lin derives from the fact that they have been able to provide a fully automated proof of the theorem based on this embedding. The work of Ågotnes et al. [1], like our own work, is orthogonal to these other contributions, in that they design a new tailor-made logic for social choice theory, rather than encoding those concepts into already existing logics. Note that Troquard et al. [37], the originators of the logic \({\varLambda }^{\mathrm{scf}}[N,X]\) we have used here, have themselves not attempted to model Arrow’s Theorem.

Examples for work in this vein addressing results other than Arrow’s Theorem are still rare. Tang and Lin [34] have extended their approach to proving Arrow’s Theorem also to the Muller–Satterthwaite Theorem and to Sen’s Theorem. Nipkow [24], besides treating Arrow’s Theorem, also has verified a proof of the Gibbard–Satterthwaite Theorem using a higher-order logic proof assistant. Grandi and Endriss [16] also formalise Sen’s Theorem.

To date, the approaches to modelling Arrow’s Theorem in logical frameworks in Hilbert-style calculi, namely the contributions of Ågotnes et al. [1] and of Grandi and Endriss [16], have not yet yielded a complete proof of the theorem within that same logical framework, although Ågotnes et al. [1] do succeed in providing a syntactic proof of a relevant lemma. In as yet unpublished work, Perkov [30] has outlined a natural deduction proof of Arrow’s Theorem using the language of Ågotnes et al. [1]. There currently are no results of this kind available for either the Muller–Satterthwaite Theorem or Sen’s Theorem.

A recent survey on logic and social choice theory [12] has identified three critical points in existing work on logics for modelling concepts in social choice: (1) whether the approach does not require us to fix the sets of agents and alternatives upfront, (2) whether the universal domain assumption can be expressed in an elegant manner, and (3) whether the approach facilitates automation. Regarding point (1), as discussed in Sect. 2.5, our logic is indeed subject to the common limitation of requiring us to fix the cardinalities of N and X before even the notion of a well-formed formula can be defined, but we have also demonstrated that in practice this limitation can be overcome by working with schemas parametrised by N and X. Point (2) is convincingly taken care of by Lemma 1, the Universal Domain Lemma. Point (3), finally, is addressed in Sect. 5, where we show how to reduce the satisfiability problem of the logic L[NX] to the satisfiability problem for propositional logic. Of course, to directly develop automated reasoning tools for L[NX], thereby foregoing the need for translation and the associated blow-up in the problem size, is still of some interest. Evidence for the claim that also this direction is feasible and promising is given by Troquard [36], who has initiated a study of algorithms for model checking for the full logic \({\varLambda }^{\mathrm{scf}}[N,X]\), including a prototype implementation.

7 Conclusion

We have shown how to obtain a syntactic proofs of Arrow’s Theorem [3], the Muller–Satterthwaite Theorem [23], and Sen’s Theorem [32] within a simple modal logic for speaking about basic concepts of preference aggregation. The logic in question is a fragment of a logic introduced by Troquard et al. [37], which we have shown to be complete by adapting their original completeness proof. While prior work has been successful in applying tools from logic and automated reasoning to social choice theory, this is the first human-readable formalisation of the framework of preference aggregation that allows for a direct derivation of these important impossibility theorems. Inspired by the work of Tang and Lin [34], we furthermore have suggested a pragmatic approach to implementing automated reasoning tools for the logic via a translation into propositional logic.

Because of the central role of Arrow’s Theorem in particular, not only in social choice theory at large, but also more specifically in the emerging literature on logics for social choice, where it has served as a yardstick for assessing the suitability of a variety of approaches to logical modelling, we believe that our work constitutes a useful step towards the longterm aim of the field. This aim is to offer tangible computer-aided support for reasoning about methods for collective decision making, be it in the context of political decision making, economic interaction, or multiagent systems.

Our results suggest two important directions for future work. First, it certainly is possible, at least in principle, to encode most of the commonly studied desiderata for voting rules, similar to those modelled in Sect. 3, in the logic considered here. To what extent this might allow us to verify whether a given voting rule satisfies a given property or to re-prove other classical results in social choice theory, such as May’s Theorem on the characterisation of the simple majority rule [22], are intriguing open questions. Second, our demonstration of the usefulness of modal logics of social choice underlines the importance of further developing the reasoning machinery for such logics, including optimised implementations, also beyond the translation-based approach advocated in Sect. 5.