Keywords

1 Introduction

Axiomatic studies of decision procedures (social welfare functions, social choice rules, multiple criteria decision-making methods (MCDM), etc.) have provided many lessons for aggregating voters’ preferences over alternatives. The choice of an appropriate procedure is, however, still a challenging problem and there seems no agreement on the best procedure, even among social choice theorists (Nurmi [1]).

Fig. 1.
figure 1

A sketch of a two-step PCBPC

Part of the reason for this difficulty may be that axiomatic characterizations do not, at least in some cases, directly answer which is the appropriate decision procedure. For instance, theorems such as “\( f \) is the unique social choice rule that satisfies criteria \( A \) and \( B \)”, or “\( g \) is the unique social choice rule that satisfies criteria \( A \) and \( C \)” deepen our understanding of these rules. But the choice between \( f \) and \( g \) is still controversial when there is no agreement on the relative desirability of \( B \) and \( C \).

This observation implies the necessity of a decision support system that deals with voters’ preferences over criteria. To this end, the present paper provides a formal model for procedural choice based on preferences over criteria (PCBPC). Although each of the axiomatic studies has provided the justifications for particular axioms, Nurmi [1] was the first to study the system for PCBPC (see also de Almeida et al. [2]; application of this method for business contexts is found in de Almeida and Nurmi [3]). Based on the performance matrix of voting rules (Table 1), Nurmi [1] proposes two different methods to create collective rankings/choices of voting rules based on the voters’ preferences over criteria. One method is to first determine the rankings of criteria with an ordinary decision procedure (e.g., Borda’s or Kemeny’s rule) and then determine the rankings of decision procedures (p. 247–248 in Nurmi [1]). The other method is to determine the rankings of decision procedures directly using MCDM methods (p. 250–251 in Nurmi [1]). As Nurmi points out, these methods are designed for situations where voters have linear preferences over criteria and some other methods can be used instead. The present study first aims to provide a formal model for PCBPC in order to make clear the foundations and implicit assumptions of PCBPC. Next, we characterize the counting rule which chooses those decision procedures with the greatest support when voters have dichotomous preferences over criteria.

Table 1. Whether a SCR satisfies a given criterion (1) or not (0) (from de Almeida et al. [2])

Let us review the literature related to procedural choice. Some authors study the desirable way of procedural choice when the preferences over decision procedures are consequentially induced from the preferences over alternatives (Rae [4]; Koray [5]; Barbera and Jackson [6]; Suzuki and Horita [7]). Others begin with the preferences over decision procedures (Dietrich [8]; Diss and Merlin [9]). Compared with these studies, PCBPC is new in that it is based on the preferences over criteria.

As Nurmi [1] points out, the PCBPC model looks similar to a MCDM model such as PROMETHEE at first glance, because both of them deal with a performance matrix such as Table 1. In our view, the biggest difference between them is that our PCBPC model is based on the social choice theoretic approach: preferences are supposed to be binary relations rather than numerical utility functions; each concept (criterion, aggregation rule, etc.) is defined through set theory; and axioms in social choice theory turn out to be important (as we will argue).

2 Designing a Formal Model for PCBPC

Let \( N = \left\{ {1,2, \ldots ,n} \right\} \) be the set of voters \( (n \ge 2) \) and \( {\mathcal{F}} \) be the set of feasible decision procedures. The elements of \( {\mathcal{F}} \) can be social choice rules, social welfare functions, social decision functions, MCDM methods, etc. A criterion \( C \) (for \( {\mathcal{F}} \)) is a proposition on the elements of \( {\mathcal{F}} \). More formally, a criterion \( C \) is a function that returns “true” or “false” for each \( f \in {\mathcal{F}} \) (\( C\left( f \right) = {\text{true}} \) means that \( f \in {\mathcal{F}} \) satisfies the criterion).

The most straightforward approach to PCBPC would be simply to use ordinary decision procedures (e.g., Borda’s or Kemeny’s rule) by regarding criteria as alternatives and then use another procedure to yield the rankings/choices of decision procedures:

Although this approach is intuitive, it can be controversial because of the differences between criteria and alternatives in the ordinary sense. The most essential difference is in the logical relationships between criteria, i.e., some sets of criteria are consistent, but others are not. As is well known, Arrow [10] ‘s impossibility theorem says that Pareto, independence of irrelevant alternatives, and non-dictatorship reach a contradiction in the design of social welfare functions (with universal domains). Therefore, to approve of all three criteria equally is logically impossible even though the society members may unanimously agree about them all. On the other hand, the combination of anonymity, neutrality, and positive responsiveness is consistent (May [11]) and so these three can be approved simultaneously. This implies that unsatisfactory results can happen if the two steps are independently determined. In other words, the set of admissible collective judgments of criteria should be restricted by the logical relationships of decision procedures at hand. In order to make matters simple, our model assumes that the PCBPC method returns the collective choices on decision procedures directly from the set of preference profiles over criteria (and not via the collective judgments of criteria).

Assumption 1 (Outline of a PCBPC Method)

A PCBPC method is a single-step procedureFootnote 1 that returns a nonempty subset of \( {\mathcal{F}} \) for any profile of voters’ preferences over criteria.

When examining voters’ preferences over criteria, however, to require such logical consistency seems too demanding. Suppose \( {\mathcal{F}} = \left\{ {{\text{Amendment}},{\text{Copeland}}} \right\} \). Then, Table 1 says that criteria a-d are logically equivalent in the sense that \( f \) satisfies criterion x if and only if \( f \) satisfies criterion y, for all \( f \in {\mathcal{F}} \) and \( x,y \in \left\{ {a,b,c,d} \right\} \). In this case, logical consistency of voters’ preferences over criteria would demand that a: the Condorcet winner criterion and d: Monotonicity must be preferred indifferently. This sounds too demanding especially when the voter is not familiar with those criteria. Assumption 2 allows voters to have any preference irrespective of the consistency:

Assumption 2 (Universal Domain)

The domain, or the set of admissible preference profiles, of the PCBPC method is universal.

Another justification of Assumption 2 is that voters’ preferences over criteria can be independent of \( {\mathcal{F}} \). For instance, a voter who favors plurality might prefer the criterion of monotonicity to the Condorcet winner criterion because plurality satisfies only the former. Assumption 2 is necessary for allowing preferences caused by such external factors. In summary, Assumption 2 says that voters may approve even an inconsistent set of criteria (e.g., Pareto, independence of irrelevant alternatives, and non-dictatorship for social welfare functions), and they may approve several criteria independently that are coincidentally equivalent under \( {\mathcal{F}} \).

In the present article, we restrict our attention to dichotomous preferences:

Assumption 3 (Dichotomous Preferences)

Each voter is presumed to have a dichotomous preference over criteria, i.e., each voter either approves or disapproves of each criterionFootnote 2.

According to Assumption 3, \( i \)‘s preference over criteria can be expressed by arraying those criteria that \( i \) approves, e.g., \( \left( {C_{i,1} ,C_{i,2} , \ldots ,C_{{i,a_{i} }} } \right) \). For any criterion \( C \), let \( {\mathcal{F}}_{C} = \left\{ {f \in {\mathcal{F}/}f\;{\text{satisfies}}\;C} \right\}. \)

Assumption 4 (Input to the PCBPC Method: Preferences over Criteria)

The input to the PCBPC method is the preference profile over criteria. A voter’s preference over criteria is expressed as a multiset of \( {\mathfrak{P}}\left( {\mathcal{F}} \right) \) such as

$$ {\mathcal{X}}_{i} = \left[ {{\mathcal{F}}_{{C_{i,1} }} ,{\mathcal{F}}_{{C_{i,2} }} , \ldots ,{\mathcal{F}}_{{C_{{i,a_{i} }} }} } \right], $$
(1)

where \( C_{i,1} ,C_{i,2} , \ldots ,C_{{i,a_{i} }} \) are interpreted as criteria that the voter \( i \) approves.

Let us give some comments on this assumption.

  1. (1)

    Roughly speaking, Assumption 4 means that the names of the criteria do not matter. Let us explain why. According to Assumption 4, the PCBPC method looks at not the criterion \( C \) itself but \( {\mathcal{F}}_{C} \). In the choices of Copeland and Dodgson, for example, there is supposed to be no difference between criteria b and d in Table 1 because for the two voting rules, satisfying b is equivalent to satisfying d. In this example, Assumption 4 demands that “a voter approves criterion b” is equivalent to “a voter approves criterion d”. The point is that Copeland is supported by one reason, no matter which of b or d is approved. In this sense, Assumption 4 is an axiom that treats equivalent criteria equally.

  2. (2)

    It is also noteworthy that the meaning of a criterion \( C \) is measured within the set of feasible decision procedures \( {\mathcal{F}} \). Consequently, the “same” criterion, say Condorcet winner criterion (CW), may be translated into different sets if different \( {\mathcal{F}} \)‘s are given: for instance, \( {\mathcal{F}}_{\text{CW}} = {\mathcal{F}} \) when \( {\mathcal{F}} \) is made up of amendment, Copeland, and Dodgson, while \( {\mathcal{F}}_{\text{CW}}^{'} = \phi \) if \( {\mathcal{F}^{\prime}} \) is made up of plurality, Borda, and approval (see Table 1). The difference reflects the logical strength of CW in the two cases (CW is extremely weak in the former case and extremely strong in the latter).

In this sense, Assumption 4 demands that the meaning of criteria be argued with respect to \( {\mathcal{F}} \), which can be interpreted as the domain of discourse (what kind of decision procedures are considered). This view is based on the usual practice of social choice theory. One can find many examples where impossibility theorems are conquered by expanding the domain \( {\mathcal{F}} \). One of the most famous cases is that the combination of Pareto, independence of irrelevant alternatives, and non-dictatorship (as well as unrestricted domain) is found to be consistent if social decision functions are considered (Sen [12]). This shows that we cannot argue the consistency without specifying the domain of discourse.

  1. (3)

    There may be some objections to this assumption. The strongest of them would be “Doesn’t it pay little attention to other important features of criteria, such as logical relationships between criteria?” The answer is “yes” and “no”. For one thing, the assumption neglects the relationship outside \( {\mathcal{F}} \). The fact that criterion a is stronger than b in the choice of \( {\mathcal{F}^{\prime}} = \left\{ {{\text{Amendment}},{\text{Copeland}},{\text{Dodgson}}} \right\} \) is not considered in the choice of \( {\mathcal{F}} = \left\{ {{\text{Amendment}},{\text{Copeland}}} \right\} \) (the two criteria are equivalent under \( {\mathcal{F}} \)). At the same time, however, the relationships inside \( {\mathcal{F}} \) are fully included. This is because the logical relationships between \( C \) and \( C^{\prime} \) under \( {\mathcal{F}} \) can be fully expressed by \( {\mathcal{F}}_{C} \) and \( {\mathcal{F}}_{{C^{\prime}}} \)Footnote 3

  2. (4)

    A multiset \( \left[ {A,B,C, \ldots ,Z} \right] \) is a set admitting repetition. Note that the ordering does not matter: \( \left[ {A,B} \right] \ne \left[ {A,B,B} \right] = \left[ {B,A,B} \right] = \left[ {B,B,A} \right]. \)

More formally, we say that \( {\mathcal{M}} \) is a multiset of a set \( S \) if \( {\mathcal{M}} \) is a function from \( S \) to \( {\mathbb{Z}}_{ \ge 0} \) (the set of all nonnegative integers), where \( {\mathcal{M}}\left( A \right) = k \) means that \( A \) appears \( k \) timesFootnote 4. For distinction, square brackets are used to express multisets.

  1. (5)

    We use multisets instead of ordinary sets so that we can admit the approval of some criteria that are coincidentally equivalent under \( {\mathcal{F}} \) (recall Assumption 2).

  2. (6)

    Note also that our model does not restrict the set of criteria. Each voter can select his/her own set of criteria (e.g., those rules that he/she knows). This is one of the main differences between the present model and Nurmi [1] ‘s, which considers a fixed set of criteria.

Let \( {\mathfrak{D}} \) be the set of all multisets of \( {\mathfrak{P}}\left( {\mathcal{F}} \right) \). Based on Assumption 1-4, we obtain the definition of the PCBPC method (from now on, we call it the aggregation rule).

Definition 1

An aggregation rule is a function from \( {\mathfrak{D}}^{n} \) to the set of all nonempty subsets of \( {\mathcal{F}} \).

3 Results

We use script letters \( {\mathcal{X}},{\mathcal{Y}}, \ldots \) to represent preference profiles and script letters with subscripts \( {\mathcal{X}}_{i} ,{\mathcal{X}}_{j} , \ldots \) to represent preferences. Subsets of \( {\mathcal{F}} \) are denoted by capital letters \( X,Y, \ldots \)

Definition 2 (Axioms for Aggregation Rules)

An aggregation rule \( F \) is said to satisfy

  1. 1)

    anonymity (A) if for all \( i,j \in N \) and \( {\mathcal{X}}_{i} ,{\mathcal{X}}_{j} \in {\mathfrak{D}} \) such that \( C \in {\mathcal{X}}_{j} \), if \( {\mathcal{X}}_{i}^{'} = {\mathcal{X}}_{i} \cup \left[ C \right] \) and \( {\mathcal{X}}_{j}^{'} = {\mathcal{X}}_{j} { \setminus }\left[ C \right] \), thenFootnote 5 \( F\left( {{\mathcal{X}}_{i} ,{\mathcal{X}}_{j} ,{\mathcal{X}}_{ - i,j} } \right) = F\left( {{\mathcal{X}}_{i}^{'} ,{\mathcal{X}}_{j}^{'} ,{\mathcal{X}}_{ - i,j} } \right) \).

  2. 2)

    neutrality (N) if for all \( f,g \in {\mathcal{F}} \), if \( {\mathcal{X}^{\prime}} \in {\mathfrak{D}}^{n} \) is a profile obtained from \( {\mathcal{X}} \in {\mathfrak{D}} \) by swapping the positions of \( f \) and \( g \), then \( f \in F\left( {\mathcal{X}} \right) \Leftrightarrow g \in F\left( {{\mathcal{X}^{\prime}}} \right) \).

  3. 3)

    strict monotonicity (SM) if for all \( i \in N \), \( {\mathcal{X}} = \left( {{\mathcal{X}}_{1} ,{\mathcal{X}}_{2} , \ldots ,{\mathcal{X}}_{n} } \right) \in {\mathfrak{D}}^{n} \), and \( X \subseteq {\mathcal{F}} \), if \( F\left( {\mathcal{X}} \right) \cap X \ne \phi \), then \( F\left( {{\mathcal{X}^{\prime}}} \right) = F\left( {\mathcal{X}} \right) \cap X \), where \( {\mathcal{X}^{\prime}} \) is a profile obtained from \( {\mathcal{X}} \) by adding \( X \) to someone’s preference.

  4. 4)

    partition consistency (PC) if for all \( i \in N \), disjoint \( X,Y \subseteq {\mathcal{F}} \) such that \( X \cup Y \in {\mathcal{X}}_{i} \), \( F\left( {{\mathcal{X}}_{i} ,{\mathcal{X}}_{ - i} } \right) = F\left( {{\mathcal{X}}_{i}^{'} ,{\mathcal{X}}_{ - i} } \right) \) where \( {\mathcal{X}}_{i}^{'} = \left( {{\mathcal{X} \setminus }\left[ {X \cup Y} \right]} \right) \cup \left[ {X,Y} \right] \).

As usual, A and N demand that the results are not susceptible to the names of the voters or decision procedures in \( {\mathcal{F}} \), respectively.

SM is also a straightforward modification of strict monotonicity for social choice rules. It demands that if a criterion \( C \) is additionally approved by someone, the results will change in favor of the decision procedures supported by the criterion.

The basic idea of PC is that representations of equivalent criteria in different ways should not affect the results. For instance, consider a voter who is in favor of those decision procedures satisfying at least one of either the Condorcet winner criterion (CW) or the Condorcet loser criterion (CL). The union of them can be expressed in several waysFootnote 6:

$$ CW \vee CL = \left( {CW \wedge \neg CL} \right) \vee CL = CW \wedge \left( {CL \wedge \neg CW} \right). $$
(2)

For this rephrasing, PC demands that each of the following is regarded as the same in the eyes of the aggregation rule:

  1. 1)

    Approve the united axiom \( CW \vee CL \);

  2. 2)

    Approve \( \left( {CW \wedge \neg CL} \right) \) and \( CL \); and

  3. 3)

    Approve \( CW \) and \( \left( {CL \wedge \neg CW} \right) \).

Another justification of PC is as follows. Suppose that criterion d in Table 1 (monotonicity) is favored by the society members. At one time, however, the importance of criterion i (invulnerability to the no-show paradox) is argued and the group is divided into three subgroups: [i] those who prefer \( d \wedge i \), [ii] those who prefer \( d \wedge \neg i \) (maybe those who do not approve criterion i), and [iii] those who prefer both (maybe those who do not care about criterion i). In this case, we can say either that voters in [iii] approve \( d \) or that they approve both \( \left( {d \wedge i} \right) \) and \( \left( {d \wedge \neg i} \right) \). PC demands that these two ways of preference revelation yield the same outcomes.

For a preference profile \( {\mathcal{X}} = \left( {{\mathcal{X}}_{1} ,{\mathcal{X}}_{2} , \ldots ,{\mathcal{X}}_{n} } \right) \in {\mathfrak{D}}^{n} \), we define the appearance of \( f \in {\mathcal{F}} \) at \( {\mathcal{X}}_{i} \) as

$$ a\left( {f:{\mathcal{X}}_{i} } \right): = \sum\nolimits_{{X \subseteq {\mathcal{F}}\;{\text{s}}.{\text{t}}.\;f \in X}} {{\mathcal{X}}_{i} \left( X \right).} $$
(3)

Also, the appearance of \( f \in {\mathcal{F}} \) at \( {\mathcal{X}} = \left( {{\mathcal{X}}_{1} ,{\mathcal{X}}_{2} , \ldots ,{\mathcal{X}}_{n} } \right) \) is

$$ a\left( {f:{\mathcal{X}}} \right): = \sum\nolimits_{i \in N} {a\left( {f:{\mathcal{X}}_{i} } \right).} $$
(4)

In words, \( a\left( {f:{\mathcal{X}}_{i} } \right) \) and \( a\left( {f:{\mathcal{X}}} \right) \) represent how many criteria are approved that support \( f \) (as noted in (4) of Assumption 4, \( {\mathcal{X}}_{i} \left( X \right) \) denotes how many times \( X \) appears in \( {\mathcal{X}}_{i} \)). An aggregation rule \( F \) is called the counting rule, denoted by \( F_{c} \), if it selects the decision procedures with the highest appearances:

$$ F\left( {\mathcal{X}} \right) = \mathop {\text{argmax}}\nolimits_{{f \in {\mathcal{F}}}} \,a\left( {f:{\mathcal{X}}} \right). $$
(5)

In the later arguments, we use

$$ A\left( {\mathcal{X}} \right) = \sum\nolimits_{{f \in {\mathcal{F}}}} {a\left( {f:{\mathcal{X}}} \right).} $$
(6)

Example 1

Consider a group of three voters, denoted by 1-3. Suppose voter 1 approves criteria a, b, and c in Table 1 (maybe a person who prefers the Condorcet-related criteria) and voter 2 approves e and h (maybe a person who prefers Arrow’s framework), and voter 3 approves d and i. Then, among the 13 procedures in Table 1, Copeland, Kemeny, and Black get the highest appearances of 5 pts and thus selected by the counting rule.

Theorem 1

An aggregation rule \( F \) satisfies A, N, SM, and PC if and only if it is the counting rule.

[Proof of Theorem 1]

The ‘if’ part is straightforward. We prove the ‘only if’ part. Let \( F \) be an aggregation rule satisfying A, N, SM, and PC. The proof is made by an induction on \( A\left( {\mathcal{X}} \right) \).

Suppose that \( A\left( {\mathcal{X}} \right) = 0 \). For any \( f,g \in {\mathcal{F}} \), \( {\mathcal{X}} \) remains the same when the positions of \( f \) and \( g \) are swapped. So, N demands that \( f \in F\left( {\mathcal{X}} \right) \Leftrightarrow g \in F\left( {\mathcal{X}} \right) \) for all \( f,g \in {\mathcal{F}} \). Since \( F\left( {\mathcal{X}} \right) \ne \phi \) by definition, it follows that \( F\left( {\mathcal{X}} \right) = {\mathcal{F}} \).

Suppose that the “if” part holds when \( A\left( {{\mathcal{X}^{\prime}}} \right) \le k \). Suppose \( A\left( {\mathcal{X}} \right) = k + 1 \). Let \( {\mathcal{X}}^{0} = {\mathcal{X}} = \left( {{\mathcal{X}}_{1} ,{\mathcal{X}}_{2} \ldots ,{\mathcal{X}}_{n} } \right) \), \( {\mathcal{X}}_{i} = \left[ {X_{i,1} ,X_{i,2} , \ldots ,X_{{i,a_{i} }} } \right] \), and \( E: = \left\{ {f \in {\mathcal{F}|}a\left( {f:{\mathcal{X}}} \right) = k + 1} \right\} \). The proof is completed if we show \( F\left( {\mathcal{X}} \right) = E \).

For each \( e \in E \), we choose \( i\left( e \right) \in N \), \( j\left( e \right) \in \left\{ {1,2, \ldots ,a_{i\left( e \right)} } \right\} \), and \( c\left( e \right) = X_{i\left( e \right),j\left( e \right)} \) such that \( e \in X_{i\left( e \right),j\left( e \right)} \) (such \( X_{i\left( e \right),j\left( e \right)} \) exists because \( a\left( {e:{\mathcal{X}}} \right) = k + 1 > 0 \)). Let \( {\mathcal{X}}^{1} \) be a profile obtained from \( {\mathcal{X}} \) by deleting each \( e \in E \) from \( c\left( e \right) \). By the assumption of induction, we have

$$ F\left( {{\mathcal{X}}^{1} } \right) = \left\{ {f \in {\mathcal{F}}\;\text{|}\;a\left( {f:{\mathcal{X}}^{1} } \right) = k} \right\}. $$
(7)

Note that \( E \subseteq F\left( {{\mathcal{X}}^{1} } \right) \). Next, we construct \( {\mathcal{X}}^{2} \) by adding \( E \) to \( {\mathcal{X}}_{1}^{1} \) (voter \( 1 \)‘s preference at \( {\mathcal{X}}^{1} \)). With SM, we have \( F\left( {{\mathcal{X}}^{2} } \right) = F\left( {{\mathcal{X}}^{1} } \right) \cap E = E \). Let \( E = \left\{ {e_{1} ,e_{2} , \ldots ,e_{k} } \right\} \). Divide the added \( E \) to singletons in \( {\mathcal{X}}_{1}^{2} \) to make \( {\mathcal{X}}_{1}^{3} \):

$$ {\mathcal{X}}_{1}^{2} = \left[ { \ldots ,E} \right] \mapsto {\mathcal{X}}_{1}^{3} = \left[ { \ldots ,\left\{ {e_{1} } \right\},\left\{ {e_{2} } \right\}, \ldots ,\left\{ {e_{k} } \right\}} \right]. $$
(8)

By repeating PC, we have that \( F\left( {{\mathcal{X}}^{3} } \right) = F\left( {{\mathcal{X}}^{2} } \right) = E \). Construct \( {\mathcal{X}}^{4} \) by shifting each \( \left\{ e \right\} \) \( \left( {e = e_{1} ,e_{2} , \ldots ,e_{k} } \right) \) from \( {\mathcal{X}}_{1}^{3} \) to \( {\mathcal{X}}_{i\left( e \right)}^{3} \). With A, it follows that \( F\left( {{\mathcal{X}}^{3} } \right) = F\left( {{\mathcal{X}}^{4} } \right) \). With PC, we can restore each \( X_{i\left( e \right),j\left( e \right)} \) without changing the results: \( F\left( {{\mathcal{X}}^{4} } \right) = F\left( {{\mathcal{X}}^{0} } \right) \). So, we have \( F\left( {{\mathcal{X}}^{0} } \right) = E \). ■

[Independence of the axioms]

To verify the independence of each axiom used in Theorem 1, we provide several examples. The proof is omitted because they are straightforward.

  • A, SM, PC but not N: constant rule \( F\left( {\mathcal{X}} \right) = \left\{ f \right\} \) for some fixed \( f \in {\mathcal{F}} \).

  • N, SM, PC, but not A: fix a voter \( i^{*} \in N \). With a slight abuse of notation, let

$$ F_{C} \left( {\mathcal{X}} \right) \cap {\mathcal{X}}_{{i^{ *} }} = \left\{ {f \in F_{C} \left( {\mathcal{X}} \right)\;\text{|}\;\exists X \in {\mathcal{X}}_{{i^{ *} }} \;{\text{s}}.{\text{t}}.\;f \in X} \right\}. $$
(9)

Define an aggregation rule \( F \) as:

$$ F\left( {\mathcal{X}} \right) = \left\{ {\begin{array}{*{20}l} {F_{c} \left( {\mathcal{X}} \right)} \hfill & {if\;F_{C} \left( {\mathcal{X}} \right) \cap {\mathcal{X}}_{{i^{ *} }} = \phi } \hfill \\ {F_{C} \left( {\mathcal{X}} \right) \cap {\mathcal{X}}_{{i^{ *} }} } \hfill & {otherwise.} \hfill \\ \end{array} } \right. $$
(10)
  • A, N, SM, but not PC: An alternative definition of \( F_{c} \) is as follows. Given a profile \( {\mathcal{X}} = \left( {{\mathcal{X}}_{1} ,{\mathcal{X}}_{2} , \ldots ,{\mathcal{X}}_{n} } \right) \) with \( {\mathcal{X}}_{i} = \left[ {X_{i,1} , \ldots ,X_{{i,a_{i} }} } \right] \), \( F_{c} \) searches for each \( f \) among \( X_{i,1} ,X_{i,2} , \ldots ,X_{{i,a_{i} }} \) and assigns one point to \( f \) every time it is found in some \( X_{i,j} \). Those decision procedures with the highest points are selected.

Slightly modifying the algorithm above, assign \( 1/\left| {X_{i,j} } \right| \) points every time \( f \) is found in some \( X_{i,j} \). Such a rule satisfies A, N, SM, but not PC.

  • A, N, PC, but not SM: constant rule \( F\left( {\mathcal{X}} \right) = {\mathcal{F}} \). Another example is the aggregation rule that chooses the decision procedures with the greatest and the second greatest number of appearances.

4 Discussion

In Sect. 3, we provide a characterization of counting rule \( F_{C} \) from the four axioms introduced. In other words, we justify the choice of those decision procedures with highest supports. The significance of the theorem depends on the plausibility of the assumptions as well as the present model. This section discusses what happens if the foundations of Theorem 1 is slightly changed, which clarifies the limitations and the significance of the present model.

  1. (1)

    What is a criterionan atomic element or a composite?

First, let us discuss the foundation of PC. Let \( u\left( {\mathcal{X}} \right) \) be the set of criteria that all individuals agree at profile \( {\mathcal{X}} \): \( u\left( {\mathcal{X}} \right) = \left\{ {X \subseteq {\mathcal{F}/}X \in {\mathcal{X}}_{i} \;{\text{for}}\;{\text{all}}\;i \in N} \right\}. \)

We say that an aggregation rule satisfies unanimity on criteria (UC) if \( {\mathcal{F}}\left( {\mathcal{X}} \right) \subseteq \left( {\bigcap\nolimits_{{X \in u\left( {\mathcal{X}} \right)}} X } \right) \) holds whenever \( \bigcap\nolimits_{{X \in u\left( {\mathcal{X}} \right)}} {X \ne \phi } \). In words, if there exist unanimously agreed criteria and they are consistent in the sense that some decision procedure(s) satisfy them all, the winners must be among such decision procedures. Though it looks plausible, this axiom is inconsistent with PC.

Theorem 2 (Impossibility of PC and UC)

If \( \left| {\mathcal{F}} \right| \ge 3 \) , there is no aggregation rule that satisfies both PC and UC.

(Proof of Theorem 2) Assume to the contrary that \( F \) satisfies UC and PC. Let \( f,g,h \) be distinct elements in \( {\mathcal{F}} \). Let \( {\mathcal{X}}_{1} = \left[ {\left\{ {f,h} \right\},\left\{ {g,h} \right\}} \right] \) and \( {\mathcal{X}}_{i} = \left[ {\left\{ f \right\},\left\{ g \right\}} \right] \) for all \( i \ge 2 \). Let \( {\mathcal{X}}_{1}^{'} = \left[ {\left\{ f \right\},\left\{ h \right\},\left\{ {g,h} \right\}} \right] \) and \( {\mathcal{X}}_{1}^{''} = \left[ {\left\{ {f,h} \right\},\left\{ g \right\},\left\{ h \right\}} \right] \). With PC, we have \( F\left( {\mathcal{X}} \right) = F\left( {{\mathcal{X}}_{1}^{'} ,{\mathcal{X}}_{ - 1} } \right) = F\left( {{\mathcal{X}}_{1}^{''} ,{\mathcal{X}}_{ - 1} } \right). \) But UC implies that \( F\left( {{\mathcal{X}}_{1}^{'} ,{\mathcal{X}}_{ - 1} } \right) \subseteq \left\{ f \right\} \) and \( F\left( {{\mathcal{X}}_{1}^{''} ,{\mathcal{X}}_{ - 1} } \right) \subseteq \left\{ g \right\} \). Contradiction. ■

Readers might find this result somewhat bewildering, because unanimity is one of the most plausible axioms in social choice theory. Thus, one might become concerned with the soundness of PC. Our answer is that these two criteria are based on totally different points of view, which should be called atomic or composite standpoints. Recall that PC demands that different ways to represent a criterion, e.g., \( C_{1} \vee C_{2} = \left( {C_{1} \wedge \neg C_{2} } \right) \vee C_{2} \), do not affect the results. Consequently, it is possible to partition criteria into pieces without changing the results (see the proof of Theorem 1). Put another way, PC sees each criterion as a composite object that can be decomposed (i.e., a composite standpoint). On the other hand, UC focuses on the form of each criterion as it is, implicitly distinguishing a criterion from its decomposition. In this sense, UC is based on an atomic standpoint: a criterion is indivisible. The impossibility shown in Theorem 2 can be interpreted as the (huge) gap between these two standpoints.

Indeed, the impossibility soon disappears if we think of unanimity based on a composite point of view instead. Let us say that an aggregation rule satisfies unanimity on procedures (UP) if whenever there exists a decision procedure \( f \) that is unanimously supported by every criterion in the profile, i.e., \( f \in X \) for all \( X \in {\mathcal{X}}_{1} ,{\mathcal{X}}_{2} , \ldots ,{\mathcal{X}}_{n} \), then the winners must be found among such \( f \)‘s. One can easily see that UP and PC are consistent, because \( F_{c} \) satisfies them both.

In the rest of this discussion, we argue how PCBPC can be based on the atomic standpoint. To do this, we need to show another impossibility (Theorem 3 below).

  1. (2)

    To distinguish criteria that are logically equivalent under \( {\mathcal{F}} \)

Theorem 3 (Impossibility of A and UC)

There is no aggregation rule that satisfies both A and UC.

(Proof of Theorem 3)

Assume to the contrary that such an aggregation rule \( F \) exists. Let \( {\mathcal{X}}_{1} = \left[ {\left\{ f \right\},\left\{ f \right\},\left\{ g \right\},\left\{ g \right\}} \right] \), \( {\mathcal{X}}_{2} = \phi \), and \( {\mathcal{X}}_{i} = \left[ {\left\{ f \right\},\left\{ g \right\}} \right] \) for all \( i \ge 3 \). Let \( {\mathcal{X}}_{1}^{'} = \left[ {\left\{ f \right\},\left\{ g \right\},\left\{ g \right\}} \right] \), \( {\mathcal{X}}_{2}^{'} = \left[ {\left\{ f \right\}} \right] \), \( {\mathcal{X}}_{1}^{''} = \left[ {\left\{ f \right\},\left\{ f \right\},\left\{ g \right\}} \right] \), and \( {\mathcal{X}}_{2}^{''} = \left[ {\left\{ g \right\}} \right] \). Then, A implies that \( F\left( {{\mathcal{X}}_{1} ,{\mathcal{X}}_{2} ,{\mathcal{X}}_{ - 1,2} } \right) = F\left( {{\mathcal{X}}_{1}^{'} ,{\mathcal{X}}_{2}^{'} ,{\mathcal{X}}_{ - 1,2} } \right) = F\left( {{\mathcal{X}}_{1}^{''} ,{\mathcal{X}}_{2}^{''} ,{\mathcal{X}}_{ - 1,2} } \right) \). However, UC demands that \( F\left( {{\mathcal{X}}_{1}^{'} ,{\mathcal{X}}_{2}^{'} ,{\mathcal{X}}_{ - 1,2} } \right) \subseteq \left\{ f \right\} \) and \( F\left( {{\mathcal{X}}_{1}^{''} ,{\mathcal{X}}_{2}^{''} ,{\mathcal{X}}_{ - 1,2} } \right) = \left\{ g \right\} \). Contradiction. ■

In the proof, two assumptions—the explicit and the implicit—play key roles. The explicit assumption is the very heart of anonymity, i.e., who approves a certain criterion (\( \left\{ f \right\} \) and \( \left\{ g \right\} \) in the proof), must not affect the result. The implicit assumption is the repeated appearances of logically equivalent criteria (two \( \left\{ f \right\} \)‘s and two \( \left\{ g \right\} \)‘s are included in a voter’s preference), which is in fact the consequence of Assumption 2. As one can see, the proof of Theorem 3 fails if such repetition is not allowedFootnote 7. Therefore, in the last part of this discussion, we argue PCBPC can be based on the atomic standpoint by dropping Assumption 2.

  1. (3)

    What if we drop Assumption 2?

Finally, we give an informal discussion of what happens if we drop Assumption 2. Specifically, we consider a modified model that prohibits us from claiming logically equivalent criteria in a voter’s preference. This restriction can be expressed by substituting \( {\mathbb{Z}}_{ \ge 0} \) with \( \left\{ {0,1} \right\} \) in the co-domain of \( {\mathcal{X}}_{i} \).

We define Nurmi’s aggregation rule \( F_{N} \) below. Note that the rule is slightly modified from Nurmi [1] so that it fits our dichotomous preference model.

Let \( {\mathcal{X}} = \left( {{\mathcal{X}}_{1} ,{\mathcal{X}}_{2} , \ldots ,{\mathcal{X}}_{n} } \right) \) be a profile and \( {\mathcal{C}} \) be the finite set of criteria at hand. Each voter’s preference is supposed to be a subset of \( {\mathcal{C}} \). Let \( a\left( {C:{\mathcal{X}}} \right) \) be the number of voters who approve criterion \( C \in {\mathcal{C}} \).

[Step 1] Determination of collective ranking over \( {\mathcal{C}} \).

Define an weak orderingFootnote 8 \( { \gtrsim } \) over \( {\mathcal{C}} \) such that for all \( C,D \in {\mathcal{C}} \),

$$ C{ \gtrsim }D \Leftrightarrow a\left( {C:{\mathcal{X}}} \right) \ge a\left( {D:{\mathcal{X}}} \right). $$
(11)

Suppose that all elements in \( {\mathcal{C}} \) are arrayed from the greatest to the least as \( C_{1} \sim C_{2} \sim \cdots \sim C_{k} { \gtrsim }D_{1} \sim D_{2} \sim \cdots D_{l} { \gtrsim }E_{1} \sim E_{2} \sim \cdots \sim E_{m} { \gtrsim }F_{1} \sim \cdots \).

[Step 2] Collective choice over \( {\mathcal{F}} \) based on the ranking generated in Step 1.

Let \( {\mathcal{F}}_{1}^{'} = {\mathcal{F}}_{{C_{1} }} \cap {\mathcal{F}}_{{C_{2} }} \cap \cdots \cap {\mathcal{F}}_{{C_{k} }} \). If \( {\mathcal{F}}_{1}^{'} \) is empty, \( F_{N} \left( {\mathcal{X}} \right) = {\mathcal{X}} \). If \( {\mathcal{F}}_{1}^{'} \) is a singleton, \( F\left( {\mathcal{X}} \right) = F_{1}^{'} \).

Otherwise, let \( {\mathcal{F}}_{2}^{'} = {\mathcal{F}}_{1}^{'} \cap {\mathcal{F}}_{{D_{1} }} \cap {\mathcal{F}}_{{D_{2} }} \cap \cdots \cap {\mathcal{F}}_{{D_{l} }} \). If \( {\mathcal{F}}_{2}^{'} \) is empty, \( F_{N} \left( {\mathcal{X}} \right) = {\mathcal{F}}_{1}^{'} \). If \( {\mathcal{F}}_{2}^{'} \) is a singleton, \( F\left( {\mathcal{X}} \right) = F_{2}^{'} \).

Otherwise, let \( {\mathcal{F}}_{3}^{'} = {\mathcal{F}}_{2}^{'} \cap {\mathcal{F}}_{{E_{1} }} \cap {\mathcal{F}}_{{E_{2} }} \cap \cdots \cap {\mathcal{F}}_{{E_{m} }} \). If \( {\mathcal{F}}_{3}^{'} \) is empty, \( F_{N} \left( {\mathcal{X}} \right) = {\mathcal{F}}_{2}^{'} \). If \( {\mathcal{F}}_{3}^{'} \) is a singleton, \( F\left( {\mathcal{X}} \right) = {\mathcal{F}}_{3}^{'} \). Otherwise, consider \( {\mathcal{F}}_{4}^{'} \) and the process goes in this way until it finds a singleton (if the intersection of all elements in \( C \) includes more than one element, then let \( F_{N} \left( {\mathcal{X}} \right) \) be the intersection).

Note that all voters (decision procedures) are treated equally in the algorithm, which means that \( F_{N} \) satisfies A (N). Furthermore, it also satisfies UC. To see this, note that the basic idea of \( F_{N} \) is to search for a singleton subset of \( {\mathcal{F}} \) by descending from “the greatest” criteria (i.e., the greatest elements of \( {\mathcal{C}} \) with respect to \( { \gtrsim } \)) to “the least” criteria. Throughout this process, we have \( {\mathcal{F}}_{1}^{'} \supseteq {\mathcal{F}}_{2}^{'} \supseteq {\mathcal{F}}_{3}^{'} \cdots \). As unanimously agreed criteria, if they existed, would belong to \( {\mathcal{F}}_{1}^{'} \), UC holds under \( F_{N} \). As we noted earlier, this process is based on a different point of view from the counting rule \( F_{c} \), or the present paper’s framework (Assumptions 1-4). To characterize a multi-step aggregation rule like \( F_{N} \) could be an interesting future topic.

5 Concluding Remarks

The present article provides a formal model for PCBPC, or procedural choice based on preferences over criteria. Theorem 1 characterizes the counting rule, which chooses the decision procedures with the highest number of supports by approved criteria, with four axioms: anonymity, neutrality, strict monotonicity and partition consistency. The first three are well-known and essentially the same as those used in Dietrich [8] ‘s characterization of “counting rule” for a procedural choice based on preferences over “decision procedures”Footnote 9. As in Dietrich’s view, the use of anonymity and neutrality may be justified on the grounds that procedural choice is made only by the judgments within the decision body. Whether certain voters or decision procedures have competence against others should not be decided a priori; that is a matter of collective choice. SM is also important to exclude constant rules or such absurd rules that choose those decision procedures with minimal support.

Our model focuses only on dichotomous preferences (Assumption 3). While this is a typical assumption in voting theory, many studies on MCDM often consider cardinal preferences over criteria. Therefore, to look for anonymous, neutral, strict monotonic, and partition consistent aggregation rules under more complicated preferences over criteria (e.g., linear preferences or even fuzzy preferences) is an interesting topic for future research both from technical and practical points of view.