1 Introduction

Two principles that social-aggregation rules should adhere to are majoritarianism and the recognition of voters' preference intensity. These ideals safeguard the interests of a simple majority as well as protect a minority possessing significantly robust preferences, respectively. The delicate balance between these principles fuels the ongoing discourse between proponents of the majority rule (MR) and supporters of certain scoring rules, particularly the Borda rule (BR) and the prevalent plurality rule (PR).Footnote 1

Traditionally, the advocacy for MR, BR, and PR has been grounded in axiomatic rationales, demonstrating that these aggregation rules exclusively fulfill the desirable prerequisites concerning the connection between individual and social preferences. There are two distinct justifications for MR: one is the epistemic stance, rooted in Condorcet's Jury Theorem (as referenced in List and Goodin 2001); the other is the consequentialist-utilitarian perspective (as highlighted in Brighouse & Fleurbaey, 2010; Nakada et al., 2023; Rae, 1969), which is particularly pertinent during the constitutional stage, a phase dominated by the veil of ignorance.

Historically, there has been no direct comparison between MR and either BR or PR, considering their inherent drawbacks: the disregarding of preference intensity and the violation of majoritarianism, respectively. This study aims to bridge this gap by first examining MR and BR, and then comparing MR and PR in light of their costs. 'Cost' here represents the degradation of the principle each rule infringes. The two main findings reported below are PR's supremacy over MR and MR superiority over BR. These results cast a fresh perspective on the age-old debate between Condorcet and Borda and their advocates, who fervently criticized PR before championing, in order, MR and BR. Moreover, the first finding also revives the debate about the merits of PR which despite its apparent practical benefits, received limited support from 22 preeminent voting-theory experts who ranked 18 different voting rules, as evidenced by Laslier (2012).

2 The novelty of our approach

Let society N consist of n voters, n > 2, and suppose that the set of social alternatives, X, has m elements, m > 2. \({R}_{i}\) denotes the preference relation of individual i, which is assumed to be strict ordering, and R = (R \(_{1}\), …,\({R}_{n}\)) is a preference profile. An aggregation rule is a mapping from the set of possible profiles to the set of possible reflexive and complete social-preference relations. Here we do allow for indifference and the typical social-preference relation is R. The score of an alternative x under the Borda rule and the Plurality rule on which we focus is denoted by B(x) and P(x). A majority prefers y to x when the number of individuals who prefer y to x, N(y,x), is larger than the number of individuals who prefer x to y.

Our combined approach of ordinal and ranking-based utilitarianism is very different from the typical unrestricted utilitarian approach that works mostly with the standard principles (Benthamite, Rawlsian, etc.), which is very difficult to apply in comparing alternative voting rules. More explicitly, the typical utilitarian approach compares the expected social welfare (Benthamite, Rawlsian, etc.,) obtained under different voting rules. In contrast, in our approach, the social planner compares MR to a scoring rule based on expected deviation from the two fundamental democratic principles that are assumed to be of equal significance and not on some standard utilitarian principle. Since MR implies ordinal utilities, the proposed measure of the expected deviation of a scoring rule from the majority principle is naturally defined in terms of the number of individuals who prefer one alternative to another one, and not in terms of cardinal interpersonally non-comparable individual utilities. Since scoring rules imply some restricted form of cardinal and personally comparable utilities, the proposed measure of the expected deviation of MR from the principle of allowing expression of preference intensity is naturally defined in terms of the ranking-based utilities of the two scoring rules on which we focus.Footnote 2 Both measures of deviation from the fundamental principles depend on the assumed preference distributions in the population and they take into account all possible preference profiles and all possible compared alternatives.

Our focus is on the most widely studied monotonic scoring rule, BR, and the most common weakly monotonic scoring rule, PR. Given the plausibility of a complete veil of ignorance in the constitutional stage, BR is a reasonable representative scoring rule.Footnote 3 Thus, we first compare MR with BR and then compare MR with PR.Footnote 4

BR violates the majority principle in those instances, namely, preference profiles and pairs of compared alternatives, where it protects the minority effectively by taking into account its higher preference intensity rather than the majority’s lower preference intensity. While the emphasis in Baharad and Nitzan (2002) is on the different degrees of majority-decisiveness amelioration that different scoring rules provide, in the current study the focus is on the comparison between MR and the two most common scoring rules based on the severity of the problems they cause: disregarding expression of preferences, which implies prevention of effective expression of preference intensity by the minority, and violating the majority principle. Let us now define the severity of the two problems in a way that enables a comparison of the “costs” associated with applying the two alternative democratic voting rules and, in turn, the preference of MR or BR.Footnote 5

3 The severity of violating the two fundamental principles

One possibility is to measure the severity of a problem by the probability of its occurrence. Baharad and Nitzan (2007, 2011) take such an approach, focusing on comparing alternative scoring rules. Gehrlein and Lepelley (2011, 2017) apply this criterion in assessing different election paradoxes. In our study we focus on the Condorcet–Borda (binary–positional) controversy and the analysis rests on two criteria. First, the comparison between the prior likelihoods of the compared rules to be superior in terms of the deviation from the aforementioned fundamental democratic principles. Second, we take into account not only the severity of a problem in terms of the probability of its occurrence but also in terms of the expected erosion of the two foundational democratic principles. Erosion of the majority principle takes into account all possible preference profiles and any pair of alternatives where the majority’s preference is overlooked. Erosion of the second principle, namely, respect of voters’ preference intensity, takes into account all possible preference profiles and any pair of alternatives where the disregard of preference intensity implies that the minority’s preference intensity is disregarded, even though it is larger than that of the majority.

Our analysis may yield a flexible, “case-dependent” choice between the two aggregation rules. One rule may be superior for certain preference profiles and pairs of alternatives whereas the other rule may prevail for others.Footnote 6 The implementation of such flexibility, requiring the practical partitioning of the set of pairs of alternatives, may involve considerable difficulties. Our objective, therefore, is to identify the preferred aggregation rule, just one of the two rules and not a flexible, case-dependent rule, based on its larger likelihood to be superior and its lower expected violation of a fundamental principle. That is, the expected severity of the problem that it raises should be lower than that of the problem associated with using the alternative aggregation rule. The expected severity of the compared rules is referred to as their expected cost. The main contribution of our study is the clarification of the expected costs of the rules and the use of these expected costs to determine the superiority of one of them.

Given a specific situation, namely preference profile R and pair of alternatives x and y, we first measure the corresponding cost of applying MR in terms of the erosion of the principle that preference intensity must be taken into account, and, in particular, the minority’s preference intensity, C(MR, R, x, y); then we measure the cost of applying BR in terms of the erosion of majoritarianism, C(BR, R, x, y); The application of MR is warranted in a specific situation if C(MR, R, x, y) < C(BR, R, x, y); the application of BR is warranted if C(BR, R, x, y) < C(MR, R, x, y). And in case C(MR, R, x, y) = C(BR, R, x, y), the use of either MR or BR is justified.Footnote 7 This may ensure an ideal flexible situation-dependent balanced democracy that applies in every particular situation the aggregation rule associated with the lower cost. As already noted, however, such a flexible situation-dependent aggregation rule is difficult to implement because it requires information about the voters’ actual preference profile. Therefore, we impose the restriction that the same aggregation rule must be applied to any pair of alternatives and any preference profile. Given this restriction, let us turn to the comparison of MR with BR.

3.1 Proposed measures of erosion of the two principles

Consider, first, the cost of applying MR. It involves the possible erosion of the principle of allowing expression of preference intensity, which implies that the minority should win when its preference intensity exceeds that of the majority (Principle 1). This principle is eroded when, given a specific preference profile, MR and BR yield different social preferences between two alternatives x and y.Footnote 8

Let us present a natural and intuitive measurement of erosion.

Suppose that, given a specific preference profile, alternative x is preferred over y under MR and that alternative y is preferred over x under BR because the score of y, B(y), is higher than the score of x, B(x). The positive difference between these scores, (B(y)—B(x)), is referred to as the unrealized advantage of the preference intensity of the minority. The first measure is the share of the minority’s preference intensity that erodes due to the use of MR that yields the social preference of alternative x, despite its inferiority to y according to BR. This inferior alternative should not be preferred according to Principle 1. The proposed measure of erosion of Principle 1, for a particular preference profile and two given alternatives x and y, given that MR and BR result in different social preferences is the unrealized advantage of the preference intensity of the minority relative to its total intensity.

$${\mathbf{Measure}} \, {\mathbf{1}}: \, \left( {{\text{B}}\left( {\text{y}} \right){-\!\!-}{\text{B}}\left( {\text{x}} \right)} \right)/{\text{B}}(y).$$

Given a specific preference profile and pair of alternatives, Measure 1 represents the relative erosion of the minority’s ability to effectively express its preference intensity and ensure the superiority that its preferred alternative would have enjoyed had BR been used as the aggregation rule.

To sum up, the proposed measure of the cost of MR in a particular situation is the rate of reduction in the more intense minority’s preference of the socially inferior alternative in terms of preference intensity relative to the socially superior alternative under MR.

Consider now the second majority principle: it requires that x is socially preferred over y by BR when a majority prefers x over y. The cost of applying BR involves the possible erosion of this principle, Principle 2, and again, this erosion is realized when, given a specific preference profile R, MR and BR yield different social preferences between two alternatives x and y. That is, B(x) >B(y), however a majority prefers y to x; the number of individuals who prefer y to x, N(y,x), is larger than N(x,y). Analogously to the measurement of erosion of Principle 1, for a specific preference profile and pair of alternatives x and y, the measurement of erosion of Principle 2 takes the form:

$${\mathbf{Measure}} \, {\mathbf{2}} = \, \left( {{\text{N}}\left( {y,x} \right) - {\text{N}}\left( {x,y} \right)} \right)/{\text{N}}\left( {y,x} \right)$$

The proposed measure of the cost of BR is the disregarded advantage of the majority obtained by alternative y (which is superior to alternative x under MR), divided by the actual majority of y,Footnote 9 In other words, Measure 2 represents the erosion of the majority’s ability to effectively express its preference and ensure the superiority that its preferred alternative would have enjoyed had MR been used as the aggregation rule.

Note that the lower bound (0) and the upper bound (1) of the two measures represent no violation and maximal violation of the relevant principle.

Given a preference profile and a pair of alternatives, a comparison of the costs of applying MR and BR is the basis for selecting one aggregation rule over the other. The preferred rule is the less costly one.Footnote 10

3.2 Illustration

Example

Suppose N = {1,2,3}, X = { w,x,y,z} and the preference profile is R = (\({R}_{1},{R}_{2},{R}_{3}\)),

$${R}_{1}:y{R}_{1}z{R}_{1}w{R}_{1}x$$
$${R}_{2}:x{R}_{2}y{R}_{2}z{R}_{2}w$$
$${R}_{3}:z{R}_{3}y{R}_{3}w{R}_{31}x$$

Let us determine simple majority relation and Borda social preference relation corresponding to profile R:

It can be verified that

$$y\;R^{maj} \;z\;R^{maj\;} w\;R^{maj} \;x$$

Let B(s) denote the Borda score of alternative s. In the above example, B(w) = 2, B(x) = 3, B(z) = 6 and B(y) = 7. Hence,

$$y {R}^{B}z {R}^{B}x R{}^{B}w$$

Case 1: Consider the comparison of y and z. In such a case,

$${\text{C}}\left( {{\text{MR}},{\mathbf{R}}} \right)\, = \,{\text{C}}\left( {{\text{BR}},{\mathbf{R}}} \right)\, = \,0$$

Thus, either rule may be used.

Case 2: Consider the comparison of w and x. In such a case,

$${\text{C}}\left( {{\text{MR}},{\mathbf{R}}} \right)\, = \,\left( {{\text{B}}\left( x \right) - {\text{B}}\left( w \right)} \right)/{\text{B}}\left( x \right)\, = \,\left( {{3}{-}{2}} \right)/{3}\, = \,{1}/{3}$$

C(BR,R)=(N(w,x)) - N(x,w) / N(w,x) =(2-1)/2=1/2. Since C(MR, R) = 1/3 < C(BR, R) = 1/2, the justified aggregation rule for the comparison of x and w is MR.

3.3 Possible alternative measures

Finally note that one could think about alternative measures, such as the absolute deviation between the scores or the support of the alternatives preferred by BR and MR, B(y)-B(x) or N(y)-N(x) or the relative score or support of these alternatives, B(y)/B(x) or N(y)/N(x). The former alternatives for Measure 1 and Measure 2 would not allow a meaningful comparison between them. The obvious reason is that such absolute deviations cannot be compared because they apply different notions; one relates to scores and the other to majority and minority support. The latter alternative measures are not vulnerable to such criticism, that is, they can be compared. However, they do not capture the significant aspect implied by the proposed definitions of the two measures, namely, the relative disregarded or unexploited advantage of the minority and the majority when the social preference differs from their preference.

Nevertheless, the measures that involve the ratios B(y)/B(x), P(y)/P(x) and N(y)/N(x) are meaningful. In the case of B(y)/B(x) and P(y)/P(x), this formulation shifts the emphasis from the proportion of the eroded scores of the minority to assessing the severity of relative social injustice against the minority. Its preference intensity is not respected, despite being B(y)/B(x) or P(y)/P(x) times stronger than that of the majority. In the case of N(y)/N(x), this formulation shifts the emphasis from the proportion of the eroded advantage of the majority to assessing the severity of relative social injustice against the majority. Its advantage is not respected, despite being N(y)/N(x) times stronger than that of the minority. We ran simulations verifying that our findings are robust to the application of these alternative metrics to Measure 1 and Measure 2.

The current formulation assigns equal importance to all the compared pairs of alternatives when applying scoring rules. The erosion is calculated as the proportion of the scores eroded out of the scores assigned to the rejected option. This form of "local" normalization doesn't create a common global scale for all the compared pairs, but rather each pair is considered in isolation and the erosion strength for all pairs is given equal weight in the overall expectation calculation. This has its merit as it evaluates each aggregation rule over the choice between any two pairs of alternatives, irrespective of their popularity. We leave to future research the option of giving more importance to the erosion of choices with high scores relative to those with low scores. Several potential avenues for achieving this include:

  1. 1.

    Focusing solely on cases where each rule results in a different winner and calculating erosion exclusively for the pair of winners.

  2. 2.

    Investigating the implementation of a global normalization factor. In the majority rule, the erosion measure naturally interprets what percentage of voters' preferences were eroded when normalized by the total number of voters. We can potentially develop a similar measure for scoring rules by consistently normalizing erosion based on the average score that voters can assign to each option. This would establish a unified scale for measuring erosion across all pairs of alternatives.

4 Main findings

In our study, the definite preference of a rule is based on its superiority in terms of the two criteria: the likelihood of being less costly and the difference between the expected costs of the two rules. According to the two possible criteria, the conclusion may hinge on the number of voters n and the number of alternatives m. The question is how n and m affect the desirability of MR and BR under the veil of ignorance in the constitutional stage regarding the actual preference profile and the compared alternatives. Before turning to the identification of the preferred rule according to the two possible criteria, we describe the particular statistical model used to generate the preference profiles of the voters.

4.1 The simulation

Preference profiles can be generated by several probabilistic models.Footnote 11 We base our results on the Cubic and the spatial Euclidean Box models. These approaches have the common feature of generating preferences to reflect or approximate data samples in real elections.Footnote 12 The results presented below are based on the Cubic model (the results for the Euclidean Box model are essentially the same and are presented in Appendix A). In the former model, for a particular number of voters, n, and m alternatives (the candidates’ positions), we generate a matrix of size n*m where each number is sampled from the uniform distribution over the \(\left[\mathrm{0,1}\right]\) segment. This matrix represents the utility for each voter for each candidate and it yields the preference profile of the voters. In the Euclidean Box model, for a particular number of voters, n, and m alternatives, we independently and uniformly sample the alternatives and the positions of the voters from the Box on the assumption that a voter’s utility for a candidate is a decreasing function of the distance between the candidate’s position and the voter’s position.Footnote 13 For a particular case of m alternatives and a preference profile R, the percentage of pairs of alternatives that result in different preferences by MR and BR in which MR is the superior rule (the erosion of principle 1 by MR, Measure 1, is smaller than the erosion of principle 2 by BR, Measure 2) is equal to:

$$r/t$$

where

$$r = \left|\left\{ \left(x,y\right):N\left(x,y\right)>N\left(y,x\right) , B\left(y\right)>B\left(x\right) and \mu \left(x,y\right)< \nu \left(y,x\right)\right\}\right|$$
$$\begin{array}{*{20}l} {B\left( x \right)}\hfill & {{\text{= The\;Borda\;count\;of\;}}x} \hfill \\ {N\left( {x,y} \right)} \hfill & { = \left| {\{ i:xR_{i} y\} } \right|} \hfill \\ {\mu \left( {x,y} \right)} \hfill & { = \frac{{B\left( y \right) - B\left( x \right)}}{{B\left( y \right)}}} \hfill \\ {\nu \left( {x,y} \right)} \hfill & { = \frac{{N\left( {y,x} \right) - N\left( {x,y} \right)}}{{N\left( {y,x} \right)}}} \hfill \\ \end{array}$$
$$t=\left|\left\{ \left(x,y\right):N\left(x,y\right)>N\left(y,x\right)\,and\, B\left(y\right)>B\left(x\right)\right\}\right|$$

Note that t is the number of pairs of alternatives that result in different preferences by MR and BR \(.\) Profiles in which BR has equal scores for alternatives x and y are discarded and not considered to create erosion. We decided to only count cases where there is a clear winner and a clear dispute between the results. The same policy is applied later for PR.

For a particular case of m alternatives and a preference profile R, the expected costs of MR and BR, the expected erosion of the fundamental principles by these rules applying measures 1 and 2, over all possible pairs of the alternatives that result in different preference by MR and BR are:

$$\begin{array}{cc}& \frac{1}{{\text{t}}}\sum_{\begin{array}{c}x,y:N\left(x,y\right)>N\left(y,x\right)\\ B\left(y\right)>B\left(x\right)\end{array}}\mu \left(x,y\right) \, \mathrm{and }\,\frac{1}{{\text{t}}}\sum_{\begin{array}{c}x,y:N\left(x,y\right)>N\left(y,x\right)\\ B\left(y\right)>B\left(x\right)\end{array}}\nu \left(x,y\right)\\ & \end{array}$$

In the simulation, we computed the percentage of pairs of alternatives in which MR is superior to BR and the means of the difference of the expected costs of MR and BR in the 100,000 generated cases of the n voters’ preference profiles R. Analogous computations were made for comparison between MR and PR.Footnote 14 For each preference profile, we have compared all possible pairs of the alternatives.Footnote 15

Before delving into the simulated results, let us present a theoretical subsection that contains analytical results that prove the superiority of MR to BR and the superiority of PR to MR in terms of the above two criteria for the case of three alternatives and n voters, n = 3,5,7,9,11,13,15,21,31,41,51. These analytical results make it possible to better infer the results of the simulation that are almost identical for m = 3.

4.2 Analytical results for the case of three alternatives

When m = 3, there are 6 possible strict individual preference relations (rankings) denoted by 1,2, …,6. The possible preference profiles specify the selection of these rankings by the n voters. Since the Cubic model assumes that, for every voter, all preference relations have equal probability, we can disregard the identity of the voters as the voting is anonymous and focus on distinct preference profiles. Each distinct profile is given by (k1,k2,….k6), where ki is the number of voters selecting preference relation i, i = 1, …,6, and k1 + k2 + … + k6 = n. The expected occurrence of such a distinct profile can be obtained by using the multinomial theorem. This number, the multinomial coefficient, which is denoted by (k1, k2, k3,…., k6)! is equal to n!/k1!*k2!*….*k6!. For each distinct profile, we calculate Measure 1 and Measure 2 for all the pairs of alternatives that result in different preferences by MR and BR, and then compute r/\(t\). Assigning to each percentage of such a distinct profile a weight that is equal to its multinomial coefficient and summing up all the weighted percentages corresponding to these distinct profiles, we divide the sum of the weighted percentages by the total number of these distinct profiles and obtain the expected percentage of pairs of alternatives in which MR is superior to BR in all the distinct profiles that result in different preference by MR and BR.

Given the multinomial coefficients of the possible distinct preference profiles, and using the above formulas of the expected costs of MR and BR over all possible pairs of the alternatives for a particular distinct profile, we can compute the expected difference of these costs in all possible distinct profiles (k1,k2,….k6), such that k1 + k2.. + k6 = n.

Calculating the expected percentage of pairs of alternatives in which MR is superior to BR and calculating the difference between the expected costs of MR and BR, for m = 3 and n = 3,5,7,9,11,13,15,21,31,41,51, we obtain the following results:

For three alternatives then, MR is unambiguously superior to BR because, for any n, it is more likely to be superior in pairwise comparisons of the alternatives and its expected cost is smaller (Tables 1, 2).

Table 1 The expected percentage of pairs of alternatives in all the distinct profiles in which BR is superior to MR for 3 candidates
Table 2 Difference between the expected costs of BR and MR over all the pairs that result in erosion, which indicates the advantage of MR for 3 candidates

Applying the multinomial coefficients and the analogous formulas for the comparison between MR and PR, we present in Tables 3 and 4 the expected percentage of pairs of alternatives in which PR is superior to MR and the difference between the expected costs of PR and MR.

Table 3 The expected percentage of pairs of alternatives in all the distinct profiles in which PR is superior to MR for 3 candidates
Table 4 Difference between the expected costs of PR and MR over all the pairs that result in erosion, which indicates the advantage of PR for 3 candidates

For three alternatives then, PR is unambiguously superior to MR because, for any n, it is more likely to be superior in pairwise comparisons of the alternatives and its expected cost is smaller.

4.3 General simulation results: the superiority of MR over BRFootnote 16

The simulation results establish the superiority of MR in terms of the above two criteria when the two aggregation rules yield different outcomes. This is true for any combination of the number of alternatives m and the number of voters n when the number of voters is odd, n = 3,5,7,9,11,13,15,21,31,41,51,1001,10001 and m = 3,4,…,10.Footnote 17

Table 5 presents the results that illustrate the inferiority of BR in terms of the percentage of cases in which it is superior to MR when the two aggregation rules yield different outcomes. This percentage can be considered as an estimate of the a-priori likelihood that BR is the superior aggregation rule when two alternatives are compared, taking into account all preference profiles that result in the erosion of Principles 1 and 2, that is, MR and BR yield different preferences between the compared alternatives. For any n and m, the a-priori likelihood of MR being superior to BR is larger than 50%. Note that our findings are consequential because erosion, i.e., divergent outcomes by MR and BR is a likely possibility. More specifically, for m > 3, n > 3, it is obtained in at least 28% of the preference profiles and in 6% of the pairwise comparisons. The likelihood increases with both m and n and for m = 10, n = 10,001, it is obtained in 100% of the preference profiles and in 17% of the pairwise comparisons. The Tables in Appendix B present these erosion likelihoods for all the combinations of m and n.

Table 5 Percentage of pairwise comparisons in which BR is superior to MR 

For a small number of alternatives and a large number of voters, the superiority of MR tends to be less significant. For example, for three alternatives and 51 as well as 100,001 voters, the likelihood of MR being the preferred rule is 1−0.41 = 0.59. For 10 alternatives and 10,001 voters, this likelihood increases to 0.68. The results suggest that when the electorate is sufficiently large, the extent of the superiority of MR converges to a limit.

Table 6 confirms the superiority of MR by presenting the mean of the difference in the costs of BR and MR, which is always positive; for any combination of m and n, the expected difference is positive. That is, the expected erosion of Principle 1 by MR is always smaller than the expected erosion of Principle 2 by BR, taking into account all possible comparisons of the generated alternatives and preference profiles under any combination of m and n. The Table also presents in brackets two more statistics: the minimum and maximum difference between the costs of BR and MR .

Table 6 The mean of the differences between the expected costs of BR and MR, which indicates the advantage of MR
Table 7 Percentage of pairwise comparisons in which PR is superior to MR
Table 8 The mean of the differences between the expected costs of PR and MR, which indicates the advantage of PR

This difference referred to as the advantage of MR, ranges from the lowest advantage of 0.001 when m = 3 and n = 10,001 to the largest advantage of 0.34 when m = 10 and n = 3. These results confirm all the findings adduced based on the first criterion. However, it seems that for sufficiently large numbers of voters, the difference between the mean costs of BR and MR although still positive decreases with n and becomes negligible. In the alternative Euclidean Box model, a salient observation emerges from our simulation results: the discrepancy in the means of the expected costs of BR and MR amplifies notably for expansive electorates, as delineated in the second Table in Appendix A. To elucidate, for m = 5 and n = 10,001, the anticipated advantage of MR escalates to 2% from the initial value of 0.2% in the Cubic distribution simulation. This phenomenon underscores the robustness of our conclusions, particularly when grounded in more realistic assumptions about preference profiles, as expounded upon in footnote 14. A worthwhile task for future research is the analytical study of the limit behavior of the superiority of MR over BR when the number of voters is sufficiently large.

The Table clarifies that the expected cost of MR, the expected violation of Principle 1, is smaller than the expected cost of BR, the expected violation of Principle 2, for any m and n. This implies that in the constitutional stage, where the number of alternatives and their identity, the number of voters, and their preferences are all unknown, there is a very good reason to apply MR rather than BR if one focuses on the comparison between their fundamental weaknesses, namely, violation of one of the two fundamental democratic principles: majoritarianism and suitable recognition of preference intensity. Recall that the comparison between MR and BR takes into account only the situations where these rules result in different social preferences.

The underlying reason for MR's superiority can be elucidated by delving into its inherent expressive capability. Under MR, the comparison between two alternatives is based upon the electorate's allocation of n uniform scores to these alternatives. Contrarily, under the Borda Rule (BR), this juxtaposition is predicated upon the n voters' potential allocation of as many as m disparate scores to these alternatives. MR, in comparison to BR, possesses a constrained proficiency in encapsulating and delineating voter preferences. It is plausible to conjecture that this more restricted expressive capability is advantageous because it contributes to the lower erosion of principle 1 (Measure 1) relative to the higher magnitude of degradation of majoritarianism by BR (Measure 2).

Two unequivocal findings come to light regarding the effect of m and n on the advantage of MR. For a given number of voters n, the advantage of MR increases with the number of alternatives m. For a given number of alternatives m, the advantage of MR decreases with the number of voters n.

Note that the maximum difference between the costs of BR and MR is always positive. But the minimum difference can also be positive, which indicates that in all pairwise comparisons between the alternatives MR is the superior rule. This is the case when m = 3, n = 5, and when m = 4, n = 3. In the remaining combinations of m and n, the minimum difference is negative, indicating specific cases where BR is the preferred rule and the maximum difference is positive, indicating that MR is the preferred rule.

Finally, for m = 3, we confirmed that the simulation results are very close to the analytical ones. We have increased the reliability of the simulation results by generating 1,000,000 preference profiles instead of 100,000 and obtained that, for the combinations of 3 alternatives and n voters, the average deviation of the simulation results from the analytical results,  in terms of the first criterion,  was 0.27%. The average deviation of the simulation results from the analytical results,  in terms of the second criterion,  was only 0.16%.

4.4 The superiority of PR over MR

The most common and best-known scoring rule is the plurality rule, PR, which is an extreme weakly monotonic scoring rule, that is, only the score assigned to the best alternative exceeds that of the second-best and all other alternatives, so significance is assigned only to every voter’s most preferred alternative. Applying the same methodology for comparing MR and PR based on their costs, the extent of erosion of the principle they violate, PR emerges unambiguously as a superior aggregation rule when n ≤ 10,001.

Our findings establish the superiority of PR in terms of its higher likelihood to be superior (Table 7) and its lower mean cost (Table 8), all simulation outcomes taken into account. That is, for any combination of a number of alternatives m and a number of voters n, PR outperforms MR in terms of the two criteria we have applied in the previous section. The negative difference between the means of the expected costs of PR and MR represents the advantage of PR. This advantage ranges from − 0.57 which is the largest advantage, when m = 10 and n = 7, to − 0.002 which is the smallest advantage, when m = 3 and n = 10,001.

Table 7 illustrates the superiority of PR in terms of the expected a priori likelihood of being superior in pairwise comparisons of alternatives when the two aggregation rules yield different outcomes. This likelihood always exceeds 0.56. This is of significance because the likelihood of erosion now is considerably higher than that in the preceding section. More specifically, for m > 3 and n > 3, it is obtained in at least 55% of the preference profiles and in 13% of the pairwise comparisons. The likelihood increases with both m and n and for m = 10, n = 10,001, it is obtained in 100% of the preference profiles and in 35% of the pairwise comparisons. The last two tables in Appendix B present these erosion likelihoods for all the combinations of m and n.

Table 8 yields two unequivocal findings regarding the effect of m and n on the advantage of PR. For a given number of voters n, n > 3, the advantage of PR increases with the number of alternatives m. For a given number of alternatives m, the advantage of PR decreases with the number of voters n.

As in the comparison between MR and BR, we conjecture that the superiority of PR is partly due to its limited expressive capability. Under MR, the comparison between two alternatives is contingent upon the electorate's distribution of n uniform scores to these alternatives. Contrarily, under PR, the comparison typically hinges on voters allocating fewer than n such uniform scores. That is, PR allows electorates to only indicate their top-preferred choice, thereby limiting the expressiveness of their preferences. This characteristic endows PR with a distinct advantage when one is evaluating the potential severity of degrading majoritarianism, as outlined in Measure 2, relative to the erosion of principle 1 by MR as articulated in Measure 1.

Although our results establish that PR is superior to MR which is in turn superior to BR, we cannot naively conclude that PR is superior to BR, not in general or when MR is used as the yardstick. The reason is the following. In the comparisons between MR and BR or PR, the majoritarian principle is fully respected by MR, and the consideration of preference intensity is respected, albeit in a particular form by the prominent scoring rules on which we focus. Comparison between BR and PR cannot be carried out by applying the two measures we have proposed because both of these scoring rules respect a particular form of preference intensities yet violate majoritarianism. We could compare, however, PR and BR based on an alternative single criterion: the extent of their consistency with majoritarianism. As is well known, in terms of this criterion BR is superior to PR,Footnote 18 but of course, this result does not imply transitivity as the framework used in our study is different from that applied in such a comparison between the two scoring rules.

The results provide a novel justification for the widely used PR beyond its practical advantages. There are certainly other scoring rules that are superior to MR. The identification of this set of scoring rules is a task worth pursuing in future research.

5 Conclusion

The root cause of the fervent discussion regarding the use of the simple-majority rule versus a scoring rule stems from the interplay between two critical principles under our focus: the ability for voters to express preference intensity, thus offering some defense for the minority (Principle 1), and safeguarding the majority by honoring majoritarianism (Principle 2). The Borda method of counts is a monotonic scoring rule. The plurality rule is weakly monotonic. These rules have drawn the most interest as two scoring rules. Since these principles cannot be concurrently upheld, adherence to Principle 1 (or Principle 2) inherently results in the infringement of Principle 2 (or Principle 1). In practical terms, if MR is applied, Principle 1 is compromised, while the use of BR disrupts Principle 2. These violations can be construed as the costs incurred by employing these widely examined aggregation rules. In applying a sensible measure of these costs, the predicted cost of the two rules in instances of outcome divergence, and leveraging an intuitive concept of relative erosion of Principles 1 and 2, our research offers a key contribution by asserting that PR outperforms MR which, in turn, is superior to BR based on the proposed metrics of expected erosion of two fundamental democratic principles. The primary findings indicate that when a rule has a more restricted capacity to express preferences, it benefits from a reduced undermining of the principle it breaches. Specifically, PR's more constrained nature compared to MR makes it superior. Similarly, MR's relative limitation in expressing preferences compared to BR gives it an advantage. Our study not only rekindles the historic Borda-Condorcet debate by adding novel support to MR but also revives the debate about the merits of PR challenging the minimal endorsement PR garnered from 22 esteemed voting rule experts who evaluated 18 well-established voting rules, as articulated by Laslier (2012).

Our conclusions rest on the presumption that both principles are assigned equal significance.Footnote 19 They are derived from simulations that utilize the Cubic model, along with a more feasible, realistic probabilistic model to generate alternatives and voter preferences, the Box model. These outcomes augment the comprehensive discourse on the advantages and drawbacks of the three most rigorously scrutinized aggregation rules, offering an innovative justification for favoring PR over MR and MR over BR during the constitutional phase, where the veil of ignorance reigns. They cast fresh insights on the attractiveness of PR and BR, considering MR as the alternative to these scoring rules. The first finding rationalizes the actual revealed superiority of PR over MR. With the second finding, it becomes possible to decide between the approaches of Condorcet and Borda and to explain the low prevalence of BR relative to MR