1 Introduction

Ever since Hotelling (1929) and Downs (1957), game-theoretic models of electoral competition tend to focus on the ideological position on which candidates run. However, in practice, the electorate may be interested not only in the ideological position of a candidate but also in her ability to get things done. For example, suppose that the median voter has the opinion that economic reforms are sorely needed. Yet, powerful vested interests oppose such reforms. The median voter may then very well prefer a candidate with a policy platform that differs somewhat from his own but who is able to carry through substantial reforms, over a candidate with exactly the same preferences who fails to get anything done once elected.

In this paper, we develop a game-theoretic model to study how electoral competition is affected when voters care about the ability of the candidate elected. In our model, politicians run in an election. Each has an ability that is private information. More able candidates are more likely to achieve something once in office. For example, a highly competent politician may try to reduce bureaucracy by half, whereas a less able politician might find it challenging to even reduce it by 10%. During the election campaign, candidates make binding commitments to the plans they intend to implement if elected. Election platforms then serve as a signal of the true ability of a candidate. More able politicians will run on more ambitious platforms. From their platforms, voters can infer the ability of the politician.Footnote 1 Rationally, voters choose the most ambitious candidate.Footnote 2

We assume that more ambitious plans yield a higher pay-off to the electorate when successful. However, while more ambitious plans are prone to failure, more able politicians are likelier to pull off a plan. Of course, the assumption that plans are either completely successful or not at all, is a simplifying one. Still, we feel that in many cases, it is not too far from the truth.Footnote 3

In the political economy literature, it is routinely assumed that politicians simply choose a policy that they can implement when elected. This is where our model fundamentally differs. In our world, a politician that tries to implement a policy has no guarantee that she will be successful in doing so: this will depend on her ability.Footnote 4 During the election she merely makes a commitment that she will try to implement her plan. Her effort to do so is observable.

The equilibrium of our model has politicians making more ambitious promises than they would make without the need to signal their ability. This overambition effect is bad news for the electorate: it would be better off if a politician tried to implement a plan that she can actually handle, rather than a plan that is overambitious. But election promises also entail good news: by comparing the different plans, the electorate is at least able to pick the best politician. This is a selection effect. We show that the overambition effect may dominate the selection effect. As a result, the electorate is then better off picking a random candidate and letting her implement the plan that she thinks best, rather than having an election to pick the best candidate who will implement an overambitious plan. This is more likely if the distribution of abilities is skewed towards high values, if the number of candidates is high, if there are primary elections, or if there are high private benefits from office.

We are not the first to study how election campaigns affect electoral competition. In Banks (1990) and Haan (2004) election promises serve as signal of a politician’s true position on the Hotelling line. In Harrington (1993) and Aragonès et al. (2007) chances of reelection are affected by the extent to which politicians keep election promises. Yet, in these papers private information concerns policy preferences rather than ability.

In Rogoff and Sibert (1988) politicians also have private information concerning their ability. More competent politicians require less revenue to deliver the same amount of public goods. The incumbent uses tax policy as a signal. This gives rise to political business cycles. In Rogoff (1990) the incumbent can use both taxes and government spending as a signal. However, these papers do not study electoral competition between multiple candidates that can signal their ability.

One implication of our model is that having more candidates in an election may have an adverse effect on social welfare. In ((Lizzeri and Persico, 2005)), as the number of parties increases, each party focuses its electoral promises on a narrower constituency, so special interest politics replace more efficient policies with more diffuse benefits.

In our model, election promises are enforceable commitments. This notion is supported by Pétry and Collette (2009). Their meta-analysis of 21 studies on election promises reveals that approximately 67% were fulfilled, indicating that elected candidates make a substantial effort to deliver on their promises. Analysing data from 69 elections in 14 countries, Matthieß (2020) shows that a government party that fulfils more election pledges is more likely to prevent electoral losses. She proposes two possible mechanisms. First, voters may want to reward politicians that have kept their promises. Second, they may regard this as a sign of the politician’s competence. The latter also drives the result in our model.

That politicians may not keep their election promises is also the focus of some laboratory studies, that also study the difference between elected officials and randomly selected ones. In Geng et al. (2011) elections do not lead to higher promises regarding the amount of funds to be transferred to voters, than randomly appointing a politician. In Corazzini et al. (2014) candidates in an election do promise significantly more than those randomly appointed. Walkowitz and Weiss (2017) find that elected officials shirk less on fulfilling their promises, but find no difference in promises made. Born et al. (2018) find that promises are more likely to be kept if reelection is possible and if the politician was elected rather than randomly appointed. In Feltovich and Giovannoni (2015) officials with lower transfers do worse in subsequent elections. In Fehrler et al. (2020), politicians with a lower aversion to lying are more likely to win their party’s primary, as they are willing to invest more. Hence, those ending up in politics are also those less likely to keep their promises.

The remainder of this paper is structured as follows. Section 2 gives the set-up of the model. In Sect. 3, we analyze a simple example with 2 candidates and a particular distribution of abilities. The electorate is then always better of picking a random candidate, rather than having an election in which candidates make costly promises. In Sect. 4, we study a more general set-up, and derive the exact conditions under which the electorate prefers a random candidate. Section 5 studies a number of extensions. In Sect. 5.1 candidates obtain private benefits of being in office, while in Sect. 5.2, candidates compete in primary elections before being put forward by political parties in the general election. Both scenarios only strengthen our main result, in the sense that the electorate is more likely to want to pick a random candidate. Section 5.3 shows how our model can allow for the choice of ideological position. Section 6 concludes.

2 Set-up of the Model

Consider an election in which n candidates participate. For ease of exposition, we assume that voters do not care about the ideological disposition of a candidate. This is just a simplifying assumption: it is straightforward to extend the model to one in which the choice of platform does play a role, as we will do in Sect. 5.3. For now, we assume that voters are only interested in a candidate’s ability to ‘get the job done’. In the context of our model, ‘the job’ refers to some project that needs to be implemented. For example, this may be a country in which bureaucracy has gone out of control. Alternatively, corruption may have become a problem. In either case, it is obvious that something has to be done, yet politicians differ in their ability to successfully tackle the problem.

The ability of politician i is denoted \(\theta _{i}\). This is private information. It is common knowledge that \(\theta _{i}\) is drawn from a continuous cumulative probability distribution F on [0, 1]. For simplicity, we will assume that \(F(\theta )=\theta ^{\alpha }\), with \(\alpha\) some strictly positive parameter. This specification allows for ample flexibility, while still being able to yield closed-form solutions. In the election campaign, each candidate chooses a platform. This is some policy x that the politician promises to implement once elected, with \(x\ge 0\). Crucially, we assume that politicians can commit to policy x. Thus, if a politician has promised to try to implement a policy x during the campaign, she has no other choice but to do so if elected. One justification for this assumption is that a politician’s reputation will suffer greatly if she makes an election promise, but does not even make an effort to try to implement that promise. In any case, the purpose of this paper is precisely to study how the possibility of such binding commitments affect the electoral process.

Some policies are more ambitious than others. For example, one politician may run on a platform to root out corruption entirely, while another may promise to cut it in half. Of course, politicians also have to come up with detailed plans as to how they think to achieve these objectives. If such a plan is implemented, there is a probability that it will fail. Naturally, a more able politician will be more likely to pull off an ambitious plan than a less able one.

We therefore make the following assumptions. Suppose that a politician with ability \(\theta _{i}\) tries to implement some policy x. If she is successful, the payoff to the electorate is x. A higher x thus corresponds to a more ambitious plan. If she is unsuccessful, we are back to the status quo, which has payoff 0. We assume that the probability that the politician will be successful equals \(\theta _{i}-x.\)Footnote 5 This captures that a more able politician (i.e. one with a higher \(\theta\)) is more likely to pull off any project than a less able one. The expected payoff to the electorate of a project x implemented by a politician with ability \(\theta _{i}\) then is \(W(\theta _i,x)=(\theta _{i}-x)x\): the probability that the politician is successful multiplied by their payoff if that occurs.

We assume that the utility of an elected politician is proportional to how successful she turns out to be. There can be many reasons for this. A politician that delivers a higher payoff to her electorate will be more likely to be reelected, to be remembered as an outstanding politician, or otherwise to obtain future benefits. Thus, a politician that would run unopposed in an election, would set x to maximize \(U_{i}(\theta _i,x)=\gamma \left( \theta _{i}-x\right) x,\) with \(\gamma\) some strictly positive parameter. Her first-best choice is to propose a project \(\hat{x}_{i}=\theta _{i}/2,\) a choice that is also first-best from the point of view of the electorate.

A politician that does not run unopposed, however, will take into account that the policy x that she proposes in the election campaign will serve as a signal of her true ability. A politician thus faces a trade-off: if she proposes a plan that is more ambitious than her first-best, she will be more likely to get elected, as the electorate will expect more ambitious plans to come from more able politicians. However, if she does get elected, her payoff will be lower as her plan is over-ambitious. This trade-off is very much like the trade-off that a bidder in an first-price sealed bid auction faces (see e.g. Krishna, 2010: a higher bid implies a higher probability of winning the auction, but also a lower payoff upon winning. In our set-up, proposing a higher x implies a higher probability of winning the election (as one’s x is more likely to be the highest among all candidates), but implies a lower pay-off upon winning (the highest possible payoff would be secured by implementing \(\hat{x}_i\) as defined above, any higher x would yield a lower payoff).

Our model builds on the existing literature studying signaling in elections, such as Rogoff’s (1990) classic paper. We follow Rogoff (1990) by developing a game-theoretic model of electoral competition in which candidates can send a costly signal regarding their ability, about which they are privately informed, to the electorate by making particular policy choices. Our model differs from Rogoff’s (1990) in two important ways: (1) instead of a single politician having private information about their ability, in our model, all candidates are privately informed about their ability; (2) rather than having two politician types (high ability, low ability), our setting contains a continuum of politician types.

In the next section, we study a simple example with 2 politicians and a specific distribution of possible abilities. We will show that, in this context, picking a politician at random yields higher welfare than having an election in which both candidates make commitments during the election campaign. In Sect. 4, we study a more general set-up.

3 A Simple Example

Consider the set-up described in the previous section, with two politicians. Assume that \(F(\theta )=\theta ^{3}\). We look for the equilibrium platform function \(x(\theta ).\) As noted, the full information outcome has \(\hat{x}=\theta /2.\) To derive a symmetric perfect Bayesian equilibrium, we use standard techniques from mechanism design theory.Footnote 6 We start by assuming an equilibrium exists in which politicians with a higher ability \(\theta\) commit to a strictly higher policy x and in which the electorate always picks the politician committing to the higher policy. We then derive the utility that a type \(\theta _{T}\) obtains if she behaves as if she were a type \(\theta _{A}\), and the other politician behaves according to the equilibrium. For an equilibrium we then need that the type \(\theta _{T}\) finds it optimal to behave as herself, so she chooses to optimally set \(\theta _{A}=\theta _{T}\).

First of all let us assume that, when confronted with two election platforms \(x_{1}\) and \(x_{2}\), it is always a best response for the electorate to choose the candidate with the highest x.Footnote 7 Thus, just as in an auction the bidder with the highest bid wins the auction, we have in our model that the candidate with the ‘highest platform’ wins the election.

Suppose that in a tentative equilibrium a politician with true type \(\theta _{T}\) behaves as if she had type \(\theta _{A}\). Given that the other candidate sticks to the equilibrium strategy, this candidate would then win the election if the other candidate had an ability lower than \(\theta _{A}\). This occurs with probability \(F(\theta _{A})\). If it occurs, this candidate’s utility will be \(\gamma \left( \theta _{T}-x(\theta _{A})\right) x(\theta _{A})\). Defining \(v(\theta _{T},\theta _{A})\) as the expected utility of a type \(\theta _T\) that behaves as a type \(\theta _T\), we thus have

$$\begin{aligned} v(\theta _{T},\theta _{A})=F(\theta _{A})\gamma \left( \theta _{T}-x(\theta _{A})\right) x(\theta _{A}).\end{aligned}$$

With \(F(\theta )=\theta ^{3},\) we have

$$\begin{aligned} v(\theta _{T},\theta _{A})=\theta _{A}^{3}\gamma \left( \theta _{T}-x(\theta _{A})\right) x(\theta _{A}). \end{aligned}$$

Taking the derivative with respect to \(\theta _{A}\):

$$\begin{aligned} \frac{\partial v}{\partial \theta _{A}}=\theta _{A}^{3}\gamma \left( \theta _{T}-2x(\theta _{A})\right) x^{\prime }(\theta _{A})+3\gamma \theta _{A}^{2}\left( \theta _{T}-x(\theta _{A})\right) x(\theta _{A}). \end{aligned}$$

Equilibrium requires that the type \(\theta _T\) chooses to behave as her own type, i.e. that this first-order condition is solved for \(\theta _{A}=\theta _{T},\) so

$$\begin{aligned} \theta _{T}^{3}\gamma \left( \theta _{T}-2x(\theta _{T})\right) x^{\prime }(\theta _{T})+3\gamma \theta _{T}^{2}\left( \theta _{T}-x(\theta _{T})\right) x(\theta _{T})=0. \end{aligned}$$

It can be verified that this differential equation is solved by

$$\begin{aligned} x(\theta )=\frac{4}{5}\theta , \end{aligned}$$

using the boundary condition \(x(0)=0.\) Note that this implies that politicians are indeed overambitious in order to win the election: with complete information, each politician would simply run on her first best \(x=\theta /2.\)

We now compare the outcome of this model with election campaigns to a world in which politicians for some reason are not able to make commitments during the election. In such a world, voters do not obtain any information about the ability of candidates, and can do no better than simply picking a politician at random. That politician, say she has type \(\theta ,\) will then implement \(x=\theta /2,\) which yields \(\theta ^{2}/4\) to the electorate. The equilibrium expected payoff to the electorate of picking a random candidate is

$$\begin{aligned} W_R^*=\int _{0}^{1}\frac{\theta ^{2}}{4}f\left( \theta \right) d\theta =\int _{0}^{1}\frac{3}{4}\theta ^{4}d\theta =\frac{3}{20}. \end{aligned}$$

In the world with election platforms, the electorate can simply pick the best politician. The probability distribution of her quality is the expected value of the highest of two draws from distribution F. Denote the cumulative distribution as \(G(\theta )\equiv F\left( \theta \right) ^{2}.\) In this case, we thus have \(G(\theta )=\theta ^{6},\) corresponding to density \(g(\theta )=6\theta ^{5}\). As we showed above, the winning politician will implement \(x=\frac{4}{ 5}\theta ,\) which yields a payoff to the electorate that equals \(\left( \theta -\frac{4}{5}\theta \right) \left( \frac{4}{5}\theta \right) =\frac{4}{ 25}\theta ^{2}.\) The equilibrium expected payoff from the election process then is

$$\begin{aligned} W_E^*=\int _{0}^{1}\frac{4\theta ^{2}}{25}g(\theta )d\theta =\int _{0}^{1}\frac{24 }{25}\theta ^{7}d\theta =\frac{3}{25}. \end{aligned}$$

which is lower than \(W_R^*\). Intuitively, we have a trade-off. In a world with election platforms, the electorate is able to pick the best politician, which is good news. We will call this the selection effect. However, the winning politician will always implement a policy that is overambitious, which is bad news. We will call this the overambition effect. In the example we consider here, the bad news dominates: the overambition effect is stronger than the selection effect. In the next section, we give a more general analysis and derive results under which the electorate would be better off not having an election.

4 The General Model

In this section, we construct the optimal selection mechanism in our general model. We thus consider any power distribution of abilities \(F(\theta )=\theta ^{\alpha },\) with \(\alpha >0,\) and \(\theta \in\) [0, 1],  and n risk neutral politicians that compete to be elected. Figure 1 illustrates how a change in \(\alpha\) affects the cumulative density function from which abilities are drawn: \(\alpha =1\) implies a uniform distribution; as \(\alpha\) increases (decreases) the distribution of abilities is more skewed towards higher (lower) abilities.

Fig. 1
figure 1

Power Distribution of Abilities

We restrict our attention to symmetric perfect Bayesian equilibria in which politicians with a higher ability \(\theta\) commit to a strictly higher policy x and where the electorate prefers to pick the politician offering the highest policy x, consistent with their beliefs about the politicians’ abilities deduced from the politicians’ equilibrium strategy.Footnote 8

If a politician with true type \(\theta _{T}\) behaves as if she has type \(\theta _{A},\) her expected utility would equal

$$\begin{aligned} v(\theta _{T},\theta _{A})=\left( \theta _{A}\right) ^{\alpha \left( n-1\right) }\gamma \left( \theta _{T}-x(\theta _{A})\right) x(\theta _{A}). \end{aligned}$$

Taking the derivative wrt \(\theta _{A}\):

$$\begin{aligned} \frac{\partial v}{\partial \theta _{A}}=\left( \theta _{A}\right) ^{\alpha \left( n-1\right) }\gamma \left( \theta _{T}-2x(\theta _{A})\right) x^{\prime }(\theta _{A})+\alpha \left( n-1\right) \left( \theta _{A}\right) ^{\alpha \left( n-1\right) -1}\gamma \left( \theta _{T}-x(\theta _{A})\right) x(\theta _{A}). \end{aligned}$$

Equilibrium then requires \(\theta _{A}=\theta _{T}\), so that

$$\begin{aligned} \theta _{T}\left( \theta _{T}-2x(\theta _{T})\right) x^{\prime }(\theta _{T})+\alpha \left( n-1\right) \left( \theta _{T}-x(\theta _{T})\right) x(\theta _{T})=0. \end{aligned}$$

It can be easily verified that this differential equation is solved by

$$\begin{aligned} x(\theta )=\frac{1+\alpha \left( n-1\right) }{2+\alpha \left( n-1\right) } \theta , \end{aligned}$$
(1)

using the boundary condition \(x(0)=0.\)Footnote 9 Again, this implies that politicians are always overambitious in order to win the election, as we always have \(x(\theta )>x^{*}(\theta )=\theta /2\) if \(n\ge 2.\)

Expected welfare is given by

$$\begin{aligned} W(\alpha ,n)=E\left\{ x(\theta ^{(1)})\left( \theta ^{(1)}-x(\theta ^{(1)})\right) \right\} \end{aligned}$$

where \(\theta ^{(1)}\) denotes the first-order statistic out of n draws from F. So

$$\begin{aligned} W\left( {\alpha ,n} \right)& \equiv \, E\left\{ {\frac{{1 + \alpha \left( {n - 1} \right)}}{{\left( {2 + \alpha \left( {n - 1} \right)} \right)^{2} }}\left( {\theta ^{{(1)}} } \right)^{2} } \right\}\\ &= \, \frac{{1 + \alpha \left( {n - 1} \right)}}{{\left( {2 + \alpha \left( {n - 1} \right)} \right)^{2} }}\int_{0}^{1} {\left( {\theta ^{{(1)}} } \right)^{2} } dF(\theta ^{{(1)}} )^{n}\\ &= \, \frac{{\left( {\left( {n - 1} \right)\alpha + 1} \right)n\alpha }}{{\left( {\left( {n - 1} \right)\alpha + 2} \right)^{2} \left( {n\alpha + 2} \right)}}.\end{aligned}$$
(2)

Rather than having an election in which several candidates compete by proposing a policy, the electorate could also decide to pick a candidate at random and let this candidate choose the policy. Note that this is analytically equivalent to having an election in which exactly one candidate participates, so \(n=1.\) This sole candidate will run on platform \(x(\theta )=\theta /2,\) as can also be readily verified from (1). This is also her preferred policy. Expected welfare then equals \(W(\alpha ,1)=\alpha /4\left( \alpha +2\right) ,\) which follows directly from (2).

Table 1 reports welfare for selected values for n and \(\alpha\). The first thing to note is that the result that we derived in Sect. 3 is not general. For \(\alpha =1/2\) or \(\alpha =1,\) the electorate is strictly better off with an election with two candidates, rather than by picking a candidate at random, i.e. having \(n=1\). With \(\alpha =2,\) the electorate is exactly indifferent between one or two candidates. Hence, in that case, the selection effect and the overambition effect exactly cancel out. Adding more candidates, however, does make the electorate strictly worse off: beyond two candidates, the overambition effect again dominates the selection effect. For lower values of \(\alpha ,\) a similar mechanism is at work; for low values of n,  the selection effect dominates, but for large values, the overambition effect does. With \(\alpha =1/2,\) the optimal number of candidates from a welfare point of view, equals \(n=5.\) If \(\alpha =1\), the optimal number is \(n=3.\) Hence, at least for the values of \(\alpha\) considered here, the optimal number of candidates seems to decrease with \(\alpha .\)

Table 1 Expected welfare \(W(\alpha ,n)\) for selected values of \(\alpha\) and n
Fig. 2
figure 2

Expected welfare \((\alpha ,n)\) for selected values of \(\alpha\) and n

Figure 2 also illustrates these insights. For selected values of \(\alpha\) and n, we have plotted the resulting expected welfare. The left-hand panel focuses on higher values of \(\alpha\), while the left-hand panel focuses on lower values. Both panels clearly show the optimal number of candidates. For \(\alpha >2\), the electorate is clearly better off having just one candidate. As \(\alpha\) decreases, the optimal number of candidates increases.

In the remainder of this section, we will make these insights more precise, and prove them.

Theorem 1

The optimal number of candidates \(n^{*}\) is finite. Let \(N^{*}\subset \mathbb {N}\) be the set of the number of candidates from which welfare is maximized. Then \(\left| N^{*}\right| \in \{1,2\}\). If \(\left| N^{*}\right| =2\), the two elements in \(N^{*}\) are adjacent.

Proof

The first part of the theorem follows immediately from the observation that \(W(\alpha ,1)>0\) and \(\lim _{n\rightarrow \infty }W(\alpha ,n)=0\) for all \(\alpha >0\). To prove the other claims in the theorem, we treat n as a continuous variable for the moment and check how the welfare function W varies with it. Notice that \(W(\alpha ,n)\) is a differentiable function in n for \(n\ge 1\). Taking the derivative of \(W(\alpha ,n)\) with respect to n yields

$$\begin{aligned} \frac{\partial W(\alpha ,n)}{\partial n}=-\alpha \frac{\alpha \left( n-1\right) \left( 2\alpha +n^{2}\alpha ^{2}-6\right) -4}{\left( \left( n-1\right) \alpha +2\right) ^{3}\left( n\alpha +2\right) ^{2}}. \end{aligned}$$
(3)

Define

$$\begin{aligned} z(\alpha ,n)\equiv \alpha \left( n-1\right) \left( 2\alpha +n^{2}\alpha ^{2}-6\right) -4. \end{aligned}$$
(4)

Note that \(\frac{\partial W(\alpha ,n)}{\partial n}=0\) if \(z(\alpha ,n)=0.\) Because z is strictly increasing in n, there is at most one \(\tilde{n} \in \mathbb {R}\) for which \(\frac{\partial W(\alpha ,\tilde{n})}{\partial n}=0\). Also note that for constant \(\alpha\), W is single peaked in n. The reason is that for all \(n>\tilde{n}\) [\(n<\tilde{n}\)], \(z(\alpha ,n)>0\) [\(z(\alpha ,n)<0\)]. Because W is single peaked in n, the set \(N^{*}\) of optimal number of candidates is constructed as follows. If \(\tilde{n}\le 1\), \(N^{*}=\{1\}\). Otherwise, \(n^{*} \in \arg \max _{{n \in \left\{ {\left\lfloor {\tilde{n}} \right\rfloor ,\left\lceil {{\text{ }}\tilde{n}} \right\rceil } \right\}}} W\left( {\alpha ,n} \right) \equiv N^{*} \subset \left\{ {\left\lfloor {\tilde{n}} \right\rfloor ,\left\lceil {{\text{ }}\tilde{n}} \right\rceil } \right\}\).

The following theorem specifies for which \(\alpha\) the electorate is always strictly better off by picking a candidate at random.

Theorem 2

If \(\alpha >2\), the electorate is always strictly better off by picking a candidate at random rather than having an election in which \(n\ge 2\) candidates participate. For \(\alpha <2\), an election with two candidates makes the electorate better off than picking a candidate at random.

Proof

Consider a move from 1 to 2 candidates. Using (2) the net effect on welfare of such a move is

$$\begin{aligned} \Delta W=\frac{\left( 2\alpha -\alpha +1\right) 2\alpha }{\left( 2\alpha -\alpha +2\right) ^{2}\left( 2\alpha +2\right) }-\frac{\left( \alpha -\alpha +1\right) \alpha }{\left( \alpha -\alpha +2\right) ^{2}\left( \alpha +2\right) }=\frac{\left( 2-\alpha \right) \alpha }{4\left( 2+\alpha \right) ^{2}}. \end{aligned}$$

This is negative if \(\alpha >2\) and 0 for \(\alpha =2\). To see that welfare is also lower with a higher number of candidates, note from (4) that \(z(\alpha ,2)>0\) for \(\alpha \ge 2,\) hence from (3), we then have W decreasing in n. For \(\alpha <2\), \(\Delta W>0\), so that an election with two candidates makes the electorate better off than picking a candidate at random.

For picking a random candidate to maximize the electorate’s welfare we thus need that \(\alpha\) is sufficiently high, hence that the distribution of potential candidate is sufficiently skewed to the right. In that case, picking a particularly bad candidate at random is sufficiently unlikely.

As noted, Table 1 also suggests that the optimal number of candidates is decreasing in \(\alpha .\) We can show that this is generally true for a continuous proxy \(\tilde{n}\) for the optimal number of bidders \(n^{*}\), i.e., if \(\tilde{n}\le 1\), \(n^{*}=1\). Otherwise, \(n^{*} \in \arg \max _{{n \in \left\{ {\left\lfloor {\tilde{n}} \right\rfloor ,\left\lceil {{\text{ }}\tilde{n}} \right\rceil } \right\}}} W\left( {\alpha ,n} \right)\).

Theorem 3

Let \(\tilde{n}\in \mathbb {R}\) for which \(\frac{\partial W(\alpha ,\tilde{n})}{\partial n}=0\). Then \(\tilde{n}\) is unique and decreasing in \(\alpha .\)

Proof

From the proof of Theorem 1\(\tilde{n}\) is unique and implicitly defined by

$$\begin{aligned} z(\alpha ,\tilde{n})\equiv \alpha \left( \tilde{n}-1\right) \left( 2\alpha + \tilde{n}^{2}\alpha ^{2}-6\right) -4\equiv 0\text {.} \end{aligned}$$
(5)

That implies that

$$\begin{aligned} \frac{{d\tilde{n}}}{{d\alpha }} &= \, \frac{{ - \partial z/\partial \alpha }}{{\partial z/\partial \tilde{n}}}\, = \;\frac{{ - \left( {\tilde{n} - 1} \right)\left( {4\alpha + 3\tilde{n}^{2} \alpha ^{2} - 6} \right)}}{{\alpha \left( {2\alpha - 6 + 3\tilde{n}^{2} \alpha ^{2} - 2\tilde{n}\alpha ^{2} } \right)}}\\ &= \, \frac{{ - 2\left( {\tilde{n} - 1} \right)\left( {2\alpha + \tilde{n}^{2} \alpha ^{2} - 6} \right) - \left( {\tilde{n} - 1} \right)\left( {6 + \tilde{n}^{2} \alpha ^{2} } \right)}}{{\alpha \left( {2\alpha - 6 + \tilde{n}^{2} \alpha ^{2} } \right) + \alpha \left( {2\tilde{n}^{2} \alpha ^{2} - 2\tilde{n}\alpha ^{2} } \right)}}.\end{aligned}$$

From (5), we have that at the optimum

$$\begin{aligned} 2\alpha +\tilde{n}^{2}\alpha ^{2}-6=\frac{4}{\alpha \left( \tilde{n} -1\right) } \end{aligned}$$

hence

$$\begin{aligned} \frac{d\tilde{n}}{d\alpha }=\frac{-\frac{8}{\alpha }-\left( \tilde{n} -1\right) \left( 6+\tilde{n}^{2}\alpha ^{2}\right) }{\frac{4}{\tilde{n}-1} +2\alpha ^{3}\left( \tilde{n}-1\right) \tilde{n}}<0, \end{aligned}$$

since the numerator is clearly negative, and the denominator clearly positive.

Taken together, our results imply that, for sufficiently high n,  the overambition effect always dominates the selection effect. Hence, regardless of the exact parameters of our model, it is never true that having more candidates is always better for welfare. Moreover, this effect becomes more pronounced for higher \(\alpha .\) Once \(\alpha \ge 2,\) the optimal number of candidates simply equals 1, and the electorate would rather pick a candidate at random rather than having an election. Intuitively, a higher value of \(\alpha\) implies that there is a higher probability of having candidates with high ability. An individual candidate with high ability is then more likely to face a contestant that also has high ability. This implies that she has to exaggerate even more to convince the electorate that she really is the best candidate. Hence, under such circumstances, a candidate will be even more overambitious, which we can also see from (1). This implies that the overambition effect is more likely to dominate the selection effect.

5 Extensions

In this section, we study a number of extensions to our basic model. Section 5.1 considers the case in which candidates obtain benefits from being in office, over and beyond the benefits that accrue to them due to the payoff to the policy that they implement. In Sect. 5.2, we study the case in which candidates first compete in primary elections to be put forward by political parties for the general election. The findings in Sects. 5.1 and 5.2 strengthen the results derived in our basic model, in that the electorate is more likely to want to pick a candidate at random rather than to have an election. Section 5.3 shows that it is easy to extend our model to allow for the choice of platform. Such an extension does not affect our results.

5.1 Benefits from Office

So far, we have assumed that the only payoff for the politician is related to the payoffs to the project that he implements while in office. Yet, there may be additional benefits to being in office as such, and that are unrelated to the project that is implemented. For example, when in office, a politician can channel funds to preferred projects, appoint cronies to influential positions with the implicit understanding that there will be some quid pro quo in the future, etc. In this section, we study the effect of such benefits from office on our analysis.

We assume that benefits from office are given by \(b\gamma ,\) where b reflects the importance of these benefits, and the multiplication by \(\gamma\) just amounts to a normalization. We assume that b is small enough relative to \(\alpha\) and n to always obtain internal solutions. The utility function of a candidate with true ability \(\theta _{T}\) that behaves as if her ability is \(\theta _{A},\) is now given by

$$\begin{aligned} v(\theta _{T},\theta _{A})=\left( \theta _{A}\right) ^{\alpha \left( n-1\right) }\gamma \left( \left( \theta _{T}-x(\theta _{A})\right) x(\theta _{A})+b\right) . \end{aligned}$$

Taking the first-order condition with respect to \(\theta _{A}\) and then imposing \(\theta _{A}=\theta _{T}\) yields

$$\begin{aligned} \theta _{T}\left( \theta _{T}-2x\left( \theta _{T}\right) \right) x^{\prime }\left( \theta _{T}\right) +\eta \left( \left( \theta _{T}-x\left( \theta _{T}\right) \right) x\left( \theta _{T}\right) +b\right) =0, \end{aligned}$$
(6)

where we have defined \(\eta \equiv \alpha (n-1).\)

We now have the following result:

Theorem 4

Suppose the distribution of abilities is given by \(F(\theta )=\theta ^{\alpha }\) on [0, 1]. Then for private benefits b sufficiently high, the electorate is always strictly better off by picking a candidate at random rather than having an election in which \(n\ge 2\) candidates make costly campaign promises.

Proof

First, suppose that the benefits from office are \(b\gamma \theta ^{2}\) instead of \(b\gamma\). The utility function of a candidate with true ability \(\theta _{T}\) that behaves as if her ability is \(\theta _{A},\) is now given by

$$\begin{aligned} v(\theta _{T},\theta _{A})=\left( \theta _{A}\right) ^{\alpha \left( n-1\right) }\gamma \left( \left( \theta _{T}-\tilde{x}(\theta _{A})\right) \tilde{x}(\theta _{A})+b\theta _{T}^{2}\right) . \end{aligned}$$

Taking the first-order condition with respect to \(\theta _{A}\) and then imposing \(\theta _{A}=\theta _{T}\) yields

$$\begin{aligned} \theta _{T}\left( \theta _{T}-2\tilde{x}\left( \theta _{T}\right) \right) \tilde{x}^{\prime }\left( \theta _{T}\right) +\eta \left( \left( \theta _{T}- \tilde{x}\left( \theta _{T}\right) \right) \tilde{x}\left( \theta _{T}\right) +b\theta _{T}^{2}\right) =0\text {.} \end{aligned}$$
(7)

Now assume that this differential equation has a linear solution \(\tilde{x}\left( \theta \right) =\zeta \theta\) for some \(\zeta >0\). Plugging in \(\tilde{x}\left( \theta \right) =\zeta \theta\) in (7) yields

$$\begin{aligned} \theta \left( \theta -2\zeta \theta \right) \zeta +\eta \left( \left( \theta - \zeta \theta \right) \zeta \theta +b\theta ^{2}\right) =0 \Rightarrow \left( 1-2\zeta \right) \zeta +\eta \left( \left( 1- \zeta \right) \zeta +b\right) =0\text {,} \end{aligned}$$
(8)

which implies

$$\begin{aligned} \zeta =\left( \frac{\eta +1}{2\eta +4}+\frac{1}{2} \sqrt{\left( \frac{\eta +1}{\eta +2}\right) ^{2}+\frac{4b\eta }{\eta +2}} \right) \text {,} \end{aligned}$$
(9)

so that

$$\begin{aligned} \tilde{x}\left( \theta \right) =\left( \frac{\eta +1}{2\eta +4}+\frac{1}{2} \sqrt{\left( \frac{\eta +1}{\eta +2}\right) ^{2}+\frac{4b\eta }{\eta +2}} \right) \theta \text {.} \end{aligned}$$
(10)

From the differential equations (6) and (7), it follows that \(x\left( \theta \right) >\tilde{x}\left( \theta \right)\) for all \(\theta \in (0,1]\) and \(n\ge 2\). The reason is that \(x\left( 0\right) =\tilde{x}\left( 0\right) =0\), and \(x\left( \theta \right) =\tilde{x}\left( \theta \right) \Rightarrow x^{\prime }\left( \theta \right) >\tilde{x}^{\prime }\left( \theta \right)\) for all \(\theta \in [0,1]\). Note that for \(n=1\), \(x\left( \theta \right) =\tilde{x}\left( \theta \right) =\frac{1}{2}\theta\). So, if we can show that welfare is maximized at \(n=1\) in the case that the benefits from office are \(b\gamma \theta ^{2}\), then we have that welfare is also maximized at \(n=1\) for the true benefits from office \(b\gamma\), since \(\beta \gamma \theta ^2 < \beta \gamma\) for all relevant \(\theta\).

Suppose in general that equilibrium promises in an election are given by \(x(\theta )=\) \(\beta \theta .\) Note from (10) that promises are of this form in our model with private benefits from office. Welfare then equals

$$\begin{aligned} W(\alpha ,n)& = E\left\{ {x(\theta ^{{(1)}} )\left( {\theta ^{{(1)}} - x(\theta ^{{(1)}} )} \right)} \right\} = \beta \left( {1 - \beta } \right)E\left\{ {\left( {\theta ^{{(1)}} } \right)^{2} } \right\}\\ &= \beta \left( {1 - \beta } \right)\int_{0}^{1} {\left( {\theta ^{{(1)}} } \right)^{2} } dF(\theta ^{{(1)}} )^{n} = \beta \left( {1 - \beta } \right)\frac{{\alpha n}}{{\alpha n + 2}} \end{aligned}$$

When picking a candidate at random, this candidate will again implement policy \(\theta /2\) and generate welfare equal to \(\alpha /\left( 4\alpha +8\right) ,\) as was the case in our standard model. Hence, the electorate is better off by picking a candidate at random if

$$\begin{aligned} \beta \left( 1-\beta \right) \frac{\alpha n}{\alpha n+2}<\frac{\alpha }{ 4\alpha +8} \end{aligned}$$

or

$$\begin{aligned} \beta >\frac{1}{2}+\frac{1}{2}\sqrt{1-\frac{4n\alpha +8}{4n\alpha +8n}} \equiv \beta ^{*} \end{aligned}$$

Note that for \(b\) sufficiently large, \(\eta > \beta ^{*}.\) Taken together, this implies the required result.

Thus, having benefits from office only strengthens our results: we now have that the electorate is always better off picking a candidate at random rather than having an election, provided that private benefits are large enough. Private benefits make winning the election more valuable for a given candidate, which implies that she is willing to put up an even more overambitious plan, that ultimately will yield a low expected utility not only to her, but also to the electorate. The stronger the private benefits, the stronger the overambition effect. Theorem 4 implies that it is always possible to have private benefits that are so strong that the overambition effect dominates the selection effect.

5.2 Primary Elections

In our analysis so far, we have assumed that any candidate can run in a general election. In practice, this is often not the case. Candidates have to win the primary election of a large political party in order to stand any chance in the general election. In the US, a candidate for almost any elected office first needs to win the nomination from either the Democratic or the Republican party to stand a serious chance. In the UK, a candidate first has to win the nomination of either the Labour or the Conservative Party, or perhaps the Liberal Democrats. Other examples abound. In this subsection, we show that such a set-up will only strengthen the results derived in our basic model.

Suppose there are two parties, 1 and 2. They will both field one candidate in the general election, and decide upon this candidate through some internal process. Naturally, each party will try to field the candidate with the highest ability. Suppose that for each party \(\nu\) candidates are running to obtain the candidacy of that party. Each candidate has an ability that is drawn from the cdf \(F(\theta )=\theta ^{\alpha }\) on [0,1]. For simplicity, we assume that party is able to fully observe the quality of a candidate of its own party. Party barons already have ample experience with the members of their own party, and through such intensive interaction, are able to fully assess their quality, something the general public is not able to do. That implies that, from the point of view of the electorate, the distribution of the ability of the candidate that party i will field in the general election is that of the first-order statistic out of \(\nu\) draws from F,  i.e. \(F_{i}(\theta )=\theta ^{\alpha \nu }.\) In the general election, the electorate then has a choice between two candidates with an ability that is drawn from that distribution. Theorem 2 then immediately implies

Corollary 1

Consider a general election in which two parties field a candidate that has been selected from \(\nu\) internal candidates that each have an ability that is drawn from \(F(\theta )=\theta ^{\alpha }\) on [0, 1]. Then the electorate is better off by picking a candidate at random rather than having an election if \(\alpha \nu >2.\)

In Sect. 4, we argued that the electorate is more likely to want to pick a candidate at random if the distribution of abilities is skewed towards high values. But if we first let parties pick the best candidate from among their ranks, we exactly achieve such skewness.

5.3 Ideologies

In this subsection we show that our basic model straightforwardly extends to a case in which not only the abilities of politicians matter, but also their ideology. Suppose that ideological preferences of voters are uniformly distributed on [0, 1],  where 0 represents the most left-wing, and 1 represents the most right-wing position. The position of the median voter is then given by \(p=1/2.\) Suppose that the utility to the voter located at L when a candidate is elected who offers ideological position p,  has ability \(\theta\), and promises to implement policy x,  is given by

$$\begin{aligned} U\left( L,p,\theta ,x\right) =-\left( p-L\right) ^{2}+x\left( \theta -x\right) . \end{aligned}$$
(11)

As in the standard Downs (1957) model, we assume that candidates have no ideology preference. The timing of the game is as follows:

  1. 1.

    The candidates independently choose an ideological position in the interval [0,1] and commit to a policy \(x\ge 0\).

  2. 2.

    A voter located at \(L\in [0,1]\) votes for candidate i if and only if \(i\in \text {argmax}_j E_{\hat{\theta }_j}\{U(L,p_j,\hat{\theta }_j,x_j)\}\), where \(p_j\) is candidate j’s position, \(\hat{\theta }_j\) is the voter’s belief about candidate j’s ability, and \(x_j\) is the policy candidate j commits to.

We obtain the following result:

Theorem 5

Suppose the ideological preferences of the voters are uniformly distributed on [0, 1] and the distribution of abilities is given by \(F(\theta )=\theta ^{\alpha }\) on [0, 1]. Then, a symmetric perfect Bayesian equilibrium exists in which all candidates choose the ideological position that is the preference of the median voter, i.e., \(p=1/2\); a candidate having ability \(\theta\) chooses the policy according to equation (1); the voters pick the candidate offering the highest x.

Proof

Suppose that all candidates choose \(p=1/2\) and make commitments according to (1). Suppose that candidate i considers a defection in one of the two dimensions (the policy \(x_1\) and the ideological position \(p_1\)). By construction, given that all candidates choose position 1/2,  proposing a policy different from (1) decreases the expected utility of candidate i, hence she has no incentive to do so. Now, consider candidate i considering her ideological position \(p_{1}\not =1/2\) while proposing a policy according to (1). Given (1), candidate i wins the election if she is preferred by the median voter and hence by the majority of the electorate. This is the case if

$$\begin{aligned} -(p_{1}-1/2)^{2}+\theta _{1}^{2}\frac{n\alpha -\alpha +1}{\left( n\alpha -\alpha +2\right) ^{2}}>\theta _{-1}^{2}\frac{n\alpha -\alpha +1}{\left( n\alpha -\alpha +2\right) ^{2}}, \end{aligned}$$

where \(\theta _{-1}\) is the highest \(\theta\) of all other candidates: \(\theta _{-1}\equiv \max \{\theta _{2},\ldots ,\theta _{n}\}\). This probability is maximized at \(p_{1}=1/2\), which is a contradiction to the assumption that \(p_{1}\not =1/2\). Analogously, it does not pay to defect in both dimensions at the same time.

As the choices of platform and ideological position are separable, we can focus on the choice of platform without having to take the choice of position into account. Therefore, all results for the original model carry over, in particular Theorems 1–3. Note that this conclusion does not hinge on the additive separability that we assume in (11). Any specification where the median voter’s utility is decreasing in distance from 1/2,  but increasing in \(x(\theta -x)\), will do.

6 Concluding Remarks

In this paper, we proposed a game-theoretic model in which candidates compete in elections when voters care about the winner’s ability to implement policies. We have shown that the candidates are inclined to come up with overambitious policy proposals in an attempt to convince the electorate that they are of the utmost competence. In that process, however, politicians may be so ambitious that the electorate would be better off by just picking an average candidate rather than going for the best politician that will attempt to implement some grandiose plan that is likely to fail.

We have thus shown that the disadvantage of having overambitious promises being implemented may dominate the advantage of being able to choose the best candidate. In other words, the overambition effect of election platforms may very well outweigh the selection effect of elections. This is particularly likely if the distribution of abilities is skewed toward high values, or if the number of candidates is large. We also showed that the electorate is always better off picking a candidate at random rather than having elections if the private benefits from office are high enough.

Intriguingly, our model thus gives a rationale for sortition, the assignment of political offices by lot. It was widely used by the Ancient Greeks. In Athens, in the fifth and fourth centuries B.C., virtually all administrative positions were filled in this way. It was also used in renaissance Venice and Florence. Ostfeld (2020) highlights modern sortition practices.Footnote 10 We provide a new argument for sortition: it prevents the use of costly overambitious campaign promises.

Nevertheless, our results should not necessarily be construed as an argument to abolish elections altogether. We would rather like to think of our analysis as an argument against election campaigns, rather than one against elections. At least, it suggests that society may better off if politicians postpone detailed policy proposals until after the election.