1 Introduction

Most government policies have redistributive consequences. Besides direct policies such as social expenditures, more indirect redistributive tools like regulating the provision of goods and services also result in a redistribution of welfare across society. How do governments choose the combination of redistributive policies?

A study of the trade-off between direct and indirect redistribution carries two key challenges. First, the analysis should incorporate the notion that the well-being of the poor largely depends on regulated goods and services.Footnote 1 As a result, inequality and poverty play different roles in determining the composition of redistribution. On the one hand, inequality creates incentives to increase social expenditures when higher-income citizens bear the burden. On the other hand, a high poverty rate offers a political reward for subsidizing regulated services that are essential in the consumption bundle of the poor.Footnote 2,Footnote 3 In this paper, we propose a theoretical characterisation of the preferences of the poor that captures these ideas and provides a neat mechanism whereby poverty affects both regulation and social expenditures.

The second challenge is to provide robust empirical evidence on the effect of inequality on redistribution that accounts for changes in poverty. The difficulty arises because, even when conceptually different, poverty and inequality are highly correlated in practice. We then propose an estimation strategy that tests the claim that poverty is an omitted variable that empirical specifications should include. We support this claim using a panel dataset that exploits changes in inequality and poverty during the Great Recession.

The literature on redistribution in democracies rests to a great extent on the political incentives that income inequality creates. On one side, seminal works state that in a progressive or proportional tax system, the wider is the income distribution, the higher are the electoral incentives to increase public expenditures and taxation (Meltzer and Richard 1981; Persson and Tabellini 1994; Alesina and Rodrik 1994). On the other side, there is also a literature claiming that heterogeneous beliefs towards inequality (Benabou and Tirole 2006; Alesina and Angeletos 2005), and other-regarding preferences (Galasso 2003; Dhami and Al-Nowaihi 2010a, b; Alesina et al. 2012; Epper et al. 2020) break, or even reverse, the positive relationship between income inequality and direct redistribution.

We contribute to the literature by recognizing that governments have access not only to direct but also to indirect redistributive policies. Thus poverty affects the relationship between inequality and direct redistribution. For example, an increase in poverty may counteract the positive effect of inequality on social expenditures, shifting the redistribution towards indirect policies. In this way, our paper contributes with a new explanation for the frequent mismatch between the empirical evidenceFootnote 4 and the implications of the pioneering literature on inequality and redistribution in democracies. Recent work builds on heterogeneous societies to propose alternative explanations for this discrepancy. For instance, the way differences in experienced income inequality affect the preferences for redistribution (Roth and Wohlfart 2018), the effect of immigration on natives’ attitude towards redistribution (Alesina et al. 2019), how campaign contributions create political incentives in favour of the rich (Campante 2011), and other insights within the framework of behavioural economics (Luebker 2014).

Our theory disentangles the effects of poverty and inequality on the trade-off between direct and indirect redistribution within a simple model of representative democracy (Besley and Coate 1997a; Osborne and Slivinski 1996) with two types of policies. The indirect policy is the price of a regulated good; for example, the tariff of a public utility. The direct policy is a universal income transfer. The consolidated costs of the government redistribution are subject to the budget constraint imposed by the revenues collected from a linear income tax.Footnote 5

In this article, we highlight the attribute of poverty that best explains redistribution. For the poor, regulated goods and services are essential. We capture this idea, assuming quasilinear preferences such that, when income is low enough, the consumption of the regulated good increases with income. Thus, our model defines the poor and the poverty line through consumption behaviour, and this characterisation drives our theoretical results.

One of these results is that the trade-off between direct and indirect redistribution depends on the income threshold that divides the rich and the poor, and the swing voters within each of these groups. The relevance of both the poverty line and the multiple decisive voters becomes clear when considering a population comprising only non-poor citizens. If so, the median income is the only swing voter, and therefore income inequality drives the composition of the redistribution. Regulation and income transfers work as substitutes that satisfy the same end, i.e. they respond to the incentives that inequality creates.Footnote 6

Instead, our model allows for the policy to affect the rich and the poor in different ways. As a result, it changes the characteristics of the distribution of citizens that are critical as follows. When the proportion of poor citizens is large, the model provides conditions for smaller income transfers and more redistribution through regulation. This outcome suggests it is politically efficient for governments in poor societies to offer less generous direct benefits with universal access along with high indirect benefits like low tariffs of public utilities. Further, the relationship between redistribution and inequality depends crucially on poverty, possibly to the extent where the effect of inequality on policies goes in opposite directions depending on the size of the poverty rate. As an example of our results, Proposition 2 supports that an increase in inequality leads to smaller income transfers when poverty is high, and larger income transfers when poverty is low.

Does the evidence support these results? We test the effects of inequality and poverty on redistribution using data on taxes and expenditures from Spanish municipalities from 2006 to 2015. Examining municipalities instead of countries helps reduce the unobserved heterogeneity that comes from institutional differences and other country-level unobserved characteristics. Still, we face a major identification challenge posed by reverse causality. Changes in the income distribution trigger changes in public policy components, and also changes in public policy affect the income distribution (Martínez-Vázquez et al. 2012; Smeeding 2006; Keane and Prasad 2002; Aaberge et al. 2019). We tackle endogeneity by exploiting the heterogeneous exposure of municipalities to the Great Recession. We show that municipalities with a larger share of individuals in their early careers were more exposed to the Great Recession. Besides, the recession has affected female and male workers differently (De la Rica and Rebollo-Sanz 2017; Bonhomme and Hospido 2013). Municipalities with higher shares of females in the population experienced lower income inequality growth.Footnote 7

When instrumenting with these demographics, we find that inequality increases the share of social expenditures in the budget of municipalities while poverty decreases it. We find a similar pattern when testing the regulation of transport, albeit with results that go in the opposite direction. Poverty increases indirect redistribution while inequality decreases it. As we proxy poverty as the population’s proportion with income below 60% of the median, the relevance of these results relies on how well the variation of this measure of poverty, and the instruments, captures the variation in the consumption-poverty rate we propose in our theoretical model. We cannot directly corroborate this, but we do show that omitting poverty biases the coefficients, even to the point of reversing the signs of the estimates.

Our empirical exercise shows that, accounting for poverty, high inequality provides incentives to engage in direct redistribution through social expenditures. In this way, we contribute to the large body of empirical work that proposes different identification strategies for the relationship between income inequality and redistributive policies (for example, Ramcharan 2010; Lupu and Pontusson 2011; Boustan et al. 2013; Karadja et al. 2017).

The rest of the paper proceeds as follows. Section 2 lays out the model, describes the preferences of poor citizens and builds the budget constraint of the government. Section 3 develops a benchmark without poor citizens. Section 4 explores a general environment where society comprises distinct groups of citizens based on their consumption behaviour. This section provides empirical evidence of the effects of poverty and inequality on the composition of government redistribution. It contains the major results of the paper. Section 5 concludes and opens further research questions. Appendix A describes the data and provides robustness checks. The Online Appendix contains all the theoretical proofs.

2 Preliminaries

We develop a model to illustrate how the composition of governments’ redistribution depends not only on inequality but also on the poverty rate. To capture the notion of composition, we consider two policies; the price of a regulated good or service and a universal income transfer. We depart from the idea that the consumption behaviour of the poor differs from that of the rest of the population to explore its effect on the relative importance of regulation and income transfers for redistribution.

Formally, a community of a unit mass of voters (\({\mathbb {V}}\)) elects a policymaker from a large but finite set of potential candidates (\({\mathbb {C}}\)) to implement a policy. Each citizen, either candidate or voter, consumes two goods: good x produced by a linear technology with a unit price; and the regulated good y, subject to decreasing average costs.

Once in office, the elected candidate chooses a policy vector \(\mathbf{q} \in Q\) with three components: the price of the regulated good,Footnote 8\(p \in {\mathbb {R}}_{+}\); a lump-sum income transfer of \(r \in {\mathbb {R}}\) per capita; and a linear tax rate on income, \(t \in [ 0, 1 ]\). All subject to the government budget constraint by which the sum of expenditures on income transfers and subsidies to the price of the regulated good cannot exceed the total tax revenues.

The regulated good in our model captures the trade-off between price and lump-sum redistributions. For this purpose, this hypothetical good must satisfy two characteristics; the government (or the elected candidate) sets the price; and the good is essential, in the sense that the consumption increases with income for poor citizens but not for the non-poor ones. Are there real-world goods or services that suit this description? We think public utilities are good examples of regulated goods. Of course, in reality, governments do not directly set the price of, say, electricity. Still, a price-setting government is an accurate approximation for public utilities.Footnote 9 The same applies to public transport, where local governments usually hold significant power in setting fares. That is the case in most cities in Spain. Also, we can consider both electricity and public transport as essential services.

We identify voters with their exogenous income \(\omega \in \Omega \subset {\mathbb {R}}_{+}\), which distribution responds to the CDF \(F(\omega )\). Candidates are also concerned with how sustainable is the provision of the good under regulation. We represent this attribute of candidates by some number \(\gamma \in \Gamma \subseteq (0,1]\) that weighs the candidate’s value of the profit.Footnote 10 This feature captures the idea that candidates differ not only on their preferences over income distribution but also on their preferences over sustainability. All traits are common knowledge. Voters know the value \(\gamma\) and the income associated with each candidate, and candidates know the voters’ income distribution.

Preferences are represented by the utility function \(V: Q \times \Omega \times \Gamma \rightarrow {\mathbb {R}}\). To compete in elections, any potential candidate must pay the same entry cost \(\delta > 0\). All citizens derive utility from the policy implemented, regardless of the policymaker’s identity.

The political process has three stages. First (entry-stage), potential candidates decide whether they compete in the election (and pay \(\delta\)) or they stay out. \({\mathbb {E}} \subseteq {\mathbb {C}}\) denotes the set of self-declared candidates. Second, each voter casts one vote for one of the self-declared candidates. The appointed policymaker is the candidate who receives the most votes, with ties to be decided by a fair coin toss. If only one candidate runs, they are automatically appointed. Last, the winning candidate implements her preferred policy. The default policy \({ \mathbf{q}_0 }\) prevails if \({\mathbb {E}}=\emptyset\).

Under sincere voting, all the strategic behaviour occurs at the entry stage.Footnote 11 Therefore, the solution concept is Nash equilibrium.

2.1 Defining the poor

Subsidising the prices of regulated goods and services aims to improve the well-being of the poor. This statement conveys that poor citizens consume regulated goods differently from the rest of society, and therefore, we can identify the poor by their consumption behaviour. We start from the basic notion that regulated goods are essential for the poor; they dispense with all goods but the regulated ones.

To model this idea, we assume consumers’ preferences are represented by a quasilinear utility function in the form \(\psi (y)+x\), where the non-linearity is over the regulated good, y, with \(\psi '(y)>0\) and \(\psi ''(y)<0\). Then, an interior solution results in the following indirect utility function associated with \(\psi (y)+x\) and the consumer’s budget constraint,

$$\begin{aligned} \begin{aligned} V( { \mathbf{q} } ;\,\omega ,0) = v(p) + (1-t) \omega + r \end{aligned} \end{aligned}$$
(1)

where \({ \mathbf{q} } = (p,r,t)\) is the policy vector comprising the regulated price, the income transfer, and the tax rate. Notice that \(V( { \mathbf{q} } ;\,\omega ,0)\) indicates \(\gamma =0\) for voters. In addition, \(v (p) = \psi (\psi ^{'-1} (p) ) - \psi ^{'-1} (p)p\) is the indirect utility of consuming the optimal quantity of the regulated good, \(\psi (\psi ^{'-1} (p) )\), minus the budget spent on the regulated good, \(\psi ^{'-1} (p)p\). Given the assumptions on \(\psi (\cdot )\), v satisfies \(-v'(p)=\psi (p)^{'-1}>0\) and \(-v''(p)<0\). The optimal consumption of good x is equal to the residual income given by \((1-t) \omega + r - \psi ^{'-1} (p)p\).

However, an interior solution is not guaranteed. By quasilinearity, the marginal utility of consuming the regulated good is decreasing while that of the linear good is constant. Therefore, when the support of the income distribution includes values low enough, there is a fraction of citizens that spend the entire budget on the regulated good. As a result, the demands of the regulated and the linear goods satisfy the general forms,

$$\begin{aligned} \begin{aligned} x^*&= x(1,p,t,r;\,\omega ) = max \bigg \{ (1-t) \omega + r - \psi ^{'-1} (p)p; \,0 \bigg \} \\ y^*&= y(1,p,t,r;\,\omega ) = min \bigg \{ \psi ^{'-1} (p); \, \frac{(1-t) \omega + r}{p} \bigg \} \end{aligned} \end{aligned}$$

For a given policy vector \({ \mathbf{q} }\), there exists an income threshold, the poverty line \(\omega ^0 ({ \mathbf{q} })\), that divides the community into two groups. One consists of those citizens with income higher than \(\omega ^0\), namely “the rich” (R), who have preferences represented by function 1. The other is the group of citizens with income below \(\omega ^0({ \mathbf{q} })\), namely “the poor” (P), who spend all the budget on the regulated good. The following indirect utility function represents the preferences of the group P (for \(p>0\)),

$$\begin{aligned} \begin{aligned} V( { \mathbf{q} } ;\,\omega ,0) = \psi \bigg ( \frac{(1 - t) \omega + r}{p} \bigg ) \end{aligned} \end{aligned}$$
(2)

Citizens at the poverty line are simultaneously poor and rich. As a result, at \(\omega ^0\), the demand for the regulated good by a rich citizen (\(-v'(p)=\psi ^{'-1} (p)\), derived from (1) must equal the demand by a poor citizen (the total real net income that is the argument of function (2). The outcome, for \(t \in [ 0, 1 )\), is the following expression for the endogenous poverty line,

$$\begin{aligned} \omega ^0({ \mathbf{q} }) = \frac{\psi ^{'-1}(p)p - r}{1 - t}. \end{aligned}$$
(3)

The endogenous poverty rate is then the mass of citizens with income below \(\omega ^0({ \mathbf{q} })\) in 3, that is \(F[\omega ^0({ \mathbf{q} })]\). Note that mass equals rate (or percentage) because we normalise the total mass of voters to one.

Therefore, to focus on the main channel whereby poverty affects redistribution, we characterise the poor more by the bundle they consume than the income they receive. As a result, policy changes affect the well-being of poor and rich citizens differently, and also shift the poverty line. This is how our model relates a compound redistributive policy to income inequality and the poverty rate.

2.2 Candidates’ preferences and the government budget constraint

Candidates have two attributes; their exogenous income and some number \(\gamma \in (0,1]\) that weighs their value of the profit. The subset of policies preferred by a generic candidate \((\omega ,\gamma )\) is,

$$\begin{aligned} \begin{aligned} \mathbf{q } \in \underset{ \mathbf{q } \in Q }{{\text {arg max}}} \, \bigg \{ V( \mathbf{q } ;\,\omega , \gamma )&= v(p) + ( 1 - t) \omega + r + \gamma \Pi (\mathbf{q })\\ s.t. \,&\Pi (\mathbf{q }) + t{\overline{\omega }} - r \ge 0 \bigg \}. \end{aligned} \end{aligned}$$
(4)

The profits of the supplier are \(\Pi ( \mathbf{q} ) = p \big [ y^P (\mathbf{q}) + y^R(p) \big ] - K\); where \(y^k\) is the demand for good y with \(k = P,R\) referring to the poor and the rich. K represents the fixed costs.Footnote 12\({\overline{\omega }}=\int _{\omega ^-}^{\infty } \omega d F(\omega )\) is the average income; \(F(\omega )\) is the fraction of the community with income less than \(\omega\), and \(\omega ^-\) the lowest level of income in the society. Thus, \(py^P(\mathbf{q})=( 1 - t) \int _{\omega ^-}^{\omega ^0( \mathbf{q } )} \omega dF(\omega )+rF(\omega ^0(\mathbf{q }))\), since the poor spend all the income on y.

The relation between policies and the different groups presents three main features. First, the impact of policies on group P is different from the effect on R. This is clear when we define the vector of voters’ utilities \(\big (u_{jc} \big )_{j \in {\mathbb {V}}}\)Footnote 13 with \(u_{jc} = V( \mathbf{q_c } ;\,\omega ^j,0)\), induced by the policy of generic candidate c.

$$\begin{aligned} u_{jc} = \left\{ \begin{matrix} v(p_c) + (1 - t_c) \omega ^j + r_c &{} if &{} j \in R \\ \psi \bigg ( \frac{(1 - t_c) \omega ^j + r_c}{p_c} \bigg ) &{} if &{} j \in P \end{matrix}\right. \end{aligned}$$
(5)

Second, the demands for the regulated good by groups P and R determine the budget constraintFootnote 14 as follows,

$$\begin{aligned} \begin{aligned} B(\mathbf{q }) = \underbrace{( 1 - t) \int _{\omega ^-}^{\omega ^0( \mathbf{q } )} \omega dF(\omega ) + a^P(\mathbf{q }) r - a^R(\mathbf{q }) p v'(p) - K}_{{\text {Profit from the Provision of the Regulated Good}}} + t {\overline{\omega }} - r \ge 0 \end{aligned} \end{aligned}$$
(6)

where \(a^P(\mathbf{q }) = F(\omega ^0(\mathbf{q }))\) and \(a^R(\mathbf{q }) = 1 - F(\omega ^0(\mathbf{q }))\) are the masses of voters in group P and R, respectively. The budget constraint in 6 conveys that the sum of the profit accruing from the provision of the regulated good to the poor and the rich, and the tax collection (\(t {\overline{\omega }}\)) cannot be lower than the total amount of income redistributed (r). Note the profits generated from the provision of the regulated good enter directly into the government budget. We think this feature captures, in a simplified way, that, for the government, increasing redistribution via prices of regulated goods comes at a cost. This cost goes to the government budget directly when the good is publicly provided, or indirectly via subsidies to private providers. Of course, the second interpretation would be valid only if the provision raised negative profits and the government fully compensated the private provider.

Last, the definition of profit (loss) provides information on the relationship between candidates’ objective functions and the size of groups P and R,

$$\begin{aligned} \begin{aligned} V( \mathbf{q } ;\,\omega ^c, \gamma ^c)&= v(p) + ( 1 - t) \omega ^c + r \\&\quad + \gamma ^c \bigg [ ( 1 - t) \int _{\omega ^-}^{\omega ^0(\mathbf{q })} \omega dF(\omega ) + a^P(\mathbf{q }) r - a^R(\mathbf{q }) p v'(p) - K \bigg ] \end{aligned} \end{aligned}$$
(7)

A larger poor group has two effects on the value functions of candidates; it increases the profits for the total income of the poor \(( 1 - t) \int _{\omega ^-}^{\omega ^0(\mathbf{q })} \omega dF(\omega ) + a^P(\mathbf{q }) r\) is larger, and it reduces the revenues from selling the regulated good to a smaller rich group \(- a^R(\mathbf{q }) p v'(p)\).

The rest of the paper characterises the expected outcomes of this setting and provides empirical evidence that supports the results.

3 Benchmark: a community with no poor

In this section, we study the composition of redistribution in a society where all citizens have income high enough to consume strictly positive quantities of all goods, i.e. a society with no poor. This exercise provides a standard of reference to compare with the main results of the paper.

We restrict the attention to the subset of policies that satisfy \(\omega ^0(\mathbf{q})\le \omega ^-\) or \(-p v'(p) \le \omega ^{-}(1-t)+ r\), where \(\omega ^-\) is the lowest level of income in the society. The absence of poor citizens allows projecting preferences over policies on voters’ income. In this environment, our setting results in a solution that coincides with the policy preferred by the median income voter, as shown in Lemma A.1 in the Online Appendix.

This result leads to a straightforward characterisation of the situation where one candidate runs unopposed. The existence of one-candidate equilibrium requires the only candidate, say i, to have an incentive to run in elections (\(\delta \le V( \mathbf{q_i};\,\omega ^i,\gamma ^i)- V( \mathbf{q_0};\,\omega ^i,\gamma ^i)\)); and any other candidate, \(k\ne i\), with the same or higher vote share, to stay out (\(\frac{1}{2}[V( \mathbf{q_k};\,\omega ^k,\gamma ^k)- V( \mathbf{q_i};\,\omega ^k,\gamma ^k)] \le \delta\) or \(V( \mathbf{q_k};\,\omega ^k,\gamma ^k)- V( \mathbf{q_i};\,\omega ^k,\gamma ^k) \le \delta\)).Footnote 15

The general idea is that the only candidate must be the one who is closest to the median voter; when candidates’ preferences for sustainability are controlled for. To illustrate this result, we consider the situation where all potential candidates have the same sustainability concern, i.e. the same \(\gamma\). In this case, the optimal price of the regulated good is the same for all candidates; and the optimal income transfer r is weakly decreasing in income for both candidates and voters.Footnote 16 Thus, a low transfer must coincide with a high median income. Proposition A.1 in the Online Appendix formalizes these ideas. The next subsection extends the analysis to the context of two-candidate equilibrium.

3.1 Two-candidate equilibria under zero poverty

In this subsection, we develop a simple approach to analyse the two-candidate equilibria in a society with no poor.Footnote 17 The outcomes of this approach corroborate the intuition that, with no poor, prices of regulated goods and income transfers behave as substitutes: the relative responses to changes in income inequality depend on budget incentives and the characteristics of potential candidates.

We start the analysis with the characterisation of the citizens’ space. The community comprises a unit mass of voters defined by the set \({\mathbb {V}} = \{ (\omega , \gamma ) \in \Omega \times \Gamma \mid \gamma = 0 \}\), and a feasible space of potential candidates given by \({C}^A = \{ (\omega , \gamma ) \in \Omega \times \Gamma \mid \gamma \in ( 0 , 1 ] \}\). Then, society is represented by the union \({\mathbb {V}} \cup {\mathbb {C}}^A\) which is convex for it forms a rectangle as the one showed in Fig. 1a.Footnote 18

A second characteristic that makes the model tractable is the linearity of preferences w.r.t. citizens’ traits. In other words, for any pair of candidates and their respective policies, there exists a line segment, namely the indifference line, that defines two convex subspaces consisting of citizens who prefer one policy or the other. The intersection of this line segment with the space of voters (the voters’ horizontal line) defines the vote share of each candidate. Lemma A.2. in the Online Appendix formalises this result. Corollary A.1 in the Online Appendix shows that when the profits are increasing in the price of the regulated good, the slope of the indifference line is negative (positive) if the indifference line intersects the space of voters to the right (left) of the mean income.

Fig. 1
figure 1

Zero poverty and two-candidate equilibria. The space of potential candidates is depicted in gray, while the space of voters is the black horizontal line. The blue line represents all the citizens (voters and potential candidates) that are indifferent between candidates i and k

In any two-candidate equilibrium, the indifference line must intersect the space of voters at the median income (see Fig. 1b). This result follows straightforwardly from the condition for which each candidate must get one-half of the total vote; and from Lemma A.2, whereby the intersection of the indifference line with the space of voters defines the vote share of each candidate. The next lemma shows the decisive role of the median income on the choice of the redistributive policy when there are no poor citizens in society. It also provides an expression for the minimum distance between the candidates that ensures they get positive payoffs from competing in elections (see Fig. 1c).

Lemma 1

(Two-candidate equilibria: incentive compatibility) Any equilibrium with two candidates (i and k such that \(\omega ^0(\mathbf{q}_i)\le \omega ^-\) and \(\omega ^0(\mathbf{q}_k)\le \omega ^-\)) satisfies,

(a):

the indifference line intersects the space of voters at the median income,\((\omega ^m , 0) \in I(\{ i , k \})\);

(b):

if \(\omega _i<{\overline{\omega }}<\omega _k\) and \(\omega ^m > {\overline{\omega }}\), then \(\gamma ^i<\gamma ^k\);

(c):

there exist two lines, \(I^- ( \{ i , k \})\) and \(I^+ (\{ i , k \})\) parallel to \(I ( \{ i , k \})\), that define the minimum horizontal distance between the candidates’ locations. This distance is given by \(\tfrac{4\delta }{|t_i - t_k|}\).

Lemma 1 helps locate the candidates in equilibrium, but these locations are not unique. The multiplicity of equilibria is two-fold: there exist a priori multiple pairs of candidates that define the same indifference line and multiple indifference lines that correspond to different equilibria. Notwithstanding, departing from the conditions provided in Lemma 1, we capture the effects of income inequality on the components of the redistributive policies and the potential location of candidates in equilibrium. As we formally do in the next proposition.

Proposition 1

(Inequality and the composition of redistribution: the zero-poverty case) Any two-candidate equilibrium in a society with no poor citizens satisfies,

(a):

if \(\omega ^m<(>) \,{\overline{\omega }}\), then the slope of the indifference line is increasing (decreasing) in the distance \(|\omega ^m - {\overline{\omega }} |\);

(b):

for any continuous change in the income distribution such that the new distribution \(F'\) satisfies \(F(x) < F'(x)\) with \(x = F^{-1}(\frac{1}{2})\), the new candidate, say a, will prefer both lower (higher) income transfer and lower (higher) price of the regulated good if \(-p_{a}v'(p_{a})-r_{a} > (<) -p_{-a}v'(p_{-a}) - r_{-a}\), where \(-a\) indicates the candidate other than a.

The general message of Proposition 1 is that regulated prices and income transfers are substitute components of redistributive policies: both serve the higher demand for redistribution when inequality increases.

Part b of Proposition 1 explores a change where the new distribution \(F'\) shows a greater probability that income takes a value less than or equal to the old median \(F^{-1}(\frac{1}{2})\). If F and \(F'\) share the same mean, then the condition implies that \(F'\) has a higher mean-to-median ratio than F, which is the inequality measure used by Meltzer and Richard 1981. This change would also be compatible with an increase in inequality in the Lorenz sense if, for instance, derived from a regressive transfer. Note the Gini index we use in our empirical exercise is compatible with the Lorenz Criterion. Then, by Proposition 1, a higher income inequality drives a new equilibrium where the candidate with the greater revenue from the regulated good provision (\(-pv'(p)\)) relative to transfers (r) prefers both lower tariff and lower transfers. Thus, they redistribute relatively more through a lower price of the regulated good. Instead, after the increase in inequality, the new candidate with a smaller revenue relative to transfers prefers higher tariffs and transfers, i.e. they redistribute more through income transfers.

The non-poor outcome is driven not only by the relative traits of the candidates but also by the functional form of the revenues that make regulation a more effective redistributive policy when the price of the regulated good is relatively high. We can draw a similar intuition from the result that the distance \(|\omega ^m - {\overline{\omega }} |\) increases the slope of the indifference line when \(\omega ^m < {\overline{\omega }}\), i.e. \(I^m(\{i,k\})\) moves counterclockwise. Then, we should expect a new subset of equilibrium candidates with lower \(\gamma\)’s and higher \(\omega\)’s, which contains more candidates preferring lower prices of the regulated good and smaller transfers as a response to the increased level of inequality. This interpretation is valid as long as the distance between the mean and median incomes accurately measures inequality. Of course, this is not always the case.

The general conclusions of this section are as follows. When there are no poor citizens in society, the median income is the only decisive voter. In this environment, regulation and income transfers are substitute components of the redistributive policy, in the sense that both respond to an increase in inequality similarly. In contrast, in the next section, we show poor citizens change not only the decisive voters in society but also the relationship between the components of the redistributive policy.

4 Poverty, inequality and redistribution

This section presents the main results of the paper. The goal is to characterise, both theoretically and empirically, the way poverty and inequality affect the components of redistributive policies.

As for the theory, we come back to the general version of the model by considering the existence of poor citizens for whom the consumption of the regulated good is monotone in income. In contrast to the non-poor benchmark, this environment can induce equilibria with multiple swing voters. Also, it may result in multiple poverty lines.

Indeed, a two-candidate equilibrium involves one poverty line per candidate. Since the poverty line is endogenous, two different policies result in two lines. Moreover, as the preferences of the poor are distinct from those of the rich, we cannot apply a separation argument to localize a unique swing voter for both groups. Thus multiple swing voters may arise. How many? On the rich side, it is straightforward that there is at most one swing voter’s income. Our benchmark analysis in Sect. 3 builds on this case. The next lemma shows that, on the side of the poor, there is also at most one swing voter, which makes all in all four potential income cut-offs: the rich and the poor swing voters, and one poverty line per candidate.

Lemma 2

(The poor and the rich: swing voters and cut-offs) For any pair of candidates i and k there exist at most four income cut-offs: one poor swing voter \(\omega ^{P*}(i,k)\), one rich swing voter \(\omega ^{R*}(i,k)\), and the poverty lines \(\omega ^0(k)\) and \(\omega ^0(i)\).

This lemma searches for the income in each group that is indifferent between the two candidates and then checks whether a separation argument applies. By separation argument, we mean that in a specific group all citizens with income strictly greater, or smaller, than the swing voter prefer the same candidate. Since the lemma proves that this is the case, we have at most one rich swing voter and one poor swing voter. The lemma’s proof in the Online Appendix provides the expressions for the poor swing voter \(\omega ^{*P}(i,k)\), the rich swing voter \(\omega ^{*R}(i,k)\), and the poverty lines under candidate k, \(\omega ^0(k)\), and i, \(\omega ^0(i)\), as functions of candidates’ policies.

Lemma 2 has two main corollaries. First, the income cut-offs define the grouping of voters as follows. The “Very poor” (“Very rich”) comprises all voters with income smaller (greater) than the poor swing voter’s \(\omega ^{*P}(i,k)\) (rich swing voter’s \(\omega ^{*R}(i,k)\)). The “Poor middle class” (“Rich middle class”) comprises all voters with income greater (smaller) than the poor swing voter’s \(\omega ^{*P}(i,k)\) (rich swing voter’s \(\omega ^{*R}(i,k)\)) and smaller (greater) than the poverty line under candidate k (i), \(\omega ^0 (k)\) (\(\omega ^0 (i)\)). The “Poor & rich” contains all voters lying in between the poverty lines \(\omega ^0 (k)\) and \(\omega ^0 (i)\). Figure 2 displays these groups on the voters’ line.

Fig. 2
figure 2

Swing voters and poverty lines. For candidates i and k, there are at most four cut-offs: one poor swing voter, \(\omega ^{*P}(i,k)\), one rich, \(\omega ^{*R}(i,k)\), and the poverty lines \(\omega ^{0}(i)\) and \(\omega ^{0}(k)\). The resulting groups are very poor, poor middle class, poor & rich, rich middle class, and very rich

A second corollary draws the incentive compatibility conditions for the two-candidate equilibria in every possible cut-offs scenario. The starting general condition is that candidates have an incentive to compete in elections. Both must have a one-half probability of winning the election. This condition, together with the distribution of voters \(F(\cdot )\), locate the actual candidates and the cut-offs. An example is a case where there is only one cut-off that coincides with the poor swing voter, \(\omega ^{*P}(i,k)\). Then, it must be both that the “Very poor” represents half of the voters \(F(\omega ^{*P}(i,k))=\frac {1}{2}\), and that they vote for i when the “Poor & Rich” prefers k, and consequently, all income greater than \(\omega ^{*P}(i,k)\) must prefer k too. On the other side, when the only cut-off is the rich swing voter \(\omega ^{*R}(i,k)\), the equilibria satisfy both the “Very rich” are half of the voters \(1-F(\omega ^{*R}(i,k))=\frac {1}{2}\) and they vote for i when the “Poor & Rich” prefers k. Here again, we cannot discard the multiplicity of equilibria. Therefore, we do not look for unique locations of candidates but insights on the factors that affect the policy components in equilibrium. Corollary A.2 in the Online Appendix presents the six alternative incentive compatibility conditions. The first two correspond to single poor and rich swing voters; the remaining four to multiple swing voters.

The incentive compatibility conditions, the budget constraint, and the cut-offs as functions of the policy components shape different systems of equations that the two-candidate equilibria must satisfy. If we take differentials of these systems, we can explore the simultaneous changes in policy components and income distribution parameters. In the next lemma, we show that the solutions of these systems for all types of equilibrium take a common general form.

Lemma 3

(General form of policy differentials) Consider a two-candidate equilibrium with candidates i and k such that \(\omega ^0(k)< \omega ^0(i)\). Then, for a fixed location of candidate i, the differentials of the policy components of candidate k take the form

$$\begin{aligned} \bigg ( \Upsilon _a(i,k) MR_R - \Lambda _a(i,k) \bigg )dr_k= &\, - \frac{\Lambda _a(i,k) t_k}{[1-F(\omega ^0(k))]}\times d{\overline{\omega }} \quad \text {and}\\ \bigg ( \Upsilon _a(i,k) MR_R - \Lambda _a(i,k) \bigg )dp_k= & \, - \frac{\Upsilon _a(i,k) t_k}{[1-F(\omega ^0(k))]}\times d{\overline{\omega }} \end{aligned}$$
(8)

where \(MR_R=-v''(p_k)p_k-v'(p_k)\) is the marginal revenue derived from the demand of the regulated good by the rich, \(\Upsilon _a(i,k)\) and \(\Lambda _a(i,k)\) are functions of candidates’ policies with \(a=1,2,3,4,5,6\) denoting the incentive compatibility conditions enumerated in Corollary A.2.

Equilibrium policies change only if the income distribution changes. For instance, suppose that the mean income changes; i.e., \(d{\overline{\omega }} \ne 0\). Then, Lemma 3 states that \(dp_k\) and \(dr_k\) depend on the mass of poor voters \(F(\omega ^0)\), the marginal revenue from the rich’s demand of the regulated good and the size of the government measured by \(t_k\). Further, every type of equilibrium, defined by conditions \(a=1,\ldots ,6\) of Corollary A.2 , results in different functions \(\Upsilon _a\) and \(\Lambda _a\) that determine the sign of the relation between mean income and policy components. To illustrate, suppose that \(\Upsilon _a\) and \(\Lambda _a\) are such that \(\Upsilon _a(i,k)MR_R - \Lambda _a(i,k)<0\), \(\Upsilon _a(i,k) > 0\), and \(\Lambda _a(i,k) > 0\). In this case, Lemma 3 states that an increase in the average income \({\overline{\omega }}\) pushes up both the income transfer \(r_k\) and the tariff of the regulated good \(p_k\). That translates into more direct redistribution and less indirect redistribution through regulation. Moreover, the size of these effects hinges on the mass of poor citizens \(F(\omega ^0)\).

Still, there are further changes in income distribution, other than mean-income changes, that involve poverty and inequality.

The next proposition explores their effects on the policy components for equilibria with one swing voter.

Proposition 2

(One-cut-off equilibrium and policy components) Consider a one-cut-off equilibrium with candidates i and k. Then, for any continuous change in the income distribution such that the new distribution \(F'\) satisfies \(F(x)<F'(x)\) with \(x=\omega ^{*b}(i,k)\) and \(b=P,R\), it must be that;

(a):
$$\begin{aligned} \begin{aligned} dr_k&< L \bigg (\Upsilon _1, \Lambda _1, F'[\omega ^0(k)],MR_R \bigg )\times d{\overline{\omega }} \, \text { and} \\ dp_k&< M \bigg (\Upsilon _1, \Lambda _1, F'[\omega ^0(k)],MR_R \bigg )\times d{\overline{\omega }} \end{aligned} \end{aligned}$$
(9)

if the cut-off is \(\omega ^{*P}(i,k)\) and \(\frac{\omega ^{*P}(i,k)(1-t_i)+r_i}{p_i}>MR_R\), or;

(b):
$$\begin{aligned} \begin{aligned} dr_k&> L \bigg (\Upsilon _2, \Lambda _2, F'[\omega ^0(k)],MR_R \bigg )\times d{\overline{\omega }} \, \text { and} \\ dp_k&> M \bigg (\Upsilon _2, \Lambda _2, F'[\omega ^0(k)],MR_R \bigg )\times d{\overline{\omega }} \end{aligned} \end{aligned}$$
(10)

if the cut-off is \(\omega ^{*R}(i,k)\);

with \(L(\cdot ), M (\cdot )>0\).

This result states that the policy effect of a change in the income distribution associated with higher inequality depends on whether the swing voter is rich or poor. Under a poor swing voter, more inequality leads to more redistribution through regulation and less income transfer. The outcome reverses with a rich swing voter. All these policy changes are relative to the mean income since, for a fixed tax rate, the mean income determines the tax revenue.

Proposition 2 requires only a greater mass of citizens below the swing voter, and so it is compatible with different changes in the income distribution. The relevance of this approach becomes apparent when looking at the measures of poverty and inequality in the empirical exercise. Poverty is the percentage of the population with income lower than 60% of the median, say income-poverty, while the Gini index measures inequality. Recall poverty in our theoretical model is the proportion of the population that consumes only the regulated good, say consumption-poverty. Consider now the case of a mean-preserving increase in the mass of individuals with less income than the poor swing voter. Under certain conditions, like regressive transfers, this change involves higher inequality in the Lorenz sense. Also, higher inequality as measured in our empirical analysis since the Gini index is Lorenz-consistent. We can now interpret Proposition 2: increased inequality results in less direct redistribution, like social expenditures, and lower prices of regulated goods like transport. We should expect opposite effects when the swing voter is rich or non-poor, as in Spain from 2006 to 2015, where social expenditures have increased with inequality.

What role does poverty play? Consumption-poverty in our model captures the incentive for the poor to consume a bundle that relies on regulated goods and services. Then Proposition 2 shows the consequences of such incentive: the sign of the effect of inequality on the use of different redistributive policies depends on how politically relevant poverty is. Our empirical exercise represents consumption-poverty using income-poverty as a proxy. Thus, the relevance of our estimations relies on how well the population with income below 60% of the total median captures the consumption behaviour of the poor we aim to highlight. We cannot corroborate this directly. What we find is that omitting poverty biases the coefficients, even reversing the signs of the correlations between inequality and redistribution.

The next proposition is the analogous to Proposition 2 for the cases with multiple cut-offs.

Proposition 3

(Multiple-cut-offs equilibria and policy components) In any multiple-cut-off equilibrium with candidates i and k where \(\frac{\omega ^{*P}(i,k)+r_i}{p_i}>MR_R\),

$$\begin{aligned} \begin{aligned} dr_k&= L \bigg (\Upsilon _a, \Lambda _a, F[\omega ^0(k)],MR_R \bigg )\times d{\overline{\omega }} \, \text { and} \\ dp_k&= M \bigg (\Upsilon _a, \Lambda _a, F[\omega ^0(k)],MR_R \bigg )\times d{\overline{\omega }} \end{aligned} \end{aligned}$$
(11)

where \(L(\cdot ), M(\cdot )>0\) and \(a=3,4,5,6\) denotes the incentive compatibility conditions enumerated in Corollary A.2.

In this proposition, we provide a general version of equilibrium conditions as equalities. The reason is that, in multiple cut-offs environments, changes not only in the mass of poor citizens but also in the relative size of the other groups of citizens alter the equilibrium policy. Therefore, a general characterisation of a meaningful change in the distribution is less straightforward than in the case with one swing voter. The takeaway, common to all the multiple cut-offs equilibria, is that we should expect more income transfers and a higher price of the regulated good as a response to a ceteris paribus increase in the mean income.

Other changes in the distribution also convey our general message on the relevance of consumption-poverty in explaining the relation between inequality and redistribution. For instance, consider the class of equilibria with two swing voters, \(\omega ^P(i,k)\) and \(\omega ^R(i,k)\). Then, using the lexicon of Fig. 2, an increase in the size of the “Very Poor” relative to the three middle class groups, results in policy changes \(dr_k<L\times d{\overline{\omega }}\) and \(dp_k<M\times d{\overline{\omega }}\). That is more redistribution through regulation and less through income transfers for any change that preserves (or increases) the mean income.

The main idea of the theory is that when governments have discretion in the use of regulation, the components of a redistributive policy depend not only on inequality but also on the mass of the poor citizens, relative to other groups in society. This general notion is embodied in Propositions 2 and 3 above. Next, we test empirically the effects of poverty and inequality on the use of two different redistributive policies, social expenditures, and transport prices.

4.1 Empirical evidence

In this section, we provide empirical evidence for the model’s implications using a novel panel of Spanish municipalities for the period 2006–2015. We have compiled these data combining information from multiple sources. We measure poverty and income inequality using a 4% sample of all Social Security records provided in the Muestra Continua de Vidas Laborales (MCVL). These contain earnings information, mainly from wages, for over a million individuals per year. Because of individual de-anonymization concerns, only municipalities with at least 40K inhabitants are identifiable. Besides, our data exclude Navarra and the Basque Country, as these operate outside the national common fiscal regime, implying that no earnings tax data is available for their residents. Overall, our analysis is representative of Spanish largerFootnote 19 municipalities other than those of Navarra and the Basque Country. To these data, we add municipality expenditures and taxes from local budgets, both provided by the Ministry of Finance.Footnote 20 We have also collected data on demographics from the continuous censusFootnote 21 that we used to measure exposure to the Great Recession. Last, from the Home Office, we have obtained electoral data with which we construct political controls for our regressions.Footnote 22

We use data from the Great Recession of 2008 to identify exogenous variations of our variable of interests, poverty and inequality. Table 1 provides descriptive statistics that account for the overall impact of the recession on poverty, inequality, demographics and political variables before, during, and after the shock. Both poverty and inequality have increased, although the growth rate of the latter has been larger. Also, the political environment has changed. The vote share of the main left-wing party declined and that of the main right-wing party increased. In addition, participation in elections and vote fragmentation experienced a decline. As for public policies, municipalities have decreased the share of their total expenditure on housing and other social purposes, our outcomes of interest.Footnote 23

Table 1 Descriptive statistics

4.1.1 Empirical strategy

The fundamental challenge for identification is that of reverse causality.Footnote 24 Changes in the income distribution trigger changes in public policy components, and also changes in public policy affect the income distribution (Wenli and Sarte 2004). We tackle this endogeneity by exploiting demographic heterogeneity in exposure to the Great Recession across Spanish municipalities. In particular, we use the higher exposure to the Great Recession of municipalities with a larger share of individuals in their early career, together with the differential effect of the Great Recession across genders (see De la Rica and Rebollo-Sanz 2017; Bonhomme and Hospido 2013).

Table 2 Municipality demographics, inequality and poverty

The burst of the construction bubble that predominantly destroyed male employment with lower-middle wages largely explains the increase in inequality during the great recession (Bonhomme and Hospido 2017). Table 2 confirms this claim, showing that municipalities with a larger share of females within the working-age population have experienced lower inequality. The Great Recession also had a higher impact on those citizens in their early career. Table 2 shows a positive correlation between inequality and the share of individuals in age 25–35 (young for shorter) within the 25-to-65 population (working-age population). Bell and Blanchflower (2011) provide evidence of the negative effect of the Great Recession on early career workers for Spain and other OECD countries.Footnote 25

The first stage of our estimation strategy in Table 2 is the regression of municipality-level inequality or poverty on the share of female and young adults in the working-age population. Importantly, we exploit the interaction of these demographics with post-recession dummies. For example, our first stage for inequality has the following form,

$$\begin{aligned} \begin{aligned} ineq_{mt} =&\, Demo_{mt} \theta ^\top + \varpi _t + {{\,\mathrm{\mathbbm {1}}\,}}[t>= 2008]Demo_{mt}\vartheta _1^\top \\&+ {{\,\mathrm{\mathbbm {1}}\,}}[t >= 2011]Demo_{mt}\vartheta _2^\top + X_{mt}\sigma ^\top + \mu _{m} + \xi \end{aligned} \end{aligned}$$
(12)

Demo stands for a vector containing the age and female composition of the municipality’s working-age population. These demographics have not only a stationary effect on inequality, captured by \(\theta\), but also an amplifying effect because of the recession, captured by \(\vartheta _1\) and \(\vartheta _2\). We allow for municipality-level fixed unobserved heterogeneity and national-level time-varying shocks through \(\mu\) and \(\varpi\), respectively. Something to highlight from (12) is that we exploit both the first impact and the second-dip of the great recession (see Ortega and Pe nalosa 2014).Footnote 26

The results in Table 2 are as follows. First, a larger number of young citizens as a proportion of the working-age population increases inequality. In particular, a one percentage point increase in the young share produces a 0.854% increase in the Gini index. Second, a rise in the female’s share negatively affects inequality. A one percentage point increase in the female share decreases the Gini index by 1.19%. Figure 3 shows these relations in scatter-plots of inequality deviations w.r.t. to the municipality means for post-2008 demographics. Demographic heterogeneity results in different exposure to the Great Recession not only in terms of inequality but also in terms of poverty.Footnote 27 The bottom panel of Table 2 shows that a one percentage point rise in the share of young citizens increases poverty by 1.613%. Besides, the female share is associated with 1.7% less poverty.

Fig. 3
figure 3

Inequality growth and demographics. Scatter plot of Gini five year log-changes (y-axis) on share of female (x-axis, left figure) and those aged 25–35 (x-axis, right figure) in the working age population. All residualized with respect to year averages. Best-fit line in red

The assertion that municipality demographics influence both inequality and poverty comes as no surprise, given the strong correlation between inequality growth and poverty growth. Figure 4 shows that those municipalities that experienced larger inequality growth over the Great Recession also experienced larger growth in poverty. Next, we use these results to identify the effect of inequality on redistribution while controlling for poverty. In particular, we use municipality variations in age and gender composition to instrument changes in poverty and inequality. This allows us to identify both the effect of poverty and inequality on redistribution. With this strategy, we can compare estimates of the effect of inequality on redistribution with and without accounting for changes in poverty. Our major claim is that failing to control for poverty biases inequality estimates.

Fig. 4
figure 4

Inequality and poverty. Municipality level five year inequality and poverty log-differences. Inequality measured using the Gini index, poverty is the proportion of the municipality population with income lower than 60% the national median. Inequality and poverty growth figures depicted are residuals from regressions with year dummies

Before introducing our estimates, we further discuss our identification assumptions. The validity of our instruments relies on the direct effect of demographics on redistribution being time-invariant, rather than on policy changes being orthogonal to municipality demographics. We assume that all changes in the effect of demographics on redistribution over the great recession work through changes in the income distribution. This reflects in our specification which restricts the policy-demographic gradient, \(\alpha\) in the equation below, to be constant across time,

$$\begin{aligned} \begin{aligned} Policy_{mt} =&\,{\beta _1} Inequality_{mt} + {\beta _2} Poverty_{mt} \\&+ Demo_{mt} \alpha ^\top + X_{mt} \varsigma ^\top + \gamma _t + \psi _m + {\epsilon }_{mt} \end{aligned} \end{aligned}$$
(13)

Thus our empirical model in (13) allows demographics to affect policy, although it constraints the direct demographics effects on policy to be time-constant. Namely, we exclude interactions of demographics and the Great Recession from our main equation. As a result, our proposed IV estimator allows the policy to respond directly to changes in the municipality demographic composition and use the interaction of the demographic composition and the post-recession period as an instrument for the income distribution.

To explain our use of multiple instruments, we start with the first stages. Assuming a single instrument z and abstracting from control variables,

$$\begin{aligned} \begin{aligned} Inequality_{mt} =&\,\pi z_{mt} + \xi ^I_{mt} \\ Poverty_{mt} =&\,\varpi z_{mt} + \xi ^P_{mt}. \\ \end{aligned} \end{aligned}$$
(14)

Endogeneity of the income distribution implies the moment conditions \({{\,\mathrm{\mathbbm {E}}\,}}[\xi ^I\epsilon ] \ne 0\), \({{\,\mathrm{\mathbbm {E}}\,}}[\xi ^P\epsilon ] \ne 0\), while if our instrument is valid, then \({{\,\mathrm{\mathbbm {E}}\,}}[z\epsilon ] = 0\). We can use the system of equations formed by (13) and (14) to investigate the effect of failing to account for differences in poverty on the IV estimator of the effect of inequality on redistribution. Suppose that \(\varpi \ne 0\), i.e. the inequality instrument also shifts poverty.Footnote 28 Then, the use of z as an instrument for inequality in a specification that omits poverty leads to an IV estimator, \({\tilde{\beta }}_{IV}\), with the following property,Footnote 29

$$\begin{aligned} {\tilde{\beta }}_{IV} \xrightarrow []{p} \beta _1 + \underbrace{\beta _2 \dfrac{\varpi }{\pi }}_{\text {OVB}}. \end{aligned}$$
(15)

Omitting poverty leads to a bias proportional to the ratio of the poverty first-stage to the inequality first-stage. For example, our female share interacted with the dummy post-2008 would cause a bias proportional to 1.430 (\(\approx\)1.194 / 1.708). In turn, the share of young citizens results in a bias proportional to 1.910 (\(\approx\) 1.931/.854).Footnote 30

Next, we present the estimates of poverty and inequality on direct and indirect redistribution that result from our empirical strategy. The evidence supports the claim that failing to account for poverty introduces a bias in the estimates of the effect of inequality on redistributive policies.

4.1.2 Inequality, poverty and direct redistribution

Our theoretical setting opens the possibility for diverse effects of poverty and inequality on the use of direct redistribution. Here, we capture direct redistribution by municipality-level social expenditure, in particular social services and housing. In Spanish municipality budgets, the entry social-services expenditures results from the aggregation of expenditures on all activities involved in the administration of social services, the social reintegration of those marginalised, and the promotion of gender equality. Housing expenditures comprise all services related to housing and urbanism, such as access to housing, public housing, and help to land purchasers.Footnote 31

We first report the OLS estimates of the effects of poverty and inequality on the social and housing share of total municipality expenditures.Footnote 32 Column (1) of Table 3 shows a negative, small and statistically insignificant effect of inequality on housing expenditure. This estimate turns positive and grows in magnitude once we control for poverty, as shown in column (2) that also presents a negative and significant correlation between poverty and housing expenditure. Using only housing expenditure (excluding urbanization), we find a larger effect of inequality and a small and statistically insignificant negative effect of poverty; see columns (3) and (4).Footnote 33 Finally, regressing social expenditures on either inequality or inequality and poverty produces a positive, although statistically insignificant partial correlation estimate.

Table 3 Inequality, poverty and municipality expenditures OLS estimates

To address the endogeneity bias, we now report the results of our instrumental variables strategy. We start with the estimates of the sole effect of inequality on housing expenditure for then comparing with the specification that includes both poverty and inequality. The estimates of inequality instrumented with the share of young citizens in Columns (1) and (2) of Table 4 show a negative effect of inequality on housing that is strongly significant. Panel A in column (1) reports the baseline specification, i.e. with a single interaction and no controls other than the fixed effects. We find that a 10% increase in inequality reduces housing expenditure by 9 percentage points (p.p.).Footnote 34 Introducing controls for demographics and political characteristics barely alters this estimate. Again, we get very similar estimates when adding the interaction of demographics with the second Great Recession dip that took place after 2011, column (2) of Table 4. Last, when we instrument inequality using the share of females, we obtain an estimate of the opposite sign but this time statistically insignificant.

Our major claim is that it is the omission of poverty that creates these differences and not the validity of our instruments. We can preliminarily see this by looking at the first stage estimates. By results in Table 2, if we include poverty in the main equation and instrument inequality with the share of young citizens, the bias is 1.34 (\(\approx\)1.91/1.43) times the bias produced when instrumenting inequality with the share of females. In this case, the switch in sign observed between estimates with different instruments (columns (1) and (3) of Table 4) suggests a positive effect of inequality on housing expenditure, and a negative effect of poverty.

Table 4 Inequality and municipality housing expenditure IV estimates

Our main specification provides robust evidence of a positive effect of inequality on direct redistribution when we account for poverty. The left-half of Table 5 reports the effect of inequality and poverty on housing expenditure using all the instruments. In particular, when instrumenting with the share of females interacted with the two recession dummies, the results support a positive (negative) effect of inequality (poverty) on housing expenditure. Pooling all instruments together produces similar estimates, and all these are robust to controls for demographics and political characteristics. Also, we provide estimates of the effects of inequality and poverty on social expenditures (right-half of Table 5). We find a positive effect of inequality, and our results suggest a mild effect of poverty on social expenditure. Importantly, the estimates of the effect of inequality are of similar magnitude as those in Table 7 where we exclude poverty. In numbers, a 10% increase in inequality increases expenditure on social services in a range between 7 and 3 p.p., depending on the controls included. These results are robust to all instruments, i.e. the estimates have the same sign and are of similar magnitudes.

Table 5 Inequality and municipality expenditure IV estimates, controlling for poverty

Our empirical strategy may raise three major concerns. The first is the under-identification of our proposed system of equations. This problem is relevant since the Kleibergen and Paap (2006) reduced rank statistic for weak instruments is below the i.i.d. critical values in Stock and Yogo (2005) and when instruments are weak, the IV estimator is likely biased (Bound et al. 1995). We follow the recommendation by Andrews et al. (2019) and report identification-robust Anderson and Rubin (1949) confidence sets in Figs. 7 and 8. These figures show that for housing expenditure, 90% confidence sets are bounded from below at \(\beta _{inequality} = 0\), and from above at \(\beta _{poverty} = 0\). With social services expenditure, the 90% confidence set crosses zero in both dimensions, although most of the confidence set lies on the sub-space of positive (negative) inequality (poverty) effects. These results further support that inequality increases direct redistribution while poverty decreases it.

A second concern is that our demographic instruments may have different effects on inequality in different income levels of the distribution. For example, Schwabish et al. (2006) finds that the relationship between inequality and social spending depends on whether inequality increases at the top (90/50 percentile ratio) or bottom (50/10 percentile ratio) of the income distribution. If this is the case, then changes observed in Table 4 may reflect heterogeneous treatment effects. We show that this is unlikely in Tables 8 and 9, by regressing changes in expenditures on different measures of inequality, including the top (90/50 percentile ratio) and the bottom (50/10 percentile ratio) of the income distribution. These estimates show that instrumenting inequality with the share of young produces a positive effect of inequality on social service expenditure, regardless of the type of inequality, top or bottom. We find a similar result when we instrument inequality with the female share. As for housing expenditure, instrumenting inequality with the share of young (female) produces negative (positive) and statistically significant (insignificant) effects. As a robustness check, we also estimate the effect of inequality on social and housing expenditures using a Gini index that accounts for long-term unemployment (columns (1) and (5) of Tables 8 and 9). This exercise results in estimates that are similar to those using the general definition of inequality in Tables 4 and 7.

The last concern is about the validity of our instruments. Those young individuals that were more exposed to the recession may likely have migrated to municipalities with more generous social services. The literature on welfare-benefits-induced migration (e.g. Kennan and Walker 2010; De Giorgi and Pellizzari 2009) documents that welfare access provides weak incentives to geographical mobility. Still, this literature has focused on mobility across areas considerably larger than municipalities. We address this concern by using demographics measured in 2006 before any sign of the upcoming recession was apparent. Results show that the estimates produced with lagged demographics (Tables 10, 11) are very close to the baseline ones (Tables 4, 7).

4.1.3 Inequality, poverty and indirect redistribution

How can we identify the effect of inequality on indirect redistribution? Our theoretical model offers two fundamental insights. We should control for poverty, and we should consider a policy that works as the price regulation of goods or services. One broad policy area that satisfies the second insight is the regulation on transport, both private and public. To the best of our knowledge, there is no systematic collection of data on the finances and prices of public transport for Spain. Instead, we use tax rates on private vehicles as proxies for the prices of road usage.Footnote 35

The choice of tax rates on private road vehicles has three advantages. First, compared to tax revenues, tax rate information better identifies a pricing policy since the former confounds the effects of changes in both the tax rate and the tax base. This feature is relevant for Spain, where during the Great Recession, not only income but also consumption inequality experienced significant changes (Anghel et al. 2018). Second, vehicle taxes are a good proxy for the price of a public service given that the use of roads is subject to decreasing average costs of provision. Last, the tax rate we consider does not charge sales and purchases but vehicle potential use. In Spain, taxes on vehicles depend on two characteristics, engine power and type of vehicle.Footnote 36 This allows us to focus on the use of roads by high-power cars,Footnote 37 which clearly is not a necessity. We refer to the high-power-car tariff as the “large-cars” tariff.

Opposite to the effect on public transport fares, our theoretical framework suggests we should expect tariffs on private vehicles to depend positively on poverty and negatively on inequality; for the poor consume intensively public transport and not private vehicles (Asensio et al. 2003). In Table 6 we provide IV estimates of the effect of inequality on log-tariffs for the largest car segment. When we instrument inequality with the municipality share of the young, we find an insignificant effect with an unclear sign. Instead, when the instrument is the share of females, we find a 6–9% decrease on the tariff per 10% increase in inequality. Given the first stages in Table 2, the change observed in these estimates suggests a positive effect of poverty, so that estimates in Table 6 that do not control for poverty, have an upward bias. IV estimates of the effect of inequality and poverty reported in Table 12 corroborate this result. We find a negative effect of inequality on the large-cars tariff, with a 10% increase in inequality decreasing the tariff by 8–18%, depending on the added controls. Poverty has a positive effect, with a 10% increase positively affecting the tariff by 2–9%.Footnote 38

5 Final remarks

In this paper, we propose a theoretical and empirical approach to study the effects of poverty and inequality on the mix of government redistributive policies. Our model builds upon the claim that the poor have identifiable consumption patterns. In particular, they consume regulated goods more intensively than the rest of the citizens. This feature prevents an exogenous definition of poverty. That is why we have chosen a theoretical model that allows for endogenously defined groupings. This choice has a cost in terms of the precision of the model’s predictions. Also, it makes it difficult to arrive at a neatly reduced form for empirical analysis. Future work might take on these two key challenges.

Our theoretical results convey that inequality and poverty may create diverging incentives for the combinations of indirect and direct redistributive policies. This idea has great significance for the rich literature on redistribution in democracies, for it provides further insights into the relationship between inequality and redistribution. Importantly, it suggests that a robust study of such a relationship should account for the effects of poverty.

Our empirical results are consistent with the notion that a greater mass of poor citizens creates incentives to redistribute more through regulation at the expense of social expenditures. Using data at the municipality level, we provide evidence whereby inequality increases direct redistribution and decreases indirect redistribution when we control for poverty. We further show that poverty and inequality may have opposite effects on redistribution; despite being strongly correlated. This evidence provides a further explanation for previous divergent results on the redistributive effects of inequality.

Still reminds the question of how all other types of government policy respond to incentives brought by changes in the income distribution. Understanding the trade-offs involved in the menu of government redistributive policies and their welfare implications is of critical relevance for identifying more particular relationships between income distribution and public policy. Future work can depart from the framework here introduced to better understand these trade-offs.