1 Introduction

Income tax progressivity helps to attain a more equal distribution of income, wealth and consumption. Additionally, in the presence of uninsurable uncertainty (whether because it is of aggregate nature, or being of idiosyncratic type, insurance markets are missing), progressive taxation provides some partial insurance and less volatile household consumption over time. The counterpart, however, is that progressive taxation introduces incentive distortions for labor supply, saving and investment decisions of private economic agents.

The optimal degree of progressivity of income taxation has been, and still is, a live issue among academic researchers which point out challenging results in several directions. But it is also an issue of concern among policy practitioners as the recent crisis has forced many governments to look for new sources of revenue, and higher and more progressive taxes seem to be on their schedules, raising the corresponding policy debate.

The issue dates back to Mirrlees (1971) seminal paper, so that the economic literature has produced some major works on the subject since then. Hubbard and Judd (1986) find that an exemption and a higher marginal tax rate can in some cases improve efficiency relative to a proportional tax, so that some degree of progressivity may be desirable. Conesa and Krueger (2006) find that the optimal US total income tax code is well approximated by a flat tax rate of 17.2 % and a fixed deduction of about $9,400. In a similar set-up, Conesa et al. (2009) conclude that, also for the US case, the optimal capital income tax (flat) rate is 36 %, while the optimal progressive labor income tax consists of a flat tax of 23 % with a deduction of $7,200. In a closely related paper, Peterman (2014) finds that the introduction of human capital causes a 4.7 % increase in the optimal capital tax and a notably flatter optimal labor income tax. Carroll and Young (2011) find that increases in the progressivity of the income tax schedule are associated with long-run distributions with greater aggregate income, wealth, capital and labor, and lower income inequality and higher wealth inequality. Diamond and Saez (2011) and Kindermann and Krueger (2014) suggest that the optimal labor income tax rate for the top earners would imply a marginal tax rate as high as 73 % or even 90 %. Bakis et al. (2014) find that when the transitional dynamics are ignored, the optimal tax policy for the long-run steady state is moderately regressive; when the transition path is considered, however, the optimal tax reform is much more progressive. Along similar lines, some other authors suggest that the revenue-maximizing marginal tax rate for the top earners should be at least 36.9 % (see Guner et al. 2014b), or even around 53 % (see Badel and Huggett 2014). It is worth pointing out that a feature common to all these previous references is that economic growth is either absent or made exogenous.

Switching to works of endogenous growth, and focusing on the US economy, Caucutt et al. (2003) build up an OLG model where growth is driven by human capital investment. They find that reductions in the progressivity of labor income tax can in themselves have positive growth effects and decrease inequality: the annual per capita growth rate along the balanced-growth path would rise by up to 0.52 % by eliminating progressivity. Benabou (2002) sets up an infinitely-lived agents model of human capital accumulation, concluding that long run growth would be maximized if the average marginal tax rate for labor income were 34.8 %, giving rise to a 0.5 % increment in the long run growth rate (see Tang and King 2005 for a correction). Li and Sarte (2004) modify Rebelo (1991) and Barro (1990) original endogenous growth models to account for progressive taxes, finding that the progressivity decrease implied by the 1986 Tax Reform Act helped raise U.S. per capita GDP growth between 0.12 and 0.34 %.

Echevarria (2012) builds a two-period OLG economy with aggregate uncertainty and in which young individuals’ savings in the form of physical capital drive growth through an AK technology, individuals’ age being the only (trivial) source of heterogeneity as individuals pertaining to the same generation are all alike. Assuming inelastic labor supply and recursive preferences, necessary and sufficient conditions are obtained for the introduction of a progressive income tax in a flat income tax economy to induce a reduction in the growth rate, the intertemporal rate of substitution in consumption proving to be a key parameter. The model is numerically illustrated for the U.S. economy, showing that the long-run GDP growth rate would be maximized under a regressive income tax.

In this paper I set up an economy model that partly draws on Echevarria (2012). As in that paper, I assume a two-period, OLG economy populated by non-altruistic individuals in which government expenditure, representing a constant fraction of GDP, is financed via a progressive personal income tax, physical capital accumulation in an AK fashion being the engine of growth. Therefore, this set up differs from that in Caucutt et al. (2003) where human capital accumulation gives the growth generating mechanism. Once physical capital accumulation is adopted, the AK specification proves useful as it simplifies both the algebra and the numerical computation because transitional dynamics are precluded.

Additionally, here I introduce some added features. First, young individuals’ labor supply is elastic. This is a must ingredient when, as in this paper, labor and capital income end up being taxed differently. This is so not because labor and capital income are taxed separately as in Conesa et al. (2009), but because in the simple economy that I build up young (active) individuals live only on labor income, old (retired) individuals live only on capital income, and, in general, young and old individuals’ incomes are different.Footnote 1 This assumption alone implies one key difference with respect to Echevarria (2012). The (negative) growth effect of income tax progressivity is reinforced, as a higher tax progressivity is shown to lead to substantial reductions in individuals’ labor supply which further reduce young individuals’ savings.

Second, I assume that individuals differ in their skills or innate labor productivities, thereby allowing for an intra-generational heterogeneity absent in that paper and which, in turn, allows us to approximate the observed income distribution. Along the same lines, market or before-tax income distribution now becomes endogenous, as the tax policy now does influence labor and savings decisions of different type individuals in a different way.

Third, perfect foresight is assumed. When risk aversion is properly treated, so that attitudes toward risk and intertemporal consumption substitutability are independently modelled, previous works in the literature have proven that the effect of income taxation on the equilibrium growth rate mainly depends on the elasticity of intertemporal substitution in consumption, while the role played by aggregate uncertainty or individuals’ risk aversion is a minor one. In other words, deterministic models offer a reasonable first approximation to the effects of taxes on growth. See, e.g. Echevarria (2012, 2013), Mauro (1995) or Smith (1996).

Fourth, the progressive tax schedule introduced by Li and Sarte (2004) is here replaced by that in Heathcote et al. (2014). Despite being similar in nature, the latter formulation allows one to set both a lower and an upper bound on the progressivity index which have natural interpretations: proportional taxation and maximum redistribution in that net-of-tax incomes are equalized across taxpayers, respectively.

And, fifth, when discussing welfare implications of tax progressivity, a special reference is made to the destination that the government selects for tax proceeds. Contrary to the usual assumption that tax revenues are just wasted (so that no public investment is made nor a consumption good entering the individuals’ utility function is provided), in this paper I consider the case in which tax revenues are used to publicly provide a consumption good.Footnote 2 As I will show, even when this implies no change for the equilibrium private allocation nor the equilibrium growth rate, the social welfare maximizing progressivity of the income tax code does change.

The model here introduced is, therefore, characterized by basic features that, although borrowed from previous works, when jointly considered make it depart from those. It is an overlapping generations economy, but as opposed to Caucutt et al. (2003), households are non-altruistic and have perfect foresight, growth is generated by physical capital accumulation, the progressivity of the income tax applies not only to labor income, but also to capital income, and progressive tax rates are endogenously obtained rather than ex ante imposed. And unlike Echevarria (2012), perfect foresight, intra-generational heterogeneity and elastic labor supply are assumed.

The results that I obtain are of a quantitative nature. The model economy is first calibrated to mimic some stylized facts of the U.S. economy. Then, the following revenue-neutral tax experiment is run: assuming that aggregate tax revenues are a constant fraction of the economy’s aggregate output, I analyze both the individuals’ and the aggregate economy’s responses to (unannounced) changes in the progressivity of the income tax code. The numerical results obtained are an illustration of the effects of income tax progressivity upon economic growth, income inequality, and social welfare.

The main predictions that the numerical analysis suggests fall under three main categories: individual, aggregate, and social welfare related. A summary follows.

Concerning individual effects, increases in income tax progressivity induce drops in labor supply of all types of individuals, and reductions in savings of all individuals except for the least skilled, as a result of higher average tax rates borne by all individuals except for the least skilled and lower average tax rates borne by the least skilled who become subsidized.

Regarding aggregate effects, increments in income tax progressivity lead to a fall in aggregate savings, which (given the growth mechanism in this model economy) implies a lower equilibrium growth rate. Thus, eliminating the progressivity and moving to a flat tax regime would raise the equilibrium annual growth rate of per capita GDP from 1.92 to 1.95 %. Additionally, higher income tax progressivity reduces the inequality in the distribution of both pre-tax and after-tax incomes. Thus, introducing a proportional income tax code in the benchmark economy would increase the Gini indices of market and net-of-tax incomes by 1 and 3.8 %, respectively.

Finally, concerning social welfare effects, optimality imposes a political choice as income tax progressivity leads to three effects: two positive, one negative. First, a more equal distribution of net income and consumption levels across different types of individuals. Second, higher levels of leisure. And, third, a lower growth rate of consumption. The size of this negative effect crucially depends on the social discount factor: as the social planner values more future generations’ well-being, the optimal level of the income tax progressivity will fall. Thus, if the social discount factor is almost zero, then the optimal progressivity level will almost imply complete income redistribution with a welfare gain equivalent to a 62.45 % increment in lifetime consumption and a negligible (close to zero) annual growth rate of per capita GDP. However, if the social discount factor is high enough (around 1.2), the progressivity level will sharply fall (although still above the benchmark economy value). In this case, the resulting small fall in the annual growth rate of per capita GDP (from 1.92 to 1.87 %) would imply a welfare loss equivalent to a 15.01 % fall in lifetime consumption.

In addition, if the government tax proceeds are devoted to the public provision of some private good rather than discarded, the optimal progressivity level will also depend on how households value this public provision relative to private consumption. Thus, a higher valuation implies a lower optimal progressivity level, as progressivity also reduces the growth rate of the government provided consumption. At any rate, and for the parameter values considered, optimality requires some degree of progressivity. And, not only do the optimal progressivity levels themselves depend negatively such social and individual preference parameters, but so also do the respective quantitative gains.

The rest of the paper is organized as follows. Section 2 describes the economy. Section 3 solves the equilibrium growth rate. Section 4 discusses welfare analysis. Section 5 provides numerical results. Section 6 concludes. An “Appendix” discusses the results under the assumption that the government publicly provides a private consumption good financed with the tax proceeds.

2 The economy

There are two sectors in the economy: a private one (households and firms) which makes its decisions in a perfectly competitive market framework; and a government which levies a progressive income tax to finance some exogenous level of expenditure which, in this benchmark economy, is neither productive nor enters households’ preferences. Later on, as a sensitivity analysis exercise, I will discuss how this assumption might affect the results of the paper, in particular those related to individuals’ welfare.

As for households, this is an OLG economy, populated by a continuum of young individuals and a continuum of old individuals which coexist at any time, in which population is assumed to grow at an exogenous, constant rate, \(m\ge 0\). Therefore, individuals differ in age and in their innate labor ability. This way, income redistribution through progressive taxation takes place across individuals of different generations and of different labor productivities living at the same time.

The productive sector is represented by a continuum of competitive firms of measure one. All firms use the same production technology of constant returns to scale in capital and labor and are exposed to a positive externality given by the aggregate stock of capital per unit of labor. The appropriate choice of parameters will make firms exhibit an aggregate AK technology in equilibrium, thereby allowing for the existence of sustained growth.

2.1 Households

Suppose an \(i\)-th type individual born at time \(t\) who lives for 2 periods and obtains utility from young and old period consumption (\(c_{1,t}^i \) and \(c_{2,t+1}^i \), respectively) and disutility from young period labor, \(n_t^i \in (0,1)\), where total time endowment per period is normalized to unity. In their second period, individuals are assumed to be retired.Footnote 3 More precisely, I assume that \(i\)-th individual’s preferences are represented by the following utility function.

$$\begin{aligned} U(c_{1,t}^i ,c_{2,t+1}^i ,n_t^i )=\frac{\left( {c_{1,t}^i }\right) ^{1-\sigma _1 }\left( {1-n_t^i }\right) ^{\sigma _2 }}{1-\sigma _1 }+\beta \frac{\left( {c_{2,t+1}^i }\right) ^{1-\sigma _1 }}{1-\sigma _1 }, \end{aligned}$$
(1)

for \(i=1,2,{\ldots },I, \, I\) denoting the number of individual types, where \(\beta \in (0,1)\) denotes the subjective discount factor, and parameters \(\sigma _1 \) and \(\sigma _2 \) are such that if \(\sigma _1 \in (0,1)\), then \(\sigma _2 \in (0,1)\); and if \(\sigma _1 >1\), then \(\sigma _2 <0\). Some remarks follow Eq. (1). First, the intertemporal elasticity of substitution for consumption is given by IES = \(1{/}\sigma _1 \). And, second, the preferences over life-time consumption and leisure there represented are compatible with balanced growth along the steady state. Note that labor supply along the balanced growth path must be constant, while per capita first-period consumption and the wage rate grow at the same rate: i.e. the marginal rate of substitution between \(c_t^i\) and \(1-n_t^i\) must fall at the same rate as the inverse of the (net-of-tax) wage rate.

Progressive income taxation implies that individuals with different incomes face different tax rates. In this simple economy, there are two sources of individual heterogeneity: age and labor skills. Thus, the average tax rate that a young (resp. old) \(i\)-th type individual is charged is denoted by \(\tau _1^i \) (resp. \(\tau _2^i )\), its precise nature being discussed below. Tax rates are assumed time-invariant for simplicity, so that they are not affected by a time subscript.Footnote 4

Denoting the \(i\)-th type individual’s skill parameter by \(\theta ^i>0\), the first-period savings at \(t\) by \(s_t^i \), the wage rate per unit of labor at \(t\) by \(w_t \), and the net-of-depreciation interest rate paid at \(t+1\) by \(r_{t+1} \), one obtains the first- and second-period individual budget constraints as

$$\begin{aligned} c_{1,t}^i +s_t^i =(1-\tau _1^i )\theta ^in_t^i w_t , \end{aligned}$$
(2)

and

$$\begin{aligned} c_{2,t+1}^i =\left[ {1+(1-\tau _2^i )r_{t+1} } \right] s_t^i , \end{aligned}$$
(3)

respectively for \(i=1,2,{\ldots },I\).

Thus, substituting \(c_{1,t}^i \) and \(c_{2,t+1}^i \) from Eqs. (2) and (3), respectively, into the objective function in Eq. (1), and maximizing the resulting equation with respect to \(s_t^i \) and \(n_t^i \) yields the following set of two first-order necessary (and, along with Eqs. (2)–(3), sufficient) optimality conditions

$$\begin{aligned} \left( {c_{1,t}^i }\right) ^{-\sigma _1 }\left( 1-n_t^i \right) ^{\sigma _2 }=\beta \left( {c_{2,t+1}^i }\right) ^{-\sigma _1 }\left[ {1+\left( {1-\tau _2^i -s_t^i \times \frac{\partial \tau _2^i }{\partial s_t^i }}\right) r_{t+1} } \right] , \end{aligned}$$
(4)

and

$$\begin{aligned} \left( 1-n_t^i \right) \theta ^iw_t \left[ {1-\tau _1^i -n_t^i \times \frac{\partial \tau _1^i }{\partial n_t^i }} \right] =\frac{\sigma _2 }{1-\sigma _1 }c_{1,t}^i , \end{aligned}$$
(5)

for \(i=1,2,{\ldots },I\). Equation (4) represents the standard Euler equation for optimal consumption plans for two consecutive periods. Notice the term \(\partial \tau _2^i /\partial s_t^i \) on the right-hand side of Eq. (4): if the income tax schedule is progressive, the tax rate that an \(i\)-th type old individual faces becomes endogenous (increasing in his/her savings, \(s_t^i )\). Equation (5) represents the standard optimal consumption-leisure choice: the marginal utility of consumption times the foregone consumption units as a result of one additional unit of leisure, the latter being \(\theta ^iw_t \left( {1-\tau _1^i-n_t^i \times \frac{\partial \tau _1^i }{\partial n_t^i }}\right) \), must equal the marginal utility of leisure. Similarly to Eq. (4), the term \(\partial \tau _1^i /\partial n_t^i \) must show up in Eq. (5), denoting the endogenous nature of the tax rate charged to an \(i\)-th type young individual.

2.2 Firms

Let us suppose that a profit maximizing firm \(f\) acts competitively in the output and production factor (capital and labor) markets without adjustment costs in production inputs. Formally, the problem this firm faces at time \(t\) is written as

$$\begin{aligned} \mathop {\max }\limits _{\left\{ Y_t^f ,N_t^f ,K_t^f \right\} } Y_t^f -w_t N_t^f -(r_t +\delta )K_t^f \end{aligned}$$
(6)
$$\begin{aligned} s.t. \qquad Y_t^f =A\left( {K_t^f }\right) ^\alpha \left( {N_t^f}\right) ^{1-\alpha }k_t^{1-\alpha } ,\quad A>0 \end{aligned}$$

where \(Y_t^f \) denotes output, \(N_t^f \) denotes labor (in efficiency units), \(K_t^f \) denotes physical capital, \(\alpha \in (0,1)\) denotes the capital income share, and \(\delta \) stands for the physical capital depreciation rate. Some remarks concerning production technology follow. First, I assume that all firms (uniformly distributed on the interval \([0,1])\) exhibit the same production technology. Second, I also assume that there is a positive externality in the production process so that f-th firm’s output depends not only on the inputs hired by that firm, but also on the average number of units of capital per unit of labor (in efficiency units), for the whole economy, \(k_t \equiv K_t /N_t \), where \(K_{t} \equiv \int _{{0,1]}} K_{t}^{f} df \), and \(N_{t} \equiv \int _{{0,1]}} N_{t}^{f} df\). The intended consequence is that this economy will display an \(AK\) technology in equilibrium, where \(Y_t=AK_t \), thereby allowing for sustained economic growth which will be constant, i.e. with no transitional dynamics.Footnote 5

The solution to the problem in Eq. (6) is given by the factor price equations

$$\begin{aligned} r_t =\alpha A-\delta ,\quad \text{ and }\quad w_t =(1-\alpha )Ak_t . \end{aligned}$$
(7)

Thus, the user cost of capital will be constant, \(\alpha A\), but the wage rate, \(w_t =(1-\alpha )Ak_t \), will grow at the same rate as the aggregate stock of capital per unit of labor.

2.3 Government

A government sector is introduced in the following way: it taxes capital and labor incomes with the same tax code, capital depreciation is completely deductible, and tax revenues are used to finance an exogenous stream of public expenditure, \(G_t \), which (for the sake of analytical convenience) is expressed as a constant proportion, \(\gamma >0\), of aggregate output (i.e. \(G_t =\gamma Y_t)\).

Concerning the income tax code, I will suppose this to be of the particular class introduced by Benabou (2002) and Heathcote et al. (2014), where the tax rate that the \(i\)-th individual is charged depends on how his/her income, \(y_{a,t}^i \), is related to average (i.e. per capita) income, \(y_t \).Footnote 6 Thus, the average tax rate paid by individual \(i\) with age \(a\) and income \(y_{a,t}^i \) is given by

$$\begin{aligned} \tau _a^i =1-\xi \left( {\frac{y_{a,t}^i }{y_t }}\right) ^{-\phi }, \end{aligned}$$
(8)

for \(i=1,2,{\ldots }I\), for some \(\xi >0\) (to be endogenously obtained when solving the macroeconomic equilibrium), and \(\phi \ge 0\), where \(a =\) {1, 2} for young and old individuals respectively. Thus, Eq. (8) implies that the marginal tax rate faced by this individual, \(\partial \left( {\tau _a^i \times y_{a,t}^i }\right) {/}\partial y_{a,t}^i \), equals \(\hat{\tau }_a^i =1-(1-\phi )(1-\tau _a^i )\). This way, the ratio \((\hat{\tau }_a^i -\tau _a^i ){/}(1-\tau _a^i )=\phi \) provides us with a natural indicator of the progressivity of the tax schedule, independent of the income level at which it is evaluated. Note that if \(\phi =0\), income taxation is proportional and \(\hat{\tau }_a^i =\tau _a^i \); \(\phi >0\) if and only \(d\tau _a^i {/}dy_{a,t}^i >0\), i.e. the tax schedule is progressive; and if \(\varphi =1\), after-tax income for \(i\)-th type individual equals \(\left( 1-\tau _a^i \right) y_{a,t}^i = \xi y_t \): i.e. income is completely redistributed. Consequently, this tax scheme naturally suggests two bounds for the progressivity measure, 0 and 1. The specification of the tax rates in Eq. (8) allows us to rewrite the first-order necessary conditions in Eqs. (4) and (5) as

$$\begin{aligned} \left( {c_{1,t}^i }\right) ^{-\sigma _1 }\left( 1-n_t^i \right) ^{\sigma _2 }=\beta \left( {c_{2,t+1}^i }\right) ^{-\sigma _1 }\left[ {1+(1-\phi )(1-\tau _2^i )r_{t+1} } \right] , \end{aligned}$$
(9)
$$\begin{aligned} \left( 1-n_t^i \right) \theta ^iw_t (1-\phi )(1-\tau _1^i )=\frac{\sigma _2}{1-\sigma _1 }c_{1,t}^i , \end{aligned}$$
(10)

for \(i=1,2,{\ldots }I\). Simple inspection of Eqs. (9) and (10) shows that along the balanced growth path, at which the interest rate is constant [see Eq. (7)], \(c_{1,t}^i \), \(c_{2,t}^i \) and \(w_t \) must grow at the same rate, thereby guaranteeing that \(n_t^i =n^i\) for all \(t\).

3 Competitive equilibrium

Once both the private sector and the government have been introduced, I next solve for the competitive equilibrium.

Labor market equilibrium Equilibrium in the labor market is trivially obtained. Using \(J_t\) to denote the number of young individuals at \(t\), equilibrium in the labor market (expressed in efficiency units) is given by

$$\begin{aligned} N_t =J_t \times \tilde{n}_t , \end{aligned}$$
(11)

where the left-hand side denotes aggregate labor demand and the right-hand side represents aggregate labor supply, \(\tilde{n}_t \) standing for average effective labor supply per young individual, i.e. \(\tilde{n}_{t} \equiv \sum \nolimits _{{i = 1}}^{I} \theta ^{i} p^{i} n_{t}^{i}\), and \(p^i\) denoting the (time invariant) proportion of \(i\)-th type individuals (i.e. \(p^i\ge 0\) and \(\sum \nolimits _{{i = 1}}^{I} p^{i} = 1\)).

Goods market equilibrium As is standard in 2-period OLG models with no financial assets at birth, equilibrium in the goods market requires that the aggregate of young individuals’ savings be equal to the next period’s aggregate stock of capital. Therefore, it must be the case that \(\tilde{s}_t =(1+m)\hat{k}_{t+1} \), where \(\tilde{s}_{t} \equiv \sum \nolimits _{{i = 1}}^{I} p^{i} s_{t}^{i}\) represents average savings per young individual, and \(\hat{k}_t \equiv K_t/J_t \), i.e. the aggregate stock of capital per young individual as of time \(t\).

Tax rates in equilibrium We are now ready to solve for the tax rates that young and old individuals face. Firstly, from Eqs. (2) and (7), the \(i\)-th type young individual’s gross income at time \(t\) equals

$$\begin{aligned} y_{1,t}^i =(1-\alpha )\theta ^in_t^i Ak_t , \end{aligned}$$
(12)

for \(i=1,2,{\ldots },I\). Secondly, from Eqs. (3) and (7), one obtains that the \(i\)-th type old individual’s (taxable) income at time \(t\) equals

$$\begin{aligned} y_{2,t}^i =s_{t-1}^i (\alpha A-\delta ), \end{aligned}$$
(13)

for \(i=1,2,{\ldots }I\). Third, the aggregate income across young individuals as of time \(t\) is given by \(\sum \nolimits _{i}^{I} p^{i} J_{t} y_{{1,t}}^{i} = J_{t} (1 - \alpha )A\hat{k}_{t}\), where I have used Eqs. (11) and (12). Denoting the number of old individuals at time \(t\) as \(V_t \), the aggregate income across old individuals at time \(t\) is given by \(\sum \nolimits _{i}^{I} p^{i} V_{t} y_{{2,t}}^{i} = V_{t} (\alpha A - \delta )(1 + m)\hat{k}_{t}\), where I have used the equilibrium condition in the goods market and Eq. (13). Therefore, noting that \(V_t =J_t (1+m)^{-1}\), aggregate (taxable) income at time \(t\) is given by \((A-\delta )J_t\hat{k}_t \).Footnote 7 Or, equivalently, per capita income at time \(t\), \(y_t \), equals

$$\begin{aligned} y_t =\frac{(A-\delta )\hat{k}_t }{1+(1+m)^{-1}}. \end{aligned}$$
(14)

Fourth, from Eqs. (12) to (14) it is straightforward to obtain the relative before-tax incomes of the \(i\)-th type young and old individuals, \(y_{1,t}^i {/}y_t \) and \(y_{2,t}^i {/}y_t \) respectively, as

$$\begin{aligned} \frac{y_{1,t}^i }{y_t }=\frac{(1-\alpha )A\left[ {1+(1+m)^{-1}} \right] }{A-\delta }\eta ^i,\quad \frac{y_{2,t}^i }{y_t}=\frac{(\alpha A-\delta )(2+m)}{A-\delta }\mu ^i, \end{aligned}$$
(15)

for \(i=1,2,{\ldots }I\), where \(\eta ^i\equiv \theta ^in_t^i /\tilde{n}_t \) and \(\mu ^i\equiv s_{t-1}^i /\tilde{s}_{t-1} \) denote the shares respectively of (effective) labor supply and savings of the \(i\)-th type individual. Therefore, relative before-tax incomes do depend on government policy [characterized by parameters \(\gamma \), \(\xi \) and \(\phi \)] through the effect of tax rates on relative labor supplies (the \(\eta ^i\)’s) and savings (the \(\mu ^i\)’s). Substituting \(y_{1,t}^i{/}y_t \) and \(y_{2,t}^i{/}y_t \) from Eq. (15) into Eq. (8) finally yields the equilibrium average tax rates, \(\tau _1^i \) and \(\tau _2^i \), respectively as

$$\begin{aligned} \tau _1^i =1-\xi \left\{ {\frac{(1-\alpha )A\left[ {1+(1+m)^{-1}} \right] }{A-\delta }\eta ^i} \right\} ^{-\phi },\quad \hbox { and} \end{aligned}$$
(16)
$$\begin{aligned} \tau _2^i =1-\xi \left\{ {\frac{(\alpha A-\delta )(2+m)}{A-\delta }\mu ^i} \right\} ^{-\phi }, \end{aligned}$$
(17)

for \(i=1,2,{\ldots }I\). Note that relative before-tax incomes (and, therefore, the tax rates) are time-invariant (because, as shown below, the \(\eta ^i\)’s and the \(\mu ^i\)’s are also time invariant).

Fifth, I impose the condition of government budget balance on a period basis, \(\gamma Y_{t} = \sum \nolimits _{i}^{I} J_{t} \times p^{i} \times \tau _{{1,t}}^{i} \times y_{{1,t}}^{i} + \sum \nolimits _{i}^{I} V_{t} \times p^{i} \times \tau _{{2,t}}^{i} \times y_{{2,t}}^{i}\), where the left-hand side denotes government spending. The right-hand side represents income tax proceeds: the first term denotes taxes paid by the young, while the second term represents taxes paid by the old.

It can be shown that the average (income weighted) of the difference between 1 and the individuals’ marginal tax rates is given by \((1 - \phi ) \times \big [\sum \nolimits _{i}^{I} p^{i} \times (1 - \tau _{{1,t}}^{i} ) \times y_{{1,t}}^{i} + \,(1 + m)^{{ - 1}} \sum \nolimits _{i}^{I} p^{i} \times (1 - \tau _{{2,t}}^{i} ) \times y_{{2,t}}^{i} \big ]{/}\big [\sum \nolimits _{i}^{I} p^{i} \times y_{{1,t}}^{i} + (1 + m)^{{ - 1}} \sum \nolimits _{i}^{I} p^{i} \times y_{{2,t}}^{i} \big ]\). But the term multiplying \(1-\phi \) is nothing but the average of the difference between 1 and the individuals’ average tax rates. Therefore, \(\phi \) represents not only the progressivity measure at the individual level, but also economy-wide.

Growth In this paper I solve for the equilibrium growth rate of the stock of capital per young individual between date \(t\) and \(t+1\), i.e. \(g=\left( \hat{k}_{t+1} -\hat{k}_t \right) {/}\hat{k}_t \). Given a constant population growth rate, the growth rate of the stock of capital per capita is also \(g\), and so is the growth rate of the stock of capital per unit of labor. Thus, the condition for equilibrium in the goods market can be rewritten as

$$\begin{aligned} \tilde{s}_t =(1+m)(1+g)\hat{k}_t . \end{aligned}$$
(18)

Some remarks follow. First, the equilibrium growth rate, \(g\), is constant always, a standard feature of AK growth models. And, second, all per capita variables grow at the same rate. Thus, from Eq. (14) the (gross) rate of growth of \(y_t \) is given by \(y_{t+1}{/}y_t =1+g\). From Eqs. (2), (7) and (8), one has that \(c_{1,t}^i{/}\hat{k}_t = (1-\tau _1^i )\eta ^i(1-\alpha )A-\mu ^i(1+m)(1+g)\) is constant, for \(i=1,2,{\ldots }I\). Further, from Eq. (2) and (7), one obtains that \(s_t^i{/}\hat{k}_t \) is also constant and so must be \(\tilde{s}_t{/}\hat{k}_t \) [see Eq. (18)]. Finally, from Eqs. (3), (7) and (18) it follows that \(c_{2,t}{/}\hat{k}_t = \mu ^i(1+m)(1-\tau _2^i )(\alpha A-\delta )\) is constant too. Therefore, if policy parameters \(\gamma , \, \xi \) and \(\phi \) are constant, then \(y_t , \, c_{1,t} , \, s_t^i , \, c_{2,t} , \, \hat{k}_t \) and \(k_t\) would grow at the same rate, \(g\).

We are now in a position to define rigorously the competitive equilibrium.

A competitive equilibrium for this economy is a set of sequences of allocations \(\left\{ c_{1,t}^i ,n_t^i ,c_{2,t}^i ,s_t^i ,k_{t+1} ,y_t \right\} _{t=0,\;i=1}^{t=\infty ,\;i=I} \), and factor prices for labor and capital \(\left\{ w_t ,r\right\} _{t=0}^{t=\infty } \), a constant growth rate for per capita variables, \(g\), and constant income tax rates, \(\left\{ \tau _1^i ,\tau _2^i \right\} _{i=1}^{i=I} \), such that for a government expenditure to \(GDP\) ratio, \(\gamma \), tax code parameters \(\xi \) and \(\phi \), and some initial \(\hat{k}_0 >0\), at any time \(t\), households maximize utility [Eqs. (2), (3), (9) and (10) hold]; firms maximize profits [Eq. (7) holds]; the government budget equations hold [Eqs. (16) and (17) plus budget balance hold]; markets for labor and physical capital clear [Eqs. (11) and (18) hold], per capita income is given by Eq. (14); and the growth rate \(g\) is given by \(g=\left( \hat{k}_{t+1} -\hat{k}_t\right) {/}\hat{k}_t \), where \(\hat{k}_t \equiv k_t \times \tilde{n}\).

Given that the economy grows over time, and in order to solve the model numerically, per capita variables must be redefined relative to some variable that grows at the same rate so that the ratios along the balanced-growth path remain constant. A natural candidate for that purpose is the stock of capital per young individual, \(\hat{k}_t \). Therefore, the competitive equilibrium is defined in terms of transformed variables, so that some of the equations in Definition 1 must be rewritten accordingly. Thus, first- and second-period budget constraints in Eqs. (2) and (3) become

$$\begin{aligned} \hat{c}_1^i +\hat{s}^i=\theta ^in^i(1-\tau _1^i )\hat{w}, \end{aligned}$$
(19)
$$\begin{aligned} \hat{c}_2^i =\hat{s}^i(1+g)^{-1}[1+(1-\tau _2^i )r], \end{aligned}$$
(20)

for \(i=1,2,{\ldots }I\), respectively, where, by definition, \(\hat{c}_1^i \equiv c_{1,t}^i{/}\hat{k}_t \), \(\hat{c}_2^i \equiv c_{2,t}^i{/}\hat{k}_t \), \(\hat{s}^i\equiv s_t^i{/}\hat{k}_t \), and \(\hat{w}\equiv (1-\alpha )A{/}\tilde{n}\). Similarly, the Euler equation in (9) and the optimal consumption-leisure choice equation in (10) become

$$\begin{aligned} \left( {\hat{c}_1^i }\right) ^{-\sigma _1 }(1-n^i)^{\sigma _2 }=\beta \left[ \hat{c}_2^i (1+g)\right] ^{-\sigma _1 }\left[ {1+(1-\varphi )(1-\tau _2^i )r} \right] \end{aligned}$$
(21)
$$\begin{aligned} (1-n^i)\theta ^i\hat{w}(1-\phi )(1-\tau _1^i )=\frac{\sigma _2}{1-\sigma _1 }\hat{c}_1^i \end{aligned}$$
(22)

for \(i=1,2,{\ldots }I\). The goods market equilibrium condition in Eq. (18) becomes

$$\begin{aligned} \hat{s}=(1+g)(1+m), \end{aligned}$$
(23)

where \(\hat{s} \equiv \sum \nolimits _{{i = 1}}^{I} p^{i} \hat{s}^{i} = \tilde{s}_{t} /\hat{k}_{t}\), and per capita income in Eq. (14) is given by \(\hat{y}\equiv y_t{/}\hat{k}_t\quad =(A-\delta ){/}[1+(1+m)^{-1}]\). The rest of the equations in Definition 1 need not be rewritten.

The endogenous variables in Definition 1 are highly non-linear functions of parameters in household preferences, firms’ technological constraint and government tax policy. As a consequence, the very setup of the model prevents one from obtaining analytical results, so that numerical methods must be used to solve for the equilibrium and the response of the economy to government policy changes.

4 Welfare

A final issue to consider is that of welfare, i.e. what the relationship is between income tax code progressivity and social welfare. Assume that at time \(t\) the government sought the progressivity index that maximized a discounted sum of the average life-cycle utility of all current and future generations, \(\sum \nolimits _{{j = - 1}}^{\infty } D^{j} \,\sum \nolimits _{{i = 1}}^{I} p^{i} \,J_{{t + j}} \,U\left( c_{{1,t + j}}^{i} ,c_{{2,t + j + 1}}^{i} ,n_{{t + j}}^{i} \right) \), for some social discount factor \(D>0\). After normalizing the number of young individuals at time \(t\) so that \(J_t \equiv 1\), recalling that \(m\) denotes the population growth rate, grouping the contemporaneous terms together and ignoring the constant term \(\left( {c_{1,t-1}^i }\right) ^{1-\sigma _1 } \ ( {1-n_{t-1}^i })^{\sigma _2 }{/}\left( 1-\sigma _1 \right) \), the objective function can be rewritten as

$$\begin{aligned} \sum \limits _{{j = 0}}^{\infty } \hat{D}^{j} \sum \limits _{{i = 1}}^{I} p^{i} \left\{ {\frac{{\left( {c_{{1,t + j}}^{i} } \right) ^{{1 - \sigma _{1} }} \left( {1 - n_{{t + j}}^{i} } \right) ^{{\sigma _{2} }} }}{{1 - \sigma _{1} }} + \frac{\beta }{{\hat{D}}}\frac{{\left( {c_{{2,t + j}}^{i} } \right) ^{{1 - \sigma _{1} }} }}{{1 - \sigma _{1} }}} \right\} , \end{aligned}$$
(24)

for \(\hat{D}\equiv D(1+m)\) (see De la Croix and Michel 2002, chap. 2). Dividing the expression in Eq. (24) by \(\hat{k}_t^{1-\sigma _1 } =[\hat{k}_{t+j} (1+g)^{-j}]^{1-\sigma _1 }\) and rearranging terms, one obtains the (normalized) social welfare function to be maximized as

$$\begin{aligned} \sum \limits _{{j = 0}}^{\infty } \hat{D}^{j} \sum \limits _{{i = 1}}^{I} p^{i} \left\{ {\frac{{\left[ \hat{c}_{1}^{i} (1 + g)\right] ^{{1 - \sigma _{1} }} \left( {1 - n^{i} } \right) ^{{\sigma _{2} }} }}{{1 - \sigma _{1} }} + \frac{\beta }{{\hat{D}}}\frac{{\left[ \hat{c}_{2}^{i} (1 + g)\right] ^{{1 - \sigma _{1} }} }}{{1 - \sigma _{1} }}} \right\} , \end{aligned}$$
(25)

where \(\hat{c}_1^i\) and \(\hat{c}_2^i \) have been defined right after Definition 1 in Sect. 3.Footnote 8 Finally, after further rearranging Eq. (25), this can be rewritten as

$$\begin{aligned} \textit{SWF}(\phi ,D)&\equiv \sum \limits _{{i = 1}}^{I} p^{i} \left\{ {\frac{{\left[ \hat{c}_{1}^{i} (\phi )\right] ^{{1 - \sigma _{1} }} [1 - n^{i} (\phi )]^{{\sigma _{2} }} }}{{1 - \sigma _{1} }} + \frac{\beta }{{\hat{D}}}\frac{{\left[ \hat{c}_{2}^{i} (\phi )\right] ^{{1 - \sigma _{1} }} }}{{1 - \sigma _{1} }}} \right\} \nonumber \\&\times \sum \limits _{{j = 0}}^{\infty } \left\{ {\hat{D}\left[ {1 + g(\phi )} \right] ^{{1 - \sigma _{1} }} } \right\} ^{j} , \end{aligned}$$
(26)

where individual optimal decisions, \(\hat{c}_1^i \), \(n^i\) and \(\hat{c}_2^i \), as well as the growth rate, \(g\), are made explicitly dependent on \(\phi \), and function \(SWF\) is explicitly made dependent of \(\phi \) and \(D\). For the problem to be well defined, \(SWF(\phi ,D)\) must be bounded above; or, in other words, \(D\), \(m\), \(\sigma _1 \) and equilibrium \(g\) must be such that \(D<(1+g)^{\sigma _1 -1}{/}(1+m)\).Footnote 9 Thus, assuming that this condition holds, Eq. (26) leads to

$$\begin{aligned} \textit{SWF}(\phi ,D)&\equiv \frac{1}{( {\sigma _1 -1})\left\{ {\hat{D}\left[ {1+g(\phi )} \right] ^{1-\sigma _1 }-1} \right\} }\nonumber \\&\times \sum \limits _{{i = 1}}^{I} p^{i} \left\{ {\left[ \hat{c}_{1}^{i} (\phi )\right] ^{{1 - \sigma _{1} }} [1 - n^{i} (\phi )]^{{\sigma _{2} }} + \frac{\beta }{{\hat{D}}}\left[ \hat{c}_{2}^{i} (\phi )\right] ^{{1 - \sigma _{1} }} } \right\} . \end{aligned}$$
(27)

Finding an analytical expression relating \(\textit{SWF}(\phi ,D)\) to \(\phi \) would be unfeasible. As usually seen in the literature, though, one way to quantify the gain in utility as a result of some given tax policy change relative to the benchmark (or status quo) case consists of calculating the proportional change in life-time benchmark consumption that would leave agents indifferent between the two situations (in short, the consumption-equivalent variation, which can always be computed numerically).

Before obtaining such a variation, some explanation about how a change in \(\phi \) is implemented in this economy is needed. Imagine that the economy were in benchmark economy at \(t-1\), and that an unannounced change in \(\phi \) were introduced at time \(t\). Young individuals (born at time \(t)\) would make their optimal plans about consumption, saving and labor supply according to the new tax regime (characterized by \(\phi =\phi ^*)\), so that the economy would be in the new steady state at \(t\)+1. Old individuals at time \(t\) (born at \(t-1)\) however, would have already made their saving (and first-period consumption and labor supply) decisions at \(t-1\) according to the old tax regime (characterized by \(\phi =\phi ^0)\), so that they would be forced to change their optimal plans for second-period consumption. The assumption made is that old individuals at time \(t\) would pay taxes with the new progressivity index, \(\phi ^*\), but with a transitory value for parameter \(\xi \) in Eq. (8) which guaranteed that government budget would balance at time \(t\).

Thus, denoting that change by \(\lambda \), it can be shown after some algebra that \(\lambda \) is given by

$$\begin{aligned} \lambda (\phi ^{ * } ,\phi ^{0} ,D) = \left[ {\frac{{\Omega + \hat{D}\left[ {1 + g(\phi ^{ * } )} \right] ^{{1 - \sigma _{1} }} \times SWF(\phi ^{ * } ,D)}}{{SWF(\phi ^{0} ,D)}}} \right] ^{{\frac{1}{{1 - \sigma _{1} }}}} - 1, \end{aligned}$$
(28)

for \(\sigma _1 \ne 1\), where

$$\begin{aligned} \Omega \equiv \sum \limits _{{i = 1}}^{I} p^{i} \left\{ {\frac{{\left[ \hat{c}_{1}^{i} (\phi ^{ * } )\right] ^{{1 - \sigma _{1} }} [1 - n^{i} (\phi ^{ * } )]^{{\sigma _{2} }} }}{{1 - \sigma _{1} }} + \frac{\beta }{{\hat{D}}}\frac{{\left[ \tilde{c}_{2}^{i} (\phi ^{0} ,\phi ^{ * } )\right] ^{{1 - \sigma _{1} }} }}{{1 - \sigma _{1} }}} \right\} , \end{aligned}$$
(29)

\(\tilde{c}_2^i (\phi ^0,\phi ^*)\) denoting the (normalized) second-period consumption for an \(i\)-th type individual who is born in a \(\phi ^0\) tax regime and dies in a \(\phi ^*\) tax regime.

5 Results

5.1 Calibration

Here I set the parameter values for the benchmark case that I use in the simulation exercises in the next Section. In choosing the appropriate values, I will make the model economy mimic some stylized facts of the US economy. Table 1 shows the parameter values, Table 2 compares the simulated equilibrium aggregate values to the target values of the corresponding variables, and Table 3 shows the benchmark individual equilibrium.Footnote 10

Preferences Parameter \(\beta \) is set equal to 0.7397. I obtain this value by assuming a yearly subjective discount factor of 0.99 (just slightly \(<\)1), and that 1 model period represents 30 years of calendar time. The reason for this choice (above more often used values in the literature) is that a negative relationship between the subjective discount factor and the equilibrium interest rate appears.Footnote 11 I set \(\sigma _1 =2\), which implies IES = 0.5, “the median of the estimates in the literature.”Footnote 12 Parameter \(\sigma _2 \) is obtained endogenously and equals \(-\)2.2445. In this way, the average labor supply, \(\sum \nolimits _{i}^{I} p^{i} n^{i}\), equals 1/3, a standard choice (see e.g. Conesa et al. 2009; Fuentes-Albero et al. 2009).Footnote 13 The additional result is a Marshallian labor supply elasticity of 0.003 with respect to the after-tax wage rate for the average individual, i.e. that with an average skill level. Thus, differences among the labor supplies of the 5 individuals that I will introduce a few lines below turn out to be negligible: \(n^1 = 0.3330\), \(n^2 = 0.3332\), \(n^3 = 0.3333\), \(n^4 = 0.3334\) and \(n^5 = 0.3336\). Some remarks on this issue follow. First, this can be no surprise, but a direct consequence of the preferences represented by Eq. (1). Ideally, labor supply elasticity should also be targeted in the calibration exercise and replicated by this model economy, rather than obtained and ex-post compared to observed values. But, of course, compatibility of preferences with endogenousbalanced growth path imposes tight restrictions on those as is well known in the profession (see King et al. 1988). And, second, empirical microeconomic estimates of the elasticity of hours worked for the U.S. economy are substantially low, ranging between \(-\)0.1 and 0.2 for men and single women, and 0.1 and 0.3 for married women (see Kimball and Shapiro 2010; McClelland and Mok 2012).Footnote 14

Table 1 Parameter values: benchmark case
Table 2 Simulated and target aggregate values
Table 3 Equilibrium individual variables

Technology I assume \(\alpha = 0.377\), thus representing the capital income share.Footnote 15 Parameter \(A\) is obtained endogenously and equals 32.820. This allows one to obtain a (30-year) growth rate of per capita GDP of \(g = 76.92\) %, corresponding to an annual growth rate of \(g_a = 1.92\) %.Footnote 16 I endogenously obtain \(\delta \) in a straightforward manner. Consider the law of motion for the aggregate stock of capital, \(K_{a,t} \), on an annual basis, \(I_{a,t} = K_{a,t+1} -(1-\delta _a )K_{a,t} \), where \(I_{a,t} \) denotes gross investment at period \(t\) and \(\delta _a \) stands for the (annual) physical capital depreciation rate. Assuming a balanced growth rate path for \(K_{a,t} \), so that \(K_{a,t+1} = K_{a,t} (1+g_a )(1+m_a )\), where \(m_a \) denotes the annual population growth rate, and (once again) that \(1\) model period represents 30 years of calendar time, one has that \(I_{a,t}{/}Y_{a,t} = [(1+g_a )(1+m_a )-(1-\delta _a )]/(A/30)\), where \(Y_{a,t} = (A/30)K_{a,t} \) trivially denotes annual GDP at period \(t.\) Equivalently, \(I_{a,t} /Y_{a,t} = [(1+g_a )(1+m_a )-(1-\delta )^{1/30}]{/}(A/30)\). Finally, assuming an annual population growth rate of \(0.0105\) (to be justified a few lines below), setting \(\delta = 0.996\) allows one to mimic the observed value for \(I_{a,t} /Y_{a,t} \), 0.181.Footnote 17

Government I set the government consumption share of GDP, \(\gamma \), equal to 15.43 %, the observed average value for the sample period running between 1980 and 2000.Footnote 18 As for the income tax progressivity index, I set \(\phi = 0.036\), the estimated value from Internal Revenue Service micro data for 2000 reported in, (Guner et al. (2014a), Table 10, p. 15). The resulting equilibrium value for parameter \(\xi \) in Eq. (8) turns out to equal 0.8437.

Demographics According to the IMF World Economic Outlook Database, October 2010, the average annual population growth rate between 1980 and 2007 was 0.0105. Following the above assumption on model periods and calendar time, I set \(m = 0.368\). As for the skill distribution, \(\theta ^1\) is normalized to 1. The remaining skill parameters are set to pin down the income distribution. More precisely, assuming a uniform distribution of five types of individuals as in Li and Sarte (2004) (i.e. \(\left\{ p^i\right\} _{i=1}^I =0.2\), \(I = 5\)), the chosen \(\theta \)’s are able to replicate the observed shares of lifetime income (wage earnings), net of taxes and transfers, corresponding to the quintiles in 1986.Footnote 19 Given that the model period represents 30 years, it makes sense to target the distribution of the sum of discounted labor incomes over the same time interval, rather than the observed income distribution in some given single period. As a result of the unavailability of that piece of data, however, the target is proxied by the whole lifetime net-of-tax earnings and matched by the distribution of net-of-tax labor incomes across active (i.e. first-period) individuals. The resulting \(\theta \)’s are \(\theta ^2 = 1.560\), \(\theta ^3 = 1.959\), \(\theta ^4 = 2.420\), and \(\theta ^5 = 3.676\). As by-products, the model implies both a higher Gini inequality index for market incomes than for after-tax incomes (0.250 vs. 0.243), and very close Gini income inequality indices for labor and capital incomes, the former being slightly higher (0.234 vs. 0.229, respectively).Footnote 20

Three natural patterns of individual behavior arise from the inspection of Table 3. First, more skilled individuals end up enjoying higher levels of consumption (in both periods), generating higher savings, and facing higher tax rates (both average and marginal). Second, young (i.e. active) individuals are systematically charged higher tax rates than adult (i.e. retired) agents. And, third, labor supply is (almost) invariant across different type individuals.

5.2 Findings

The numerical experiment run in this paper consists of analyzing the effects of changes in income tax progressivity upon long-term growth when this is based on physical capital accumulation, income inequality, and social welfare for a wide enough range of non-negative values of \(\phi \) and \(D\). Three previous remarks are in order. First, this is a revenue-neutral tax experiment in that changes in \(\phi \) are accompanied by the necessary changes in parameter \(\xi \), so that total tax revenues represent a constant fraction, \(\gamma \), of aggregate output [see Eq. (8) and the condition for government budget balance].Footnote 21 Second, the theoretically sensible range of values for \(\phi \) is the closed interval [0,1] where, as stated in Sect. 2.3, \(\phi = 0\) represents the flat (proportional) tax case; and \(\phi = 1\) characterizes the complete redistribution case. Note, however, that the latter extreme case implies that marginal tax rates equal 1, so that the first order necessary condition in Eq. (10) admits no defined solution. And, third, as already mentioned in Sect. 2.2, the AK feature of the aggregate technology prevents the existence of transitional dynamic effects. Results follow in this order: individual, aggregate, and welfare effects.

Individual effectsFirst, concerning the tax rates, this model economy predicts that, as expected, and for a given \(\phi \), higher levels of skills are associated with higher incomes and, consequently, higher average tax rates. And, for any given individual type, active (i.e. young) individuals bear higher tax rates than retirees (i.e. old).Footnote 22 Concerning the response obtained after changes in \(\phi \), as the tax code is made more progressive, the average tax rates faced by young individuals of types 4 and 5 become higher, while type 3 young individuals’ remain fairly invariant, and type 1 and 2 young individuals’ fall. Actually, type 1 individuals become subsidized for values of \(\phi \) higher than around 0.23 (i.e. negative average tax rates). As a result, the dispersion of average tax rates faced by young individuals rises which, in turn, will help explain the effect of changes in the tax progressivity upon (net-of-tax) income inequality (see Fig. 1). As for old individuals’ average tax rates, responses to a changing \(\phi \) are not monotonic. For low enough values (below around 0.55), the pattern followed by type 5 individuals’ is increasing in \(\phi \), while types 1 through 4 individuals’ tax rates are decreasing, so that tax rates of type 1 and type 2 individuals would eventually become negative, as anticipated a few lines above. For high values of \(\phi \), however, the distorting effect would be so strong, that further increments in \(\phi \) would lead to lower average tax rates for type 5 adults (because of the large reduction in their savings) and higher tax rates (but still negative) for type 1 and type 2 adults (see Fig. 2).

Fig. 1
figure 1

Progressivity and average tax rates (young)

Fig. 2
figure 2

Progressivity and average tax rates (old)

Second, concerning the labor supply decision, the distortionary effects implied by higher tax progressivity induce a sharp reduction in the labor supply of all types of individuals, with no (substantial) differences across different type individuals (see Fig. 3 where individual labor supplies (almost) overlap each other). As pointed out in the Introduction, this calls for the need to allow for endogenous labor supply when analyzing the effects of taxation on economic growth when this is driven by physical capital accumulation and investment is financed out of young (workers’) savings, even though (as is the case) labor supply elasticity is close to zero. Heathcote et al. (2014) also find that higher income tax progresssivity discourages labor supply. Along these lines, empirical evidence has found that even with low (zero) Marshallian labor supply elasticity, large Hicks elasticities imply large negative labor supply effects of progressive taxation (see Keane 2011, p. 983).

Fig. 3
figure 3

Progressivity and labor supply

Third, as for first-period savings, those of individuals of types 3 through 5 fall as a result of increasing progressivity, in particular most skilled ones’, type 5. Type 2 individuals’ remain fairly constant (except for high levels of \(\phi \), around 0.45, getting lower thereafter); and those of Type 1, i.e. the least skilled, show an increasing pattern (except for unlikely high values of \(\phi \) around 0.8). This last result is well explained because (as we have just seen above), old type 1 and type 2 individuals face diminishing tax rates as \(\phi \) increases, becoming subsidized at some point (see Fig. 4). Needless to say, the response of average first-period savings to be discussed a few lines below is critically relevant in this model economy as economic growth is physical capital driven.

Fig. 4
figure 4

Progressivity and savings

Aggregate effectsFirst, parameter \(\xi \) displays a fairly invariant pattern with respect to \(\phi \) as it plays a minor role in the tax code: given the reactions of individuals’ choices after changes in the progressivity parameter, \(\xi \) is just endogenously adjusted so as to obtain budget balance (see Table 4, column 2).

Table 4 Tax progressivity

Second, as advanced above, the model economy predicts a positive relationship between \(\phi \) and equilibrium leisure. The intuition behind the result is found as a compound effect on young individuals’ marginal tax rates and consumption levels after rewriting Eq. (22) as \([\sigma _2{/}(1-\sigma _1 )](1-n^i)=\hat{c}_1^i{/}[\theta ^i\hat{w}(1-\hat{t}_1^i )]\). Thus, the numerical simulation shows that, following an increment in \(\phi \), for most (resp. least) skilled individuals both first-period consumption, \(\hat{c}_1^i \), and the net-of-marginal-tax wage rate in efficiency units, \(\theta ^i\hat{w}(1-\hat{t}_1^i )\), turn out to fall (resp. rise) in such a way that the net effect on the right-hand side of the last equation is always positive (see Table 4, column 3). As a by-product the model predicts a positive relationship between the (normalized) wage rate, \(\hat{w}\), and \(\phi \): from Eq. (7) and the definitions of \(\hat{w}\equiv w_t{/}\hat{k}_t \), \(\hat{k}_t \equiv K_t{/}N_t \) and \(\tilde{n} \equiv \sum \nolimits _{{i = 1}}^{I} \theta ^{i} p^{i} n^{i}\) , one has that \(\hat{w}=(1-\alpha )A/\tilde{n}\).

Third, young individuals’ aggregate savings relative to aggregate output fall as a result of increasing levels of progressivity as pointed out above: except for the least skilled individuals, higher values of \(\phi \) end up inducing lower first-year savings, in particular for the most skilled workers (see Table 4, column 4).

Fourth, and as a natural consequence of the above (given the role played by physical capital accumulation in this model), the model predicts a negative monotonic relationship between progressivity and growth [see Eq. (18)]. Thus, having denoted the annual growth rate by \(g_a \), the movement from the benchmark economy (i.e. with a progressive income tax code) to a proportional income tax economy (i.e. \(\phi =0)\) would make \(g_a \) rise from 1.92 to 1.95 %, a gain well below than that suggested by Caucutt et al. (2003) (see Table 4, column 5). Available empirical evidence for OECD countries supports a negative relationship between progressivity of personal income taxes and growth (see, e.g. Arnold 2008, Table 2). One likely reason why the growth effects of lowering the tax progressivity are lower than in Caucutt et al. (2003) is that in the absence of intergenerational links, the distortionary effects of capital income taxation are minimized.Footnote 23

And, fifth, as for the income distribution, the model also predicts a positive relationship between income tax progressivity and equality of income distribution, and higher inequality of before-tax than after-tax incomes. More precisely, the Gini indices of market and net-of-tax incomes, denoted by \(G_b \) and \(G_a \) respectively, display a decreasing pattern relative to \(\phi \) (see Table 4, columns 6 and 7). This is a natural result because, as discussed above, increments in income tax progressivity lead to increasing tax rates for more skilled agents and lower tax rates for less skilled. After all, given that tax revenues represent a constant fraction of GDP, a higher progressivity must lead to a reduction in net-of-tax income inequality. In short, income inequality would be maximized under a proportional tax scheme, \(\phi =0\), and any increment in personal income tax progressivity would reduce inequality.

Welfare effects Concerning welfare analysis, simple inspection of Eq. (27) shows that higher progressivity leads to three different effects, one of which turns out to display a negative sign, the second and the third one displaying a positive one. On the one hand, increments in \(\phi \) induce lower growth rates and, therefore, reduce the well-being of current individuals and future generations’. This effect is captured by a lower first term of the right-hand side of Eq. (27). Note that the social discount factor, \(D\), is a crucial element here: as \(D\) increases, so that future generations’ well-being is more valued, this lower growth effect of progressivity becomes more relevant, thereby calling for a lower progressivity. On the other hand, higher progressivity levels give rise to lower inequality of after-tax incomes and, therefore, lower dispersion of optimal consumption levels across individuals of different types, and more importantly, as the numerical results reveal, higher levels of leisure. The latter effect is captured by a higher second term of the right-hand side of Eq. (27). The relationship between these three effects could be interpreted as follows: a social planner would always value increasing equality, but that would have the cost lowering consumption growth; increasing leisure would be a (non-negigible) side effect.Footnote 24

Table 5 summarizes the results. The table shows, for alternative values of \(D\) (ranging between 0.1 and 1.2, including \(\beta = 0.7397\)), and in addition to the aggregates of Table 4, the social welfare maximizing progressivity index, \(\phi ^W\), and the associated welfare gain relative to the benchmark case, \(\lambda ^W\).Footnote 25 The pattern shown in Table 5 is clear. First, the optimal degree of progressivity is always positive. Second, that optimal progressivity level is inversely related to how society values future generations’ well-being.Footnote 26 And, third, quantitative welfare gains at the optimum, in terms of equivalent variation in consumption, largely depend on the social discount factor: as future well-being is more valued, the welfare gains associated with optimal progressivity levels tend to fall.

Table 5 Tax progressivity

To illustrate how relatively important these three effects of tax progressivity are, one can think about the following two experiments. Assume, first, that (almost) complete redistributive taxes were optimal, so that \(\varphi ^W = 0.9\) (which is actually the case when \(D = 0.1\)), with an associated welfare gain of 62.45 % in terms of equivalent variation in consumption (see Table 5, column 2). In this case, the welfare gain exclusively due to eliminating consumption inequality equals 20.45 % as an approximation.Footnote 27

Assume, additionally, that such a low social discount factor meant that the growth effect (which influences future consumption) were negligible. In this case, most of the welfare gain, 42.0 % \(=\) 62.45 % \(-\) 20.45 %, would be attributed to the increment in leisure. Consider, instead, that \(\varphi ^W = 0.9\) were optimal regardless of the social discount factor and different values for \(D\) were tried. Note that, by construction, the aggregates (leisure, growth and net-of-tax income inequality, in particular) would be the same as those in Table 4, last row. This time differences in welfare gains (or losses) would be exclusively due to differences in the valuation of the fall of the equilibrium growth rates of future consumption (see Table 6). Thus, complete redistribution would be welfare enhancing only for sufficiently low social discount factors: \(D \le \) 0.5. For instance, the aforementioned gain in welfare of 62.45 % when \(D = 0.1\), would become a welfare loss of 75.33 % for \(D = 0.7\).

Table 6 Complete redistribution (\(\phi =0.9)\), social discount factor and welfare

Two additional cases are worth pointing out. Consider, first, that \(D = \beta \), i.e. the subjective discount factor (see Table 5, column 9). The welfare maximizing progressivity index, 0.45, and the associated welfare gain, 18.1 %, sharply fall compared to just above discussed case of \(D = 0.1\). Despite both inequality of after-tax incomes and labor supply fall compared to the benchmark economy (from 0.243 to 0.14, and from 0.33 to 0.22 respectively), the equilibrium annual growth rate falls (from 1.92 to 1.46 %). Thus, given the relatively high weight on future generations’ wellbeing, the reduction in the growth rate explains the lower values of \(\varphi ^W\) and \(\lambda ^W\). Finally, consider the case of the highest admissible weight on future generations, namely \(D = 1.2\).Footnote 28 (see Table 5, column 10). Social welfare is maximized by a very low progressivity index, 0.1, leading to a welfare loss compared to the benchmark economy: \(\lambda ^W = -15.01\) %. The intuition is straightforward: inequality of after-tax incomes and labor supply are almost the same as in the benchmark economy (0.243 vs. 0.23, and 0.33 vs. 0.32 respectively). However, the equilibrium growth rate is lower than in the benchmark economy (1.92 vs. 1.87 %) and, and this is the key, the high social discount factor heavily penalizes drops in the growth rate.

So far I have assumed that government spending represents a constant proportion of GDP and that it is neither productive nor enters households’ preferences: two assumptions often made in model economies focusing on taxation issues. These lead to two unsatisfactory and related to each other consequences: higher growth rates imply that more (current and future) resources in absolute terms are being wasted, and that welfare effects of income tax policy only depend on how the tax burden is shared by households but not on what the government does with tax revenues. Concerning the latter, several alternatives are open. The government might provide some infrastructure in the form of public capital goods thereby influencing the equilibrium growth rate à la Barro (1990); or it might subsidize human capital accumulation thereby affecting equilibrium growth à la Benabou (2002); or it might provide some public consumption good valued by households, so that government expenditure in equilibrium would grow at the same rate as the economy. Bearing in mind how sustained equilibrium is obtained in this model economy, it seems reasonable to choose the last among these three alternatives. This issue is covered in the “Appendix” at the end of the paper. There I show that, first, for a given value of the social discount factor, a higher valuation on the part of individuals for the consumption of the publicly provided good implies a lower welfare maximizing progressivity index. And, second, for a given valuation of the consumption of the publicly provided good, a lower social discount of future generations’ well-being calls for a lower welfare maximizing progressivity index. (see the “Appendix” for details).

6 Conclusions

This paper has analyzed the effects of income tax progressivity on long-run economic growth, income inequality and social welfare.

The theoretical model used for this purpose is characterized by several key features that, when they are considered together, make the paper differ from the existing literature. The model economy does not allow for analytical solution, so that all the results are numerical in nature. The quantitative implications of the model have been illustrated after calibrating its parameters to mimic some basic stylized facts of the US economy.

The findings of the paper can be classified into three main categories: individual, aggregate, and social welfare related results. Increments in income tax progressivity induce a drop in labor supply of all types of individuals (despite the zero labor supply elasticity implied by preferences). Higher progressivity also reduces savings of all individuals except the least skilled ones’. This result squares with that on tax rates: least skilled individuals’ average tax rates fall (even becoming negative, so that least skilled individuals become subsidized), while the rest of individuals’ average tax rates rise.

As far as the aggregate effects are concerned, increases in income tax progressivity lead to a fall in aggregate savings, which, given how the growth is modeled, necessarily translates into a drop in the equilibrium growth rate. For instance, eliminating the progressivity and moving to a flat tax regime would raise the equilibrium annual growth rate of per capita GDP from 1.92 to 1.95 %. Furthermore, increasing the income tax progressivity reduces the inequality in the distribution of both market and net-of-tax incomes. Introducing a proportional income tax code would increase the Gini indices of before-tax and after-tax incomes by 1 and 3.8 % respectively.

Finally, regarding social welfare effects, optimality suggests the need of a political choice between a more equal distribution of net income and consumption levels across different types of individuals and higher levels of leisure and lower growth rates. The size of the second, negative, effect critically depends on the social discount factor: if the social planner discounts less future generations’ well-being, the optimal level of progressivity will be lower. In addition, if the government tax proceeds are devoted to the public provision of some private good rather than disposed, the optimal progressivity level will also depend on how households value this public provision relative to private consumption: a higher valuation also implies a lower optimal progressivity level, as progressivity also reduces the growth rate of the government-provided consumption. At any rate, and for the parameter values considered, optimality requires some degree of progressivity.

The paper can be improved in at least three directions: (1) increasing the number of life-time periods to gain more realism, (2) allowing for a richer skill distribution which would allow us to reproduce the observed wage distribution and (3) allowing for a different treatment between labor and capital incomes in the spirit of Conesa et al. (2009), thus abandoning the “comprehensive” or “global” approach.