Skip to main content
Log in

The effects of population aging on optimal redistributive taxes in an overlapping generations model

  • Published:
International Tax and Public Finance Aims and scope Submit manuscript

Abstract

The impact of population aging on the steady-state solution to an Ordover and Phelps (J. Public Econ. 12:1–26, 1979) overlapping generations optimal nonlinear income tax problem with two types of worker and quasilinear-in-leisure preferences is investigated. A decrease in the rate of population growth, which leads to an aging population, increases the relative price of consumption per person in retirement, which tends to decrease optimal consumption for retirees of both skill types. Nevertheless, it is also shown that the optimal marginal income tax rates are independent of the rate of population growth. In addition, the steady-state interest rate unambiguously declines when the rate of population growth declines. Resulting adjustments in production plans have an ambiguous effect on the aggregate wage rate. This article identifies factors contributing to an increase in the aggregate wage when the population ages, namely normality of consumption in retirement, complementarity between capital and labor in production, and a large capital deepening effect relative to the increase in dependency owing to demographic change. Depending on the sign of this wage effect, ambiguities may arise in the direction of change in the optimal steady-state consumption and production plans. However, when the dependency effect is sufficiently strong, it is possible to sign the direction of change in all production and consumption plans. Moreover, regardless of the direction of change in optimal consumption plans, the absolute value of the changes in consumption plans are smaller for low-skilled workers than for high-skilled when utility is time-separable and preferences exhibit decreasing absolute risk aversion. Adopting, instead, a quasilinear-in-consumption specification of preferences sharpens the comparative statics of consumption allocations, but introduces ambiguity into the effect of the rate of population growth on the optimal marginal income tax rate.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

Notes

  1. See, however, McDaniel (2003) for a critical assessment of “apocalyptic demography” in the Canadian context.

  2. See Myles (1995, pp. 509–514) for a textbook treatment of this analysis.

  3. Throughout this analysis, subscripts are used to denote the type of an individual and superscripts denote the date of birth of an individual. Quantities denoted without subscripts are within-period aggregates.

  4. See Dillén and Lundholm (1996) for an exposition of a two-period model in which the taxation authority sets an optimal linear tax schedule for workers who supply labor in both time periods. Apps and Rees (2006), Berliant and Ledyard (2005), and Brett and Weymark (2008b) study nonlinear income taxes with labor supply in two periods and the potential for a ratchet effect.

  5. The total population size, N, can be incorporated into the welfare weights.

  6. Weymark (1987, p. 1171) provides a detailed discussion justifying the exact form of the reduced-form welfare weights.

  7. One technical complication remains. There is no guarantee that the solution procedure outlined here guarantees that \(\tilde{y}_{1}>0\). I assume this to be the case throughout the remainder of the analysis. With this assumption, all elements of the optimal program can be shown to be positive.

  8. Weymark (1987) does not give an explicit statement of the analogous result. However, combining his equations (37) and (A.1) for any unbunched individuals yields a generalization of (A.7) used in the proof of Proposition 1.

  9. A few back-of-the-envelope calculations reveal that it really is an empirical question as to which of these terms would dominate. The capital stock is roughly three times GDP in Canada. A 2% annual growth rate, compounded over 20 years, results in n≈0.5. This results in a capital spreading effect of roughly 4.5 times GDP. Consumption-in-retirement is meant to capture the entire post-work period. If, say, 20% of the population is retired and the retirement period is 20 years, then \(\tilde{x}\) might be four times the value of yearly consumption, which is somewhat less than four times GDP. Given the closeness of these “ballpark” figures, it is difficult to say which effects dominate for a particular economy without careful empirical investigation.

  10. Formally, both the numerator and denominator in the final term on the left-hand side of (20) increase.

  11. Linear homogeneity and strict concavity of f imply f ky >0.

  12. This sub-section is meant to illustrate how some of the results depend on the form of the utility function. A full analysis of the alternative model is beyond the scope of this article.

  13. The calculations presented here are more than the minimum required to prove the Proposition. However, they are needed later on.

  14. See Intriligator (1971, p. 158) for an analogous calculation in the context of consumer theory.

References

  • Apps, P., & Rees, R. (2006). Repeated optimal nonlinear income taxation. Unpublished manuscript, University of Sydney and University of Munich.

  • Berliant, M., & Ledyard, J. O. (2005). Optimal dynamic nonlinear income taxes with no commitment. Downloadable from the Working Paper Archive at Washington University at St. Louis, http://econpapers.repec.org/paper/wpawuwppe/0403004.htm.

  • Boadway, R., & Pestieau, P. (2007). Tagging and redistributive taxation. Annales d’Économie et de Statistique, 83–84, 123–147.

    Google Scholar 

  • Boadway, R., Cuff, K., & Marchand, M. (2000). Optimal income taxation with quasi-linear preferences revisited. Journal of Public Economic Theory, 2, 435–460.

    Article  Google Scholar 

  • Brett, C., & Weymark, J. A. (2008a). The impact of changing skill levels on optimal nonlinear income taxes. Journal of Public Economics, 92, 1765–1771.

    Article  Google Scholar 

  • Brett, C., & Weymark, J. A. (2008b). Optimal nonlinear taxation of income and savings without commitment (Discussion paper 08-W05). Vanderbilt University.

  • Brett, C., & Weymark, J. A. (2008c). Public good provision and the comparative statics of optimal nonlinear income taxation. International Economic Review, 49, 255–290.

    Article  Google Scholar 

  • Cutler, D. M., Poterba, J. M., Sheiner, L. M., & Summers, L. H. (1990). An aging society: Opportunity or challenge? Brookings Papers on Economic Activity, 1, 1–56.

    Article  Google Scholar 

  • Dillén, M., & Lundholm, M. (1996). Dynamic income taxation, redistribution, and the ratchet effect. Journal of Public Economics, 59, 69–93.

    Article  Google Scholar 

  • Hamilton, J., & Pestieau, P. (2005). Optimal income taxation and the ability distribution: Implications for migration equilibria. International Tax and Public Finance, 12, 29–45.

    Article  Google Scholar 

  • Intriligator, M. D. (1971). Mathematical optimization and economic theory. Englewood Cliffs: Prentice-Hall.

    Google Scholar 

  • McDaniel, S. A. (2003). Toward disentangling policy implications of economic and demographic changes in Canada’s aging population. Canadian Public Policy, 29, 491–509.

    Article  Google Scholar 

  • Meijdam, L., & Verbon, H. A. A. (1997). Aging and public pensions in an overlapping-generations model. Oxford Economic Papers, 49, 29–42.

    Article  Google Scholar 

  • Mirrlees, J. A. (1971). An exploration in the theory of optimum income taxation. Review of Economic Studies, 38, 175–208.

    Article  Google Scholar 

  • Myles, G. D. (1995). Public economics. Cambridge: Cambridge University Press.

    Book  Google Scholar 

  • Ordover, J., & Phelps, E. (1979). The concept of optimal taxation in the overlapping-generations model of capital and wealth. Journal of Public Economics, 12, 1–26.

    Article  Google Scholar 

  • Pirttilä, J., & Tuomala, M. (2001). On optimal non-linear taxation and public good provision in an overlapping generations economy. Journal of Public Economics, 79, 485–501.

    Article  Google Scholar 

  • Simula, L. (2010). Optimal nonlinear income tax and nonlinear pricing: Optimality conditions and comparative static properties. Social Choice and Welfare, 35, 199–220.

    Article  Google Scholar 

  • Visco, I. (2001). The fiscal implications of ageing populations in OECD countries, Organization for Economic Cooperation and Development. Presented at the Oxford Centre on Population Ageing Pensions Symposium, June.

  • Weymark, J. A. (1986). A reduced-form optimal nonlinear income tax problem. Journal of Public Economics, 30, 199–217.

    Article  Google Scholar 

  • Weymark, J. A. (1987). Comparative static properties of optimal nonlinear income taxes. Econometrica, 55, 1165–1185.

    Article  Google Scholar 

Download references

Acknowledgements

This research was generously supported by the Canada Research Chairs Program. I am grateful to Frank Strain and John Weymark for helpful comments. Errors, omissions and views are entirely my own.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Craig Brett.

Appendix

Appendix

In order to carry out any comparative static exercise, it is first necessary to show that the problem at hand has a unique solution. Lemma A.1 establishes that this is so for the Steady-State Optimal Nonlinear Income Tax Problem.

Lemma A.1

The Steady-State Optimal Nonlinear Income Tax Problem has a unique solution.

Proof of Lemma A.1

Solving (14) for a 2 v(c 2,x 2) and substituting into (13) yields

$$\mathcal{W} = \lambda_1 a_1 v(c_1,x_1) - \lambda_1y_1 + \lambda_2\bigl[a_2 v(c_1,x_1)-y_1+y_2\bigr] -\lambda_2y_2. $$
(A.1)

Employing the normalization λ 1+λ 2=2 along with (A.1) yields

$$\mathcal{W} = \lambda_1 a_1 v(c_1,x_1)+ (1-\lambda_1)a_2 v(c_1,x_1) + a_2 v(c_1,x_1)- 2y_1. $$
(A.2)

Solving (14) for a 2 v(c 1,x 1) and substituting into the penultimate term in (A.2) yields,

$$\mathcal{W} = \bigl[\lambda_1 a_1 + (1-\lambda_1) a_2\bigr] v(c_1,x_1) + a_2 v(c_2,x_2) - y_1 -y_2. $$
(A.3)

Thus, the Steady-State Optimal Nonlinear Income Tax Problem is equivalent to maximizing the objective (A.3) subject to the constraint (11). The curvature and boundary conditions on v and f guarantee a unique solution for the vector (c 1,c 2,x 1,x 2,k,y 1+y 2). Using the binding self-selection constraint (14) and y=y 1+y 2, it is straightforward to compute unique solution values of y 1 and y 2 from the uniquely determined (c 1,c 2,x 1,x 2,k,y). □

Proof of Lemma 1

Rearranging (A.3) yields

$$\mathcal{W} = \bigl[ a_1 + (1-\lambda_1) (a_2-a_1)\bigr] v(c_1,x_1) + a_2 v(c_2,x_2) - y. $$
(A.4)

Substituting (17) and (18) into (A.4) yields (16). In so doing, one constraint appearing in the Steady-State Optimal Nonlinear Income Tax Problem has been substituted into its objective, and the variables y 1 and y 2 have been eliminated. However, the variable y is inserted and the constraint (11) remains. The lemma follows. □

Proof of Proposition 1

Part (i) follows directly from (3) and (22). Part (ii) follows from dividing (19) by (20) for individuals of each type. By (21),

$$\mu=\frac{1}{f_y}. $$
(A.5)

Part (iii) follows from substituting (A.5) and (18) into (19) for individuals of type 2 and rearranging.

Using (3) in conjunction with the definition of IMTR 1 given in (25) yields

$$\mathit{IMTR}_1= 1-\frac{1}{a_1 f_y v_c(\tilde{c}_1,\tilde{x}_1)}. $$
(A.6)

Substituting (A.5) and (19) into (A.6) yields

$$\mathit{IMTR}_1=1-\frac{1}{\frac{a_1}{\beta_1}} = \frac{a_1 -\beta_1}{a_1}. $$
(A.7)

Recalling the definition of β 1 from (17) and rearranging yields (25). □

Proposition A.2

The optimality conditions (11) and (19)(22) define a continuously differentiable solution function \(F \colon \mathbb{R}_{+} \to \mathbb{R}^{7}_{++}\) of the problem (16) with n\((\tilde{c}_{1},\tilde{x}_{1},\tilde{c}_{2},\tilde{x}_{2},\tilde{y},\tilde{k},\tilde{\mu})\). For all n∈ℝ+, the derivative DF of F at n is given by

$$DF(n)=\bigl(A^{-1}b\bigr)(n), $$
(A.8)

where

(A.9)

and

$$b(n) = \begin{bmatrix}0\\-(1+n)^{-2}\tilde{\mu}\\0\\-(1+n)^{-2}\tilde{\mu}\\0\\1\\\tilde{k}-(1+n)^{-2}\tilde{x}\end{bmatrix}, $$
(A.10)

and where all expressions on the right-hand sides of (A.9) and (A.10) are evaluated at the solution to (16).

Proof of Proposition A.2

By Lemma A.1, the first-order necessary conditions define a solution function. Differentiating the first-order conditions and the resource constraint yields

$$A[dc_1 \quad dx_1 \quad dc_2 \quad dx_2 \quad dy \quad dk \quad d\mu ]^\top = b \> dn, $$
(A.11)

where dependence on the parameter n is now expunged from the notation. The first zero in the final line of (A.9) follows from (22). In order to establish the Proposition, it suffices to show that the matrix A is invertible. To that end, introduce the partition

(A.12)

where H is the upper 6 × 6 block of A, p is a column of length 6 containing all but the last element of the seventh column of A, and the zero in (A.12) is a scalar.

The matrix H is block-diagonal. I now show that each of its blocks is invertible, so that H −1 exists.Footnote 13 Specifically,

(A.13)

where each block in the partition of H is 2 × 2 and

(A.14)

and

(A.15)

Strict concavity of v and f imply that Δ i >0, i=1,2,f. Indeed, the curvature properties imply that H is negative-definite.

It is straightforward to check thatFootnote 14

(A.16)

where

$$\theta= \frac{1}{p^\top H^{-1} p}. $$
(A.17)

Incidentally, because H is negative-definite, so is H −1; therefore, θ<0. □

Proof of Result 1

Using the bottom line of (A.16), (A.8) and (A.10) yields

(A.18)

Substituting (A.13)–(A.15) into (A.18) and performing the matrix multiplication gives (26).

Normality of x implies that the terms inside the summation sign on the right-hand side of (26) are negative. Linear homogeneity of f implies that f y is homogeneous of degree zero. Hence, by Euler’s Theorem

$$yf_{yy}+kf_{ky}=0.$$
(A.19)

But f yy <0, so f ky >0. Hence, the entire expression inside the square bracket is negative when x is normal. Because θ<0, the first term is positive. Clearly, (1+n)2 k>x is sufficient for the final term to be positive as well. □

Proof of Results 2–4

It is possible to use equations (A.13)–(A.16) to directly compute the results presented in Results 2–4. However, it is instructive to use a more heuristic solution method. The top six lines of (A.11) can be written

$$H\begin{bmatrix}dc_1 \\ dx_1 \\dc_2 \\dx_2 \\dy \\dk\end{bmatrix}= \begin{bmatrix}d\mu\\(1+n)^{-1} \,d\mu-(1+n)^{-2}\mu \,dn\\d\mu\\(1+n)^{-1} \,d\mu-(1+n)^{-2}\mu \,dn \\-f_y \,d\mu \\dn\end{bmatrix}. $$
(A.20)

Given the block-diagonal structure of H, (A.20) can be decomposed into the following three matrix equations:

(A.21)

Using (A.14) and (A.15) to compute the solutions to (A.21) gives

(A.22)

and

$$\begin{bmatrix} dy \\ dk \end{bmatrix}=\frac{1}{\varDelta _f}\begin{bmatrix}-f_{kk}f_y \,d\mu - \mu f_{ky} \,dn\\[6pt]f_{ky} f_y \,d\mu + \mu f_{yy} \,dn\end{bmatrix}. $$
(A.23)

Equations (27)–(30) follow from “dividing” the appropriate entries in (A.22) and (A.23) through by dn.

The final sentence of Result 2 is immediate. □

Proof of Corollary 1

When utility is of the form (31), (27) can be written

$$\frac{d\tilde{c}_i}{d n}=\frac{h^{\prime\prime}(\tilde{x}_i)}{\varDelta _i} \frac{ d\tilde{\mu}}{dn},\quad i=1,2. $$
(A.24)

Also, (31) and (A.14) imply

$$\varDelta _i= \beta_i g^{\prime\prime}(\tilde{c}_i) h^{\prime\prime}(\tilde{x}_i), \quad i=1,2. $$
(A.25)

Moreover, when (31) holds, the first-order condition (19) becomes

$$\beta_i g^\prime(\tilde{c}_i) =\tilde{\mu}, \quad i=1,2. $$
(A.26)

Substituting (A.25) and (A.26) into (A.24) and simplifying yields

$$\frac{d\tilde{c}_i}{d n} = \frac{d \tilde{\mu}}{dn}\frac{1}{\tilde{\mu}} \frac{g^\prime(\tilde{c}_i)}{g^{\prime\prime}(\tilde{c}_i)}, \quad i=1,2. $$
(A.27)

Thus,

$$\biggl\vert \frac{d \tilde{c}_1}{dn}\biggr\vert \ge \biggl\vert \frac{d \tilde{c}_2}{dn}\biggr\vert \quad \longleftrightarrow \quad \biggl\vert \frac{g^\prime(\tilde{c}_1)}{g^{\prime\prime}(\tilde{c}_1)} \biggr\vert \ge \biggl\vert \frac{g^\prime(\tilde{c}_2)}{g^{\prime\prime}(\tilde{c}_2)} \biggr\vert \quad \longleftrightarrow \quad -\frac{g^{\prime\prime}(\tilde{c}_1)}{g^{\prime}(\tilde{c}_1)} \le - \frac{g^{\prime\prime}(\tilde{c}_2)}{g^{\prime}(\tilde{c}_2)}. $$
(A.28)

Because β 2>β 1, it follows from (A.26) and strict concavity of g that \(\tilde{c}_{1} < \tilde{c_{2}}\). Thus, part (i) of the Corollary follows from (A.28).

The proof of part (ii) of the Corollary is analogous. □

Proof of Lemma 2

The steady-state resource constraint (11) and the self-selection constraint (14) imply

(A.29)

Solving the system (A.29) for x 1 and x 2 yields

(A.30)

Social welfare is given by

$$\mathcal{W} =\lambda_1\Biggl[g\biggl(c_1, \frac{y_1}{a_1}\biggr)+x_1\Biggr] + \lambda_2 \Biggl[g\biggl(c_2, \frac{y_2}{a_2}\biggr)+x_2\Biggr]. $$
(A.31)

Substituting (A.30) into (A.31) and simplifying yields

(A.32)

Applying the normalization λ 1+λ 2=2 to (A.32) yields the objective function in (33). □

Proof of Proposition 2

When preferences have the form (34), the objective function in (33) becomes

(A.33)

The first-order condition for the choice of c i is, therefore,

$$u^\prime(c_i) = 1+n. $$
(A.34)

The Proposition follows immediately from applying the Implicit Function Theorem to (A.34). □

Rights and permissions

Reprints and permissions

About this article

Cite this article

Brett, C. The effects of population aging on optimal redistributive taxes in an overlapping generations model. Int Tax Public Finance 19, 777–799 (2012). https://doi.org/10.1007/s10797-011-9207-7

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10797-011-9207-7

Keywords

JEL Classification

Navigation