1 Introduction

Since the 1970s, the effective retirement age has declined in almost all Western countries, while at the same time life expectancy has increased substantially. These developments led to an increase in the average retirement period relative to the working period, thereby eroding the fiscal sustainability of pension schemes. To reverse this trend, in recent years more attention has been given to pension reforms that improve labour supply incentives and encourage people to work longer. Countries like the UK and Australia, for example, introduced a flexible retirement age and increased the reward to continue working. The advantage of this type of reforms is that it not only reduces the labour market distortions caused by incentives to retire early but can also increase the sustainability of pension systems.

A potential disadvantage is, however, that these flexibility reforms are typically implemented in a uniform way, i.e. applied to all participants in the same way, while individuals have heterogeneous characteristics (e.g., in terms of life expectancy or income level). Uniformly implemented reforms therefore probably have different welfare effects at the individual level and may affect certain types of individuals negatively.Footnote 1 Indeed, it is well known that pension schemes based on uniform policy rules contain large redistribution effects within and across generations, some intentional, and others unintentional (see, e.g. Börsch-Supan and Reil-Held 2001 and Bonenkamp 2009). For example, unfunded pension schemes, especially those of the Beveridgean type, often contain redistribution from high to low incomes. Apart from this, these pension schemes typically also redistribute from short-lived to long-lived agents because they are based on collective annuities which do not depend on individual life expectancy. This makes collective annuities subject to the objection that they lead to more regressive pension schemes because it is well known that average longevity tends to increase with income (see e.g. Pappas et al. 1993; Adams et al. 2003; Meara et al. 2008). Pension reforms that introduce more flexibility in pension take-up will affect these redistribution effects. It is therefore important to take into account the redistribution in existing pension schemes and the fact that individuals are heterogeneous when analysing the welfare effects of pension flexibility reforms.

This paper explores the redistribution and welfare effects of the introduction of a flexible starting date for pension benefits in the context of an unfunded pension scheme with an explicit redistribution motive. That means, we consider a change from a payout scheme in which benefits start at the fixed statutory retirement age to a scheme where benefits start at the flexible effective retirement age. This flexible pension take-up is combined with actuarial adjustments of pension benefits for early or late retirement. To analyse the economic implications of this reform, we use a two-period overlapping-generations model populated with agents who differ in ability and lifespan. It is assumed that the lifespan of an individual is positively linked to his productivity. The pay-as-you-go (PAYG) social security system is of the Beveridgean type and is characterized by lifetime annuities and proportional contributions. In this way, the pension scheme includes two types of intragenerational redistribution, from high-income earners to low-income earners and from short-lived to long-lived agents. Note that, in contrast to the former, the latter type of redistribution is regressive due to the positive link between productivity and longevity. The fact that individuals are heterogeneous implies that introducing pension flexibility will affect individuals differently.

In this paper, we take a positive perspective and observe that in many countries, PAYG pension schemes do redistribute in practice. That is, we follow the literature that takes the existence of a PAYG social security scheme as a redistribution device as given (see e.g. Galasso and Profeta 2002; Cremer and Pestieau 2003; Casamatta et al. 2005). Of course, there are many good reasons for redistributing income via pensions, for example, protection against myopia or the fact that individual differences in income could manifest themselves only later in the career. However, the normative question why pension schemes should be redistributive within cohorts remains outside the scope of this paper.

Implementing pension contracts with a variable starting date for benefits, as analysed in this paper, is important for various reasons. It helps individuals to adjust the timing of pension income according to their own preferences and circumstances. This is particularly relevant for people who have a preference to retire early but who are prevented to do that because of liquidity or borrowing constraints. Flexible pensions can also function as a hedge against all types of risks, like disability risks (Diamond and Mirrlees 1978), stock market risks (Bodie et al. 1992) or productivity risks (Pestieau and Possen 2010). This paper adds some other arguments. We will illustrate that flexible pensions can stimulate people to postpone retirement voluntary. In that case, flexible pension take-up may help to bear the increasing fiscal burden of ageing. We also show that flexible pension take-up could be used to reduce the element of regressive redistribution in social security schemes.

The main results are as follows. First, introducing a flexible pension take-up cannot be Pareto improving if the government conditions the adjustment factor of benefits on individual characteristics like lifespan. Individual actuarial adjustment eliminates the unintended redistribution from short-lived to long-lived agents. The low-skilled therefore benefit from this reform at the expense of the high-skilled. Second, introducing a flexible pension take-up can be Pareto improving if the actuarial adjustment of benefits occurs in a uniform way (i.e. based on the average lifespan). Uniform benefit adjustment leads to selection effects in the retirement decision which may reduce initial tax distortions. For the high-skilled individuals, the uniform reward rate for later retirement is too high from an actuarial point of view because they live longer, which reduces their implicit tax and stimulates them to continue working. If the contribution rate is sufficiently high, the low-skilled also gain because they receive higher pensions, enabled by the additional tax payments of the high-skilled. Third, combining uniform adjustment with actuarial non-neutrality to induce people to postpone retirement can further improve the reform, i.e. a Pareto improvement can be achieved at a lower contribution rate, or for a given contribution rate, the welfare effects are more positive for all individuals.

It is important to note that our benchmark PAYG scheme is of the Beveridgean type and characterized by inflexible pension take-up and lifetime annuities. Countries like the UK, the Netherlands and Denmark indeed follow this tradition. Other countries, like Germany, Italy and France have Bismarckian pension schemes where pension benefits are linked to former contributions. In general, Bismarckian pension systems still contain intragenerational redistribution from short- to long-lived agents but have considerably less redistribution from the rich to the poor. As a consequence, with this type of pension scheme, the labour market distortions caused by incentives to retire early, and therefore also the potential for welfare gains of introducing flexible retirement with an increased reward to continue working as studied in this paper, will be much smaller than with a Beveridgean pension system.

This paper is related to studies that analyse the interaction between pension schemes and retirement decisions (see, e.g. Hougaard Jensen et al. 2003) and to a growing literature that focuses on the role of alternative pension systems when income and lifespan are correlated (see, e.g. Borck 2007; Hachon 2008; Cremer et al. 2010). In addition, our paper is also related to Fisher and Keuschnigg (2010) and Jaag et al. (2010) who investigate the labour market impact of pension reforms towards more actuarial neutrality. Most of these aforementioned studies focus on pension reforms that strengthen the link between contributions and benefits. Our study, in contrast, deals with the implementation of a flexible pension take-up.

This paper is most closely related to Cremer and Pestieau (2003). They analyse the implementation of age-dependent tax rates in an economy with a redistributive PAYG pension scheme. This policy generates the same ‘double dividend’ as the flexibility reform of the pension scheme considered in this study: it not only generates additional revenues but also fosters redistribution from high to low incomes. An important difference with Cremer and Pestieau (2003) is that in our model, people have heterogeneous lifespans. Heterogeneous lifespans play a crucial role in our analysis as this leads endogenously to the ’right’ retirement incentives (i.e. the high-skilled will work longer and the low-skilled shorter) when a flexible retirement scheme with uniform actuarial adjustment is introduced. Cremer and Pestieau (2003) need age-dependent taxes to achieve a similar result. The flexible retirement reform we study differs in two important aspects from the introduction of age-dependent taxation. Firstly, in contrast to age-dependent taxation, it is directly targeted at the retirement distortion caused by the PAYG pension scheme. That is, it only affects the retirement decision (i.e. the extensive margin of labour supply), not the decision how many hours to work (the intensive margin).Footnote 2 Secondly, the introduction of flexible pension take-up as considered in this paper is more often observed in practice than age-dependent taxation.Footnote 3

Our main contribution in this respect is that we provide a rationale for the empirically observed introduction of pension flexibility with actuarial adjustment of benefits in a Beveridgean pension scheme. In particular, we elucidate why these pension flexibility reforms are typically implemented in a uniform way instead of making the adjustment of benefits dependent on individual characteristics; we show that the only way to induce a Pareto improvement is by adjusting the benefits in a uniform way, even though individuals are heterogeneous.

This paper is organized as follows. In Sect. 2, we introduce the benchmark model. This model contains a PAYG social security scheme with inflexible pension take-up and lifetime annuities. Section 3 analyses the redistribution and welfare effects of reforms aimed at increasing the flexibility of individual pension take-up. In Sect. 4, we elaborate on these flexibility reforms by introducing non-neutral actuarial adjustment of benefits. Section 5 concludes the paper.

2 The benchmark model

We consider a two-period overlapping-generations model of a small open economy populated with heterogeneous agents who differ in terms of ability and lifespan. Agents decide upon the amount of savings in the first period and upon the length of the working period in the second period of life. The individual ability level determines whether an agent supplies labour as a low-skilled worker or as a high-skilled worker. High-skilled workers earn a higher wage rate than low-skilled workers. The model includes a Beveridgean social security scheme which offers a lifetime annuity that starts paying out from the statutory retirement age until the end of life. Agents are allowed to continue their working life after the statutory retirement age or to advance retirement and stop working before the statutory retirement age. So the statutory retirement age is related to the date agents receiving their pension benefit, which is not necessarily equal to their effective retirement date.

2.1 Preferences

Preferences over first-period and second-period consumption are represented by the following utility function:

$$\begin{aligned} U(c,x) = u(c) + \pi u(x) \end{aligned}$$
(1)

with \(u'>0\) and \(u''<0\); \(c\) is first-period consumption; \(x\) is second-period consumption; and \(\pi \le 1\) is the length of the second period. To keep the analysis as simple as possible, we assume that the interest rate and the discount rate are zero.Footnote 4 Second-period consumption is defined net of the disutility of labour:

$$\begin{aligned} x = \frac{d}{\pi } - \frac{\gamma }{2} \left( \frac{z}{\pi }\right) ^2 \end{aligned}$$
(2)

where \(d\) is total consumption of goods when old yielding a consumption stream of \(d/\pi \), \(z\) denotes the working period, and \(\gamma \) is the preference parameter for leisure. Following Casamatta et al. (2005) and Cremer and Pestieau (2003), we assume a quadratic specification for the disutility of work. This specification makes the problem more tractable, but comes with the cost that there are no income effects in labour supply. Income effects in the retirement decision are found to be small compared to substitution effects, however, see, e.g. Krueger and Pischke (1992) or French (2005). Observe that the disutility of working is related to the fraction of the second period spent on working (i.e. \(z/\pi \)). This implies that for a given retirement age, an agent with a short lifespan experiences a higher disutility of work than an agent with a long lifespan because this short-lived agent works a relatively larger share of his remaining lifetime.

2.2 Innate ability and skill level

There are two levels of work skill, denoted by ’low’ (\(L\)) and ’high’ (\(H\)). Born low-skilled, an agent can acquire extra skills and become a high-skilled worker by investing \(1-a\) units of time in schooling in the first period. The rest of the time, \(a\), is devoted to working as a high-skilled worker.

The individual-specific parameter \(a\) reflects the ability of individuals to acquire high working skills. The higher is \(a\), the more able is the individual, and the less time a worker needs to become high-skilled for acquiring a work skill. The parameter \(a\) ranges between 0 and 1, and its cumulative distribution function is denoted by \(G(\cdot )\), i.e. \(G(a)\) is the number of individuals with an innate ability parameter below or equal to \(a\). We henceforth refer to an individual with an innate ability parameter of \(a\) as an \(a\)-individual. For the sake of simplicity, we normalize the total number of individuals born in each period to be one, i.e. \(G(1)=1\).

A high-skilled worker provides an effective labour supply of one unit per unit of working time, while a low-skilled worker provides only \(q<1\) units of effective labour for each unit of working time. This difference in effective labour supply also applies to the second period of life. Let \(w\) denote the wage rate per unit of effective labour, then the maximum amount of income agents can earn in the first period, denoted by \(W_y(a)\), is given by:

$$\begin{aligned} W_y(a) \equiv \left\{ \begin{array}{l@{\quad }l} qw &{} \hbox {for} \quad a \le a^* \\ aw &{} \hbox {for} \quad a \ge a^* \end{array} \right. \end{aligned}$$
(3)

where \(a^*\) is the cut-off ability level to become high-skilled. It is assumed that \(a^*\) is exogenous.Footnote 5 For the second period of life, the maximum labour income, \(W_o(a)\), equals:

$$\begin{aligned} W_o(a) \equiv \left\{ \begin{array}{l@{\quad }l} qw &{} \hbox {for} \quad a \le a^* \\ w &{} \hbox {for} \quad a \ge a^* \end{array} \right. \end{aligned}$$
(4)

2.3 Individual lifespan

Each individual lives completely the first period of life (with a length normalized to unity), but only a fraction \(\pi (a)\le 1\) of the second period. We assume that \(\pi '(a) \ge 0\): the higher the innate ability of an agent, the longer the length of life. As a consequence, our model contains a positive association between longevity and skill level. Since high-skilled agents earn a higher wage rate than low-skilled workers, the model is in line with the empirical evidence that income positively co-moves with life expectancy.Footnote 6

Whenever necessary to parameterize the function \(\pi (a)\), we will use the following specification:

$$\begin{aligned} \pi (a) = \bar{\pi }\left[ 1 + \lambda (a-\bar{a})\right] , \quad \lambda > 0 \end{aligned}$$
(5)

where \(\bar{a}\equiv \int _0^1 a \, \mathrm {d}G\) denotes the average ability level. This simple function has the following appealing properties. First, \(\bar{\pi }\) represents the average duration of the second phase of life. Second, there is a positive link between ability and the length of life as long as \(\lambda >0\). Indeed, \({{\mathrm{Cov}}}(\pi ,a) = \lambda \text {Var}(a)\ge 0\). Third, consistent with empirical findings (Pappas et al. 1993; Mackenbach et al. 2003; Meara et al. 2008), the relative differences in individual lifespans remain constant if the average lifespan increases. In absolute terms, this means that the socioeconomic gap in longevity gets larger if the average life span increases, i.e. \(\pi (a=1)-\pi (a=0)=\lambda \bar{\pi }\); the lifespan of more able individuals increases more when average longevity rises.

2.4 Consumption and retirement

An individual faces the following intertemporal budget constraint:

$$\begin{aligned} c + d = (1-\tau )W_y + (1-\tau ) z W_o + P \end{aligned}$$
(6)

where \(\tau \) is the social security contribution (tax) rate and \(P\) denotes total pension entitlements received during old age.Footnote 7

Maximizing lifetime utility (1) over \(c\), \(d\) and \(z\), subject to the lifetime budget constraint (6) yields the following first-order conditions:

$$\begin{aligned} u'(c)&= u'(x) \end{aligned}$$
(7)
$$\begin{aligned} (1-\tau )W_o&= \frac{\gamma z}{\pi } \end{aligned}$$
(8)

Expression (7) is the standard consumption Euler equation. Equation (8) is the optimality condition regarding retirement and states that the marginal benefit of working (net wage rate) should be equal to the marginal cost of working (disutility of labour). From these first-order conditions, we obtain the following expressions for \(c\), \(x\) and \(z\) for the benchmark model:

$$\begin{aligned} c&= \frac{1}{1+\pi }\left[ (1-\tau )W_y + \frac{(1-\tau )^2 W_o^2 \pi }{2\gamma }+P \right] \end{aligned}$$
(9)
$$\begin{aligned} x&= c \end{aligned}$$
(10)
$$\begin{aligned} z&= \frac{(1-\tau )W_o \pi }{\gamma } \end{aligned}$$
(11)

where \(P\) denotes total pension entitlements in the benchmark model. Note that the social security tax distorts the retirement decision: the larger the contribution rate \(\tau \), the earlier agents leave the labour market, i.e. the lower \(z\), because it reduces the net wage (and thus the price of leisure). Notice further that our disutility specification ensures that the retirement period is proportional to longevity, i.e. \(\pi -z = [1-(1-\tau )W_o/\gamma ]\pi \). Hence, a longer lifespan is split between later retirement and a longer retirement period. Low-skilled workers retire earlier than high-skilled workers for two reasons. First, since it is assumed that \(q<1\), low-skilled people have a lower wage rate (substitution effect). Second, low-skilled workers will generally have a shorter lifespan which induces them to leave the labour force earlier (disutility of labour effect).

2.5 Social security

The PAYG social security scheme is of the Beveridgean type. In the benchmark model, agents receive a flat pension benefit \(b\) per retirement period which starts at the statutory retirement age \(h\) and lasts until the end of the individual old-age period \(\pi \). Total pension entitlements \(P\) are then:Footnote 8

$$\begin{aligned} P = (\pi -h)b \end{aligned}$$
(12)

The fact that the pension benefit is flat but social security contributions \(\tau \) are proportional to the wage rate implies that the pension scheme redistributes income from high-income to low-income individuals. The pension scheme also redistributes from short-lived to long-lived individuals, however, as individuals receive the flat pension benefit until their death. The positive link between ability, wages and lifespan in our model then implies that there is also some redistribution from low incomes to high incomes, as the latter group typically has a longer lifespan.

A feasible social security pension scheme must satisfy the following resource constraint:Footnote 9

$$\begin{aligned} \int _0^1 P \, \mathrm {d}G = \tau qw \int _0^{a^*} (1+z_L) \, \mathrm {d}G + \tau w \int _{a^*}^1 (a+z_H) \, \mathrm {d}G \end{aligned}$$
(13)

Using Eqs. (5) and (12), we can rewrite this equation as:

$$\begin{aligned} b (\bar{\pi }-h) = \tau qw \int _0^{a^*} (1+z_L) \, \mathrm {d}G + \tau w \int _{a^*}^1 (a+z_H) \, \mathrm {d}G \end{aligned}$$
(14)

This condition states that the total amount of pension benefits paid out (left-hand side) equals the total amount of tax contributions received (right-hand side). The first term on the right-hand side is the tax payments of the low-skilled workers, and the second term is the payments of the high-skilled workers.

As a measure for redistribution, we calculate the net benefit of participating in the pension scheme. The net benefit is the difference between the total pension benefits received and tax contributions paid:

$$\begin{aligned} NB \equiv (\pi -h)b - \tau (W_y+z W_o) \end{aligned}$$
(15)

An agent is a net beneficiary if total pension benefits received exceed contributions paid (i.e. \(NB>0\)). Otherwise, the agent is a net contributor (i.e. \(NB<0\)). A priori it is not immediately clear whether the low-skilled agents are the net beneficiaries of this Beveridgean pension system. On the one hand, low-skilled agents benefit from this pension scheme as they have a lower wage rate and generally retire earlier than high-skilled agents. On the other hand, low-skilled agents also die earlier than high-skilled agents, which implies that low-skilled agents are negatively affected by the pension scheme.

Using the definition of net benefits, Eq. (15), the budget constraint of the pension scheme implies:

$$\begin{aligned} \int _0^{a^*} NB_L \, \mathrm {d}G + \int _{a^*}^1 NB_H \, \mathrm {d}G = 0 \end{aligned}$$
(16)

The net benefits of all (young) individuals are equal to zero, reflecting the zero-sum game nature of the pension scheme.Footnote 10

3 Pension flexibility reforms

In recent years, many countries have taken measures to increase work incentives and to stimulate people voluntarily to continue working. In this section, we consider the welfare and redistribution effects of a pension reform that allows for a flexible starting date of social security benefits, as recently implemented in e.g. the UK, Finland and Denmark. Introducing a variable starting date for benefits may help individuals to adjust the timing of pension income according to their own preferences. We will show that flexible pensions can also help to bear the costs of ageing or to reduce unintended transfers from short-lived to long-lived individuals.

In the benchmark model, we have assumed that social security benefits start at the statutory retirement date, irrespective of the individual’s effective retirement date. In this section, we impose that the benefits start at the time the individual actually leaves the labour market. If a person then retires later than the statutory retirement age, he receives an increment to his benefits for later retirement, and when this person retires earlier, he receives a decrement. The imposed coincidence of pension take-up and retirement is a realistic assumption because in practice flexible pension schemes often contain legal restrictions to continue work after a person has opted for benefits.Footnote 11 We will first discuss the actuarial adjustment of benefits in general. The specific cases of individual actuarial adjustment and uniform actuarial adjustment of benefits will be discussed in Sects. 3.1 and 3.2, respectively.

Actuarial adjustment of benefits

Suppose the government pays benefits \(p\) to an individual over his whole effective retirement period. Total pension entitlements are then equal to \(P=(\pi -z)p\). Pension earnings per retirement period \(p\) are given by:

$$\begin{aligned} p = m(z,\hat{\pi }) b \end{aligned}$$
(17)

where \(b\) is the reference flat pension benefit independent of contributions and labour history. The factor \(m(\cdot )\) is the actuarial adjustment factor which determines to what extent the reference benefit \(b\) will be adjusted when agents retire later or earlier than the statutory retirement age and is given by:

$$\begin{aligned} m(z,\hat{\pi }) = \frac{\bar{\pi }-h}{\hat{\pi }- z} \end{aligned}$$
(18)

where we impose \(\hat{\pi }- z>0\) to make sure that \(m(\cdot )>0\) to rule out negative pension benefits. The adjustment factor is equal to the ratio between the average retirement period and the individual retirement period measured by the reference lifespan parameter \(\hat{\pi }\) which will be specified below. At the individual level, actuarial non-neutrality arises when \(\hat{\pi }\) differs from \(\pi \). The function \(m(\cdot )\) is an increasing function in the individual retirement decision \(z\); when an agent decides to continue to work after the statutory retirement age, the pension benefit in the remaining retirement periods will be adjusted upwards.

We consider two scenarios for the lifespan to be used in the adjustment factor which differ with respect to the information set available to the government. In the first scenario, the government can observe individual longevity and uses adjustment factors based on individual lifespans (\(\hat{\pi }=\pi \)). The government can then get rid of the adverse redistribution from short- to long-lived individuals. The implication of this is, however, that the high-skilled will be harmed by this reform while the low-skilled gain, and a Pareto improvement is not possible. In the second scenario, we assume that the government applies a uniform actuarial adjustment factor, based on the average lifespan of the population (\(\hat{\pi }=\bar{\pi }\)). This uniform actuarial adjustment introduces selection effects in the retirement decision, long-lived agents have an incentive to postpone retirement, while short-lived agents have an incentive to advance retirement. We show that in this reform scenario, a Pareto improvement is possible.

3.1 Individual actuarial adjustment of benefits

To set the scene, we assume that the government can observe individual lifespans (or individual abilities)Footnote 12 and uses this information to assess the adjustment of benefits. This complete actuarial adjustment is a rather extreme position as it seems quite unrealistic that the government can observe individual lifespans. Moreover, it removes the raison d’être of our pension scheme as a redistribution device. Still, we think it is useful to present this case as a benchmark to demonstrate that it is impossible to generate a Pareto improvement with complete actuarial adjustment of benefits. It illustrates that a degree of incompleteness in the actuarial adjustment is necessary to generate a Pareto improvement, as in the case with uniform actuarial adjustment presented in Sect. 3.2. Moreover, we show in the online Appendix B that the same result applies in the more realistic case where the government cannot observe the individual lifespan, but only the skill level, i.e. the education level (which in general is correlated with individual lifespans, see e.g. van Kippersluis et al. 2011) and uses that information to adjust the benefits; also in that case, actuarial adjustment of benefits cannot result in a Pareto improvement.

3.1.1 Actuarial adjustment factor

With individual adjustment, \(\hat{\pi }=\pi \), the individual-specific adjustment factor \(m\) and the pension entitlements \(P\) become:

$$\begin{aligned} m&= \frac{\bar{\pi }-h}{\pi -z} \end{aligned}$$
(19)
$$\begin{aligned} P&= (\bar{\pi }-h)b \end{aligned}$$
(20)

Note from Eq. (19) that \(m=1\) for an agent with an average ability level (\(a=\bar{a}\)) who retires at the statutory retirement age \(h\). For this so-called average individual, the pension benefit per retirement period is equal to the reference benefit, i.e. \(p=b\). In case this person retires later than the statutory retirement age, then \(m>1\), implying that the per-period benefit is adjusted upwards, i.e. \(p>b\). On the other hand, when the person retires earlier than the statutory retirement age, we have \(m<1\) and \(p<b\).

The retirement decision is actuarially neutral because the effective retirement age has no effect on the total pension entitlements \(P\), i.e. \(\partial P/\partial z = 0\). Agents cannot increase their total pension entitlements by postponing or advancing retirement. Any individual, irrespective of lifespan, income or skill level, receives exactly the same amount of lifetime pension benefits.

3.1.2 Consumption and welfare effects

The retirement decisions are the same as in the benchmark social security model (\(z_ ben =z_ ind )\).Footnote 13 The aggregate budget constraint of the pension contract also does not change, implying that the pension benefit per retirement period stays the same as well (\(b_ ben =b_ ind )\). Only consumption changes:

$$\begin{aligned} c_ ind= & {} c_ ben +\frac{(\bar{\pi }-\pi )b}{1+\pi }\end{aligned}$$
(21)
$$\begin{aligned} x_ ind= & {} x_ ben +\frac{(\bar{\pi }-\pi )b}{1+\pi } \end{aligned}$$
(22)

With individual actuarial adjustment, the redistribution in the PAYG scheme related to differences in lifespan (i.e. from short-lived to long-lived agents) that is present in the benchmark model is removed, but the retirement decision is not changed. As a result, lifetime income, and therefore consumption, is higher for short-lived individuals (i.e. with a lifespan \(\pi \) below the average lifespan \(\bar{\pi }\)) and vice versa for long-lived individuals (\(\pi >\bar{\pi }\)). From this, we can immediately infer the following result:

Proposition 1

Introducing retirement flexibility using individual actuarial adjustment of pension benefits implies that the welfare of the short-lived agents (\(\pi <\bar{\pi }\)) increases while the welfare of the long-lived agents (\(\pi >\bar{\pi }\)) decreases. This reform therefore cannot be a Pareto improvement.

As the retirement decisions are the same as in the benchmark model, introducing flexible retirement with individual actuarial adjustment does not generate an efficiency gain, only pure redistribution. As a consequence, an improvement according to the Kaldor–Hicks criterion is not possible either.

3.2 Uniform actuarial adjustment of benefits

Individual lifespans are difficult to observe in practice. Therefore, real-world pension schemes with a flexible starting date for benefits always rely on uniform actuarial adjustment factors based on some average life expectancy index. In this section, we show that this uniform adjustment of benefits can increase welfare of all individuals, i.e. induce a Pareto improvement, although individuals are heterogeneous.

3.2.1 Actuarial adjustment factor

With uniform adjustment, the reference lifespan index is the same for each agent, \(\hat{\pi }=\bar{\pi }\), so the adjustment factor and pension entitlements are:

$$\begin{aligned} m&= \frac{\bar{\pi }-h}{\bar{\pi }-z} \end{aligned}$$
(23)
$$\begin{aligned} P&= \frac{(\pi -z)(\bar{\pi }-h)b}{\bar{\pi }-z} \end{aligned}$$
(24)

The actuarial adjustment factor \(m\) equals one for each individual who retires at the statutory retirement age, i.e. if \(z=h\), so that \(p=b\). Agents who retire later than \(h\) receive a higher benefit, \(p>b\), and agents who retire earlier receive less, \(p<b\).

From Eq. (24), we observe that, ceteris paribus, total pension entitlements of agents with long lifespans are higher than the entitlements of agents with short lifespans. This redistribution implies that the pension scheme is not actuarially neutral at the individual level. As the amount of pension entitlements depends on the individual retirement age, uniform actuarial adjustment introduces selection effects in the retirement decision. To show this, we derive from Eq. (24):

$$\begin{aligned} \varPsi (z) \equiv \frac{\partial P(z)}{\partial z} = \frac{(\pi -\bar{\pi })p}{\bar{\pi }-z} \end{aligned}$$
(25)

For agents with above-average lifespans (\(\pi >\bar{\pi }\)), \(\varPsi >0\), implying that these agents have an incentive to postpone retirement as this will increase their lifetime pension income. From an actuarial point of view, the conversion factor of these agents is too high. For short-lived people (with \(\pi <\bar{\pi }\)) it is just the opposite; for these agents, the conversion factor of continued activity is too low which stimulates early retirement. For these people, postponing retirement would simply mean that total pension entitlements decrease (\(\varPsi <0\)).

3.2.2 Consumption and retirement

With flexible pension take-up and uniform actuarial adjustment, the lifetime budget constraint of the \(a\)-individual is still equal to Eq. (6), but now \(P\) is defined as in Eq. (24). Only the first-order condition regarding retirement changes:

$$\begin{aligned} (1-\tau )W_o + \varPsi (z) = \frac{\gamma z}{\pi } \end{aligned}$$
(26)

with \(\varPsi (z)\) given by Eq. (25). Consumption and retirement are then equal to:Footnote 14

$$\begin{aligned} c_ uni&= c_ ben +\frac{1}{1+\pi }\left[ P_ uni -P_ ben -\frac{\left[ \varPsi (z_{uni})\right] ^2 \pi }{2\gamma }\right] \end{aligned}$$
(27)
$$\begin{aligned} z_ uni&= z_ ben +\frac{\varPsi (z_ uni ) \pi }{\gamma } \end{aligned}$$
(28)

Equation (28) shows that there is an extra distortion in retirement behaviour. Like before, we have that the contribution rate induces early retirement (through its impact on \(z_ ben \)). The redistribution effects, represented by \(\varPsi \), imply an additional distortion in the retirement decision. This redistribution distortion can either stimulate retirement or depress retirement, depending on the individual lifespan \(\pi \). For individuals with below-average lifespans (\(\pi <\bar{\pi }\)), \(\varPsi <0\), which implies that these people retire earlier as a result of uniform actuarial adjustment. If individuals have above-average lifespans (\(\pi >\bar{\pi }\)), then \(\varPsi >0\), and these people will postpone retirement.

Consumption can either be higher or lower compared to consumption in the benchmark case. The last term in Eq. (27) is negative and reflects the utility loss resulting from the redistribution distortion in the retirement decision. Of course, flexibility can also induce a utility gain because an agent can choose the retirement age which gives him the highest entitlements. This potential gain is captured by the term \(P_ uni -P_ ben \). Note from Eqs. (12) and (24) that total pension benefits are generally not the same in the benchmark scheme and in the flexibility reform with uniform adjustment.Footnote 15

3.2.3 Welfare effects

The welfare effects are not trivial because, compared to the benchmark model, uniform adjustment introduces another distortion in the retirement decision which can work into the opposite direction of the existing distortion related to the contribution tax. We will show that under certain conditions, this reform can lead to a Pareto improvement.

Suppose that the reform takes place unexpectedly. First we will analyse how this reform affects utility of the current old generation. In the benchmark, second-period consumption is equal to:

$$\begin{aligned} \pi x_ ben = s_ ben + \frac{(1-\tau )^2W_o^2\pi }{2\gamma }+P_ ben \end{aligned}$$
(29)

where savings are equal to \(s=(1-\tau )W_y-c\). After the reform, the first-order condition for the retirement decision of the old generation is given by Eq. (26). Using this condition, old-age consumption after the reform is:

$$\begin{aligned} \pi x_ uni = s_ ben + \frac{(1-\tau )^2W_o^2\pi }{2\gamma }+P_ uni -\frac{\left[ \varPsi (z_ uni )\right] ^2 \pi }{2\gamma } \end{aligned}$$
(30)

The old generation is not worse off after the reform when \(u(x_ uni )-u(x_ ben ) \ge 0\), implying:

$$\begin{aligned} \pi x_ uni -\pi x_ ben \ge 0 \, \Rightarrow \, P_ uni - P_ ben - \frac{\left[ \varPsi (z_ uni )\right] ^2 \pi }{2\gamma }\ge 0 \end{aligned}$$
(31)

The current young generation and future generations are better off if \(U(c_ uni ,x_ uni ) \ge U(c_ ben ,x_ ben )\) for each ability level, which implies, using Eq. (7), \(c_ uni \ge c_ ben \). From Eq. (27), we can see that the condition for young and future generations is exactly the same as that for the current old generation. This is due to the fact that there are no income effects in the retirement decision. Consequently, for a given ability level, the transition generation and all future young generations retire at the same age and thus have the same amount of lifetime income. Hence, when condition (31) is satisfied and is strictly positive for at least one \(a\)-individual, the reform is Pareto improving. To analyse the possibility of a Pareto improvement, we make the following assumption:

Assumption 1

The statutory retirement age is set equal to the retirement age of the individual with the average ability level, i.e. \(h=z(\bar{a})\).

This assumption implies that individuals with below-average life span have an incentive to advance retirement as from an actuarial point of view the adjustment factor of retirement postponement is too low for them. Therefore, for these people, retiring after the statutory retirement age is not in their interest, ceteris paribus, as it reduces pension entitlements compared to the benchmark. For individuals with above-average life span, exactly the opposite holds. These individuals have an incentive to postpone retirement because the actuarial adjustment factor is too high for them. Hence, retiring before the statutory retirement is not in their interest.

Suppose Assumption 1 is satisfied, we can then derive the following result:

Proposition 2

A pension reform from inflexible Beveridgean pensions towards flexible Beveridgean pensions with the same tax rate and uniform actuarial adjustment of pension benefits is a Pareto improvement if and only if \(\tau \ge \tau ^*\), with \(\tau ^*\) equal to:

$$\begin{aligned} \tau ^* = \frac{(\gamma -qw)\sqrt{\gamma -w}-(\gamma -w)\sqrt{\gamma -qw}}{w\sqrt{\gamma -qw}-qw\sqrt{\gamma -w}} \end{aligned}$$
(32)

Proof

See Appendix A.1 \(\square \)

The intuition for this result is as follows. High-skilled workers certainly gain from this reform because the adjustment factor is too high for them from an actuarial perspective because they live longer. This leads to a lower implicit tax on continued activity and thus later retirement. The welfare of low-skilled workers in principle declines because they are confronted with higher implicit taxation as their actuarial adjustment factor is too low. The only way to compensate for this loss is to give the low-skilled more social security benefits. If the contribution tax rate is sufficiently high, it is indeed possible that the continued activity of the more able generates enough resources to compensate the less able so that ultimately the welfare of all agents is higher.

Instead of keeping the tax rate constant as assumed in Proposition 2 and making everyone better of, the government may also use the additional resources generated by reducing the distortion of the retirement decision to lower the tax rate without making anyone worse of. In this way, introducing flexible retirement with uniform actuarial adjustment can be viewed as a reform to prevent the increasing fiscal burden of ageing. The crucial factor allowing for this is that the reform generates a double dividend: it not only generates additional revenues but also fosters redistribution from high to low incomes. Similar to Cremer and Pestieau (2003), this ‘double dividend’ hinges on two conditions. First, the retirement decision in the benchmark pension scheme needs to have a downward distortion, i.e. retirement is too early, and the removal of this distortion therefore brings additional resources. Second, the pension contract needs to be redistributive from rich to poor individuals so that most of the cost of the reform is borne by the high-income people.Footnote 16

In Fig. 1, we show a numerical illustration of the welfare (left graph) and redistribution effects (right graph) of a switch to a flexible scheme based on uniform actuarial adjustment. The underlying parameterization is as follows. The tax rate \(\tau \) is 0.3,Footnote 17 \(w=1\) and \(\gamma =2\). We further assume \(h=1/6\) and \(\bar{\pi }=0.7\), which implies an official retirement age of 65 and an average lifespan of 81 years.Footnote 18 The heterogeneity parameter \(\lambda \) is calibrated such that the difference between the lifespan of high-skilled and low-skilled agents is at most 3.5 years which is consistent with recent Dutch estimates, this gives \(\lambda = 1/6\). We interpret the high skill level as the highest attainable education levels in the Netherlands (i.e. higher vocational training and university) and the low skill level as the collective term of all remaining education levels. According to recent figures of Statistics Netherlands, about two-third of the Dutch population is low-skilled (\(a^*=2/3\)) and these people earn about 40 % less than high-skilled agents (\(q=0.6\)). Finally, we assume that ability \(a\) follows a uniform distribution,Footnote 19 i.e. \(G(a) = a\), and that the utility function is logarithmic, i.e. \(u(\cdot ) = \ln (\cdot )\).

Fig. 1
figure 1

Uniform adjustment: welfare and redistribution. a Welfare. b Net benefit. Notes: The welfare effects are measured in terms of consumption equivalent variation: the percentage of extra consumption an agent would require in the benchmark situation to be as well off as after the flexibility reform. The net benefit is the difference between the total pension benefits received and tax contributions paid, see Eq. (15). The graph shows the absolute change in the net benefit from the benchmark

Figure 1a shows that the welfare effects of introducing flexible retirement with uniform adjustment are positive for all high-ability agents. These agents benefit from a lower implicit tax on continued activity due to the attractive actuarial adjustment factor and therefore choose to work longer. With these parameter settings, however, the additional tax contributions are not sufficient to compensate all low-skilled agents for the higher implicit tax they are confronted with, although most of them experience an increase in the net benefit from the scheme (see Fig. 1b). To achieve a Pareto improvement, the contribution rate needs to be at least 40 %, that is, \(\tau ^* = 0.4\).

There are good reasons to argue that in practice the tax critical rate is lower than presumed in our analysis. First, as explained in Footnote 17, income redistribution from rich to poor runs through more channels than the pension scheme, like the tax system or public health care. Hence, when high-skilled agents are stimulated to work longer with a flexible pension take-up, the low-skilled may also be compensated through these other types of redistribution. Second, in reality the contribution tax is added to other sources of distortionary taxation. As the deadweight loss is roughly quadratic in the total tax rate, the marginal welfare improvement of introducing flexible retirement (and lowering implicit taxation) might be larger than our analysis suggests. In the next section, we show that a reduction in the tax critical rate can also be obtained by reformulating the pension reform to some extent, i.e. by setting the reward rate of retirement postponement above the actuarially neutral level.

4 Introducing actuarial non-neutrality

In recent years, an increasing number of countries introduced penalties and rewards for earlier and later retirement. To stimulate work continuation, the penalty rate is typically not as high as the reward rate, i.e. the adjustment is asymmetric. In the USA, for example, for each year of retirement before the statutory retirement age, the annual benefit is reduced by 6.75 %. The actuarial increment for those retiring after the statutory retirement age amounts to 8 %. In Japan, the difference is even larger, where the penalty rate of early retirement is 6 % per year while the reward rate of later retirement is 8.4 % (OECD 2011). In this final section, we therefore consider a pension flexibility reform where pension benefits are adjusted in an actuarially non-neutral way to induce people to postpone retirement. We show that under such a reform, a Pareto improvement can be achieved at a lower contribution rate or that for a given contribution rate, it leads to more positive welfare effects for all individuals.

4.1 Actuarial adjustment factor

To make our point as clear as possible, we abstract from lifespan heterogeneity in the analytical analysis. Hence, each agent, irrespective of his ability level, lives a fraction \(\pi \le 1\) of the second period. In the simulation graphs, however, we have heterogeneous lifespans. The actuarial adjustment factor is specified as follows:

$$\begin{aligned} m(z,\pi ) = \left( \frac{\pi -h}{\pi -z}\right) ^{\sigma }, \quad \sigma \ge 1 \end{aligned}$$
(33)

where the parameter \(\sigma \) governs the degree of actuarial non-neutrality of the adjustment factor, this is also shown in Fig. 2. In case \(\sigma =1\), the adjustment is completely actuarially neutral with respect to the retirement decision (see Sect. 3.1). For \(\sigma >1\), the adjustment factor is higher than the actuarially neutral level if agents retire later than the statutory retirement age (\(z>h\)). On the contrary, the adjustment factor is lower than the actuarially neutral level if agents retire earlier than the statutory retirement age (\(z<h\)). In other words, specification (33) rewards delaying retirement and discourages early retirement as long as \(\sigma > 1\).

Fig. 2
figure 2

Actuarial adjustment factor

Given Eq. (33), the pension entitlements \(P\) are equal to:

$$\begin{aligned} P = (\pi -h)^{\sigma }(\pi -z)^{1-\sigma }b \end{aligned}$$
(34)

Taking the derivative of \(P\) with respect to \(z\) gives:

$$\begin{aligned} \varPsi (z) \equiv \frac{\partial P(z)}{\partial z} = (\sigma -1)p \end{aligned}$$
(35)

Hence, if \(\sigma >1\) then \(\varPsi >0\), i.e. introducing actuarial non-neutrality gives all agents an incentive to continue working as this will increase pension entitlements.

4.2 Consumption and retirement

The consumption decision and retirement decision are equal to:Footnote 20

$$\begin{aligned} c_ nan&= c_ ben +\frac{1}{1+\pi }\left[ P_ nan -P_ ben -\frac{\left[ \varPsi (z_{nan})\right] ^2 \pi }{2\gamma }\right] \end{aligned}$$
(36)
$$\begin{aligned} z_ nan&= z_ ben +\frac{\varPsi (z_ nan )\pi }{\gamma } \end{aligned}$$
(37)

where \(P\) and \(\varPsi \) are defined by Eqs. (34) and (35), respectively. Taking the derivative of the retirement choice with respect to the parameter \(\sigma \) that governs the degree of actuarial non-neutrality gives (evaluated at \(\sigma = 1\)):

$$\begin{aligned} \left. \frac{\partial z}{\partial \sigma }\right| _{\sigma =1} = \frac{\pi p}{\gamma }>0 \end{aligned}$$
(38)

An increase in the parameter \(\sigma \) (starting from actuarial neutrality, i.e. \(\sigma =1\)) leads to later retirement. The introduction of this kind of non-neutrality in the retirement decision can undo (at least to some extent) the distortionary effect of the social security tax. This result is comparable with the situation in the flexibility reform with uniform actuarial adjustment and heterogeneous lifespans. With uniform actuarial adjustment, however, the pension scheme is still actuarially neutral on average: high-skilled workers (with a long lifespan) receive a subsidy on continuing work, whereas low-skilled workers (with a short lifespan) experience a tax on delaying retirement. The current reform is different because now the pension scheme subsidizes work continuation for all agents, irrespective of skill level.

4.3 Welfare effects

Introducing actuarial non-neutrality does not only stimulate labour supply, it also leads to a Pareto improvement if the tax rate is sufficiently high.

Proposition 3

With a given tax rate, introducing actuarial non-neutrality aimed at stimulating work effort makes high-skilled workers strictly better off. In addition, the reform is Pareto improving if and only if \(\tau >\hat{\tau }\), with:

$$\begin{aligned} \hat{\tau } = \frac{\left[ 1-G(a^*)\right] \ln \left( \frac{\pi -z_L}{\pi -z_H}\right) }{G(a^*)\frac{qw\pi }{\gamma (\pi -z_L)}+\left[ 1-G(a^*)\right] \frac{w\pi }{\gamma (\pi -z_H)}} \end{aligned}$$
(39)

This implicit equation has a unique solution.

Proof

See Appendix A.2 \(\square \)

The intuition for this result is similar as in the reform with uniform actuarial adjustment (see Sect. 3.2). The government can apply non-neutral actuarial conversion of benefits for late retirement as an instrument to increase the total efficiency of the economy. This subsidy reduces the existing labour supply distortion on the retirement decision related to the contribution tax rate. With actuarial non-neutrality, however, the reward rate of retirement postponement is relatively more attractive for agents who retire later (i.e. the high-skilled), as can also be seen from Fig. 2. Therefore, to ensure that the welfare of the low-skilled also improves, the contribution rate needs to be sufficiently high so that the additional tax payments of the high-skilled lead to higher pension benefits.

Figure 3 compares the welfare effects of a uniform adjustment under actuarial neutrality (dashed line) and actuarial non-neutrality (solid line). Contrary to the analytical exposition discussed above, this graph is based on heterogeneous lifespans (see also Eq. (5)). All parameter values are the same as those used in the previous graphs. As we have shown before, a contribution rate of 30 % is not sufficient to ensure that an actuarially neutral and a uniform adjustment of benefits is Pareto improving. Figure 3 shows, however, that when a uniform adjustment is combined with actuarial non-neutrality, this has strictly positive welfare effects for all individuals under a contribution rate of 30 %, i.e. the reform is Pareto improving. This implies that by introducing actuarial non-neutrality in the pension scheme, it is possible to achieve a Pareto improvement for a lower contribution tax rate. The reason for this result is that an actuarial non-neutral uniform adjustment gives more incentives for the high-skilled to retire later and that their labour supply in the second period will be higher than under the actuarially neutral reform; this will generate more resources to compensate the low-skilled.

Fig. 3
figure 3

Neutral versus non-neutral actuarial adjustment. Notes: The welfare effects are measured in terms of consumption equivalent variation: the percentage of extra consumption an agent would require in the benchmark situation to be as well off as after the flexibility reform. The actuarially non-neutral scenario is based on \(\sigma =1.05\)

5 Conclusion

In this paper, we have studied the intragenerational redistribution and welfare effects of a pension reform that introduces a flexible take-up of pension benefits. To analyse the economic implications of such a pension reform, we have developed a stylized two-period overlapping-generations model populated with heterogeneous agents who differ in ability and lifespan. The model includes a Beveridgean social security scheme with lifetime annuities. In this way, we take into account the empirically most important channels of intragenerational redistribution: income redistribution from rich to poor people and lifespan redistribution from short-lived to long-lived agents.

Our results suggest that introducing a flexible pension take-up with uniform adjustments can induce a Pareto improvement. This reform can collect additional resources without diminishing the welfare of low-skilled agents and increasing that of high-skilled agents. In that way, it can also help to bear the costs of ageing in a Beveridgean pension scheme. The selection effects of uniform actuarial adjustment increase the implicit tax of the low-skilled, but decrease the implicit tax of the high-skilled, who in turn decide to work longer and therefore pay more pension contributions. A necessary condition for such a Pareto improvement is that the contribution tax is sufficiently high so that the continued activity of the high-skilled generates enough tax revenues to compensate the low-skilled with higher benefits. Increasing the reward and penalty rates of later and earlier retirement in an actuarially non-neutral way can help to reduce this tax critical rate. This policy reduces the implicit tax not only of the high-skilled agents but also of the low-skilled, implying that the less-skilled agents need less compensation through the redistributive pension scheme.

In real-world pension schemes that have actuarial adjustment of pension entitlements, this adjustment is indeed independent of individual characteristics, like life expectancy or skill level. The results of this paper give a rationale for this kind of uniform flexibility reforms. In recent years, penalties and rewards for earlier or later retirement have increased in a number of countries (OECD 2011). However, in most countries, the implemented reductions in early pension benefits do still not fully correspond both to the lower amount of contributions paid by the worker and to the increase in the period over which the worker will receive pension payments (Queisser and Whitehouse 2006). This implies that there is still room to improve the pension systems by going into the direction of complete actuarial neutrality or by moving even beyond that level, as our analysis of non-actuarial neutral adjustment suggests.

Other important elements to which we have not paid attention, but that might be important when analysing pension flexibility, are the role of income effects in the retirement decision or social norms. Especially in the short run, flexibility in the pension age could lead to only small changes in retirement behaviour if agents are used to retire at some socially accepted retirement age. In the long run, however, norms may change and the effects described in this paper may still apply. To what extent these kinds of issues would affect our main results is left for future research.

Our paper, however, provides a rationale why countries with Beveridgean pension schemes should use uniform rules for the adjustment of pension benefits when they introduce flexible pension take-up even though people have different skill levels and life expectancies. It is sometimes argued that it would be preferable to base the actuarial adjustment factor on individual life expectancy or skill level. This paper shows that even in a very simple setting, the latter type of pension flexibility reform cannot be Pareto improving as some of the redistribution in the initial pension scheme (from the short- to the long-lived) is removed. It is therefore important to take all types of redistribution in the initial pension scheme into account when discussing the implementation of flexible pension take-up. Applying uniform actuarial adjustment, possibly combined with non-neutral elements to increase the incentives to postpone retirement, could increase the economic efficiency of the pension system. In that way, this reform generates extra resources to cope with the costs of ageing and makes some people better off while not hurting other people.