1 Introduction

The distributional impact of tax and transfer reforms plays a central role in policy debates. This is typically assessed by comparing the gains and losses across different income groups: reforms are classified as progressive if they result in a greater proportional gain (or a smaller proportional loss) for households with low incomes, and regressive if the converse is true.Footnote 1 However, such analyses suffer from an important limitation: the incomes used to rank households—and the changes in transfer entitlements or tax liabilities—are usually snapshot measures derived from cross-sectional data. These could provide a poor guide to the distributional implications of reforms because snapshot incomes are not always a good proxy for well-being. Individuals’ circumstances often vary a great deal across life, and resources can be transferred across periods of life through borrowing and saving. As a result, well-being may be better measured using longer-run income measures (see, for example, Aaberge and Mogstad 2015; Bönke et al. 2015).

In this paper, we contrast the snapshot and lifetime distributional impacts of two sets of reforms, one targeting the bottom of the distribution and the other targeting the top. At the bottom, we investigate reforms to the working-age benefit system that have played an important role in the policy response to widening inequality (Blundell 2006): an expansion in work-contingent transfers (like the EITC in the United States) and an increase to out-of-work benefits (like jobseekers benefit in most European economies). It has been well-established that work-contingent transfers are an extremely effective means of reducing poverty (especially among children: Sherman 2009; Joyce and Sibieta 2013) and raising the employment of lone parents (Hotz and Scholz, Hotz and Scholz; Brewer et al. 2006). Yet they are also seen as involving an equity-efficiency trade off, with Eissa and Hoynes (2011), for example, arguing that they raise a “glaring” distributional issue by transferring resources only to working families and “providing minimal support to the poorest families”, who for the most part are not in paid work.

A key contribution of our work is to show that while increasing out-of-work benefits looks far more progressive than equally costly increases to work-contingent benefits at a snapshot in time, increases to work-contingent benefits are—on average—just as effective as increases to out-of-work benefits at redistributing resources to the lifetime poor. The reason for this is that the lifetime poor spend most of their working lives in low-paid work, whereas many of those who benefit from increases to out-of-work benefits are experiencing temporary periods of joblessness.Footnote 2

Such patterns of mobility suggest that the perceived equity-efficiency trade-off that policymakers face in deciding between increasing work-contingent and out-of-work transfers is less stark than typically imagined. Alongside the well-documented fact that income is far more equally distributed on a lifetime than annual basis (e.g Blomquist 1981; Björklund 1993; Creedy 1999; van de Ven 2006; Kopczuk et al. 2010; Brewer et al. 2012), these patterns also raise the question as to whether personal income taxes—which are typically assessed on monthly or annual income—are effective at targeting those who are lifetime rich. Like Bengtsson et al. (2012) we find that they are, with increases to the main rates of income tax primarily affecting those with high levels of income over their lifecycle. This is because higher—though not necessarily the highest—earners tend to exhibit greater persistence in their incomes over time. As a result, annually assessed personal income taxes appear well-targeted at the top half of the lifetime income distribution. While our analysis uses UK data and the UK tax and benefit system, our results are likely to generalise to other countries with similar labour market structures.Footnote 3

To assess the distributional impact of reforms from a lifetime perspective we require longitudinal data which not only tracks individuals over the entirety of their adult lives, but which also includes all the necessary information for computing individuals’ tax liabilities and for determining their eligibility for different transfers. Unfortunately, even in the few countries where sufficiently long-running administrative or longitudinal survey data are available, they do not include all the information needed to calculate individuals’ disposable incomes under counterfactual tax and transfer systems. Administrative datasets that do contain such information typically are not available for sufficiently long periods of time to cover individuals for the whole of their lives.Footnote 4

Instead, we use statistical models of earnings, employment, health, housing tenure and family composition over the life-cycle to simulate lifetime profiles for individuals born in the ‘baby-boom’ cohort (1945–54). The approach we take deploys a method proposed in Levell and Shaw (2016), who compare the performance of alternative lifetime simulation methods. It involves first estimating transition equations for annual outcomes, conditioning on a rich set of demographics and outcomes for previous years using an 18-wave panel survey. The resulting estimates are then used to construct a simulated dataset for individuals that contains all the relevant characteristics for calculating tax and transfer payments. In order to ensure that our resulting simulated life-courses are representative of the experiences of our cohort of interest, estimates from the relatively short panel we use are combined with information from a much longer-running cross-sectional survey. Specifically, the transitions estimated using our panel data are adjusted to match cross-sectional averages for the baby-boom cohort at each age. This approach allows us to minimise the risks of conflating age, period and cohort effects that would otherwise occur when generalising processes estimated across different cohorts using a relatively short panel.

Our modelling approach incorporates insights from recent research on earnings dynamics, including the stylised facts that negative (positive) earnings shocks are more persistent for high (low) earners (Guvenen et al. 2015) and that the variance of shocks to income vary with age (Blundell 2014). Incorporating these more realistic income dynamics has been shown to have important implications for understanding the variance of lifetime earnings (Altonji et al. 2013), observed patterns of wealth inequality (Castaneda et al. 2003; De Nardi et al. 2016), and consumption decisions (Arellano et al. 2017).

Our approach also allows us to realistically model complex tax liabilities and benefit entitlements over entire life-courses, enabling us to characterise the lifetime distributional effects of different tax and transfer reforms. However, it has two key limitations. The first is that our analysis does not incorporate behavioural responses to tax reforms, as is common in cross-sectional analyses of tax reforms.Footnote 5 In our lifetime analysis we impose two counterfactual tax systems on individuals (a baseline tax system as if they were under the current tax and benefit system for the whole of their lives, and a reformed tax system) but do not model how these counterfactual systems might change individuals’ earnings and employment, partnering and fertility. This allows us to model an almost complete set of characteristics that are relevant for calculating tax and transfer payments, such as detailed family composition, joint employment and earnings in couples, health status, housing tenure and house value. Typically, a number of these variables are omitted from dynamic models that incorporate behavioural responses (usually in labour supply or consumption) in order to maintain tractability.Footnote 6

A second limitation concerns coverage of those with the very highest incomes. Those in the top 1% of the income distribution are often not captured well in household surveys (Atkinson et al. 2011; Burkhauser et al. 2018) but may have different earnings dynamics even when compared to those in neighbouring percentiles. For this reason, we do not examine reforms that are only likely to affect those at the very top of the income distribution (such as reforms to the top rate of income tax), and examine the distributional effects by cross-sectional and lifetime income deciles, rather than percentiles.

In contrasting the snapshot and lifetime distributional impact of these reforms, we add to a recent literature that has highlighted the importance of considering the long-run impacts of taxes and transfers. These include studies of their impact on individuals’ education (Blundell et al. 2016), employment (Hoynes et al. 2016), earnings (Dahl et al. 2014; Neumark and Shirley 2017) and health (Evans and Garthwaite 2014), as well as that of their children (Bastian and Michelmore 2018).

We also contribute to a literature examining the lifetime distributional impact of policy that has tended to focus on indirect taxes (Davies et al. 1984; Poterba 1989; Caspersen and Metcalf 1994; Lyon and Schwab 1995; Metcalf 1999). This showed that while such taxes on consumption look highly regressive when assessed against annual income, the same is not true when lifetime income or—as a proxy—annual consumption is used. Similarly, we show the importance of moving beyond an exclusively snapshot perspective when analysing direct tax and transfer reforms.

Our results suggest that policymakers seeking to redistribute resources to the long-run poor might favour doing so through increases to work-contingent benefits, given that these typically have less damaging effects on work incentives than out-of-work benefits. However, redistributing to those with few lifetime resources is by no means the only goal of policy. A growing literature has shown the consumption floor provided by benefits provides significant insurance value to individuals, both in retirement (O’Dea 2018) and working life (Landais and Spinnewijn 2019). In addition, credit market imperfections mean that redistribution towards periods of temporarily low current incomes can be particularly welfare-improving even if policymakers’ primary concern is the distribution of lifetime resources (Roantree and Shaw 2018). Rather, our results illustrate the importance of taking both a lifetime and snapshot perspective and of being clear about what one is trying to achieve: the alleviation of short-term hardship or the redistribution of lifetime resources.

The remainder of this paper is structured as follows. In the next section, we describe how we construct our simulated lifetime profiles, with further details provided in Appendices. Section 3 presents some descriptive statistics on lifetime incomes and how they compare to cross-sectional measures. This forms the backdrop to our results on the distributional effects of tax and transfer reforms on lifetime and cross-sectional income in Sect. 4. Section 5 concludes.

2 Methodology

In this section, we describe the data we use and how our lifetime processes are estimated and simulated for use in our tax policy analysis. Our aim throughout is to simulate life profiles that are representative of individuals born in the United Kingdom in the period 1945–54 (roughly corresponding to the ‘baby-boom’ cohort). Here we provide an overview of our method and how we validate it; for further details, see Appendix A and Levell and Shaw (2016), who compare our microsimulation approach to others used in the literature.

2.1 Data

We rely primarily on two datasets: the British Household Panel Survey (BHPS) and the Living Costs and Food Survey (LCFS).

The BHPS is a panel survey that ran for 18 waves from 1991 to 2008, collecting a wide range of demographic and socio-economic information. The survey followed individuals and their descendants over successive waves. The original sample comprised around 10,000 individuals in 5500 households and was nationally representative south of the Caledonian canal. Booster samples were introduced for Scotland and Wales in 1999. In each wave, the survey aimed to interview all individuals aged 16+ in each household, including children who reached adulthood after the survey began and adults who moved into households that were previously surveyed.

The Living Costs and Food Survey (LCFS) is the latest name for a long-running, annual (for most of its history), cross-sectional survey of household spending patterns in the United Kingdom. It was known as the Family Expenditure Survey (FES) between 1957 and 2001 and the Expenditure and Food Survey (EFS) between 2001 and 2008. For simplicity in what follows we shall refer to all these surveys simply as the LCFS. The LCFS collects data on household incomes from various sources over the past 12 months, employment, family characteristics (including years of education from 1978 onwards) and expenditures. We make use of the LCFS between 1968 and 2012.

In addition to these datasets, we also make use of the English Longitudinal Study of Ageing (ELSA) to impute private pension incomes to our simulated individuals. ELSA is a panel survey of the over 50s in England, with rich information on private pension wealth.

2.2 Overview of simulation approach

The first step in our approach is to estimate the conditional probabilities associated with different transitions at each age using the BHPS panel data. The processes we model are those that are central to determining taxes and benefits: mortality, partnering, separation, child arrival and departure, movements into and out of disability, movements in and out of employment, movements between full-time and part-time work, movements across ranks in the earnings and rent distributions, movements into and out of rented accommodation, and movements between local property tax bands (known in the United Kingdom as council tax). We do not model capital or asset incomes, or receipts of transfers and inheritances from other households. We also ignore benefits in kind, such as health and education spending. In most cases, we estimate transition probabilities using binary and multivariate logit models with a detailed set of covariates. A summary of the exact specifications we use in the estimation stage is set out in Appendix A.

Once we have obtained estimates, the next step is to use these to simulate a set of lifetime profiles. These simulations allow for correlations between outcomes in a sequential manner. First we determine whether or not the agent lives or dies in the period. We then assign births to individuals according to probabilities of child arrival that we have estimated, and determine whether children between ages 16 and 18 leave the household. Individuals then partner or separate. Childbirth is determined prior to partnering so that it will depend on lagged rather than current partner status (allowing for a nine month gestation period). We then determine whether or not individuals receive disability benefits, before assigning an employment status, and a location in the earnings distribution. We impose that all those who are disabled are unemployed. Finally we determine whether or not the individual is a renter, and the household’s council tax band, before incrementing individuals’ ages and repeating the process.

We start simulating from 1960 when the baby-boomers are in childhood. Initial conditions (education levels, likelihood of being a renter and so on) are set using data on the baby-boom cohort from the LCFS. We simulate 5000 lifecycles. Consumption and private pension profiles are imputed to individuals once the simulations are complete. Consumption is imputed on a year-by-year basis using regressions run on the cross-sectional LCFS. For private pensions we use pension incomes projected for future years that were calculated using the English Longitudinal Study of Ageing for real world members of the baby-boom cohort.Footnote 7 These are calculated for different possible retirement dates for each individual given observed pension wealth. The profiles are matched to our simulated individuals within cells defined by cohort, year, age and sex according to estimated ranks in the private pension wealth distribution.

For some of our variables, the process of simulation is quite straightforward. Probabilities are estimated and transitions then drawn from the relevant distribution. The only additional complication is that after estimating transition probabilities using the BHPS our results are scaled in order to match cross-sectional patterns of demographics and employment observed for our cohort of interest. We describe this scaling procedure in Sect. 2.4. Further details of the exact approaches we take are presented in Appendix A. The way we model earnings is a little more complicated however, and so we provide more detail here.

To model movements of individuals across the earnings distribution accurately, we must be careful to allow for key features of real-world transitions. A standard approach to modelling earnings dynamics would be to estimate an earnings process including both an individual-level fixed effect, an AR(1) error and a normally distributed innovation. However, such models are now known to have problems capturing key features of real-world earnings mobility, including aspects which are likely to be particularly important in understanding the lifetime impact of different tax and benefit reforms. For instance, these models assume that the persistence of earnings is independent of both age and individual earnings histories. Moreover, in such models, positive and negative shocks to earnings are equally likely, and equally persistent, regardless of individuals’ initial locations in the earnings distribution. Recent work using detailed administrative data (Guvenen et al. 2015) has shown that the distribution and persistence of shocks differs in important ways both over the life-cycle and according to current earnings.

The approach we take is to directly estimate transition matrices across earnings quantiles and part-time or full-time employment (for other examples see Buchinsky and Hunt 1999; Bowlus and Robin 2012). In particular we model individuals’ earnings ranks (relative positions in the distribution by age and year) in a three-step approach. First, we model movements into and out of employment. Second, for those employed, we then model movements across five earnings ‘bins’ (corresponding to quartiles of earnings if in full-time employment and a separate bin for part-time employment). Third, we predict individuals’ exact earnings ranks within each bin and ‘fill in’ earnings levels from the actual cross-sectional distributions of earnings observed for our particular birth cohorts in the LCFS. More details on the procedure is given in the following subsection. But the key points here are that we capture earnings and employment transitions in a highly flexible way and match cross-sectional distributions (means, variances and higher order moments) exactly by construction.

Modelling ranks in the way we do means that we assume that movements across the earnings distribution by age are the same across cohorts and periods. This assumption is not completely innocuous. One could for instance imagine that the degree of relative mobility has changed over time between cohorts or that recessions may differentially affect the positions of some individuals relative to others. However, if we had instead modelled earnings and rents levels, we would face the problem of having to disentangle age, period and cohort effects when predicting values from our panel data. Failure to specify the correct model in this regard would potentially severely bias the shape of our estimated age profiles. In addition, we would have faced the challenge of ensuring that our simulated individuals were subject to a realistic sequence of business cycles by for example modelling a separate process for period effects. Taking values from actual cross-sectional distributions as we do ensures that our simulated individuals automatically face a real-world process for aggregate shocks.

2.3 Details on modelling employment and earnings

In this subsection, we provide more detail on our approach to modelling employment and earnings. We use the following three-step parametric approach.

  1. 1.

    Determine employment status We estimate transition matrices for employment status separately for males and females using multinomial logit models and according to individuals’ employment status over the previous two waves. The probabilities making up this matrix are estimated through a set of logit models which include several lags of employment status (and interactions thereof) to help us match the high persistence of employment status observed in the data. We also estimate separate logits according to the individual’s employment in the previous two periods and by sex. These transition probabilities are scaled so as to match the observed unemployment rates at different ages for the baby-boom cohort in the cross-sectional LCFS data (as discussed below).

  2. 2.

    Place the individual in an earnings bin Once an individual’s employment status is determined, we then place the individual in one of five possible bins: in part-time work, or in full-time work and in one of four different earnings quartiles. Distinguishing between part- and full-time work is important in our case as it determines eligibility for receipt of tax credits in the United Kingdom. We assume that part-time work corresponds to 20 hours per week and full-time work to 40 hours. To determine which bin an individual should be placed in we estimate multinomial logits from each of the six possible prior states i (which include unemployment)

    $$\begin{aligned} {\mathbf{Pr}} (i,j|X_{it}) = \frac{\exp (X_{it}\beta ^{ij})}{\sum _{m=0}^{N}\exp (X_{it}\beta ^{im})} \end{aligned}$$
    (1)

    where \(X_{it}\) is a set of covariates which includes a cubic in age, education, a dummy for whether individuals have children or not, and a dummy for whether they have children under the age of five (and various interactions of these) their current earnings rank (entering linearly) as well as five lags of full-time and employment status and lagged earnings quartiles. Including these lags relaxes what would otherwise be a Markov assumption that next period’s transition depends only on current circumstances. Differences in the coefficients attached to these allow for differences in the persistence of earnings ranks across the distribution. We run logistic regressions separately for each initial bin, and separately for men and women.

  3. 3.

    Determine the individual’s precise earnings rank The results from these models can be used to estimate the probability of moving between unemployment, part-time and full-time work and the different earning quartiles. However, it does not place individuals precisely within these quartiles. One approach to deal with this is to match simulated individuals to real-world individuals who made the same employment and earnings transitions as they did and use these individuals’ new ranks to determine the simulated individual’s new locations (the approach adopted by Bowlus and Robin, 2012). We adopt an alternative parametric method. This involves predicting ranks using regressions of the following form

    $$\begin{aligned} \begin{aligned} \Phi ^{-1}(r_{ikt}) =&\sum _{\tau =1}^{4}\sum _j\delta _0^{\tau j}D^i_{Q_{t-\tau }=j} +\sum _{\tau =1}^{4}\sum _j\delta _1^{\tau j}D^i_{Q_{t-\tau }=j}.r_{ij,t-1} \\&+ \sum _{\tau =1}^{4}\sum _j\delta _2^{\tau j}D^i_{Q_{t-\tau }=j}.r_{ij,t-1}^2 + \dots \end{aligned} \end{aligned}$$
    (2)

    where \(r_{ikt}\) is the within-bin rank of individual i in period t with bin k. \(D^i_{Q_{t-\tau }=j}\) is a dummy which equals one if the individual was located in bin j in period \(t-1\). \(\Phi ^{-1}(\dots )\) is the inverse of the CDF of the normal distribution. These regressions are run separately for each destination bin, allowing us to capture the asymmetric nature of persistence over the income distribution. They are also run separately for males and females. Linearly predicting \(\Phi ^{-1}(r_{ikt})\) [and then feeding this prediction through \(\Phi (\dots )\)] ensures that the predicted within-bin rank always lies between 0 and 1. The polynomial of past ranks (up to a cubic) included in this regression is also interacted with a cubic in age in order to help us match the differing persistence of earnings over the life-cycle. In our simulations, we add a normally distributed noise term with the variance of residuals seen in the data to the linear prediction made using Eq. 2. The raw ranks that we predict for our simulated individuals need not have a uniform distribution. If this were not corrected before imputing earnings, the quantiles of earnings for our simulated individuals would differ from those in our cross-sectional data. To make the distribution uniform, we assign new ranks to individuals based on their relative positions within-sample as a final step. We see this additional step as analogous to our other cohort scaling adjustments. Younger individuals (who do not have all the lags needed to be included in the regression models mentioned so far) have their status determined by a multinomial logit (across our five possible bins and unemployment) and a within-bin regression involving only one lag.

Once earnings ranks have been predicted, we then impute actual earnings levels using the distributions observed in the LCFS. For years when our cohort is not observed in the LCFS (prior to 1968 and after 2012) we uprate or downrate earnings distributions for the relevant ages from different cohorts using historical data on earnings growth or forecasts taken from the UK Office for Budget Responsibility.

Our approach allows for the persistence of earnings to differ by age, and for transition probabilities to vary depending on individuals’ earnings histories. In Appendix A.1, we assess how well we capture patterns of mobility that we observe in the data compared to a more traditional earnings process. We find that our approach is much better at matching the tendency of individuals to remain at the same location in the earnings distribution as well as the greater persistence of earnings in the tails of the distribution (and particularly for high earners).Footnote 8

2.4 Scaling transition probabilities

Typically two different approaches are used to construct lifetime profiles from panels that cover a relatively short period of time. One is to estimate processes using a sample that pools together data on different cohorts (see, for example Falkingham and Harding 1996). Another is to adopt a splicing approach that joins individuals observed at different ages within the same short period together to form a complete lifecycle (Bovenberg et al. 2008).Footnote 9 The downside of both of these approaches is that processes for earlier ages will be estimated using data taken from cohorts that are younger than the baby-boom cohort and those for later in life from older cohorts. This creates difficulties in interpreting the resulting estimates, since the simulated life-cycles may not correspond to the actual experiences of any real-world cohort of individuals. In addition, the approach does not make use of the information we do have on the evolution of average employment, fertility and so on across years from longer-running cross-sectional surveys.

To make best use of the data we have available, we modify the simulation approach in the following way. We first estimate transition probabilities for each of the relevant variables using panel data as we described above, and then we adjust these probabilities so that the resulting cross-sectional averages for our simulated cohort match those in the cross-sectional data for the same cohort from 1960 onwards.Footnote 10

To be more precise we aim to find transition probabilities for, say, employment such that we move from the (observed) cross-sectional distribution given at time \(t-1\) to the cross-sectional distribution at time t. For a general binary outcome for some individual i in year t and group g, \(y_{it}^{g}\), let:

$$\begin{aligned} p_{it}^g&= {\text {Prob}}(y_{it}^g=1) \end{aligned}$$
(3)
$$\begin{aligned} \pi _{it}^{g,01}&= {\text {Prob}}(y_{it}^g=1 | y_{i,t-1}^g=0 ) \end{aligned}$$
(4)
$$\begin{aligned} \pi _{it}^{g,11}&= {\text {Prob}}(y_{it}^g=1 | y_{i,t-1}^g=1 ) \end{aligned}$$
(5)

Then by the law of total probability, we get the flow equation

$$\begin{aligned} p_{it}^g = \pi _{it}^{g,01}(1-p_{i,t-1}^g)+\pi _{it}^{g,11}*p_{i,t-1}^g \end{aligned}$$
(6)

It is clear that \(\pi _{it}^{g,01}\) and \(\pi _{it}^{g,11}\) cannot be uniquely identified with knowledge of \(p_{it}^g\) and \(p_{i,t-1}^g\) alone. This is because a variety of transition probabilities could in principle satisfy Eq. 6. Cross-sectional data alone is therefore not sufficient to identify these and panel data is required. However, the transition probabilities that we have estimated need not satisfy this expression - not least because they are partly based on data for different cohorts and years. Let these probabilities be denoted by \(\hat{\pi }_{it}^{g,01}\) and \(\hat{\pi }_{it}^{g,11}\). We proceed by choosing values for \(\pi _{it}^{g,01}\) and \(\pi _{it}^{g,11}\) so as to minimise the average distance to our estimated transition probabilities

$$\begin{aligned} \left( \frac{1}{N_g}\sum _{i}\pi _{it}^{g,01} - \frac{1}{N_g}\sum _{i}\hat{\pi }_{it}^{g,01} \right) ^2 + \left( \frac{1}{N_g}\sum _{i}\pi _{it}^{g,11} - \frac{1}{N_g}\sum _{i}\hat{\pi }_{it}^{g,11} \right) ^2 \end{aligned}$$
(7)

subject to satisfying Eq. 6. Here \(N_g\) is the number of individuals in group g. Since our probabilities are estimated using logit models, we can equivalently achieve this by choosing scalars \(z^{g,1}\) and \(z^{g,1}\) to minimise

$$\begin{aligned} \begin{aligned}&\left( \frac{1}{N_g}\sum _{i} \Lambda (X_{it}\beta ^{g,01} +z^{g,1}) - \frac{1}{N_g}\sum _{i} \Lambda (X_{it}\beta ^{g,01}) \right) ^2 \\&\quad + \left( \frac{1}{N_g}\sum _{i} \Lambda (X_{it}\beta ^{g,11} +z^{g,2}) - \frac{1}{N_g}\sum _{i} \Lambda (X_{it}\beta ^{g,11}) \right) ^2 \end{aligned} \end{aligned}$$
(8)

again subject to satisfying Eq. 6. \(\Lambda\) is the cdf of the logistic function and \(\beta ^{g,01}\) for example denotes the coefficients estimated for transitions from \(y_{i,t-1}^g=0\) to \(y_{it}^g=1\). Choosing terms to add to the index of individuals’ logistic functions rather than scaling overall probabilities ensures that all adjusted probabilities remain between zero and one.

The above scaling procedure is used to adjust transition probabilities for couple status, moving from being a renter to an owner (and vice versa) and employment. It was not found necessary to scale the probability of child arrival even though earlier cohorts tended to have more children. This is because the number of children among our simulated individuals tended to reach the right level once the probabilities for partnering and separation had been adjusted. We allow the scaling factors \(z^{g,1}\) and \(z^{g,1}\) we choose to differ across subgroups of the population g, allowing us for example to ensure that we match male and female employment rates implied by our repeated cross sections. The precise subgroups used differ depending on the variable we are considering and are given in Table 1.

Table 1 Cells within which transition probabilities are scaled

Scaling within subgroups means we will not necessarily exactly match the probability of being employed or in a couple in each year (since it is possible that the number of individuals with children differ from the numbers for the population for example). However, as we show in Appendix A, we match the cross-sectional probabilities extremely well.

We scale mortality rates for individuals using a simpler method. For this we take data from the UK Office for National Statistics Life Tables which provide projected mortality rates for men and women at different ages for different birth years. We then use the difference between these and average within-sample mortality rates for individuals in the BHPS to scale mortality rates predicted using a logit regression on income, disability benefit receipt, education and couple status.

2.5 Taxes and benefits

The rich set of outcomes we model gives us all the key variables we need to calculate tax and benefits for each of our simulated individuals. We assign tax and benefit payments to our simulated individuals using TAXBEN, a detailed tax and benefit microsimulation model for the United Kingdom (Waters 2017). We model all major benefit payments—both universal and means-tested—including unemployment insurance, payments to families with children, disability benefits and tax credits.Footnote 11 We assume that all those eligible for benefits take them up. The taxes included are income tax, payroll taxes (employee National Insurance contributions), VAT, excise and fuel duties, and local property taxes (council tax). We do not model the effects of capital taxes, inheritance duties or business taxes which are difficult to attribute to individual households (such as corporation tax). In addition, we are limited in our ability to accurately model taxes on the very highest (i.e. top 1% of) earners because they are not captured well in household surveys (Atkinson et al. 2011; Burkhauser et al. 2018). For this reason, although we incorporate the additional rate of income tax in our simulations, we do not examine reforms to this rate of tax. Finally, our approach does not account for any differences in behaviour that might result from our individuals being exposed to the tax systems we impose on them rather than the actual tax and benefit systems they faced.

2.6 Behavioural responses

Our microsimulation approach does not incorporate possible behavioural responses to tax and transfer reforms. While not without limitations, it allows us to model the impact of a much more complete set of characteristics relevant for calculating tax and transfer payments, including detailed family composition, joint employment and earnings in couples, health status, housing tenure and house value. Typically, a number of these variables are omitted from dynamic models that incorporate behavioural responses (usually in labour supply or consumption) in order to maintain tractability.Footnote 12 The approach we take allows us to model the effects of more realistic and intricate tax and transfer reforms than would otherwise be the case.

3 Simulated lifetime incomes

To provide some background for interpreting our subsequent results, this section presents descriptive statistics for our simulated lifetime profiles and how these compare to statistics from a synthetic 2015/16 cross-section that is also based on our simulated lifecycles. This cross-section describes what the 2015/16 population would look like if all cohorts were the same as the baby-boom cohort. As a result, any differences relative to the lifetime will be due to the lifetime perspective.Footnote 13

The course of incomes over the lifecycle is shown in Fig. 1. It plots mean annual earnings (not conditional on working) and mean private pensions (not conditional on receiving a pension) across life for men and women for our simulated profiles.Footnote 14 The figure shows that, on average, earnings rise for men until the late 40s and then decline steadily thereafter. For women, earnings flatten off during the late 20s, associated with taking time out of the labour market for child-rearing. They then rise again, reaching a peak at around age 50 before falling towards retirement. As earnings decline around retirement, individuals start receiving private pensions, though at a much lower level, on average, than earnings during working life.

Fig. 1
figure 1

Mean earnings and pensions by age and sex. Note: Series show mean earnings and private pensions across life. Values are expressed in real 2015 terms (deflated by the Retail Prices Index, or RPI). Earnings are zero for the unemployed and for those not in receipt of a private pension

In order to compare individuals with different lifetime incomes, we must first specify a way to convert income profiles across life into a single figure. One way to do this would be to set lifetime income equal to the present discounted sum of incomes across all ages. However, this definition could mean that those living long lives with low living standards in each period will appear to have higher incomes than those living short lives but with higher average incomes. Such outcomes are counterintuitive. Instead, we use a measure of the discounted average lifetime income

$$\begin{aligned} Y_i = \frac{1}{\sum _{a=16}^{A_i}(1+r_a)^{a-16}}.\sum _{a=16}^{A_i}\frac{y_a}{(1+r_a)^{a-16}} \end{aligned}$$
(9)

where a is age, \(r_a\) is the nominal interest rate (taken from the yields on UK government consols) and \(A_i\) is the maximum age reached by individual i. This converts lifetime income into an annualised figure. Incomes are equivalised using the modified OECD scale in each year prior to doing this. The discounted value of lifetime income \(Y_i\) is then inflated using the Retail Price Index (RPI) to 2015 terms to ensure comparability across cohorts.

Figure 2 gives an indication of how persistent net income is over time in our simulations at different points in the income distribution. This shows that those lifetime poor (rich) spend quite a substantial fraction of life outside the poorest (richest) cross-sectional decile; in other words, there is a substantial degree of mobility around the distribution across life. It also shows that mobility is non-symmetric. Persistence is greater at the top of the distribution than at the bottom: individuals in the richest lifetime decile spend more of their lives in the richest cross-sectional decile than individuals in the poorest lifetime decile spend in the poorest cross-sectional decile (34% compared to 21%).Footnote 15

Fig. 2
figure 2

Proportion of life spent in each cross-sectional decile, by lifetime net income decile. Note: The series show the proportion of life spent in each cross-section net income decile by individuals in the poorest/richest lifetime net income decile. Deciles are defined on equivalised net income ignoring indirect taxes (annualised net income for lifetime deciles)

Figure 3 shows how employment varies across the net income distribution. The cross-section series shows the proportion of working-age individuals who are employed, split by cross-sectional net income decile. The lifetime series shows the average fraction of working life that individuals are employed, split by annualised lifetime net income decile. From this graph, it is clear that relatively few individuals in the bottom cross-sectional decile are employed (22%), but from a lifetime perspective, individuals in the bottom lifetime decile are employed for the majority of working life (an average of 63%). This has implications for the relative impact of in-work and out-of work benefits on lifetime inequality, which we discuss below.

Fig. 3
figure 3

Employment among working age individuals by net income decile. Note: The cross-section series shows the fraction of working-age individuals in each net income decile who are employed (deciles are defined using the whole population, not just working-age individuals). The lifetime series shows the average fraction of working life that individuals are employed for. Working age is defined as under 63 for women and under 65 for men. Deciles are defined on equivalised net income ignoring indirect taxes (annualised net income for lifetime deciles)

Finally, Fig. 4 plots the distributions for gross lifetime and cross-sectional incomes. The cross-sectional distribution exhibits positive skew, with a long tail of individuals with high incomes, while the lifetime distribution is more symmetric. This reflects the impact of income mobility on the lifetime distribution—a point we also return to in what follows.

Fig. 4
figure 4

Distribution of gross income. Note: The series show the densities of gross equivalised household incomes over the lifetime and in a cross-section. Lifetime incomes are expressed in annualised terms and are discounted to the year when individuals turned 16 (see Eq. 9). We exclude the top 1% of incomes and those with zero incomes

4 Results

In this section, we contrast the snapshot and lifetime distributional impact of two sets of changes to the tax and transfer system. We first consider three reforms which have been seen as alternative ways of redistributing resources towards lower income families, with different trade-offs between equity and efficiency. We then consider changes to the basic and higher rates of income tax, key policy instruments for financing such increases in expenditure.

The analysis assumes throughout that - barring the reforms we consider - individuals face the 2015/16 UK tax and benefit system for the entirety of their lives. This is because we are primarily interested in the characteristics of given tax and benefit systems from a lifetime perspective rather than, say, the experiences of a particular cohort under the systems they were actually exposed to. We hold behaviour fixed following reforms, meaning that labour supply and other variables are assumed to be the same as they were for the baby-boom cohort under all the different tax systems we consider. To determine individual incomes, tax liabilities and benefit payments, we assume equal sharing of resources between members of couples. In order to apply a given tax and benefit system to data from earlier or later years, we uprate in line with average earnings growth. This brings us close to ensuring that the tax and benefit system raises the same revenue each year.Footnote 16

Several countries provide significant income top-ups to low-income in-work families through tax credits. For example in the United States there is the Earned Income Tax Credit (EITC), while in the United Kingdom these take the form of the Working Tax Credit (supplemented by Child Tax Credit for in and out-of-work families with children). Governments also attempt to help low-income households by removing low earning individuals from the direct tax net. Such reforms have the potential to provide help to low-income households while at the same time improving work incentives. However, in a cross-sectional analysis, they often appear less progressive than increases in out of work benefits, creating an equity-efficiency trade-off described for the EITC in Eissa and Hoynes (2011). How does a lifetime analysis affect this assessment?

To answer this question we consider three reforms relative to the 2015–16 tax and benefit system with similar cross-sectional revenue consequences (around £3 billion/$4.2 billion per year), namely:

  • an increase in out-of-work benefits: a 16.5% increase in the maximum income support, (income-based) jobseeker’s allowance, pension credit and (non-contributory) employment support allowance awards;

  • an increase in in-work (i.e. work-contingent) benefits: an 18% increase in the maximum working tax credit award;

  • an income tax cut: a 4% increase in the income tax personal allowance. This is the threshold below which income tax is not paid. In 2015/16 this stood at £10,600 (around $16,000).

Figure 5 shows the cross-sectional effect of the three reforms listed above.Footnote 17 Unsurprisingly, the most progressive reform is the increase to out-of-work benefits: gains are concentrated in the bottom two income deciles, with the largest average gain experienced by the lowest income decile (5.0%), in which there is a high share of non-working individuals. Next most progressive is the increase to working tax credit. Here, the bottom four deciles are gainers, with gains peaking at an average of 1.7% in decile two. The bottom decile gains by less because fewer individuals here are entitled to working tax credit (because they are not in work). The least progressive (indeed a regressive) reform is the income tax cut in the form of an increase in the personal allowance. Gains are concentrated among the upper half of the income distribution, reflecting the fact that the poorest adults have income that is too low to benefit from the giveaway, while dual-income couples—who tend to have higher family incomes—can benefit twice over.

Fig. 5
figure 5

Cross-sectional distributional impact of reforms. Note: Deciles are defined on the basis of cross-sectional equivalised net household income. The height of the bars is the gain or loss as a percentage of the relevant decile’s total net (unequivalised) household income. The ‘Out-of-work benefits’ series shows the effect of a 16.5% increase in maximum income support, (income-based) jobseeker’s allowance and (non-contributory) employment support allowance. The ‘In-work benefits’ series shows the effect of an 18% increase in maximum working tax credit. The ‘Income tax’ series shows the effect of a 4% increase in the income tax personal allowance. In all cases, the baseline tax and benefit system is the 2015/16 system

The pattern of gains over lifetime income is different, as shown in Fig. 6. In particular, increases in out-of-work and in-work benefits are strongly progressive, with very similar distributional patterns, while the income tax cut (personal allowance increase) is close to distributionally neutral. This stems from the pattern of worklessness over the lifecycle shown in Fig. 3. Although the poorest individuals at a point in time are frequently out of work, this is often a temporary state with many moving in and out of work over the course of their lifetimes. In addition, when in work, they are relatively likely to be in low-paid work and qualify for in-work support while also being liable for income tax.

Fig. 6
figure 6

Lifetime distributional impact of reforms. Note: Deciles are defined on the basis of average discounted equivalised disposable income, as described in the text. The height of the bars is the gain or loss as a percentage of the relevant decile’s total net (unequivalised) household income. To aid comparison, we have scaled gains proportionally such that the ‘All’ bars are the same across reforms. The ‘Out-of-work benefits’ series shows the effect of a 16.5% increase in maximum income support, (income-based) jobseeker’s allowance and (non-contributory) employment support allowance. The ‘In-work benefits’ series shows the effect of an 18% increase in maximum working tax credit. The ‘Income tax’ series shows the effect of a 4% increase in the income tax personal allowance. In all cases, the baseline tax and benefit system is the 2015/16 system. All individuals face the same system throughout life uprated in line with average earnings (AEI). To aid comparison, we have scaled gains proportionally such that the ‘All’ bars are the same across reforms

That increases to out-of-work and in-work benefits have such a similar distributional pattern is a particularly striking result. In-work benefits have the advantage that—in general—they have much less of a negative impact on work incentives at the extensive margin where labour supply responses may be particularly large (see Eissa and Liebman (1996) for a discussion in the context of the EITC). Out-of-work benefits reduce the net-financial gain to being in work, while in-work benefits in the United Kingdom are contingent on working a certain number of hours. Furthermore, as Blundell et al. (2016) highlight, the two types of benefits have different effects on incentives to accumulate human capital: while increasing out-of-work benefits provides a high level of insurance, they are associated with strong moral hazard effects and are less effective in improving overall welfare than in-work benefits.

Policymakers looking to target the lifetime poor might therefore favour doing so through in-work benefits. The disadvantage of such an approach is that it would do less to help the lifetime poor in the particular periods that they were not working, which could matter if they did not have access to savings or borrowing facilities. It would also do less to help the minority of the lifetime poor who do experience sustained periods without work. However, recent experience suggests that, at least in the UK, the lifetime poor among younger cohorts are so because of low levels of earnings, rather than long periods out of the labour market (Belfield et al. 2014).

Changes to income tax are among the primary policy instruments that policymakers have to finance increased expenditure. We therefore consider the snapshot and lifetime distributional impact of increases in both the higher rate of income tax—which in 2015/16 applied at a rate of 40% to individual incomes above £31,786 (roughly $48,000)—and the basic rate of 20%—applied to incomes between this and the personal allowance of £10,600 (around $16,000).Footnote 18

Figure 7 shows the distributional impact of a one percentage point increase in the higher rate of income tax, from both a cross-sectional and lifetime perspective. The reform is extremely progressive in the cross-section: the bottom four deciles are completely unaffected (because few individuals in these earn enough to pay the higher rate), and it is only the top two deciles that experience a hit to incomes of more than 0.1%, with losses peaking at 0.38% for the top decile. Over the lifetime, the reform remains strongly progressive, but there is slightly more of an impact further down the distribution. Those in the top four deciles experience a loss exceeding 0.1% but, as before, it is the very top decile that stands out, with a 0.23% fall. This reflects the greater persistence in earnings at the top of the distribution in our simulations (as we showed in Fig. 1). Thus, changes to the higher rate of income tax are reasonably effective at targeting the lifetime rich.

Fig. 7
figure 7

Distributional impact of a one percentage point increase in the higher rate of income tax. Note: Deciles are defined on the basis of equivalised net household income (cross-section income for the ‘Cross-section’ series and lifetime income for the ‘Lifetime’ series). The height of the bars is the gain or loss as a percentage of the relevant decile’s total net (unequivalised) household income (cross-section income for the ‘Cross-section’ series and lifetime income for the ‘Lifetime’ series). The baseline tax and benefit system is the 2015/16 system. For the ‘Lifetime’ series, all individuals face the same system throughout life uprated in line with average earnings (AEI)

For comparison, we also present the distributional impact of a one percentage point increase in the basic rate of income tax (Fig. 8). This shows that the cross-sectional impact is progressive: the bottom decile is largely unaffected because most individuals in this decile do not earn enough to pay income tax and the average loss peaks at 0.60% for the ninth decile. The top decile loses by slightly less because a smaller share of income for these individuals is subject to the basic rate. Over the lifetime, the impact remains progressive, but much less so because many more individuals will pay basic rate income tax at some point in life than in any one year. The bottom decile suffers a loss of 0.35%, rising to 0.53% for the top decile.

Fig. 8
figure 8

Distributional impact of a one percentage point increase in the basic rate of income tax. Note: Deciles are defined on the basis of equivalised net household income (cross-section income for the ‘Cross-section’ series and lifetime income for the ‘Lifetime’ series). The height of the bars is the gain or loss as a percentage of the relevant decile’s total net (unequivalised) household income (cross-section income for the ‘Cross-section’ series and lifetime income for the ‘Lifetime’ series). The baseline tax and benefit system is the 2015/16 system. For the ‘Lifetime’ series, all individuals face the same system throughout life uprated in line with average earnings (AEI)

5 Conclusion

Mobility in employment, earnings, housing tenure, health status and family composition means that in the longer-run, the distributional impact of tax and transfer reforms can look very different to those implied by snapshot analyses. This paper examines the lifetime distributional impact of two transfer reforms which have been the subject of intensive policy debate in many countries.

We find that increases in work-contingent benefits are just as effective as increases to out-of-work benefits at redistributing resources to the lifetime poor, with important implications for the perceived equity-efficiency trade-off between the two types of transfer. We also find, by contrast, that higher rates of tax on annually assessed income are an effective way of targeting the lifetime rich. These results are driven by differing patterns of mobility over the earnings distribution. Those at the bottom of the lifetime income distribution tend to see greater year-to-year variation in their earnings, as they move in and out of (low-paid work), whereas the earnings of those at the top of the distribution see greater persistence.

Our results illustrate the importance of moving beyond an exclusively snapshot perspective when analysing tax and transfer reforms. However, there remain good reasons to also consider cross-sectional impacts: in particular, the presence of credit market imperfections or significant uncertainty which inhibit consumption smoothing means that redistribution towards periods of temporarily low current incomes can be particularly valuable (O’Dea 2018; Landais and Spinnewijn 2019; Roantree and Shaw 2018). Rather, our results illustrate the importance of taking both a lifetime and snapshot perspective and of being clear about what one is trying to achieve: the alleviation of short-term hardship or the redistribution of lifetime resources.

Appendix A provides additional details on how we construct and validate our simulated lifetime profiles. This appendix draws partly on the description of the model in Levell and Shaw (2016) but also includes additional validation statistics. Details on how we model employment and earnings as well as how we calculate taxes and benefits are given in the main body of the text. In Appendix B, we describe how we implemented an alternative fixed effects earnings process to compare with our main results.