Frictions and taxpayer responses: evidence from bunching at personal tax thresholds

Adam, Stuart; Browne, James; Phillips, David; Roantree, Barra

doi:10.1007/s10797-020-09619-0

Frictions and taxpayer responses: evidence from bunching at personal tax thresholds

Open access
Published: 19 August 2020

Volume 28, pages 612–653, (2021)
Cite this article

Download PDF

You have full access to this open access article

International Tax and Public Finance Aims and scope Submit manuscript

Frictions and taxpayer responses: evidence from bunching at personal tax thresholds

Download PDF

Stuart Adam²,
James Browne³,
David Phillips² &
…
Barra Roantree ORCID: orcid.org/0000-0002-8738-8225¹

5423 Accesses
4 Citations
12 Altmetric
Explore all metrics

Abstract

We exploit kinks and notches in the UK personal tax schedule over a 40-year period to investigate how taxpayers respond to income tax and social security contributions. At kinks, where the marginal rate rises, we find bunching by company owner-managers and the self-employed, but not those with only employment income. Responses to notches, where the average rate rises, provide compelling evidence that this is because most employees face substantial frictions: fewer than a quarter bunch even where doing so would increase both consumption and leisure. We develop a new approach for identifying selection in who responds and for decomposing responses into hours and wage components. We find that those employees who do bunch at notches are almost exclusively part-time workers, but tend to have lower wages and work more hours than those part-time workers who do not bunch.

How do taxpayers respond to a large kink? Evidence on earnings and deduction behavior from Austria

Article Open access 13 April 2018

Pay inequity effects on back-office employees’ job performances: the case of a large insurance firm

Article 07 February 2015

Participation inertia in R&D tax incentive and subsidy programs

Article 12 July 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

How individuals respond to personal income taxes is of enormous importance for public policy, with both the revenue yield from reforms and the efficiency costs of taxation highly dependent on the magnitude and nature of responses. Since Feldstein (1999) showed that the elasticity of taxable income (ETI) is—under certain conditions—a sufficient statistic for these efficiency costs, a large volume of work has sought to estimate this parameter.

This work typically adopts a panel regression approach (e.g. Auten and Carroll 1999; Gruber and Saez 2002; Kopczuk 2005; Adam et al. 2019). However, such estimates are sensitive to the precise specification used to control for mean reversion or secular trends in income growth, and require a great deal of variation in tax rates over time.^{Footnote 1} More recently, new ‘bunching’ estimators have been developed which attempt to estimate the ETI by exploiting the prediction of basic neoclassical labour supply models that individuals should bunch at upward kinks and notches in the tax schedule (where, respectively, the marginal and average rates rise at a threshold). While research adopting this design has found evidence of substantial bunching by those (such as the self-employed) with significant scope for income manipulation, tax planning and evasion, evidence of bunching by employees is limited and appears to imply very low elasticities (Kleven 2016).^{Footnote 2}

The primary contribution of this paper is to show that the vast majority of wage earners face substantial frictions to optimising their earnings, sufficiently large to prevent them from bunching at tax thresholds. We interpret ‘optimisation frictions’ broadly, to encompass not only adjustment costs arising from rigidities in contracts or pay structures and search and matching costs, but also inattention, lack of information, optimisation errors, etc. Since such frictions should at least partially abate over time,^{Footnote 3} it is important to distinguish whether small observed responses reflect frictions or underlying preferences; the unattenuated underlying elasticity will usually (though not always) be the most relevant parameter for long-run policy.

While previous research has suggested that frictions could play an important role in attenuating the responses of employees around tax thresholds, ours is the first to confirm and quantify that effect for a broad group of workers in an advanced economy. Chetty et al. (2011) and Chetty (2012) suggest that frictions could be important but do not actually estimate them. Gelber et al. (2020) estimate the costs of adjusting earnings in the USA but only for low-paid workers approaching retirement, and likewise Zaresani (2019) only for disability insurance recipients in Canada. Kleven and Waseem (2013) look at bunching for a group of relatively high-income workers in the formal sector in Pakistan, finding that around 90% of them do not optimise their earnings in response to a notch due to frictions. Applying this same approach, we show that in an advanced economy, most working-age employees at various points across the earnings distribution face frictions in excess of 2% of earnings, and that for most low-paid workers the frictions are much bigger than that. This is true even for part-time employees, who might be thought to have greater flexibility in their hours of work.

Using high-quality administrative and firm survey data for the UK, we first show that while company owner-managers and the self-employed bunch strongly at kinks in the income tax and social security contributions (SSC) schedule, employees do not. While similar patterns of bunching have previously been identified in the USA (Saez 2010; Mortenson and Whitten 2020), Denmark (Chetty et al. 2011; le Maire and Schjerning 2013), Sweden (Bastani and Selin 2014), the Netherlands (Bosch et al. 2019) and Ireland (Hargaden 2020), the peculiar tax schedule on personal income in the UK, which during the period we study contained notches as well as kinks, allows us to shed light on why.^{Footnote 4} By creating a strictly dominated region of earnings that no one should choose to locate in, regardless of how much they value consumption relative to leisure, notches provide a means of measuring the extent to which individuals are constrained from reducing their earnings by optimisation frictions (Kleven and Waseem 2013).

We find no bunching below, and no dip above, several notches located at various points across the earnings distribution, meaning that frictions must be sufficiently large to offset the substantial gains from bunching. While we do see some bunching by employees at a notch lower down the earnings distribution, consistent with an underlying taxable earnings elasticity unattenuated by frictions of around 0.10–0.20, most of those who would locate in the dominated region in the absence of the notch still do so in the presence of a notch. This provides compelling evidence that the reason employees do not bunch at kinks is because a large majority face substantial frictions. This is true even immediately above the notch, where the potential tax saving from bunching was at times as high as 9% of earnings for the employee and a further 10.45% for the employer.

A particularly novel feature of this paper is that the firm survey data we use contain information on employees’ hours of work in addition to their earnings. Exploiting this, we find that the bunching observed at a notch low down the earnings distribution is entirely driven by the behaviour of part-time workers, who make up the majority of individuals at this part of the earnings distribution. Bunching is negligible among full-time workers with similar levels of earnings (and correspondingly lower hourly wages), suggesting that they face much greater frictions than part-time workers. We make a methodological contribution to the bunching literature, showing how these responses can be decomposed into hours and hourly wage components, and that selection in who responds can be identified. Applying this approach, we find that those part-time employees who bunch are higher-hours, lower-wage types than those part-time employees who do not bunch.

We also observe interesting variation in responses across taxpayers. Like Kleven and Waseem (2013) and le Maire and Schjerning (2013), we find that company owner-managers and the self-employed are particularly responsive to taxes, reflecting their greater scope for income manipulation, tax planning and evasion. Moreover, in recent years company owner-managers have become much more responsive than the self-employed. Among low-paid employees, we find that those in the retail and hospitality sectors are much more likely to respond than those in the public sector, and—perhaps surprisingly—that there is little difference in the responses of men and women conditional on working part-time. This raises the question of whether well-documented differences in observed behavioural responses between groups (e.g. men and women, employees and the self-employed) are a result of heterogeneity in preferences or in frictions. If increases in the share of women working full-time rather than part-time mean that they face larger frictions to adjusting their hours of work, that might help to explain why Blau and Kahn (2007), among others, find a decline in the elasticity of labour supply for women over time.

The rest of this paper proceeds as follows: Sect. 2 describes our institutional setting and data. Section 3 provides graphical evidence of bunching at kinks and notches in the UK income tax and SSC schedules and of how it varies between groups. Section 4 uses the bunching estimators of Saez (2010) and Kleven and Waseem (2013) to estimate elasticities from the bunching observed at kinks and notches, respectively. Section 5 outlines our method for decomposing bunching responses and identifying selection. Section 6 concludes.

2 Institutions and data

2.1 Institutional setting

The UK has two main personal taxes on income: income tax, paid by individuals on their earned and unearned income, and National Insurance contributions (NICs), paid by employees and employers on earned income only.^{Footnote 5} Unusually by international standards, most employees in the UK have their exact tax liability deducted from earnings at source through a pay-as-you-earn (PAYE) system and do not have to submit a tax return.^{Footnote 6} In fiscal year 2015–2016, both income tax and NICs had piecewise linear schedules, applying above tax-free allowances at standard rates up to a common upper threshold of £42,380 per year and at different rates above that. However, their design has not always been so simple, and their structures over the years provide multiple kinks and notches that we exploit to investigate the responsiveness of taxpayers.

2.1.1 Income tax

From the start of the 1990s, the UK operated a relatively simple, annual system of income tax, applied at a starting, basic and higher rate to individual income above a tax-free personal allowance.^{Footnote 7} Table 1 shows these rates for the period of our analysis (1995–1996 to 2015–2016), along with the personal allowance and the thresholds above which the basic and higher rates applied. Different rates of income tax applied to savings and dividend income, as described in the note to Table 1.

Table 1 Income tax rates and thresholds for earned income.

Full size table

In 2008–2009, the starting rate was abolished (except for savings income), leaving taxpayers facing either the basic rate (above the personal allowance) or the higher rate (above the higher-rate threshold) on their non-savings, non-dividend income. Subsequent reforms have complicated this rate structure for the c.2% of adults with income above £100,000: since 2010–2011, the personal allowance has been reduced by £1 for each £2 of income above this point, creating a band in which income tax liabilities rise by 60 pence for each additional pound of income (an effective 60% rate) until the allowance is exhausted and the rate falls back to 40%, while incomes above £150,000 have been subject to an additional rate (initially 50%, now 45%).

In summary, the UK income tax schedule contains a number of upwards kinks at which we would expect to see bunching, namely:

at the personal allowance, throughout
at the basic rate threshold, until 2007–2008
at the higher-rate threshold, throughout
at £100,000 and £150,000, since 2010–2011

In addition, since 2010–2011 there is a downward kink at around £115,000 (where the personal allowance is fully withdrawn and the marginal rate falls back from 60 to 40%), which should result in a dip in the distribution of taxable income analogous to the bunching expected at upwards kinks.

2.1.2 National insurance contributions

Between April 1975 and October 1985, once earnings exceeded a lower threshold called the Lower Earnings Limit (LEL), NICs were levied on the entirety of earnings at an employee and an employer rate up to a ceiling called the Upper Earnings Limit (UEL). This created a jump (notch) in NICs liabilities at the LEL and a strictly dominated range of earnings above. The solid black lines in Fig. 1 illustrate this schedule for both employee (panel A) and employer (panel B) contributions in the 1984–1985 tax year.

Reforms taking effect in October 1985 changed the schedule significantly. As shown by the dashed grey lines in Fig. 1, the notch in the employee and employer NICs schedules at the LEL was reduced in size (from 9% and 10.45%, respectively, to 5% apiece), while new notches above the LEL were introduced in both schedules: two in the employee schedule and three in the employer schedule. In addition, the cap on employer contributions at the UEL was abolished.

Another reform, in October 1989, further reduced the size of the notch at the LEL for both employee and employer contributions (to 2%) and eliminated the additional notches above the LEL in the employee NICs schedule. However, it left in place the additional notches in the employer NICs schedule, as shown by the dashed black lines in Fig. 1b.

The structure of NICs in place at the end of our period (shown by the solid grey line in Fig. 1) was arrived at through reforms that took effect between 1998 and 2003. This removed the remaining notches from both the employer and employee NICs schedules, replacing them with kinks at new Primary and Secondary Thresholds for employee and employer contributions, respectively. Employee NICs were also extended for the first time to earnings above the UEL, though at a very low rate (initially 1%, later 2%).

To summarise, the design of NICs creates incentives for taxpayers to bunch:

below a notch at the LEL from 1975–1976 to 1998–1999
below multiple notches above the LEL from 1985–1986 to 1998–1999
at kinks in the employee and employer schedule since 1999–2000

In addition, the zero and reduced rates that have applied above the UEL to employee contributions throughout this period and to employer contributions between 1975–1976 and 1984–1985 create downwards kinks at the UEL which should result in a dip in the distribution of earnings, analogous to the bunching expected at upwards kinks. Tables 6, 7 and 8 in Appendix B contain a full list of these thresholds, along with the size of the notch or kink.

National Insurance was originally envisaged as a ‘true’ social insurance scheme, with a broadly actuarial link between contributions paid and benefit entitlements for each individual. Insofar as there is—or, perhaps, is perceived to be—such a link, National Insurance may not have the same disincentive effects as a simple tax on earnings (Summers 1989). However, in practice the link between contributions and benefits—particularly at the margin—had already been significantly weakened by the 1970s, and had all but disappeared by 2015. For the most part, therefore, NICs acted as a straightforward tax on earnings.

There was one strongly contributory element to the National Insurance scheme. Until very recently, individuals contributing to a private pension could choose whether to ‘contract in’ or ‘contract out’ of the second pillar of the UK state pension system (initially the State Earnings-Related Pension Scheme, SERPS, and later the State Second Pension, S2P). Those contracting out were charged slightly lower rates of employee and employer NICS on earnings between the LEL and the UEL (or, since 2009, the Upper Accruals Point) in exchange for sacrificing future entitlement to SERPS/S2P.

For much of the period, our data do not record whether individuals were contracted in or out. However, the majority of people were contracted out, and the contracted-out rate is arguably a better measure of the ‘tax wedge’ associated with NICs even for those who were contracted in since the rate reduction was a roughly actuarially fair reflection of the forgone entitlements. We therefore use contracted-out NICs rates throughout.^{Footnote 8} In any case, this does not affect the size of the notch at the LEL, which is crucial for our estimation, since contracting out only reduced the marginal rate between the LEL and the UEL, not the rate charged on earnings below the LEL when the LEL was reached. The marginal rate above the notch plays only a secondary role in our analysis, in translating behavioural responses into elasticities in Sect. 4.^{Footnote 9}

Throughout our period, a much lower rate of employee NICs was available (in exchange for reduced benefit entitlement) to married women who had been claiming it almost continuously since May 1977. Since we cannot identify married women in our data, let alone those eligible for this option and taking it up, we ignore it. The requirement to remain married and in virtually continuous employment since 1977 meant that this affected a large fraction of employed women in the early years of our analysis but very few in later years: the number of women paying it fell from 4.2 million in 1978–1979 to 80,000 in 2000–2001 and 3000 in 2011–2012 (Thurley 2014). We note below where ignoring this reduced rate might significantly affect our results, and check sensitivity to this assumption.

2.2 Data

This paper uses both administrative and employer survey data: the Survey of Personal Incomes (SPI) and the New Earnings Survey Panel Dataset (NESPD).

When looking at income tax thresholds, we use data from the SPI, a stratified random sample drawn from income tax records held by HM Revenue and Customs (HMRC), that cover the tax years between 1995–1996 and 2013–2014.^{Footnote 10} The sample size increased steadily during that period, from under 60,000 individuals in 1995–1996 to over 700,000 by 2013–2014. Those with very high incomes are oversampled, while the data are not representative of those with incomes too low to pay income tax. For this reason, we do not use the SPI to examine bunching at the income tax personal allowance, from where the starting or basic rate of income tax begins to apply. Income is recorded annually and includes almost all sources that are liable for income tax. One exception is interest and investment income for starting or basic rate taxpayers who do not submit a self-assessment return. This is instead imputed in the SPI, based on administrative data on the aggregate amount and numbers receiving it and on survey evidence on the broad pattern across the income distribution. While the imputation will not allocate the income to individuals accurately-so there is some measurement error in our income variable-the imputed distribution should be a fair approximation, and suggests that savings and investment income amounts to less than £500 for over 90% of employees around the higher-rate threshold (roughly the same fraction as we observe for the self-employed, who must report such income on their tax return). This should not be enough to have a significant impact on our elasticity estimates (which are based on examining bunching in a range of £700 either side of the threshold) unless the employees who in reality bunch happen also to be the few with unusually high savings and investment income, which will not be imputed to them so their bunching will not be visible in our data.

To look at NICs we use the NESPD, a mandatory survey of employers that collects information on employees’ basic characteristics and earnings for a pay period each April, from 1975 to 2015.^{Footnote 11} The target sample frame is employees whose National Insurance number ends with a specific pair of digits.^{Footnote 12} In principle, this should deliver a 1% random sample of employees, but in practice it delivers around 0.7% due to non-response and the exclusion of non-civilian employees. At around 165,000 individuals per year, the NESPD contains a far larger sample than UK household surveys and does not suffer from the same degree of measurement error, as responses are provided by employers directly from or with reference to their payroll records.

As well as the length of its coverage, a key advantage of the NESPD is that the earnings measure corresponds closely to the tax base for NICs, most notably recording earnings in a single pay period (typically a week or month) rather than annually.^{Footnote 13} Unlike the SPI and other administrative datasets typically used to investigate bunching, the NESPD also records hours of work, which we exploit in Sect. 5.

A feature of the NESPD which complicates our analysis is that the pay period for which earnings are observed is close to the turn of the UK’s fiscal year on 6 April, when changes in NICs rates and thresholds usually take effect. For the vast majority of individuals, the earnings we observe will be subject to the NICs schedule of the fiscal year just beginning. But some years’ data contain individuals who face the schedule of the year just ending.^{Footnote 14} Bunching below the new year’s threshold may appear diffuse if these individuals bunch below the threshold from the fiscal year just ending.

In addition, the NESPD is potentially susceptible to under-sampling of employees earning below the LEL as employers are not obliged to operate PAYE on these jobs, the records of which are used to identify employees falling within the sampling frame of the survey.

There is evidence to suggest that this is not a major issue by the late 1990s:

First, if there were systematic under-sampling below the LEL, one would expect a corresponding upwards jump in the density at the LEL after 1999, when the NICs exemption threshold was raised above the LEL (removing the tax incentive to stay below the LEL but retaining it as the threshold for the mandatory reporting of earnings). We observe no such discontinuity.
Second, comparisons of the NESPD with the Family Resources Survey (FRS) and Labour Force Survey (LFS)-which do not suffer from the same potential issue of under-sampling-suggest that by the late 1990s the proportion of employees with earnings below the LEL was only slightly higher in these household surveys than in the NESPD. Moreover, the proportion just above the LEL was also slightly higher in the FRS and LFS than in the NESPD, which cannot be explained by NESPD coverage issues but could be explained by oversampling of lower earners or underreporting of earnings in the FRS and LFS (see Bound et al. (2001) and references therein for a discussion of earnings underreporting and measurement error in household surveys).
Third, the UK Office for National Statistics (ONS) says in documentation accompanying the NESPD that a 2004 investigation showed `the impact of this under-coverage on [survey] estimates was very small'.^{Footnote 15}
Fourth, the ONS also reports that there was little change in the NESPD earnings distribution in 2014 when the PAYE sampling frame moved to ‘Real-Time Information’ and larger employers were required to include all of their employees, not just those above the LEL (Office for National Statistics 2017).

There is less evidence available for the earlier part of the period we analyse. If there was no significant under-sampling by the late 1990s, it seems unlikely that it was a major problem in the early/mid-1990s and then suddenly disappeared; but it may be more of an issue in the 1970s and 1980s. One reason under-sampling below the LEL is not a significant problem in recent years is that employers typically include all their employees on their PAYE scheme, even where it is not required; employers were less likely to do that before widespread computerisation, though anecdotal reports suggest that many still did so (it was often simpler to include them, not least because factors such as variable earnings, job changes and interactions with other income sources meant that they might well be obliged to include them at another time anyway). Comparison of the NESPD with the Family Expenditure Survey (FES) suggests that the issue of under-sampling was more significant in the 1980s than in subsequent decades, though more acute for the very lowest earners than for those just below the LEL (consistent with those quite near the LEL being more likely to need including for other reasons). Insofar as this is an issue, we may understate the extent of bunching at the LEL and we note where this may affect our results in the sections that follow.

3 Bunching at kinks and notches

Kinks are defined by a change in the marginal rate of tax at a threshold, such as in income tax schedules where income in higher bands is subject to higher rates of tax. Such tax schedules create a convex budget set that the neoclassical model of labour supply predicts should lead to bunching in the distribution of taxable income (Saez 2010). Optimisation errors, arising from an inability to perfectly control pre-tax income, may mean that this bunching appears as a diffuse mass around, rather than a spike at, the kink.^{Footnote 16}

Notches—where the average rather than marginal tax rate increases discontinuously at a threshold—should lead to bunching below, rather than at, the threshold (Kleven and Waseem 2013). This is because the notch creates a jump in tax liabilities, so those who would have located slightly above in the absence of the notch can now obtain a large tax advantage from a small relocation to below the threshold. Indeed, notches often create a dominated region of earnings that no one should locate in, regardless of how much they value consumption relative to leisure. Because both consumption and leisure can be increased by reducing earnings below, the only reason we should observe anyone in the dominated region is because they are subject to optimisation frictions that prevent them from adjusting their earnings.^{Footnote 17}

Such frictions could take many forms. For example, they could reflect lack of understanding of the tax system; rigidities in contracts, pay structures or wage-setting/bargaining processes based on nominal wages; restricted hours choices available within a firm combined with frictions (such as search and matching costs or specific human capital) that make it costly to move jobs; or minimum wages or other institutional features that make reductions in earnings difficult or unattractive (such as mortgage offers or employer pension contributions being specified as percentages of nominal earnings).^{Footnote 18}

In the remainder of this section, we provide graphical evidence of the extent of bunching for different groups at kinks and notches in the UK income tax and NICs schedules. We first look at kinks in the income tax schedule using the SPI data.^{Footnote 19} Figure 2 shows the distribution of taxable income around the basic rate, higher rate, £100,000 and £150,000 thresholds, pooling observations from all waves of our data. There is little bunching at the relatively small kink created by the basic rate threshold. It is pronounced – if diffuse – at the higher-rate threshold, where the net-of-tax rate (that is, one minus the tax rate) fell by between 20 and 25%. The final panel shows there is clear, but modest, bunching at the £100,000 and £150,000 thresholds.^{Footnote 20}

What bunching there is at each of these thresholds is mostly due to the behaviour of company owner-managers and (to a lesser extent) the self-employed, with little or no bunching among employees. Figure 3 shows this for the higher-rate threshold, where we saw the strongest bunching; Figs. 9 and 10 in Appendix B show the same results at other upwards income tax kinks. Similarly, Fig. 4 uses data from the NESPD which measures the employment earnings subject to NICs rather than total income for income tax purposes. It shows that we see substantial bunching by managers and senior officials—the standardised occupation category which includes most company owner-managers—at the NICs Secondary Threshold, but nothing for other employees who make up the vast majority of workers at this level of earnings.^{Footnote 21}

There are several possible reasons for the greater responsiveness of those running their own business. They may have more flexibility to adjust their work patterns and, more generally, to fine-tune their income and deductions in any given year, including (for example) by splitting income with a spouse or other family member. They may be more aware of financial planning and more likely to be receiving professional advice. They have more scope to misreport the level or timing of their income and deductions, since they are not subject to the same kind of third-party reporting that employees face on their salary.^{Footnote 22} And company owner-managers in particular can adjust the amount of profit they retain in the company in order to shift personal income across years (or ultimately take it as capital gains instead). Using newly linked administrative data on personal and corporate tax returns, Miller et al. (2019) show that the last of these possibilities is particularly important in explaining the high responsiveness of company owner-managers documented here.

The lack of bunching among employees might reflect a low underlying behavioural elasticity, or it might reflect frictions that attenuate the response. The patterns of bunching we observe at notches provide robust evidence that frictions play an important role. This is because (as discussed above) notches in the employee NICs schedule create a dominated region that no one should locate in, regardless of how much they value consumption relative to leisure, save for frictions that prevent them from adjusting their earnings. We first examine the notch at the LEL, where between 1975–1976 and 1998–1999 the average rate of employee NICs jumped from 0% to between 2 and 9% of earnings and the average rate of employer NICs jumped from 0% to between 3 and 10.45% of earnings.

Figure 5 shows the distribution of earnings (z) relative to this notch (n), pooling all years of NESPD over this period.^{Footnote 23} There is clear, though modest, bunching below the threshold and a dip above it. But there is also substantial mass visible just above the threshold, in the area that corresponds to the dominated region created by the notch in employee NICs. This provides compelling evidence that the majority of employees at even this low level of earnings are subject to frictions large enough to prevent them from bunching.^{Footnote 24} As Fig. 11 in Appendix B shows, there is no evidence of any bunching below the notches located higher up the earnings distribution or of any dip above them, even in the dominated region created where the average rates of employee and employer NICs each jumped by 2 percentage points between 1986–1987 and 1989–1990 (panels a and c). This suggests that virtually all employees at these higher levels of earnings faced frictions large enough to prevent bunching.

As with kinks, there is substantial heterogeneity in who bunches at the LEL. The first two panels of Fig. 6 show that while part-time employees bunch sharply below the LEL, there is a much less pronounced response among full-time employees. Interestingly, panels c and d show that there appears to be little difference in the bunching responses of women and men conditional upon working part-time (although there are far fewer men working part-time at this level of earnings than there are women). The final two panels show that there are also substantial differences in bunching across industries. Those working in the retail and hospitality sector, where working patterns are typically more flexible (e.g. shift work), bunch sharply below the LEL while there is no observable response among employees in the public sector.

While the presence or absence of bunching at kinks and notches has traditionally been seen as a complication in fitting structural models of labour supply to data (e.g. Burtless and Hausman 1978), recent work has instead viewed it as a potential source of variation that might be used to identify parameters summarising behavioural responses. In the next section, we use the bunching responses documented above to estimate the elasticity of taxable income (or earnings), applying bunching estimators developed by Saez (2010) for kinks and Kleven and Waseem (2013) for notches.

4 Estimating elasticities from bunching

Saez (2010) showed that the elasticity of taxable income can be inferred from the income response of the ‘marginal buncher’: the last person who, facing a convex budget set, chooses to locate at, rather than above, a kink k where the marginal tax rate rises from $\tau _l$ to $\tau _h$. As this response represents a move between two tangency points, by the definition of the ETI $e \equiv \frac{1-\tau _l}{\tau _h-\tau _l}\frac{\Delta z}{k}$ we have a relationship that depends only on known parameters of the tax schedule ($\tau _l$, $\tau _h$ and k) and the income response of the marginal buncher ($\Delta z$). This response can in turn be inferred from the amount of excess mass around the threshold $B \equiv \int _{z_l}^{z_u} (h_0(z)-h_1(z))dz$ (where $z_l$ and $z_u$ define the income range in which bunching is observed and $h_1(z)$ and $h_0(z)$ are the actual and counterfactual distributions of income) as $B = \int _{k}^{k+\Delta z} h_0(z)dz$ (Saez 2010). The crucial step is to estimate the counterfactual distribution of income that would exist in the absence of a kink, $h_0(z)$. We follow Chetty et al. (2013) and estimate $h_0(z)$ by fitting a flexible polynomial to the observed distribution of taxable income, excluding the area [$z_l, z_u$] around k.^{Footnote 25}

When the kink is small, this approach identifies the compensated elasticity around the threshold, as the kink does not produce income effects over the bunching segment [$k, k+\Delta z$] (Saez 2010). At larger kinks, without making assumptions about the functional form of utility, the elasticity identified is instead a weighted average of the local compensated and uncompensated elasticities (Kleven 2016). In either case, the bunching estimator easily extends to accommodate heterogeneity in preferences, instead identifying the (compensated, or combined) local average elasticity at the threshold.^{Footnote 26}

We apply this estimator to the bunching already documented at the higher rate, £100,000 and £150,000 thresholds, separately for employees, the self-employed and company owner-managers, who each face a different change in tax rates at these thresholds.^{Footnote 27} Figure 7 shows these estimates at the higher-rate threshold annually for each year covered by our SPI data, along with the bootstrapped 95% confidence interval. The estimates for employees are precisely estimated, and significantly different from zero in only 2 of the 16 years (1999–2000 and 2013–2014). On average, we estimate a higher ETI for company owner-managers (0.078) than the self-employed (0.046), though the coefficient estimates are imprecisely estimated in the 1990s when sample sizes were smaller. As Fig. 7 shows, the relative responsiveness of the two groups seems to have changed over time: our central estimate for the self-employed declines from about 0.10 to almost 0, while that for company owner-managers does the reverse.^{Footnote 28} The reasons for this apparent gradual decline in the ETI among the self-employed and rise among company owner-managers would be an interesting topic for future research.

Table 2 shows estimates for the same groups at the £100,000 and £150,000 thresholds in 2010–2011 and 2013–2014, the 2 years covered by our SPI data that these thresholds existed. Again, estimates are near zero for employees, and higher for company owner-managers than the self-employed (though both are less precisely estimated than at the higher-rate threshold because sample sizes are smaller).

Table 2 Estimate of the ETI at £100,000 & £150,000 income tax thresholds.

Full size table

Taken together, results from the bunching estimator applied to income tax kinks show clear differences in elasticities, with company owner-managers and the self-employed more responsive than employees. Yet the estimated ETIs are smaller than estimates using different methodologies for high-income individuals in the UK (e.g. Brewer et al. 2010; Browne and Phillips 2017), and toward the lower end of the range estimated in the wider literature (Saez et al. 2012).

However, allowing for even relatively small optimisation frictions could reconcile these estimates with much larger underlying taxable income elasticities. Following Chetty (2012), we illustrate possible effects frictions may be having on our estimates, assuming a quasi-linear utility function and a fixed adjustment cost equal to 1% of utility. Our estimate of 0.204 for company owner-mangers at the higher-rate threshold in 2013–14 is consistent with an elasticity unattenuated by frictions of up to 1.52, while even the smaller estimates for employees and the self-employed at this threshold are consistent with unattenuated elasticities well in excess of one.

If we are interested in obtaining an estimate of the elasticity of taxable income that can be used to predict the response of individuals in a different setting to that in which it was obtained, then it is important to distinguish whether limited bunching (and a correspondingly small estimated elasticity) is the result of low underlying responsiveness or non-trivial frictions. If frictions take the form of fixed costs then we would expect disproportionately stronger responses to bigger tax changes, and if frictions dissipate in the long run then long-run behavioural responses may be larger than short-run responses. And as well as improving our understanding of likely responses to tax changes, frictions may be a policy concern in their own right. Elasticities at least partly reflect utility-maximising behaviour by taxpayers given their preferences between (say) consumption and leisure;^{Footnote 29} whereas if frictions are preventing people from maximising their utility, leading to an inefficient allocation of resources, that might highlight potential gains from policy measures to reduce frictions, ranging from greater information provision to reforms to labour market institutions.

Kinks, unfortunately, provide no means of distinguishing high frictions from low underlying elasticities without variation in the size or location of the kink, and even then only with assumptions on the form frictions take or multiple changes in the size of kinks (Gelber et al. 2020). Notches, however, provide a more promising source of variation for distinguishing these explanations. This is because notches provide an additional empirical moment—the observed density in the strictly dominated region above the notch—that can be used to account for attenuation from frictions.

Kleven and Waseem (2013) show that the ratio of the empirical to the counterfactual density in the dominated region, ${\hat{a}}$, can be used to measure the share of individuals who do not respond to the notch because of frictions. This can then be used to scale up the estimated bunching below the notch (${\hat{b}}$, the difference between empirical and counterfactual distributions in the range $[z_l,n]$) to what it would be if no one were subject to frictions large enough to prevent them from bunching. The authors show that this scaled-up bunching mass, ${\tilde{b}} \equiv \frac{{\hat{b}}}{(1-{\hat{a}})}$, is related to the unattenuated earnings response of the marginal buncher, which can be estimated by calculating the earnings level at which the counterfactual mass between n and that point equals the scaled-up bunching mass.^{Footnote 30}

There are then two ways of converting this unattenuated earnings response into a local average unattenuated elasticity. The first is to assume a functional form for utility and use the fact that by definition the marginal buncher is indifferent between locating at the notch n and at another higher point, $z^*$. This defines a relationship between known parameters of the tax schedule, the estimated unattenuated earnings response of the marginal buncher $\Delta {\hat{z}}^*$ and the local average (structural) earnings elasticity $e_s$, which can be solved numerically. The second approach is a reduced form one that approximates the effect of a jump in average tax rates as a large change in marginal tax rates. As a notch will induce a larger bunching response than the implicit kink would, this reduced form estimate will be upwardly biased by treating the bunching response as if it were generated by a kink.

As with kinks, the key empirical entity to be estimated is the counterfactual distribution of earnings around the notch. We follow Kleven and Waseem (2013) in estimating this by fitting a flexible polynomial to the number of individuals in small bins of earnings, excluding observations in the range [$z_l,n]$ below the notch that is obviously affected by bunching and an initially arbitrary range [$n,z_{u0}$] above the notch. The polynomial is then repeatedly estimated, increasing the excluded area above the threshold until it reaches a point ($z_u$) where the estimated excess mass between the actual and counterfactual earnings distributions below the threshold, ${\hat{b}}$, equals the estimated missing mass between the actual and counterfactual distributions above the threshold, ${\hat{m}}$. The resulting estimate of the excess mass, ${\hat{b}}$, can then be filled in directly under the counterfactual to estimate the attenuated earnings response of the marginal buncher ($\Delta {\hat{z}}$), or scaled up to account for frictions and then filled in under the counterfactual to estimate the unattenuated earnings response $\Delta {\hat{z}}^*$ and ETI.^{Footnote 31}

We apply this estimator to the bunching observed at the Lower Earnings Limit for three sets of tax years where the jump in average employee NICs rates (and so the dominated region) is of the same size: 1983–84 to 1985–86 (9% points), 1986–1987 to 1989–1990 (5ppts) and 1990–1991 to 1998–1999 (2ppts).^{Footnote 32} Table 3 shows that both the reduced form (${\hat{e}}_r$) and structural (${\hat{e}}_s$) unattenuated elasticity estimates are greater than zero, but modest. For example, the reduced form estimate is 0.238 for both the 1990–1991 to 1998–1999 and 1986–1987 to 1989–1990 periods, similar to the hours elasticities for low-paid married women over the same time period estimated by Blundell et al. (1998), and to the elasticity of earnings with respect to employee NICs estimated by Adam et al. (2019) using the same data as this paper but exploiting tax changes over time. Both the reduced form and structural estimates for the period 1983–1984 to 1985–1986 are substantially smaller. This may partly reflect the married women’s reduced rate described in Sect. 2.1, which at this time still affected a substantial number of women: if the bunching we observe was generated by a smaller notch than we are assuming, then the true elasticity will be somewhat higher than these estimates.^{Footnote 33} It may also reflect the potential under-sampling of those below the LEL in the NESPD discussed in Sect. 2.2, though as noted there this is more acute far below rather than just below the LEL.

The elasticities for the period 1990–1991 to 1998–1999 are less precisely estimated, as shown by the large bootstrapped standard errors.^{Footnote 34} In addition, all of our central estimates are quite sensitive to the exact specification of the fitted polynomial, the range of income over which the counterfactual is estimated, and the excluded range of earnings below the threshold.

The estimated share of individuals who do not respond to the notch because of frictions, ${\hat{a}}$, however, is quite precisely estimated and robust to specification. While these may also be affected by potential under-sampling below the LEL, our estimates suggest that around 75–90% of employees at this low level of earnings are subject to frictions sufficiently large to prevent them from bunching: equivalent to at least 9% of gross earnings in the 1983–84 to 1985–86 period, more than £400 per year in today’s prices. As a result, our estimate of the unattenuated percentage earnings response, $\frac{\Delta {\hat{z}}^*}{n}$, is between 1.5 and 2 times the attenuated one $\frac{\Delta {\hat{z}}}{n}$ obtained solely from the estimated bunching response below the threshold, ${\hat{b}}$.

Table 3 Estimates from bunching at the NICs Lower Earnings Limit.

Full size table

These estimates provide compelling evidence of the important role that frictions can play in attenuating the earnings response of employees, and are reinforced by the complete absence of bunching at notches further up the earnings distribution (Fig. 11, discussed in Sect. 3).^{Footnote 35} Frictions of this magnitude could also play an important role in reconciling microeconomic and macroeconomic estimates of the compensated elasticity, as suggested by Chetty (2012). Macroeconomic estimates of this parameter are typically derived by calibrating macroeconomic models so that they match cross-country variation in aggregate hours of work, and in general are significantly larger than those obtained with microeconomic data exploiting policy reforms. Chetty suggests that allowing for frictions with utility costs equivalent to 1% of consumption can fully reconcile these differences. Our estimates imply that most low- and middle-earning employees face frictions that are even larger.

5 Bunching responses and hours of work

Unlike the administrative data used in most studies of bunching at tax thresholds, the NESPD contains information on hours of work as well as earnings for around 85% of employees sampled. In this section, we show how hours data can be used to reveal more about the nature of bunching responses.

Figure 8 plots mean log hours in each bin of log earnings around the LEL for the three periods 1983–1985, 1986–1989 and 1990–1998. Since $\ln (z) = \ln (h) + \ln (w)$, this decomposes log earnings at every level of earnings into mean log hours and mean log hourly wage components. We then fit a 7th-order polynomial to the data points excluding those in the interval $[z_l, z_u]$ affected by bunching.

This estimated polynomial allows us to interpolate counterfactual mean log hours in the interval $[z_l, z_u]$ affected by bunching: that is, to estimate what mean log hours (and, by subtraction, mean log hourly wages) at each earnings level would have been in the absence of behavioural responses to the notch. The slope of this counterfactual mean log hours function tells us, before bunching responses, to what extent as we move up the log earnings distribution people had higher earnings because they had higher hours rather than because they had higher wages. Across all three periods, the estimated slope is around 0.9 in the neighbourhood of the LEL: that is, around 90% of the extra (log) earnings of higher earners was because they worked longer hours and only 10% was because of higher (log) hourly wages.^{Footnote 36} This is precisely estimated and robust to the choice of polynomial order.

Under the assumption that those who do not bunch do not adjust their earnings at all in response to the notch, looking at actual and counterfactual mean log hours at different earnings levels, alongside the actual and estimated counterfactual earnings densities, can potentially tell us about the characteristics and behaviour of bunchers.^{Footnote 37}

5.1 Decomposition of earnings responses

First, we use our estimated counterfactual hours profiles to look at how much of the earnings response of bunchers was through reductions in hours of work rather than hourly wages. The new tax responsiveness literature emphasises that responses to taxation might take the form of reduced hourly wages, rather than reduced hours, because recorded wages capture margins of behaviour such as effort and income shifting. Moreover, the discrete jump in tax liability at notches (unlike at kinks) means that it can make sense to accept lower hourly wages to locate below the threshold even if there is no compensating utility gain in leisure, reduced effort or anything else.

Saez et al. (2012), among others, argue that the responsiveness of taxable income is driven less by hours of work than by factors such as tax evasion, avoidance and income shifting. This conclusion is based mainly on how the taxable income of high-income individuals respond to income tax changes. However, the potential tax-reducing responses to NICs are different from income tax because the tax bases differ, and low-paid workers may respond differently from high-income individuals.^{Footnote 38}

To investigate how employees responded to the notch at the LEL, we estimate how much of the total earnings reduction was accounted for by a reduction in hours. We do this by calculating the reduction in mean log hours as a fraction of the reduction in mean log earnings, using the (observed) actual and (estimated) counterfactual log hours and log earnings across the interval $[z_l, z_u]$ affected by bunching.^{Footnote 39} Formally, we estimate:

$$\begin{aligned} {\hat{h}} \equiv \underbrace{\left( \frac{ \sum _{j=z_l}^{z_u} {\tilde{h}}_{0j} f_{0j} }{\sum _{j=z_l}^{z_u}f_{0j} } - \frac{ \sum _{j=z_l}^{z_u} {\tilde{h}}_{1j} f_{1j} }{\sum _{j=z_l}^{z_u}f_{1j} } \right) }_{\equiv \Delta {\tilde{h}}} \div \underbrace{\left( \frac{ \sum _{j=z_l}^{z_u} {\tilde{z}}_{0j} f_{0j} }{\sum _{j=z_l}^{z_u} f_{0j} } - \frac{ \sum _{j=z_l}^{z_u} {\tilde{z}}_{1j} f_{1j} }{\sum _{j=z_l}^{z_u} f_{1j} } \right) }_{\equiv \Delta {\tilde{z}}} \end{aligned}$$

(1)

where ${\tilde{h}}_{1j}$ is the actual and ${\tilde{h}}_{0j}$ the counterfactual mean log hours in log earnings bin j; $f_{1j}$ is the actual and $f_{0j}$ the counterfactual number of individuals in log earnings bin j; and ${\tilde{z}}_{1j}$ is the actual and ${\tilde{z}}_{0j}$ the counterfactual mean log earnings in log earnings bin j.

Table 4 shows that the reduction in mean log earnings is estimated, with reasonable precision, at 0.010, 0.011 and 0.007, respectively, in the three periods 1983–1985, 1986–1989 and 1990–1998. As these are averages across both bunchers and non-bunchers, this implies that those who did bunch reduced their log earnings by 0.244, 0.169 and 0.144, respectively, in these three periods (around 20% on average).^{Footnote 40}

However, the reduction in mean log hours is less precisely estimated, and when combined with that for earnings, results in an estimated share of the total earnings reduction accounted for by a reduction in hours that is associated with very large standard errors. The approach we have developed for decomposing earnings responses into hours and hourly wage responses thus does not appear to have enough power to be informative in our application, though it may yield more interesting results in other cases.

Table 4 Estimated share of total earnings reduction through hours.

Full size table

5.2 Selection of bunchers

Our approach also allows us to identify selection in who responds to the incentives created by the notch at the LEL. To do this, we compare the actual mean log hours of non-bunchers with the estimated counterfactual mean log hours in the region above the notch affected by bunching; that is, we compare ${\tilde{h}}_1$ to ${\tilde{h}}_0$ in the interval $[n, z_u]$. This tells us whether non-bunchers (and, by extension, whether bunchers) were on average high-hours, low-wage workers or low-hours, high-wage workers compared to others with the same counterfactual earnings. Specifically, we test the hypothesis:

$$\begin{aligned} \sum _{j=n}^{z_u} f_{0j} {\tilde{h}}_{1j} - \sum _{j=n}^{z_u} f_{0j} {\tilde{h}}_{0j} = 0 \end{aligned}$$

(2)

Table 5 shows these estimates of mean log hours, along with the differences between them and the associated standard errors.^{Footnote 41} These differences are negative throughout, and statistically significant for the periods 1986–1989 and 1990–1998, indicating that those who bunched at the LEL were higher-hours, lower-wage workers than those who did not. We show in Sect. 3 that bunching at the LEL was almost entirely driven by the responses of part-time workers, but this is not inconsistent: since about 90% of workers around the LEL were part-time, the implication of our results is that higher-hours, lower-wage types among part-time workers were more likely to bunch than lower-hours, higher-wage types.^{Footnote 42}

Given these estimates, and the counterfactual log earnings densities shown in Fig. 5, we can calculate how much higher bunchers’ mean log hours were than non-bunchers’: 0.129 in 1986–1989, and 0.124 higher in 1990–1998.^{Footnote 43} To give a sense of magnitudes, simply taking exponentials of these mean log hours suggests that bunchers would typically have worked around 19 h per week in the absence of the notch, whereas non-bunchers with the same earnings typically worked around 17 h per week.^{Footnote 44}

Table 5 Actual and counterfactual hours in region above LEL affected by bunching.

Full size table

6 Conclusion

This paper has investigated behavioural responses to income tax and social security contributions, exploiting cross-sectional variation created by thresholds in UK tax schedules over a 40-year period. At thresholds where the marginal tax rate fell, we found no sign of a dip in the density of the earnings distribution. But there was clear, if modest, bunching at thresholds where the marginal rate rose, especially the higher-rate income tax threshold. We found that company owner-managers and the self-employed were the most responsive to kinks in the income tax schedule, reflecting their greater scope for income manipulation, tax planning and evasion. In recent years company owner-managers have been much more responsive than the self-employed, whereas if anything the opposite was true in the 1990s.

In contrast, employees did not respond at all to kinks in the tax schedule. Notches, where the average rate of tax changes, provide compelling evidence that the reason for this is that most low-paid employees faced substantial optimisation frictions: while there was some bunching by employees below the notch at the National Insurance Lower Earnings Limit in the 1980s and 1990s, only a minority responded in this way. In some years, this meant non-responders paid an additional 9% of total earnings in employee contributions and a further 10.45% in employer contributions. Frictions of this magnitude suggest that long-run responses, or responses to large reforms, could be larger than implied by elasticities estimated from short-term responses to small tax differentials, and could play an important role in reconciling micro- and macro-based estimates of labour supply elasticities, as suggested by Chetty (2012).

We found that there was substantial heterogeneity in which employees bunched at the LEL. Those employed in the hospitality or retail sectors were far more likely to respond than those working in less flexible sectors. Part-time workers were more likely to bunch than full-time workers, though those part-time workers who did bunch were typically higher-hours, lower-wage types than those who did not. Notably, we found little difference in the responses of women and men conditional upon working part-time. This raises the question whether well-documented differences in observed behavioural responses between groups (e.g. men and women, employees and the self-employed) are a result of heterogeneity in underlying preferences or in frictions faced. If increases in the share of women working full- rather than part-time mean that they face larger frictions to adjusting their hours of work, that might help to explain why Blau and Kahn (2007), among others, find a fall in the elasticity of labour supply for women over time. Deepening our understanding of the nature of the frictions we highlight in this paper is not only crucial for integrating disparate evidence on elasticities into a sophisticated understanding of taxpayer responses, but may also have important policy implications in its own right, since to the extent that frictions prevent people from maximising utility, addressing their underlying causes may provide significant welfare gains.

Notes

See Saez et al. (2012) for a critical review of this new tax responsiveness literature. Adam et al. (2019) exploit the panel dimension of the same data used in this paper to estimate earnings elasticities using a best practice version of this approach, benefiting from an unusually long time period with extensive policy variation.
For example, while Chetty et al. (2011) observe substantial bunching by the self-employed around income tax thresholds in Denmark, that by wage earners is much smaller and corresponds to an estimate of the ETI that is very close to 0.
While the absence of bunching can be persistent (Tazhitdinova 2015), there is evidence to suggest frictions somewhat fade over time (Paetzold 2019; Zaresani 2019; Gelber et al. 2020). Moreover, depending on the nature of frictions, responses to reforms can strengthen over time even while there remains little bunching (e.g. He et al. 2020).
Some papers have examined bunching at kinks in the UK in limited contexts. Britton and Gruber (2019) find little bunching by graduates at kinks in the UK’s income-contingent student loan repayment schedule, Miller et al. (2019) examine bunching by company owner-managers at the higher-rate income tax threshold, and Tazhitdinova (2015) documents similar patterns of bunching to us at some kinks in the UK income tax and SSC schedules in the 2000s. We look at both kinks and notches in the UK income tax and SSC schedules over a 40-year period, using data that are more closely aligned with the base each tax is levied on (see Sect. 2).
NICs are also paid by the self-employed, at significantly lower rates, but we do not analyse self-employed NICs in this paper.
HM Revenue and Customs estimates that for the 2012–2013 tax year, only 10.74 million out of 30 million income taxpayers had to fill in a self-assessment tax return: primarily the self-employed, those with significant unearned income and those with incomes over £100,000.
The vast majority of adults are entitled to this personal allowance, although higher allowances have at times existed for lone parents, older taxpayers and married couples. Our analysis takes account of these different allowances, but for convenience we refer throughout to ‘the’ personal allowance.
These rates are contained in Appendix Table 6. We use the contracted-out rate for those contributing to a defined-benefit pension: the contracted-out rate for those contributing to a defined-contribution pension (which was less common) varied by age.
Earning above the LEL conveys certain other entitlements. But we do not see any bunching above the LEL after 1999, when the LEL stopped affecting contributions but continued to affect entitlement. This suggests that people did not place a high value on (or did not understand) these entitlements and that our estimates of bunching below the LEL in earlier years should not be strongly affected by these weak contributory links.
Data for 2008–2009, 2011–2012 and 2012–2013 are currently unavailable.
Appendix A provides more information on this dataset and the features that are relevant to our analysis.
The use of a fixed pair of digits from 1975 onwards means that the NESPD constitutes a panel. While this paper’s focus is on the cross-sectional variation created by tax thresholds, Adam et al. (2019) estimate the elasticity of taxable earnings using the panel dimension of the NESPD and reforms to NICs.
See Appendix A for further details.
See Appendix A for further details.
See https://ec.europa.eu/eurostat/cache/metadata/EN/earn_ses2010_esqrs_uk.htm.
By the same reasoning, a downwards kink should result in a hole in the distribution of income, which may materialise as a less sharp dip if taxpayers are unable to perfectly control their pre-tax income.
Not all notches create dominated regions. In our setting, notches in the employee NICs schedule create a dominated region but notches in the employer NICs schedule do not: the structure of employer NICs—specifically, being levied (unlike employee NICs) on a base that excludes the tax itself—means that there is no range in which an increase in gross earnings (and labour cost) is associated with a reduction in consumption. We should still see bunching at such notches, since there is still a discrete tax advantage to be had from an infinitesimal reduction in earnings.
Note that locating in a dominated region requires an explanation beyond frictions in adjusting ‘real’ behaviour such as hours of work, since in the dominated region both employer and employee could gain financially from a reduction in gross earnings without any accompanying change in real behaviour. However, this would require co-operation by both employer and employee and might still be prevented by factors such as contractual rigidities or a binding minimum wage; in such cases, and for those above the dominated region, frictions in adjusting real behaviour can still help to explain a lack of behavioural response.
The SPI does not allow us to look at the personal allowance kink because it is not representative of individuals with income below this level, where we would expect bunching to occur.
We do not observe any hole or dip in the distribution of income at the downwards kink created by the withdrawal of the personal allowance, even among company owner-managers and the self-employed. We are not alone in this: as Kleven (2016) points out, `no research has found evidence of holes around nonconvex kink points'.
Again, we do not see any dip in the distribution at the downwards kink created by the UEL.
On the importance of third-party reporting, see Kreiner et al. (2016).
For the analysis of notches in the NICs schedule we normalise taxable earnings relative to the LEL by plotting the distribution in bins of $\ln (z/n) \times 100$ so that 0 represents the threshold in each year and positive (negative) numbers means having earnings above (below) the threshold. This natural log normalisation makes little difference to the bunching estimates but allows us to decompose (log) earnings responses into additive (log) hours and (log) hourly wage components in Sect. 5. In addition, because employees tend to be paid in round number amounts, particularly multiples of £10 per week or £100 per year, we follow Kleven and Waseem (2013) in dropping all individuals with weeklyised or annualised earnings that take common round number values to avoid conflating bunching in respond to a threshold with a spike in the distribution of earnings at such an amount.
In Sect. 4, we quantify the proportion of employees that are subject to frictions sufficient to prevent them from bunching.
Our main estimates use a fifth-order polynomial with an excluded range in 2015–2016 prices of £700 around the higher-rate kink, and £7,000 around the £100,000 and £150,000 kinks. Blomquist and Newey (2017) argue that the choice of these parameters constitutes an assumption about the form of preference heterogeneity, to which estimates may be sensitive. We therefore test the sensitivity of our estimates to varying these choices, and note where they are below.
Note that, in the presence of heterogeneity, k + $\Delta z$ is not the same as $z_u$: k + $\Delta z$ is the mean (elasticity) marginal buncher, used to identify the average elasticity, whereas $z_u$ is the highest (elasticity marginal) buncher, the top of the income range affected at all by bunching which we want to exclude when estimating the no-bunching counterfactual income distribution.
Since the estimated elasticity depends on the size of the kink giving rise to bunching, we must estimate it separately for each group or period for which the size of the kink was different. We do not estimate ETIs at the NICs secondary threshold because the size of the kink changes over time and our annual sample sizes are too small to apply the estimator to a single year’s data for the only group we observe bunching at this threshold (managers and senior officials).
The difference in elasticities between the two groups is not statistically significant at the 95% level in the 1990s, when sample sizes were smaller, but is highly significant from 2002.
Though even the ‘unattenuated’ ETIs we estimate below are not structural preference parameters: they are also functions of the tax base and enforcement, and may highlight the extent of avoidance and evasion opportunities.
This approach will in fact deliver a downwardly biased estimate of the unattenuated earnings response. This is because the utility gain from bunching declines with distance above the notch and so the share of the population who are constrained from bunching will be larger further above the notch. As our estimate ${\hat{a}}$ is derived from the dominated region immediately above the notch, it will therefore be downwardly biased.
Note that, as with kinks, in the presence of heterogeneous elasticities the earnings of the highest buncher ($z_u$, obtained by filling out ${\hat{b}}$ between the actual and counterfactual earnings distributions above the notch), which defines the top of the bunching region excluded when estimating the final polynomial, will be different from the earnings of the mean marginal buncher ($n + \Delta {\hat{z}}$, obtained by filling out ${\hat{b}}$ between the x-axis and the counterfactual earnings distributions above the notch).
While NICs are the only tax or benefit withdrawal rate that changes discontinuously at the LEL, an individual’s effective marginal tax rate below the LEL (t) depends on household characteristics not observed in the NESPD. To account for this, we use smaller-scale but richer household survey data along with TAXBEN, the Institute for Fiscal Studies’ tax and benefit microsimulation model for the UK, to calculate the average t faced by those below the threshold.
If we made the alternative (extreme) assumption that all individuals employed in this period were claiming the married women’s reduced rate, the elasticities implied by the bunching we observe would be 0.115 (reduced form estimate) or 0.095 (structural estimate) rather than 0.085 and 0.052, respectively.
Standard errors are calculated by drawing (with replacement) bootstrapped distributions of normalised earnings from the empirical one, and repeating the estimation procedure 1000 times.
We cannot formally estimate ${\hat{a}}$ in these cases, since without any bunching we cannot identify an excluded region and estimate a counterfactual density. But the absence of any sign of a dip in the distribution in the dominated region above the notch implies that ${\hat{a}}$ is around 1, i.e. virtually everyone in the dominated region is subject to frictions bigger than the size of the notch.
The central estimates for the slope of the polynomial at the LEL are 0.88 for 1983–1985, 0.92 for 1986–1989 and 0.86 for 1990–1998. The standard errors are about 0.02 in each case and the confidence intervals all include 0.9.
Throughout this section, we restrict our analysis to the 85% of our sample for whom we observe hours of work as well as earnings. References to the actual and counterfactual earnings densities therefore refer to variants of those shown in Fig. 12 which are re-estimated for this restricted sample. In practice, these look very similar to those for the full sample.
To the extent that low-paid workers do respond through tax evasion, avoidance and income shifting, our analysis will capture this in the implied hourly log wage.
Actual and counterfactual mean log hours in each earnings bin are weighted by the corresponding (actual or counterfactual) density of earnings in each bin.
The earnings reduction among bunchers is given by dividing the change in overall mean log earnings by the fraction of the population in $[z_l, z_u]$ that bunch: e.g. for 1986–1989 $-0.010/0.065=0.169$.
In this case we weight both actual and counterfactual mean log hours in the different bins by the same, counterfactual, log earnings density, so that we are isolating whether bunchers’ log hours are different from non-bunchers’ in the same bin, not whether bunchers come disproportionately from low earnings bins within the interval.
Consistent with this, if we perform the kind of subgroup analysis shown in Sect. 3 separating out those part-time employees who worked under 16 h per week from those working 16 h or more, we see more pronounced bunching among those working 16 h or more.
This is obtained by dividing the change in mean log hours for everyone in $[n, z_u]$ by the fraction of the (counterfactual) population in that region who bunch: e.g. for 1986–89 $0.015/0.116=0.129$.
The non-linearity of the log function means that these numbers are only illustrative, not averages.
This appendix draws heavily on the description in Adam et al. (2019).
The NESPD is in fact the result of joining together the old New Earnings Survey and the similar Annual Survey of Hours and Earnings which replaced it from 2004.
Source: Authors’ correspondence with the Office for National Statistics.
There are a number of more minor reasons that our sample may not be completely random, but we do not expect these to have a significant effect on the validity of results.
Source: Authors’ correspondence with the Office for National Statistics. It is not clear exactly what the ‘automated National Minimum Wage check’ entails, since we do observe people in our data receiving less than the national minimum wage. Nor is it clear what constitutes an ‘outlier’ for these purposes.
Except for employees earning less than £8,500 per year, for whom benefits in kind remained outside the scope of NICs—and indeed income tax—until April 2016.

References

Adam, S., Phillips, D., & Roantree, B. (2019). 35 years of reforms: A panel analysis of the incidence of, and employee and employer responses to, social security contributions in the UK. Journal of Public Economics, 171(March), 29–50.
Article Google Scholar
Auten, G., & Carroll, R. (1999). The effect of income taxes on household income. Review of Economics and Statistics, 81(4), 681–693.
Article Google Scholar
Bastani, S., & Selin, H. (2014). Bunching and non-bunching at kink points of the Swedish tax schedule. Journal of Public Economics, 109, 36–49.
Article Google Scholar
Blau, F. D., & Kahn, L. M. (2007). Changes in the labor supply behavior of married women: 1980–2000. Journal of Labor Economics, 25, 393–438.
Article Google Scholar
Blomquist, S., & Newey, W. (2017). The bunching estimator cannot identify the taxable income elasticity. NBER Working Paper 24136, National Bureau of Economic Research December.
Blundell, R., Duncan, A., & Meghir, C. (1998). Estimating labor supply responses using tax reforms. Econometrica, 66(4), 827.
Article Google Scholar
Bosch, N., Jongen, E., Leenders, W., & Möhlmann, J. (2019). Non-bunching at kinks and notches in cash transfers in the Netherlands. International Tax and Public Finance, 26(6), 1329–1352.
Article Google Scholar
Bound, J., Brown, C., & Mathiowetz, N. (2001). Measurement error in survey data. In J. J. Heckman, & E. Leamer (Eds.), Handbook of econometrics (Vol. 5 pp. 3705–3843), Elsevier.
Brewer, M., Saez, E., & Shephard, A. (2010). Means testing and tax rates on earnings. In J. Mirrlees, S. Adam, T. Besley, R. Blundell, S. Bond, R. Chote, M. Gammie, P. Johnson, G. Myles, & J. Poterba (Eds.), Dimensions of tax design: The Mirrlees review (pp. 90–173). Oxford: Oxford University Press.
Google Scholar
Britton, J. W., & Gruber, J. (2019). Do income contingent student loan programs distort earnings? Evidence from the UK. Working Paper 25822, National Bureau of Economic Research May
Browne, J., & Phillips, D. (2017). Estimating the size and nature of responses to changes in income tax rates on top incomes in the UK: A panel analysis. IFS Working Paper WP17/13, Institute for Fiscal Studies, London
Burtless, G., & Hausman, J. A. (1978). The effect of taxation on labor supply: Evaluating the gary negative income tax experiment. Journal of Political Economy, 86(6), 1103–1130.
Article Google Scholar
Chetty, R. (2012). Bounds on elasticities with optimization frictions: A synthesis of micro and macro evidence on labor supply. Econometrica, 80(3), 969–1018.
Article Google Scholar
Chetty, R., Friedman, J. N., & Saez, E. (2013). Using differences in knowledge across neighborhoods to uncover the impacts of the EITC on earnings. American Economic Review, 103(7), 2683–2721.
Article Google Scholar
Chetty, R., Friedman, J. N., Olsen, T., & Pistaferri, L. (2011). Adjustment costs, firm responses, and micro vs. macro labor supply elasticities: Evidence from Danish tax records. The Quarterly Journal of Economics, 126(2), 749–804.
Article Google Scholar
Feldstein, M. (1999). Tax avoidance and the deadweight loss of the income tax. The Review of Economics and Statistics, 81(4), 674–680.
Article Google Scholar
Gelber, A. M., Jones, D., & Sacks, D. W. (2020). Estimating adjustment frictions using nonlinear budget sets: Method and evidence from the earnings test. American Economic Journal: Applied Economics, 12(1), 1–31.
Google Scholar
Gruber, J., & Saez, E. (2002). The elasticity of taxable income: Evidence and implications. Journal of Public Economics, 84(1), 1–32.
Article Google Scholar
Hargaden, E. P. (2020). Taxpayer responses in good times and bad. Technical Report April
He, D., Peng, L., Wang, X. (2020). Understanding the elasticity of taxable income: A tale of two approaches. Mimeo.
Kleven, H. (2016). Bunching. Annual Review of Economics, 8, 435–464.
Article Google Scholar
Kleven, H. J., & Waseem, M. (2013). Using notches to uncover optimization frictions and structural elasticities: Theory and evidence from Pakistan. The Quarterly Journal of Economics, 128(2), 669–723.
Article Google Scholar
Kopczuk, W. (2005). Tax bases, tax rates and the elasticity of reported income. Journal of Public Economics, 89(11–12), 2093–2119.
Article Google Scholar
Kreiner, C. T., Leth-Petersen, S., & Skov, P. E. (2016). Tax reforms and intertemporal shifting of Wage income: Evidence from Danish monthly payroll records. American Economic Journal: Economic Policy, 8(3), 233–257.
Google Scholar
le Maire, D., & Schjerning, B. (2013). Tax bunching, income shifting and self-employment. Journal of Public Economics, 107, 1–18.
Article Google Scholar
Miller, H., Pope, T., & Smith, K. (2019). Intertemporal income shifting and the taxation of owner-managed businesses. IFS Working Paper WP19/25, Institute for Fiscal Studies
Mortenson, J., & Whitten, A. (2020). Bunching to maximize tax credits: Evidence from kinks in the US Tax Schedule. American Economic Journal: Economic Policy. (forthcoming).
Office for National Statistics. (2017). Annual Survey of Hours and Earnings: 2017 provisional and 2016 revised results. Office for National Statistics: Technical Report.
Paetzold, J. (2019). How do taxpayers respond to a large kink? Evidence on earnings and deduction behavior from Austria. International Tax and Public Finance, 26(1), 167–197.
Article Google Scholar
Saez, E. (2010). Do taxpayers bunch at kink points? American Economic Journal: Economic Policy, 2(3), 180–212.
Google Scholar
Saez, E., Slemrod, J., & Giertz, S. H. (2012). The elasticity of taxable income with respect to marginal tax rates: A critical review. Journal of Economic Literature, 50(1), 3–50.
Article Google Scholar
Summers, L. H. (1989). Some simple economics of mandated benefits. The American Economic Review, 79(2), 177–183.
Google Scholar
Tazhitdinova, A. (2015). Behavioral responses to payroll and income taxes in the UK. SSRN Working Paper 2689879, Social Science Research Network, Rochester, NY
Thurley, D. (2014). “Married women and state pensions,” Standard Note SN1910. London: House of Commons Library.
Google Scholar
Zaresani, A. (2019). Adjustment costs and incentives to work: Evidence from a Disability Insurance Program. IZA Institute of Labor Economics: Technical Report.

Download references

Author information

Authors and Affiliations

Economic and Social Research Institute (ESRI) and Trinity College Dublin (TCD), Dublin, Ireland
Barra Roantree
Institute for Fiscal Studies (IFS), London, England
Stuart Adam & David Phillips
Tony Blair Institute for Global Change (TBI), London, England
James Browne

Authors

Stuart Adam
View author publications
You can also search for this author in PubMed Google Scholar
James Browne
View author publications
You can also search for this author in PubMed Google Scholar
David Phillips
View author publications
You can also search for this author in PubMed Google Scholar
Barra Roantree
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Barra Roantree.

Ethics declarations

Conflict of interest

None.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The authors thank Richard Blundell, Raj Chetty, Eric French, Henrik Kleven, Guy Laroque, Ian Preston, and Emmanuel Saez for their helpful comments, along with colleagues at DIW Berlin, CPB Netherlands and IPP Paris for many helpful discussions throughout the course of the broader project of which this paper was a part. Browne’s work on the paper was conducted while he was at the IFS. The opinions expressed and arguments employed, as well as any errors or omissions, are those of the authors, who gratefully acknowledge funding from the European Research Council (ERC-2010-AdG-269440-WSCWTBDS), the ESRC Centre for the Microeconomic Analysis of Public Policy (ES/M010147/1), ESRC Research Grant ES/K006185/1 and the Nuffield Foundation (OPD/40517). The New Earnings Survey Panel Dataset is produced by the Office for National Statistics and supplied by the Secure Data Service at the UK Data Archive. The Survey of Personal Incomes is Crown Copyright material and has been used with the permission of the Controller of HMSO and the Queen’s Printer for Scotland.

Appendices

Data description: the NESPD

The target sample frame of the New Earnings Survey Panel Dataset (NESPD) is civilian employees in Great Britain whose National Insurance (NI) number ends with a specific pair of digits.^{Footnote 45},^{Footnote 46} Since the last digits of NI numbers are allocated randomly to all adults and the NESPD sample uses same pair of digits each year, in principle this should deliver a random 1% panel sample of employees.

In practice, the NESPD is not quite a random 1% sample of employees. In fact, it includes around 0.7% of employees on average over the period (1% of employees in Britain would be around 235,000 per year, not the 165,000 we actually observe). The main reason for this is that, despite supposedly being mandatory, the survey suffers from significant non-response. The valid response rate fell from over 75% in the 1980s to around 60% by 2012.^{Footnote 47} Non-response reduces sample size and therefore the precision of our estimates, though as noted above our sample remains large. Non-random non-response is unlikely to be an issue for our approach, which relies on identifying a bunching at thresholds: to cause a significant problem response rates would need to differ for those just below and just above the threshold, which we have no reason to suspect.^{Footnote 48}

Two other features of the sample frame are particularly relevant when looking at low earners. One is the potential under-sampling of those below the LEL discussed in Sect. 2.2. In addition, since 2005, employees have been removed from the dataset if their earnings were below £10,000 per year (£11,000 since 2009) and either (a) their job title was ‘Director’, (b) they had the same first initial and surname as the employer completing the survey, (c) they ‘fail the automated National Minimum Wage check’ or (d) their earnings were an outlier for their occupation.^{Footnote 49} This is an attempt to identify and remove company owner-managers who are manipulating their earnings—for example, taking dividends instead to reduce their tax liability—and are therefore perceived to be producing a distorted picture of the earnings distribution (though in practice these criteria may remove some other employees as well). However, for our purposes, such income shifting may be one of the kinds of response to taxation we might like to capture, and this procedure means that we may understate the extent of bunching by managers and senior officials at the Secondary Threshold shown in Fig. 4.

The main earnings variable recorded in the NESPD measures total cash earnings (including pay for overtime, shift premiums, commission, performance-related pay, etc.), excluding benefits in kind and employer pension contributions but without deducting employee pension contributions, relating to a particular pay period (typically a week or month, but in all cases converted to a weekly equivalent by the data provider). This corresponds closely to the tax base for NICs, which is levied on a very similar definition of earnings and is charged separately in each pay period. The only difference we are aware of relates to benefits in kind:

Some things we might think of as benefits in kind (broadly those that can be exchanged for cash or are equivalent to cash, such as goods or services bought by the employee but paid for by the employer) are treated like cash in tax law and subject to NICs in full. It is difficult to know whether employers are including those things when they provide earnings measure in the NESPD; if they are not then our earnings measure underestimates taxable earnings.
Other benefits in kind—the principal ones being company cars and fuel and private medical insurance—were not subject to NICs at all until the 1990s. But the NICs base was gradually broadened to bring more benefits in kind within the scope of employer NICs (employer NICs were applied to company cars and fuel from 1991, and to most other benefits in kind from 2000),^{Footnote 50} though these benefits in kind remain outside the scope of employee NICs. Thus, from 1991 our earnings measure will be a slight underestimate of low-paid workers’ earnings for employer NICs purposes (though not for employee NICs purposes).

Thus, we may slightly underestimate taxable earnings (or, for some benefits in kind in the latter half of our data, underestimate taxable earnings for employer NICs purposes but not for employee NICs purposes). However, the magnitude of any discrepancy is small and unlikely to have a significant impact on our findings: overall we consider the accuracy of earnings reported in our data to be a strength, not a weakness.

Changes in NICs rates and thresholds usually take effect at the start of the fiscal year. The NESPD collects information each year about earnings and hours of work in the particular pay period that includes the ‘survey reference date’, a specific date in April. The precise date varies from year to year, ranging from 4 April to 29 April. Hence the earnings level reported by the employer in the NESPD will refer to the pay period containing the survey reference date, but the applicable NICs rate will generally depend on whether the amount in question is paid before or after 6 April.

Earnings in respect of the pay period containing a particular date in April may be paid before or after 6 April, so we cannot be certain which fiscal year’s NICs schedule applies to the earnings in our data, and so what contribution cap applies. For example, if the employee’s pay period is the calendar month then the employer will record their April earnings in the survey; but if the employee is paid on the first day of each month then those April earnings will be subject to the NICs schedule for the old fiscal year (ending on 5 April), whereas if they are paid on the 15th day or the last day of each month then their April earnings will be subject to the NICs schedule for the new fiscal year (starting on 6 April). Similar ambiguities can arise for employees with other pay periods, depending on the relationship between the survey reference date, the lengths and dates of pay periods, and the point in the pay period at which earnings are actually paid.

For the large majority of observations in our dataset, the earnings we observe will be subject to the NICs schedule of the fiscal year just beginning, but this will not be the case for all observations (particularly in years when the survey reference date is near the start of April) and we cannot identify those for which it is not true. As the NICs thresholds of the fiscal year just beginning were usually above (and never below) those of the fiscal year just ending (due to the routine uprating of tax thresholds in line with inflation), this means that bunching may appear diffuse, with some individuals bunching below the lower threshold.

Additional figures and tables

See Figs. 9, 10, 11, 12 and Tables 6, 7, 8, 9.

Table 6 NICs Lower and Upper Earnings Limits, 1975–1976 to 1998–1999

Full size table

Table 7 Notches above the LEL in the NICs schedule, 1986–1987 to 1998–1999.

Full size table

Table 8 Kinks in the NICs schedule, 1999–2000 to 2015–2016.

Full size table

Table 9 Estimates of the ETI at the income tax higher-rate threshold.

Full size table

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Adam, S., Browne, J., Phillips, D. et al. Frictions and taxpayer responses: evidence from bunching at personal tax thresholds. Int Tax Public Finance 28, 612–653 (2021). https://doi.org/10.1007/s10797-020-09619-0

Download citation

Published: 19 August 2020
Issue Date: June 2021
DOI: https://doi.org/10.1007/s10797-020-09619-0

Keywords

JEL

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Frictions and taxpayer responses: evidence from bunching at personal tax thresholds

Abstract

Similar content being viewed by others

How do taxpayers respond to a large kink? Evidence on earnings and deduction behavior from Austria

Pay inequity effects on back-office employees’ job performances: the case of a large insurance firm

Participation inertia in R&D tax incentive and subsidy programs

1 Introduction