1 Introduction

How individuals respond to personal income taxes is of enormous importance for public policy, with both the revenue yield from reforms and the efficiency costs of taxation highly dependent on the magnitude and nature of responses. Since Feldstein (1999) showed that the elasticity of taxable income (ETI) is—under certain conditions—a sufficient statistic for these efficiency costs, a large volume of work has sought to estimate this parameter.

This work typically adopts a panel regression approach (e.g. Auten and Carroll 1999; Gruber and Saez 2002; Kopczuk 2005; Adam et al. 2019). However, such estimates are sensitive to the precise specification used to control for mean reversion or secular trends in income growth, and require a great deal of variation in tax rates over time.Footnote 1 More recently, new ‘bunching’ estimators have been developed which attempt to estimate the ETI by exploiting the prediction of basic neoclassical labour supply models that individuals should bunch at upward kinks and notches in the tax schedule (where, respectively, the marginal and average rates rise at a threshold). While research adopting this design has found evidence of substantial bunching by those (such as the self-employed) with significant scope for income manipulation, tax planning and evasion, evidence of bunching by employees is limited and appears to imply very low elasticities (Kleven 2016).Footnote 2

The primary contribution of this paper is to show that the vast majority of wage earners face substantial frictions to optimising their earnings, sufficiently large to prevent them from bunching at tax thresholds. We interpret ‘optimisation frictions’ broadly, to encompass not only adjustment costs arising from rigidities in contracts or pay structures and search and matching costs, but also inattention, lack of information, optimisation errors, etc. Since such frictions should at least partially abate over time,Footnote 3 it is important to distinguish whether small observed responses reflect frictions or underlying preferences; the unattenuated underlying elasticity will usually (though not always) be the most relevant parameter for long-run policy.

While previous research has suggested that frictions could play an important role in attenuating the responses of employees around tax thresholds, ours is the first to confirm and quantify that effect for a broad group of workers in an advanced economy. Chetty et al. (2011) and Chetty (2012) suggest that frictions could be important but do not actually estimate them. Gelber et al. (2020) estimate the costs of adjusting earnings in the USA but only for low-paid workers approaching retirement, and likewise Zaresani (2019) only for disability insurance recipients in Canada. Kleven and Waseem (2013) look at bunching for a group of relatively high-income workers in the formal sector in Pakistan, finding that around 90% of them do not optimise their earnings in response to a notch due to frictions. Applying this same approach, we show that in an advanced economy, most working-age employees at various points across the earnings distribution face frictions in excess of 2% of earnings, and that for most low-paid workers the frictions are much bigger than that. This is true even for part-time employees, who might be thought to have greater flexibility in their hours of work.

Using high-quality administrative and firm survey data for the UK, we first show that while company owner-managers and the self-employed bunch strongly at kinks in the income tax and social security contributions (SSC) schedule, employees do not. While similar patterns of bunching have previously been identified in the USA (Saez 2010; Mortenson and Whitten 2020), Denmark (Chetty et al. 2011; le Maire and Schjerning 2013), Sweden (Bastani and Selin 2014), the Netherlands (Bosch et al. 2019) and Ireland (Hargaden 2020), the peculiar tax schedule on personal income in the UK, which during the period we study contained notches as well as kinks, allows us to shed light on why.Footnote 4 By creating a strictly dominated region of earnings that no one should choose to locate in, regardless of how much they value consumption relative to leisure, notches provide a means of measuring the extent to which individuals are constrained from reducing their earnings by optimisation frictions (Kleven and Waseem 2013).

We find no bunching below, and no dip above, several notches located at various points across the earnings distribution, meaning that frictions must be sufficiently large to offset the substantial gains from bunching. While we do see some bunching by employees at a notch lower down the earnings distribution, consistent with an underlying taxable earnings elasticity unattenuated by frictions of around 0.10–0.20, most of those who would locate in the dominated region in the absence of the notch still do so in the presence of a notch. This provides compelling evidence that the reason employees do not bunch at kinks is because a large majority face substantial frictions. This is true even immediately above the notch, where the potential tax saving from bunching was at times as high as 9% of earnings for the employee and a further 10.45% for the employer.

A particularly novel feature of this paper is that the firm survey data we use contain information on employees’ hours of work in addition to their earnings. Exploiting this, we find that the bunching observed at a notch low down the earnings distribution is entirely driven by the behaviour of part-time workers, who make up the majority of individuals at this part of the earnings distribution. Bunching is negligible among full-time workers with similar levels of earnings (and correspondingly lower hourly wages), suggesting that they face much greater frictions than part-time workers. We make a methodological contribution to the bunching literature, showing how these responses can be decomposed into hours and hourly wage components, and that selection in who responds can be identified. Applying this approach, we find that those part-time employees who bunch are higher-hours, lower-wage types than those part-time employees who do not bunch.

We also observe interesting variation in responses across taxpayers. Like Kleven and Waseem (2013) and le Maire and Schjerning (2013), we find that company owner-managers and the self-employed are particularly responsive to taxes, reflecting their greater scope for income manipulation, tax planning and evasion. Moreover, in recent years company owner-managers have become much more responsive than the self-employed. Among low-paid employees, we find that those in the retail and hospitality sectors are much more likely to respond than those in the public sector, and—perhaps surprisingly—that there is little difference in the responses of men and women conditional on working part-time. This raises the question of whether well-documented differences in observed behavioural responses between groups (e.g. men and women, employees and the self-employed) are a result of heterogeneity in preferences or in frictions. If increases in the share of women working full-time rather than part-time mean that they face larger frictions to adjusting their hours of work, that might help to explain why Blau and Kahn (2007), among others, find a decline in the elasticity of labour supply for women over time.

The rest of this paper proceeds as follows:  Sect. 2 describes our institutional setting and data. Section 3 provides graphical evidence of bunching at kinks and notches in the UK income tax and SSC schedules and of how it varies between groups. Section 4 uses the bunching estimators of Saez (2010) and Kleven and Waseem (2013) to estimate elasticities from the bunching observed at kinks and notches, respectively. Section 5 outlines our method for decomposing bunching responses and identifying selection. Section 6 concludes.

2 Institutions and data

2.1 Institutional setting

The UK has two main personal taxes on income: income tax, paid by individuals on their earned and unearned income, and National Insurance contributions (NICs), paid by employees and employers on earned income only.Footnote 5 Unusually by international standards, most employees in the UK have their exact tax liability deducted from earnings at source through a pay-as-you-earn (PAYE) system and do not have to submit a tax return.Footnote 6 In fiscal year 2015–2016, both income tax and NICs had piecewise linear schedules, applying above tax-free allowances at standard rates up to a common upper threshold of £42,380 per year and at different rates above that. However, their design has not always been so simple, and their structures over the years provide multiple kinks and notches that we exploit to investigate the responsiveness of taxpayers.

2.1.1 Income tax

From the start of the 1990s, the UK operated a relatively simple, annual system of income tax, applied at a starting, basic and higher rate to individual income above a tax-free personal allowance.Footnote 7 Table 1 shows these rates for the period of our analysis (1995–1996 to 2015–2016), along with the personal allowance and the thresholds above which the basic and higher rates applied. Different rates of income tax applied to savings and dividend income, as described in the note to Table 1.

Table 1 Income tax rates and thresholds for earned income.

In 2008–2009, the starting rate was abolished (except for savings income), leaving taxpayers facing either the basic rate (above the personal allowance) or the higher rate (above the higher-rate threshold) on their non-savings, non-dividend income. Subsequent reforms have complicated this rate structure for the c.2% of adults with income above £100,000: since 2010–2011, the personal allowance has been reduced by £1 for each £2 of income above this point, creating a band in which income tax liabilities rise by 60 pence for each additional pound of income (an effective 60% rate) until the allowance is exhausted and the rate falls back to 40%, while incomes above £150,000 have been subject to an additional rate (initially 50%, now 45%).

In summary, the UK income tax schedule contains a number of upwards kinks at which we would expect to see bunching, namely:

  • at the personal allowance, throughout

  • at the basic rate threshold, until 2007–2008

  • at the higher-rate threshold, throughout

  • at £100,000 and £150,000, since 2010–2011

In addition, since 2010–2011 there is a downward kink at around £115,000 (where the personal allowance is fully withdrawn and the marginal rate falls back from 60 to 40%), which should result in a dip in the distribution of taxable income analogous to the bunching expected at upwards kinks.

2.1.2 National insurance contributions

Between April 1975 and October 1985, once earnings exceeded a lower threshold called the Lower Earnings Limit (LEL), NICs were levied on the entirety of earnings at an employee and an employer rate up to a ceiling called the Upper Earnings Limit (UEL). This created a jump (notch) in NICs liabilities at the LEL and a strictly dominated range of earnings above. The solid black lines in Fig. 1 illustrate this schedule for both employee (panel A) and employer (panel B) contributions in the 1984–1985 tax year.

Fig. 1
figure 1

Source: Tolley’s National Insurance Contributions, various years

National Insurance contribution schedules (April 2015 prices). Note Previous years’ thresholds uprated to April 2015 prices using the retail prices index (RPI). Assumes employee contracted into State Earnings-Related Pension Scheme (SERPS) or State Second Pension (S2P). The 1984–1985 schedule excludes the 1% National Insurance surcharge abolished in September 1984.

Reforms taking effect in October 1985 changed the schedule significantly. As shown by the dashed grey lines in Fig. 1, the notch in the employee and employer NICs schedules at the LEL was reduced in size (from 9% and 10.45%, respectively, to 5% apiece), while new notches above the LEL were introduced in both schedules: two in the employee schedule and three in the employer schedule. In addition, the cap on employer contributions at the UEL was abolished.

Another reform, in October 1989, further reduced the size of the notch at the LEL for both employee and employer contributions (to 2%) and eliminated the additional notches above the LEL in the employee NICs schedule. However, it left in place the additional notches in the employer NICs schedule, as shown by the dashed black lines in Fig. 1b.

The structure of NICs in place at the end of our period (shown by the solid grey line in Fig. 1) was arrived at through reforms that took effect between 1998 and 2003. This removed the remaining notches from both the employer and employee NICs schedules, replacing them with kinks at new Primary and Secondary Thresholds for employee and employer contributions, respectively. Employee NICs were also extended for the first time to earnings above the UEL, though at a very low rate (initially 1%, later 2%).

To summarise, the design of NICs creates incentives for taxpayers to bunch:

  • below a notch at the LEL from 1975–1976 to 1998–1999

  • below multiple notches above the LEL from 1985–1986 to 1998–1999

  • at kinks in the employee and employer schedule since 1999–2000

In addition, the zero and reduced rates that have applied above the UEL to employee contributions throughout this period and to employer contributions between 1975–1976 and 1984–1985 create downwards kinks at the UEL which should result in a dip in the distribution of earnings, analogous to the bunching expected at upwards kinks. Tables 6, 7 and 8 in Appendix B contain a full list of these thresholds, along with the size of the notch or kink.

National Insurance was originally envisaged as a ‘true’ social insurance scheme, with a broadly actuarial link between contributions paid and benefit entitlements for each individual. Insofar as there is—or, perhaps, is perceived to be—such a link, National Insurance may not have the same disincentive effects as a simple tax on earnings (Summers 1989). However, in practice the link between contributions and benefits—particularly at the margin—had already been significantly weakened by the 1970s, and had all but disappeared by 2015. For the most part, therefore, NICs acted as a straightforward tax on earnings.

There was one strongly contributory element to the National Insurance scheme. Until very recently, individuals contributing to a private pension could choose whether to ‘contract in’ or ‘contract out’ of the second pillar of the UK state pension system (initially the State Earnings-Related Pension Scheme, SERPS, and later the State Second Pension, S2P). Those contracting out were charged slightly lower rates of employee and employer NICS on earnings between the LEL and the UEL (or, since 2009, the Upper Accruals Point) in exchange for sacrificing future entitlement to SERPS/S2P.

For much of the period, our data do not record whether individuals were contracted in or out. However, the majority of people were contracted out, and the contracted-out rate is arguably a better measure of the ‘tax wedge’ associated with NICs even for those who were contracted in since the rate reduction was a roughly actuarially fair reflection of the forgone entitlements. We therefore use contracted-out NICs rates throughout.Footnote 8 In any case, this does not affect the size of the notch at the LEL, which is crucial for our estimation, since contracting out only reduced the marginal rate between the LEL and the UEL, not the rate charged on earnings below the LEL when the LEL was reached. The marginal rate above the notch plays only a secondary role in our analysis, in translating behavioural responses into elasticities in Sect. 4.Footnote 9

Throughout our period, a much lower rate of employee NICs was available (in exchange for reduced benefit entitlement) to married women who had been claiming it almost continuously since May 1977. Since we cannot identify married women in our data, let alone those eligible for this option and taking it up, we ignore it. The requirement to remain married and in virtually continuous employment since 1977 meant that this affected a large fraction of employed women in the early years of our analysis but very few in later years: the number of women paying it fell from 4.2 million in 1978–1979 to 80,000 in 2000–2001 and 3000 in 2011–2012 (Thurley 2014). We note below where ignoring this reduced rate might significantly affect our results, and check sensitivity to this assumption.

2.2 Data

This paper uses both administrative and employer survey data: the Survey of Personal Incomes (SPI) and the New Earnings Survey Panel Dataset (NESPD).

When looking at income tax thresholds, we use data from the SPI, a stratified random sample drawn from income tax records held by HM Revenue and Customs (HMRC), that cover the tax years between 1995–1996 and 2013–2014.Footnote 10 The sample size increased steadily during that period, from under 60,000 individuals in 1995–1996 to over 700,000 by 2013–2014. Those with very high incomes are oversampled, while the data are not representative of those with incomes too low to pay income tax. For this reason, we do not use the SPI to examine bunching at the income tax personal allowance, from where the starting or basic rate of income tax begins to apply. Income is recorded annually and includes almost all sources that are liable for income tax. One exception is interest and investment income for starting or basic rate taxpayers who do not submit a self-assessment return. This is instead imputed in the SPI, based on administrative data on the aggregate amount and numbers receiving it and on survey evidence on the broad pattern across the income distribution. While the imputation will not allocate the income to individuals accurately-so there is some measurement error in our income variable-the imputed distribution should be a fair approximation, and suggests that savings and investment income amounts to less than £500 for over 90% of employees around the higher-rate threshold (roughly the same fraction as we observe for the self-employed, who must report such income on their tax return). This should not be enough to have a significant impact on our elasticity estimates (which are based on examining bunching in a range of £700 either side of the threshold) unless the employees who in reality bunch happen also to be the few with unusually high savings and investment income, which will not be imputed to them so their bunching will not be visible in our data.

To look at NICs we use the NESPD, a mandatory survey of employers that collects information on employees’ basic characteristics and earnings for a pay period each April, from 1975 to 2015.Footnote 11 The target sample frame is employees whose National Insurance number ends with a specific pair of digits.Footnote 12 In principle, this should deliver a 1% random sample of employees, but in practice it delivers around 0.7% due to non-response and the exclusion of non-civilian employees. At around 165,000 individuals per year, the NESPD contains a far larger sample than UK household surveys and does not suffer from the same degree of measurement error, as responses are provided by employers directly from or with reference to their payroll records.

As well as the length of its coverage, a key advantage of the NESPD is that the earnings measure corresponds closely to the tax base for NICs, most notably recording earnings in a single pay period (typically a week or month) rather than annually.Footnote 13 Unlike the SPI and other administrative datasets typically used to investigate bunching, the NESPD also records hours of work, which we exploit in Sect. 5.

A feature of the NESPD which complicates our analysis is that the pay period for which earnings are observed is close to the turn of the UK’s fiscal year on 6 April, when changes in NICs rates and thresholds usually take effect. For the vast majority of individuals, the earnings we observe will be subject to the NICs schedule of the fiscal year just beginning. But some years’ data contain individuals who face the schedule of the year just ending.Footnote 14 Bunching below the new year’s threshold may appear diffuse if these individuals bunch below the threshold from the fiscal year just ending.

In addition, the NESPD is potentially susceptible to under-sampling of employees earning below the LEL as employers are not obliged to operate PAYE on these jobs, the records of which are used to identify employees falling within the sampling frame of the survey.

There is evidence to suggest that this is not a major issue by the late 1990s:

  • First, if there were systematic under-sampling below the LEL, one would expect a corresponding upwards jump in the density at the LEL after 1999, when the NICs exemption threshold was raised above the LEL (removing the tax incentive to stay below the LEL but retaining it as the threshold for the mandatory reporting of earnings). We observe no such discontinuity.

  • Second, comparisons of the NESPD with the Family Resources Survey (FRS) and Labour Force Survey (LFS)-which do not suffer from the same potential issue of under-sampling-suggest that by the late 1990s the proportion of employees with earnings below the LEL was only slightly higher in these household surveys than in the NESPD. Moreover, the proportion just above the LEL was also slightly higher in the FRS and LFS than in the NESPD, which cannot be explained by NESPD coverage issues but could be explained by oversampling of lower earners or underreporting of earnings in the FRS and LFS (see Bound et al. (2001) and references therein for a discussion of earnings underreporting and measurement error in household surveys).

  • Third, the UK Office for National Statistics (ONS) says in documentation accompanying the NESPD that a 2004 investigation showed `the impact of this under-coverage on [survey] estimates was very small'.Footnote 15

  • Fourth, the ONS also reports that there was little change in the NESPD earnings distribution in 2014 when the PAYE sampling frame moved to ‘Real-Time Information’ and larger employers were required to include all of their employees, not just those above the LEL (Office for National Statistics 2017).

There is less evidence available for the earlier part of the period we analyse. If there was no significant under-sampling by the late 1990s, it seems unlikely that it was a major problem in the early/mid-1990s and then suddenly disappeared; but it may be more of an issue in the 1970s and 1980s. One reason under-sampling below the LEL is not a significant problem in recent years is that employers typically include all their employees on their PAYE scheme, even where it is not required; employers were less likely to do that before widespread computerisation, though anecdotal reports suggest that many still did so (it was often simpler to include them, not least because factors such as variable earnings, job changes and interactions with other income sources meant that they might well be obliged to include them at another time anyway). Comparison of the NESPD with the Family Expenditure Survey (FES) suggests that the issue of under-sampling was more significant in the 1980s than in subsequent decades, though more acute for the very lowest earners than for those just below the LEL (consistent with those quite near the LEL being more likely to need including for other reasons). Insofar as this is an issue, we may understate the extent of bunching at the LEL and we note where this may affect our results in the sections that follow.

3 Bunching at kinks and notches

Kinks are defined by a change in the marginal rate of tax at a threshold, such as in income tax schedules where income in higher bands is subject to higher rates of tax. Such tax schedules create a convex budget set that the neoclassical model of labour supply predicts should lead to bunching in the distribution of taxable income (Saez 2010). Optimisation errors, arising from an inability to perfectly control pre-tax income, may mean that this bunching appears as a diffuse mass around, rather than a spike at, the kink.Footnote 16

Notches—where the average rather than marginal tax rate increases discontinuously at a threshold—should lead to bunching below, rather than at, the threshold (Kleven and Waseem 2013). This is because the notch creates a jump in tax liabilities, so those who would have located slightly above in the absence of the notch can now obtain a large tax advantage from a small relocation to below the threshold. Indeed, notches often create a dominated region of earnings that no one should locate in, regardless of how much they value consumption relative to leisure. Because both consumption and leisure can be increased by reducing earnings below, the only reason we should observe anyone in the dominated region is because they are subject to optimisation frictions that prevent them from adjusting their earnings.Footnote 17

Such frictions could take many forms. For example, they could reflect lack of understanding of the tax system; rigidities in contracts, pay structures or wage-setting/bargaining processes based on nominal wages; restricted hours choices available within a firm combined with frictions (such as search and matching costs or specific human capital) that make it costly to move jobs; or minimum wages or other institutional features that make reductions in earnings difficult or unattractive (such as mortgage offers or employer pension contributions being specified as percentages of nominal earnings).Footnote 18

In the remainder of this section, we provide graphical evidence of the extent of bunching for different groups at kinks and notches in the UK income tax and NICs schedules. We first look at kinks in the income tax schedule using the SPI data.Footnote 19 Figure 2 shows the distribution of taxable income around the basic rate, higher rate, £100,000 and £150,000 thresholds, pooling observations from all waves of our data. There is little bunching at the relatively small kink created by the basic rate threshold. It is pronounced – if diffuse – at the higher-rate threshold, where the net-of-tax rate (that is, one minus the tax rate) fell by between 20 and 25%. The final panel shows there is clear, but modest, bunching at the £100,000 and £150,000 thresholds.Footnote 20

Fig. 2
figure 2

Source: Authors’ calculations using the Survey of Personal Incomes, 1995–1996 to 2013–2014

Bunching at upward kinks in the income tax schedule. Note Panels a and b show the distribution of annual taxable income in 2015–2016 prices relative to the basic and higher-rate thresholds, respectively, pooling all available years of data between 1995–1996 and 2007–2008 (panel a) or 2013–2014 (panel b). Panel c shows the distribution of taxable income in nominal terms with vertical lines indicating the £100,000 and £150,000 thresholds, pooling the 2010–2011 and 2013–2014 data.

What bunching there is at each of these thresholds is mostly due to the behaviour of company owner-managers and (to a lesser extent) the self-employed, with little or no bunching among employees. Figure 3 shows this for the higher-rate threshold, where we saw the strongest bunching; Figs. 9 and 10 in Appendix B show the same results at other upwards income tax kinks. Similarly, Fig. 4 uses data from the NESPD which measures the employment earnings subject to NICs rather than total income for income tax purposes. It shows that we see substantial bunching by managers and senior officials—the standardised occupation category which includes most company owner-managers—at the NICs Secondary Threshold, but nothing for other employees who make up the vast majority of workers at this level of earnings.Footnote 21

Fig. 3
figure 3

Source: Authors’ calculations using the Survey of Personal Incomes, 1995–1996 to 2013–2014

Bunching at the income tax higher-rate threshold, by taxpayer type. Note Employees (panel a) are defined as taxpayers whose total income is predominantly (more than 97.5%) derived from employment earnings. Company owner-managers (panel b) are defined as taxpayers who are directors of closely held companies. Self-employed (panel c) are defined as taxpayers whose total income is predominantly (more than 97.5%) derived from self-employment earnings. The other taxpayers group (panel d) contains all remaining taxpayers, and is mostly made up of those with income from a mixture of sources (e.g. earned and unearned income).

There are several possible reasons for the greater responsiveness of those running their own business. They may have more flexibility to adjust their work patterns and, more generally, to fine-tune their income and deductions in any given year, including (for example) by splitting income with a spouse or other family member. They may be more aware of financial planning and more likely to be receiving professional advice. They have more scope to misreport the level or timing of their income and deductions, since they are not subject to the same kind of third-party reporting that employees face on their salary.Footnote 22 And company owner-managers in particular can adjust the amount of profit they retain in the company in order to shift personal income across years (or ultimately take it as capital gains instead). Using newly linked administrative data on personal and corporate tax returns, Miller et al. (2019) show that the last of these possibilities is particularly important in explaining the high responsiveness of company owner-managers documented here.

Fig. 4
figure 4

Source: Authors’ calculations using the New Earnings Survey Panel Dataset, 2000–2015

Bunching at the NICs Secondary Threshold. Note Groups defined using the Standard Industrial Classification of economic activities (SIC00). Weekly taxable earnings relative to Secondary Threshold.

The lack of bunching among employees might reflect a low underlying behavioural elasticity, or it might reflect frictions that attenuate the response. The patterns of bunching we observe at notches provide robust evidence that frictions play an important role. This is because (as discussed above) notches in the employee NICs schedule create a dominated region that no one should locate in, regardless of how much they value consumption relative to leisure, save for frictions that prevent them from adjusting their earnings. We first examine the notch at the LEL, where between 1975–1976 and 1998–1999 the average rate of employee NICs jumped from 0% to between 2 and 9% of earnings and the average rate of employer NICs jumped from 0% to between 3 and 10.45% of earnings.

Figure 5 shows the distribution of earnings (z) relative to this notch (n), pooling all years of NESPD over this period.Footnote 23 There is clear, though modest, bunching below the threshold and a dip above it. But there is also substantial mass visible just above the threshold, in the area that corresponds to the dominated region created by the notch in employee NICs. This provides compelling evidence that the majority of employees at even this low level of earnings are subject to frictions large enough to prevent them from bunching.Footnote 24 As Fig. 11 in Appendix B shows, there is no evidence of any bunching below the notches located higher up the earnings distribution or of any dip above them, even in the dominated region created where the average rates of employee and employer NICs each jumped by 2 percentage points between 1986–1987 and 1989–1990 (panels a and c). This suggests that virtually all employees at these higher levels of earnings faced frictions large enough to prevent bunching.

Fig. 5
figure 5

Source: Authors’ calculations using the New Earnings Survey Panel Dataset, 1975–1998

Bunching at the NICs Lower Earnings Limit. Note Taxable earnings, z, are shown relative to the LEL by plotting the density of observations in bins of \(\ln (\frac{z}{\text {LEL}}) \times 100\) so that 0 represents the threshold in each year and 5, for example, means having earnings approximately 5% above the threshold. Excludes individuals with weeklyised or annualised earnings that take common round number values.

As with kinks, there is substantial heterogeneity in who bunches at the LEL. The first two panels of Fig. 6 show that while part-time employees bunch sharply below the LEL, there is a much less pronounced response among full-time employees. Interestingly, panels c and d show that there appears to be little difference in the bunching responses of women and men conditional upon working part-time (although there are far fewer men working part-time at this level of earnings than there are women). The final two panels show that there are also substantial differences in bunching across industries. Those working in the retail and hospitality sector, where working patterns are typically more flexible (e.g. shift work), bunch sharply below the LEL while there is no observable response among employees in the public sector.

While the presence or absence of bunching at kinks and notches has traditionally been seen as a complication in fitting structural models of labour supply to data (e.g. Burtless and Hausman 1978), recent work has instead viewed it as a potential source of variation that might be used to identify parameters summarising behavioural responses. In the next section, we use the bunching responses documented above to estimate the elasticity of taxable income (or earnings), applying bunching estimators developed by Saez (2010) for kinks and Kleven and Waseem (2013) for notches.

Fig. 6
figure 6

Source: Authors’ calculations using the New Earnings Survey Panel Dataset, 1975–1998

Bunching at the NICs Lower Earnings Limit: by subgroup . Note Taxable earnings, z, are shown relative to the LEL by plotting the density of observations in bins of \(\ln (\frac{z}{\text {LEL}}) \times 100\) so that 0 represents the threshold in each year and 5, for example, means having earnings approximately 5% above the threshold. Excludes individuals with weeklyised or annualised earnings that take common round number values.

4 Estimating elasticities from bunching

Saez (2010) showed that the elasticity of taxable income can be inferred from the income response of the ‘marginal buncher’: the last person who, facing a convex budget set, chooses to locate at, rather than above, a kink k where the marginal tax rate rises from \(\tau _l\) to \(\tau _h\). As this response represents a move between two tangency points, by the definition of the ETI \(e \equiv \frac{1-\tau _l}{\tau _h-\tau _l}\frac{\Delta z}{k}\) we have a relationship that depends only on known parameters of the tax schedule (\(\tau _l\), \(\tau _h\) and k) and the income response of the marginal buncher (\(\Delta z\)). This response can in turn be inferred from the amount of excess mass around the threshold \(B \equiv \int _{z_l}^{z_u} (h_0(z)-h_1(z))dz\) (where \(z_l\) and \(z_u\) define the income range in which bunching is observed and \(h_1(z)\) and \(h_0(z)\) are the actual and counterfactual distributions of income) as \(B = \int _{k}^{k+\Delta z} h_0(z)dz\) (Saez 2010). The crucial step is to estimate the counterfactual distribution of income that would exist in the absence of a kink, \(h_0(z)\). We follow Chetty et al. (2013) and estimate \(h_0(z)\) by fitting a flexible polynomial to the observed distribution of taxable income, excluding the area [\(z_l, z_u\)] around k.Footnote 25

When the kink is small, this approach identifies the compensated elasticity around the threshold, as the kink does not produce income effects over the bunching segment [\(k, k+\Delta z\)] (Saez 2010). At larger kinks, without making assumptions about the functional form of utility, the elasticity identified is instead a weighted average of the local compensated and uncompensated elasticities (Kleven 2016). In either case, the bunching estimator easily extends to accommodate heterogeneity in preferences, instead identifying the (compensated, or combined) local average elasticity at the threshold.Footnote 26

We apply this estimator to the bunching already documented at the higher rate, £100,000 and £150,000 thresholds, separately for employees, the self-employed and company owner-managers, who each face a different change in tax rates at these thresholds.Footnote 27 Figure 7 shows these estimates at the higher-rate threshold annually for each year covered by our SPI data, along with the bootstrapped 95% confidence interval. The estimates for employees are precisely estimated, and significantly different from zero in only 2 of the 16 years (1999–2000 and 2013–2014). On average, we estimate a higher ETI for company owner-managers (0.078) than the self-employed (0.046), though the coefficient estimates are imprecisely estimated in the 1990s when sample sizes were smaller. As Fig. 7 shows, the relative responsiveness of the two groups seems to have changed over time: our central estimate for the self-employed declines from about 0.10 to almost 0, while that for company owner-managers does the reverse.Footnote 28 The reasons for this apparent gradual decline in the ETI among the self-employed and rise among company owner-managers would be an interesting topic for future research.

Table 2 shows estimates for the same groups at the £100,000 and £150,000 thresholds in 2010–2011 and 2013–2014, the 2 years covered by our SPI data that these thresholds existed. Again, estimates are near zero for employees, and higher for company owner-managers than the self-employed (though both are less precisely estimated than at the higher-rate threshold because sample sizes are smaller).

Fig. 7
figure 7

Source: Authors’ calculations using the Survey of Personal Incomes, 1995–1996 to 2013–2014

Estimates of the ETI at the income tax higher-rate threshold. Note Figures show annual point estimates and 95% confidence intervals for the elasticity of taxable income, obtained by repeating estimation procedure on 500 bootstrap samples, drawn with replacement from the empirical distribution. Employees (panel a) are defined as taxpayers whose total income is predominantly (\(\ge 97.5\%\)) derived from employment earnings. Self-employed (panel b) are defined as taxpayers whose total income is predominantly (\(\ge 97.5\%\)) derived from self-employment earnings. Company owner-managers (panel c) are defined as taxpayers who are directors of closely held companies.

Table 2 Estimate of the ETI at £100,000 & £150,000 income tax thresholds.

Taken together, results from the bunching estimator applied to income tax kinks show clear differences in elasticities, with company owner-managers and the self-employed more responsive than employees. Yet the estimated ETIs are smaller than estimates using different methodologies for high-income individuals in the UK (e.g. Brewer et al. 2010; Browne and Phillips 2017), and toward the lower end of the range estimated in the wider literature (Saez et al. 2012).

However, allowing for even relatively small optimisation frictions could reconcile these estimates with much larger underlying taxable income elasticities. Following Chetty (2012), we illustrate possible effects frictions may be having on our estimates, assuming a quasi-linear utility function and a fixed adjustment cost equal to 1% of utility. Our estimate of 0.204 for company owner-mangers at the higher-rate threshold in 2013–14 is consistent with an elasticity unattenuated by frictions of up to 1.52, while even the smaller estimates for employees and the self-employed at this threshold are consistent with unattenuated elasticities well in excess of one.

If we are interested in obtaining an estimate of the elasticity of taxable income that can be used to predict the response of individuals in a different setting to that in which it was obtained, then it is important to distinguish whether limited bunching (and a correspondingly small estimated elasticity) is the result of low underlying responsiveness or non-trivial frictions. If frictions take the form of fixed costs then we would expect disproportionately stronger responses to bigger tax changes, and if frictions dissipate in the long run then long-run behavioural responses may be larger than short-run responses. And as well as improving our understanding of likely responses to tax changes, frictions may be a policy concern in their own right. Elasticities at least partly reflect utility-maximising behaviour by taxpayers given their preferences between (say) consumption and leisure;Footnote 29 whereas if frictions are preventing people from maximising their utility, leading to an inefficient allocation of resources, that might highlight potential gains from policy measures to reduce frictions, ranging from greater information provision to reforms to labour market institutions.

Kinks, unfortunately, provide no means of distinguishing high frictions from low underlying elasticities without variation in the size or location of the kink, and even then only with assumptions on the form frictions take or multiple changes in the size of kinks (Gelber et al. 2020). Notches, however, provide a more promising source of variation for distinguishing these explanations. This is because notches provide an additional empirical moment—the observed density in the strictly dominated region above the notch—that can be used to account for attenuation from frictions.

Kleven and Waseem (2013) show that the ratio of the empirical to the counterfactual density in the dominated region, \({\hat{a}}\), can be used to measure the share of individuals who do not respond to the notch because of frictions. This can then be used to scale up the estimated bunching below the notch (\({\hat{b}}\), the difference between empirical and counterfactual distributions in the range \([z_l,n]\)) to what it would be if no one were subject to frictions large enough to prevent them from bunching. The authors show that this scaled-up bunching mass, \({\tilde{b}} \equiv \frac{{\hat{b}}}{(1-{\hat{a}})}\), is related to the unattenuated earnings response of the marginal buncher, which can be estimated by calculating the earnings level at which the counterfactual mass between n and that point equals the scaled-up bunching mass.Footnote 30

There are then two ways of converting this unattenuated earnings response into a local average unattenuated elasticity. The first is to assume a functional form for utility and use the fact that by definition the marginal buncher is indifferent between locating at the notch n and at another higher point, \(z^*\). This defines a relationship between known parameters of the tax schedule, the estimated unattenuated earnings response of the marginal buncher \(\Delta {\hat{z}}^*\) and the local average (structural) earnings elasticity \(e_s\), which can be solved numerically. The second approach is a reduced form one that approximates the effect of a jump in average tax rates as a large change in marginal tax rates. As a notch will induce a larger bunching response than the implicit kink would, this reduced form estimate will be upwardly biased by treating the bunching response as if it were generated by a kink.

As with kinks, the key empirical entity to be estimated is the counterfactual distribution of earnings around the notch. We follow Kleven and Waseem (2013) in estimating this by fitting a flexible polynomial to the number of individuals in small bins of earnings, excluding observations in the range [\(z_l,n]\) below the notch that is obviously affected by bunching and an initially arbitrary range [\(n,z_{u0}\)] above the notch. The polynomial is then repeatedly estimated, increasing the excluded area above the threshold until it reaches a point (\(z_u\)) where the estimated excess mass between the actual and counterfactual earnings distributions below the threshold, \({\hat{b}}\), equals the estimated missing mass between the actual and counterfactual distributions above the threshold, \({\hat{m}}\). The resulting estimate of the excess mass, \({\hat{b}}\), can then be filled in directly under the counterfactual to estimate the attenuated earnings response of the marginal buncher (\(\Delta {\hat{z}}\)), or scaled up to account for frictions and then filled in under the counterfactual to estimate the unattenuated earnings response \(\Delta {\hat{z}}^*\) and ETI.Footnote 31

We apply this estimator to the bunching observed at the Lower Earnings Limit for three sets of tax years where the jump in average employee NICs rates (and so the dominated region) is of the same size: 1983–84 to 1985–86 (9% points), 1986–1987 to 1989–1990 (5ppts) and 1990–1991 to 1998–1999 (2ppts).Footnote 32 Table 3 shows that both the reduced form (\({\hat{e}}_r\)) and structural (\({\hat{e}}_s\)) unattenuated elasticity estimates are greater than zero, but modest. For example, the reduced form estimate is 0.238 for both the 1990–1991 to 1998–1999 and 1986–1987 to 1989–1990 periods, similar to the hours elasticities for low-paid married women over the same time period estimated by Blundell et al. (1998), and to the elasticity of earnings with respect to employee NICs estimated by Adam et al. (2019) using the same data as this paper but exploiting tax changes over time. Both the reduced form and structural estimates for the period 1983–1984 to 1985–1986 are substantially smaller. This may partly reflect the married women’s reduced rate described in Sect. 2.1, which at this time still affected a substantial number of women: if the bunching we observe was generated by a smaller notch than we are assuming, then the true elasticity will be somewhat higher than these estimates.Footnote 33 It may also reflect the potential under-sampling of those below the LEL in the NESPD discussed in Sect. 2.2, though as noted there this is more acute far below rather than just below the LEL.

The elasticities for the period 1990–1991 to 1998–1999 are less precisely estimated, as shown by the large bootstrapped standard errors.Footnote 34 In addition, all of our central estimates are quite sensitive to the exact specification of the fitted polynomial, the range of income over which the counterfactual is estimated, and the excluded range of earnings below the threshold.

The estimated share of individuals who do not respond to the notch because of frictions, \({\hat{a}}\), however, is quite precisely estimated and robust to specification. While these may also be affected by potential under-sampling below the LEL, our estimates suggest that around 75–90% of employees at this low level of earnings are subject to frictions sufficiently large to prevent them from bunching: equivalent to at least 9% of gross earnings in the 1983–84 to 1985–86 period, more than £400 per year in today’s prices. As a result, our estimate of the unattenuated percentage earnings response, \(\frac{\Delta {\hat{z}}^*}{n}\), is between 1.5 and 2 times the attenuated one \(\frac{\Delta {\hat{z}}}{n}\) obtained solely from the estimated bunching response below the threshold, \({\hat{b}}\).

Table 3 Estimates from bunching at the NICs Lower Earnings Limit.

These estimates provide compelling evidence of the important role that frictions can play in attenuating the earnings response of employees, and are reinforced by the complete absence of bunching at notches further up the earnings distribution (Fig. 11, discussed in Sect. 3).Footnote 35 Frictions of this magnitude could also play an important role in reconciling microeconomic and macroeconomic estimates of the compensated elasticity, as suggested by Chetty (2012). Macroeconomic estimates of this parameter are typically derived by calibrating macroeconomic models so that they match cross-country variation in aggregate hours of work, and in general are significantly larger than those obtained with microeconomic data exploiting policy reforms. Chetty suggests that allowing for frictions with utility costs equivalent to 1% of consumption can fully reconcile these differences. Our estimates imply that most low- and middle-earning employees face frictions that are even larger.

5 Bunching responses and hours of work

Unlike the administrative data used in most studies of bunching at tax thresholds, the NESPD contains information on hours of work as well as earnings for around 85% of employees sampled. In this section, we show how hours data can be used to reveal more about the nature of bunching responses.

Figure 8 plots mean log hours in each bin of log earnings around the LEL for the three periods 1983–1985, 1986–1989 and 1990–1998. Since \(\ln (z) = \ln (h) + \ln (w)\), this decomposes log earnings at every level of earnings into mean log hours and mean log hourly wage components. We then fit a 7th-order polynomial to the data points excluding those in the interval \([z_l, z_u]\) affected by bunching.

Fig. 8
figure 8

Source: Authors’ calculations using the New Earnings Survey Panel Dataset, 1983 to 1998

Decomposition of bunching responses at the NICs Lower Earnings Limit. Note The horizontal axis shows taxable earnings, z, relative to the LEL in terms of \(\ln (\frac{z}{\text {LEL}}) \times 100\) so that 0 (shown by the solid vertical line) represents the threshold in each year and 5, for example, means having earnings approximately 5% above the threshold. Overlaid are actual and estimated counterfactual mean log hours. Vertical dotted lines indicate the excluded areas below and above the threshold (\(z_l\) and \(z_u\), respectively), and the dashed vertical line indicates the top of the dominated region. Excludes individuals with annual or weekly-equivalent earnings that take common round number values, and those missing information for hours.

This estimated polynomial allows us to interpolate counterfactual mean log hours in the interval \([z_l, z_u]\) affected by bunching: that is, to estimate what mean log hours (and, by subtraction, mean log hourly wages) at each earnings level would have been in the absence of behavioural responses to the notch. The slope of this counterfactual mean log hours function tells us, before bunching responses, to what extent as we move up the log earnings distribution people had higher earnings because they had higher hours rather than because they had higher wages. Across all three periods, the estimated slope is around 0.9 in the neighbourhood of the LEL: that is, around 90% of the extra (log) earnings of higher earners was because they worked longer hours and only 10% was because of higher (log) hourly wages.Footnote 36 This is precisely estimated and robust to the choice of polynomial order.

Under the assumption that those who do not bunch do not adjust their earnings at all in response to the notch, looking at actual and counterfactual mean log hours at different earnings levels, alongside the actual and estimated counterfactual earnings densities, can potentially tell us about the characteristics and behaviour of bunchers.Footnote 37

5.1 Decomposition of earnings responses

First, we use our estimated counterfactual hours profiles to look at how much of the earnings response of bunchers was through reductions in hours of work rather than hourly wages. The new tax responsiveness literature emphasises that responses to taxation might take the form of reduced hourly wages, rather than reduced hours, because recorded wages capture margins of behaviour such as effort and income shifting. Moreover, the discrete jump in tax liability at notches (unlike at kinks) means that it can make sense to accept lower hourly wages to locate below the threshold even if there is no compensating utility gain in leisure, reduced effort or anything else.

Saez et al. (2012), among others, argue that the responsiveness of taxable income is driven less by hours of work than by factors such as tax evasion, avoidance and income shifting. This conclusion is based mainly on how the taxable income of high-income individuals respond to income tax changes. However, the potential tax-reducing responses to NICs are different from income tax because the tax bases differ, and low-paid workers may respond differently from high-income individuals.Footnote 38

To investigate how employees responded to the notch at the LEL, we estimate how much of the total earnings reduction was accounted for by a reduction in hours. We do this by calculating the reduction in mean log hours as a fraction of the reduction in mean log earnings, using the (observed) actual and (estimated) counterfactual log hours and log earnings across the interval \([z_l, z_u]\) affected by bunching.Footnote 39 Formally, we estimate:

$$\begin{aligned} {\hat{h}} \equiv \underbrace{\left( \frac{ \sum _{j=z_l}^{z_u} {\tilde{h}}_{0j} f_{0j} }{\sum _{j=z_l}^{z_u}f_{0j} } - \frac{ \sum _{j=z_l}^{z_u} {\tilde{h}}_{1j} f_{1j} }{\sum _{j=z_l}^{z_u}f_{1j} } \right) }_{\equiv \Delta {\tilde{h}}} \div \underbrace{\left( \frac{ \sum _{j=z_l}^{z_u} {\tilde{z}}_{0j} f_{0j} }{\sum _{j=z_l}^{z_u} f_{0j} } - \frac{ \sum _{j=z_l}^{z_u} {\tilde{z}}_{1j} f_{1j} }{\sum _{j=z_l}^{z_u} f_{1j} } \right) }_{\equiv \Delta {\tilde{z}}} \end{aligned}$$
(1)

where \({\tilde{h}}_{1j}\) is the actual and \({\tilde{h}}_{0j}\) the counterfactual mean log hours in log earnings bin j; \(f_{1j}\) is the actual and \(f_{0j}\) the counterfactual number of individuals in log earnings bin j; and \({\tilde{z}}_{1j}\) is the actual and \({\tilde{z}}_{0j}\) the counterfactual mean log earnings in log earnings bin j.

Table 4 shows that the reduction in mean log earnings is estimated, with reasonable precision, at 0.010, 0.011 and 0.007, respectively, in the three periods 1983–1985, 1986–1989 and 1990–1998. As these are averages across both bunchers and non-bunchers, this implies that those who did bunch reduced their log earnings by 0.244, 0.169 and 0.144, respectively, in these three periods (around 20% on average).Footnote 40

However, the reduction in mean log hours is less precisely estimated, and when combined with that for earnings, results in an estimated share of the total earnings reduction accounted for by a reduction in hours that is associated with very large standard errors. The approach we have developed for decomposing earnings responses into hours and hourly wage responses thus does not appear to have enough power to be informative in our application, though it may yield more interesting results in other cases.

Table 4 Estimated share of total earnings reduction through hours.

5.2 Selection of bunchers

Our approach also allows us to identify selection in who responds to the incentives created by the notch at the LEL. To do this, we compare the actual mean log hours of non-bunchers with the estimated counterfactual mean log hours in the region above the notch affected by bunching; that is, we compare \({\tilde{h}}_1\) to \({\tilde{h}}_0\) in the interval \([n, z_u]\). This tells us whether non-bunchers (and, by extension, whether bunchers) were on average high-hours, low-wage workers or low-hours, high-wage workers compared to others with the same counterfactual earnings. Specifically, we test the hypothesis:

$$\begin{aligned} \sum _{j=n}^{z_u} f_{0j} {\tilde{h}}_{1j} - \sum _{j=n}^{z_u} f_{0j} {\tilde{h}}_{0j} = 0 \end{aligned}$$
(2)

Table 5 shows these estimates of mean log hours, along with the differences between them and the associated standard errors.Footnote 41 These differences are negative throughout, and statistically significant for the periods 1986–1989 and 1990–1998, indicating that those who bunched at the LEL were higher-hours, lower-wage workers than those who did not. We show in Sect. 3 that bunching at the LEL was almost entirely driven by the responses of part-time workers, but this is not inconsistent: since about 90% of workers around the LEL were part-time, the implication of our results is that higher-hours, lower-wage types among part-time workers were more likely to bunch than lower-hours, higher-wage types.Footnote 42

Given these estimates, and the counterfactual log earnings densities shown in Fig. 5, we can calculate how much higher bunchers’ mean log hours were than non-bunchers’: 0.129 in 1986–1989, and 0.124 higher in 1990–1998.Footnote 43 To give a sense of magnitudes, simply taking exponentials of these mean log hours suggests that bunchers would typically have worked around 19 h per week in the absence of the notch, whereas non-bunchers with the same earnings typically worked around 17 h per week.Footnote 44

Table 5 Actual and counterfactual hours in region above LEL affected by bunching.

6 Conclusion

This paper has investigated behavioural responses to income tax and social security contributions, exploiting cross-sectional variation created by thresholds in UK tax schedules over a 40-year period. At thresholds where the marginal tax rate fell, we found no sign of a dip in the density of the earnings distribution. But there was clear, if modest, bunching at thresholds where the marginal rate rose, especially the higher-rate income tax threshold. We found that company owner-managers and the self-employed were the most responsive to kinks in the income tax schedule, reflecting their greater scope for income manipulation, tax planning and evasion. In recent years company owner-managers have been much more responsive than the self-employed, whereas if anything the opposite was true in the 1990s.

In contrast, employees did not respond at all to kinks in the tax schedule. Notches, where the average rate of tax changes, provide compelling evidence that the reason for this is that most low-paid employees faced substantial optimisation frictions: while there was some bunching by employees below the notch at the National Insurance Lower Earnings Limit in the 1980s and 1990s, only a minority responded in this way. In some years, this meant non-responders paid an additional 9% of total earnings in employee contributions and a further 10.45% in employer contributions. Frictions of this magnitude suggest that long-run responses, or responses to large reforms, could be larger than implied by elasticities estimated from short-term responses to small tax differentials, and could play an important role in reconciling micro- and macro-based estimates of labour supply elasticities, as suggested by Chetty (2012).

We found that there was substantial heterogeneity in which employees bunched at the LEL. Those employed in the hospitality or retail sectors were far more likely to respond than those working in less flexible sectors. Part-time workers were more likely to bunch than full-time workers, though those part-time workers who did bunch were typically higher-hours, lower-wage types than those who did not. Notably, we found little difference in the responses of women and men conditional upon working part-time. This raises the question whether well-documented differences in observed behavioural responses between groups (e.g. men and women, employees and the self-employed) are a result of heterogeneity in underlying preferences or in frictions faced. If increases in the share of women working full- rather than part-time mean that they face larger frictions to adjusting their hours of work, that might help to explain why Blau and Kahn (2007), among others, find a fall in the elasticity of labour supply for women over time. Deepening our understanding of the nature of the frictions we highlight in this paper is not only crucial for integrating disparate evidence on elasticities into a sophisticated understanding of taxpayer responses, but may also have important policy implications in its own right, since to the extent that frictions prevent people from maximising utility, addressing their underlying causes may provide significant welfare gains.