1 Introduction

The benefits, costs, and rationale of policies promoting high-growth firms (HGFs) have been strongly debated (see e.g., (Acs et al., 2009; Bos and Stam, 2014; Autio and Rannikko, 2016; Brown et al., 2017; Grover Goswami et al., 2019)). This debate has emphasized that the main benefit of having high-growth firms is their ability to create jobs (e.g., (Henrekson and Johansson, 2010)). At the same time, additional effects of having more HGFs, including their productivity effects, are less clear, even though productivity growth, if not the sole aim of a policy, is likely to be at least one of the policymakers’ objectives, and as such, should appear in well-designed cost-benefit analyses of entrepreneurship policies. As Brown et al. (2017) put it: “This myopic focus on HGF growth rates has also largely overlooked crucially important aspects of firm growth, notably firm profitability and sustainability and wider productivity.” While some influential papers mention increased productivity growth as an established effect of HGFs,Footnote 1 others are very skeptical.Footnote 2

This paper sheds light on these wider productivity effects by quantifying the contributions of HGFs to industry-level TFP and labor productivity growth based on detailed microdata from Hungary. The concept of productivity contribution we use, which follows Foster et al. (2008), captures both within-firm productivity growth and Schumpeterian reallocation effect, and, as a result, it is closer to the “wider productivity effects” than focusing only on the productivity growth of high-growth firms. While a number of studies have investigated either the productivity growth of HGFs or their initial productivity level, the main novelty of our paper is using a concept which captures both these channels with a focus on key questions from the HGF literature.Footnote 3Footnote 4 Importantly, it is not automatic that correlations between firm size and productivity growth will generalize to this level: as we show, the industry-level contribution combines a number of correlations, and the within and reallocation effects often have opposite signs. Studying which firm-level results persist at this aggregation level is an important contribution of our paper.

Besides deciding on whether to implement HGF promotion, policymakers also have a choice between promoting firms growing along different dimensions. We analyze the contribution of HGFs defined in different ways to identify any potential trade-off between productivity and employment contribution.Footnote 5 Indeed a large literature has demonstrated that different definitions capture different sets of firms with different average characteristics (Delmar, 2006; Shepherd and Wiklund, 2009). Our results contribute to this debate by demonstrating the presence of a trade-off between our measure of “wider productivity effects” and job creation. Furthermore, we show that definitions differ in the relative importance of the within-firm and Schumpeterian contribution. These results are mainly driven by the negative or weak relationship between employment growth and productivity growth during the high-growth period, also documented by Coad and Broekel (2012) and Guillamón et al. (2017). In particular we find, the contribution during the high-growth phase depends on the definition used, varies widely across industries and years and it is often negative. Negative contribution mainly comes from negative productivity growth during the high-growth phase, which is more frequent in the case of employment-based HGFs. As a result, the average contribution of sales-based HGFs is substantially higher than that of employment-based HGFs for the high-growth period of three years

HGF policies are also often motivated by longer-term effects of high-growth firms, therefore, it is crucial to understand the longer-term effects of HGFs. A number of results confirm that HGF status itself is not persistent (Coad, 2007; Daunfeldt et al., 2015; Hölzl, 2014; Erhardt, 2021), which “seriously challenges the notion that policymakers can target high-growth firms in order to promote future firm growth” (Daunfeldt et al., 2014). Still, it is theoretically possible that large size growth is preceded or followed by longer-term productivity growth if growth was enabled by innovation or if size growth is followed by organizational innovations (Greiner, 1989).Footnote 6 We follow cohorts of high growth firms to identify their contribution in the long run and show that HGFs mainly contribute positively to productivity growth during their high growth period.

Our third line of investigation focuses on the role of industry features in HGF productivity contribution. We find that specific characteristics of industry dynamics, captured by two key moments, substantially influence the contribution of HGFs to industry-level productivity growth. Stronger reallocation and a stronger correlation between productivity and size growth imply higher reallocation. Policies promoting industry dynamics in these respects may increase the benefits of entrepreneurship policy. These correlations, however, change quickly and may not be suitable for targeting. We show that more fundamental industry features—concentration, productivity dispersion, the share of young firms, and the technology level of the sector—cannot strongly predict the HGF contribution. We only find evidence for a positive relationship between the share of young firms and HGF contribution. These results are in line both with the low predictability of HGFs (see e.g., (Coad, 2007; Coad et al., 2014; Coad and Srhoj, 2019)) and the role of young firms in the HGF phenomenon (Navaretti et al., 2014; Coad et al., 2016; Coad, 2018).

This article contributes to four strands of the literature. First, the literature has documented a complex relationship between high growth and productivity at the firm level (Acs et al., 2009; Bianchini et al., 2017; Moschella et al., 2019; Arrighetti and Lasagni, 2013; Du and Temouri, 2015; Daunfeldt et al., 2014). As Grover Goswami et al. (2019) concludes, in general, there seems to be no strong connection between productivity and high-growth status. Our results reinforce the conclusion that high growth and productivity are interlinked in complex ways and sheds more light on this relationship by systematically reviewing how different HGF definitions are related to initial productivity level and productivity growth during the HGF phase.

Second, a large literature has documented how HGF definition matters (Delmar, 2006; Shepherd and Wiklund, 2009), as well as suggesting novel definitions (Acs et al., 2009; Moschella et al., 2019; Daunfeldt et al., 2014). In the current paper, we also emphasize the differences between the various HGF measures in the context of their contribution to aggregate productivity growth.

Third, while many papers have shown that HGFs create a large share of new jobs (see the meta-analysis of (Henrekson and Johansson, 2010)), there are only a few papers investigating the contribution of HGFs to aggregate productivity growth. Daunfeldt et al. (2014) shows that the within contribution of HGFs to aggregate economic growth, employment growth, sales growth, and productivity growth varies across the different HGF measures and can be even negative in some cases. Considering a 7-year period they find that the total productivity growth of employment-based HGFs is negative, while that of sales-based HGFs is positive but relatively low, about 7-8% of the aggregate productivity growth. In the current paper, we go deeper, and by extending the Foster et al. (2008) framework we also consider the reallocation contribution and show that it is a substantial part of HGFs’ contribution. Additionally, we also consider cross-industry heterogeneity in HGFs contribution to aggregate productivity and investigate the industry characteristics which influence the magnitude of this contribution. The paper of Haltiwanger et al. (2016) is the closest to ours, as beyond looking at HGFs contribution to employment and real output growth they decompose the contribution of HGFs to industry-level productivity growth, focusing on the role of reallocation. They find that both HGFs and rapidly declining firms have a considerable contribution to aggregate productivity growth through reallocation. In addition to these results, we also look at the factors leading to cross-industry differences in the between and within terms of HGFs’ contribution to industry-level productivity. Moreover, we explicitly focus on differences by HGF definition comparing OECD- and Birch-type measures as well. Lastly, we also provide a simple model to show that the observed differences can be explained by a few moments of industry dynamics.

Finally, we contribute to the literature on reallocation (Foster et al., 2008; Hsieh and Klenow, 2009; Bartelsman et al., 2013; Restuccia and Rogerson, 2017; Baqaee and Farhi, 2020). Some papers already looked at how specific firm groups, like foreign-owned (Balsvik and Haller, 2006; Harris and Moffat, 2013) or exporters (Gleeson and Ruane, 2009; Fuss and Theodorakopoulos, 2018) contribute to aggregate productivity via within-firm growth and reallocation. We focus on the role of HGFs in aggregate reallocation and link that contribution to parameters characterizing the overall strength of reallocation, which has already been found important for aggregate productivity growth via different channels (e.g., (Andrews et al., 2015; Andrews et al., 2016)).

In what follows we first introduce our data and the decomposition methodology in Section 2. Section 3 introduces the different HGF definitions we use and presents descriptive statistics to show the differences between them. Section 4 presents our findings on the contribution of HGFs to aggregate productivity growth and the impact of industry characteristics. Section 5 presents a simple model, and Section 6 concludes.

2 Data and methods

2.1 Hungarian firm-level data

Our main source of information is the database of the firm-level corporate income tax statements during the period 2001-2016 from the Hungarian National Tax Authority (NAV). The dataset has almost universal coverage as it includes all firms using double-entry bookkeeping. The sample covers more than 95% of employment and value added of the business sector and about 55% of the full economy in terms of GDP. The dataset includes the most important balance sheet items and information on a wide range of matters such as ownership, employment, industry at the NACE 2-digit code level, and the location of the headquarters. The Centre for Economic and Regional Studies (KRTK) has extensively cleaned and harmonized the data (see Appendix 1). Nominal variables are deflated by the appropriate 2-digit industry-level deflators from OECD STAN.Footnote 7

Given the scope of our analysis, we restrict the data in several ways. First, we exclude non-profit organizations. Second, we drop firms that operate either in agriculture or in the non-market service sectors of the economy. Third, we drop all firms that never had more than 10 employees, because standard HGF definitions require firms to have at least 10 employees.

When quantifying productivity, we mainly rely on TFP, estimated with the method proposed by Ackerberg et al. (2015). We present robustness checks by using labor productivity, calculated as log value added per employee. Given our focus on productivity, we drop firm-year observations where TFP (and labor productivity) is missing because the firm did not report material cost, or its employees or capital stock was reported to be zero. We use the same productivity measures as in Muraközy et al. (2018), and we describe the procedure in more detail in Appendix 2.

The number of observations is reported in Table 1. Column (1) shows the number of firms in our main sample, falling between 26,000-33,600 per year, and including firms that ever had at least 10 employees in our sample period, i.e., which have the (theoretic) potential to be defined as a HGF.Footnote 8 Column (2) shows that around 60-70% of these firms have at least 10 employees in the specific year. According to Column (3), 75-85% of the firms in Column (2) is observed 3 years prior to the focal year, showing that 15-25% of the firms in our sample has entered the sample in the preceding 3 years.Footnote 9 Similarly, Columns (4) and (5) include the number of firms that can be still observed 3 and 6 years after the focal year, respectively, showing that 60-66% of the firms having at least 10 employees in the specific year are observable even 6 years later.

Table 1 Number of firms by year

Table 2 provides some basic descriptive statistics by year from our main sample. The average number of employees is between 40-45, suggesting that the overwhelming majority of the firms in the sample are SMEs. 13-17% of the firms are foreign-owned and 30-40% of them export. In line with the lower entry rate after 2000 compared to the 1990s, which was the decade after the transition, the share of young firms (at most 5 years old) decreases gradually over time, from 42% in 2000 to only 11% in 2016. The share of OECD employment HGFs is around 3-6%, with low values around the Great Recession in 2008/2009. The share of OECD sales HGFs is about twice as large as that of employment HGFs, also having lower values around the Great Recession. Table 7 in the Appendix shows similar statistics by 2-digit industry.

Table 2 Descriptives by year

2.2 Decomposition

Our decomposition is based on Foster et al. (2008), who split aggregate productivity growth into within, between, cross and net entry terms.

The original decomposition starts with the change between \(t_0\) and t (in our empirical exercise \(t=t_0+3\)) in aggregate productivity (\(\Delta PROD_t\)):

$$\begin{aligned} \Delta PROD_t&= \underbrace{\sum _{i\in C} \theta _{i,t_0}\Delta prod_{i,t}}_{within} +\underbrace{\sum _{i \in C}(prod_{i,t_0}-PROD_{t_0})\Delta \theta _{i,t}}_{\text {between}} + \underbrace{\sum _{i \in C} \Delta prod_{i,t}\Delta \theta _{i,t}}_{\text {cross}} + \\&\underbrace{\sum _{i \in N} \theta _{i,t}(prod_{i,t}-PROD_{t_0})- \sum _{i \in X} \theta _{i,t_0}(prod_{i,t_0}-PROD_{t_0})}_{\text {net entry}} \end{aligned}$$

where \(\theta _{i,t}\) is the employment share of firm i in year t, \(prod_{i,t}\) and \(PROD_t\) are productivity measures at the firm and aggregate level, respectively. \(\Delta\) always denotes the change between \(t_0\) and t. C stands for continuing firms, N for new entrants and X for exiting firms.

The within term captures the sum of firm-level productivity changes for continuing firms, weighted by their initial employment share. This term is large if firms, especially large ones, increased their productivity quickly. The between term captures the main channel of reallocation by quantifying the extent to which initially more productive firms grew faster in terms of employment. The cross term captures whether firms increasing their employment share were also able to improve their productivity. The net entry term is positive if new entrants were more productive relative to exiting firms.

Importantly, the reallocation is additive, and all these terms are sums of firm-level moments. Therefore, we can further distinguish between the contribution of HGFs and other continuing firms (similarly to (Haltiwanger et al., 2016)).Footnote 10 The decomposition becomes:

$$\begin{aligned} \Delta PROD_t =&\underbrace{\sum _{i\in HGF} \theta _{i,t_0}\Delta prod_{i,t} + \sum _{i\in otherC} \theta _{i,t_0}\Delta prod_{i,t}}_{\text {within}} + \\&\underbrace{\sum _{i \in HGF}(prod_{i,t_0}-PROD_{t_0})\Delta \theta _{i,t}+ \sum _{i \in otherC}(prod_{i,t_0}-PROD_{t_0})\Delta \theta _{i,t}}_{\text {between}} + \\&\underbrace{\sum _{i \in HGF} \Delta prod_{i,t}\Delta \theta _{i,t}+\sum _{i \in otherC} \Delta prod_{i,t}\Delta \theta _{i,t}}_{cross}+ \\&\underbrace{\sum _{i \in N} \theta _{i,t}(prod_{i,t}-PROD_{t_0})- \sum _{i \in X} \theta _{i,t_0}(prod_{i,t_0}-PROD_{t_0})}_{\text {net entry}} \end{aligned}$$

The total contribution of HGFs will be the sum of the three HGF terms, i.e., \(\sum _{i\in HGF} \theta _{i,t_0}\Delta prod_{i,t}+\sum _{i \in HGF}(prod_{i,t_0}-PROD_{t_0})\Delta \theta _{i,t}+\sum _{i \in HGF} \Delta prod_{i,t}\Delta \theta _{i,t}\).

The terms in this formula suggest that HGF contribution is likely to be large in two cases. First, if high-growth firms increase their productivity during their high-growth phase, both the within and cross terms tend to be positive. The HGF within term captures whether HGFs increase their productivity. In the HGF cross term \(\Delta \theta _{i,t}\) is positive by definition for all HGFs, hence the sign of the cross term is primarily determined by the sign of productivity changes. Therefore, both terms are mainly driven by productivity growth during the high-growth phase, but the within term weights firm-level productivity changes with their initial size while the cross term weights them by their size growth. The second way HGFs can contribute positively is via reallocation. If high-growth firms are more productive initially, reallocation of resources to them will improve aggregate productivity. This channel is captured by the between term.

In our main exercise, we decompose productivity growth for 3-year periods starting in each year between 2001 and 2016 for each 2-digit industry.Footnote 11 Productivity decomposition is usually less noisy in such ‘medium-term’ periods, which may better reflect the timeline of economic processes like reallocation. 3-year periods also correspond to the time span of the standard OECD HGF definition. We present these decompositions separately for different cohorts of HGFs. We define the cohort of year t as the group of firms that were in their high-growth phase between t and \(t+3\).Footnote 12

3 Definitions

This section introduces the different HGF definitions we use and presents a few patterns. This helps us (i) to understand how the different types of HGFs contribute to aggregate productivity growth and (ii) to investigate the relevant differences between the firms captured by the different definitions. To present the patterns in a transparent way, the figures and tables in this section mainly focus on firms that were HGFs between 2013 and 2016 (the 2013 cohort), which is our last cohort. The patterns are similar for other cohorts with few exceptions which we always note.

3.1 HGF definitions

The literature provides multiple definitions for HGFs (OECD, 2010), differing across two key dimensions. First, any type of size change can be measured in absolute or relative terms. One class of definitions relies solely on relative growth, while definitions in a second class use a combination of relative and absolute growth. For simplicity, we refer to the former as the OECD (based on (OECD, 2010)), and the latter as the Birch (Birch, 1981) method. Second, firms’ performance can be assessed based on employment (more generally, input) or sales (output) dynamics.

Table 3 presents the typical definitions used in the literature. Within the relative definitions, one can distinguish between employment- (input) and sales- (output) based OECD definitions. The OECD definition requires a firm to grow by 20% on average per annum for three years. The Birch definition captures firms that are in the top 5 percentile based on an average of absolute and relative growth. Again, we will distinguish between labor and sales growth-based definitions. To make the results comparable, we will use the 3-year time frame in all cases.

Table 3 HGF definitions

Appendix 3 compares HGFs defined in different ways in several dimensions and documents many results which are consistent with the literature. Different definitions cover a partly overlapping but different set of firms, with significant differences in initial size and growth along the different dimensions ( e.g., (Delmar et al., 2003; Delmar, 2006; Shepherd and Wiklund, 2009; Acs et al., 2009; Arrighetti and Lasagni, 2013; Coad et al., 2014; Daunfeldt et al., 2014; Moreno and Coad, 2015; Du and Bonner, 2017; Grover Goswami et al., 2019)). Also consistent with earlier results, we find that HGF status is often a transitory phase in the firm’s history, even though there is some persistence. (see (Delmar et al., 2003; Coad, 2007; Acs et al., 2009; Daunfeldt et al., 2015; Lopez-Garcia and Puente, 2012; Hölzl, 2014; Daunfeldt and Halvarsson, 2015; Coad and Srhoj, 2019)). Finally, we find that firms are less likely to exit following a HGF phase compared to other firms (e.g., (Acs et al., 2009; Choi et al., 2017; Mohr et al., 2014)), suggesting that exits following the HGF phase are unlikely to affect our results about longer-term dynamics.

3.2 Size growth and productivity growth at the firm level

Table 4 provides an initial view of size and TFP of differently defined HGFs at the firm level, and Table 8 repeats the same exercise with labor productivity. The tables show HGF performance four years before (\(t-7\)), one year before (\(t-1\)), four years after (\(t+4\)), and seven years after (\(t+7\)) the beginning of the high-growth phase. The premia are expressed as the (unweighted) average number of employees and total factor productivity level of HGFs relative to that of the average firm (scaled to be 100%).Footnote 13

Table 4 Employment and TFP before and after the high-growth phase by HGF definition

Starting with the relative/absolute dichotomy, we find that the two OECD definitions identify firms that are initially (\(t-1\)) considerably smaller than the average firm. On average, these HGFs employ 25-27% fewer employees before their high-growth phase than the average firm, and 55-156% more right after that (in \(t+4\)). In contrast, using the Birch definition, HGFs are 122-160 percent larger than the average firm even before the high-growth phase, increasing their premium to 372-472 percent right after the high-growth phase.

Irrespective of the definition used, HGFs create a substantial number of jobs. As expected, average employment growth during the HGF period is somewhat higher under the employment-based than under the sales-based definition. Using employment-based definitions, relative to the average firm, the average HGF’s employment grows from \(t-1\) to \(t+4\) by 184 percentage points based on the OECD definition, and by 350 percentage points based on the Birch definition. Using sales-based definitions yields smaller results, but the magnitudes are still noteworthy: 80 and 212 percentage points according to the OECD and the Birch definition, respectively. The large growth of initially larger Birch HGFs hints at a larger potential contribution of these firms.

Regarding productivity, apart from the OECD sales definition, HGFs are initially (i.e., in \(t-1\)) 21-35% more productive than the average firm.Footnote 14 This suggests a potentially positive reallocation effect for these three groups of HGFs.

Importantly, during the high-growth phase (approximated by the change between \(t-1\) and \(t+4\)), sales-based HGFs experience a productivity increase (by 13-18 pp), while the productivity of input-based HGFs falls (by 20-23 pp). This suggests that the within contribution of output-based HGFs is likely to be positive, while that of input-based HGFs may be negative.

A key message of this table is that there seems to be some trade-off between job creation and productivity growth, but this is not very strong: employment-based HGFs create more jobs while sales-based HGFs increase their productivity more. However, the difference between the two groups in terms of productivity growth is much more characteristic than their difference in job creation. Sales-based HGFs increase their productivity substantially and create many jobs at the same time, while employment-based HGFs face declining productivity.

Table 4 also allows us to follow the firms before and after their high-growth phase. Compared to the developments during the high-growth period, the productivity and employment changes before and after that are relatively small. Still, the high-growth phase seems to be typically preceded by productivity growth (growing TFP between \(t-4\) and \(t-1\)) for all the four definitions, although to a different extent. On the contrary, the average productivity levels are very similar right after the end of the growth period (\(t+4\)) and three years afterward (\(t+7\)) and in some cases they become even lower. There is no further productivity increase after the end of the high-growth phase.

The fact that these pre- and post-trends in terms of both employment and productivity are relatively modest compared to changes during the high-growth period suggests that high growth tends to be a transitory phase in most firms’ life.Footnote 15 From an empirical point of view, this finding suggests that by focusing on the high-growth period one is likely to capture the bulk of HGFs’ productivity contribution. We use a more formal event study methodology in Section 4.2 to quantify HGFs’ contribution before, during, and after the high-growth period.

4 HGFs’ contribution to total productivity growth

While the previous section has provided descriptive evidence for HGFs’ productivity and employment growth patterns, here we go deeper, focusing explicitly on HGFs’ contribution to productivity growth, which is the main interest of this paper.

In the following, we rely on the decomposition methodology presented in Section 2.2. Our unit of observation is a cohort (c) in industry (j). We denote the base year of cohort c by \(t_0^c\) and firms are considered as part of the cohort if their annual growth rate between \(t_0^c\) and \(t_0^c+3\) was above the threshold prescribed by the relevant definition. The productivity contribution of cohort c in industry j between \(t_0^c\) and \(t_0^c+3\) will be denoted by \(cont_{jc}\). These objects, which are defined at the industry-cohort (year) level, will be our units of observation.

4.1 How much do HGFs contribute during their high-growth phase?

Let us start with the overall average of HGF contributions across industries and cohorts, \(cont_{jc}\) during their high-growth phase. Table 5 shows that the total HGF contribution differs strongly across definitions. The table shows the contribution in percentage points of the total industry-level productivity growth, i.e., OECD (emp) HGFs contributed 0.58 pp on average. In general, it is larger for sales-based than for employment-based definitions and it is also larger for Birch definitions than for OECD ones. Birch-sales HGFs contribute the most on average (3.93 percentage points), followed by OECD sales HGFs (2.94 pp.). The contribution of employment HGFs is substantially lower, with 0.58 pp. for the OECD and 1.05 pp. for the Birch definition. The difference between the Birch and OECD definitions is likely to result from the larger size of Birch firms as well as from the fact that it is a stricter HGF definition than the OECD one.

Table 5 Decomposing the HGF contribution to TFP growth

The productivity decomposition exercise reveals the source of these differences. First, the within contribution of the sales-based definitions is strongly positive while that of the employment-based definitions is small and rather negative. This comes from the fact that productivity typically increases (or is stable) for sales-based definitions while it falls for the employment-based definitions (see Table 4). The average between effect is positive for all the definitions, showing that HGFs are typically more productive than their peers. There is some variation, mainly resulting from the differences in initial productivity advantage and in the employment growth rate during the high-growth period. Finally, the cross contribution is negative for all definitions. This reflects that overall, there is a negative correlation between size and productivity growth, as we will discuss in the next section.

A policy-relevant insight from these patterns is that output HGFs, which contribute more than input HGFs, typically contribute via their within term, while their initial productivity level is not especially high (see Table 4). Therefore, HGF policies that only target firms with high initial productivity levels – in the implicit hope of a strong within contribution – will capture potential input HGFs and miss potential output HGFs. Similarly, HGF policies centered on promoting employment growth rather than output growth may capture firms that contribute less to productivity growth.

While these absolute numbers are of some interest, it is probably more relevant to ask whether HGFs contribute more than the average firm. In other words, whether the share of HGFs’ contribution is larger than their (initial) share in terms of inputs or outputs. Figure 1 illustrates this relationship at the industry level, with the average initial employment share of HGF cohorts on the horizontal axis and the average ratio of total HGF contribution to industry-level TFP growth on the vertical axis.Footnote 16

Fig. 1
figure 1

The initial employment share of HGFs and their TFP growth contribution as a share of industry-level TFP growth. This figure shows the relationship between the initial employment share of HGFs and their productivity contribution during their high-growth period. These numbers are averaged across cohorts for all 2-digit industries, and we omit year-industry combinations in which industry productivity growth was negative. The dashed line is the 45-degree line

Again, there is a quite obvious dichotomy between employment- and sales-based HGF definitions. Sales-based HGFs tend to contribute substantially compared to their initial share. The initial employment share of OECD (Birch) sales HGFs is 7.1% (18.3%) on average while their share in TFP growth is 24.7% (28.3%). In contrast, employment-based HGFs often contribute negatively and tend to be below the 45-degree line. The initial employment share of OECD (Birch) employment HGFs is 3.1% (14.7%) on average while their share in TFP growth is 0.2% (3.1%).Footnote 17

The average effects in Table 5 hide a large degree of heterogeneity. Figures 2 and 9 (the latter for labor productivity) focus on the heterogeneity of \(cont_{jc}\). Clearly, the HGF contribution is not necessarily positive for either of the definitions. The 10th percentile is negative for all four definitions. On the other extreme, in some industries a very substantial part of productivity growth results from the activities of HGFs. For example, in 20% of the industries with positive productivity growth, more than half of total productivity growth is contributed by sales-Birch HGFs.

Fig. 2
figure 2

Productivity contribution of HGFs by industry-cohort. This figure shows the kernel distribution of the productivity contribution of HGF cohorts at the 2-digit industry-cohort level between \(t_0\) and \(t_0+3\), winsorized at the 5th and 95th percentiles. The horizontal axis shows the HGF contribution in log percentage points during the 3-year period

The distributions provide a more complex picture than the means in Table 5. Within the OECD/Birch dimensions, the difference in means clearly transforms into stochastic dominance. The distribution of OECD (sales) firms’ contribution dominates that of OECD (emp) firms and the same is true for the Birch definitions. The variance of the Birch definitions is substantially higher than that of the OECD definitions, with both having thicker tails than the two OECD definitions. Therefore, while the mean and median of the OECD (sales) distribution are larger than that of the Birch (emp) distribution, the top percentiles of the Birch (emp) distribution are larger. Clearly, idiosyncratic characteristics of some of the larger firms captured by the Birch definitions can have very substantial effects on industry productivity, with a strong upside, as shown by the very thick upper tail of the Birch (sales) distribution. In contrast, the OECD (sales) distribution is less dispersed, with few negative and some decent-sized contributions.

This section has documented several patterns about HGFs’ contribution during their high-growth phase. However, they may also contribute in other periods. The next section investigates this topic with an event study approach.

4.2 The contribution of HGF cohorts over time

While the lack of average pre- and post-trends presented in Table 4 is suggestive of HGFs’ productivity and size growth being concentrated in the HGF phase, we use an event study methodology to investigate whether these findings can be generalized to the productivity contribution itself.Footnote 18

For the event study, we extend our previous notation as we need to follow each cohort (c) for a number of periods (p).Footnote 19 Here we denote contributions by \(cont_{cjp}\). To present the results, we run event study-type regressions at the industry-cohort-period level of the form:

$$\begin{aligned} cont_{cjp}=\sum _{\theta } \delta ^{\theta }_{cp} +\eta _j+\xi _p+\epsilon _{cjp} \end{aligned}$$
(1)

where c indexes cohorts, j industries and p periods. \(\delta ^{\theta }_{cp}\) are event time dummies taking the value of 1 when the \(t_p-t_0^c=\theta\), where \(t_p\) is the beginning of period p and \(t_0^c\) is the initial year for cohort c. The coefficients of these event time dummies show the difference between the actual contribution of the cohort relative to its contribution in the base period, \(t_0^c-6\) (which is the omitted period).Footnote 20\(\eta _j\) and \(\xi _p\) are 2-digit industry and period fixed effects, respectively. We weigh the regressions by the number of employees of the industry at the beginning of the period.

Figure 3 shows the results for TFP and Fig. 8 for labor productivity. The horizontal axis represents the event time, and the vertical axis shows the point estimates and the standard errors of the event time dummies. Zero is the contribution in the base period, starting at \(t_0-6\).Footnote 21 We can draw two main conclusions. First, relative to the omitted period, we find positive contributions during the high-growth period \([t_0;t_0+3]\)) for the sales-based definitions and negative contributions for the employment-based definitions. The additional positive contributions of the sales-based HGFs during the high-growth period are substantial: OECD (sales) HGFs contribute by 2.3 percentage points while Birch (sales) HGFs by 2.8 percentage points.Footnote 22 This pattern contrasts strongly with that of the employment HGFs, which report a negative contribution during their high-growth phase compared to their contribution in the base period.

Fig. 3
figure 3

Total TFP contribution: event study. This figure shows the results of an event study regression in which one observation is the productivity contribution of a cohort of HGFs in a 2-digit industry in different event-times before and after the high-growth period. HGFs are firms that are in their high-growth phase in the 3-year period starting with year t. The vertical axis shows the productivity contribution of HGFs in percentage points, normalized to zero two periods before the high-growth phase, i.e., in \([t-6,t-3]\)

Second, regarding timing, HGFs tend to contribute unusually during their high-growth period, between \(t_0\) and \(t_0+3\). We find evidence for a positive contribution before the high-growth phase for the Birch (sales) HGF, which is about half as large as the one observed during their high-growth phase.Footnote 23 This suggests that high sales growth of larger firms often takes place after a period when both productivity and size are growing. Importantly, in line with our earlier observations in Section 3.2, we do not find evidence for extraordinary contributions after the high-growth phase for any of the HGF definitions. This latter observation implies that the long-run productivity contribution of HGFs is likely to be similar to their short-term contribution.Footnote 24

4.3 Industry characteristics and HGF contribution

Motivated by the substantial heterogeneity across industries and periods, as documented by Fig. 2, this section investigates some of the industry characteristics which may influence the role HGFs play in productivity growth.Footnote 25 We use two approaches. First, based on our decomposition, we focus on the dynamic parameters which show up directly in the decomposition: the strength of reallocation and the correlation between size and productivity growth. The second approach, instead, focuses on more fundamental structural parameters, such as the technology of the industry, concentration and the share of young firms within the industry.

Dynamic correlations

Our first approach starts from the decomposition in Section 2.2. The first relevant correlation is the strength or efficiency of reallocation, \(\rho ^{real}\). This captures the extent to which more productive firms expand in terms of their employment or market share faster, represented by \(corr(prod_{it_0}, \Delta \theta _{it})\). Clearly, this determines how likely do HGFs come from more productive firms. This ‘selection effect’ has an impact on the between term, \(\sum _{i \in HGF}(prod_{i,t_0}-PROD_{t_0})\Delta \theta _{i,t}\), by showing the extent to which HGFs are initially more (or less) productive than the average firm. We expect a stronger reallocation to be positively associated with the between components and, in turn, with productivity growth.

A second key measure is the extent to which size growth is accompanied by productivity growth \(corr(\Delta prod_{it}, \Delta \theta _{it})\), which we call the cross-correlation, \(\rho ^{cross}\). This correlation captures a key property of industry dynamics: whether firms improving their productivity can acquire sufficient resources rapidly enough to also expand on the market in the same period typically considered in HGF definitions. The cross-correlation is the fundamental behind both the HGF within and cross terms. While the cross-correlation shows up directly in the cross term, it also affects the within term positively, since with a stronger cross-correlation HGFs are more likely to improve their productivity. Empirically, we define these correlations for industry j in year t based on levels in year t and changes between t and \(t+3\). Therefore, the efficiency of reallocation for firms i in industry j is \(\rho ^{real}_{jt}=corr(prod_{it}, \theta _{i,t+3}-\theta _{i,t})\). Similarly, the cross-correlation is \(\rho ^{cross}_{jt}=corr(prod_{it+3}-prod_{it}, \theta _{i,t+3}-\theta _{i,t})\). In our data, the reallocation correlation is positive, with an average value of 0.14. The cross-correlation is typically negative, with an average of -0.1, i.e., 3-year size growth is negatively correlated with productivity growth during the same period.

We run regressions to investigate how these correlation parameters are related to the industry-level HGF contribution. The dependent variable is the HGF contribution of cohort c in industry j during its high-growth period, between \(t_0^c\) and \(t_0^c+3\). The main independent variables are dummies showing the quartile (q) of the industry in the overall \(\rho ^{real}\) and \(\rho ^{cross}\) distributions while controlling for year fixed effects (\(\delta _t\)).Footnote 26 Therefore, we estimate the following equation:

$$\begin{aligned} cont_{jc}=\sum _q\beta ^{real}_q (\rho ^{real})^q_{jt_0^c}+\sum _q\beta ^{cross}_q (\rho ^{cross})^q_{jt_0^c}+\delta _t+\epsilon _{jc} \end{aligned}$$
(2)

We weight the regression by industry employment and calculate standard errors clustered at the industry level.Footnote 27

Regression results are presented in Fig. 4 and Panel A of Table 9. We find that HGF contribution increases in these industry dynamics parameters for all four definitions. Clearly, stronger industry dynamics both in terms of reallocation and cross-correlation are associated with higher TFP contributions. The difference in terms of contribution between the industries with the weakest and the strongest reallocation is between 1.7 and 3.4 percentage points, depending on the definition. This is quite large given that the average contribution is between 0.6-3.9 pp. for the various definitions (Table 5). The strength of reallocation matters more for the Birch definitions. The relationship between the cross-correlation and the HGF contribution is similar, with 2.8-7.1 pp. difference between industries with high vs. low cross-correlations.

Fig. 4
figure 4

The relationship between industry dynamics parameters and HGF contribution. This figure shows point estimates from Eq. 2, in which the dependent variable is the HGF contribution to productivity growth of a HGF cohort of an industry in a 3-year period, while the explanatory variables are quartiles of \(\rho ^{real}\) and \(\rho ^{cross}\). The different lines show the point estimates for the four different HGF definitions. Panel A presents the coefficients of the different quartile dummies for \(\rho ^{realloc}\), relative to the first quartile (base category). Panel B presents similar results for \(\rho ^{cross}\). The full circles show that the coefficient is significant at the 10% level. For example, the rightmost point corresponding to Birch (emp) HGFs in panel A shows that such HGFs in an industry which has a high (top quartile) \(\rho ^{realloc}\) contribute about 2.9 pp. more to industry-level TFP growth compared to an industry with a low (bottom quartile) \(\rho ^{realloc}\), and this difference is significant at the 5% level. The corresponding regression results are presented in Table 9, Panel A. Year dummies are included, and standard errors are clustered at the industry level

Panel B of Table 9 presents estimation results of Regression 2 using labor productivity to define the dependent variable. We find patterns similar to TFP, suggesting that these patterns are not sensitive to the definition of productivity.

Figure 10 decomposes the relationship found in Fig. 4 by re-running Regression 2 with HGF within, between and cross contributions as dependent variables. In line with the arguments stated at the beginning of this section, we find that \(\rho ^{real}\) mainly affects the between component by making more productive firms more likely to become HGFs. In contrast, \(\rho ^{cross}\) is related to the within and cross components.

Fundamental industry characteristics

While the previous results confirm that the reallocation and cross-correlations are significantly related to HGF contribution, these correlations are not easy to observe or interpret. Therefore, our second approach investigates the extent to which one is able to predict HGFs’ contribution to productivity growth based on more fundamental industry parameters.

We modify the basic specification in Eq. (2) including several basic industry features measured in \(t_0\) instead of reallocation and cross-correlation quartiles as explanatory variables. We capture market structure by concentration (C5) and by the standard deviation of TFP. We also include the share of young firms (with age of at most 5 years) which, as Panel B of Table 5 shows, tend to contribute substantially to productivity growth, accounting for 31-75% of the total HGF contribution.Footnote 28 Finally, we add the technology/knowledge intensity classification of the industry based on Eurostat’s categorization. The base category is primary industries, and we distinguish between high-tech manufacturing, low-tech manufacturing, knowledge-intensive services (KIS), and less knowledge-intensive services (LKIS).Footnote 29

The results of this exercise are reported in Table 6. Column (1) reports the baseline specification. We find that the HGF contribution is larger in industries with more young firms: a 10 pp. increase in the share of young firms is associated with a 1.5 pp. higher HGF contribution. Services and manufacturing industries also seem to differ to some extent, with knowledge-intensive services featuring a lower HGF contribution than other industries. Concentration does not seem to matter. Importantly, the explanatory power of the regression is quite moderate, at 15%, showing that these industry features have limited explanatory power in terms of HGF contribution to productivity growth.Footnote 30

Table 6 HGF contribution to TFP growth and industry characteristics

In column (2) we also include the TFP growth contribution of non-HGFs which is likely to capture industry-year level shocks to productivity, a potential omitted variable. The patterns remain similar, though the significance levels are somewhat smaller.

To sum up, we have found that stronger reallocation, measured by industry-level correlations, is positively related to the contribution of high-growth firms. These correlations, however, may be hard to observe for policymakers and may change frequently. Basic industry characteristics predict the HGF contribution only to a limited extent, but we have found evidence for the share of young firms being correlated with the HGF contribution.

5 A benchmark model for HGF contributions

After we have examined the patterns of HGF contributions to aggregate productivity, in this section we ask whether the patterns we find are “surprising” or can be predicted by a simple benchmark model. If the benchmark model of firm dynamics reproduces most findings, the simple framework can help economists or policymakers think about HGF contributions.Footnote 31

Our simple simulation starts from a normal joint distribution of productivity, size (input), productivity growth, and size (input) growth. This distribution is easy to be characterized by the correlations between these four variables, including the strength of reallocation and cross-correlations. HGFs are defined as firms growing strongly either in terms of their input use or in terms of output.

We find this approach useful for several reasons. First, we show that our observations, at least qualitatively, are indeed easily predicted by this benchmark. As an important consequence, the average HGF contribution is determined by overall industry dynamics to a large extent rather than by idiosyncratic dynamics of specific firms. Second, our framework supplies policymakers with a few intuitive and measurable concepts and parameters affecting the productivity contribution of HGFs, such as whether HGFs are defined based on their inputs or outputs, the strength of reallocation, and the cross-correlation in the industry. Taking into consideration these concepts can help in designing effective policies. Finally, as we illustrate through the example of SMEs in Appendix Section 2, the same logic can be applied to thinking about the productivity growth contributions of other groups of firms defined by some easily observable characteristics.

Section 5.1 describes our simple approach and Section 5.2 illustrates the difference between input- and output-based HGFs in this framework. We provide more details in 4, where Appendix Section 1 shows how the simulation reproduces the qualitative patterns found in Section 4.3 and Appendix Section 2 presents how the same simulation framework can be used to model the contribution of other groups of firms via the example of SMEs.

5.1 Method: simulation

Based on the decomposition framework, we simulate data in 4 dimensions: (ln) initial input level, (ln) initial productivity level, input growth, and productivity growth. The growth rates are interpreted as the 3-year growth rates used in the empirical exercise. We simulate a 4-dimensional normal distributionFootnote 32 for these variables with different correlation structures. Note that these include the two key correlations analyzed in Section 4.3: the strength of reallocation (the correlation between productivity level and subsequent input growth) and the cross-correlation (the correlation between input growth and productivity growth). As a baseline, we calibrate these correlations from the Hungarian data.Footnote 33 To keep things as simple as possible, we think of productivity as simply dividing output produced with the amount of inputs used. Therefore, \(output=input \times productivity\).Footnote 34 As a result, output growth can be written as: \(\Delta output=\Delta productivity + \Delta input\), where \(\Delta\) is interpreted as a (log) percentage change.

We calculate aggregate productivity growth on these simulated data and decompose it with the method described in Section 2.2. We define HGFs based on both input and output growth, as this has proved to be the key distinction in Section 3.1.Footnote 35 To make the results comparable across specifications, we define input HGFs as firms with the top 5% input growth. Output HGFs are the top 5% in terms of their \(\Delta productivity + \Delta input\) value. We simulate 20,000 firms each time.

5.2 Input and output HGFs

In this section we illustrate the differences between input and output HGFs predicted by our simple model in three steps and contrast them with our previous empirical results. First, related to the between term, we investigate how productivity is related to the probability of becoming a HGF. In our empirical exercise, we have seen that input-based HGFs are on average more productive at the beginning of their HGF period. Second, we show the difference between the two types of firms by presenting the joint distribution of size and productivity growth and the within contribution of the two groups. In the empirical part, we found that output-based HGFs are the ones that tend to increase their productivity throughout the HGF period. Third, we simulate the distribution of the two firm groups’ total contributions. In the previous empirical exercise, we found that output-based HGFs have a higher contribution on average.

We illustrate the difference between input- and output-based HGFs with the simple simulation described in the previous section. Let us start with reallocation. Figure 5 depicts the share of HGFs within 20 productivity quantiles using parameters partly calibrated from the Hungarian data.Footnote 36 The share of input-based HGFs is positively associated with productivity, mainly due to the positive reallocation correlation: more productive firms are more likely to increase their inputs. This implies that reallocation to input HGFs indeed contributes positively to industry-level productivity growth, represented by a positive reallocation term.

Fig. 5
figure 5

Input- and output-based HGFs: reallocation effect. This figure shows the simulated relationship between initial productivity level and the share of input- and output- based HGFs. We normalize each of the variables in such a way that their standard deviation is 1. The correlations are: \(\rho _{real}=0.1441\); \(\rho _{cross}=-0.1\); \(corr(\Delta prod, prod)=-0.41\); \(corr(prod, inp)=0.1418\); \(corr(\Delta inp, inp)=-0.1852\); \(corr(inp, \Delta prod)=0.0011\)

In contrast, and possibly more surprisingly, the share of output-based HGFs is decreasing in productivity. By our definition, output-based productivity growth can either result from rapid input growth and/or strong productivity growth. While more productive firms are more likely to increase their size (as witnessed by the positive reallocation correlation), less productive firms are more likely to experience productivity growth, as evidenced by the convergence correlation, \(corr(\Delta prod, prod)\), which has an average value of -0.41.Footnote 37 Given its larger absolute value, convergence correlation dominates the reallocation correlation in the relationship between productivity and the share of output HGFs, leading to a negative overall relationship.

These findings are clearly in line with the descriptive patterns reported in Table 4 for OECD HGFsFootnote 38: output HGFs tend to be less productive initially than input HGFs – therefore the within contribution of the former is likely to be smaller.

Figure 6 illustrates the within contributions of the simulated input- and output-based HGFs. The cluster of points represents firms in the simulated industry, each point showing a firm’s input growth (horizontal axis) and productivity growth (vertical axis) in a (3-year) period. The correlation between these two quantities, the cross-correlation, is slightly negative, calibrated from the Hungarian data. Input HGFs are to the right of a critical value (represented by the vertical dashed line), denoted by squares and triangles.

Fig. 6
figure 6

Input- and output-based HGFs: within effect. This figure shows the scatter plot of input and productivity growth of firms in the simulated data. Input HGFs are those with input growth above a threshold, denoted by squares and triangles. Output HGFs are denoted by diamonds and triangles, thus the triangles are both input and output HGFs. \(HGF_{out}\) represents the average productivity growth of output HGFs and \(HGF_{inp}\) is the average productivity growth of input HGFs. The correlations are: \(\rho _{real}=0.1441\); \(\rho _{cross}=-0.1\); \(corr(\Delta prod, prod) =-0.41\); \(corr(prod, inp)=0.1418\); \(corr(\Delta inp, inp)=-0.1852\); \(corr(inp, \Delta prod)=0.0011\). The mean of all variables is normalized to zero

As we have already mentioned, output growth can be expressed in the model as \(\Delta output=\Delta productivity + \Delta input\). Consequently, output HGFs are firms above a line where \(\Delta productivity + \Delta input>c\), with c being a critical value. Such firms are located above a specific line with a negative slope and are denoted by diamonds or triangles. Firms that are both input and output HGFs are denoted by triangles.

The figure illustrates some of our earlier points. First, the intersection between input and output HGFs is not especially large (as seen in Table 11). Input HGFs rapidly increase their size but their productivity growth is rarely exceptional. Most output HGFs qualify because of their productivity growth but many of them do not expand too much in terms of their size, as we have shown in Table 4.

These patterns clearly explain the differences between the within contributions of input and output HGFs. The average productivity growth of input HGFs (denoted by \(HGF_{inp}\) on the vertical axis) is slightly negative because of the negative cross-correlation, which implies a negative within component for these firms. In contrast, average productivity growth of output HGFs (denoted by \(HGF_{out}\) on the vertical axis) is positive, and hence, the within component is strongly positive in this group.

We have presented intuition for the difference between input- and output-based HGFs in terms of their within contributions. Now we continue with simulating the distribution of the total contributions (similar to Fig. 2). In this figure, the distribution represents industry-years with different parameters and with different random components. Therefore, in our simulation, we also vary industry parameters across observations. Figure 7 shows this distribution with industry correlation parameters (\(\rho _{real},\rho _{cross}, corr(\Delta prod, prod), corr(prod, inp), corr(\Delta inp, inp), corr(inp, \Delta prod)\)) chosen randomly from a uniform distribution.Footnote 39 This figure reproduces the main patterns of Fig. 2: first, the contribution of output-based HGFs clearly stochastically dominates that of input-based HGFs; second, the distribution corresponding to output-based HGFs is more asymmetric with a long right tail but very few negative contributions.

Fig. 7
figure 7

Simulated productivity contribution of input- and output-based HGFs across industries with different dynamics. This figure shows the kernel density of the simulated contributions of input- and output-based HGFs across industries with different parameter values. The correlations (\(\rho _{real}\), \(\rho _{cross}\), \(corr(\Delta prod, prod)\), corr(prodinp), \(corr(\Delta inp, inp)\), \(corr(inp, \Delta prod)\)) are chosen randomly and independently from a uniform distribution with a range \([-0.5,0.5]\). The standard deviations of the variables are also drawn randomly and independently from a uniform distribution with a range of [0, 1]. The average productivity growth is also uniformly random, [0, 0.2]. The contributions are winsorized at the 2nd and 98th percentiles

This section has shown that a simple model of firm input and productivity growth generates patterns that are qualitatively very similar to our empirical observations.

6 Conclusion

This paper investigates the role of HGFs in reallocation and industry-level productivity growth. We have shown that productivity contributions differ widely, ranging from a negative contribution in half of industry-period observations for the OECD (emp) definitions to HGFs accounting for more than half of the industry-level productivity growth in 20 percent of cases for the Birch (sales) definition. HGF definition clearly matters. The positive contribution is limited to the high-growth period or the one preceding it, but HGFs do not contribute positively to aggregate productivity afterwards. The size of their contribution is linked to reallocation but cannot be predicted very effectively with fundamental industry characteristics. Finally, a very simple simulation can reproduce most of these findings.

Our results provide a number of insights regarding high-growth firms. Many of the patterns documented at the firm level have fundamental consequences in terms of industry-level “wider productivity effects”. These links between the firm-level patterns and industry-level contributions are not automatic or trivial. Our measure captures both within-firm and reallocation effects, which often have opposite signs and therefore their net effect is not clear a priori.

One link between the firm-level results and industry-level contribution is the fact that employment growth and productivity growth are weakly or negatively correlated (Coad et al., 2014) and that initial productivity is not a strong predictor of high growth, probably because of inefficient reallocation (Grover Goswami et al., 2019) or weak selection in the framework of evolutionary theories (e.g. (Metcalfe, 1994)) are not too strong. This pattern implies two key trade-offs. First, as a result of the former correlation, the contribution of sales HGFs comes primarily from the within term, while that of the employment HGFs is dominated by the Schumpeterian effect. This fact, combined with the second correlation, which implies that the reallocation effect is generally weak, leads to much stronger productivity contribution by sales-based HGFs.

Another key link between the firm- and industry-level patterns is that the low persistence of firm-level HGF status seems to be associated with low persistence of the productivity contribution. Positive unusual contributions are concentrated at the high-growth period of the cohort. We find that size or productivity growth is typically not followed by further improvements in productivity at the cohort level. This finding is in line with the firm growth model of Penrose (1959), which emphasizes that size growth is often followed by reduced productivity and the cycle models (see e.g. (Greiner, 1989)), which predict a management crisis following growth periods.

A third link between firm- and industry-level patterns is that neither the HGF status nor the industry-level HGF contribution can be predicted with much confidence based on industry-level characteristics. Again, even though these are interlinked, this link is not automatic: even if the HGF status is not linked strongly to industry features, key industry correlations, such as the correlation between employment and productivity growth could be. This, however, is not the case, suggesting that targeting industries has limited rationale when productivity contribution is concerned.

We argue that these findings are of high policy relevance. The productivity evolution of high-growth firms is not only important because of HGFs’ direct contribution to productivity growth but also because more spillovers may be expected from more productive firms (e.g., (Stoyanov and Zubanov, 2012)). The insight that sometimes HGFs contribute negatively to productivity growth should warn policymakers that HGF promotion policies can have negative side effects on aggregate productivity, thus these effects should be quantified when designing and evaluating such a policy. The second conclusion on the importance of HGF definitions implies a key trade-off across definitions between productivity growth and job creation. A policy focusing on employment growth (i.e., input-based HGFs) may generate a small or negative productivity effect, while, according to our results, output-based HGFs create only somewhat fewer jobs but contribute to productivity growth substantially. Policies limited to firms that are very productive to start with may miss these firms. Our final insight emphasizes that HGFs are likely to have a higher contribution in environments that are in general conducive to efficient reallocation, while the low predictability of HGF contributions limits the extent to which industry-based targeting can be efficient.

Our research has two key limitations. First, the results are limited to one country, and investigating their external validity on microdata from other countries would be important. Such research would also help in identifying better how country and industry characteristics are related to the HGF contribution. Second, our method only considers the direct contribution of HGFs to productivity growth and ignores external effects. In this respect, it mainly complements recent research on such externalities (de Nicola et al., 2019; Du and Vanino, 2020).