1 Introduction

Productivity is viewed as the most important long-run driver of economic growth in both economic theory and empirical research. According to Klenow and Rodríguez-Clare (1997), total factor productivity (TFP) growth accounts for 90 % of the international variation in output growth. Easterly and Levine (2001) argue that the major empirical regularities of economic growth indicate an important role for the residual rather than for factor accumulation. Productivity growth is also regarded as an important underlying reason for China’s remarkable economic growth since 1978, in addition to other proximate determinates such as physical and human capital accumulation (see, for instance, Ding and Knight 2011).

In this paper, we explore the determinants of China’s productivity growth using a comprehensive firm-level dataset over the period 1998–2007 (see also Brandt et al. 2012). The novelty of this paper lies at least in the following four aspects. First, when estimating TFP, we include directly in our (production function) model the determinants of TFP for which we have data (such as firm ownership, export behavior, age, political affiliation, intangible assets, liquidity, and geographic location). This is important as omission of these variables would produce biased estimates of the production function, and thus biased estimates of TFP. It is also important since we are interested not just in obtaining estimates of TFP, but also in what drives TFP in China. Second, unlike most previous studies, which rely on the method of Olley and Pakes (1996) or Levinsohn and Petrin (2003) to construct TFP, we use a system Generalized Method of Moments (GMM) estimator (Blundell and Bond 1998). We believe it is important to use this approach as many studies have shown that firms have (unmeasured) productivity advantages that persist over time, which need to be captured. The system GMM estimator enables us to take these fixed effects into account. Thirdly, we estimate models for different industry sub-groups to allow for differences in technologies. In other words, we avoid assuming all firms operate using a standard technology across all sectors. Lastly, we adopt the Haltiwanger approach (Foster et al. 1998) to decompose measures of productivity growth into various components that represent the impact of resource reallocation across surviving firms and the impact on productivity of the entry and exit of firms.

Our results indicate increasing returns to scale in the majority of industries and a (usually large) positive time trend representing technical change. Younger firms and firms with no political affiliation are found to have higher TFP, whereas firms with state ownership are found to have lower TFP. There exists some heterogeneous evidence among industries on the effect of high political affiliation and private ownership on TFP. Neither exports nor R&D are found to be especially important drivers of TFP among Chinese industries. We find evidence of positive agglomeration spillovers (mitigated to some extent by negative ‘costs’ associated with the very largest urban areas). Firm fixed costs and liquidity are also important. Furthermore, the average TFP growth in Chinese industries is 9.6 % per annum during the period of 1998–2007. In line with Brandt et al. (2012), this rapid productivity growth is mainly driven by firm entry rather than reallocation among firms in operation throughout 1998–2007 (i.e. continuers). The sub-sector decomposition exercises show that the inter-firm resource reallocations are more prominent across industries than across provinces.

The structure of the paper is as follows. Section 2 describes our data and empirical methodology to construct TFP. Section 3 illustrates the results of our TFP estimation, while Sect. 4 conducts the productivity decomposition using the Haltiwanger approach in order to examine the determinants of TFP growth in China. Finally, the last section concludes the paper.

2 Existing studies for China

There have been a number of macro-level studies on China’s productivity and its determinants, which we only briefly cover since the emphasis in this paper is on micro-econometric approaches (Song et al. 2011, provides a more comprehensive review of recent work in this area). Among these, Bosworth and Collins (2008) use the growth accounting approach to examine the sources of growth for China over the period 1978–2004, finding that the average annual TFP growth is 3.6 % (rising from 3.0 % during the period of 1978–1993 to 6.1 % during the period of 1993–2004). Using two-digit industrial level data for the period of 1998–2003, Pandey and Dong (2009) present results from a production function based regression showing that ownership restructuring and the downsizing of firms accounted for about 30 % of TFP growth in China’s manufacturing industries. Song et al. (2011) compute TFP as a standard Solow residual of a one-sector aggregate production function, finding an average annual growth rate of 5.9 % over the period 1998–2005; about 70 % of this growth is due to reallocation from inefficient firms to more efficient ones in the manufacturing sector. In contrast, Chen et al. (2011) use the stochastic frontier approach to evaluate the productivity growth of 38 two-digit manufacturing industries in China; they find aggregate TFP growth was 6.7 % during the period of 1980–2008 and that factor reallocation from less productive to more productive sectors accounted for 42 % of TFP growth.

More recently, and in contrast to the approaches adopted by others, Scherngell et al. (2014) examine the role of knowledge capital in driving manufacturing TFP across 29 Chinese provinces over the period of 1988–2007. Based on a Spatial Durbin model, they find a significant effect of knowledge capital on regional TFP, which is based not only on within-region knowledge capital but also on inter-regional knowledge spillovers. They interpret the results as evidence for the transformation of China into a knowledge-based economy.

Turning to the micro-based evidence, Hsieh and Klenow (2009) examine the role of misallocation on aggregate productivity in China, claiming that distortions reduce China’s manufacturing productivity by 30–50 % relative to an optimal distribution of capital and labor across existing manufacturers for the period 1998–2005. However, it is Brandt et al. (2012) that has most relevance for the current paper. They present one of the most comprehensive and influential micro-level research on China’s manufacturing TFP, by examining three possible sources for aggregate TFP growth in the manufacturing sector, i.e. firm-level productivity growth, firm entry and exit, and resource reallocation from less to more productive continuing firms. Their preferred productivity growth estimates, based on the methods of Olley and Pakes (1996) and Ackerberg et al. (2006), average 7.96 % per annum for a valued-added production function, and 2.85 % on a gross output basis, for 1998–2007. These are among the highest compared to other countries. Net entry accounts for over two-thirds of total TFP growth over the entire sample period. Consistent with the results of Hsieh and Klenow (2009), they find that aggregate TFP growth in manufacturing industries remains constrained by limited efficiency-enhancing input reallocations across active firms.

More recent micro-level studies include Bas and Causa (2013), who examine the effect of trade and product market policies in upstream sectors on productivity in downstream firms. Using ORBIS firm-level data for 2001–08, and a value-added labor productivity measure, they obtain results suggesting that removing remaining restrictions in upstream industries can bring substantive productivity gains and benefit not only firms producing in these industries but also those that use inputs from these industries. Yu (2014) also investigates the effect of trade liberalization on firm productivity in China using two highly disaggregated datasets for 2000–06. While using both an Olley and Pakes (1996) and a system GMM approach, Yu (2014) notes that the Olley-Pakes approach assumes that capital is more actively responsive to unobserved productivity, which may not be suitable for China, a labor-abundant economy with low labor costs. Yu (2014) argues that both tariff reductions and processing trade can generate productivity gains for Chinese firms.

None of these micro-level studies use multiple covariates in their models to explain what determines TFP in China; most do not include firm-level fixed effects; and none cover the broad range of industries used in the present paper.Footnote 1 We build on this literature by taking all those elements into account.

3 Data and empirical methodology

3.1 Data and sample

We use data drawn from the annual accounting reports filed by industrial firms with the National Bureau of Statistics (NBS) over the period of 1998–2007. This dataset includes all SOEs and other types of enterprises with annual sales of five million yuan (about $817,000) or more. These firms operate in the manufacturing, utilities and mining sectors and are located in all 31 Chinese provinces or province-equivalent municipal cities. Since the NBS dataset does not include enterprises with annual sales below ¥5 million, it is important to note that about 80 % of all industrial firms are excluded from the sample. However, as shown in Brandt et al. (2012), using the full census of firms periodically carried out in China, the omitted firms only account for some 9.9 % of output in 2004, and 2.5 % of exports. Moreover, a comparison of 1995 NBS and Census data shows the NBS has a similar level of coverage, allowing Brandt et al. (2012) to state that “… the NBS decision rule on which firms to include in their annual sample is not introducing any systematic bias in our estimates”.

The dataset covers nearly 600 thousand firms, which corresponds to some 2.2 million firm-year observations.Footnote 2 Our sample is unbalanced, and its structure can be observed in Table 6 in “Appendix”. The number of observations ranges from a minimum of 148,000 firms in 1998 to a maximum of 331,000 firms in 2007. There is significant churning among firms during our sample period: only 7.2 % of firms (18.3 % of total firm-year observations) have the full 10-year accounting information, and about 38 % of firms (9.5 % of firm-year observations) only exist for one or 2 years and then exit.Footnote 3 Brandt et al. (2012) regard the active entry and exit of firms as the consequence of enterprise restructuring, which began in earnest in the mid-1990s.

Besides the standard income statement and balance sheet information, the NBS data contain a continuous measure of firms’ ownership, which is based on the fraction of paid-in-capital contributed by the following six different types of investors: the state; foreign investors (excluding those from Hong Kong, Macao, and Taiwan); investors from Hong Kong, Macao, and Taiwan;Footnote 4 collective investors;Footnote 5 legal entities or corporation investors;Footnote 6 and individual investors. We use this information to represent ownership, which is superior to the registration information of firms’ ownership, as the latter does not reflect the dynamic nature of firm ownership evolution over the sample period.

Another feature of the NBS dataset is the inclusion of an index on firms’ political affiliation. Political affiliation refers to the fact that firms are affiliated (have a lishu relationship) with the central, provincial, and local governments (Li 2004; Tan et al. 2007; Xia et al. 2009). A lishu relationship is associated with government support and subsidies. In particular, governments can grant firms affiliated with them benefits such as bank loans at better conditions, waivers of import tariffs, tax reductions and so on. On the other hand, firms with such political affiliation are more likely to engage in investment/production that does not aim to maximize firm value but to achieve objectives preferred by the government. We classify firms into three groups according to various degrees of political affiliation, i.e. high political affiliation (firms affiliated with central or provincial governments), medium political affiliation (firms affiliated with local governments such as city-, district-, county-, prefecture-, township- and village-level governments), and no political affiliation (those having no political affiliation). We believe that both the ownership and political affiliation information are important when examining firm productivity in the Chinese context, because such institutional factors have a significant impact on firms’ decision making and behavior in China.

3.2 Empirical methods for TFP estimation and approach used

The objective of productivity measurement is to identify output differences that cannot be explained by input differences (Van Biesebroeck 2007). There are some advantages of estimating TFP using micro-level data. For instance, Del Gatto et al. (2011) argue that although aggregate analysis plays an important role in cross-country comparative analysis, firm-level analysis enables the investigation of TFP patterns at a deeper level controlling for issues like non-competitive markets, increasing returns, and heterogeneous firms.

Previous studies (of China and elsewhere) have used different approaches to estimating TFP using micro-level panel data (see Sect. 2 above for details). Van Beveren (2012) reviews various problems of estimating firm-level TFP, such as the endogeneity of input choices (or simultaneity bias), the omitted variable bias (if data on physical inputs and output and their corresponding firm-level prices are unavailable), the sample selection bias (as a result of no allowance of firm entry and exit), and the multiple-product-firm problem (which arises when production technology differs across products produced by single firm). To tackle these problems, he compares the performance of various estimators including fixed effects, GMM, and some semi-parametric estimators like Olley and Pakes (1996) and Levinsohn and Petrin (2003), where investment and intermediate inputs are used to proxy for unobserved productivity, respectively. He argues that the preferred estimator depends on the extent to which the data satisfies the assumptions underlying the specific estimation algorithm. Van Biesebroeck (2007) compares the sensitivity of five widely used productivity measures (index numbers, data envelopment analysis, stochastic frontiers, GMM, and semi-parametric estimation) using simulated data. Despite the fact that each method has its own strengths and weakness, the system GMM estimator is viewed as the most robust technique when measurement errors and technological heterogeneity are present.

Here, we calculate TFP using a Cobb-Douglas log-linear production function approach including fixed effects. The inclusion of fixed effects is necessary as empirical evidence using firm-level panel data consistently shows that firms are heterogeneous (productivity distributions are significantly ‘spread’ out with large ‘tails’ of firms with low TFP), but more importantly that the distribution is persistent—firms typically spend long periods in the same part of the distribution (see, for instance, Bartelsman and Dhrymes 1998; Haskel 2000; and Martin 2008). Such persistence suggests that firms have ‘fixed’ characteristics (associated with access to different path dependent (in)tangible resources, managerial and other capabilities) that change little through time, and thus need to be modeled. In the light of these considerations, we estimate the following model:

$$y_{it} = \alpha_{i} + \alpha_{E} e_{it} + \alpha_{M} m_{it} + \alpha_{K} k_{it} + \alpha_{X} X_{it} + \alpha_{T} t + \varepsilon_{it}$$
(1)

where endogenous y, e, m and k refer respectively to the logarithms of real gross output, employment, intermediate inputs, and the capital stock in firm i at time t \((i = 1, \ldots , N; \quad t = 1, \ldots T\));Footnote 7 and X it is a vector of observed (proxy) variables determining TFP (see Sect. 3.3, below). We include firm characteristics such as firm age, political affiliation, firm ownership, export behavior, whether the firm engaged in R&D, financial variables, and geographic location into the vector X it (Table 1 provides a list of the variables used). Lastly, t is a time trend, measuring exogenous gains in TFP over time.

Table 1 Descriptive statistics for variables used in TFP estimation, China 1998–2007

We first estimate Eq. (1) for different industries, and obtain the values of the elasticities of output with respect to inputs (α E , α M , and α K ). TFP can then be calculated as the level of output that is not attributable to factor inputs (employment, intermediate inputs and capital).Footnote 8 In other words, productivity is due to efficiency levels and technical progress. Thus, such a measure of TFP can be expressed as:Footnote 9

$$\ln \widehat{TFP}_{it} = y_{it} - \hat{\alpha }_{E} e_{it} - \hat{\alpha }_{M} m_{it} - \hat{\alpha }_{K} k_{it} = \hat{\alpha }_{\iota } + \widehat{{\alpha_{X} }}X_{it} + \widehat{{\alpha_{T} }}t + \hat{\varepsilon }_{it}$$
(2)

Note, Eq. (2) is not a proper TFP index, because the measure of input growth (\(\hat{\alpha }_{E} e_{it} - \hat{\alpha }_{M} m_{it} - \hat{\alpha }_{K} k_{it} ,\) equivalent to the change in the denominator in Eq. 4) does not satisfy axiom X5 (proportionality) in O’Donnell (2015), except in the case of constant returns-to-scale. The solution is to restore proportionality by using a special case of the Färe and Primont (1995) input index. The measure of TFP becomes:

$$\ln \widehat{TFP}_{it}^{FP} = y_{it} - \frac{1}{{\hat{\alpha }_{E} + \hat{\alpha }_{M} + \hat{\alpha }_{K} }}\left( {\hat{\alpha }_{E} e_{it} + \hat{\alpha }_{M} m_{it} + \hat{\alpha }_{K} k_{it} } \right)$$
(2a)

An alternative approach, popular in the literature, is to estimate (1) without including X it on the right-hand-side of the equation, and then use (2) to obtain TFP, where X it is now part of the random error term (\(\hat{\varepsilon }_{it}\)). Typically, \(\ln \widehat{TFP}_{it}\) obtained from Eq. (2) is then regressed on X it to measure the determinants of TFP as part of a two-stage approach. Clearly, we would expect estimates of the elasticities of output (and thus \(\ln \widehat{TFP}_{it}\)) from this two-stage approach to be biased because of an omitted variable(s) problem.

A large class of models has been used to estimate the parameters of Eq. (1), using micro-level panel data. Here we concentrate on the two most popular in recent work. The first includes the Olley and Pakes (1996) and Levinsohn and Petrin (2003) approaches, which account for both endogeneity of inputs in the production function and selection bias due to firm entry and exit (which is likely to be correlated with productivity), by using two-stage procedures where unobserved TFP is ‘proxied’ by another state variable(s) such as investment or intermediate inputs. In essence, Olley and Pakes (OP) replace Eq. (1) with:

$$y_{it} = \beta_{0} + \beta_{E} e_{it} + \beta_{M} m_{it} + \beta_{K} k_{it} + h\left( {i_{it} ,k_{it} } \right) + \varepsilon_{it}$$
(1a)

where TFP is proxied by \(h\left( \cdot \right)\)—which itself is approximated by a higher-order polynomial in i it and k it —and i it is investment. Levinsohn and Petrin (LP) replace \(h\left( \cdot \right)\) with \(h\left( {m_{it} ,k_{it} } \right)\). Both approaches do not allow for fixed effects and make some strong assumptions when compared to the systems GMM approach (as discussed in Ackerberg et al. 2006).

We therefore use the other popular approach to micro-level estimation: that is, Eq. (1)—in dynamic form with additional lagged values of output and factor inputs—is estimated using the two-step XTABOND2 system GMM approach (Arellano and Bond 1991) implemented in STATA (this also involves correcting for any potential finite sample bias using Windmeijer’s 2005, approach). Thus Eq. (1) is estimated both in first-differences and in levels, allowing for fixed effects and tackling endogeneity of the right-hand-side variables (including the lagged dependent variable) and selection bias by using lagged values of the endogenous variables as instruments in the first differences equation, and first-differences of the same variables as instruments in the levels equation (Blundell and Bond 1998).Footnote 10 In this study, gross output, intermediate inputs, labour, and capital are treated as endogenous, as well as political affiliation, capital ownership, exporting, and R&D. Lastly, according to Arellano and Bond (1991), the presence of second-order autocorrelation implies that the estimates are inconsistent. Panel tests for autocorrelation are used to establish whether second-order correlation is an issue.

3.3 Justification for variables used as determinants of TFP (i.e., X it )

As stated in the introduction, inclusion of X it is important as omission of these variables would produce biased estimates of the production function, and thus biased estimates of TFP. Our choice of X it is in part determined by the information available to us in the NBS dataset, and by previous work that uses similar variables. A detailed justification for the majority of the variables used is available in Harris and Moffat (2015). Here we provide just an overview of key arguments.

Firms’ age is included to measure whether younger firms produce with greater efficiency and better technology than older plants (a vintage capital effect); or if productivity increases as the firm ages through learning-by-doing (e.g. Jovanovic and Nyarko 1996).

Our inclusion of the lishu relationship was discussed above, and also encompasses the likely impact of State ownership of capital. Other forms of ownership are less likely to be subject to the impact of political influence, while foreign-owned firms are expected to possess characteristics (e.g. specialised knowledge about production and better management or marketing capabilities) that give them a cost advantage over domestic firms (Hymer 1976). These firms are therefore expected to be more efficient. Conversely, cultural differences between the owners of the firm and the workforce may act to lower levels of TFP in foreign owned plants, especially in the immediate period after the establishment of new ‘greenfield’ operations, or the acquisition of an existing enterprise. Dunning (1988) suggests a lack of understanding of management and labour attitudes as one such disadvantage possessed by foreign owned firms in developed countries. There are likely to be even larger and more embedded issues for foreign-owned firms operating in China. In the long(er)-run, this problem should be overcome as the owners of the indigenous firm become more familiar with domestic working practices/institutional environment.

Exporting is expected to be associated with the need of such firms to achieve productivity gains prior to exporting, while there can also be ‘learning-by-exporting’ effects post-entry (Greenaway and Kneller 2007, provide a survey of this literature).

A measure of the concentration of output across firms, and therefore of market power, is usually included to take account of competition effects. Under the assumption that the elasticity of demand does not vary too greatly across firms in an industry, this is a valid measure of competition within an industry (see, for example, Cabral 2000). Intuitively, one would expect that greater competition will pressure firms into adopting new technologies and operating more efficiently (e.g. Nickell 1996; Meyer and Vickers 1997). However, it can also be argued—following Schumpeter (1943) and more recent endogenous growth theory models—that the level of competition may be inversely related to productivity if monopoly rents are required for management to invest in R&D which in turn leads to innovation and improvements in TFP (Dixit and Stiglitz 1977; Aghion et al. 2001; Aghion and Howitt 1992, 1999; Romer 1990; Grossman and Helpman 1991). It has also been shown that, under some conditions, increased competition can lower the expected income of managers and therefore their effort (Hermalin 1992). This reduced effort may be reflected in reductions in firm efficiency levels.

Agglomeration externalities are usually distinguished in the literature according to whether they are an intra- or inter-industry phenomenon. Intra-industry externalities are termed MAR (Marshall 1890; Arrow 1962; Romer 1986) or localisation externalities, while inter-industry externalities are termed Jacobian (Jacobs 1970, 1986) or urbanisation externalities. The mechanisms that give rise to agglomeration externalities can support both localisation and urbanisation externalities. We also include a dummy that takes on a value of 1 for those firms located in one of the top 200 cities (based on population size in year t), as an additional proxy for potential negative spillovers (i.e. ‘congestion’ costs) in large Chinese cities. For instance, firms may learn from other firms in the same industry and from firms in another industry.

When estimating models of TFP, internal and external knowledge creation is usually represented by both endogenous technical progress due to undertaking R&D in the firm, and by exogenous gains over time, as well as its obsolescence (as represented by the age of the plant—see above). R&D is expected to have an impact on TFP through two channels. Most obviously, performing R&D may generate process innovations that allow existing products to be produced with greater efficiency (through lower costs). It may also generate product innovations which will improve TFP if the new products are produced with greater efficiency or by using better technology than existing products (i.e. an outward shift of the firm’s production possibility frontier). The second channel is through the development of absorptive capacity (see Cohen and Levinthal 1989, and especially Zahra and George 2002, for a detailed discussion of the concept). Absorptive capacity permits the identification, assimilation and exploitation of innovations made by other firms and R&D actors, such as universities and research institutes, and is therefore also expected to lead to improvements in TFP. The notion of absorptive capacity is based on the observation that some knowledge is tacit and is difficult to acquire unless the firm is directly involved in R&D in the area. These two channels through which R&D may affect TFP reflect the two ‘faces’ of R&D (Griffith et al. 2004).

We use selling and distribution expenses as a percentage of sales as a proxy for managerial efficiency (i.e. marketing efficiency—see Lee and Rugman 2012), and/or corporate governance problems (as a proxy for discretionary spending and self-aggrandisement—or organizational slack, i.e. the risk of corporate governance problems). Furthermore Chen and Guariglia (2013) argue that the availability of liquid assets enhances the firms’ capacity to obtain cash at short notice, since liquid assets can be mobilized to raise cash for financing productive projects (it is also required to ensure that the firm has sufficient working capital to finance day-to-day operations). This suggests that liquidity has a fundamental role for financing those activities, which are likely to determine a shift in the efficiency frontier, or best practice technology, which thus impacts on TFP.

4 Results of TFP estimation

4.1 Descriptive statistics

Table 1 presents the summary statistics of variables used in Eq. (1). It is interesting to see that the majority of firms in the sample (52 %) have no political affiliations with any level of government, and the ratio of unaffiliated firms dramatically increases from 15.7 % in 1998 to 76 % in 2007. In 1998, more than 72 % of firms had medium-level political affiliation with local governments, but the ratio quickly declined to 21 % in 2007. The diminishing role of government in the Chinese industrial sector can be viewed as an outcome of China’s marketization reform starting from the late 1970s.

In terms of ownership, our sample is dominated by private firms, i.e. 38.4 % of firm-year observations are owned by individual investors, and 21.1 %, by corporation/legal entities investors over the period of 1998–2007. There is an interesting pattern of the evolution of ownership over the ten-year period. The proportion of state ownership in our sample declines dramatically, from 32.5 % in 1998 to 3.6 % in 2007. A similar pattern holds for collective firms, whose share declines from 29 to 4.6 %. In contrast, the share of individual investors climbs from 14.4 to 49.4 %, and the corresponding figures for corporation/legal entities investors are 12.3 % in 1998 and 27 % in 2007. The share of foreign investors and investors from Hong Kong, Macaw and Taiwan remains roughly stable at between 6 and 8 % respectively. Privatization of small state-owned enterprises (SOEs) and collective firms became significant after 1998 (Haggard and Huang 2008). Our dataset reflects the restructuring process involved in the shrinkage of the state and collective sectors and the expansion of the private sector.

Lastly, our data is dominated by non-exporters, i.e. 74.7 % of firms do not export over the sample period. The ratio remains stable between 1998 and 2007, but the volume of export sales increases. Finally, most firms in our sample (89 %) do not engage in R&D. There was also evidence of a significant trend in urbanization in China, i.e. 27 % firms located in the top 200 cities in 1998 whereas the corresponding figure rises to 87.5 in 2007.

4.2 Econometric results

We begin the discussion of our results with reference to the (only other comparable) study by Brandt et al. (2012)—see Sect. 2—noting their use of the Olley and Pakes (1996) and Ackerberg et al. (2006) approach. Thus unlike the present study, they do not allow for fixed effects and also do not include the vector X it in Eq. (2), and thus they measure TFP essentially as the time trend plus (assumed) random shocks to technology—i.e., overall, they make no attempt to explain the determinants of TFP.

The detailed results from estimating Eq. (1) for 26 two-digit industries/industry groups are presented in Tables 2.Footnote 11 , Footnote 12 Firstly as the diagnostics show, the estimates obtained are economically sensible, and pass various tests of the validity of the instruments used and tests for autocorrelation. All 26 models pass the Hansen test for over-identification at the 5 % level or better, suggesting the validity of the instrument set used.Footnote 13 With regard to tests for autocorrelation, all models show evidence of significant negative first-order serial correlation in differenced residuals, and none show evidence of second-order serial correlation in the differenced residuals (based on a 5 % significance level), suggesting the overall consistency of our estimates.

Table 2 Long-run two-step system-GMM production function (26 industries, China)

The elasticities of output with respect to intermediate input, labor and capital display significant heterogeneity among various industries, but they are all positive and significant.Footnote 14 It is interesting to note that our results based on the system GMM estimator show increasing returns-to-scale (RTS) for most industries (20 out of 26 industries) with an average sum of the output elasticities equal to 1.12.Footnote 15 By contrast, when using the approach of Levinsohn and Petrin (2003), we obtain decreasing returns to scale for 11 industries, with an average sum of the output elasticities equal to 0.51 (which is very low and, in our view, implausible). Besides, when the Levinsohn and Petrin (2003) method is adopted, we also find that the estimated elasticities with respect to output are generally much lower for capital and labor, but higher for intermediate inputs.Footnote 16 We believe our results obtained using system GMM (e.g. increasing returns) are more plausible for the rapidly growing Chinese economy (e.g., see the model and results presented in Song et al. 2011).

The time trend is included to account for (Hicks-neutral) technical change and to capture the impact on TFP of exogenous improvements in technology that are common to all firms. The coefficient on the time trend is positive and significant for all industries with two exceptions (insignificant for petroleum processing and for textiles). The time trend is more important in some industries such as non-metal and metal products, transport equipment, and machinery and equipment than in industries like petroleum processing, textiles, plastic and leather, where the former are likely to be more dynamic and closer to the technological frontier.

Firm age is found to affect TFP significantly and negatively for most industries. This is consistent with the belief that younger firms produce with greater efficiency and better technology than older firms. Obviously the hypothesis that productivity increases as the firm ages through learning-by-doing is not supported by our data for China.

The coefficients of political affiliation show some interesting patterns. Firms with no political affiliation are found to have significantly higher TFP in most industries, which is in line with the view that the use of the ‘invisible hand’ of the market rather than government intervention is conducive to firm productivity in general. On the other hand, the coefficients of firms with high political affiliation vary significantly across industries. In some highly monopolistic industries such as gas and water production, the TFP effect of high political affiliation is significantly positive. This is because the Chinese government keeps a tight control on such sectors with strategic importance, and firms with high political affiliations can therefore enjoy more benefits such as a larger market share and more access to bank credit which may help them to maintain higher productivity levels. On the contrary, in other more competitive markets such as metal and nonmetal products, machinery and equipment, timber, and furniture, the TFP effect of high political affiliation is significantly negative, indicating that government intervention may distort firms’ behavior and therefore reduce productivity.

In terms of ownership, we find that state ownership has a significant negative impact on TFP in general. This is consistent with the arguments that despite decades of economic reform, SOEs remain the least efficient group in the economy, with an average return on capital well below that in the private sector (Dougherty and Herd 2005; Ding et al. 2012). The effect of private ownership (individual investors and corporation/legal entities investors) on TFP is positive and significant for most industries, indicating that the private sector is the driving force of China’s productivity and economic growth. An exception is found for some monopolistic sectors like medical, electronic power, and water production, where private ownership is associated with a negative (although often not significant) impact on TFP. As discussed above, this is because SOEs remain dominant in energy, natural resources and a few strategic or monopolistic sectors that are controlled and protected by central and local governments.

In a large number of sectors, foreign ownership is associated with higher levels of TFP. This is in line with the view of Hymer (1976) that to make it worthwhile for a foreign firm to incur the costs of setting up or acquiring a plant in the domestic market, foreign firms must possess characteristics that give them a cost advantage over domestic firms. But in the short-run there may be ‘assimilation’ issues (see discussion earlier) while in the long-run, some of these advantages may dissipate as domestically owned firms learn to imitate the foreign firms as a result of knowledge spillovers, depending upon levels of absorptive capacity in domestic firms (Harris and Robinson 2003). This argument may help to understand the insignificant coefficient of foreign ownership in some industries.

As stated above, the existing literature shows that exporting firms have superior performance compared to their non-exporting counterparts in terms of productivity, employment, wages, skill- and capital-intensity. This can be explained either through ‘self-selection’ or ‘learning-by-exporting’ (see Bernard and Jensen 1999; Van Biesebroeck 2005; Bernard et al. 2007; De Loecker 2007). However, this view is not uniformly supported by our data, where exporters seem to have higher TFP in only 9 out of 26 sectors. There are at least three explanations for this seemingly surprising result, i.e. the processing trade argument;Footnote 17 the financial constraints argument; and the factor endowments argument. First, one feature of China’s trade pattern is the sheer magnitude of processing trade: about 60 % of Chinese exports are in the processing trade sector during the period 2000–06 (Wang and Yu 2011). There is a recent literature on the effect of different trade regimes (ordinary vs. processing trade) on firm performance in China (for instance, Yu 2014; Manova and Yu 2012). Dai et al. (2011) find direct evidence that processing exporters are 4–30 % less productive than non-exporters, and that excluding processing trade restores the traditional finding that exporters have a superior performance relative to non-exporters.Footnote 18

Secondly, according to Manova and Zhang (2009), Chinese firms may face high fixed trade costs, which raise the productivity cut-off for exporting and importing. Despite being productive, some small firms may be unable to raise sufficient external financing to engage in international trade. Feenstra et al. (2013) confirm the view that unless they are foreign-owned, exporting firms face an additional credit constraint in China. Hence, it is likely that some productive Chinese firms are excluded from exporting markets due to financial constraints.

Thirdly, Lu (2010) offers an alternative explanation on the weak performance of Chinese exporters from the factor endowment view. He argues that when countries differ in their factor endowment, sectors that are intensive in the locally abundant factor face higher competition in the domestic market than in foreign markets. Hence it is domestic rather than export markets that select the most productive firms. In a labor-abundant country like China, exporters are therefore likely to be less productive than non-exporters, especially in labor-intensive sectors.

Undertaking R&D is expected to have a positive impact on TFP through two channels discussed above (innovation and greater absorptive capacity). However, we find only limited supporting evidence for such hypotheses using Chinese NBS data: cet. par., R&D leads directly to significantly higher TFP in only 9 out of 26 industries and leads to significantly lower TFP in transport equipment (although as shown below in Fig. 6, overall firms engaging in R&D do have a productivity advantage over those that spend nothing on such intangible, knowledge-based assets). It is difficult to provide further details with regard to the results obtained, given that we are not aware of any studies that have looked at the impact of R&D for the disaggregated set of industries included in this study (and specially transport equipment). Others have positive results for sectors like electronics (e.g., Yang et al. 2013), or overall across all industries (e.g., Liao et al. 2012). This study also finds positive spillover effects (see below) through agglomeration and diversification, and these effects are likely to be (at least in part) associated with knowledge spillovers (and therefore linked to R&D).

In this study we proxy MAR-spillovers using an agglomeration variable, i.e. the logarithm of the percentage of 2-digit industry output located in each province in which the firm is located.Footnote 19 Jacobian spillovers, on the other hand, are proxied by a measure of diversification, i.e. the logarithm of the proportion of 3-digit industries (maximum 226) located in (208) city areas in which the firm is located.Footnote 20 In general, we find agglomeration spillovers are significant and positive (in 18 out of 26 industries), especially in non-metal products, other mining, medical, gas production, timber, furniture, petroleum processing, coal mining and electricity. Diversification spillovers are even stronger both in terms of the number of industries in which there is a significant positive effect (only the tobacco industry and electronic power has no such impact), and the strength of the relationship (in 9 industries a 10 % increase in diversification results in at least a 2 % increase in TFP). We also include a dummy that takes on a value of 1 for those firms located in one of the top 200 cities (based on population size), as an additional proxy for potential negative spillovers (i.e. ‘congestion’ costs) in large Chinese cities. We find that in 19 of the 26 industries covered, there exist such significant costs (ranging between 5 and 14 %).

Including the Herfindahl index, we find that higher competition within some industries leads to lower TFP in 9 industries (especially in textiles, cultural, and gas production), but higher TFP in the majority (14 out of 26 industries, especially food production, petroleum processing, coal mining, medical and nonmetal products). Our proxy for managerial fixed costs, i.e. selling and distribution expenses as a percentage of sales, shows that the higher these are, the lower is TFP in 23 out of 26 industries.

Finally, in line with Chen et al. (2011), we include two measures of liquidity in our equation. We measure liquidity as the difference between current assets and current liabilities.Footnote 21 We also include a dummy (coded 1) to distinguish those firms that have zero or negative liquidity. We find that firms that experiencing negative liquidity in any year are associated with a significantly lower TFP in almost every industry (usually the impact is to lower TFP by some 4–6 %). In addition, the higher the working capital, the higher TFP in all but one industry (the largest effect is in nonmetal products, where a 10 % increase in liquidity results in a 12 % higher level of TFP). These findings can be explained considering that firms with high (low) liquidity are less (more) likely to face financial constraints and have more (less) resources at hand to undertake productivity-enhancing activities (Chen and Guariglia 2013).

In brief, our production function estimation based on the system GMM estimator indicates increasing returns to scale in most industries and significant technical change. Younger firms and firms with no political affiliation are found to have higher TFP, and firms with state ownership are found to have lower TFP. There exists some heterogeneous evidence among industries on the effect of high political affiliation and private ownership (individual investors and corporation investors/legal entities), which seems plausible in the Chinese institutional context. Neither being an exporter nor investing in R&D are found to impact strongly on TFP among most industries, but there is evidence of positive agglomeration spillovers (counted to some extent by negative ‘costs’ associated with the very largest urban areas). Firm fixed costs and liquidity are also important, in ways consistent with prior expectations.

4.3 TFP distributions

We next present a graphical analysis highlighting TFP differences over time, across firms with different characteristics, and across provinces. To this end, we order the predicted TFP values (based on Eq. 2a) from highest-to-lowest. Figure 1 shows the productivity distribution for firms operating in 1998 and in 2007. We can see that the distribution of 2007 obviously dominates (i.e. lies to the right) that of 1998, showing direct evidence of productivity growth. A formal test that the 2007 distribution lies to the right (i.e. stochastically dominates) that of 1998 is provided by the Kolmogorov–Smirnov (KS) test, which indicates the maximum gap between the two distributions is nearly 0.17 and statistically significant at the 1 % level.

Fig. 1
figure 1

TFP distribution: 1998 versus 2007

Figure 2 compares the productivity distribution between firms with high political affiliation (those affiliated with central and provincial governments) and firms with lesser or no links. Consistent with the results in Table 2, firms with no political affiliation have a productivity distribution to the right of those with high political affiliation, indicating that the former dominates the latter in terms of its TFP distribution (a KS test shows that the maximum gap between the distribution for firms with no affiliation and firms with some affiliation has a value of 0.18,Footnote 22 and is significant at the 1 % level). Figure 3 compares the distribution of firms with more than 25 % of state ownership and those with less than 25 % of state ownership. We observe that the TFP distribution of those with less state ownership dominates that of firms with more state ownership with a highly significant KS value of 0.33. This is again consistent with our findings in Table 2.

Fig. 2
figure 2

TFP distribution: high political affiliation versus no political affiliation

Fig. 3
figure 3

TFP distribution: ownership comparison

Contrasting TFP distributions also shows significant heterogeneity among various geographic locations. For instance, Fig. 4 compares the TFP distribution of east coast Shanghai and that of an inner province, Guizhou. Not surprisingly, Shanghai has a productivity distribution to the right of Guizhou (the value of the KS test statistic is 0.41, significant at the 1 % level). Hence it is important to capture such geographic information when estimating TFP.

Fig. 4
figure 4

TFP distribution: geographic location comparison

Lastly, Fig. 5 shows some evidence of significant cross-over in terms of TFP distribution between exporters and non-exporters, at the top end of the productivity distribution, although, overall, firms engaged in exporting tend to have higher productivity. Similar results are found for firms undertaking R&D (Fig. 6), where the productivity advantage is much smaller (significant at the 1 % level, but with a KS statistic of 0.09). These findings are in line with the econometric results in Table 2.

Fig. 5
figure 5

TFP distribution: exporters versus non-exporters

Fig. 6
figure 6

TFP distribution: R&D versus no R&D

5 Productivity decomposition

5.1 The Haltiwanger (1997) approach and some baseline results

Having obtained firm-level estimates of TFP, we use the approach taken by Haltiwanger to decompose measures of productivity growth into various components that represent the impact of resource allocation across surviving firms, as well the impact on productivity of the entry and exit of firms.Footnote 23 The index of productivity in year t is defined as a geometrically weighted average of individual firm-level productivities. This index and its growth between t and t − k can therefore be written as follows:

$$lnP_{t}= \sum_{i} \theta_{it} lnP_{it} \quad {\Delta}lnP_{t}= lnP_{t}- lnP_{t - k}$$
(5)

where \(P\,{\text{measures productivity and }}\theta_{it}\) is the share of output for firm i in period t for the economy. The decomposition for a given economy is motivated by the fact that there are firms that continue in operation between t and t − k, new firms entering in period t, as well as firms exiting, which all contribute to productivity in t − k. Thus, productivity growth can be expressed as follows:

$$\begin{aligned} &\mathop{\Delta lnP_{t}= \mathop{\sum_{i} {\theta_{it - k} }}{ \Delta lnP_{it}}}\limits^{{{\text{Continuers:}}\,{\text{within-firm}}}}+ \mathop {\sum\limits_{i} {\left( {lnP_{it - k}- \ln \,P_{t - k} } \right)\Delta } \theta_{it} }\limits^{{{\text{Continuers:}}\,{\text{Between-firm}}}}+ \mathop {\sum\limits_{i} {\Delta lnP_{it} \Delta } \theta_{it} }\limits^{{{\text{Continuers:}}\,{\text{Cross-firm}}}}\hfill \\ &\qquad + \mathop {\sum\limits_{i} {\theta_{it} } \left( {\ln P_{it}- lnP_{t - k} } \right)}\limits^{{{\text{Entering}}\,{\text{firms}}}}- \mathop {\sum\limits_{i} {\theta_{it - k} } \left( {\ln P_{it}- lnP_{t - k} } \right)}\limits^{{{\text{Exiting}}\,{\text{firms}}}}\hfill \\ \end{aligned}$$
(6)

The first term shows the impact of resource shifts within firms that were open in both t and t − k to achieve higher (or lower) productivity, depending on how important such firms were in the base year (in terms of their shares of output in the economy). The second term also concerns continuing firms, and measures the impact of changing productivity shares across firms weighted by the firm’s ranking in the economy in the base year. The second term needs to be complemented with the third: the covariance effect that considers whether increases in productivity correspond (covary) with increasing market shares. Lastly, there are terms to denote the contributions of entrants and exiting firms, both measured with respect to the economy average in the base year. It should be noted that the last term is expected to be negative if exiting firms have lower productivity. Thus, this term is preceded by a negative sign to allow for a positive impact on productivity.Footnote 24

In summary, the Haltiwanger-type decomposition disaggregates changes in total productivity into those due to within-firm increases, those due to between-firm increases,Footnote 25 and the share of new entrants and those firms exiting production. Note that compared to standard indices, the more general Haltiwanger-type approach provides a holistic view of the interaction of firms, industries, and the aggregate economy, since firm entry and exit in markets inherently involves changes in market shares, and thus industrial restructuring. That is, when describing aggregate productivity growth, we need to include and measure the impact of such ‘churning’, as well as the impact on TFP of any intra-industry reallocations of resources.

The approach can also be used to show the relative contributions of various sub-groups (e.g. industry sectors) to overall TFP growth, as well as decomposing TFP growth into its sources (intra-, inter-resource reallocations, as well as the impact of entry and exit). However, if there are a large number of sub-groups being considered (and/or any have a non-equal share in total aggregate output), then the figures obtained can be somewhat difficult to interpret, i.e. they are determined not only by what has been happening to TFP in a sector but also how important the sector is to the economy, in terms of its share of total output. Therefore to help interpret such Haltiwanger results, we produce not only the figures obtained from the Haltiwanger decomposition (row 1 in Tables 4, 5) but also these figures weighted to take account of the relative size of each sector (rows 2–6 in Tables 4, 5).Footnote 26

We start by producing the overall results for all the industries that are covered in the NBS dataset. Table 3 shows that, based on the system GMM estimation, annual growth in TFP between 1998 and 2007 in Chinese industries was overall 10 %. As can be seen, this very large TFP growth was dominated by new firms entering post-1998 with relatively higher TFP. This finding is consistent with (although slightly stronger than) the conclusion by Brandt et al. (2012) that net entry accounts for over two thirds of total TFP growth. There was no increased productivity through the exit of firms with relatively lower TFP (indeed on average more productive firms closed); and improvements due to firms becoming themselves more productive over time were relatively small (although not small when compared with the UK). Reallocations of resources (through contraction and expansion of output shares in firms of different productivity levels) also contributed relatively little to raising aggregate TFP growth. The corresponding figures based on the Levinsohn and Petrin (2003) approach are much lower, e.g. 3.4 % for the annual TFP growth, but the main finding that firm entry contributes to TFP growth in China remains intact.

Table 3 Firm-level TFP growth (average % p.a.) in Chinese (1998–2007) and UK industry (1997–2008)

For comparison, we have included in Table 3 figures for UK manufacturing, which are based on the same type of approach as used here (both in terms of deriving estimates of TFP and the use of the Haltiwanger approach). The major difference between China and the UK is that TFP growth was on average 10 % per annum in China, versus 1.2 % per annum in the UK. The importance of entry dominates TFP growth in both countries (and in others, given the results reported for many countries in the last decade); while productivity improvements in firms open in both 1997 and 2008 (the continuers) made a relatively important contribution to UK manufacturing, but not to Chinese manufacturing. Contrary to China, firms closing in the UK had on average higher TFP, and thus their closure lowered aggregate TFP.

5.2 Results for the industry sub-sectors and provinces

Table 4 presents the industry-level results of productivity decomposition. Column (1) presents the actual growth in TFP obtained using Eq. (5). The industry sub-group figures sum to the total for all sectors. This does not take into account differences in the relative size of each sub-group (columns 7 and 8), and therefore column (2) is based on weighting column (1) by output shares in the base year. Columns (3)–(6) present the results obtained from applying the Haltiwanger-type decomposition (Eq. 5) and each row sums to the figures in column (2).

Table 4 Firm-level TFP growth (average per annum) in industry sub-sectors, 1998–2007, China

The results show that between 1998 and 2007, TFP increased by 10 % per annum on average across all sectors with, in terms of absolute contributions to TFP growth (column 1), petroleum processing contributing the most, followed by metal products, and machinery and equipment. Taking into account the relative size of each sector (column 2), water production is the largest contributor to TFP growth, followed by petroleum processing, and machinery and equipment. The largest decline in TFP (in both relative and absolute terms) is attributable to other manufacturing.

In terms of the decomposition of TFP growth, the figures in columns (3) to (6) show that, in general, ‘churning’ (firm entry and exit) dominates. Sectors that experienced rapid TFP growth are dominated by either the opening of more productive firms (e.g. and machinery and equipment, metal products, transport equipment, gas production, and measuring instruments); or the closing of less productive firms (e.g. water production, non-metal products, electronic power and heating, medical and other mining). For sectors characterized by relatively low (or even negative) TFP growth (i.e. the lower half of Table 4), the most important contributions are the closing of relatively high productivity firms and reallocations of output to less efficient firms operating throughout the period (i.e., the between-firm effect). In general the within-firm effect is relatively unimportant in explaining TFP growth at the industry-level.

Table 5 presents the results for provinces. In column (2), where the TFP growth is weighted by output shares in different provinces, it is interesting to see that several inner provinces such as Inner Mongolia, Xinjiang, Heilongjiang, Gansu and Qinghai are ranked in the top half of the table in terms of relative contributions to TFP growth. Their rapid TFP growth may reflect a catch-up effect among Chinese provinces, which is also experienced by provinces just inland from the east coast provinces (e.g. Jiangxi, Hunan, and Hebei). Others have found similar evidence (e.g. the recent studies by Andersson et al. 2013; Herrerias and Ordonez 2012), although it is difficult to make direct comparisons as these studies use different time periods, measures based on labour productivity (not TFP), and aggregate (not firm-level) data.

Table 5 Firm-level TFP growth (average per annum) in provinces, 1998–2007, China

6 Conclusion

Drawing on a large firm-level dataset, we examine TFP and its determinants in Chinese industries over the period of 1998–2007. When estimating TFP, we favor the system GMM estimator because of its ability to capture firm-level fixed effects and to deal with the endogeneity of regressors and potential mis-measurement. Besides the factor inputs, we include in the production function several China-specific variables such as firms’ political affiliation and ownership, along with some other general variables such as firm age, export behavior, intangible fixed assets (proxied by R&D), liquidity, and geographic location, in order to alleviate the omitted variable(s) problem. TFP is estimated separately for each industry to allow for heterogeneity in technology. A Haltiwanger approach is then adopted to decompose TFP growth into different components for both the economy and for various sub-sectors.

In brief, our production function estimation based on the system GMM estimator indicates increasing returns to scale in most industries and significant technical change. Younger firms and firms with no political affiliation are found to have higher TFP, and firms with state ownership are found to have lower TFP. There exists some heterogeneous evidence among industries on the effect of high political affiliation and private ownership (individual investors and corporation investors/legal entities), which seems plausible in the Chinese institutional context. Neither export behavior nor R&D are found to impact strongly on TFP among most industries, but there is evidence of positive agglomeration spillovers (countered to some extent by negative ‘costs’ associated with the very largest urban areas). Firm fixed costs and liquidity are also important. Our estimated average TFP growth in Chinese industries is 10 % per annum over the sample period. The Haltiwanger-type decomposition shows that this rapid productivity growth is mainly driven by firm entry rather than reallocation among existing firms. When an industry- and province-level decomposition is conducted, it appears that the positively contributing inter-firm resource reallocations are more prominent across industries than across provinces.