1 Introduction

Productivity is an important component of profitability. In fact, Total Factor Productivity (TFP) change, as the most encompassing measure of productivity change, is nothing but the “real” component of profitability change (see Balk, 2003). Productivity is therefore an important driver to changing standards of living. TFP growth is an index number aimed at capturing any output growth that is unexplained by input growth (e.g., Hulten, 2001). In the recent literature a lot of attention has been devoted to what has been aptly called theoretical productivity indices (see Russell, 2018). A theoretical productivity index is defined on the assumption that the technology is known and non-stochastic, but unspecified and thus most often approximated by a nonparametric multiple-input, multiple-output specification using some form of distance functions. The foundational concepts are on the one hand the Malmquist productivity index (initially developed by Caves et al. (1982)) and on the other hand the Hicks-Moorsteen productivity index (Bjurek, 1996). While the Malmquist productivity index is fundamentally a measure of the shift of the production frontier, the Hicks–Moorsteen productivity index is a ratio of an aggregate output index over an aggregate input index. Thus, the Malmquist productivity index measures local technical change (i.e., the local change of a production frontier) but in general not TFP change, while the Hicks-Moorsteen productivity index has a TFP interpretation.

In the last decades, awareness has developed that ignoring inefficiency may potentially bias productivity measures. Nishimizu and Page (1982) is probably the seminal article decomposing productivity into a technical change component and a technical efficiency change component. Caves et al. (1982) analyze the discrete time Malmquist productivity index using distance functions as general representations of technology. Caves et al. (1982) show that this Malmquist index happens to be related to the Törnqvist productivity index if a translog technology is assumed. However, the Törnqvist productivity index uses both price and quantity information, but it needs no knowledge on the technology. Färe et al. (1995) are the first to propose a procedure to estimate the distance functions in the Malmquist productivity index by exploiting their relation with the radial efficiency measures computed relative to nonparametric technologies, and also integrate the two-part (Nishimizu & Page, 1982) decomposition. Bjurek (1996) offers an alternative Hicks-Moorsteen TFP index that can be defined as the ratio of an aggregate Malmquist output- over an aggregate Malmquist input-index.

From a theoretical point of view, these Malmquist and Hicks-Moorsteen productivity indexes are known to be identical only under two very stringent conditions: (i) inverse homotheticity of the technology; and (ii) constant returns to scale (see Färe et al., 1996). Therefore, from an empirical point of view both indices are in general expected to differ, since these two conditions that need to hold for their equality are unlikely to be met in practice. Kerstens and Van de Woestyne (2014) empirically show that the Malmquist productivity index offers a poor approximation to the Hicks-Moorsteen TFP index in terms of the resulting distributions, and that for individual observations one may well even encounter conflicting evidence regarding the basic direction of productivity growth or decline.

A substantial part of the subsequent literature extends these two theoretical productivity indices to incorporate on the one hand the possibility of technological inefficiency (i.e., operation below the production frontier), and on the other hand decompositions into a variety of components of productivity change (e.g., efficiency change, scale effects, input- and output-mix effects). It is fair to say that most focus has been on decomposing the Malmquist productivity index: this has led to various controversies that have been summarised in the now somewhat dated survey by Zofío (2007). The Hicks-Moorsteen TFP index has long been thought not to be amenable to decomposition, but a recent proposal for a decomposition is found in Diewert and Fox (2017).

In the literature, more general primal productivity indicators have meanwhile been proposed. Chambers et al. (1996) introduce the Luenberger productivity indicator as a difference-based indicator of directional distance functions [Chambers (2002) provides the best background]. These directional distance functions generalize traditional distance functions by allowing for simultaneous input reductions and output expansions and these are dual to the profit function. Briec and Kerstens (2004) define a Luenberger-Hicks-Moorsteen TFP indicator using these same directional distance functions. Though not as popular as the Malmquist productivity index, the Luenberger productivity indicator has been rather widely used. The Luenberger–Hicks–Moorsteen TFP indicator is relatively speaking less employed. Luenberger output (or input) oriented productivity indicators and Luenberger–Hicks–Moorsteen productivity indicators coincide under similar demanding properties spelled out in Briec and Kerstens (2004). Kerstens et al. (2018) empirically document that the Luenberger productivity indicator provides a poor approximation to the Luenberger-Hicks-Moorsteen TFP indicator in terms of the resulting distributions, and that for individual observations one may obtain conflicting results with respect to the basic direction of productivity growth or decline.

In our contribution, we focus on one potentially neglected issue in the development of the Malmquist productivity index, namely that variations in capacity utilisation have so far largely been ignored. In traditional productivity decompositions–mainly based on parametric functional specifications–several proposals for incorporating measures of capacity utilisation have been available in the literature. Examples of such theoretical contributions include Hulten (1986), Morrison (1985) or Morrison Paul (1999) (see, e.g., Fousekis and Papakonstantinou (1997) for an empirical example). Since the basic Malmquist productivity index focuses on primal technologies, a seminal theoretical proposal to include an output-oriented plant capacity utilisation measure (proposed in Färe et al., 1989) within an output-oriented Malmquist productivity index is found in De Borger and Kerstens (2000).Footnote 1

For several decades the output-oriented plant capacity utilisation measure has been the only technical or engineering capacity notion available in the literature. However, recently two innovations have been proposed. First, Kerstens et al. (2019) criticize the traditional output-oriented plant capacity utilisation measure for not being attainable: it determines maximal outputs for potentially unlimited amounts of variable inputs, but it ignores the basic fact that the amounts of variable inputs needed to obtain these maximal outputs may well not be available at either the firm or the industry level. The same authors then go on to define an attainable output-oriented plant capacity utilisation measure: it modifies the basic output-oriented plant capacity utilisation measure by including an upper bound on the amount of available variable inputs. In empirical applications the problem is to determine a realistic upper bound on the amount of available variable inputs.

Second, an alternative input-oriented plant capacity utilisation measure has been introduced in Cesaroni et al. (2017). It is based on a pair of input-oriented efficiency measures using a nonparametric frontier framework, very much in line with the output-oriented plant capacity utilisation measure that is based on a couple of output-oriented efficiency measures. In a recent study, Kerstens and Shen (2021) use these plant capacity concepts to measure hospital capacities in the Hubei province in China during the outbreak of the COVID-19 pandemic. Using the medical literature indicating that mortality rates increase with high capacity utilization rates leads to the preliminary conclusion that this relatively new input-oriented plant capacity concept correlates best with mortality.

Therefore, this contribution sets itself three main goals. First, it develops a proper decomposition of the input-oriented Malmquist productivity index that is compatible with the new input-oriented plant capacity notion. This decomposition is distinct from the existing decomposition of the output-oriented Malmquist productivity index developed in De Borger and Kerstens (2000). In addition, the existing decomposition of the output-oriented Malmquist productivity index is extended by including the attainable output-oriented plant capacity utilisation measure.

Second, we are—to the best of our knowledge—the first empirical application of both these basic decompositions of the input-oriented and output-oriented Malmquist productivity indices using an empirical data set. In particular, we employ Chinese provincial data from tourism activities. For a lack of realistic upper bound on the amount of available variable inputs in our empirical application, we refrain from estimating the output-oriented Malmquist productivity index with the attainable output-oriented plant capacity utilisation measure.

Third, we are the first empirical application of both these basic decompositions of the Malmquist productivity index that test for the impact of the convexity assumption. While the impact of convexity is generally known to be considerable for technical efficiency measurement, its effect on the measurement of capacity utilisation and the computation of the Malmquist productivityindex remain under-explored. Cesaroni et al. (2017) show that convex and nonconvex input- and output-oriented plant capacity concepts differ significantly. Kerstens and Van de Woestyne (2014) are the only ones known to us that empirically illustrate that convex and nonconvex Malmquist productivity index results differ. Therefore, it is useful to see the impact of convexity on the above mentioned decompositions of the Malmquist productivity index.

This contribution is structured as follows. The next Sect. 2 defines the basic technologies, the Malmquist productivity indices, the necessary plant capacity concepts, as well as integration of these plant capacity concepts in the corresponding Malmquist productivity indices. Section 3 provides a succinct literature review about efficiency and productivity measurement in the tourism industry. The next Sect. 4 discusses the specification and the data employed. Empirical results are listed and discussed in Sect. 5. The empirical results contain the output- and the input oriented Malmquist productivity indices and their components, some Spearman rank correlations between both indices and their components, a t-test whether these indices differ from unity, and some robustness check under the form of a bootstrapping analysis. Section 6 concludes.

2 Technology, primal productivity indices, and plant capacity: definitions

We first introduce the assumptions on technology and the definitions of the required efficiency measures. Then, we define the Malmquist productivity indices (MPI) as well as the necessary plant capacity utilisation notions. The latter elements are then finally integrated into the components of the Malmquist productivity indices.

2.1 Technology and efficiency measures

This subsection introduces basic notation and defines the production technology. Assume that for periods \(t=1,...,T\), \(N-\)dimensional input vectors \(x^t \in \mathbb {R}_+^N\) are employed to produce \(M-\)dimensional output vectors \(y^t \in \mathbb {R}_+^M\). In each period t, the production possibility set or technology S is defined as follows: \(S^t = \{(x^t,y^t) | x^t \text { can produce at least } y^t\}\). A first alternative definition of technology \(S^t\) is the input set denoting all input vectors \(x^t\) capable of producing a given output vector \(y^t\): \(L^t(y^t)= \{x^t| (x^t,y^t) \in S^t\}\). A second alternative definition of technology \(S^t\) is the output set denoting all output vectors \(y^t\) that can be produced from a given input vector \(x^t\): \(P^t(x^t)=\{y^t|(x^t,y^t)\in S^t\}\).

The following standard assumptions are imposed on the technology \(S^t\):

  1. (T.1)

    Possibility of inaction and no free lunch, i.e., \((0,0) \in S^t\) and if \((0,y^t) \in S^t\), then \(y^t=0\).

  2. (T.2)

    \(S^t\) is a closed subset of \(\mathbb {R}_+^N \times \mathbb {R}_+^M\).

  3. (T.3)

    Strong input and output disposal, i.e., if \((x^t,y^t) \in S^t\) and \(({\bar{x}}^t,{\bar{y}}^t) \in \mathbb {R}_+^N \times \mathbb {R}_+^M\), then \(({\bar{x}}^t,- {\bar{y}}^t) \ge (x^t,-y^t) \Rightarrow ({\bar{x}}^t, {\bar{y}}^t) \in S^t\).

  4. (T.4)

    \(S^t\) is convex.

These traditional axioms on technology can be succinctly commented upon as follows (see, e.g., Hackman, 2008 for details). First, inaction is feasible, and there is no free lunch. Second, the technology is closed. Third, we impose free or strong disposal of both inputs and outputs in that inputs can be wasted and outputs can be discarded. Finally, technology is convex. In our empirical analysis later on these axioms are not always simultaneously maintained.Footnote 2 In particular, in the empirical analysis one key assumption distinguishing some of the technologies is convexity versus nonconvexity.

Turning to the definition of the input-and output-oriented efficiency measures needed to define Malmquist productivity index as well as the plant capacity notions, we start with the radial input efficiency measure that can be defined as follows:

$$\begin{aligned} DF^t_i(x^t,y^t)=\min \{\lambda \mid \lambda \ge 0, \lambda x^t \in L^t(y^t)\}. \end{aligned}$$
(1)

This radial input efficiency measure characterizes the input set \(L^t(y^t)\) completely. Its main properties are that it is smaller or equal to unity \((DF_i^t(x^t,y^t) \le 1)\), with efficient production on the boundary (isoquant) of \(L^t(y^t)\) represented by unity, and that it has a cost interpretation (see, e.g., Hackman, 2008).

The radial output efficiency measure can be defined as follows:

$$\begin{aligned} DF^t_o(x^t,y^t)=\max \{\theta \mid \theta \ge 0, \theta y^t \in P^t(x^t)\}. \end{aligned}$$
(2)

This radial output efficiency measure offers a complete characterization of the output set \( P^t(x^t)\). Its main properties are that it is larger than or equal to unity \((DF^t_o(x^t,y^t) \ge 1)\), with efficient production on the boundary (isoquant) of the output set \(P^t(x^t)\) represented by unity, and that this radial output efficiency measure has a revenue interpretation (e.g., Hackman, 2008).

In the short run, it is customary to distinguish between fixed and variable inputs. Thus, we can partition the input vector into a fixed and a variable part. In particular, we denote \(x^t=(x^t_f, x^t_v)\) with \(x^t_f\in \mathbb {R}_+^{N_f}\) and \(x^t_v\in \mathbb {R}_+^{N_v}\) such that \(N=N_f+N_v\). In an analogous way, a short-run technology \(S^t_f = \{(x^t_f,y^t) \in \mathbb {R}_+^{N_f} \times \mathbb {R}_+^M \mid \text { there exists some } x^t_v \text { such that } (x^t_f,x^t_v) \text { can produce at least } y^t \}\) and the corresponding short-run input set \(L^t_f(y^t)= \{x^t_f \in \mathbb {R}_+^{N_f} \mid (x^t_f,y^t) \in S^t_f\}\) and short-run output set \(P^t_f(x^t_f)=\{y^t \mid (x^t_f,y^t)\in S^t_f\}\) can be defined (see Cesaroni et al., 2019 for more details).

Denoting the radial output efficiency measure of the short-run output set \(P^t_f(x^t_f)\) by \(DF^t_o(x^t_f,y^t)\), this short-run output-oriented efficiency measure can be defined as follows:

$$\begin{aligned} DF_o^t(x^t_f,y^t)= \max \{\theta \mid \theta \ge 0, \theta y^t \in P^t_f(x^t_f)\}. \end{aligned}$$
(3)

The sub-vector input efficiency measure reducing only the variable inputs is defined:

$$\begin{aligned} DF^t_i(x^t_f,x^t_v,y^t)=\min \{\lambda \mid \lambda \ge 0, (x^t_f,\lambda x^t_v) \in L^t(y^t)\}. \end{aligned}$$
(4)

Finally, we need the following particular definition of technology: \(L^t(0) = \{x^t \mid (x^t,0) \in S^t\}\) is the input set with a zero level of outputs. The sub-vector input efficiency measure reducing variable inputs evaluated relative to this input set with zero outputs level is:

$$\begin{aligned} DF^t_i(x^t_f,x^t_v,0)=\min \{\lambda \mid \lambda \ge 0, (x^t_f,\lambda x^t_v) \in L^t(0)\}. \end{aligned}$$
(5)

Given data on K observations \((k= 1, \cdots , K)\) consisting of a vector of inputs and outputs \((x^t_k,y^t_k )\in \mathbb {R}_+^{N+M}\), a unified algebraic representation of convex and nonconvex nonparametric frontier technologies under the flexible or variable returns to scale assumption is possible as follows:

$$\begin{aligned} S^{t, \Gamma } = \left\{ (x^t,y^t) \,\mid \, x^t \ge \sum _{k=1}^K z_k x^t_k, y^t \le \sum _{k=1}^K z_k y^t_k, z=(z_1,\ldots ,z_k) \in \Gamma , \right\} , \end{aligned}$$
(6)

where

  1. (i)

    \(\displaystyle \Gamma \equiv \Gamma ^\text {C} = \left\{ z \, \mid \, \sum _{k=1}^K z_k = 1 \text { and } z_k \ge 0 \right\} \);

  2. (ii)

    \(\displaystyle \Gamma \equiv \Gamma ^\text {NC} = \left\{ z \, \mid \, \sum _{k=1}^K z_k = 1 \text { and } z_k \in \{0,1\} \right\} \).

The convexity axiom is represented by the activity vector z of real numbers summing to unity. This same sum constraint with each vector element being restricted to be a binary integer represents the nonconvexity axiom. The convex technology satisfies axioms (T.1) (except inaction) to (T.4), while the nonconvex technology complies with axioms (T.1–T.3). In the remainder, we condition the above notation of the efficiency measures relative to these nonparametric frontier technologies by distinguishing between convexity (convention C) and nonconvexity (convention NC).

Kerstens and Van de Woestyne (2014) empirically illustrate that to measure local technical change using a Malmquist productivity index one obtains the most precise results for variable returns to scale rather than for the often used constant returns to scale assumption. Furthermore, these authors show that convex and nonconvex Malmquist productivity results can differ substantially. Another pragmatic reason to opt for variable returns to scale is that some plant capacity notions may not be well defined under constant returns to scale.

2.2 Malmquist productivity indices: definitions

Using the output-oriented radial efficiency measures one can define the output-oriented Malmquist productivity index in base period t as follows:

$$\begin{aligned} M_o^t (x^t,y^t,x^{t+1},y^{t+1}) = \frac{DF_o^t(x^t,y^t)}{DF_o^t(x^{t+1},y^{t+1})}. \end{aligned}$$
(7)

Values of this base period t output-oriented Malmquist productivity index above (below) unity reveal productivity growth (decline).

Similarly, a base period \(t+1\) output-oriented Malmquist productivity index is defined:

$$\begin{aligned} M_o^{t+1}(x^t,y^t,x^{t+1},y^{t+1}) = \frac{DF_o^{t+1}(x^t,y^t)}{DF_o^{t+1}(x^{t+1},y^{t+1})}. \end{aligned}$$
(8)

Again, values of this base period \(t+1\) output-oriented Malmquist productivity index above (below) unity reveal productivity growth (decline).

To avoid an arbitrary selection among base years and inspired by Caves et al. (1982), Färe et al. (1995) define the output-oriented Malmquist productivity index as a geometric mean of a period t and a period \(t+1\) productivity index:

$$\begin{aligned} \begin{array}{ll} M_o^{t,t+1}(x^t,y^t,x^{t+1},y^{t+1}) &{}= \sqrt{M_o^{t}(x^t,y^t,x^{t+1},y^{t+1}) \cdot M_o^{t+1}(x^t,y^t,x^{t+1},y^{t+1})} \\ &{}= \sqrt{ \frac{DF_o^t(x^t,y^t)}{DF_o^t(x^{t+1},y^{t+1})}\cdot \frac{DF_o^{t+1}(x^t,y^t)}{DF_o^{t+1}(x^{t+1},y^{t+1})}}.\\ \end{array} \end{aligned}$$
(9)

The base period of this productivity index changes over time: it can be conceptualized as an index computed in a two year window sliding over the observations through time. Moreover, this geometric mean output-oriented Malmquist index (9) can be decomposed into two mutually exclusive components:

$$\begin{aligned} \begin{array}{ll} &{} M_o^{t,t+1} (x^t,y^t,x^{t+1},y^{t+1}) \\ &{}\quad = \underbrace{\frac{DF_o^{t}(x^t,y^t)}{DF_o^{t+1}(x^{t+1},y^{t+1})}}_{(i)} \underbrace{\sqrt{ \frac{DF_o^{t+1}(x^{t+1},y^{t+1})}{DF_o^t(x^{t+1},y^{t+1})}\cdot \frac{DF_o^{t+1}(x^t,y^t)}{DF_o^{t}(x^{t},y^{t})}}}_{(ii)}.\\ \end{array} \end{aligned}$$
(10)

The first component (i) measures the change in technical efficiency over time, while the second component (ii) is related to the shift of the frontier of the production technology (i.e., it captures technical change).

By analogy, an input-oriented Malmquist productivity index with base period t is defined as the ratio of two input efficiency measures as follows:

$$\begin{aligned} M_i^t (x^t,y^t,x^{t+1},y^{t+1}) = \frac{DF_i^t(x^t,y^t)}{DF_i^t(x^{t+1},y^{t+1})}. \end{aligned}$$
(11)

Values of this base period t input-oriented Malmquist productivity index below (above) unity reveal productivity growth (decline).

Similarly, an input-oriented Malmquist productivity index with base period \(t+1\) can similarly be defined as:

$$\begin{aligned} M_i^{t+1}(x^t,y^t,x^{t+1},y^{t+1}) = \frac{DF_i^{t+1}(x^t,y^t)}{DF_i^{t+1}(x^{t+1},y^{t+1})}. \end{aligned}$$
(12)

Again, values of this base period \(t+1\) input-oriented Malmquist productivity index below (above) unity reveal productivity growth (decline). Note that since the \(DF_i(x,y) \le 1\) and \(DF_o(x,y) \ge 1\), the interpretation of Eqs. (11) and (12) are inverse of the interpretation of Eqs. (7) and (8).

To avoid an arbitrary choice of base period, the input-oriented Malmquist productivity index is defined as a geometric mean of a period t and \(t+1\) productivity index:

$$\begin{aligned} \begin{array}{ll} M_i^{t,t+1}(x^t,y^t,x^{t+1},y^{t+1}) &{}= \sqrt{M_i^{t+1}(x^t,y^t,x^{t+1},y^{t+1}) \cdot M_i^{t+1}(x^t,y^t,x^{t+1},y^{t+1})} \\ &{}= \sqrt{ \frac{DF_i^t(x^t,y^t)}{DF_i^t(x^{t+1},y^{t+1})}\cdot \frac{DF_i^{t+1}(x^t,y^t)}{DF_i^{t+1}(x^{t+1},y^{t+1})}}.\\ \end{array} \end{aligned}$$
(13)

Note that when the geometric mean input-oriented Malmquist productivity index is larger (smaller) than unity, it points to a productivity growth (decline). Moreover, the Malmquist index (13) can be decomposed into two mutually exclusive components:

$$\begin{aligned} \begin{array}{ll} &{} M_i^{t,t+1} (x^t,y^t,x^{t+1},y^{t+1}) \\ &{}\quad =\underbrace{\frac{DF_i^{t}(x^t,y^t)}{DF_i^{t+1}(x^{t+1},y^{t+1})}}_{(i)} \underbrace{\sqrt{ \frac{DF_i^{t+1}(x^{t+1},y^{t+1})}{DF_i^t(x^{t+1},y^{t+1})}\cdot \frac{DF_i^{t+1}(x^t,y^t)}{DF_i^{t}(x^{t},y^{t})}}}_{(ii)}.\\ \end{array} \end{aligned}$$
(14)

The first component (i) measures the change in technical efficiency over time, while the second component (ii) is related to the shift of the frontier of the production technology (i.e., it captures technical change). Note that when this input-oriented Malmquist productivity index (14) is smaller (larger) than unity, it points to a productivity growth (decline). A similar interpretation applies to the separate components.

Following Ouellette and Vierstraete (2004), the sub-vector input-oriented Malmquist productivity index can now be defined as follows:

$$\begin{aligned} \begin{array}{ll} &{} M_i^{t,t+1} (x_f^t,x_v^t,y^t,x_f^{t+1},x_v^{t+1},y^{t+1}) \\ &{}=\frac{DF_i^{t}(x_f^t,x_v^t,y^t)}{DF_i^{t+1}(x_f^{t+1},x_v^{t+1},y^{t+1})} \sqrt{ \frac{DF_i^{t+1}(x_f^{t+1},x_v^{t+1},y^{t+1})}{DF_i^t(x_f^{t+1},x_v^{t+1},y^{t+1})}\cdot \frac{DF_i^{t+1}(x_f^t,x_v^t,y^t)}{DF_i^{t}(x_f^{t},x_v^{t},y^{t})}}.\\ \end{array} \end{aligned}$$
(15)

The interpretation of this sub-vector input-oriented Malmquist productivity index as well as its decomposition is exactly similar to the previous index (14).

Note that since the \(DF_i(x,y) \le 1\) and \(DF_o(x,y) \ge 1\), the interpretation of Eqs. (11) and (12) are inverse of the interpretation of Eqs. (7) and (8). Moreover, when the input-oriented Malmquist productivity index (15) is smaller (larger) than unity, it points to a productivity growth (decline) while the interpretation of the output-oriented Malmquist productivity index (10) is exactly the inverse.

2.3 Plant capacity utilisation: definitions

The informal definition of output-oriented plant capacity by (Johansen, 1968, p. 362) has been made operational by Färe et al. (1989) using a pair of output-oriented efficiency measures. We now recall the definition of their output-oriented plant capacity utilization (PCU). The output-oriented plant capacity utilization (\(PCU_o\)) in each period t is defined as:

$$\begin{aligned} PCU^t_{o}(x^t,x^t_f, y^t)=\frac{DF^t_o(x^t,y^t)}{DF^t_o(x^t_f,y^t)}, \end{aligned}$$
(16)

where \(DF^t_o(x^t,y^t)\) and \(DF^t_o(x^t_f,y^t)\) are output efficiency measures including respectively excluding the variable inputs as defined before in (2) and (3).

Since \(1 \le DF^t_o(x^t,y^t) \le DF^t_o(x^t_f,y^t)\), notice that \(0<PCU^t_{o}(x^t,x^t_f, y^t) \le 1\). Thus, output-oriented plant capacity utilization has an upper limit of unity. This output-oriented plant capacity utilisation compares the maximum amount of outputs with given inputs to the maximum amount of outputs in the sample with potentially unlimited amounts of variable inputs, whence it is smaller than unity. It answers the question how the current amount of efficient outputs relates to the maximal possible amounts of efficient outputs. Following the terminology introduced by Färe et al. (1989, 1994) one can distinguish between a so-called biased plant capacity measure \(DF^t_o(x^t_f,y^t)\) and an unbiased plant capacity measure \(PCU^t_{o}(x^t,x^t_f, y^t)\). Taking the ratio of efficiency measures eliminates any existing inefficiency and yields an in this sense cleaned concept of output-oriented plant capacity. This leads to the following output-oriented decomposition:

$$\begin{aligned} DF^t_o(x^t,y^t)=DF^t_o(x^t_f,y^t) \cdot PCU^t_{o}(x^t,x^t_f, y^t). \end{aligned}$$
(17)

Thus, the traditional output-oriented efficiency measure \(DF^t_o(x^t,y^t)\) can be decomposed into a biased plant capacity measure \(DF^t_o(x^t_f,y^t)\) and an unbiased plant capacity measure \(PCU^t_{o}(x^t,x^t_f, y^t)\).

Recently, Kerstens et al. (2019) have argued and empirically illustrated that the output-oriented plant capacity utilization \(PCU^t_{o}(x^t,x^t_f, y^t)\) may be unrealistic in that the amounts of variable inputs needed to reach the maximum capacity outputs may simply be unavailable at either the firm or the industry level. This is linked to what Johansen (1968) called the attainability issue. Hence, Kerstens et al. (2019) define a new attainable output-oriented plant capacity utilization at the firm level. We now recall the definition of their attainable output-oriented plant capacity utilization (APCU) at level \({\bar{\lambda }} \in \mathbb {R}_+\) in each period t as follows:

$$\begin{aligned} APCU^t_{o}(x^t,x^t_f, y^t,\bar{\lambda })=\frac{DF^t_o(x^t,y^t)}{ADF_o^t(x^t_f,y^t,\bar{\lambda })}, \end{aligned}$$
(18)

where the attainable output-oriented efficiency measure \(ADF_o^f\) at a certain level \({\bar{\lambda }} \in \mathbb {R}_+\) is defined by

$$\begin{aligned} ADF_o^t(x^t_f,y^t,\bar{\lambda })=\max \{\varphi \mid \varphi \ge 0, 0 \le \lambda \le \bar{\lambda }, \varphi y^t \in P^t(x^t_f,\lambda x^t_v)\}. \end{aligned}$$
(19)

Again, for \(\bar{\lambda }\ge 1\), since \(1 \le DF^t_o(x^t,y^t) \le ADF_o^t(x^t_f,y^t,\bar{\lambda })\), notice that \(0<APCU^t_{o}(x^t,x^t_f, y^t,\bar{\lambda }) \le 1\). Also, for \(\bar{\lambda } < 1\), since \(1 \le ADF_o^t(x^t_f,y^t,\bar{\lambda }) \le DF^t_o(x^t,y^t)\), notice that \(1 \le APCU^t_{o}(x^t,x^t_f, y^t,\bar{\lambda })\).

One can again distinguish between a so-called biased attainable plant capacity measure \(ADF_o^t(x^t_f,y^t,\bar{\lambda })\) and an unbiased attainable plant capacity measure \(APCU^t_{o}(x^t,x^t_f, y^t,\bar{\lambda })\), whereby the latter is cleaned from any eventual inefficiency. This leads to the following output-oriented decomposition:

$$\begin{aligned} DF^t_o(x^t,y^t)=ADF_o^t(x^t_f,y^t,\bar{\lambda }) \cdot APCU^t_{o}(x^t,x^t_f, y^t,\bar{\lambda }). \end{aligned}$$
(20)

Therefore, the traditional output-oriented efficiency measure \(DF^t_o(x^t,y^t)\) can be decomposed into a biased attainable plant capacity measure \(ADF_o^t(x^t_f,y^t,\bar{\lambda })\) and an unbiased attainable plant capacity measure \(APCU^t_{o}(x^t,x^t_f, y^t,\bar{\lambda })\). Furthermore, Kerstens et al. (2019) note that if expert opinion cannot determine a plausible value, then it may be better to opt for the next input-oriented plant capacity measure that does not suffer from the attainability issue.

Cesaroni et al. (2017) define a new input-oriented plant capacity measure using a pair of input-oriented efficiency measures. The input-oriented plant capacity utilization (\(PCU_i\)) in each period t is defined as:

$$\begin{aligned} PCU^t_{i}(x^t,x^t_f, y^t)=\frac{DF^t_i(x^t_f,x^t_v,y^t)}{DF^t_i(x^t_f,x^t_v,0)}, \end{aligned}$$
(21)

where \(DF^t_i(x^t_f,x^t_v,y^t)\) and \(DF^t_i(x^t_f,x^t_v,0)\) are both sub-vector input efficiency measures reducing only the variable inputs relative to the technology, whereby the latter efficiency measure is evaluated at a zero output level.

Since \(0<DF^t_i(x^t_f,x^t_v,0) \le DF^t_i(x^t_f,x^t_v,y^t)\), notice that \(PCU^t_{i}(x^t,x^t_f, y^t) \ge 1\). Thus, input-oriented plant capacity utilization has a lower limit of unity. This input-oriented plant capacity utilisation compares the minimum amount of variable inputs for given amounts of outputs with the minimum amount of variable inputs with output levels where production is initiated, whence it is larger than unity. It answers the question how the amount of variable inputs compatible with the initialisation of production must be scaled up to produce the current amount of outputs. Similar to the previous case, one can distinguish between a so-called biased plant capacity measure \(DF^t_i(x^t_f,x^t_v,0)\) and an unbiased plant capacity measure \(PCU^t_{i}(x^t,x^t_f, y^t)\), the latter being cleaned of any prevailing inefficiency. This leads to the following input-oriented decomposition:

$$\begin{aligned} DF^t_i(x^t_f,x^t_v,y^t)= DF^t_i(x^t_f,x^t_v,0) \cdot PCU^t_{i}(x^t,x^t_f, y^t). \end{aligned}$$
(22)

Thus, the traditional sub-vector input-oriented efficiency measure \(DF^t_i(x^t_f,x^t_v,y^t)\) is decomposed into a biased plant capacity measure \(DF^t_i(x^t_f,x^t_v,0)\) and an unbiased plant capacity measure \(PCU^t_{i}(x^t,x^t_f, y^t)\).

It is important to notice that output- and input-oriented plant capacity notions differ with respect to the concept of attainability. The more recent input-oriented plant capacity notion is always attainable in that one can always reduce the amount of variable inputs such that one reaches an input set with zero output level. Indeed, due to the axiom of inaction it is normally possible to reduce variable inputs to reach zero production levels. Inaction simply means that one can halt production. Producing a zero output need not imply that no inputs are used. An example of zero production with positive amounts of variable inputs are maintenance activities in large industrial plants that bring production to a halt.

2.4 Integration of plant capacity utilisation and malmquist productivity indices

Following De Borger and Kerstens (2000), starting from the basic decomposition of the output-oriented Malmquist productivity index (10) into technical efficiency change and technical change one can isolate changes in capacity utilisation from technical efficiency change in the first component. In particular, incorporating (10) and (17) we can straightforwardly decompose the technical efficiency change component of the Malmquist productivity index \(M_o^{t,t+1} (x^t,y^t,x^{t+1},y^{t+1})\) to obtain:

$$\begin{aligned} \begin{array}{ll} &{} M_o^{t,t+1} (x^t,y^t,x^{t+1},y^{t+1})\\ &{}= \underbrace{\frac{DF_o^{t}(x_f^t,y^t)}{DF_o^{t+1}(x_f^{t+1},y^{t+1})}}_{(i)}\cdot \underbrace{\frac{PCU_o^{t}(x^{t}, x_f^{t},y^{t})}{PCU_o^{t+1}(x^{t+1}, x_f^{t+1},y^{t+1})}}_{(ii)} \underbrace{\sqrt{ \frac{DF_o^{t+1}(x^{t+1},y^{t+1})}{DF_o^t(x^{t+1},y^{t+1})}\cdot \frac{DF_o^{t+1}(x^t,y^t)}{DF_o^{t}(x^{t},y^{t})}}}_{(iii)}.\\ \end{array}\nonumber \\ \end{aligned}$$
(23)

This expression (23) shows that productivity changes are the combined results of three separate phenomena. The first component (i) measures the change in technical efficiency assuming a constant degree of capacity utilization. Specifically, it evaluates the change in technical efficiency relative to a full capacity output technology between periods t and \(t+1\). The second component (ii) captures the change in the degree of plant capacity utilisation between t and \(t+1\) while holding the level of technical efficiency constant. The third component (iii) is the same as in (10) and reflects pure technical change. When any of the components is larger (smaller) than unity, this indicates an improvement (deterioration) in the corresponding component, except for the component indicating changes in plant capacity utilization. For the latter, a number smaller (larger) than unity indicates an improvement (deterioration). In other words, this decomposition of the Malmquist productivity index provides a straightforward procedure for relating productivity growth to the dynamics of capacity utilization.

Similarly, we can now present a new decomposition of the technical efficiency change component of the attainable output-oriented Malmquist productivity index \(M_i^{t,t+1} (x_f^t,x_v^t,y^t,x_f^{t+1},x_v^{t+1},y^{t+1})\) at level \(\bar{\lambda }\). By incorporating (10) and (20) as follows:

$$\begin{aligned} \begin{array}{ll} M_o^{t,t+1} (x^t,y^t,x^{t+1},y^{t+1})&{}=\underbrace{\frac{ADF_o^{t}(x_f^t,y^t,\bar{\lambda })}{ADF_o^{t+1}(x_f^{t+1},y^{t+1},\bar{\lambda })}}_{(i)} \cdot \underbrace{\frac{APCU_o^{t}(x^{t}, x_f^{t},y^{t},\bar{\lambda })}{APCU_o^{t+1}(x^{t+1}, x_f^{t+1},y^{t+1},\bar{\lambda })}}_{(ii)} \\ &{}\quad \underbrace{\sqrt{\frac{DF_o^{t+1}(x^{t+1},y^{t+1})}{DF_o^t(x^{t+1},y^{t+1})}\cdot \frac{DF_o^{t+1}(x^t,y^t)}{DF_o^{t}(x^{t},y^{t})}}}_{(iii)}. \end{array} \end{aligned}$$
(24)

This expression (24) shows that productivity changes are the combined results of three separate statements. The first part (i) measures the change in technical efficiency assuming a constant degree of attainable capacity utilization. Specifically, it evaluates the change in technical efficiency relative to a full attainable capacity output technology between periods t and \(t+1\). The second component (ii) captures the change in the degree of attainable plant capacity utilisation between t and \(t+1\) while holding the level of technical efficiency constant. The third component (iii) is the same as in (10) and (23), and reflects pure technical change. When any of these components is larger (smaller) than unity, this indicates an improvement (deterioration) in the corresponding component, except for the component indicating changes in plant capacity utilization. For the latter, a number smaller (larger) than unity indicates an improvement (deterioration). In other words, this decomposition of the Malmquist productivity index provides a straightforward procedure for relating productivity growth to the dynamics of capacity utilization.

By analogy, we can now present a new decomposition of the technical efficiency change component of the input-oriented Malmquist productivity index \(M_i^{t,t+1} (x_f^t,x_v^t,y^t,x_f^{t+1},x_v^{t+1},y^{t+1})\). By incorporating (15) and (22), one obtains:

$$\begin{aligned} \begin{array}{ll} &{} M_i^{t,t+1} (x_f^t,x_v^t,y^t,x_f^{t+1},x_v^{t+1},y^{t+1})=\underbrace{\frac{DF_i^{t}(x_f^t,x_v^t,0)}{DF_i^{t+1}(x_f^{t+1},x_v^{t+1},0)}}_{(i)} \\ &{}\cdot \underbrace{\frac{PCU_i^{t}(x^{t}, x_f^{t},y^{t})}{PCU_i^{t+1}(x^{t+1}, x_f^{t+1},y^{t+1})}}_{(ii)} \underbrace{\sqrt{ \frac{DF_i^{t+1}(x_f^{t+1},x_v^{t+1},y^{t+1})}{DF_i^t(x_f^{t+1},x_v^{t+1},y^{t+1})}\cdot \frac{DF_i^{t+1}(x_f^t,x_v^t,y^t)}{DF_i^{t}(x_f^{t},x_v^{t},y^{t})}}}_{(iii)}. \end{array} \end{aligned}$$
(25)

This expression (25) shows that productivity changes are the combined results of three separate phenomena. The first component (i) measures the change in technical efficiency assuming a constant degree of capacity utilization. Specifically, it evaluates the change in technical efficiency relative to a full capacity input technology between periods t and \(t+1\). The second component (ii) captures the change in the degree of input-oriented plant capacity utilisation between t and \(t+1\) while holding the level of technical efficiency constant. The third component (iii) is the same as in (15) and reflects pure technical change. When any of these components is smaller (larger) than unity, this indicates an improvement (deterioration) in the corresponding component, except for the component indicating changes in plant capacity utilization. For the latter, a number larger (smaller) than unity indicates an improvement (deterioration). In other words, this decomposition of the Malmquist productivity index provides a straightforward procedure for relating productivity growth to the dynamics of capacity utilization.

Note that for all these three Malmquist index decompositions (23), (24), and (25) are defined relative to variable returns to scale technologies, which limits the scope for further decompositions (see Zofío, 2007). Note furthermore that for these three Malmquist index decompositions (23), (24), and (25) there is always the possibility that the frontier change component is infeasible. The incidence of infeasibilities is determined by the empirical data configurations (see Kerstens & Van de Woestyne, 2014 for more details). Finally, to save space details on the mathematical programming problems needed to compute all components of these Malmquist index decompositions (23), (24), and (25) are available in Appendix A.

3 Efficiency and productivity in tourism: a succinct review

Tourism has become a major part of some countries economic activities. The notion of productivity is complex and multi-faceted to apply in the tourism sector with its mixture of complementary private and public sector activities (see, e.g., Ritchie & Crouch (2003) for a review). There is a rather substantial literature using traditional average practice specifications of technology and limiting itself to partial productivity indicators (for example, McMahon, 1994). Furthermore, a wide range of methodologies has been used to gauge productivity changes. The work by Blake et al. (2006) is one example that uses computable general equilibrium models to evaluate productivity change.

A lot of recent studies have opted for studying the efficiency and productivity based on best practice frontier technology specifications. While it is fair to say that the deterministic, nonparametric frontier methods (often denoted as Data Envelopment Analysis models) seem to be most popular in the tourism field at large, also stochastic frontier analysis is being used on a regular basis (e.g., Anderson et al., 1999), and even Bayesian approaches are occasionally employed (for instance, Assaf & Tsionas, 2018). Furthermore, for each of these basic frontier methods, a plethora of methodological refinements is available: for instance, the basic deterministic, nonparametric frontier methods have been extended into a metafrontier to envelop groups of frontiers in, e.g., Huang et al. (2013).

Most existing published efficiency studies in tourism have focused on privately owned facilities. Popular themes of study have been the efficiency of hotels (e.g., Barros et al., 2009), restaurants (for instance, Banker & Morey, 1986), and travel agencies (e.g., Sellers-Rubio & Nicolau-Gonzálbez, 2009), among others. Alternatively, some efficiency studies have attempted to evaluate the performance of public sector tourism infrastructures like museums (e.g., Mairesse & Vanden Eeckaut, 2002), national parks (for instance, Bosetti & Locatelli, 2006), or theaters (e.g., Last & Wetzel, 2010).

There are also proposals to analyse the efficiency and productivity in the tourism sector at an aggregate level (e.g., Peypoch & Solonandrasana, 2008). Furthermore, one can mention some other isolated attempts to judge certain aspects of tourism policies at the macro level. For example, Botti et al. (2012) develop a mean-variance portfolio approach to help destination management organizations minimize variance and maximize return of inbound tourism. In a similar vein, Botti et al. (2009) analyse the tourism destination competitiveness of French regions. For instance, Wober & Fesenmaier (2004) assess the efficiency of advertising budgets of state tourism offices in the United States. As a final example, Cracolici et al. (2008) evaluate 103 Italian regions for the single year 2001: the single output bed-nights relative to population is related to proxies for cultural and historical capital, human capital, and labour inputs.

Focusing on the hotel industry, perhaps the seminal article is Morey & Dittman (1995) who evaluate the performance of 54 hotels of a national chain in the USA. Since this classic article a wide variety of efficiency assessments have been made for hotels and hotel chains in a number of countries. Examples of more recent applications at the national or regional level include: Huang et al. (2012) for China; Zhang et al. (2016) for France; Bosetti et al. (2007) for Italy; Barros (2005) for Portugal; Assaf and Cvelbar (2011) for Slovenia; Devesa and Peñalver (2013) for Spain; Hathroubi et al. (2014) for Tunesia; Anderson et al. (1999) for the US; among others. To the best of our knowledge, Moriarty (2010) is the only article analysing output-oriented plant capacity, among others, for the New Zealand hotel sector at large.

Reviewing the literature, there are a rather limited number of studies focusing on a dynamic productivity analysis of hotels over a minimal time period. Since these studies are relevant for our own study, we succinctly summarise key research findings. Sun et al. (2015) evaluate an output-oriented MPI to Chinese regions from 2001 to 2009 and find positive productivity change driven by technological change and some regional heterogeneity. Barros et al. (2009) apply a Luenberger productivity indicator to 15 Portugese hotels for the 1998–2004 period and find an positive average productivity change that is mainly due to technological change.

4 Data and specification

Tourism industry has grown rapidly in recent years. It has even become one of the most crucial sectors in China. With the booming of tourism, a fierce competition has been imposed on the hospitality industry. Also, substantial investment have been made in the industry. For instance, total assets have increased from 653 billion RMB in 2008 to 1 215 billion RMB in 2016. However, the profit versus total asset rate has dropped from 20.75 to 17.35% between 2008 and 2016. Thus, operational efficiency seems to have become a major concern for the Chinese accommodation industry.

In the tourism literature, there is still some argument about whether star-rated hotels can be regarded as representative of the hospitality industry (see Núñez-Serrano et al., 2014). Hence, in this paper our models are applied to the Chinese accommodation industry above a minimal designed size, since this is the most comprehensive range of data available.Footnote 3 In the sequel, we first discuss the specification of inputs and outputs in the technology in more detail. Then, we present some descriptive statistics for the sample.

4.1 Specification: choice of inputs and outputs

One characteristic of the accommodation industry is the multitude of activities. The majority of hotels provides not only accommodation, but also other supplementary services, such as catering and entertainment. In our study, we consider that hotels propose three main services: (i) accommodation activity (rooms), (ii) food and beverage services (meals), and (iii) other services such as entertainment. Then, following past studies the revenues generated from each of these three activities are used to reflect the hotels’ profitability (e.g., Hu et al., 2010). As for the inputs, in total three variable inputs and one fixed input are considered. The three variable inputs are: (i) number of employees represents the indispensable core asset that make the hotels capable to offer all three services; (ii) current assets represent the hotel’s capacity to support its daily operation; and (iii) main business costs describe the hotel’s main expenses on its business activities. In addition, we follow economic tradition by considering as a single fixed input: (iv) total fixed assets reflect the hotel’s support to its development and future extension.

4.2 Descriptive statistics

To ensure the homogeneity of the hotel technology in this study, we have selected a sample of 31 provinces in mainland China with a period spanning from 2008 to 2016. As such, this represents a unique opportunity to evaluate the whole Chinese accommodation industry over a rather long period of time. To obtain the data for our inputs and outputs, we make use of a commercial database: the Wind Database. We have four inputs: (i) number of employees (in 10 000 persons); (ii) current assets (in CNY 100 million); (iii) main business cost (in CNY 100 million); and (iv) fixed assets (in CNY 100 million). Obviously, the first tree assets are variable inputs, while the fourth input is fixed following economic tradition. Note that also Barros et al. (2010) consider fixed assets as a quasi-fixed input. We also have three outputs: (v) revenues from meals (in CNY 100 million); (vi) revenues from rooms (in CNY 100 million); and (vii) other revenues (in CNY 100 million).

As an initial step, some descriptive statistics for inputs and outputs are presented in Table 1 to contextualize our analysis: trimmed mean, standard deviation (St. Dev.), and minimum (Min.) and maximum (Max.). One observes a rather wide range of variation, which is not uncommon for this aggregate level of analysis.

Table 1 Descriptive statistics for Chinese hotels (2008–2016)

To depict the evolution of the trimmed mean in Table 1 of all inputs and outputs over the different years, we use Fig. 1a and b that trace the inputs and outputs , respectively. Note that since the first input, i.e., number of employees (No. of Employees), is reported in terms of 10000 persons, it is plotted against the secondary axis on the right-hand side in Fig. 1a.

Fig. 1
figure 1

Inputs and outputs changes over different periods

Figure 1a depicts the average evolution of the inputs. Clearly, two input variables have increased substantially and in a monotonous way: the number of current assets and main costs have increased by 152 and 92.92% respectively. However, for the number of employees we notice that after an initial increase there is a substantial 6.8% drop. While for the fixed-assets, there is a one year substantial drop and then a continuous increase that almost compensates this initial drop. This reduction in overall fixed assets is due to a shift in investments towards high-end hotels in major tourism provinces such as Beijing, Guangdong, Jiangsu, Shanghai, Shandong and Zhejiang.Footnote 4 Looking at the evolution of the inputs over time in Fig. 1a, it is not straightforward to empirically determine the fixed nature of some of these inputs, partially because of the aggregate nature of the data.Footnote 5 All these numbers show that the Chinese accommodation industry has tried to rationalize its input usage, revealing that operational efficiency is clearly an objective for the policy makers involved.

Figure 1b shows the evolution of the three outputs over time. First, we observe that all three time series increase almost monotonously. Second, it is clearly visible that the shares of the room services and other services become relatively speaking more important. In fact, the revenue share of meals decreases slightly.

5 Empirical results

5.1 Results for output-oriented Malmquist productivity index

Table 2 reports the basic descriptive statistics for the components of the output-oriented MPI (23) from 2008 to 2016. In this table, the first eight columns list the results under C, while the last eight columns report the results under NC. The rows of Table 2 include four parts. In each part, the first line lists the number of feasible observations for the components of the output-oriented Malmquist productivity index, while the next four lines list descriptive statistics: geometric mean, standard deviation, minimum and maximum. Note that the use of a geometric mean ensures that the multiplicative decomposition holds true exactly. Part (i) reports basic descriptive statistics for the first component of (23), i.e., the component \(\frac{DF_o^{t}(x_f^t,y^t)}{DF_o^{t+1}(x_f^{t+1},y^{t+1})}\) which shows the change in technical efficiency (or rather, the change in the degree of biased plant capacity utilisation) between periods t and \(t+1\). To facilitate comparison between Part (ii) and other parts, we report the basic descriptive statistics for the inverse of the second component of (23), i.e., \(\frac{PCU_o^{t+1}(x^{t+1}, x_f^{t+1},y^{t+1})}{PCU_o^{t}(x^{t}, x_f^{t},y^{t})}\) that shows the change in the degree of unbiased plant capacity utilisation between periods \(t+1\) and t. Finally, part (iii) shows the third component of (23) that is related to the shift of the production frontier. Finally, the last part states the output-oriented MPI (23) as the product of its components. Thus, all components can be interpreted in the same way: a component larger than unity indicates growth, while a component smaller than unity indicates decline.

Table 2 Descriptive statistics for the output-oriented MPI (23) and its components

Analysing the results in Table 2, we can infer the following conclusions. First, on average the change in the degree of biased plant capacity utilisation (part i) is rather close to the degree of unbiased plant capacity utilisation (part (ii)) for all periods under C. These two components turn out to be identical under NC. This is due to the fact that the numerator of plant capacity utilisation is always unity for all observations under NC: \(DF^t_o(x^t,y^t)=1\). Given that the biased plant capacity utilisation measures \(DF^t_o(x^t_f,y^t) \le 1 \) are always smaller than unity, this leads to this particular result. Second, under C for the periods 2009–2010 and 2011–2012 the degree of biased and unbiased plant capacity utilisation improve. Under NC both the degree of biased and unbiased plant capacity utilisation improve in periods 2008–2009, 2009–2010 and 2012–2013. Third, for the average of the frontier change (part iii), we obtain a minimum amount in period 2012–2013 and a maximum amount in the period 2008–2009 under both C and NC. Also, the average of part (iii) is larger than the averages of parts (i) and (ii) for all periods, except for periods 2011–2012 and 2012–2013 under both C and NC and for period 2009–2010 under NC only.

Note that there are a few computational infeasibilities for the frontier change component: this problem is identical for C and NC, except for the years 2011–2012 and 2012–2013 where there is one more infeasibility under NC.

Table 3 reports the Spearman rank correlation coefficients for components of the output-oriented MPI (23). This table is structured as follows. First, components on the diagonal (in bold) depict the rank correlation between the C and NC cases. Second, the components under the diagonal show the rank correlation between NC components, and the components above the diagonal show the rank correlation between the C components.

Table 3 Spearman rank correlations for the output-oriented MPI (23) and its components

The following three conclusions emerge from studying Table 3. First, for the C results, one can observe that part (iii) and MPI have a very high rank correlation and part (i) and inverse of part (ii) have the highest rank correlation among all components of the output-oriented MPI. Second, for the NC results, part (iii) and MPI have a unity rank correlation while also part (i) and inverse of part (ii) have a unity rank correlation. Third, comparing C and NC results, the highest rank correlations are for MPI compared to part (iii), while parts (i) and (ii) correlate weakly.

5.2 Results for input-oriented Malmquist productivity index

Table 4 is structured in a way similar to Table 2. This table reports the basic descriptive statistics for components of the input-oriented MPI (25) from 2008 to 2016. Analogously to subsection 5.1, all components can now be interpreted in the same way: a component smaller than unity indicates growth, while a component larger than unity indicates decline.

Table 4 Descriptive statistics for the input-oriented MPI (25) and its components

Analysing the results in Table 4, one can draw the following conclusions. First, on average the change in the degree of biased plant capacity utilisation (part i) is almost close to the degree of unbiased plant capacity utilisation (part ii) for all periods under C while they are identical under NC. This is due to the fact that the numerator of input-oriented plant capacity utilisation is always unity for all observations under NC: \(DF^t_i(x^t_f,x^t_v,y^t)=1\). Given that the biased input-oriented plant capacity utilisation measures \(DF^t_i(x^t_f,x^t_v,0) \le 1 \) are always smaller than unity, this leads to this particular result. Second, only for the periods 2009–2010 and 2012–2013 the biased and unbiased capacity utilisation indices are larger than unity, indicating an improvement, while for all other periods these deteriorate under both C and NC. Third, the average frontier change (part iii) is minimal in period 2009–2010, improves till period 2012–2013, and then decreases. Also, the average frontier change is larger than the average changes in parts (i) and (ii) for all periods, except for periods 2009–2010 under C and NC.

Note that under NC the number of computational infeasibilities for the frontier change is much higher than under C. While the NC frontier technology leads to a closer fit with the data and results in a more precise measurement of local technical change, this precision comes at the cost of an increased possibility of infeasibilities (see also Kerstens & Van de Woestyne, 2014).

Table 5 reports the Spearman rank correlation coefficients for component of the input-oriented MPI (25). This table is structured in a similar way to Table 3. First, components on the diagonal (in bold) depict the rank correlation between the C and NC cases. Second, the components under the diagonal show the rank correlation between NC components, and the components above the diagonal show the rank correlation between the C components.

Table 5 Spearman rank correlations for the input-oriented MPI (25) and its components

The following three conclusions emerge from studying Table 5. First, for the C results, one can observe that part (iii) and MPI have a very high rank correlation and part (i) and inverse of part (ii) have the highest rank correlation among all components of the input-oriented MPI. Second, for the NC results, part (iii) and the input-oriented MPI have a unity rank correlation while also part (i) and inverse of part (ii) have a unity rank correlation. Third, comparing C and NC results, the highest rank correlations are for Part (ii) followed by part (iii) and then the other components.

5.3 Comparing output- and input-oriented Malmquist productivity indices and some further results

To compare output- and input-oriented Malmquist productivity indices, one can deduce the following conclusions. First, the output-oriented MPI moves inverse to the input-oriented MPI in all periods except for the two last ones under C. Thus, there is agreement on the same pattern of growth and decline, except for the two last periods under C. This inverse relationship is somewhat mitigated under NC: only in the 3 periods 2011–2012 till 2013–2014 this inverse relation holds true. Thus, there is less agreement on patterns of growth and decline under NC. Thus, overall output- and input-oriented MPI do not necessarily measure the same things. Second, the frontier change component (part iii) moves in an inverse way when comparing both MPI indices under C for almost all periods except the last one, while it moves in an inverse way only for the periods 2011–2012 till 2013–2014 under NC. Thus, there is less agreement on patterns of frontier change under NC. Overall, output- and input-oriented frontier change do not necessarily measure the same things all the time. Third, the plant capacity utilisation change (part ii) moves in an inverse way when comparing both MPI indices under C for the periods 2011–2012 and 2012–2013; while it moves in an inverse way only for the periods 2008–2009 under NC. Thus, there is less agreement on patterns of plant capacity utilisation change under NC. Thus, output- and input-oriented plant capacity utilisation change are not necessarily measuring things exactly the same all the time.

Table 6 Spearman rank correlations among components of output- and input-oriented MPI (23) and (25)

Table 6 reports the Spearman rank correlation coefficients among the components of the output- and input-oriented MPI under C and NC separately. To calculate this Spearman rank correlation coefficients, we ensure that all components of the input-oriented MPI (15) and output-oriented MPI (10) have the same interpretation. Therefore, we invert the second part of the output-oriented MPI (10) such that all output-oriented components have the same interpretation. Furthermore, we invert the input-oriented MPI (15) as well as its first and third components such that these are in line with the second component. Thus, all output- and input-oriented MPI and components now are interpreted as follows: when any of these components is larger (smaller) than unity, this indicates an improvement (deterioration) in the corresponding component.

The following two conclusions emerge from studying Table 6. First, for the C results, one can observe that the highest rank correlations are for output- and input-oriented MPI followed by part (iii). Second, for the NC results, all components of the output- and input-oriented MPI experience very low rank correlations.

In most of the literature computing Malmquist productivity indices, it is simply assumed that the observed components are somehow statistically significant. However, the hotel industry is a service sector and there are some a prior indications that the quaternary sector of the economy may suffer from Baumol’s cost disease: a rise of salaries in jobs that experience no or low increase of (labour) productivity. Furthermore, the hotel sector is rather sensitive to the business cycle. Finally, some hotel studies like Assaf & Barros (2011) do report overall close to stagnant productivity growth: for a sample of 31 hotel chains over the period 2006 to 2008 about half experience productivity decline, while the other half benefits from productivity growth. A glance at our own Tables 2 and 4 suffices to see that the MPI hovers very closely around unity.

Therefore, it can be meaningful to perform a t-test to evaluate whether the average MPI as well as its components are significantly different from unity or not. We report the corresponding p-values in Table 7. If the p-value is greater than 0.05, then it means that we cannot reject the null hypothesis that the population average equals unity at the 5% significance level. If the p-value is less than or equal to 0.05, then it means that we reject this same null hypothesis at the 5% significance level: the average MPI differs from unity.

Table 7 T-test whether MPI and its components differ from unity
Table 8 Spearman rank correlations among components of output- and input-oriented PCU and average occupancy rate

Based on the results of Table 7, we can infer two conclusions. First, for the convex input- and output-oriented MPI and their components, the p value of the t-test are less than 0.05, except for part (i). This indicates that under convexity for both input- and output-oriented MPIs, the average MPIs, and their parts (ii) and (iii) differ from unity. Thus, there is obvious improvement or deterioration in productivity. Second, for the nonconvex input- and output-oriented MPI and their components, the p value of the t-test is bigger than 0.05, except for part (ii) of the input-oriented MPI. This indicates that under nonconvexity for both input- and output-oriented MPIs, the average MPI, and their components do not differ from unity. Thus, there is no obvious improvement or deterioration in productivity. Overall, it is interesting to observe that the precise conclusion seems to be very much conditioned on the convexity axiom.

In a first attempt to provide some external validation for the different PCU notions used in this contribution, we compare these with a more traditional non-frontier capacity utilisation measure in the hotel industry. The only relevant information available to us is the average occupancy rate. This average occupancy rate equals the number of rooms (nights) actually sold divided by the number of rooms (nights) available within the relevant time period at the provincial level. Table 8 contains the Spearman rank correlations between the average occupancy rate and the levels of the output- and input-oriented PCU notions under both convexity and nonconvexity.

The Spearman rank correlations between output- and input-oriented PCUs on the one hand and average occupancy rate on the other hand for each separate year are reported in the first nine rows of Table 8. In the two last lines we report the Spearman rank correlations when we consider on the one hand all observations over all years listed above and on the other hand an intertemporal frontier (amalgamating all observations over all years in one frontier). While the year by year frontiers are needed for the MPI and its decomposition, the intertemporal frontier is solely computed for the purpose of comparing the PCUs with the average occupancy rate.

Several conclusions emerge from looking at Table 8. First, a bit surprisingly the input-oriented PCU notion correlates slightly better than the output-oriented PCU notion with average occupancy rate. Second, the NC results have slightly lower rank correlations than the C results. Third, when we consider the union of all year-by-year frontier estimates, there is a significant relation among all PCU notions with the average occupancy rate. However, one can observe that the input-oriented plant capacity notion correlates better than the output-oriented plant capacity notion with average occupancy rate under both C and NC cases. Finally, when we consider the intertemporal frontier, there is a significant relation between average occupancy rate with all PCU notions, except the convex output-oriented PCU notion. Thus, the results show that the input-oriented PCU notion, similar to the year-by-year frontiers, correlates better than the output-oriented PCU notion with the average occupancy rate under both C and NC cases. Overall, this indicates that these PCU notions do have some substantial relation with a more traditional non-frontier capacity utilisation measure in the hotel industry.

5.4 Robustness check

To investigate the sensitivity of our findings, we perform a bootstrap on the four possible cases (i.e., input- vs. output-oriented combined with convex vs nonconvex). For the convex cases, we follow the procedure described in Simar and Wilson (1999), but suitably adapted to our short-run sub-vector efficiency measures and the accompanying MPI components mentioned in (23) and (25). Since these components are new, no existing bootstrapping software can be employed. Therefore, the bootstrap routines have been developed from scratch in Python.

For the technical details about the original procedure, we refer the interested reader to Simar and Wilson (1999) for all details. But, bootstrapping according to these authors can-roughly speaking—be summarized as follows. Using the variation present in the efficiency measures derived from the available observations combined with a kernel estimator based on the bivariate normal density function, create perturbed inputs (outputs) for the input-oriented (output-oriented) case. This is done a number of B times: we consider the customary \(B=2000\) used in a lot of other bootstrap studies. Since the data at hand consists of panel data, there are serial correlations that must be preserved when creating these perturbations. For each of these perturbed inputs and outputs, sequences of the required efficiency measures and related MPI components are then computed (i.e., the bootstrap sequences). Finally, the median is taken of these B bootstrap sequences and a 95% confidence interval can be obtained by using the percentile method among others (i.e., by considering the interval with boundary points the minimum and maximum of the ordered bootstrap sequence after cutting off 2.5% at both sides). Since this process has to be repeated for all observations and all time periods, it is rather computationally challenging.

Table 9 MPI deviations and confidence intervals

For the nonconvex cases, to the best of our knowledge no satisfying bootstrap methods have been described in the literature. Only Jeong & Simar (2006) develop a method to bootstrap nonconvex efficiencies, but it requires a type of Delaunay triangulation (i.e., the multidimensional counterpart of linear interpolation). Given the complexity of the procedure and the fact that it convexifies a nonconvex technology to some extent, we opt not to go down this road. But, simply applying the same strategies developed for the convex cases effectively does not yield meaningful results in the nonconvex cases either. The main obstacle is the lack of variation in the efficiency measures obtained from the original observations. In fact, for some of the available years, all observations are (sub-vector) efficient which leads to no variations at all and consequently a vanishing variance-covariance matrix.Footnote 6

To overcome this issue for our sample, we have opted for a pragmatic approach and we use the variations obtained in the convex models to create perturbed inputs and outputs from which the corresponding nonconvex efficiency measures are derived: MPI components can then be obtained. In terms of comparing convex and nonconvex results, this pragmatic approach makes sense since the perturbations in the underlying inputs and outputs are kept very similar so that possible differences in results can be attributed solely to differences in technology.

Table 9 reports the difference (\(\Delta \)MPI) between the geometric mean of the MPI estimates reported in Tables 2 and 4 on the one hand and the geometric mean of the MPI bootstrap estimates over all observations on the other hand. Table 9 also reports the average lower (MPI\(_{lb}\)) and upper (MPI\(_{ub}\)) bounds of the 95% confidence intervals obtained from the bootstrap results. For the other components, the results are presented in Table 11 in Appendix C.

For the output-oriented model under convexity, the bootstrapped MPI values are larger than the original MPI values with nearly all deviations being negative. For all other models, we observe an opposite result with most deviations being positive. This implies that the tourism productivity index as calculated by traditional method tends to be overestimated in most cases while the output-oriented model underestimates tourism performance under the convexity assumption. Based on the confidence intervals, it is apparent that the deviations between the upper and lower bounds are relatively small: this highlights the small scope for change in China’s regional tourism efficiency. Only the change in productivity in 2009–2010 is significantly different from unity under all assumptions. More precisely, for the output-oriented MPIs the improvement or deterioration in productivity is confirmed between 2008 and 2012 under the convexity assumption, but the same conclusion is supported only for 2008–2009 under the nonconvexity assumption. For the input-oriented MPIs, there is an obvious productivity deterioration between 2009–2010, but the improvement in MPI lasts only between 2012 and 2014 under the convexity assumption. By contrast, we observe an input-oriented MPI improvement between 2014 and 2015 under the nonconvexity assumption. The conclusions from these bootstrapping results confirm our main findings from the t-test in Table 7. The statistical significance of the Malmquist productivity indices varies according to our assumptions on the convexity axiom. The results show that the impact of the convexity assumption in performance measurement is non-negligible in empirical applications.

6 Conclusions

Starting from the seminal theoretical proposal to include an output-oriented plant capacity utilisation notion within an output-oriented MPI (De Borger and Kerstens, 2000), this contribution has made two new proposals: a first is to include an attainable output-oriented plant capacity utilisation concept within the output-oriented MPI, and a second is to integrate an input-oriented plant capacity utilisation measure within the input-oriented MPI.

Our empirical application on a balanced panel of Chinese hotels has served to empirically illustrate the above extended decompositions of the MPI. The final comparison of output- and input-oriented MPI has shown that there is some overall agreement on the same patterns of growth and decline, but that there also exist some substantial exceptions The same conclusions were found for the frontier change component (part iii), and for the plant capacity utilisation change (part ii). Overall, output- and input-oriented MPI as well as their decomposition partially measure similar things, but these MPI and components also measure things differently in their own right.

Being among the first in the tourism literature to test for the significance of the observed patterns of the MPI, we interestingly observe that the change in MPI and its components is mainly significantly different from unity under convexity, while it is mainly not significantly different from unity under nonconvexity. Furthermore, to the best of our knowledge we offer the first perspective on some external validation for the different PCU notions by comparing these with the average occupancy rate. Our empirical results reveal that the input-oriented PCU notion correlates better than the output-oriented PCU notion with the average occupancy rate under both C and NC. Thus, these PCU notions are somehow related to this more traditional non-frontier capacity utilisation measure in the hotel industry.

Avenues for eventual future research include the following. First, one could try to combine a graph-based Malmquist productivity index (see Zofío & Lovell, 2001) with a graph-based plant capacity notion (see Kerstens et al., 2020). Furthermore, it may be attractive to try to develop suitable plant capacity indicators that could be used to extend the existing decompositions of the Luenberger productivity indicator. In a similar vein, it may be useful to search how to include plant capacity components into the Hicks-Moorsteen TFP index and the Luenberger-Hicks-Moorsteen TFP indicator. Also an external validation for the different plant capacity notions discussed in this contribution remains a hot topic: our aggregate level provincial results should ideally be complemented with firm level results. As to the choice of fixed inputs, while we relied on economic tradition, from an empirical point of view it could be worthwhile to see how the number of employees as a second fixed input would affect our empirical results. Finally, one may consider a second stage regression of MPI change and its components on a series of explanatory variables when these would be available, or using lagged values to forecast future MPI values.