Introduction

The relations between efficient vectors of inputs and outputs for a given technology picture production functions, Shephard (1970), the characteristics of which predominantly are determined by their substitutionalities, and their returns to scale (RTS). The homogeneity of such a production function depends on the change, i.e., reaction of outputs with radial changes of inputs. Production theory distinguishes three forms of such reactions:

  • constant RTS,

  • decreasing RTS, and

  • increasing RTS.

There is a great variety of methods to estimate such functional dependencies, cf. Coelli et al. (2005).

Data envelopment analysis (DEA) as a nonparametric approach permits the approximation of the efficient boundary of technologies. This approximation takes place via gathered data of inputs and outputs for the so-called decision-making units (DMUs), that is, classical DEA; numerous theoretical papers and applications prove its value, see, for instance, Bashiri et al. (2013), Shokrollahpour et al. (2016) and Ziari (2016) or for a overwhelming survey Emrouznejad and Yang (2017).

Already in their pioneering work, the authors in Banker et al. (1984) also tackled the problem of the DMUs’ RTS. They proved the sign of a variable u in the so-called multiplier form of input-oriented DEA to indicate the RTS situation—rather than the RTS measure—of a DMU, and they restricted their analyses to efficient rather than inefficient units, only. Roughly speaking, we have constant/non-decreasing/non- increasing RTS iff optimal \(u =\)/\(\geqq\)/\(\leqq 0\). These results were generalized in Banker and Thrall (1992) or Sahoo et al. (2016) for efficient production points with variable u’s, i.e., non-unique RTS situations. The authors in Førsund (1996), Førsund and Hjalmarsson (2004), Førsund et al. (2007), and Fukuyama (2000) advanced the RTS context by models of neoclassical production theory and developed equations of scale elasticity for efficient and non-efficient DMUs; an overview and a discussion regarding the concept of RTS are provided in Tone and Sahoo (2003) or Jahanshahloo and Soleimani-Damaneh (2004). In the present paper, we give an elegant and comprehensible proof and an easy interpretation of such scale elasticity, for efficient and non-efficient DMUs. Generally speaking, there are several ways for classifying a DMU’s RTS situation in DEA—either envelopment or multiplier form driven ways. However, the RTS measure is based on an optimal solution of the multiplier form; in this sense, the above-mentioned authors provide equivalent representations of the respective formula for calculating such scale elasticities.

A DMU which is informed about its relative inefficiency wants to react by input reduction, output increase, or both: Classical DEA theory recommends to proceed against the boundary of technology; for some activity planning procedures cf. Du et al. (2010), Du and Liang (2012), Homayounfar et al. (2014), Zhang et al. (2015) and Tohidi and Khodadadi (2013). DEA software supports such instructions and helps the user to take respective actions. In all these calculations, scale elasticities are not taken into account. However, the additional knowledge of scale elasticities should encourage the DMUs also to make use of scaling effects. Consequently, if a DMU wants to maintain its BCC efficiency, then for increasing RTS it should upsize its production as any increasing inputs result in disproportionally higher increase of outputs. Decreasing RTS rather recommends downsizing due to lower disproportionality. Therefore, each DMU pursues two objectives: improvement of efficiency and upsizing or downsizing production. Mostly if not always the way to technology boundary is hard to realize. Labour law or social restrictions might forbid such rigid alterations. Is there a more convenient way towards the right production size and efficiency? It is, and the hitherto necessary information is available even for interior points of technology.

For the combination of efficiency and scaling improvement, we develop an (input-oriented) interactive algorithm, thus harmonizing scientific findings and managerial interests of the DMU under consideration. Stepwise it improves its efficiency and its scale size and hence runs through a path from its original activity towards mpss, if possible. The DMU’s awareness of inefficiency sometimes demands unrealistic input reduction. Rather it—the DMU—might communicate its disposition for a more reasonable reduction and the algorithm should respect this information.

Not always realizing the path from the actual activity of the DMU to most productive scale size (mpss) is an easy job. An impressive example for a vulnerable theatre scenery illustrates this issue.

The paper is organized as follows. In the second section, we present preliminaries of DEA. In the next section, two forms of activity changes are given, one maintaining BCC efficiency and one maintaining CCR efficiency. The following section is dedicated to situations of non-unique RTS. The next section provides the central topic of this paper: an interactive and iterative algorithm towards mpss. In the following section, an empirical example illustrates the new method. For 30 German theatres, efficiencies and RTS are calculated, and for a selected theatre, the proposed method is outlined. The last section contains a short resume and delineates prospects of further research.

Preliminaries

Koopmans’s activity analyses are the roots of DEA, cf. Koopmans (1951). Activities are processes which transform objects in other objects. If such objects are material or immaterial goods, such a process by definition is a production process, cf. Frisch (1965), p. 3. The set of all such processes is the production possibility set or technology \(\text{T}\), for short. The activity over a certain time period transforms the input \(\mathbf{x} \in \mathbb {R}^M_+\) into the output \(\mathbf{y} \in \mathbb {R}^S_+\) and thus characterizes the performance of a DMU, such as a project, a corporation, or even a non-profit utility. Once a technology is determined, DEA theory allows for the efficiency measurement of any activity. We restrict our attention to input-oriented efficiency measures and omit output orientation, cf. Banker et al. (1984), however. Partial inefficiencies which might occur with radial input reduction will not be considered here, either; the reader is referred to Charnes and Cooper (1984) or again Banker et al. (1984).

The authors in Charnes et al. (1978), following Debreu (1951), and Farell (1957) developed linear optimization problems (LOPs) measuring efficiency.

For each DMU, k solve

$$\begin{aligned}&\min \theta _k \nonumber \\ {\text{s.t.:}}\,&\theta _k \mathbf{x}_k - \sum \nolimits _j \lambda _{kj} \mathbf{x}_j \geqq \mathbf{0}\nonumber \\&\sum \nolimits _j \lambda _{kj} \mathbf{y}_j \geqq \mathbf{y}_k \nonumber \\ \,\text{with}&\,\lambda _{kj} \geqq 0 \ \forall j, \ \theta _k \ \text{free} \nonumber \\ {\text{and}}&\quad \quad {\hbox{---}} \quad \quad \quad {\text{for}} \quad {\text{constant returns to scale (CRS)}} \end{aligned}$$
(1.1)
$$\begin{aligned}&\quad\,\, \sum \nolimits _j \lambda _{kj} \ = \ 1 \quad {\text{for}} \quad {\text{variable returns to scale (VRS).}} \end{aligned}$$
(1.2)

(1.1) and (1.2) are called envelopment form of DEA. If the optimal \(\theta _k\) is equal to 1 DMU, k is efficient and inefficient, otherwise.

The dual problems of (1.1) and (1.2) are (2.1) and (2.2), again LOPs.

$$\begin{aligned}&\max \ g_k = \mathbf{u}_k^{\mathsf {T}} \mathbf{y}_k + {u}_k \nonumber \\ {\text{s.t.:}}\,&\mathbf{v}_k^{\mathsf {T}} \mathbf{x}_k = 1 \nonumber \\&\mathbf{u}_k^{\mathsf {T}}{} \mathbf{y}_j + {u}_k - \mathbf{v}_k^{\mathsf {T}}{} \mathbf{x}_j \leqq 0 \quad \forall j \nonumber \\&\mathbf{u}_k, \mathbf{v}_k \geqq \mathbf{0} \nonumber \\ {\text{and}}\,&{u}_k \ = \ 0 \quad \quad {\text{for CRS}} \end{aligned}$$
(2.1)
$$\begin{aligned}&\quad\quad{u}_k \ {} \text{free} \quad \quad \, \, {}{\text{for VRS.}} \end{aligned}$$
(2.2)

Such weights \(\mathbf{v}_k\) and \(\mathbf{u}_k\) of inputs and outputs often are called virtual prices. Neither are they preassigned nor market-based, they just are a suitable means for each DMU k to stress its own efficiency. This attitude is called ’self-appraisal’. Already in their pioneering work, Banker et al. (1984) demonstrate that for an efficient activity \((\mathbf{x}_k,\mathbf{y}_k)\), the optimal value of \({u}_k\) in (2.2) indicates constant, non-decreasing, or non-increasing RTS, though the authors refer to the sign of the respective variable \({u}_k\), only, and do not fully exploit the information from an optimal solution \(\mathbf{u}_k^{*}, \ \mathbf{v}_k^{*}, \ {u}_k^{*}, \ g_k^{*}\). More on that in the next section.

In the DEA literature, models (1.1), (2.1) and (1.2), (2.2) often are named by the acronyms CCR and BCC, due to their creators Charnes, Cooper, Rhodes and Banker, Charnes, Cooper, respectively. Whenever convenient, we follow such practice.

Activity changes

Activity change under constant BCC efficiency

Let \(\mathbf{u}_k^{*}, \ \mathbf{v}_k^{*}, \ {u}_k^{*}, \ g_k^{*}\) be an optimal solution of (2.2). Then, we have

$$\begin{aligned} \frac{\mathbf{u}_k^{*\mathsf {T}}{} \mathbf{y}_k+{{u}}_k^{*}}{\mathbf{v}_k^{*\mathsf {T}}{} \mathbf{x}_k} \ = \ g_k^{*}, \end{aligned}$$

and we assume the optimal solution to be unique, first, cf. Banker et al. (1984), p. 1086. Multiplying by the denominator and reordering terms yield the following equation:

$$\begin{aligned} {\mathbf{u}_k^{*\mathsf {T}}}{\mathbf{y}_k}+{{u}_k^{*}}-{g_k^{*}}\; {\mathbf{v}_k^{*\mathsf {T}}{\mathbf{x}_k}} = 0. \end{aligned}$$
(3)

Theorem 1

For the equation \({\mathbf{u}_k^{*\mathsf {T}}}{\mathbf{y}_k}+{{u}_k^{*}}-{g_k^{*}}\; {\mathbf{v}_k^{*\mathsf {T}}{\mathbf{x}_k}} = 0\) and a radial change \(\mathbf{x}_k \rightarrow {(1+ \delta )}{} \mathbf{x}_k\) and a radial change \(\mathbf{y}_k \rightarrow {(1+ \varepsilon _k)}{} \mathbf{y}_k\), the equation \({\mathbf{u}_k^{*\mathsf {T}}}{(1+ \varepsilon _k)}{\mathbf{y}_k}+{{u}_k^{*}}-{g_k^{*}}\; {\mathbf{v}_k^{*\mathsf {T}}}\) \({(1+ \delta )}\) \({\mathbf{x}_k} = 0\) holds iff

$$\begin{aligned} \varepsilon _k = \delta \ \frac{ \mathbf{u}_k^{*\mathsf {T}}\mathbf{y}_k+{{u}_k^{*}}}{\mathbf{u}_k^{*\mathsf {T}}{} \mathbf{y}_k}. \end{aligned}$$
(4)

Proof

$$\begin{aligned}&\quad\quad\,{\mathbf{u}_k^{*\mathsf {T}}}{(1+ \varepsilon _k)}{\mathbf{y}_k} + {{u}_k^{*}} - {g_k^{*}}\; {\mathbf{v}_k^{*\mathsf {T}}}{(1+ \delta )}{\mathbf{x}_k} = 0 \\&\Longleftrightarrow {(1+ \delta )} \underbrace{[{\mathbf{u}_k^{*\mathsf {T}}}{\mathbf{y}_k} + {{u}_k^{*}} - {g_k^{*}}\; {\mathbf{v}_k^{*\mathsf {T}}}{\mathbf{x}_k}]}_{= \ 0} + {\mathbf{u}_k^{*\mathsf {T}}}{(1+ \varepsilon _k)}{\mathbf{y}_k} = {{\delta } \ {{u}_k^{*}} +{\mathbf{u}_k^{*\mathsf {T}}}{{(1+ \delta )}{} \mathbf{y}_k}} \\&\Longleftrightarrow \quad {\mathbf{u}_k^{*\mathsf {T}}}{\mathbf{y}_k}+ {\mathbf{u}_k^{*\mathsf {T}}}{\varepsilon _k}{\mathbf{y}_k} = \phantom {0} {\mathbf{u}_k^{*\mathsf {T}}}{\mathbf{y}_k}+{\delta } \begin{pmatrix} {u}_k^{*}+\mathbf{u}_k^{*\mathsf {T}}{} \mathbf{y}_k \end{pmatrix} \\&\Longleftrightarrow \quad \delta \ \frac{ \mathbf{u}_k^{*\mathsf {T}}{} \mathbf{y}_k+{{u}_k^{*}}}{\mathbf{u}_k^{*\mathsf {T}}{} \mathbf{y}_k} = \varepsilon _k. \\ \end{aligned}$$

\(\square\)

Theorem 1 determines the necessary radial change of output with a radial change of input so as to maintain efficiency \(g_k^{*}\) constant. Equation (4) provides the respective relation between input/output changes. Please note that

$$\begin{aligned} \varepsilon _k = \delta \ \frac{ \mathbf{u}_k^{*\mathsf {T}}\mathbf{y}_k+{{u}_k^{*}}}{\mathbf{u}_k^{*\mathsf {T}}{} \mathbf{y}_k} = \delta \ \frac{g_k^{*}}{g_k^{*}-{{u}_k^{*}}}. \end{aligned}$$

Therefore, the comprehensible result of theorem 1 corresponds to the well-known scale elasticity measure of Førsund et al. (2007). From this equation and from Eq. (4), it is obvious which role \(u_k^*\) plays in this context.

Conclusion 1

  • \(|\varepsilon _k|= |\delta |\ \, {\text{for}} \ {u}_k^{*} = 0 ; \ \mathbf{y} \, \mathrm{changes \ radially \ to \ the \ same \ amount \ as} \ \mathbf{x}\) does \(\Longleftrightarrow\) constant RTS

    \(|\varepsilon _k|> |\delta |\ \mathrm{{{for}}} \ {u}_k^{*} > 0 ; \ \mathbf{y} \ \mathrm{changes \ radially \ to \ a \ greater \ amount \ than} \ \mathbf{x}\) does \(\Longleftrightarrow\) increasing RTS

    \(|\varepsilon _k|< |\delta |\ \mathrm{{{for}}} \ {u}_k^{*} < 0 ; \ \mathbf{y} \ \mathrm{changes \ radially \ to \ a \ smaller \ amount \ than }\ \mathbf{x}\) does \(\Longleftrightarrow\) decreasing RTS.

  • All statements are valid for an efficient (\(g_k^{*} = 1\)) as well as for an inefficient (\(g_k^{*} < 1\)) DMU k.

Perhaps, the second bullet point needs some attention. Equation (3) is valid at (in-)efficiency level \(g_k^{*}\); execution of input projection \(\overline{\mathbf{x}}_k =\,\! g_k^{*}\,{\mathbf{x}_k}\) results in

$$\begin{aligned} {\mathbf{u}_k^{*\mathsf {T}}}{\mathbf{y}_k}+{{u}_k^{*}}-{\mathbf{v}_k^{*\mathsf {T}} \overline{\mathbf{x}}_k} = 0.\quad\quad (3^\prime ) \end{aligned}$$

RTS for this equation obviously is the same as for (3). If \(g_k^{*}<1\), (3) characterizes RTS (directly) for an interior point of technology, see also Dellnitz (2016), whereas \((3^\prime )\) determines the same RTS for the respective boundary point; see also, e.g., Fukuyama (2000). In the remainder of this contribution, we make use of this fact: moving in the interior of technology is possible without projection on the boundary!

The measure of RTS for an activity \((\mathbf{x}_k, \mathbf{y}_k)\) is \(\frac{ \mathbf{u}_k^{*\mathsf {T}}{} \mathbf{y}_k+{{u}_k^{*}}}{\mathbf{u}_k^{*\mathsf {T}}{} \mathbf{y}_k}\), rather than \({u}_k^{*}\). This measure is not only a function of \({u}_k^{*}\), but also of prices \(\mathbf{u}_k^{*}\) and outputs \(\mathbf{y}_k\). For equivalent equations, confer again Førsund et al. (2007). The authors in Podinovski et al. (2009) grabbed this question again and simplified mathematical derivations for either case, envelopment, and multiplier form, the so-called direct and indirect approach.

Next, we study Eq. (3) again, but with varying inputs and outputs rather than the fix activity of \(\text{DMU}\) k. The respective hyperplane in \(\mathbb {R}^{M+S}\) reads

$$\begin{aligned} {\mathbf{u}_k^{*\mathsf {T}}}{\mathbf{y}}+{{u}_k^{*}}-{g_k^{*}}\, {\mathbf{v}_k^{*\mathsf {T}}{\mathbf{x}}} = 0. \end{aligned}$$
(5)

We call (5) \(\text{DMU}\) k’s hyperplane at (in-)efficiency level \(g_k^{*}\). The following generalization of theorem 1 is straightforward:

Corollary 1

For the equation \({\mathbf{u}_k^{*\mathsf {T}}}{\mathbf{y}}+{{u}_k^{*}}-{g_k^{*}}\, {\mathbf{v}_k^{*\mathsf {T}}{\mathbf{x}}} = 0\) and a radial change \(\mathbf{x} \rightarrow {(1+ \delta )}{} \mathbf{x}\) and a radial change \(\mathbf{y} \rightarrow {(1+ \varepsilon _k)}{} \mathbf{y}\), the equation \({\mathbf{u}_k^{*\mathsf {T}}}{(1+ \varepsilon _k)}{\mathbf{y}}+{{u}_k^{*}}-{g_k^{*}}\, {\mathbf{v}_k^{*\mathsf {T}}}\) \({(1+ \delta )}\) \({\mathbf{x}} = 0\) holds iff

$$\begin{aligned} \varepsilon _k = \delta \ \frac{ \mathbf{u}_k^{*\mathsf {T}}\mathbf{y}+{{u}_k^{*}}}{\mathbf{u}_k^{*\mathsf {T}}{} \mathbf{y}}. \end{aligned}$$
(6)

Moving on such a hyperplane means moving under constant BCC efficiency. As soon as input projection of the virtual activity \((\mathbf{x}, \mathbf{y})\) falls on another facet of technology than that of \((\mathbf{x}_k, \mathbf{y}_k)\), efficiency will change, of course. More on that in Sect. “Improving scale size and efficiency: an interactive approach” or for calculating efficiency stability regions, see, for example, Zamani and Borzouei (2016).

Activity change under constant CCR efficiency

Starting from (1.2), we now study simultaneous activity change for DMU k from \(\mathbf{x}_k\) to \(\mathbf{x}_k/r\) and from \(\mathbf{y}_k\) to \(\mathbf{y}_k/r\). This transformation and multiplication by \(r>0\) results in the following equation:

$$\begin{aligned}&\min \theta _k \nonumber \\ \text{s.t.:}\,&\theta _k \mathbf{x}_k - \sum _j \nu _{kj} \cdot r \cdot \mathbf{x}_j \geqq \mathbf{0} \nonumber \\&\sum _j \nu _{kj} \cdot r \cdot \mathbf{y}_j \geqq \mathbf{y}_k \nonumber \\&\sum _j \nu _{kj} = 1\nonumber \\&\nu _{kj} \geqq 0 \ \forall j, \ \theta _k \ \text{free}, \end{aligned}$$
(7)

and with \(\lambda ^{\prime }_{kj} = \nu _{kj} \cdot r\), we get the linear program (8) with parameter r on the right-hand side:

$$\begin{aligned}&\min \theta _k \nonumber \\ \text{s.t.:}&\,\theta _k \mathbf{x}_k - \sum _j \lambda ^{\prime }_{kj} \cdot \mathbf{x}_j \geqq \mathbf{0} \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \nonumber \\&\sum _j \lambda ^{\prime }_{kj} \cdot \mathbf{y}_j \geqq \mathbf{y}_k \nonumber \\&\sum _j \lambda ^{\prime }_{kj} = r\nonumber \\&\lambda ^{\prime }_{kj} \geqq 0 \ \forall j, \ \theta _k \ \text{free}. \end{aligned}$$
(8)

Interesting enough, the parametrization of activity \((\mathbf{x}_k, \mathbf{y}_k)\) by \((\mathbf{x}_k/r, \mathbf{y}_k/r)\) is equivalent to reciprocal sensitivity analysis of the right-hand side of the convexity restriction in (1.2); let \([r^{-},r^{+}]\) be the stability region. Activity change beyond these limits would make the input projection of \((\mathbf{x}_k/r, \mathbf{y}_k/r)\) fall off the actual facet of technology. Running on a trajectory \((\mathbf{x}_k, \mathbf{y}_k)/r\) lets DMU k’s CCR efficiency unchanged, of course. Note that \(r > 1\) implies in downsizing and \(r < 1\) in upsizing of DMU k’s activity.

In this section, two forms of activity changes were proposed:

  • maintaining BCC efficiency in Sect. “Activity change under constant BCC efficiency” and

  • maintaining CCR efficiency in Sect. “Activity change under constant CCR efficiency”.

How to combine these transformations for creating a path towards mpss, i.e., improving BCC efficiency and CCR efficiency or productivity, respectively, is topic of the algorithmic interactive approach in Sect. “Improving scale size and efficiency: an interactive approach”.

Before doing so, however, we need some statements on non-unique RTS. Sometimes a DMU must decide under aggravated conditions whether it should expand or reduce activity. In those cases, non-unique RTS play a decisive role.

Non-unique RTS

The authors in Banker and Thrall (1992), Golany and Yu (1997) study the consequences of a non-unique solution \({\mathbf{u}_k^{*}}\), \({\mathbf{v}_k^{*}}\), \({{u}_k^{*}}\), \({g_k^{*}}\) of problem (2.2), and specially focus on the non-uniqueness of \({{u}_k^{*}}\). The former authors seek for geometrical characterizations of efficiency points and then calculate intervals for \({{u}_k^{*}}\), cf. Banker and Thrall (1992), p. 82 equations (7) and (8). We follow their reasoning and generalize. Solve again

$$\begin{aligned}&{{g}}_k^{*} = \max g_k = \mathbf{u}_k^{\mathsf {T}} \mathbf{y}_k + {u}_k\\ {\text{s.t.:}}\,&\mathbf{v}_k^{\mathsf {T}} \mathbf{x}_k = 1 \\&\mathbf{u}_k^{\mathsf {T}}{} \mathbf{y}_j + {u}_k - \mathbf{v}_k^{\mathsf {T}}{} \mathbf{x}_j \leqq 0\quad \forall j \\&\mathbf{u}_k, \mathbf{v}_k \geqq \mathbf{0} \quad {\text{and}} \ {u}_k \ {\text{free.}} \end{aligned}$$

Apply \(g_k^{*}\) and solve (9), cf. Mardani Shahrbabak and Noura (2011):

$$\begin{aligned}&{{u}}_k^{+}=\sup \ {{u}}_k \ {\text{and}} \ {{u}}_k^{-\phantom{.}} = \inf \ {{u}}_k \nonumber \\ \text{s.t.:}\,&\mathbf{v}_k^{\mathsf {T}} \mathbf{x}_k = 1 \nonumber \\&\mathbf{u}_k^{\mathsf{T}}{} \mathbf{y}_j + {u}_k - \mathbf{v}_k^{\mathsf {T}}{} \mathbf{x}_j \leqq 0\quad \forall j \quad \nonumber \\&\mathbf{u}_k^{\mathsf {T}} \mathbf{y}_k + {u}_k = {{g}}_k^{*} \nonumber \\&\mathbf{u}_k, \mathbf{v}_k \geqq \mathbf{0} \quad {\text{and}} \ {u}_k \ {\text{free.}} \end{aligned}$$
(9)

Let \({\mathbf{u}_{k}^{+}}\), \(\mathbf{v}_{k}^{+}\), \({{u}_{k}^{+}}\) and \({\mathbf{u}_{k}^{-\phantom{.}}}\), \(\mathbf{v}_{k}^{-\phantom{.}}\), \({{u}_{k}^{-\phantom{.}}}\) be the corresponding optimal solutions of (9). Equation (4) of Corollary 1 allows for the following generalization:

$$\begin{aligned} \varepsilon _k^{+} = \delta \ \frac{\mathbf{u}_{k}^{+\mathsf {T}}\mathbf{y}_k+{{u}_{k}^{+}}}{\mathbf{u}_{k}^{+\mathsf {T}}{} \mathbf{y}_k} \ \text{and} \ \varepsilon _k^{-\phantom{.}} = \delta \ \frac{\mathbf{u}_{k}^{-\mathsf {T}}\mathbf{y}_k+{{u}}_{k}^{-\phantom{.}}}{\mathbf{u}_{k}^{-\mathsf {T}}{} \mathbf{y}_k}. \end{aligned}$$
(10)

The numerator in (10) in each case equals \(g_k^{*}\), but is composed differently. The respective changes from (\(\mathbf{x}_k, \mathbf{y}_k\)) to \((1+\delta ) \mathbf{x}_k\) and \((1+ \varepsilon _k^{\pm })\mathbf{y}_k\) must be feasible in \(\text{T}\), of course.

Conclusion 2

If \(0 < {u}_{k}^{-\phantom{.}} \leqq {u}_{k} \leqq {u}_{k}^{+}\), then for all \({u}_k\) increasing RTS prevail at (\(\mathbf{x}_k, \mathbf{y}_k\) ); if \({u}_{k}^{-\phantom{.}} \leqq {u}_{k} \leqq {u}_k^{+}<\) 0, then decreasing RTS. The case \({u}_{k}^{-\phantom{.}} \leqq 0 \leqq {u}_{k}^{+}\) yields decreasing, increasing and specially constant RTS. This property remains valid for all activities \((\mathbf{x}, \mathbf{y}) \in \text{T}\) located on the hyperplane at efficiency level \(g_k^{*}\) like in (5).

Example

For the activities of 7 DMUs, we have the data in Table 1. Table 2 contains the optimal weights, efficiencies, and RTS of the DMUs; the shaded area in Fig. 1 shows the respective technology. In addition, Fig. 1 illustrates non-unique RTS.

Table 1 Inputs/outputs of seven DMUs
Table 2 Efficiency, optimal weights, and RTS of seven DMUs
Fig. 1
figure 1

Family of hyperplanes for DMU 1 and hyperplane of DMU 6 at efficiency \(\frac{3}{4}\)

It shows that DMU 1 under model (2.2) performs with efficiency \(g_1^{*} = 1\). Problem (9) yields \({u}^{{-\phantom{.}}}_1 = \frac{7}{8}\) and \({u}^{+}_1 = 1\). For the equation \(1 = \frac{\mathbf{u}_{1} \cdot {1}+{{u}_{1}}}{\mathbf{v}_{1} \cdot {2}}\) or \({\mathbf{u}_{1} \cdot {1}+{{u}_{1}}} - {\mathbf{v}_{1} \cdot {2}} = 0\), respectively, we consider the solutions

$$\begin{aligned}&{u}_{1} = \frac{7}{8},\quad \mathbf{u}_{1} = \frac{1}{8},\quad \mathbf{v}_{1} = \frac{1}{2}; \\&{u}_{1} = 1,\quad \, \, \mathbf{u}_{1} = 0,\quad \, \, \mathbf{v}_{1} = \frac{1}{2}; \\&{u}_{1} = 0,\quad \, \, \mathbf{u}_{1} = 1,\quad \, \, \mathbf{v}_{1} = \frac{1}{2}. \end{aligned}$$

The second case is degenerated and the third one violates the inequalities in (9) for DMU 2 to DMU 5. The only hyperplane which permits activity changes, feasible in \(\text{T}\), is the one with \({u}_{1} = \frac{7}{8}\). It contains DMU 2, ditto with \({g}^*_{2} = 1\).

DMU 6 under model (2.2) performs with efficiency \(g_6^{*} = \frac{3}{4}\). Problem (9) yields a unique \({u}^-_{6} = {u}^+_{6} = \frac{7}{12}\). The dashed line captioned with \({u}_{6} = \frac{7}{12}\) in Fig. 1 shows the hyperplane of DMU 6 at efficiency level \(g_6^{*}\). From equation \({g}^*_{6} = \frac{\mathbf{u}_{6} \cdot {2}+{{u}_{6}}}{\mathbf{v}_{6} \cdot {3}}\) or \({\mathbf{u}_{6} \cdot {2}+{{u}_{6}}} - \frac{3}{4} \cdot {\mathbf{v}_{6} \cdot {3}} = 0\), respectively, we get: \(\mathbf{u}_{6} = \frac{1}{12}\), \(\mathbf{v}_{6} = \frac{1}{3}\). The hyperplane of DMU 6 which is illustrated in Fig. 1 is the equation \({\frac{1}{12}\mathbf{y} + \frac{7}{12} - \frac{3}{4} \cdot \frac{1}{3}{} \mathbf{x}} = 0\). Note that the slope of this equation is 3—whereas with \(0.1 \cdot \frac{1/12 \cdot {2}+7/12}{1/12 \cdot {2}} = 0.45\), the RTS is 45% for a 10% increase. DMU 6 operates under strictly increasing RTS. \(\diamond\)

Improving scale size and efficiency: an interactive approach

From Eqs. (2.2) and (4), DMU k calculates its relative efficiency and its RTS. Even if the respective activity is an interior point of technology, the DMU knows the exact rate of radial output/input change to keep BCC efficiency constant, cf. Eq. (3\(^{\prime }\)) and the subsequent text. But what must be done to improve productivity and simultaneously make the right scaling decision? After all, Eq. (4) is a recommendation of upsizing or downsizing production, see Conclusion 1. However, how far should this production sizing go. Banker (1984) formulated the concept of mpss activity. An activity has most productive scale size, when CCR and BCC efficiency coincide and are equal to 1. The author showed a way for each DMU how to reach this goal in one step without taking interactive communication with a DMU into account.

Here, the situation is different: How can even an inefficient DMU find its way stepwise to mpss avoiding this lack? Before giving a general answer to this question, we study the example again.

Example

(continued).

First, we focus on DMU 5. It might pursue two objectives

  • downsize activities due to decreasing RTS; as \({u}^+_5 = {u}^{-\phantom{.}}_5 = -\frac{11}{16} < 0\),

  • improve efficiency from actual \({{g}}_5^{*} = \frac{13}{16}\).

The two arrows in Fig. 2 give an idea of possible activity changes.

Fig. 2
figure 2

Upsizing and downsizing in the BCC technology of six DMUs

If DMU 5 runs on the dotted line, it realizes downsizing on its (in-)efficiency hyperplane, i.e., with constant BCC efficiency. It should stop when its input projection falls on another facet of technology and check its situation with respect to new weights.

So far downsizing; and next the goal efficiency improvement. We propose an interactive step. DMU 5 must find out its input reduction potential: What is the minimal input portion just to meet the benchmark \(\tilde{\mathbf{y}}_5\)? Choose a number \(g_5^* \leqq \text{f}_5 \leqq 1\), such that \(\hat{\mathbf{x}}_5 = \tilde{\mathbf{x}}_5 \cdot \text{f}_5\) suffices to produce \(\tilde{\mathbf{y}}_5\). Make (\(\hat{\mathbf{x}}_5, \hat{\mathbf{y}}_5 = \tilde{\mathbf{y}}_5\)) the new activity. For \(\text{f}_5 = g_5^*\), DMU 5 becomes efficient, and for \(\text{f}_5 = 1\), it has no input reduction potential at all. For \(g_5^*< \text{f}_5 < 1\), it remains inefficient, but improves efficiency from \(g_5^*\) to \(\frac{g_5^*}{\text{f}_5}\). In Fig. 2, \(\text{f}_5 = g_5^*\) even makes it a mpss.

Now, consider DMU 6. A first iteration step concerning upsizing yields input \(\tilde{\mathbf{x}}_6 = \frac{10}{3}\) and output \(\tilde{\mathbf{y}}_6 = 3\). For an exemplary demonstration, we assume a radial input reduction factor \(\text{f}_6 = \frac{17}{20}\) resulting in \(\hat{\mathbf{x}}_6 = \tilde{\mathbf{x}}_6 \cdot \frac{17}{20} = \frac{17}{6}\) and \(\hat{\mathbf{y}}_6 = \tilde{\mathbf{y}}_6 = 3\). This activity is a start point for a second iteration. A second upsizing results in \(\tilde{\mathbf{x}}_6 = \frac{17}{3}\) and \(\tilde{\mathbf{y}}_6 = 7\), and a second reduction step with \(\text{f}_6 = \frac{15}{17}\) yields (\(\hat{\mathbf{x}}_6 = 5, \hat{\mathbf{y}}_6\) = 7). In addition, this makes DMU 6 mpss.

Whether or not DMU 6 finally reaches mpss obviously depends on its readiness for respective input reductions. In this case, the iterative procedure results in scale efficiency 1 and even mpss.\(\diamond\)

Running on BCC efficiency hyperplanes has two flaws:

  1. 1.

    \(\mathbf{u}_k^{*\mathsf {T}}{} \mathbf{y}_k\) might be 0 and this makes \(\frac{\mathbf{u}_k^{*\mathsf {T}}{} \mathbf{y}_k + {u}^{*}_k}{\mathbf{u}_k^{*\mathsf {T}}{} \mathbf{y}_k}\) indefinite, cf. Theorem 1. Consider DMU 7: Its input projection falls on the vertical facet with \(\mathbf{u}^{*}_7 = 0\). Consequently, RTS becomes indefinite, and hence, Eq. (6) is undefined. In such cases, the ideal path on BCC-(in-)efficiency hyperplanes, as was demonstrated for DMUs 5 and 6, is blocked.

  2. 2.

    Assume that DMU 7—besides the above-mentioned problem—succeeded in realizing activity (3, 1), see Fig. 1. Its efficiency remains \(\frac{2}{3}\), but now, Eq. (6) permits different weight systems for this efficiency. Which of these weight systems DMU 7 should select for further activity change remains an open question. In addition, this problem worsens, the more BCC efficiency hyperplanes pass through (3, 1). For high-dimensional DEA, this causes a severe problem.

We overcome these flaws by a three-step procedure.

  1. 1.

    Solve (1.2) with optimal value of objective function \(\theta _k^{*}\).

  2. 2.

    Parameterize the 1 in the convexity restriction of (1.2). Let \([r^{-},r^{+}]\) be the range of this sensitivity analysis. Make \((\hat{\mathbf{x}}_k, \hat{\mathbf{y}}_k)^{\pm } = (\mathbf{x}_k, \mathbf{y}_k)/r^{\pm }\) like in (8) and respective comments. Mind the fact that the scaling direction, \(r^{-}\) or \(r^{+}\), depends on the RTS, including \(\infty\) and 0, cf. Table 2.

  3. 3.

    Solve (1.2) for \((\hat{\mathbf{x}}_k, \hat{\mathbf{y}}_k)\) with optimal efficiency \(\hat{\theta^*_k }\). Make \((\tilde{\mathbf{x}}_k, \tilde{\mathbf{y}}_k) = (\hat{\mathbf{x}}_k \cdot {\hat{\theta^*_k }}/{{\theta _k^{*}}}, \hat{\mathbf{y}}_k)\). This step traces \((\hat{\mathbf{x}}_k, \hat{\mathbf{y}}_k)\) back to BCC-(in-)efficiency hyperplane of DMU k at level \({\theta }_k^{*}\)!

The following algorithm formalizes our explanations so far; for the stop criterion, we need both, the optimal CCR and BCC efficiency: \(g_k^{**}\) and \(g_k^{*}\).

figure a

To make the steps of the algorithm more transparent, we illustrate the first two iteration steps of DMU 7 in Fig. 3. DMU 7 is of particular interest, because its input projection has an infinite scale elasticity. Therefore, we apply an activity change under constant CCR efficiency like in Sect. “Activity change under constant CCR efficiency” (step 12). Then, step 14 traces back this activity to the prior BCC efficiency. Step 15 reflects the DMU’s disposition for input reduction. The continuing process of activity changes as indicated in Fig. 3 follows the same reasoning.

Fig. 3
figure 3

Two steps of the proposed algorithm

This algorithm is only one possible stepwise approximation of mpss, of course. So the order of scale-sizing and efficiency improvement might change, so up- and downsizing might be decomposed in smaller layers, etc. However, each specification of such an algorithm must comprise

  • up- or downsizing with constant efficiency,

  • interactive efficiency improvement,

to pursuit the DMU’s two objectives.

So far the algorithm. How a DMU in the medium or in the long term can realize such alterations of its activities is due to its economical environment and its change management. These questions are beyond of the scope of this paper.

Interactive improvement of activities in theatres

A rich theatre scenery is considered the basis of a broad cultural supply for the society, worldwide. The German theatre landscape follows this tradition. Therefore, in the season 2013/14, 142 public theatres attracted 21 million visitors offering more than 67,000 events.

Nevertheless, in times of budgets in deficit in almost all countries, the cost performance ratio of such activities is a major challenge for local authorities and politicians. There are some studies about this question, in which DEA plays a decisive role, cf. Tobias (2003), Marco-Serrano (2006), Kleine and Hoffmann (2013). What are desired outputs of a theatre, what the respective inputs, and for which homogenous subgroup of all theatres does such an efficiency measurement make sense. For our purpose, we consider two inputs:

  • number of seats

  • personnel expenses (million euro)

and three outputs: number of

  • events

  • productions

  • visitors

as the database for DEA among 30 so-called three division theatres which offer drama, music, and dance performances. In Table 3, we show inputs and outputs for the season 2013/14—data from the German Stage Association, Deutscher Bühnenverein (2013)—together with respective CCR and BCC efficiencies plus RTS characteristics—IRS, CRS, and DRS—for these 30 theatres. All findings are results of standard DEA software. Note that each theatre is labelled by its city code.

Table 3 Inputs/outputs, efficiencies, and RTS of 30 theatres

We then choose a special member of the group, namely the theatre of the town of Oldenburg and submit it to the process explained in foregoing sections. Theatre OL starts with 2258 seats, 20.646 mio. personnel expenses, and realizes 734, 54, and 179,742 units of the three outputs. Its relative CCR efficiency is 0.789 and its BCC efficiency amounts to 0.816. For the optimal price system, we have \(u_{\text{OL}}^\pm = -0.07\), thus plighting under-proportional loss of outputs when reducing inputs. Furthermore, we presume an input reduction factor \(f_{\text{OL}} = 0.98\). With these parameters, Oldenburg theatre from planning period to planning period would run through input/output sceneries, and CCR-, BCC-, and scale efficiencies (SE) like in Table 4 (rounded values).

Table 4 Interactive iterations for DMU OL

From this table, we learn that the reduction process of activities for Oldenburg’s public would be very painful. First, reducing activities stepwise by \(r^+ = 1.0703\), 1.0773, 1.0856, and 1.0956 plus input reduction of \((1-0.98) \times 100 = 2\)% in each step means an irresponsible sellout of cultural quality in town. And such a sellout very likely will cause an angry protest in Oldenburg’s population. Even worse, this reduction does not make OL theatre efficient at all. All together, this strenuous effort over 4 planning periods results in 88.45% CCR and BCC efficiency and a 100% scale efficiency. However, mpss is still far away....

Data envelopment analysis is a suitable instrument for a DMU to detect its weaknesses and its improvement potentials. Whether or not a DMU can realize such findings depends on the surrounding conditions and the DMU’s change management, however. Whether or not a theatre like the one in Oldenburg will follow recommendations to reduce activities is by no means a DEA question but rather a political issue. We hope that the audience in this town will have many years of vivid sensations with its theatre.

Conclusion and the road ahead

In this contribution, radial returns to scale are measured for all DMUs along their respective (in-)efficiency hyperplanes. This measure involves not only the variable u but also outputs and their corresponding virtual prices. The measure is valid for efficient and inefficient activities and can be applied even to cases with non-unique u’s. In other words: even for an interior point of the technology, unique and non-unique returns are measurable without any projection upon the technology boundary. This measure for each DMU is a handy instrument to evaluate consequences of radial upsizing and downsizing. Each time, such upsizing or downsizing is realized, the DMU might again check its efficiency and its returns to scale and take action to improve its situation. In this paper, we propose an algorithm to support DMUs in finding an economically reliable path towards mpss. This algorithm interactively communicates with the decision-maker to avoid unrealistic steps of activity improvements. In an application, such steps might be modified due to environmental conditions. These modifications are a worthwhile focus for future research.

Cross-efficiencies are considered an interesting approach to evaluate DMUs’ efficiencies from the point of view of other DMUs. Whenever a supervising institution dismisses self-appraisal as a valid concept, crosswise evaluations might help to find a peer, a weight system acceptable for all DMUs. Earlier and recent DEA literature reports on such peer-appraisal concepts confer, e.g., Doyle and Green (1994), Rödder and Reucher (2012). Once such a concept is accepted and a peer is selected, his weight system not only appraises all DMUs efficiencies but becomes the transfer price system of the whole group. Do there exist cross-RTS similar to cross-efficiencies and which consequences do such cross-RTS have upon a DMUs scale-sizing. Such questions could be the issue of further research.