1 Introduction

Data envelopment analysis (DEA) is a non-parameter technique for measuring the relative efficiency of a set of peer decision-making units (DMUs) with multiple inputs and outputs (Charnes et al, 1978). A weakness of traditional DEA models is that it assumes that all the inputs and outputs are non-negative. However, negative values, especially those of outputs, could exist in many situations. For example, the expected return is generally treated as an output measure for estimating the efficiency of mutual fund, which might be negative for some mutual funds. Likewise, the profit, which is generally chosen as an output for measuring the efficiency of projects, might be negative for some projects. Hence, it is necessary to improve the DEA model to expand its application.

There are several approaches to deal with negative data in DEA models. The simplest method is to treat negative inputs (outputs) as outputs (inputs). If inputs (outputs) are all non-positive, their absolute values can be treated as non-negative outputs (inputs), so that non-positive inputs (outputs) will decrease (increase) when those corresponding non-negative outputs (inputs) expand (Scheel, 2001). However, this method is not applicable if there is some input or output with both positive and negative values. Another approach to handle negative data is to utilize the “translation invariance” property. A DEA model is translation invariant if the translated input–output data yield the same results as the original data. The variable returns to scale (VRS) additive DEA models (Charnes et al, 1984) are translation invariant (Ali and Seiford, 1990; Lovell and Pastor, 1995; Pastor, 1996), but they yield the "furthest" target on the production frontier for inefficient DMUs (Portela et al, 2004) and cannot provide any measure of efficiency. The output-oriented BCC model (Banker et al, 1984) is input translation invariant, and the input-oriented BCC model is output translation invariant. These two kinds of BCC models cannot be applied to the situation where negative values exist in both inputs and outputs. Based on a modified directional distance function (DDF) (Chambers et al, 1996), Portela et al (2004) develop a range directional measure (RDM) model, which can deal with inputs and/or outputs taking positive values for some DMUs and negative values for the others. However, the RDM model may be unbounded when the evaluated DMU has the maximum values for all the outputs and the minimum values for all the inputs (Cheng et al, 2013). Inspired by the RDM model, Sharp et al (2007) introduce a modified slack-based measure, which can deal with negative inputs and/or negative inputs. Emrouznejad et al (2010) propose a semi-oriented radial measure (SORM) to handle negative input–output data. Kerstens and Woestyne (2011) recommend a generalized Farrell proportional distance function that handles negative data and maintains a proportional interpretation under mild conditions. Cheng et al (2013) find that the SORM model might lead to worse target inputs or outputs than the original ones for inefficient DMUs. They develop a variant of the traditional radial model where original values are replaced with their absolute values. Some imprecisions in Cheng et al (2013) are corrected by Kerstens and Woestyne (2014).

A limitation of the above DEA models is that they cannot further discriminate efficient DMUs, all of which have an efficiency score of unity. Andersen and Petersen (1993) develop a super-efficiency DEA model, see also Banker et al (1989), which can rank efficient DMUs. The input-oriented (output-oriented) super-efficiency DEA model excludes the DMU under evaluation from the reference set so that efficient DMUs may have efficiency scores larger (smaller) than or equal to one. The original super-efficiency DEA model is introduced under the condition of constant returns to scale (CRS) and is feasible if all inputs and outputs of DMUs are positive. However, the infeasibility issue might occur in VRS super-efficiency DEA models (Seiford and Zhu, 1999).

Many modified VRS radial super-efficiency DEA models (Chen, 2005; Ray, 2008; Cook et al, 2009; Lee et al, 2011) have been proposed to address the infeasibility issue. Among them, the VRS Nerlove–Luenberger super-efficiency DEA model (Ray, 2008) is based on the DDF and is very often feasible under the non-negative data set. However, this model fails in two exceptions (see Ray, 2008 for details). By choosing proper directions, Chen et al (2013) and Lin and Chen (2015) propose two DDF-based VRS super-efficiency DEA models to eliminate the infeasibility in two exceptions. The model in Chen et al (2013) may be infeasible if zero data exist in outputs (Lin and Chen, 2015). All these modified super-efficiency DEA models are proposed for the non-negative data. In the situations where there exist negative inputs or outputs, the infeasibility issue still exists. Based on the RDM model (Portela et al, 2004), Hadi-Vencheh and Esmaeilzadeh (2013) propose two super-efficiency models in the presence of negative data, called the super RDM+ model and the super RDM—model, respectively. Both models can rank efficient DMUs, but they are still infeasible in some cases. We will illustrate this point in Section 2.

In this paper, we propose a novel DDF-based VRS radial super-efficiency DEA model which is feasible and is able to handle negative data. There are at least five contributions in this article.

  1. 1.

    By choosing a proper direction for the DDF, we propose an alternative VRS radial super-efficiency DEA model. The proposed model not only successfully addresses the infeasibility problem in VRS radial super-efficiency DEA models, but also extends the application of the super-efficiency measure to negative data.

  2. 2.

    The proposed model projects each DMU onto the super-efficiency frontier along a suitable direction and provides improved targets for inefficient DMUs.

  3. 3.

    The proposed model yields a bounded measure of super-efficiency.

  4. 4.

    In the situation where outputs are all non-negative, the proposed model always generates reference points with non-negative outputs.

  5. 5.

    The proposed model is monotonous, units invariant and output translation invariant.

The rest of the paper is organized as follows. Section 2 presents the DDF and some existing directions, whose limitations are illustrated through a numerical example. Section 3 proposes a modified DDF, and based on it, we develop a new VRS radial super-efficiency DEA model capable of dealing with negative data, whose useful properties are also investigated in this section. In Section 4, the proposed model is applied to the numerical example in Section 2 and a data set from the literature, respectively, in order to demonstrate its properties and merits. Conclusions are presented in the last section.

2 DDF-based super-efficiency and directions

Assume that there are n DMUs, each DMU has m inputs and s outputs, and each of inputs and outputs has at least one nonzero value. For each DMU \(j \ (j=1,\ldots ,n)\), let \(x_{ij}\ (i=1,\ldots ,m)\) denote the ith input and \(y_{rj}\ (r=1,\ldots ,s)\) denote the rth output. Under the standard assumptions of convexity and free disposability of inputs and outputs (Chen et al, 2013), the production possibility set (PPS) for a target DMU \(p\ (p\in \{1,\ldots ,n\})\) with respect to super-efficiency is spanned by \((x_{ij},y_{rj}),\) \(j=1,\ldots ,n,\ j\ne p,\) as follows.

$$ \begin{aligned} & T_{p} = \left\{ {(x_{i} ,y_{r} )|x_{i} \ge \sum\limits_{{j = 1,j \ne p}}^{n} {\lambda _{j} x_{{ij}} } ,\quad i = 1, \ldots ,m;} \right. \\ & \quad y_{r} \le \sum\limits_{{j = 1,j \ne p}}^{n} {\lambda _{j} y_{{rj}} } ,\quad r = 1, \ldots ,s; \\ & \quad \left. {\sum\limits_{{j = 1,j \ne p}}^{n} {\lambda _{j} } = 1,\quad \lambda _{j} \ge 0,\quad j = 1, \ldots ,n,\quad j \ne p} \right\} \\ \end{aligned} $$

Choosing a direction vector \((g^{x},g^{y})\), the directional distance function (DDF) for DMUp with respect to \(T_{p}\) is defined as:

$$ D(x_{ip},y_{rp};g^{x},g^{y})=\max \beta _{p}{:}(x_{ip}-\beta _{p} g^{x},y_{rp}+\beta _{p} g^{y}) \in T_{p}.$$

Then, the following general DDF-based super-efficiency DEA model can be established.

$$ \max\quad \beta _{p} $$
(1)
$$ \mathrm{s.t.}\quad \sum _{j=1, j\ne p}^{n}\lambda _{j}x_{ij} \le x_{ip}-\beta _{p} g^{x},\quad i=1,\ldots ,m{,}$$
(2)
$$\sum _{j=1, j\ne p}^{n}\lambda _{j}y_{rj}\ge y_{rp}+\beta _{p} g^{y},\quad r=1,\ldots ,s, $$
(3)
$$\sum _{j=1, j\ne p}^{n}\lambda _{j}=1,\ \lambda _{j}\ge 0,\quad j=1,\ldots ,n,\ j\ne p{.}$$
(4)

Denote the optimum value of model (1)–(4) as \(\beta _{p}^{o}\). The super-efficiency score of the evaluated DMUp can be determined as \(1-\beta _{p}^{o}\) (Ray, 2008). The smaller the value of \(\beta _{p}^{o}\), the more efficient the DMUp. For any efficient DMUp, \(1-\beta _{p}^{o}\) is no less than 1.0.

The direction vector \((g^{x},g^{y})\) should be non-negative and nonzero and can be chosen in an arbitrary way (Chen et al, 2013; Ray, 2008). Briec and Kerstens (2009a) point out that model (1)–(4) cannot guarantee the feasibility if the direction is a constant vector and the output direction vector is nonzero. Hence, \(g^{x}\) and \(g^{y}\) are often considered as the function of \(x_{ip}\) and \(y_{rp}\). If all input and output data are non-negative, the standard DDF for the DMUp is adopted by choosing \((x_{ip},y_{rp})\) as \((g^{x},g^{y})\) (Chambers et al, 1996), and the VRS Nerlove–Luenberger super-efficiency DEA model (Ray, 2008) (called the NL model for short) is obtained. The NL model is very often feasible for non-negative data, but it fails in the following two exceptions (Ray, 2008):

  1. 1.

    When a super-efficiency score is greater than 2.0, the NL model will yield a reference point with negative outputs. In applications where the outputs should be non-negative, such as the performance evaluation of airlines (Ray, 2008), a reference point with negative outputs results in a conceptual problem.

  2. 2.

    If a zero input exists in the evaluated DMU and all other DMUs in the reference set have positive values in that input, the NL model becomes infeasible.

In the context of the definition of a Luenberger productivity indicator, Briec and Kerstens (2009b) avoid reference points with negative outputs by adding a constraint that the projected output should remain positive. However, this method still cannot solve the infeasibility issue in the second exception. To eliminate the infeasibility in both the exceptions, Lin and Chen (2015) recently have put forward a modified DDF-based super-efficiency DEA model (called the LC model for short) by choosing \((x_{ip}+{\max}_{\mathop{j=1,\ldots ,n}\limits_{j\ne p}} \{x_{ij}\},y_{rp})\) as \((g^{x},g^{y})\). The LC model successfully addresses the infeasibility issue in conventional VRS radial super-efficiency DEA models and the NL model under non-negative data.

In the presence of negative data, both the NL and LC models might be infeasible. This is because their related direction vectors, \((x_{ip},y_{rp})\) and \((x_{ip}+{\max}_{\mathop{j=1,\ldots ,n}\limits _{j\ne p}}\{x_{ij}\},y_{rp})\), might be negative, which could guide the DMUp to be further away from the super-efficiency frontier and thus lead to infeasibility. To illustrate this problem, we consider a simple example with six DMUs. Each DMU has three inputs and two outputs. The concrete data set of this example is shown in columns 2 to 6 of Table 1. Input X3 and output Y1 are negative for DMU 2 and DMU 1, respectively. The super-efficiency results yielded by the NL model and the LC model are shown in columns 7 and 8 of Table 1, respectively. Obviously, both the NL model and the LC model become infeasible for some DMUs.

Table 1 Input–output data and evaluation results

In order to rank DMUs in the presence of negative data, Hadi-Vencheh and Esmaeilzadeh (2013) propose the super RDM+ model and the super RDM− model, respectively. Let

$$ P_{ip}^{-}=x_{ip}-\min \limits _{j}\{x_{ij}\},\ P_{rp}^{+}=\max \limits _{j}\{y_{ij}\}-y_{rp}$$
(5)

(Portela et al, 2004). By employing \((P_{ip}^{-},P_{rp}^{+})\) and \((\frac{1}{P_{ip}^{-}},\frac{1}{P_{rp}^{+}})\) as the direction vector \((g^{x},g^{y})\) in model (1)–(4), respectively, the dual of the super RDM+ model and the dual of the super RDM− model can be obtained. If there exists some i or r such that \( P_{ip}^{-}=0\) or \(P_{rp}^{+}=0\), the direction \((\frac{1}{P_{ip}^{-}},\frac{1}{P_{rp}^{+}})\) would be meaningless. To solve this problem, we can use 0 to replace \(\frac{1}{P_{ip}^{-}}\) and \(\frac{1}{P_{rp}^{+}}\), as Portela et al (2004), Hadi-Vencheh and Esmaeilzadeh (2013) proposed. However, these two models still fail in some cases. Let us consider DMU 6 in the above example. According to (5), we can find \(P_{16}^{-}=9,\ P_{26}^{-}=0,\ P_{36}^{-}=2, P_{16}^{+}=0, P_{26}^{+}=3\). Then for both the dual super RDM+ model and the dual super RDM− model, the constraint (3) with respect to the output Y1 of DMU 6 is expressed as

$$ -8\lambda _{1}+4\lambda _{3}+\lambda _{4}+3\lambda _{5}\ge 6+0\cdot \beta =6{.} $$
(6)

According to (4), we have \(-8\lambda _{1}+4\lambda _{3}+\lambda _{4}+3\lambda _{5}\le 4\). This contradicts with (6). Therefore, both the dual super RDM+ model and the dual super RDM− model are infeasible for DMU 6. According to the duality theory of linear programs, the super RDM+ and the super RDM− models are infeasible or have an unbounded optimal value for DMU 6. Actually, these two models fail for DMUs 2 and 6, as one can see from the last two columns of Table 1.

3 Modified DDF-based super-efficiency model

Considering that negative values might exist in the input–output data, we need to choose a new direction vector which is always non-negative and nonzero, independent of inputs and outputs being non-negative or not. To this end, we introduce the following constants:

$$ a_{i}= k\cdot \max \limits _{j=1,\ldots ,n}\{|x_{ij}|\},\quad i=1,\ldots ,m,$$
(7)
$$b_{r}= \min \limits _{j=1,\ldots ,n}\{y_{rj}\},\quad r=1,\ldots ,s,$$
(8)

where k is a constant satisfying \(k\ge 3\). Clearly, we have \(x_{i{p}}+a_{i}>0\) and \(y_{r{p}}-b_{r}\ge 0\) for all \(i=1,\ldots ,m,\) and \(r=1,\ldots ,s,\) respectively. Therefore, we can choose \((x_{i{p}}+a_{i}, y_{r{p}}-b_{r})\) as \((g^{x}, g^{y})\) and then obtain the following VRS radial super-efficiency DEA model:

$$ \max \quad \beta _{p} $$
(9)
$$ \mathrm{s.t.} \quad \sum _{j=1, j\ne p}^{n}\lambda _{j}x_{ij} \le (1-\beta _{p})x_{ip}-a_{i}\beta _{p},\quad i=1,\ldots ,m, $$
(10)
$$ \sum _{j=1, j\ne p}^{n}\lambda _{j}y_{rj}\ge (1+\beta _{p})y_{rp}-b_{r}\beta _{p},\quad r=1,\ldots ,s, $$
(11)
$$ \sum _{j=1, j\ne p}^{n}\lambda _{j}=1,\ \lambda _{j}\ge 0,\; j=1,\ldots ,n,\ j\ne p{.} $$
(12)

For the sake of distinction, we denote the optimum value of model (9)–(12) as \(\beta ^{*}_{p}\). The following proposition shows the feasibility of model (9)–(12).

Proposition 1

Model (9)–(12) is feasible, and we have \(0\le \beta _{p}^{*}<1\) for \((x_{ip},y_{rp})\in T_{p}\) and \(-1\le \beta _{p}^{*}<0\) for \((x_{ip},y_{rp})\notin T_{p}\).

Proof

Since \(x_{i{p}}+a_{i}>0\), constraints (10) can be rewritten as

$$\beta _{p}\le \frac{x_{ip}-\sum^{n}_{\mathop{{j=1}}\limits_{\scriptstyle j\ne p}}\lambda_{j}x_{ij}}{x_{ip}+a_{i}},\ i=1,\ldots ,m{.} $$
(13)

For each \(p\in \{1,\ldots ,n\}\), let \(J_{p}=\{r|y_{rp}-b_{r}>0,r=1,\ldots ,s\}\) and \(O_{p}=\{r|y_{rp}-b_{r}=0,r=1,\ldots ,s\}\). As \(y_{rp}-b_{r}\ge 0\), we have \(J_{p}\bigcup O_{p}=\{1,\ldots ,s\}\).

According to (8), (11) and (12), we have

$$ \sum _{j=1,j\ne p}^{n}\lambda _{j}y_{rj}\ge \mathop {\mathop {\min }\limits _{j=1,\ldots ,n }}\limits _{ j\ne p} \{y_{rj}\} \ge \min \limits _{j=1,\ldots ,n} \{y_{rj}\}= b_{r}=y_{rp},\ r\in O_{p}{.} $$
(14)

It is easy to see from (14) that the inequalities in (11) naturally hold for all \(r\in O_{p}\). Hence, constraints (11) are equivalent to

$$ \beta _{p}\le \frac{{\sum}^{n}_{\mathop{{j=1}}\limits _{\scriptstyle j\ne p}} \lambda _{j}y_{rj}-y_{rp}}{y_{rb}-b_{r}},\ r\in J_{p}{.} $$
(15)

With (13) and (15), we consider the following two complementary cases for each DMU \(p\in \{1,\ldots ,n\}\).

Case I \((x_{ip},y_{rp})\in T_{p}\): In this case, we have \(x_{ip}\ge {\sum}^{n}_{\mathop{{j=1}}\limits _{\scriptstyle j\ne p}}\lambda _{j}x_{ij}\) and \(y_{rp}\le {\sum}^{n}_{\mathop {{j=1}}\limits _{\scriptstyle j\ne p}}\lambda _{j}y_{rj}\) for \(i=1,\ldots ,m\) and \(r=1,\ldots ,s\), respectively. So,

$$\begin{aligned} \frac{x_{ip}-{\sum }^{n}_{\mathop {{j=1 }}\limits _{\scriptstyle j\ne p}}\lambda _{j}x_{ij}}{x_{ip}+a_{i}}\ge 0,\ i=1,\ldots ,m, \end{aligned}$$
(16)

and

$$\begin{aligned} \frac{{\sum}^{n}_{\mathop{{j=1}}\limits_{\scriptstyle j\ne p}}\lambda _{j}y_{rj}-y_{rp}}{y_{rb}-b_{r}}\ge 0,\ r\in J_{p}{.} \end{aligned}$$
(17)

Inequalities (13), (15), (16) and (17) mean that \(\beta _{p}=0\) is a feasible solution of model (9)–(12). Then as the optimum value of problem (9)–(12), \(\beta _{p}^{*}\ge 0\) always holds for \( p=1,\ldots ,n.\) In addition, we have from (7), (12) and (13) that

$$\begin{aligned} \beta _{p}& \le \frac{x_{ip}-\sum \nolimits _{j=1,j\ne p}^{n}\lambda _{j}x_{ij}}{x_{ip}+a_{i}} \le \frac{x_{ip}+{\max}_{\mathop {j=1,\ldots ,n}\limits _{j\ne p}}\{|x_{ij}|\}}{x_{ip}+a_{i}}\\ & \le \frac{x_{ip}+\max \nolimits _{j=1,\ldots ,n}\{|x_{ij}|\}}{x_{ip}+a_{i}}< 1. \end{aligned}$$

Thus in this case, we have \(0\le \beta _{p}^{*}<1\).

Case II \((x_{ip},y_{rp})\notin T_{p}\): We know from (14) that the r satisfying \({\sum}^{n}_{\mathop{{j=1 }}\limits_{\scriptstyle j\ne p}}\lambda _{j}y_{rj}< y_{rp}\) does not belong to \(O_{p}\). Hence in this case, there must exist some \(i\in \{1,\ldots ,m\}\) such that \({\sum}^{n}_{\mathop {{j=1}}\limits _{\scriptstyle j\ne p}}\lambda _{j}x_{ij}> x_{ip}\) or some \(r\in J_{p}\) such that \({\sum}^{n}_{\mathop {{j=1}}\limits _{\scriptstyle j\ne p}}\lambda _{j}y_{rj}< y_{rp}\). We can then easily see from (13) and (15) that model (9)–(12) is still feasible in this case and its optimal value, \(\beta ^{*}_{p}\), is negative. Indeed, since \(k\ge 3\), we know from (7) that

$$\begin{aligned} \max _{j=1,\ldots ,n}\{|x_{ij}|\}\le 2x_{ip}+a_{i}. \end{aligned}$$
(18)

From (12) and (18), we have

$$\begin{aligned} & \frac{x_{ip}-{\sum}^{n}_{\mathop {{j=1}}\limits_{\scriptstyle j\ne p}}\lambda _{j}x_{ij}}{x_{ip}+a_{i}}\ge \frac{x_{ip}- \max \nolimits _{j=1,\ldots ,n}\{|x_{ij}|\}}{x_{ip}+a_{i}}\\ & \quad \ge \frac{x_{ip}-2x_{ip}-a_{i}}{x_{ip}+a_{i}}=-1,\ i=1,\ldots ,m{.} \end{aligned}$$
(19)

On the other hand, we obtain from (8) and (12) that

$$\begin{aligned} & \frac{{\sum}^{n}_{\mathop {{j=1}}\limits _{\scriptstyle j\ne p}}\lambda _{j}y_{rj}-y_{rp}}{y_{rb}-b_{r}}\ge \frac{\min \nolimits _{j=1,\ldots ,n}\{y_{rp}\}-y_{rp}}{y_{rp}-b_{r}}\\ & \quad =\frac{b_{r}-y_{rp}}{y_{rp}-b_{r}}=-1,\, r\in J_{p}{.} \end{aligned}$$
(20)

Since we maximize \(\beta _{p}\) in model (9)–(12), \(\beta ^{*}_{p}\ge -1,\ p=1,\ldots ,n,\) always holds due to (13), (15), (19) and (20). Therefore, \(-1\le \beta ^{*}_{p}<0\) for \((x_{ip},y_{rp})\notin T_{p}\). \(\square \)

According to Proposition 1, no matter whether inputs and outputs are non-negative or not, for each DMU whose input–output bundle belongs to \(T_{p}\), model (9)–(12) expands its outputs and reduces its inputs simultaneously to reach the super-efficiency frontier formed by the rest of DMUs; for each DMU whose input–output bundle does not belong to \(T_{p}\), model (9)–(12) reduces (expands) at least one of its outputs (inputs) to reach the super-efficiency frontier formed by the remaining DMUs.

Inspired by the criterion for identifying efficiency in conventional DEA models such as the CCR model (Charnes et al, 1978), we judge whether the evaluated DMU is efficient or not under model (9)–(12) by the following criterion: The DMU p is inefficient if \(\beta _{p}^{*}>0\) or \(\beta _{p}^{*}=0\) with positive slacks yielded by model (9)–(12); otherwise, it is efficient. Let \(\lambda _{j}^{*}\) denote the optimal maximum slack solution (Cooper et al, 2007) of model (9)–(12). Then, the inputs and outputs of the projection on the super-efficiency frontier with respect to the DMUp can be expressed as

$$\begin{aligned} \hat{x}_{ip}= & {} \sum _{j=1, j\ne p}^{n}\lambda _{j}^{*}x_{ij},\ i=1,\ldots ,m, \end{aligned}$$
(21)
$$\begin{aligned} \hat{y}_{rp}= & {} \sum _{j=1, j\ne p}^{n}\lambda _{j}^{*}y_{rj},\ r=1,\ldots ,s. \end{aligned}$$
(22)

Like traditional DEA models, model (9)–(12) might provide multiple projections for the evaluated DMU because there might exist multiple optimal solutions for \(\lambda _{j}^{*}\). From Proposition 1 and constraints (10) and (11), we know that model (9)–(12) projects the DMUp onto the super-efficiency frontier formed by the rest of DMUs along the direction \((x_{ip}+a_{i},y_{rp}-b_{r})\), without the actual data transformation. Concretely, for inefficient DMUs, it preserves the proportionate improvement property of the traditional DEA model and any projection it provides can be seen as an improved target for the evaluated DMU since \(\hat{x}_{ip}={\sum}^{n}_{\mathop {{j=1}}\limits _{\scriptstyle j\ne p}}\lambda _{j}^{*}x_{ij}\le x_{ip}-\beta _{p}^{*}(x_{ip}+a_{i})\le x_{ip}, \ \hat{y}_{rp}={\sum}^{n}_{\mathop {{j=1}}\limits _{\scriptstyle j\ne p}}\lambda _{j}^{*}y_{rj}\ge y_{rp}+\beta _{p}^{*}(y_{rp}-b_{r})\ge y_{rp},\forall i, r,\) hold; for efficient DMUs, at least one of the outputs (or inputs) of projections is not larger (or not smaller) than that of the corresponding DMU due to the positive \(\beta _{p}^{*}\) or \(\beta _{p}^{*}=0\) with all zero slacks. Thus, model (9)–(12) generalizes current VRS radial super-efficiency DEA models suitable for non-negative data to the situation with partially or fully negative data.

Similarly to the determination of the super-efficiency score in Ray (2008), the DDF-based super-efficiency score for DMUp is determined by \(1-\beta ^{*}_{p}\) under model (9)–(12). From Proposition 1, we have the following corollary about the boundedness of the super-efficiency score determined by model (9)–(12).

Corollary 1

\((1-\beta ^{*}_{p})\in (0,2]\).

We know from Ray (2008) that under model (9)–(12), the outputs of the reference point for DMUp are

$$\begin{aligned} \tilde{y}_{rp}=(1+\beta ^{*}_{p})y_{rp}-b_{r}\beta ^{*}_{p},\ r=1,\ldots ,s. \end{aligned}$$
(23)

According to Proposition 1, we have

$$\begin{aligned} \tilde{y}_{rp} & = y_{rp}+\beta ^{*}_{p}(y_{rp}-b_{r})\ge y_{rp},\ (x_{ip},y_{rp})\in T_{p},\\ \tilde{y}_{rp}& \ge y_{rp}-(y_{rp}-b_{r})=b_{r},(x_{ip},y_{rp})\notin T_{p}. \end{aligned}$$

Therefore, we have the following conclusion for \(\tilde{y}_{rp}\):

Corollary 2

For the data set with non-negative outputs, \(\tilde{y}_{rp}\ge 0\) holds for any DMU \(p,\ p \in \{1,\ldots ,n\}\).

Corollary 2 shows that the conceptual problem described in Ray (2008) does not occur under our model. From Proposition 1 and Corollary 2, we know that the proposed model eliminates the infeasibility issue of the conventional VRS super-efficiency DEA model and the NL model and meanwhile extends the applicability of the VRS radial super-efficiency DEA model to the situation with negative input–output data.

Furthermore, our model has the following three useful properties.

(i) Monotonicity

Suppose that the inputs of DMUp are reduced to \(x_{ip}-\Delta x_{ip}\) and the outputs of DMUp are increased to \(y_{rp}+\Delta y_{rp}\), here \(\Delta x_{ip}\ge 0\) and \(\Delta y_{rp}\ge 0\) for all \(i=1,\ldots ,m,\) and \(r=1,\ldots ,s,\) respectively. Notice that here the input (output) data of the DMUp are decreased (increased). According to the principle, we introduced the constants \(a_{i}\) and \(b_{r}\) in (7) and (8), and these two constants should be determined by considering all possible values of inputs and outputs. Therefore, \(a_{i}\) and \(b_{r}\) in this situation should be determined by

$$ a_{i}= k\cdot \max \{|x_{ij}|,\ \forall j,\ |x_{ip}-\Delta x_{ip}|\},\ i=1,\ldots ,m,$$
(24)
$$ b_{r}= \min \{y_{rj},\ \forall j, y_{rp}+\Delta y_{rp}\}=\min \limits _{j=1,\ldots ,n}\{y_{rj}\},\ r=1,\ldots ,s{.} $$
(25)

Equality (25) holds due to the non-negativity of \(\Delta y_{rp}\). With \(a_{i}\) and \(b_{r}\) in (24) and (25), we have the following conclusion:

Proposition 2

The optimal value of model (9)–(12) does not increase if inputs (outputs) of the DMU p are reduced (increased).

Proof

After the specified input reduction and output expansion, the direction vector becomes \((x_{ip}-\Delta x_{ip}+a_{i},y_{rp}+\Delta y_{r}-b_{r})\). With \(a_{i}\) and \(b_{r}\) in (24) and (25), we have \(x_{ip}-\Delta x_{ip}+a_{i}>0,\ i=1,\ldots ,m,\) and \(y_{rp}+\Delta y_{r}-b_{r}\ge 0, \ r=1,\ldots ,s.\) Then, the corresponding model (9)–(12) for the DMUp should be rewritten as

$$ \max\quad \beta _{p}$$
(26)
$$\mathrm{s.t.}\quad \sum _{j=1,j\ne p}^{n}\lambda _{j}x_{ij} \le (1-\beta _{p})(x_{ip}-\Delta x_{ip})-a_{i}\beta _{p},\ i=1,\ldots ,m,$$
(27)
$$\begin{aligned}&\sum _{j=1, j\ne p}^{n}\lambda _{j}y_{rj}\ge (1+\beta _{p})(y_{rp}+\Delta y_{rp})-b_{r}\beta _{p},\ r=1,\ldots ,s, \end{aligned}$$
(28)
$$\begin{aligned}&\sum _{j=1, j\ne p}^{n}\lambda _{j}=1,\ \lambda _{j}\ge 0,\;j=1,\ldots ,n,\ j\ne p{.} \end{aligned}$$
(29)

Denote the optimal solution of model (26)–(29) as \((\beta _{p}^{'},\lambda _{j}^{'})\). By using the same derivation as that for (13) and (15), constraints (27) and (28) are equivalent to

$$\begin{aligned} \beta _{p}\le \frac{x_{ip}-\Delta x_{ip}- {\sum}^{n}_{\mathop {{j=1}}\limits _{\scriptstyle j\ne p}}\lambda _{j}x_{ij}}{x_{ip}-\Delta x_{ip}+a_{i}},\ i=1,\ldots ,m, \end{aligned}$$
(30)

and

$$\begin{aligned} \beta _{p}\le \frac{{\sum}^{n}_{\mathop {{j=1}}\limits _{\scriptstyle j\ne p}}\lambda _{j}y_{rj}-y_{rp}-\Delta y_{rp}}{y_{rb}+\Delta y_{rp}-b_{r}}, \forall r\in J'_{p}, \end{aligned}$$
(31)

respectively, where \(J'_{p}=\{r|y_{rp}+\Delta y_{rp}-b_{r}>0,r=1,\ldots ,s\}\). We have from (29) and (24) that

$$\begin{aligned} \frac{x_{ip}-\Delta x_{ip}-\sum \nolimits _{j=1,j\ne p}^{n}\lambda _{j}x_{ij}}{x_{ip}-\Delta x_{ip}+a_{i}}\le \frac{x_{ip}-\Delta x_{ip}+{\max}_{\mathop{j=1,\ldots ,n,}\limits_{\scriptstyle j\ne p}}\{|x_{ij}|\}}{x_{ip}-\Delta x_{ip}+a_{i}}<1. \end{aligned}$$
(32)

Therefore, we have \(\beta _{p}^{'}<1\). Since \(k\ge 3\), we know from (24) that

$$\begin{aligned} & \mathop {\mathop {\max }\limits _{j=1,\ldots ,n,}}\limits _{ j\ne p}\{|x_{ij}|\}\le \max \{|x_{ij}|,\ \forall j, \ |x_{ip}-\Delta x_{ip}|\}\\ & \quad \le 2(x_{ip}-\Delta x_{ip})+a_{i}. \end{aligned}$$

As \(x_{ip}-\Delta x_{ip}+a_{i}>0,\ i=1,\ldots ,m\), we have

$$\begin{aligned}& \frac{x_{ip}-\Delta x_{ip}- {\sum}^{n}_{\mathop {{j=1}}\limits _{\scriptstyle j\ne p}}\lambda _{j}x_{ij}}{x_{ip}-\Delta x_{ip}+a_{i}}\ge \frac{x_{ip}-\Delta x_{ip}-{\max}_{\mathop{j=1,\ldots ,n,}\limits_{\scriptstyle j\ne p}}\{|x_{ij}|\}}{x_{ip}-\Delta x_{ip}+a_{i}}\ge \\ & \ \ \ \ \frac{x_{ip}-\Delta x_{ip}-2(x_{ip}-\Delta x_{ip})-a_{i}}{x_{ip}-\Delta x_{ip}+a_{i}}=-1,\ i=1,\ldots ,m{.} \end{aligned}$$
(33)

On the other hand, we have from (31) that

$$\begin{aligned} \frac{{\sum }^{n}_{\mathop {{j=1 }}\limits _{\scriptstyle j\ne p}}\lambda _{j}y_{rj}-y_{rp}-\Delta y_{rp}}{y_{rp}+\Delta y_{rp}-b_{r}}\ge \frac{\min \nolimits _{j=1,\ldots ,n}\{y_{rp}\}-y_{rp}-\Delta y_{rp}}{y_{rp}+\Delta y_{rp}-b_{r}}= -1, \forall r\in J'_{p}. \end{aligned}$$
(34)

From (30), (31), (33) and (34), we have \(\beta '_{p}\ge -1\) for \(p=1,\ldots ,n,\) since \(\beta _{p}\) is maximized in model (26)–(29). Then, \(1-\beta '_{p}\) and \(1+\beta '_{p}\) are non-negative, and we have

$$\begin{aligned} & \sum \limits _{j=1,j\ne p}^{n}\lambda '_{j}x_{ij}\le (1-\beta '_{p})(x_{ip}-\Delta x_{ip})-a_{i}\beta '_{p}\\ & \quad \le (1-\beta '_{p})x_{ip}-a_{i}\beta '_{p},\ i=1,\ldots ,m{.} \end{aligned}$$
(35)
$$\begin{aligned} & \sum \limits _{j=1,j\ne p}^{n}\lambda '_{j}y_{rj}\ge (1+\beta '_{p})(y_{rp}+\Delta y_{rp})-b_{r}\beta '_{p}\\ & \quad \ge (1+\beta '_{p})y_{rp}-b_{r}\beta '_{p},\ r=1,\ldots ,s{.} \end{aligned}$$
(36)

As chosen constants, the values of \(a_{i}\) and \(b_{r}\) in model (26)–(29) should be the same as those in model (9)–(12). Thus according to (35) and (36), \((\lambda _{j}^{'},\beta _{p}^{'})\) is a feasible solution vector for the model (9)–(12). Therefore, \(\beta _{p}^{*}\ge \beta _{p}^{'}\) holds since we maximize \(\beta _{p}\) in model (9)–(12). \(\square \)

(ii) Units invariance

Assume that the \(i_{0}\)th \((i_{0}\in \{1,\ldots ,m\})\) input and the \(r_{0}\)th \((r_{0}\in \{1,\ldots ,s\})\) output of all DMUs are scaled by the factors \(c,\ c>0 \ \mathrm{and} \ c\ne 1, \) and \(d,\ d>0 \ \mathrm{and} \ d\ne 1\), respectively. After this scaling, \(a_{i_{0}}\) and \(b_{r_{0}}\) should be determined by \(k\cdot \max \nolimits _{j=1,\ldots ,n}\{|c\cdot x_{i_{0}j}|\}=c\cdot k\cdot \max \nolimits _{j=1,\ldots ,n}\{|x_{i_{0}j}|\}\) and \(\min \nolimits _{j=1,\ldots ,n}\{d\cdot y_{r_{0}j}\}=d\cdot \min \nolimits _{j=1,\ldots ,n}\{y_{r_{0}j}\}\), respectively, according to (7) and (8). Thus, the corresponding constraints in (10) and (11) become

$$\begin{aligned} & \sum \limits _{j=1,j\ne p}^{n}\lambda _{j}\cdot c\cdot x_{i_{0}j}\le (1-\beta _{p})\cdot c\cdot x_{i_{0}p}-c\cdot k \\ & \quad \cdot \max \limits _{j=1,\ldots ,n}\{|x_{i_{0}j}|\}\cdot \beta _{p},\ i=1,\ldots ,m,\\ & \sum \limits _{j=1,j\ne p}^{n}\lambda _{j}\cdot d\cdot y_{r_{0}j}\ge (1+\beta _{p})\cdot d \\ & \quad \cdot y_{r_{0}p}-d\cdot \min \limits _{j=1,\ldots ,n}\{y_{r_{0}j}\}\cdot \beta _{p},\ r=1,\ldots ,s. \end{aligned}$$

These constraints are equivalent to the original constraints in (10) and (11), respectively. Therefore, the optimal solution of our model does not change after the above scaling. This means that our model is unit invariant.

(iii) Translation invariance

\(\bullet \) Output translation Assume that all the outputs of DMUs are translated to \({y}'_{rj}={y}_{r,j}+\tau _{r},\ r=1,\ldots ,s,\ j=1,\ldots ,n,\) where \(\tau _{r},\ r=1,\ldots ,s,\) are arbitrary constants. After this translation, \(b_{r}\) should be calculated by

$$\begin{aligned} b_{r} & =\min \limits _{j=1,\ldots ,n}\{{y}'_{rj}\}=\min \limits _{j=1,\ldots ,n}\{{y}_{r,j}+\tau _{r}\}=\min \limits _{j=1,\ldots ,n}\{{y}_{r,j}\}\\ & \quad +\tau _{r}, \ r=1,\ldots ,s, \end{aligned}$$
(37)

according to (8). Therefore, the constraints in (11) become

$$\begin{aligned} & \sum \limits _{j=1,j\ne p}^{n}\lambda _{j}(y_{rj}+\tau _{r})\ge y_{rp}+\tau _{r}+\beta _{p}\left(y_{rp}+\tau _{r} \right. \\ & \quad \left. -\min \limits _{j=1,\ldots ,n}\{{y}_{r,j}\}-\tau _{r}\right),\ r=1,\ldots ,s, \end{aligned}$$

which is equivalent to (11) due to (12). Hence, model (9)–(12) is output translation invariant.

\(\bullet \) Input translation If all the inputs of DMUs are translated to \({x}'_{ij}={x}_{i,j}+\eta _{i},\ i=1,\ldots ,m,\ j=1,\ldots ,n,\) where \(\eta _{i},\ i=1,\ldots ,m,\) are arbitrary constants. After this translation, \(a_{i}\) should be calculated by

$$\begin{aligned} a_{i}=k\cdot \max \limits _{j=1,\ldots ,n}\{|{x}'_{ij}|\}=k\cdot \max \limits _{j=1,\ldots ,n}\{|{x}_{i,j}+\eta _{i}|\},\ i=1,\ldots ,m{.} \end{aligned}$$
(38)

With (38), the constraints in (10) become

$$\begin{aligned} & \sum \limits _{j=1,j\ne p}^{n}\lambda _{j}(x_{ij}+\eta _{i}) \le x_{ip}+\eta _{i} -\beta _{p}\left(x_{ip}+\eta _{i} \right. \\ & \left. \quad +k\cdot \max \limits _{j=1,\ldots ,n}\{|{x}_{i,j}+\eta _{i}|\}\right),\ i=1,\ldots ,m. \end{aligned}$$

These constraints are not equivalent to the original constraints in (10).

To ensure the input translation invariance, we should translate \(a_{i}\) by the opposite amount of the corresponding input translation, ie,

$$\begin{aligned} a_{i}=k\cdot \max \limits _{j=1,\ldots ,n}\{|{x}_{ij}|\}-\eta _{i},\ i=1,\ldots ,m{.} \end{aligned}$$
(39)

With (39), the constraints in (10) become

$$\begin{aligned} & \sum \limits _{j=1,j\ne p}^{n}\lambda _{j}(x_{ij}+\eta _{i}) \le x_{ip}+\eta _{i} -\beta _{p}\left(x_{ip}+\eta _{i}\right. \\ & \quad \left. +k\cdot \max \limits _{j=1,\ldots ,n}\{|{x}_{i,j}|\}-\eta _{i}\right),\ i=1,\ldots ,m, \end{aligned}$$

which are the same as those original constraints in (10) due to (12). Because of the adjustment in (39), we call the above input translation invariance with parameter adjustment the generalized input translation invariance for distinction. Although our model does not satisfy the traditional input translation invariance, it is generalized input translation invariant.

4 Numerical examples

In this section, two numerical examples are used to show the applicability and merits of the proposed model.

Example 1

To show the properties of the proposed model and meanwhile to compare it with the NL model (Ray, 2008), the LC model (Lin and Chen, 2015), the super RDM+ model and the super RDM− model (Hadi-Vencheh and Esmaeilzadeh, 2013), we apply the proposed model to the data set in Table 1, situated in Section 2 above. In this paper, we set \(k=3\). Then according to (7) and (8), we have \(a_{1}=27\), \(a_{2}=3\), \(a_{2}=6\), \(b_{1}=-8\) and \(b_{2}=0\) for the data set in Table 1. The proposed model is feasible for each of six DMUs. The resulting optimal value is shown in the second column of Table 2. Note that for DMU 4, some resulting slacks are not equal to zero. Thus, except DMU 4, all other DMUs are efficient. According to the super-efficiency scores, these six DMUs are ranked as column 4 of Table 2 shows. The last five columns of Table 2 show the inputs and outputs of the projection of each DMU on the super-efficiency frontier. For this data set (as well as the data set in Example 2), our model yields a unique projection for each DMU. It is easy to see from the projection results that: Inefficient DMU 4 should generate lower target inputs and higher target outputs than the original values in order to reach the super-efficiency frontier; other DMUs should reduce (expand) at least one of its outputs (inputs) to reach the super-efficiency frontier.

To examine the monotonicity of the proposed model, we assume that the first input of DMU 1 is decreased from 1 to −99 and its first output is increased from −8 to 192 in the following way: \(x_{1,1}=1-l\), and \(y_{1,1}=-8+2\cdot l\), here l increases from 0 to 100 with the step size being 1. As the values of \(x_{1,1}\) and \(y_{1,1}\) are changed in this situation, then according to (24) and (25), we have

$$\begin{aligned} a_{1} &= 3\cdot \max \{|1-l|,0,4,4,2,9,\ l=0,\ldots ,100\}=297,\\ a_{2} & = 3\cdot \max \{0,1,0,0,0,0\}=3,\\ a_{3} &= 3\cdot \max \{0,|-2|,0,0,0,0\}=6,\\ b_{1}&= \min \{-8,0,4,1,3,6\}=-8,\\ b_{2}& =\min \{0,3,0,0,0,0\}=0, \end{aligned}$$

for the changing data set. When l increases from 0 to 100, the proposed model is always feasible and the optimal value \(\beta _{1}^{*}\) decreases gradually from −0.00329 to −0.93000. Figure 1 plots the change of \(\beta _{1}^{*}\) with respect to l. It is easy to see from Figure 1 that \(\beta _{1}^{*}\) monotonically decreases with the increase of l. This confirms the theoretical analysis about the monotonicity of model (9)–(12).

Table 2 Results for six DMUs
Figure 1
figure 1

The change of the optimal value of model (9)–(12) for DMU 1

For the data set in Table 1, we assume that all the inputs are scaled down by 0.001 and two outputs are scaled down by 0.01. Then according to (7) and (8), we have \(a_{1}=0.027, a_{2}=0.003, a_{3}=0.006\), \(b_{1}=-0.08\) and \(b_{2}=0\) for the scaled data. By solving problem (9)–(12) with the scaled data, we find that all the resulting super-efficiency scores are still equal to those in column 3 of Table 2. This confirms the theoretical result about the unit invariance of the proposed model.

Assume the input and output values of all DMUs are translated to

$$\begin{aligned} x'_{1j} & =x_{1j};\ x'_{2j}=x_{2j};\; x'_{3j} =x_{3j}+4;\\ y'_{1j} & =y_{1j}+9,\ y'_{2j}=y_{2j},\ j=1,\ldots ,6{.} \end{aligned}$$
(40)

Then according to (39) and (37), we have

$$\begin{aligned} a_{1}=27, \ a_{2}=3, \ a_{3}=6-4=2, \ b_{1}=-8+9=1, \ b_{2}=0. \end{aligned}$$
(41)

With \(a_{i}\) and \(b_{r}\) in (41), we solve problem (9)–(12) under the translated data and find that all the resulting super-efficiency scores are still equal to those in column 3 of Table 2. This confirms the usual output translation invariance and the generalized input translation invariance of the proposed model.

It is worth noting that by the translation in (40), the input–output values of every DMU become non-negative. The first input is zero for DMU 2, and it is positive for other DMUs. By solving the NL model under these translated data, we find that it is infeasible for DMU 2 and the resulting super-efficiency score for DMU 1 is 2.5, which results in a negative referenced output for DMU 1. Therefore, the infeasibility issue in two exceptions of the NL model, mentioned in Section 2, occurs under the translated data. In contrast, the proposed model is feasible under the translated data for all DMUs, and we have from (23) that the outputs of referenced points for individual DMUs are \( \tilde{y'}_{11}=11.680 ,\tilde{y'}_{12}=10.183,\tilde{y'}_{13}=12.932,\tilde{y'}_{14}=10.903,\tilde{y'}_{15}=11.532 ,\tilde{y'}_{16}=13.000, \tilde{y'}_{21}=0.320 ,\tilde{y'}_{22}=0.000,\tilde{y'}_{23}=0.000,\tilde{y'}_{24}=0.000,\tilde{y'}_{25}=0.383,\tilde{y'}_{26}=0.000, \) which are non-negative. This confirms the conclusion of Corollary 2. From Proposition 1, Corollary 2 and the above analysis, we can safely say that for non-negative data, the proposed model is feasible and ensures the non-negativity of the referenced outputs for all DMUs. So, for the non-negative data set, the infeasibility issue of the NL model (Ray, 2008) does not occur for model (9)–(12).

Example 2

In order to further show the practicability of our model, we consider the data set in Sharp et al (2007). This data set has 13 DMUs with two inputs and three outputs. The detailed data are shown in columns 2–6 of Table 3. It is easy to see from Table 3 that just one input (cost) and one output (saleable) are non-negative and other data are non-positive. For this example, we have \(a_{1}=32.4,\ a_{2}=6.96\), \(b_{1}=0.49, \ b_{2}= -1.42\) and \(b_{3}=-3.79\) according to (7) and (8). By solving the proposed model for all DMUs, we obtain the resulting super-efficiency scores, which are shown in column 7 of Table 3.

It is easy to see that the proposed model is feasible for all DMUs. Compared with DEA models handling negative data (Portela et al, 2004; Sharp et al, 2007; Emrouznejad et al, 2010; Cheng et al, 2013), an advantage of the proposed model is that it can differentiate the performance of efficient DMUs. This superiority comes from an inherent characteristic of super-efficiency DEA models. According to the resulting super-efficiency scores, all DMUs are ranked as column 8 of Table 3 shows.

Table 4 shows the target input–output values of inefficient DMUs, determined by the proposed model. It is obvious that under the proposed model, each inefficient DMU should reduce its inputs and expand its outputs in order to tend to the super-efficiency frontier. Therefore, the proposed model can provide improved target inputs and outputs for all inefficient DMUs.

Table 3 Input–output data and super-efficiency results
Table 4 Improved targets for inefficient DMUs

From the theoretical analyses and the above two examples, we can conclude that the proposed model can deal with the data set with negative values and can provide improved targets for inefficient DMUs. In addition, our model is unit invariant, output translation invariant, generalized input translation invariant and monotonous. For the non-negative data set, the proposed model fully eliminates the infeasibility issue of the NL model. Therefore, the proposed model successfully addresses the infeasibility problem occurring in conventional VRS radial super-efficiency DEA models and the NL model. More importantly, different from current DEA models handling negative data, the proposed model can rank efficient DMUs.

5 Conclusions

Super-efficiency model in the presence of negative data is a rather neglected issue in the DEA field. The existing super-efficiency models capable of handling negative data might be infeasible in some cases. By choosing appropriate direction variables in the DDF, this paper develops a DDF-based VRS radial super-efficiency DEA model for dealing with negative data. Compared with existing related models, the proposed model is feasible no matter whether the input–output data are non-negative or not, and meanwhile, it can rank all DMUs. It can project each DMU onto the super-efficiency frontier along a suitable direction and can provide improved targets for inefficient DMUs. It possesses good properties such as monotonicity, units invariance, output translation invariance and generalized input translation invariance. Moreover, it successfully eliminates the infeasibility issue occurring in two exceptions of the NL model. In summary, the proposed model not only overcomes the infeasibility issue of VRS super-efficiency DEA models, but also extends current VRS radial super-efficiency DEA models suitable for non-negative data to the situation with partially or fully negative data.

The new super-efficiency DEA model is developed by utilizing the directional distance function, and the resulting super-efficiency scores are not equal to those yielded by traditional radial super-efficiency DEA models if the latter models are feasible and the input–output data are non-negative. As for the future research, we will propose an alternative radial super-efficiency DEA model which not only overcomes the above drawback, but also keeps all advantages of the proposed super-efficiency DEA model.