Skip to main content

Cross redundancy and sensitivity in DEA models

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

Data envelopment analysis (DEA) measures the efficiency of each decision making unit (DMU) by maximizing the ratio of virtual output to virtual input with the constraint that the ratio does not exceed one for each DMU. In the case that one output variable has a linear dependence (conic dependence, to be precise) with the other output variables, it can be hypothesized that the addition or deletion of such an output variable would not change the efficiency estimates. This is also the case for input variables. However, in the case that a certain set of input and output variables is linearly dependent, the effect of such a dependency on DEA is not clear. In this paper, we call such a dependency a cross redundancy and examine the effect of a cross redundancy on DEA. We prove that the addition or deletion of a cross-redundant variable does not affect the efficiency estimates yielded by the CCR or BCC models. Furthermore, we present a sensitivity analysis to examine the effect of an imperfect cross redundancy on DEA by using accounting data obtained from United States exchange-listed companies.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2

Notes

  1. Nunamaker (1985), however, points out that even the addition of a highly correlated variable may alter the resulting DEA efficiency estimates substantially.

  2. Principal component analysis is based on the covariance matrix between variables.

  3. If there are no restrictions on the sign of a i , such a dependence is called a linear dependence. Readers who are not familiar with the term “conic combination” are referred to Barvinok (2002, p.65) and Boyd and Vandenberghe (2004, p.24) for examples.

  4. In the jargon of statistics, if there is a cross redundancy between input and output variables, then the absolute value of the first canonical correlation equals one.

  5. Dulá (1997) examined relationships between DEA and the theory of redundancy in linear systems, but he studied the redundancy of a set of DMUs with respect to the other DMUs. The current study focuses on the redundancy of variables.

  6. Some prior studies, including Smith (1990), Thore et al. (1993), and Halkos and Salamouris (2004), used DEA to examine a firm’s performance by using accounting data and including net income as one of the output variables.

  7. The current study focuses on the CCR and BCC models. For the output-oriented BCC models, to be precise, inputs are not restricted to be nonnegative. But, for the output-oriented CCR models, inputs are restricted to be nonnegative [refer to page 102 in Cooper et al. (2000)]. Hence, to accommodate both output-oriented CCR and BCC models, we consider only nonnegative inputs and outputs when we deal with output-oriented models. Also, practically, negative inputs are not useful.

  8. We analyze the output-oriented models separately because we believe that the proof of Theorem 2 is not easily deduced from the proof of Theorem 1.

  9. For sensitivity analyses and simulation studies of DEA models, refer to Banker et al. (1996a, 2004), Charnes et al. (1985), Cooper et al. (2001), and Seiford and Zhu (1998) for examples.

  10. http://mba.tuck.dartmouth.edu/pages/faculty/ken.french/.

  11. For a detailed explanation of the selection of input variables, readers are referred to Section 3 in Demerjian et al. (2006).

  12. The item number represents the numeric code used in the Compustat database.

  13. This is a slight abuse of notations in the sense that we are using subindices, not superindices, to enumerate input or output variables. But, using these vector notations simplifies the presentation of our results.

  14. In this study, we use an absolute efficiency change rather than a percentage efficiency change to measure sensitivity. Our rationale for this is as follows. Suppose there are only two DMUs and efficiency scores change from 0.1 to 0.2 for one DMU and from 0.9 to 1.0 for the other DMU after the inclusion of additional output variables. If we use a mean absolute efficiency change as a summary measure, we will report a value of 0.1. If we use a mean percentage efficiency change as a summary measure, we will report a value of 55.6%. Which one between 0.1 and 55.6% is more informative as a single summary measure? Although answers would depend on analysts’ subjective tastes, our opinion is that, if we use a percentage change, then summary measures usually employed, such as the arithmetic or geometric mean, would put disproportionately larger weights on efficiency changes from lower efficiency scores than on those from higher efficiency scores. For this reason, we believe that it would be more difficult to interpret summary measures based on percentage change than those based on absolute change. Also, we suspect that this would be one main reason that several prior studies (e.g., Banker et al. 1993, 1996a, and 2004) use an absolute change (the mean absolute deviation, to be precise) as a measure of sensitivity.

  15. Note, however, that R 2 is not exactly equal to the degree of cross redundancy between input and output variables. In Equation (2), we restricted the sign of coefficients a r and b i to be nonnegative, but in the regression of net income on sales and other inputs, the coefficients are not restricted in sign.

  16. As a perturbation scheme, we can also consider an additive perturbation scheme: y ε3  = y 3 + ε. In this case, to make ρ = corr(y 3y ε3 ), we need to set \(\sigma^{2}_{\epsilon} = (1 - {\rho ^{2}}) {\sigma^{2}_{y_{3}}}/ \rho^2\). For industry #33, \({\sigma^{2}_{y_{3}}}\) is greater than twenty million. Hence, even if ρ2 = 0.99, we must have σ 2ε  ≥ 200000. However, for a number of DMUs in this industry, the value of y 3 is close to zero. Therefore, the additive perturbation scheme, y ε3  = y 3 + ε, results in a change in the sign of y 3 for a large number of DMUs, which alters the DEA results significantly. Thus, in the current study, we believe that use of the multiplicative perturbation scheme is appropriate.

  17. We repeated our simulation study using data from two other industries, industry #31 (Utilities) and industry #43 (Restaurants, Hotels, and Motels). The results from these industries are much better than the results in Table 2. Here, the term “better” means that the effect of a perturbation on DEA is smaller.

  18. For a sample from any distribution, a larger the sample size results in a larger maximum difference. Hence, a maximum difference increase is expected as the sample size increases.

References

  • Andersen P, Petersen NC (1993) A procedure for ranking efficient units in data envelopment analysis. Manag Sci 39:1261–1264

    Article  Google Scholar 

  • Banker RD (1993) Maximum likelihood, consistency and data envelopment analysis: a statistical foundation. Manag Sci 39:1265–1273

    Article  Google Scholar 

  • Banker RD (1996) Hypothesis tests using data envelopment analysis. J Prod Anal 7:139–159

    Article  Google Scholar 

  • Banker RD, Bardhan I, Cooper WW (1996) A note on returns to scale in DEA. Eur J Oper Res 88:583–585

    Article  Google Scholar 

  • Banker RD, Chang H, Cooper WW (1996a) Simulation studies of efficiency, returns to scale and misspecification with nonlinear functions in DEA. Ann Oper Res 66:233–253

    Google Scholar 

  • Banker RD, Chang H, Cooper WW (1996b) Equivalence and implementation of alternative methods for determining returns to scale in data envelopment analysis. Eur J Oper Res 89:473–481

    Article  Google Scholar 

  • Banker RD, Chang H, Cooper WW (2004) A simulation study of DEA and parametric frontier models in the presence of heteroscedasticity. Eur J Oper Res 153:624–640

    Article  Google Scholar 

  • Banker RD, Charnes A, Cooper WW (1984) Some models for estimating technical and scale inefficiencies in data envelopment analysis. Manag Sci 30:1078–1092

    Article  Google Scholar 

  • Banker RD, Gadh M, Gorr WL (1993) A Monte Carlo comparison of two production frontier estimation metods: corrected ordinary least squares and data envelopment analysis. Eur J Oper Res 67:332–343

    Article  Google Scholar 

  • Banker RD, Thrall RM (1992) Estimation of returns to scale using data envelopment analysis. Eur J Oper Res 62:74–84

    Article  Google Scholar 

  • Barvinok A (2002) A course in convexity. American Mathematical Society, USA

    Google Scholar 

  • Boyd S, Vandenberghe L (2004) Convex optimization. Cambridge University Press, Cambridge

    Google Scholar 

  • Caves DW, Christensen LR, Diewert WE (1982) The economic theory of index numbers and the measurement of input, output and productivity. Econometrica 50:1393–1414

    Article  Google Scholar 

  • Charnes A, Cooper WW, Lewin AY, Morey RC, Rousseau J (1985) Sensitivity and stability analysis in DEA. Ann Oper Res 2:139–156

    Article  Google Scholar 

  • Charnes A, Cooper WW, Rhodes E (1978) Measuring the efficiency of decision making units. Eur J Oper Res 2:429–444

    Article  Google Scholar 

  • Cook WD, Kress M, Seiford LM (1992) Prioritization models for frontier decision making units in DEA. Eur J Oper Res 59:319–323

    Article  Google Scholar 

  • Cooper WW, Seiford LM, Tone K (2000) Data envelopment analysis: a comprehensive text with models, applications, references and DEA-solver software. Kluwer, Massachusetts

    Google Scholar 

  • Cooper WW, Li S, Seiford LM, Tone K, Thrall RM, Zhu J (2001) Sensitivity and stability analysis in DEA: some recent developments. J Prod Anal 15:217–246

    Article  Google Scholar 

  • Cooper WW, Seiford LM, Zhu J (2004) Handbook on data envelopment analysis. Kluwer, Massachusetts

    Google Scholar 

  • Demerjian PR, Lev BI, McVay SE (2006) Managerial ability and earnings quality. American accounting association 2006 annual meeting paper. Avaiable at http://pages.stern.nyu.edu/∼blev

  • Demerjian PR, Lev BI, McVay SE (2009) Quantifying managerial ability: A new measure and validity tests. AAA 2009 financial accounting and reporting section (FARS) paper; AAA 2009 Management Accounting Section (MAS) Meeting Paper. Available at SSRN: http://ssrn.com/abstract=1266974

  • Dulá JH (1997) Equivalences between data envelopment analysis and the theory of redundancy in linear systems. Eur J Oper Res 101:51–64

    Article  Google Scholar 

  • Färe R, Grosskopf S, Lindgren B, Roos P (1992) Productivity changes in Swedish pharmacies 1980–1989: a non-parametric malmquist approach. J Prod Anal 3:85–101

    Article  Google Scholar 

  • Färe R, Grosskopf S, Norris M, Zhang Z (1994) Productivity growth, technical progress, and efficiency change in industrialized countries. Am Econ Rev 84:66–83

    Google Scholar 

  • Fried HO, Lovell CAK, Schmidt SS, Yaisawarng S (2002) Accounting for environmental effects and statistical noise in data envelopment analysis. J Prod Anal 17:157–174

    Article  Google Scholar 

  • Golany B, Roll Y (1989) An application procedure for DEA. OMEGA-Int J Manag Sci 17:237–270

    Article  Google Scholar 

  • Halkos GE, Salamouris DS (2004) Efficiency measurement of the Greek commercial banks with the use of financial ratios: a data envelopment analysis approach. Manag Accoun Res 15:201–224

    Article  Google Scholar 

  • Jenkins L, Anderson M (2003) A multivariate statistical approach to reducing the number of variables in data envelopment analysis. Eur J Oper Res 147:51–61

    Article  Google Scholar 

  • Kittelsen SAC (1993) Stepwise DEA: Choosing variables for measuring technical efficiency in Norwegian electricity distribution. Memorandum No. 06/93, Department of Economics, University of Oslo, Norway

  • Leverty J, Grace M (2004) Dupes or incompetents? An examination of management’s impact on property-liability insurer distress. Working Paper, Georgia State University, USA

  • Lewin AY, Morey RC, Cook TJ (1982) Evaluating the administrative efficiency of courts. OMEGA-Int J Manag Sci 10:401–411

    Article  Google Scholar 

  • Nunamaker TR (1985) Using data envelopment analysis to measure the efficiency of non-profit organizations: a critical evaluation. Manag Decision Econ 6:50–58

    Article  Google Scholar 

  • Norman M, Stoker B (1991) Data envelopment analysis: the assessment of performance. Wiley, New York

    Google Scholar 

  • Pastor JT, Ruiz JL, Sirvent I (2002) A statistical test for nested radial DEA models. Oper Res 50:728–735

    Article  Google Scholar 

  • Seiford LM, Zhu J (1998) Sensitivity analysis of DEA models for simultaneous changes in all the data. J Oper Res Soc 49:1601–1071

    Google Scholar 

  • Sengupta JK (1990) Tests of efficiency in data envelopment analysis. Compu Oper Res 17:123–132

    Article  Google Scholar 

  • Smith P (1990) Data envelopment analysis applied to financial statements. OMEGA-Int J Manag Sci 18:131–138

    Article  Google Scholar 

  • Smith P (1997) Model misspecification in data envelopment analysis. Ann Oper Res 73:233–252

    Article  Google Scholar 

  • Tavares G (2002) A bibliography of data envelopment analysis. RUTCOR Research Report, RRR-01-02, Rutgers University, USA

  • Thore S, Kozmetsky G, Phillips F (1994) DEA of financial statments data: The US computer industry. J Prod Anal 5:229–248

    Article  Google Scholar 

  • Thompson RG, Singleton FD, Thrall RM, Smith BA (1986) Comparative site evaluations for locating a high-energy physics lab in Texas. Interfaces 16:35–49

    Article  Google Scholar 

  • Ueda T, Hoshiai Y (1997) Application of principal component analysis for parsimonious summarization of DEA inputs and/or outputs. J Oper Res Soc Jpn 40:466–478

    Google Scholar 

  • Wagner JM, Shimshak DG (2007) Stepwise selection of variables in data envelopment analysis: procedures and managerial perspectives. Eur J Oper Res 180:57–67

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kyuwan Choi.

Appendix

Appendix

In this appendix, we provide the rationale behind Equation (5). For convenience of notations, we set Y = y 3. Suppose that Y and ε are independent and that ε follows a normal distribution N(0, σ 2ε ). Let

$$ Y^\epsilon = Y +\epsilon Y = (1+\epsilon)Y. $$

Then we have

$$ \begin{aligned} &Cov(\epsilon Y,Y)\\ &\quad =E [(\epsilon Y-E[\epsilon Y] ) (Y-E[Y] ) ]=E [\epsilon Y (Y-E[Y] ) ]\\ &\quad =E [\epsilon ] \cdot E [ Y (Y-E[Y]) ] = 0, \end{aligned} $$

where we used the independence between Y and ε. Using CovYY) = 0, we also have

$$ \begin{aligned} &Var(Y^\epsilon ) \\ &\quad =Var((1+\epsilon) Y )=Cov(Y+\epsilon Y,Y+ \epsilon Y)\\ &\quad =Var(Y)+Var(\epsilon Y)=Var(Y)+E[(\epsilon Y-E[\epsilon Y] )^2 ]\\ &\quad =Var(Y)+E[\epsilon^2 Y^2 ]\\ &\quad =Var(Y)+\sigma_\epsilon^2 (Var(Y)+E[Y]^2). \end{aligned} $$

Hence, to make ρ = corr(YY ε), we must set

$$ \sigma_\epsilon^2=\frac{1-\rho^2}{\rho^2} \times \frac{\sigma_Y^2}{\sigma_Y^2+E[Y]^2}. $$

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Lee, K., Choi, K. Cross redundancy and sensitivity in DEA models. J Prod Anal 34, 151–165 (2010). https://doi.org/10.1007/s11123-009-0166-2

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11123-009-0166-2

Keywords

  • Data envelopment analysis (DEA)
  • Efficiency
  • Cross redundancy
  • Sensitivity analysis
  • Simulation
  • Accounting data

JEL classification

  • C67
  • D20
  • M11