Abstract
Data envelopment analysis (DEA) measures the efficiency of each decision making unit (DMU) by maximizing the ratio of virtual output to virtual input with the constraint that the ratio does not exceed one for each DMU. In the case that one output variable has a linear dependence (conic dependence, to be precise) with the other output variables, it can be hypothesized that the addition or deletion of such an output variable would not change the efficiency estimates. This is also the case for input variables. However, in the case that a certain set of input and output variables is linearly dependent, the effect of such a dependency on DEA is not clear. In this paper, we call such a dependency a cross redundancy and examine the effect of a cross redundancy on DEA. We prove that the addition or deletion of a cross-redundant variable does not affect the efficiency estimates yielded by the CCR or BCC models. Furthermore, we present a sensitivity analysis to examine the effect of an imperfect cross redundancy on DEA by using accounting data obtained from United States exchange-listed companies.
This is a preview of subscription content, access via your institution.



Notes
Nunamaker (1985), however, points out that even the addition of a highly correlated variable may alter the resulting DEA efficiency estimates substantially.
Principal component analysis is based on the covariance matrix between variables.
In the jargon of statistics, if there is a cross redundancy between input and output variables, then the absolute value of the first canonical correlation equals one.
Dulá (1997) examined relationships between DEA and the theory of redundancy in linear systems, but he studied the redundancy of a set of DMUs with respect to the other DMUs. The current study focuses on the redundancy of variables.
The current study focuses on the CCR and BCC models. For the output-oriented BCC models, to be precise, inputs are not restricted to be nonnegative. But, for the output-oriented CCR models, inputs are restricted to be nonnegative [refer to page 102 in Cooper et al. (2000)]. Hence, to accommodate both output-oriented CCR and BCC models, we consider only nonnegative inputs and outputs when we deal with output-oriented models. Also, practically, negative inputs are not useful.
We analyze the output-oriented models separately because we believe that the proof of Theorem 2 is not easily deduced from the proof of Theorem 1.
The item number represents the numeric code used in the Compustat database.
This is a slight abuse of notations in the sense that we are using subindices, not superindices, to enumerate input or output variables. But, using these vector notations simplifies the presentation of our results.
In this study, we use an absolute efficiency change rather than a percentage efficiency change to measure sensitivity. Our rationale for this is as follows. Suppose there are only two DMUs and efficiency scores change from 0.1 to 0.2 for one DMU and from 0.9 to 1.0 for the other DMU after the inclusion of additional output variables. If we use a mean absolute efficiency change as a summary measure, we will report a value of 0.1. If we use a mean percentage efficiency change as a summary measure, we will report a value of 55.6%. Which one between 0.1 and 55.6% is more informative as a single summary measure? Although answers would depend on analysts’ subjective tastes, our opinion is that, if we use a percentage change, then summary measures usually employed, such as the arithmetic or geometric mean, would put disproportionately larger weights on efficiency changes from lower efficiency scores than on those from higher efficiency scores. For this reason, we believe that it would be more difficult to interpret summary measures based on percentage change than those based on absolute change. Also, we suspect that this would be one main reason that several prior studies (e.g., Banker et al. 1993, 1996a, and 2004) use an absolute change (the mean absolute deviation, to be precise) as a measure of sensitivity.
Note, however, that R 2 is not exactly equal to the degree of cross redundancy between input and output variables. In Equation (2), we restricted the sign of coefficients a r and b i to be nonnegative, but in the regression of net income on sales and other inputs, the coefficients are not restricted in sign.
As a perturbation scheme, we can also consider an additive perturbation scheme: y ε3 = y 3 + ε. In this case, to make ρ = corr(y 3, y ε3 ), we need to set \(\sigma^{2}_{\epsilon} = (1 - {\rho ^{2}}) {\sigma^{2}_{y_{3}}}/ \rho^2\). For industry #33, \({\sigma^{2}_{y_{3}}}\) is greater than twenty million. Hence, even if ρ2 = 0.99, we must have σ 2ε ≥ 200000. However, for a number of DMUs in this industry, the value of y 3 is close to zero. Therefore, the additive perturbation scheme, y ε3 = y 3 + ε, results in a change in the sign of y 3 for a large number of DMUs, which alters the DEA results significantly. Thus, in the current study, we believe that use of the multiplicative perturbation scheme is appropriate.
We repeated our simulation study using data from two other industries, industry #31 (Utilities) and industry #43 (Restaurants, Hotels, and Motels). The results from these industries are much better than the results in Table 2. Here, the term “better” means that the effect of a perturbation on DEA is smaller.
For a sample from any distribution, a larger the sample size results in a larger maximum difference. Hence, a maximum difference increase is expected as the sample size increases.
References
Andersen P, Petersen NC (1993) A procedure for ranking efficient units in data envelopment analysis. Manag Sci 39:1261–1264
Banker RD (1993) Maximum likelihood, consistency and data envelopment analysis: a statistical foundation. Manag Sci 39:1265–1273
Banker RD (1996) Hypothesis tests using data envelopment analysis. J Prod Anal 7:139–159
Banker RD, Bardhan I, Cooper WW (1996) A note on returns to scale in DEA. Eur J Oper Res 88:583–585
Banker RD, Chang H, Cooper WW (1996a) Simulation studies of efficiency, returns to scale and misspecification with nonlinear functions in DEA. Ann Oper Res 66:233–253
Banker RD, Chang H, Cooper WW (1996b) Equivalence and implementation of alternative methods for determining returns to scale in data envelopment analysis. Eur J Oper Res 89:473–481
Banker RD, Chang H, Cooper WW (2004) A simulation study of DEA and parametric frontier models in the presence of heteroscedasticity. Eur J Oper Res 153:624–640
Banker RD, Charnes A, Cooper WW (1984) Some models for estimating technical and scale inefficiencies in data envelopment analysis. Manag Sci 30:1078–1092
Banker RD, Gadh M, Gorr WL (1993) A Monte Carlo comparison of two production frontier estimation metods: corrected ordinary least squares and data envelopment analysis. Eur J Oper Res 67:332–343
Banker RD, Thrall RM (1992) Estimation of returns to scale using data envelopment analysis. Eur J Oper Res 62:74–84
Barvinok A (2002) A course in convexity. American Mathematical Society, USA
Boyd S, Vandenberghe L (2004) Convex optimization. Cambridge University Press, Cambridge
Caves DW, Christensen LR, Diewert WE (1982) The economic theory of index numbers and the measurement of input, output and productivity. Econometrica 50:1393–1414
Charnes A, Cooper WW, Lewin AY, Morey RC, Rousseau J (1985) Sensitivity and stability analysis in DEA. Ann Oper Res 2:139–156
Charnes A, Cooper WW, Rhodes E (1978) Measuring the efficiency of decision making units. Eur J Oper Res 2:429–444
Cook WD, Kress M, Seiford LM (1992) Prioritization models for frontier decision making units in DEA. Eur J Oper Res 59:319–323
Cooper WW, Seiford LM, Tone K (2000) Data envelopment analysis: a comprehensive text with models, applications, references and DEA-solver software. Kluwer, Massachusetts
Cooper WW, Li S, Seiford LM, Tone K, Thrall RM, Zhu J (2001) Sensitivity and stability analysis in DEA: some recent developments. J Prod Anal 15:217–246
Cooper WW, Seiford LM, Zhu J (2004) Handbook on data envelopment analysis. Kluwer, Massachusetts
Demerjian PR, Lev BI, McVay SE (2006) Managerial ability and earnings quality. American accounting association 2006 annual meeting paper. Avaiable at http://pages.stern.nyu.edu/∼blev
Demerjian PR, Lev BI, McVay SE (2009) Quantifying managerial ability: A new measure and validity tests. AAA 2009 financial accounting and reporting section (FARS) paper; AAA 2009 Management Accounting Section (MAS) Meeting Paper. Available at SSRN: http://ssrn.com/abstract=1266974
Dulá JH (1997) Equivalences between data envelopment analysis and the theory of redundancy in linear systems. Eur J Oper Res 101:51–64
Färe R, Grosskopf S, Lindgren B, Roos P (1992) Productivity changes in Swedish pharmacies 1980–1989: a non-parametric malmquist approach. J Prod Anal 3:85–101
Färe R, Grosskopf S, Norris M, Zhang Z (1994) Productivity growth, technical progress, and efficiency change in industrialized countries. Am Econ Rev 84:66–83
Fried HO, Lovell CAK, Schmidt SS, Yaisawarng S (2002) Accounting for environmental effects and statistical noise in data envelopment analysis. J Prod Anal 17:157–174
Golany B, Roll Y (1989) An application procedure for DEA. OMEGA-Int J Manag Sci 17:237–270
Halkos GE, Salamouris DS (2004) Efficiency measurement of the Greek commercial banks with the use of financial ratios: a data envelopment analysis approach. Manag Accoun Res 15:201–224
Jenkins L, Anderson M (2003) A multivariate statistical approach to reducing the number of variables in data envelopment analysis. Eur J Oper Res 147:51–61
Kittelsen SAC (1993) Stepwise DEA: Choosing variables for measuring technical efficiency in Norwegian electricity distribution. Memorandum No. 06/93, Department of Economics, University of Oslo, Norway
Leverty J, Grace M (2004) Dupes or incompetents? An examination of management’s impact on property-liability insurer distress. Working Paper, Georgia State University, USA
Lewin AY, Morey RC, Cook TJ (1982) Evaluating the administrative efficiency of courts. OMEGA-Int J Manag Sci 10:401–411
Nunamaker TR (1985) Using data envelopment analysis to measure the efficiency of non-profit organizations: a critical evaluation. Manag Decision Econ 6:50–58
Norman M, Stoker B (1991) Data envelopment analysis: the assessment of performance. Wiley, New York
Pastor JT, Ruiz JL, Sirvent I (2002) A statistical test for nested radial DEA models. Oper Res 50:728–735
Seiford LM, Zhu J (1998) Sensitivity analysis of DEA models for simultaneous changes in all the data. J Oper Res Soc 49:1601–1071
Sengupta JK (1990) Tests of efficiency in data envelopment analysis. Compu Oper Res 17:123–132
Smith P (1990) Data envelopment analysis applied to financial statements. OMEGA-Int J Manag Sci 18:131–138
Smith P (1997) Model misspecification in data envelopment analysis. Ann Oper Res 73:233–252
Tavares G (2002) A bibliography of data envelopment analysis. RUTCOR Research Report, RRR-01-02, Rutgers University, USA
Thore S, Kozmetsky G, Phillips F (1994) DEA of financial statments data: The US computer industry. J Prod Anal 5:229–248
Thompson RG, Singleton FD, Thrall RM, Smith BA (1986) Comparative site evaluations for locating a high-energy physics lab in Texas. Interfaces 16:35–49
Ueda T, Hoshiai Y (1997) Application of principal component analysis for parsimonious summarization of DEA inputs and/or outputs. J Oper Res Soc Jpn 40:466–478
Wagner JM, Shimshak DG (2007) Stepwise selection of variables in data envelopment analysis: procedures and managerial perspectives. Eur J Oper Res 180:57–67
Author information
Authors and Affiliations
Corresponding author
Appendix
Appendix
In this appendix, we provide the rationale behind Equation (5). For convenience of notations, we set Y = y 3. Suppose that Y and ε are independent and that ε follows a normal distribution N(0, σ 2ε ). Let
Then we have
where we used the independence between Y and ε. Using Cov(εY, Y) = 0, we also have
Hence, to make ρ = corr(Y, Y ε), we must set
Rights and permissions
About this article
Cite this article
Lee, K., Choi, K. Cross redundancy and sensitivity in DEA models. J Prod Anal 34, 151–165 (2010). https://doi.org/10.1007/s11123-009-0166-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11123-009-0166-2
Keywords
- Data envelopment analysis (DEA)
- Efficiency
- Cross redundancy
- Sensitivity analysis
- Simulation
- Accounting data
JEL classification
- C67
- D20
- M11