1 Introduction

Supplier evaluation is one of the most important tasks of supplier management, and the literature specifically addresses the issue of supplier selection. To structure the related knowledge, a large number of literature reviews have been published. The large-scale review of Wetzstein et al. (2018) based on a co-citation network analysis classifies the topics of the related papers into one of the following clusters: (1) the conceptual foundation of the field; (2) modeling of the procurement environment; (3) handling group decision making and imprecise input data; (4) computational research; (5) green/sustainable research; and, (6) risk-based supplier selection. As the literature reviews reveal, one important stream of publications has focused on methodologies for supporting supplier evaluation. In this regard, AHP and DEA have been identified as the most common approaches (e.g. Ho et al. 2010; Agarwal et al. 2011; Chai et al. 2013).

These methods generally aim at assisting with the selection of the most appropriate, best-performing suppliers through assessing their performance and competencies. It is obviously important that any method that is used can take into account broad decision-making criteria, thereby mapping the actual preferences of the decision-maker.

Negative numbers may occur among supplier evaluation data (e.g. losses in a profit statement) or may be the result of data transformation. Such cases cannot be handled by CCR and BCC data envelopment models (Cook and Seiford 2009) because input and output data cannot be transformed together so that data translation can be applied. Therefore, it is almost exclusively additive DEA models that can be used to deal with such problems. The present paper presents an example of supplier evaluation in such a case. After an efficiency evaluation, we construct a ranking. To do this, we use cross-efficiencies (Doyle and Green 1994) obtained from an additive model. Aggregate efficiencies help rank suppliers in descending order of efficiency.

The paper is organized as follows. The next section summarizes the findings of literature and underlines the relevance of the examples we later present. The third section of the paper describes a DEA framework for supplier evaluation. Data translation is used to filter out negative data in management and environmental criteria, leading to an additive model. The reciprocal DEA model is then presented and compared to the additive DEA models. The model enables the analysis of cross-efficiency (CE) which is used on suppliers. Section 5 presents some conclusions about this work.

2 Literature review

The performance of suppliers has a fundamental influence on the performance of buying companies (Janda and Seshadri 2001; Hartmann et al. 2012; Foerstl et al. 2013; Tchokogué et al. 2017). For this reason, it is important to understand what “best corporate performance” means. The literature includes a number of studies that have investigated which criteria are important from a business point of view, of which sustainability criteria are increasingly emphasized. Supplier evaluation criteria should be those that are relevant to the specific business situation, and the ideal supplier should perform well in relation to each of these. Most methods therefore compare performance indicators and create rankings.

Over the years, several techniques have been developed to evaluate suppliers. Analytic hierarchy process (AHP), analytic network process (ANP), linear programming (LP), mathematical programming, multi-objective programming, data envelopment analysis (DEA), neural networks (NN), case-based reasoning (CBR), and fuzzy set theory (FST) have all been applied in the literature (Chai et al. 2013; Govindan et al. 2015). DEA is one of the most frequently used of these models, being a widely recognized approach for evaluating the efficiencies of decision-making units (DMUs). Because of its ease of use and successful application, DEA has gained much attention and is in widespread use by business and academic researchers.

One of the central problems identified in the literature on DEA is the nature of the data. In the context of supplier evaluation, several papers have been published which address the issue of performance-type data related to DEA. It is certainly an important issue that such methods are capable of comparing performance independent of the units in which the input and output variables are defined (Lovell and Pastor 1995). Traditionally, many supplier evaluation models have been based on cardinal data, with less emphasis on ordinal data. However, emphasis has shifted to the simultaneous consideration of cardinal and ordinal data in the supplier selection process (Saen 2007; Ebrahimi et al. 2018). The DEA base model is primarily designed to handle positive numbers (Pastor and Ruiz 2007), although in some cases negative numbers should be incorporated into the evaluation dataset (Izadikhah and Saen, 2016). Supplier-related decisions often rely on personal judgment, as some criteria reflect expert opinions. Thus, using DEA for an evaluation requires making use of imprecise data (Toloo et al. 2018).

The problem primarily stems from the nature of the data that is incorporated. However, the different nature of supplier–buyer relationships and different supplier strategies can lead to a situation and supplier data structure that can ultimately distort evaluations (Kleinsorge et al. 1992; Bruno et al. 2012; Prajogo et al. 2012).

The concept of game theory is also widely referred to in the purchasing literature (Bai and Sarkis 2016; Ji et al. 2015; Mohammaditabar et al. 2016). An interesting consideration is that suppliers strive both to win business, and to maximize their own profits. In competition with other suppliers, this may lead to the development of tactics whereby suppliers promise (potentially unrealistically) high levels of performance for each metric, which are ultimately significantly lower than expected. In some cases, such strategies may be screened with appropriate specifications, or an appropriate pre-qualification system (de Boer et al. 2001; Hong et al. 2005; Sen et al. 2010; Dobos and Vörösmarty 2019b). However, supplier optimization tactics are not always easy to identify a priori (e.g. in the case of a new purchase) and are thus difficult to prepare for when determining the evaluation process. For this reason, it is of great importance that the supplier evaluation method includes an analysis of supplier parameters as a whole.

There are a number of data problems involved in supplier evaluation that the literature proposes solutions to. If negative data exist for both inputs and outputs, the additive DEA model is recommended, although its use in purchasing is sporadic. However, the supplier evaluation problem highlights the issue of the need to manage outliers in supplier evaluation, which can be achieved through the use of cross-efficiencies. This also allows practitioners to apply additive model cross-efficiency to the ranking problem.

3 DEA framework for supplier evaluation with cross-efficiency

The following supplier selection model is formulated as a decision-making problem. Suppliers are evaluated along management and environmental criteria (Dobos and Vörösmarty 2014, 2019a, b; Vörösmarty and Dobos 2019). The management criteria are the usual supplier evaluation criteria, such as trustworthiness, purchasing price, lead time, and quality of products supplied, etc. We assume that environmental criteria are the outputs of the examined model. A very common method is used to investigate the effects of environmental issues on supplier assessment.

Input and output data must be transformed in DEA models if the input data is not minimized, and the same applies to output data if it is not maximized. In previous papers we explained how this can be done by using reciprocal values (Dobos and Vörösmarty 2019a; Vörösmarty and Dobos 2019). However, it is also possible to choose to scale your variables. This solution was chosen in Dobos and Vörösmarty (2019b). A reciprocal DEA model is presented to illustrate the calculation of cross-efficiency.

In the second model, we have chosen to transform the data into negative values. However, this solution involves making the data translation invariant (Cook and Seiford 2009; Neralić and Wendell 2019). This can be done by adding a positive number to criteria containing negative values, which will make the values of the criterion non-negative. A positive number is used to create new variables of the “lack of” type. These values represent the optimum numbers for the criterion under consideration. The additive model that we present is translation invariant for all data, both input and output.

3.1 The application of a DEA model with reciprocal data in supplier selection

Let us assume that the purchaser evaluates p suppliers. The number of traditional management criteria is n, and the number of environmental criteria is m. The evaluation of supplier i is defined with vectors (xi, yi), where vector xi is the value of the management (input) criteria, and vector yi is the environmental (output) criteria. The input and output vectors of suppliers can be summarized in matrices X and Y, where matrix X = [x1, x2,…, xn] and matrix Y = [y1, y2,…, ym]. It is assumed that the matrices are reciprocally transformed if this is necessary.

DEA is a general framework for evaluating suppliers in the field of materials and supply management in the absence of criteria weights. The application of DEA is based on the categories “inputs,” “outputs,” and efficiencies. The basic method was initiated by Charnes et al. (1978) to determine the efficiency of decision making units (DMU). The model offered by the former is a hyperbolic programming model under linear conditions. The existence of a general solution to such kinds of models was first investigated by Martos (1964), who examined the problem as a special case of a linear programming model. The aim of the DEA model is to construct weights for management (input) and environmental (output) criteria for which the weights are vectors v and u, respectively. The basic DEA model for the first supplier is thus as follows:

$$ {\mathbf{u}} \cdot {\mathbf{y}}_{1} \to { \hbox{max} } $$
(1)
$$ \begin{aligned} & {\text{s}} . {\text{t}} .\\ & {\mathbf{v}} \cdot {\mathbf{x}}_{1} = \, 1, \\ \end{aligned} $$
(2)
$$ - \, {\mathbf{v}} \cdot {\mathbf{X}} + {\mathbf{u}} \cdot {\mathbf{Y}} \le 0, $$
(3)
$$ {\mathbf{u}} \ge {\mathbf{0}},{\mathbf{v}} \ge {\mathbf{0}}. $$
(4)

Model types (1)–(4) can be solved with commercial software such as Microsoft Excel Solver. Throughout the paper we apply this software to construct our numerical examples (see Table 1). Our example fulfils the general rule for the number of decision-making units with regard to valid outcomes. The number of suppliers is equal to 15—i.e. r = max{m·n; 3·(m + n)}, where r is the number of suppliers and numbers m = 2, and n = 3 are the number of outputs and inputs (Cooper et al. 2001).

Table 1 Solution of the DEA model with reciprocal data for the first supplier

The cross-efficiencies of the model (1)–(4) can be calculated for all j models, where value j is the number of a decision-making unit (DMU). Let us assume that the optimal weights (u1; v1) of problems (1)–(4) are known. Then the cross-efficiencies for these weights and Supplier 1 are

$$ E_{j1} = {\mathbf{u}}^{1} \cdot {\mathbf{y}}_{j} /{\mathbf{v}}^{1} \cdot {\mathbf{x}}_{j} ;\quad j = 1,2, \ldots ,r. $$
(5)

The efficiency of self-appraisal, is equal to Ejs = Ejj (Doyle and Green 1994). Peer-appraisal for the first DMU is calculated as the average of the cross-efficiencies:

$$ E_{1p} = \frac{{\mathop{\mathop{\sum}\nolimits_{i = 1}}\limits_ {i \ne 1} }^{p} E_{ji} }{p - 1},\quad j = \, 1,2, \ldots ,r. $$
(6)

We hereby complete the introduction to the basic DEA model. The self-appraisal and peer-appraisal indicators for Supplier 1 are shown in Eqs. (5) and (6).

3.2 The application of an additive DEA model in supplier selection

After introducing “lack of” type variables, the input and output vectors of the suppliers can be summarized in matrices X′ and Y′. Let us formulate the additive DEA model in the next format, assuming that we are examining the efficiency of the first decision-making unit:

$$ - \varepsilon \cdot {\mathbf{1}} \cdot {\mathbf{s}}^{ - } - \varepsilon \cdot {\mathbf{1}} \cdot {\mathbf{s}}^{ + } \to { \hbox{min} } $$
(7)
$$ \begin{aligned} & {\text{s}} . {\text{t}} .\\ & - \, {\mathbf{X^{\prime}}} \cdot {\varvec{\uplambda}} - {\mathbf{s}}^{ - } = - {\mathbf{x}}_{1} \\ \end{aligned} $$
(8)
$$ {\mathbf{Y^{\prime}}} \cdot {\varvec{\uplambda}} - {\mathbf{s}}^{ + } = {\mathbf{y}}_{1} $$
(9)
$$ {\mathbf{1}} \cdot {\varvec{\uplambda}} = 1 $$
(10)
$$ {\varvec{\uplambda}} \ge {\mathbf{0}},\;{\mathbf{s}}^{ - } \ge {\mathbf{0}},\;{\mathbf{s}}^{ + } \ge {\mathbf{0}}. $$
(11)

Model (7)–(11) is the basic dual model for the additive DEA method which can be reformulated in a primal model in the following form

$$ - \, {\mathbf{v}} \cdot {\mathbf{x^{\prime}}}_{1} + {\mathbf{u}} \cdot {\mathbf{y^{\prime}}}_{1} + {\text{u}}_{1} \to { \hbox{max} } $$
(12)
$$ \begin{aligned} & {\text{s}} . {\text{t}} .\\ & - \, {\mathbf{v}}\cdot{\mathbf{X^{\prime}}} + {\mathbf{u}}\cdot{\mathbf{Y^{\prime}}} + {\text{u}}_{1} \cdot {\mathbf{1}} \le {\mathbf{0}} \\ \end{aligned} $$
(13)
$$ {\mathbf{u}} \ge \varepsilon \cdot {\mathbf{1}},{\mathbf{v}} \ge \varepsilon \cdot {\mathbf{1}}. $$
(14)

The self-appraisal and peer-appraisal indexes of the additive DEA model can be determined in a similar way to that of Eqs. (5) and (6).

4 Numerical examples

Examples of numerical data are presented in “Appendix 1”. Based on these, we first determine the values of self-appraisal and peer-appraisal with the data obtained by reciprocal transformation. Then we transform the data with the additive model. Section 4.3 compares the results of self- and peer-appraisal indicators for reciprocal and additive models.

4.1 A DEA model with reciprocal data for supplier selection

Let us transform the data in “Appendix 1” into a form that better fits maximization criterion; i.e., gives higher values than those of a less good evaluation. If a better evaluation produces a higher value, then the evaluation of that criterion shall not be changed. (This is the case, for example, for reusability, lead time, and price.) If a better criterion is awarded a lower value, then we use the inverse (i.e. reciprocals of the data). The data used for this type of analysis are shown in “Appendix 2”.

The linear programming model gives the following weights for solving the problem for the first supplier. For problems (1)–(4), the optimal weights are given in Table 1. The DEA efficiency measures for the first supplier are shown in the first column of Table 2. The other columns in the matrix present the relevant DEA efficiency of the supplier and the cross-efficiencies.

Table 2 DEA-efficiencies and Cross-efficiencies for DEA model with reciprocal data

The most DEA efficient and cross-efficient decision-making units with maximal values of one are suppliers 7, 8, and 15. The first supplier in our case has an efficiency score (i.e. self-appraisal) of 0.951, which is relatively high.

In this numerical example, two sets of criteria were formulated: management as input (traditional purchasing) criteria, and environmental, or output criteria.

The weights vector suggests that the weight of all classical purchasing criteria should be incorporated into the evaluation of suppliers. The criterion of reusability received a weight in the analysis, but the criterion of CO2 emissions was not evaluated in this model.

The efficiency of each supplier is the solution of problem (1)–(4) by optimizing the values for their own criteria. In the present case we have to solve 15 linear programming problems. When solving certain problems, cross-efficiencies are incidental; that is, with optimal weights the efficiency of other suppliers can be determined.

Table 2 contains the DEA efficiencies and cross-efficiencies for all suppliers. The diagonal of the matrix shows the DEA efficiencies, which are colored gray. The white elements of the vertical columns include the 14 cross-efficiencies with the optimum weights for the particular supplier.

Table 5 includes the self- and peer-appraisal values of suppliers. Suppliers 2, 4, 6, 7, 8, 9, 10, 11, 13, and 15 have the highest efficiency, i.e., one. These suppliers are also Pareto efficient, meaning there is no other supplier that outperforms them. However, the remaining five suppliers are inefficient. The cross-efficiencies of suppliers 7, and 15 are around 0.8, or greater. As the purpose of supplier evaluation is to help with selecting a supplier, one should be chosen that clearly has high cross-efficiency and DEA efficiency. In this case, Supplier 10 is efficient, but in terms of peer-appraisal it is weak.

This completes the analysis.

4.2 An additive DEA model in supplier selection

Let us now transform the data from “Appendix 1” for the additive model. If a better evaluation has a higher value, than we will not change the evaluation of that criterion. If a lower value for a criterion is better, then we will reduce the value of the maximum available quality (i.e. 100 percent) by the supplier’s value for quality, while for CO2 emissions 40 g/t is the least best value, representing the highest CO2 emission of any supplier in this sector. These two upper bounds can be interpreted as “lack of” parameters. Quality has an upper bound of 100 percent, and we assume an “industrial worst” technology with 40 g/t CO2 emissions. The data used for the analysis are shown in “Appendix 3”.

The linear programming model (12)–(14) offers the following weights for solving the problem for the first supplier. These are presented in Table 3. The ε value for this model is 10−5. The DEA efficiency measures are shown in the first column of Table 4. The other columns in the matrix present the relevant DEA efficiency of the supplier and the cross-efficiencies as well.

Table 3 Solution of the DEA model for the first supplier
Table 4 DEA-efficiencies and cross-efficiencies for additive DEA model

The weights vector suggests that the weight of all classical purchasing features should be incorporated into the evaluation of suppliers. Reusability and CO2 emissions receive a weight in the analysis as well. The weighting of the environmental criteria is exactly the minimum possible (i.e. 10−5). This means that these weights are effective in the sense that they take on the lowest possible value.

The most efficient decision-making units with values of over 0.9 are suppliers 4, 6, 7, 8 and 15. The first supplier in our case has an efficiency score of 0.823, which is relatively high.

Analysis from the literature suggests that overall performance can be mapped according to vendor evaluation criteria, such that each of the evaluation criteria is an important company parameter. It is often unacceptable for a company to have a supplier who only fulfils a single criterion well (even if they are extremely good in this regard). According to the literature, such a situation is the result of a utility-maximizing supplier.

The analysis of additive model is complete.

4.3 Comparison of self- and peer-appraisal indicators for reciprocal and additive DEA models

DEA efficiency is relatively easy to interpret. However, there are several interpretations of cross-efficiency (Doyle and Green 1994) One interpretation is that DEA efficiency can involve self-appraisal, meaning that the supplier tries to maximize their own efficiency. Cross-efficiencies, on the other hand, develop through peer-appraisal. Of course, cross-efficiency is always less than DEA efficiency. The method of calculation of peer-appraisal indexes is shown in (6). Table 5 summarizes the self-appraisal and peer-appraisal of reciprocal and additive models.

Table 5 Efficiency measures of the suppliers

Reciprocal and additive models are compared using a correlation coefficient. First, we discuss the relationship between self-appraisal and peer-appraisal in reciprocal and additive models. In the reciprocal model, the correlation between the two efficiency measures is 0.479. This suggests that there is only a moderate relationship between the two metrics, so DEA efficiency reassesses the value obtained for efficiency using peer-appraisal. In the additive model, the correlation is 0.776, which indicates strong correlation. This shows that the self-appraisal and peer-appraisal metrics are similar.

The correlation coefficient between the two models’ self-appraisal indices is 0.698. This suggests that if there is a difference between the two models, it is irrelevant. The correlation between the two peer-appraisal metrics is 0.789. Interestingly, the relationship between peer-appraisal metrics is stronger than that between self-appraisal metrics.

In summarizing the results, we can state that the results obtained by the two data processing methods give almost similar results. Only a significant difference between the two efficiency indicators for the reciprocal model can be identified. This also means that the results are less dependent on data transformation.

5 Conclusion

In this paper, a cross-efficiency DEA model was defined. The proposed model addresses two problems. First, we provide an example of supplier evaluation using negative data. In classical DEA models, input and output data cannot be transformed together, so data translation must be undertaken. Therefore, additive DEA models are almost exclusively used to deal with such problems.

Second, the method should respond to modifications in suppliers’ offers in the case that a buyer favors overall good performance. To filter suppliers with data outliers in supplier evaluation, it is important to distinguish between self-appraisal and peer-appraisal. Self-appraisal metrics cover DEA efficiency, while peer-appraisal indicators cover cross-efficiency. In traditional DEA models it is possible to filter out efficient suppliers based on self-appraisal. Moreover, by applying the concept of cross-efficiency it is possible to differentiate between generally good performance and partially good performance with the help of peer-appraisal.

Finally, reciprocal and additive model self- and peer-appraisal indicators were compared. The correlation analysis showed a very strong linear relationship in terms of the efficiency of the two DEA models in our numerical example. The conditions under which the efficiency measures applied in the two models differ should be further investigated.

Cross-efficiency is rarely used in an additive model. In additive DEA models, efficiency can be greater than one. The numerical example presented here indicates that all self-appraisal DEA and aggregate peer-appraisal cross-efficiency values can be between zero and one.

The role of weights may also be the subject of future study.