1 Introduction

Productivity measurement and efficiency in particular is a common term among the discipline of production economics. Each organization, firm, enterprise, business bank, can extract the efficiency of its branches, units if the latter are described by inputs and outputs. Efficiency is generally defined as the ratio of the amount of output produced given the unit’s inputs or resources. Thus, in order to measure the efficiency of a unit, certain inputs and outputs of a unit should be provided a priori. Deploying Data Envelopment Analysis (DEA) technique, the efficiency of each unit can be calculated (Charnes et al. 1981, 1985).

In the past decades there have been advances in Integer Programming (IP); one of which is Mixed Logical Linear Programming (MLLP) or Mixed Integer Linear (MILP) Programming models, where binary variables provide information regarding the installation of a plant or the selection of a route or a procedure, depending on the nature of the model (Hooker and Osorio 1999). This “family” of problems is generally solved with IP techniques, one of which is Branch and Bound (B&B) (Ross and Soland 1975). Based on this approach, a tree representation of the problem is provided; given a minimization direction to the problem (a transformation can be deployed if there is a maximization direction in the objective function) each node is examined separately for feasibility and for objective value improvement.

The optimal design of supply chain network problem is an extension of a transportation problem. Based on this type of problem, multiple echelons are considered representing the different stages of transportation or manufacturing process that products are subject to. The aim of these type of problems includes the minimization of total operational cost, lead time, stock out instances, or the maximization of profit, or customer satisfaction that the firm or enterprise will gain. Assuming that a supply chain is designed with respect to warehouse selection and installation, then each potentially installed warehouse has inputs and outputs which are schematically presented in Fig. 1.

Fig. 1
figure 1

Inputs and outputs of a warehouse

Due to the existence of binary variables B&B solution approach can be easily implemented providing solutions that are subjected to a single criterion. However, as can be seen in Fig. 1, considering each potentially installed warehouse as an entity then another conflicting objective is added to the problem. Based on this new objective facilities are not selected only based on cost minimization (or profit maximization) but also on whether these solutions are efficient. If more than one sites are selected then a possible approach is to add a single sourcing constraint which will select a single facility that reduces greatly the overall cost, but it may also cause the problem to become infeasible. In this case, other methodologies must be employed in order to provide solution to the problem.

The proposed algorithm is formulated in order to reduce the number of facilities (warehouses) in a supply network design problem. When designing the supply chain network, the Decision Maker (DM) seeks for less warehouses (facilities in general) as possible in order to reduce cost. This is obtained through economies of scale, however, reduction in facilities leads to less customers’ satisfaction. Generally the models that are used in order to design supply chain network, use single sourcing constraint in order to reduce the number of facilities by imposing the model to select one facility to accommodate a cluster of demand zones. This constraint is hard and often leads to infeasibility. In case of infeasibility, the model has to be solved without single sourcing constraint by letting the model to select as much facilities (warehouses) as needed in order to satisfy demand of the customers. Also, the cost is significantly increasing, depending on the demand pattern. On the contrary, the presented algorithm selects the warehouses based on their efficiency, leading to better results, both by reducing cost and by increasing efficiency. The algorithm filters all the optimal results derived from the initial MILP model and iteratively selects those warehouses that are fully technically efficient. Some of the methods that are employed for solving supply chain network design problems, in case of infeasibility, are Lagrangean Relaxation and Benders Decomposition, however, these methods use relaxation of “hard” constraints in order to solve the problem. The presented algorithm solves the problem based on two dimensions: cost and efficiency. Additional constraints remove inefficient solutions. In case that this new problem is now infeasible, different thresholds of efficiency are considered in the constraint, where the lower level of that percentage is left to DM to decide.

In this work, which is an extension of the work of Grigoroudis et al. (2014), a Branch and Efficiency (B&E) algorithm is proposed for the optimal design of supply chain networks. Through an iterative procedure, an initial vector of solutions is provided along with the inputs and outputs of each solution. Each solution is filtered in the following stages through constraints (efficiency cuts). The algorithm stops if the number of the non-zero solutions of the final vector is less than a certain pre-determined level, or there is no change in objective function’s value. An approach that is integrating DEA technique in the selection of solutions, providing a Multi-Objective Programming model, as solutions are not only subjected to constraints to the problem, but also to the “efficiency cuts” that are posed by B&E approach, has not yet been proposed in the supply chain literature.

2 Literature review

In the recent years supply chain design literature has expanded to consider the rapidly changing economic environment in which a supply chain network should be designed. There has been proposed a plethora of mathematical programming models that have been applied to supply chain network design problems of which Mixed Integer Linear Programming (MILP) and Mixed Integer Non Linear Programming (MINLP) models have been widely used, providing generic frameworks for managerial use.

Optimal supply chain network design models are divided into two categories; the steady state and the multi-period ones. In the first case, time is absent from the analysis, and this type of formulation provides average levels of decisions, while in the multi-period models, the decisions are made with respect to the planning horizon.

The key point in modeling supply chain networks is the demand uncertainty. A number of studies have captured the stochasticity with distribution functions that best describe demand.

Tsiakis et al. (2001) proposed a steady state model for the optimal supply chain network design, with decisions that regard the installation of facilities (distribution centers and warehouses). Demand uncertainty is modeled through different demand and capacity parameter scenarios.

The optimal design of supply chain networks has been also proposed by Petridis (2015). Demand stochasticity is proposed by Normal Distribution, while probabilistic constraint are integrated in a single framework for the optimal supply chain network providing decisions about the occurrence of stock out instances.

In their work, Rodriguez et al. (2014) employed a multi-period model for the optimal design of supply chain taking into consideration demand uncertainty providing a production plan that integrates tactical and strategic decisions.

Supply chain management in global scale integrates decisions that take into account the factors that contribute to the sustainability of a supply chain network. A holistic approach towards the optimization of sustainability subject to the economic, ecological and social objectives has been proposed in the work of Kannegiesser and Günther (2014).

In the context of supply chain network design model, DEA formulations have been proposed in order to assess the efficiency of supply chain networks (Network DEA, Two stage DEA formulation etc). The following works demonstrate the use of DEA to supply chain area. However, most applications of DEA to supply chain systems is performed in an entirely different context to the one presented in this paper.

A non linear programming model has been proposed by Liang et al. (2006) for the evaluation of supply chain efficiency under intermediated performance evaluation.

A multi-phase supply chain network design model has been proposed by Talluri and Baker (2002) utilizing DEA technique along with a game theoretic approach for pairwise evaluation of performance, designing the supply chain and providing optimal routing decisions.

Besides the evaluation of supply chain efficiency, sub operations conducted among the nodes of supply chain is also of major importance. In their work, Cheung and Hausman (2000), propose an exact measure efficiency measurement of (QR) policy of a two-echelon, multiple retailers system. Evaluation of supply chain performance has been proposed in the work of Forker et al. (1997) where through the combination of non linear DEA and regression analysis, Total Quality Management measures were provided.

Frota Neto et al. (2008) applied DEA and utilized DEA technique’s ability for efficiency extraction integrating into a unified framework with a multi-objective optimization model.

In their work Chen and Yan (2011) proposed a special network DEA approach in order to provide exact modeling with respect to the internal interactions of the supply chain. Similar works have been also proposed by Prieto and Zofío (2007), Huang et al. (2010) and Färe and Grosskopf (2000) proposed Network DEA models that can eventually be applied to the supply chain network evaluation framework.

Yang et al. (2011), have proposed an exact production possibility set for evaluating the performance different forms of supply chain models, while a game theoretic DEA model has been proposed by Chen et al. (2006), for analyzing the efficiency game between two supply chain parts. The model is proposed to explain bargaining supplier and manufacturer’s behavior for decision process. The internal supply chain performance, has been examined by Wong and Wong (2007) using technical and cost efficiency models. Using this DEA to measure the internal operations, inefficiencies in supply chain operations can be identified.

Generally, DEA technique has been used in order to provide efficiency of whole supply chain network system or to measure the performance of specific critical sub-systems, like suppliers, plants etc. Yet, the data for the application of DEA technique are provided a-priori, while the results are not taken into account during the optimization technique.

In this paper a DEA based algorithm is proposed for the optimal design of supply chain networks design. The algorithm utilizes the properties of DEA technique to provide productivity scores based on multiple inputs and outputs for each examined unit. It is assumed that in the proposed algorithm, except for the maximization of profit or revenue (minimization of cost), there is another objective based on which the selection of solutions is conducted, the maximization of selected solutions. The proposed algorithm is called Branch and Efficiency (B&E) as in each iteration the algorithm adds “efficiency cuts”, which are constraints to filter only the feasible and efficient solutions to be accepted. None of the existing papers in the literature have proposed such an algorithm until now.

3 B&E algorithm

3.1 Introduction to B&E

In general, a Mixed Integer Linear Programming (MILP) model is expressed with mathematical formulation as in (1) whereas \(\mathbf{x}\) represents the vector of continuous variables while \(\mathbf{y}\) the vector of binary variables. In MILP formulation (1), \(\mathbf{c}^{\mathbf{T}}\) and \(\mathbf{d}^{\mathbf{T}}\) are the coefficients of continuous and binary variables correspondingly. Finally, \(\mathbf{f}\) represents the vector of right-hand side of constraints while \(\mathbf{x}^{\mathbf{L}}\) and \(\mathbf{x}^{U}\) are vectors representing the upper and lower bounds placed on non-negative variables \(\mathbf{x}\).

$$\begin{aligned}&P_0 \min \mathbf{c}^{\mathbf{T}}\cdot \mathbf{x}+\mathbf{d}^{\mathbf{T}}\cdot \mathbf{y}\nonumber \\&s.t.\nonumber \\&\mathbf{A}\cdot \mathbf{x}+\mathbf{B}\cdot \mathbf{y}\le \mathbf{f}\nonumber \\&\mathbf{x}^{\mathbf{L}}\cdot \mathbf{y}\le \mathbf{x}\le \mathbf{x}^{\mathbf{U}}\cdot \mathbf{y}\nonumber \\&\mathbf{x}\ge \mathbf{0},\quad \mathbf{y}\in \left\{ {0, 1} \right\} \end{aligned}$$
(1)

Continuous variables represent upstream or downstream flows towards the different nodes of the supply chain while binary variables represent the decision of connections or the selected facilities. Setting in advance what are the characteristics that the DM would like to measure in each of the selected binary variables (facilities), inputs and outputs should be provided. Assuming that \(\mathbf{h}\) and \(\mathbf{k}\) are sub-matrices of \(\mathbf{x}\) and \(\mathbf{y}\), that contain nonzero continuous or integer solutions that will be introduced as inputs and outputs correspondingly. The following Linear Programming (LP) model represents an output oriented DEA model in a matrix form. The sub-matrices \(\mathbf{h}\) and \(\mathbf{k}\) of \(\mathbf{x}\) and \(\mathbf{y}\) are used in order to extract the efficiency for each binary solution set to 1.

$$\begin{aligned}&\mathbf{DEA} \max \mathbf{z}\nonumber \\&s.t.\nonumber \\&{\varvec{\Lambda }}^{\mathbf{T}}\cdot \mathbf{h}\le {\mathbf{h}'}\nonumber \\&{\varvec{\Lambda }}^{\mathbf{T}}\cdot \mathbf{k}\ge {\mathbf{k}'}\cdot \mathbf{z}\nonumber \\&{\varvec{\Lambda }} \ge \mathbf{0} \end{aligned}$$
(2)

The aim of this approach is the minimization of total cost by selecting the decisions that concentrate maximum efficiency. For this purpose, a new MILP model is formulated with constraints with respect to efficiency filtering of solutions. In the following MILP model, efficiency cut constraints are introduced for the filtering of solutions that concentrate highest efficiency. Binary variables \(\mathbf{y}\) are now replaced by binary variables \({\varvec{\upxi }}\) that are used to select the efficient solutions.

$$\begin{aligned}&P^{\varepsilon }_0 \min \mathbf{c}^{\mathbf{T}}\cdot \mathbf{x}+\mathbf{d}^{\mathbf{T}}\cdot {\varvec{\upxi }}\nonumber \\&s.t.\nonumber \\&\mathbf{A}\cdot \mathbf{x}+\mathbf{B}\cdot {\varvec{\upxi }} \le \mathbf{f}\nonumber \\&\mathbf{x}^{\mathbf{L}}\cdot {\varvec{\upxi }} \le \mathbf{x}\le \mathbf{x}^{\mathbf{U}}\cdot {\varvec{\upxi }}\nonumber \\&\mathbf{E}\cdot {\varvec{\upxi }} \ge {\varvec{\alpha }}\nonumber \\&\mathbf{x}\ge \mathbf{0},\quad {\varvec{\upxi }}\in \left\{ {0, 1} \right\} \end{aligned}$$
(3)

As it can be seen in (3), the formulation is the same as in (1) but due to the fact that only the efficient solutions are selected with addition of efficiency constraint \(\mathbf{E}\cdot {\varvec{\upxi }} \ge {\varvec{\alpha }}\). In formulation (3), \({\varvec{\alpha }}\) is a vector of efficiency where the solutions with efficiency score greater or equal than this threshold, are selected with variables \({\varvec{\upxi }}\). The solution approach is shown in Fig. 2. Initially the iterations counter \((\varepsilon )\) is set to 0 while the set of selected solutions N is empty. Solving MILP model (1) an upper bound for the objective value is obtained; this value will be used as an indicator for the next steps of the B&E algorithm. The solutions provided by (1) will be used as inputs and outputs for efficiency evaluation via DEA technique. The inputs and outputs must be determined in advance by the DM so that would characterize each potential’s solution efficiency. It must be noted that if the number of DMUs is less than an acceptable threshold \(\left( T \right) \), where DEA cannot provide reliable results, B&E algorithm stops. Here it is assumed that \(T=\max \left\{ {m\cdot n, 3\cdot \left( {m+n} \right) } \right\} \). This can be modeled with the cardinality of the positive solutions of problem \(P_0\) and is defined as \(S=\left\{ {{X^{*}\in F}{\Big /}{X^{*}>0}} \right\} \). This means that each problem is unique and a special customization should be performed in advance.

After efficiency calculation, Technical Efficiency (TE) is calculated as an efficiency measure, and is defined as the reciprocal of \(\varphi \) (Andersen and Petersen 1993). Additional constraints are introduced in order to allow only solutions that gather efficiency equal to 1. Correspondingly, new binary variables \(({\varvec{\upxi }})\) are introduced replacing those that concerned the selection of a solution. These variables are triggered only if the solution is efficient.

In case where optimal solutions for binary variables \(({\varvec{\upxi }}^{*})\) yield infeasibility, the range of efficiency becomes wider in order to incorporate the minimum number of solutions that make the problem to be solved to optimality and reduce the objective function value. The algorithm stops when the number of DMUs drops below a certain level or there is no change in the objective function value.

Fig. 2
figure 2

The proposed B&E algorithm in a flowchart presentation

Throughout this procedure, the initially empty set of solutions N is filled with solutions that satisfy the conditions of iterative set \(J^{\gamma }\).

3.2 Notation

Index

i :

Plant

j :

Warehouse

k :

Customer

\(\gamma \) :

Iteration

Parameters

\(P_i^U\) :

Upper bound of produced quantities at plant i

\(P_i^L\) :

Lower bound of produced quantities at plant i

\(Q_{ij}^U\) :

Upper bound of transported quantities from plant i to warehouse j

\(Q_{jk}^U\) :

Upper bound of transported quantities from warehouse j customer k

\(W_j^U\) :

Upper capacity of warehouse j

\(\beta _j\) :

Coefficient relating quantity at capacity at warehouse j

\(I_j\) :

Inventory level stored warehouse j

Monetary parameters

\(c_i^P\) :

Production cost at plant i

\(c_{ij}^V\) :

Unit transportation cost of products transported from plant i to warehouse j

\(c_{ij}^F\) :

Rout transportation cost of products transported from plant i to warehouse j

\(c_{jk}^V\) :

Unit transportation cost of products transported from warehouse j to customer k

\(c_{jk}^F\) :

Route transportation cost of products transported from warehouse j to customer k

\(c_k^{{ PEN}}\) :

Penalty cost assigned to uncovered demand of customer k

\(F_j\) :

Installation cost of warehouse j

Continuous variables

\(P_i\) :

Production quantity at plant i

\(Q_{ij}\) :

Transported quantity from plant i to warehouse j

\(Q_{jk}\) :

Transported quantity from warehouse j to customer k

\(W_j\) :

Capacity of warehouse j

\(\lambda _j\) :

Lambda (peers)

\(g_k\) :

Variable modeling deficit in demand of customer k

\({ TC}\) :

Total cost

\(\varphi _j\) :

Efficiency of each

Binary variables

\(X_{ij}\) :

1 if the connection between plant i and warehouse j exists, 0 otherwise

\(X_{jk}\) :

1 if the connection between warehouse j exists and customer k exists, 0 otherwise

\(Y_j\) :

1 if warehouse j will be installed, 0 otherwise

\(\xi _j\) :

1 if warehouse j will be installed under efficiency level a %, 0 otherwise

3.3 Introduction through Supply Chain Network Design (SCDN) problem

In order to demostrate the applicability of the B&E algorithm, an introduction to the proposed mathematical programming algorithm is provided, via an application of a simple SCDN problem.

In Fig. 3, the network of the supply chain is presented. In the supply chain network proposed here it is assumed that only a single product is manufactured, stored and transported throughout the channels of the network. The application of B&E algorithm is intensively demonstrated in a simple SCDN model, so that it can be analytically described, however, the algorithm can be applied in any SCDN formulation independently of whether it is theoretical or real.

The supply chain network that is examined in this paper, consists of three nodes; the plants, the warehouses and customers. In the first link of the supply chain, the plants which manufacture the product for the customers, are assumed to be already located. These produced quantities are subjected to certain constraints. Once the products are produced, they are eventually stored in warehouses (the location of which is to be determined). From the warehouses, the quantities are delivered to the final link of the chain (the customer), according to corresponding demand.

Fig. 3
figure 3

The examined supply chain network

In Fig. 3, the binary variables that correspond to the supply chains arcs (connections) and nodes (warehouses) are shown. The present model can be extended in order to take into account, more than one products, while there are many stages and echelons, the problem is extended to a multi-product, multi-stage multi-echelon SCDN problem.

The levels of decision the supply chain may provide are the following:

  1. a)

    The quantities produced, transported and stored

  2. b)

    Capacity of the installed facilities and their location.

3.4 SCDN model

Due to the presence of binary variables, the following is a Mixed Integer Linear Programming (MILP) model with objective function (4) and constraints (5)–(15) and models a typical supply chain. Objective function (4) represent overall Total Cost which consists of the production cost (a), variable transportation cost (b), fixed transportation cost (c) among nodes and finally the installation cost (d). The last term represents penalty cost of uncovered demand.

$$\begin{aligned} P_0 :\min \, { TC}&=\overbrace{\sum _i {c_i^P \cdot P_i} }^{\left( a \right) }+\overbrace{\sum _i {\sum _j {c_{ij}^V} } \cdot Q_{ij}}^{\left( b \right) }+\overbrace{\sum _i {\sum _j {c_{ij}^F} } \cdot X_{ij}}^{\left( c \right) }+\overbrace{\sum _j {\sum _k {c_{jk}^V} } \cdot Q_{jk}}^{\left( b \right) }\nonumber \\&\qquad +\overbrace{\sum _j {\sum _k {c_{jk}^F} } \cdot X_{jk}}^{\left( c \right) }+\overbrace{\sum _j {F_j} \cdot Y_j}^{\left( d \right) }+\sum _k {c_k^{{ PEN}} \cdot g_k}\end{aligned}$$
(4)
$$\begin{aligned}&s.t.\nonumber \\&P_i =\sum _j {Q_{ij}},\quad \forall \, i \end{aligned}$$
(5)
$$\begin{aligned}&P_i \le P_i^U,\quad \forall \, i \end{aligned}$$
(6)
$$\begin{aligned}&P_i \ge P_i^L,\quad \forall \, i\end{aligned}$$
(7)
$$\begin{aligned}&\sum _i {Q_{ij}} =\sum _k {Q_{jk}},\quad \forall \, j\end{aligned}$$
(8)
$$\begin{aligned}&Q_{ij} \le Q_{ij}^U \cdot X_{ij},\quad \forall \, i, j\end{aligned}$$
(9)
$$\begin{aligned}&Q_{jk} \le Q_{jk}^U \cdot X_{jk},\quad \forall \, j, k\end{aligned}$$
(10)
$$\begin{aligned}&X_{ij} \le Y_j,\quad \forall \, i, j\end{aligned}$$
(11)
$$\begin{aligned}&X_{jk} \le Y_j,\quad \forall \, j, k\end{aligned}$$
(12)
$$\begin{aligned}&W_j \ge \beta _j \cdot \left( {\sum _i {Q_{ij}} +I_j} \right) , \quad \forall \, j\end{aligned}$$
(13)
$$\begin{aligned}&W_j \le W_j^U \cdot Y_j,\quad \forall \, j\end{aligned}$$
(14)
$$\begin{aligned}&\sum _j {Q_{jk}} +g_k =d_k,\quad \forall \, k\end{aligned}$$
(15)
$$\begin{aligned}&Y_j, X_{ij}, X_{jk} \in \left\{ {0, 1} \right\} \nonumber \\&P_i, Q_{ij}, Q_{jk}, W_j, g_k \ge 0,\quad \forall \, i, j, k \end{aligned}$$
(16)

In the above problem, constraints (6), (7) suggest that the quantities produced in plant i should not exceed and upper \((P_i^U)\) and lower \((P_i^L)\) bound production correspondingly. Constraint (5) is a mass balance constraint representing that product flow from warehouse j to customer k should be equal. Constraint (15) models the quantity that ends to the final node, the customer, and is assumed to cover demand of customer k. An additional variable is added in order to provide the magnitude of any shortfalls in demand. The introduction of slack variable \(g_k\) in constraint (15), guarantees that the problem is not infeasible, in case that the demand is high and cannot be covered by supply. The following term \(\sum \limits _k {c_k^{{ PEN}} \cdot g_k} \) is introduced in objective function (4) (Erdirik-Dogan and Grossmann 2008).

Besides constraints that regard to continuous variables, there are also logical constraints that are introduced in order to model the logical conditions. Constraints (9) and (10) suggest that the transported quantities from plant i to warehouse j and from warehouse j to customer k are upper bounded if—f the corresponding connection exists. Constraints (11) and (12) are introduced for the supply chain design network. In the two constraints is stated that if warehouse j is installed then the corresponding connection from warehouse j to plant i and from warehouse j to customer k exists. Constraint (13) states that warehouse’s capacity should be more than product’s quantities that will be transported to warehouse j plus the inventory level stored at warehouse j. The aforementioned quantities are multiplied by coefficient \(\beta _j\) expressing the amount of warehousing capacity required to hold a unit amount of the examined product at warehouse j. Constraint (14) is introduced to model the upper bound of warehouse capacity j.

Non-negativity constraints are imposed to production, transportation and warehousing variables as seen in (16).

4 B&E formulation

In this section the B&E algorithm will be deployed on the SCDN model that was previously described. Based on this approach, each node acts as an unit being described by inputs and outputs. The efficiency of each unit is extracted based on their level of inputs and outputs. Thus, the incurring efficiency can be measured with DEA. The data that will be fed to DEA are acquired by solving the initial problem, while the inputs and outputs are a-priori determined. In Fig. 4, the inputs and outputs of the warehouse facility is provided. If it assumed that a warehouse was a branch of an enterprise, then the most productive one would be the one that would minimize its operational cost and would provide more services.

Integrating DEA model into the SCND model, non-linearity terms will arise that may lead to local optima. In order to obtain global optima and to maintain the linearity of the model and extract the efficiency of each potential solution, B&E algorithm is applied.

Fig. 4
figure 4

Inputs and outputs of warehouse node

4.1 Data

The data used for the efficiency extraction are generally provided in advance after statistical, qualitative or techno economic analysis. As mentioned in the previous section and can be seen in B&E flowchart, DEA technique is applied for the caclulation of DMUs efficiency. Here DMUs are considered to be the potentially installed warehouses. The data that are provided to DEA technique, are not externally provided, but come within the solution of the problem. The only parameter that should be predetermined is the inputs and outputs that the DM considers that capture the productivity of each warehouse.

In this work, an output oriented DEA model is applied in order to extract the efficiency of each DMU (warehouse) with inputs and outputs as seen in Table 1. The choice of these data to serve as inputs and outputs are done on the basis of selecting the warehouses that can provide the maximum services at the minimum cost. In Table 1 the inputs and the outputs that will be provided to DEA technique are presented. The inputs of the study are: (a) the cost which corresponds to the transportation of quantities from plant to warehouse \(\left( {C_j^{1, V}} \right) \) and from warehouse to customer \(\left( {C_j^{2, V}} \right) \), (b) the fixed transportation cost for routes done from plant to warehouse \(\left( {C_j^{1, F}} \right) \) and from warehouse to customer \(\left( {C_j^{2, F}} \right) \) and c) the installation cost of warehouse \(j\, \left( {C_j^{IN}} \right) \). The outputs of the study are depicted in order to capture the magnitude of service level each warehouse can provide. Thus as outputs are considered (a) the total quantity that a warehouse can send to each customer \((TQ_j)\) and (b) the out coming connections, that is, how many customers a warehouse is connected to and therefore can serve \((OC_j)\). Thus the higher the efficiency the highest the service level that each warehouse can provide to each customer.

Table 1 Inputs and Outputs of the proposed supply chain

4.2 Measuring DMUs efficiency

As in this model, the efficiency is derived through solutions of a MILP model, inputs and outputs must be selected so that the efficiency of each warehouse is captured. As it can be seen from Table 1, incoming quantities have not been taken into account because due to the mass balance constraint, incoming quantities are equal to out coming quantities of a node.

In the following LP model, output oriented DEA model is presented. The data provided for the evaluation of efficiency of each warehouse are the solutions from the initial MILP model \(P_0\).

$$\begin{aligned}&{ LP} \max \varphi \nonumber \\&\quad s.t.\nonumber \\&\quad \sum _j {\lambda _j \cdot C_j^{1, V}} \le C_o^{1, V}\nonumber \\&\quad \sum _j {\lambda _j \cdot C_j^{2, V}} \le C_o^{2, V}\nonumber \\&\quad \sum _j {\lambda _j \cdot C_j^{1, F}} \le C_o^{1, F}\nonumber \\&\quad \sum _j {\lambda _j \cdot C_j^{2, F}} \le C_o^{2, F}\nonumber \\&\quad \sum _j {\lambda _j \cdot F_j} \le F_o\nonumber \\&\quad \sum _j {\lambda _j \cdot TQ_j} \ge TQ_o \cdot \varphi \nonumber \\&\quad \sum _j {\lambda _j \cdot OC_j} \ge OC_o \cdot \varphi \nonumber \\&\quad \lambda _j \ge 0,\quad \forall \, j\nonumber \\&\quad \varphi \, { free} \end{aligned}$$
(17)

The extraction of each DMU’s efficiency is preformed after solving the LP model (17) for each of the examined DMUs. In order to provide conclusions for the efficiency of each DMU, as \(\varphi \) is a free variable and can obtain any value, Technical Efficiency (TE) index is used. This index is defined as the reciprocal of \(\varphi \) (18) and is bounded in the range \(\left[ {0, 1} \right] \).

$$\begin{aligned} { TE}=1/\varphi \end{aligned}$$
(18)

The next step in B&E algorithm is to add those DMUs with \({ TE}=1\) in \(J^{\gamma }\) set, where all efficient solutions are stored. In order to investigate which DMUs hold an efficiency of 1, the next binary variable is introduced:

$$\begin{aligned} \xi _j ={\left\{ \begin{array}{ll} 1,\quad { TE}_j \ge \alpha \\ 0,\quad o/w\\ \end{array}\right. }\quad \forall \, j \end{aligned}$$
(19)

As it can be seen in (19), binary variable gets the value of 1, if \({ TE}\) is greater than or equal to an efficiency threshold. Initially this value is 1, but if the problem yields infeasibility then this value is reduced. Binary variable is triggered when TE of a warehouse is greater or equal to a, using the following constraint.

$$\begin{aligned} { TE}_j \ge a\cdot \xi _j,\quad \forall \, j \end{aligned}$$
(20)

The set of efficient solutions at each iteration of the algorithm is defined as follows:

$$\begin{aligned} J^{\gamma } =\left\{ {{ ORD}(j): { TE}_j \ge a\cdot \xi _j} \right\} \end{aligned}$$
(21)

Assuming that N is the set of all non-zero solutions after \(\varepsilon \) iterations of the algorithm, then DMUs with technical efficiency less than a will be included in the following set:

$$\begin{aligned} {\bar{E}}=N/J^{\gamma } \end{aligned}$$
(22)

In the previous context, the set of inefficient solutions is defined as the set subtraction of the set of all solutions after \(\varepsilon \) iterations of the algorithm minus the set of inefficient solutions.

4.3 Solving problem \(P_0^\gamma \)

In order to design the supply chain network, using binary variables for efficient solutions, constraints that are introduced for the design of the network (logical) are reformulated. A DMU is selected if—f satisfies constraint (20). Thus, the binary variable that corresponds to the selection of warehouses \(\left( {Y_j} \right) \) is replaced with binary variable \(\xi _j\). Constraints (11)–(14) are reformulated as follows:

$$\begin{aligned} X_{ij}\le & {} \xi _j,\quad \forall \, i, j \end{aligned}$$
(23)
$$\begin{aligned} X_{jk}\le & {} \xi _j,\quad \forall \, j, k \end{aligned}$$
(24)
$$\begin{aligned} W_j\ge & {} \beta _j \cdot \left( {\sum _i {Q_{ij}} +\xi _j \cdot I_j} \right) ,\quad \forall \, j \end{aligned}$$
(25)
$$\begin{aligned} W_j\le & {} W_j^U \cdot \xi _j,\quad \forall \, j \end{aligned}$$
(26)

In constraints (23), (24) and (26), binary variable \(Y_j\) is replaced with variable \(\xi _j\) and as it can be seen (25) has been modified introducing it to the inventory parameter in order to avoid any infeasibilities that may occur. As \(Q_{ij}\) is controlled by binary variables that correspond to connections, namely \(X_{ij}\), through constraint (9) and binary variables are connected with constraint (11), such that if a warehouse is installed then the corresponding connection is created, thus \(Q_{ij}\) has an indirect relation with the variable that correspond to quantities transferred. For this reason, binary variable is also introduced to the lower bound and especially on the inventory parameter which is not controlled by any logical or continuous variable.

Except for the supply chain network and logical constraints, the objective function of the problem is modified as follows:

$$\begin{aligned} \min \, { TC}= & {} \sum _i {c_i^P \cdot P_i} +\sum _i {\sum _j {c_{ij}^V} } \cdot Q_{ij} +\sum _i {\sum _j {c_{ij}^F} } \cdot X_{ij} +\sum _j {\sum _k {c_{jk}^V} } \cdot Q_{jk}\nonumber \\&+\sum _j {\sum _k {c_{jk}^F} } \cdot X_{jk} +\sum _j {F_j} \cdot \xi _j +\sum _k {c_k^{{ PEN}} \cdot g_k} \end{aligned}$$
(27)

In objective function (27), the installation cost is now computed upon the product of installation cost and binary variable \(\xi \). The procedure is graphically represented in the following figure (Fig. 5). In Fig. 5, the reduction of cost is performed through consecutive efficiency cuts. The proposed B&E algorithm is not the logic cut type technique. At the beginning the value of the objective function is higher than in any of the iterations of the algorithm. The basis for this argument is that the binary variable that corresponds to supply chain network design is replaced by a binary variable that measures efficiency of solutions and is subjected to an additional constraint, the efficiency cut. Thus in each iteration through the selection of reduced number of facilities, the overall cost is also minimized. The thick black line indicates the point where the data are insufficient for the application of DEA technique.

Fig. 5
figure 5

Minimization of cost through iterative efficiency cuts

The new formulated problem \(P_0^\gamma \) is described by objective function (27) and constraints (5)–(10), (23)–(26), (15) and (16). The initial value of the objective function will be greater than the value of any consecutive iteration. This is attributed to the reduction of solutions (by zeroing) for certain indices as stated in (28).

$$\begin{aligned}&\sum _i {\sum _{j\in {\bar{E}}} {c_{ij}^V} } \cdot Q_{ij} +\sum _i {\sum _{j\in {\bar{E}}} {c_{ij}^F} } \cdot X_{ij} +\sum _{j\in {\bar{E}}} {\sum _k {c_{jk}^V} } \cdot Q_{jk}\nonumber \\&\quad +\sum _{j\in {\bar{E}}} {\sum _k {c_{jk}^F} } \cdot X_{jk} +\sum _{j\in {\bar{E}}} {F_j} \cdot \xi _j =0 \end{aligned}$$
(28)

For non-efficient solutions, when \(\xi _{j\in {\bar{E}}} =0\) the decisions that concern to the installation of a facility are also 0. The aforementioned state is derived from the following inequalities.

$$\begin{aligned} X_{ij}\le & {} 0\Leftrightarrow \nonumber \\ X_{ij}= & {} 0 \quad \forall \, i,\quad j\in {\bar{E}} \end{aligned}$$
(29)
$$\begin{aligned} X_{jk}\le & {} 0\Leftrightarrow \nonumber \\ X_{jk}= & {} 0 \quad \forall \, j\in {\bar{E}},\quad k \end{aligned}$$
(30)

Finally, the variables that regard to the quantities become also 0 when a solution is not efficient. Given the fact that the system should be in a balanced form, quantities are channeled through the selected facilities, which are the efficient ones. The procedure stops at the point where objective function value has no improvement in two consecutive iterations or when the number of DMUs in each iteration is less than the minimum amount of DMUs needed for the functionality of DEA.

5 Results

5.1 Description of case study

In this section, the applicability of the model is presented through a case study. The proposed B&E algorithm can be applied in any supply chain, reducing overall cost. In the supply chain network presented in the previous section, a single homogeneous product is manufactured and transported throughout the links of the supply chain.

In the following table (Table 2) the data of the case study are presented. Simulated data have been used drawn from uniform distribution which is considered as fair due to the fact that all observation have the same probability of occurrence. The unit production and transportation cost is measured in relative money units/units (r.m.u/u) while parameters with respect to capacity and demand are measured in units (u).

Table 2 Data of the proposed case study

5.2 Numerical results

In this section the application of B&E is demonstrated through the results of the case study. The case study was model and solved in GAMS optimization software using CPLEX as LP and MIP solver on an Intel Pentium, 2.3 GHz, 2 GB RAM laptop computer. Even if the instance is medium to large with \(\left| I \right| =50\) plants, \(\left| J \right| =50\) plants and \(\left| K \right| =50\) customers, the problem was solved in 10 CPU seconds to optimality.

5.2.1 Initialization

First MILP model \(P_0\) is solved to optimality providing decisions about the number of warehouses. A summary of the data (inputs and outputs) that are derived after solving MILP model \(P_0\) are shown in Fig. 6. Regarding the outputs, the average value of the total quantity \((TQ_j)\) (Fig. 6a) that is sent from the selected warehouses to the customers is 5000 r.m.u while the average value of the total outgoing connections is \(10\, (OC_j)\) (Fig. 6b). Regarding the inputs, the average value of the variable transportation cost from plant to warehouse (Fig. 6c) \((C_j^{1, V})\) is almost 800 r.m.u and almost the same conclusions can be extracted for variable transportation cost from warehouse to customer \((C_j^{2, V})\) (Fig. 6d). Finally, the average values of fixed transportation cost from plant to warehouse \((C_j^{1, F})\) (Fig. 6e) and fixed transportation cost from warehouse to customer \((C_j^{2, F})\) (Fig. 6f) are 12,500 and 10,000 r.m.u correspondingly.

Fig. 6
figure 6

Boxplot for outputs: a total quantity b outgoing connections and inputs: c variable transportation cost from plant to warehouse, d variable transportation cost from warehouse to customer, e fixed transportation cost from plant to warehouse, f fixed transportation cost from warehouse to customer, after solving \(P_0\)

Table 3 Technical efficiency for all selected DMUs (warehouses)

After solving model \(P_0\) 50 warehouses are selected providing a total cost of 27,352,117.18  r.m.u. The number of warehouses (viz. the decision variables that correspond to the installation of a warehouse) is larger than the minimum DMU requirements of \(T=\max \left\{ {5\cdot 2, 3\cdot \left( {5+2} \right) } \right\} =21\) needed in order for the DEA results to have validity, so the procedure continues.

5.2.2 DEA

Each DMU’s (warehouse) efficiency is evaluated through LP model (17), with the use of pre-formulated inputs and outputs. The inputs and the outputs (as have been defined in Table 1) are calculated from the optimal solutions of \(P_0 (Q_{ij}^{*}, Q_{jk}^{*}, X_{ij}^{*}, X_{jk}^{*}, Y_j^{*})\) as can be seen in Fig. 4. Solving LP model (17), the efficiency of each warehouse is extracted and the initial set of solutions is the following:

$$\begin{aligned} N=\left\{ {1, 2, 3, \ldots , 50} \right\} \end{aligned}$$
(31)

The Technical Efficiency of each of the selected DMUs (warehouses) is presented in the next table (Table 3) and the efficient warehouses are shown with bold. As it can be seen in Table 3, the majority of DMUs selected has a technical efficiency of 1 and are most likely to remain in N. Yet, as the problem has indirect two objectives, namely the maximization of efficiency of the final supply chain network and the minimization of cost, if a warehouse may have a large cost comparing to the other, it may be excluded.

Table 4 Results from application of B&E algorithm to larger instances

5.2.3 Efficiency cuts

As aim of the proposed algorithm is to reduce the number of facilities in order to reduce overall cost but at the same time increase the service level of the customers, the efficiency cuts are applied to the second MILP model \(P_0^{\gamma =1}\). Model \(P_0^{\gamma =1}\) presented in Sect. 4.3 (\(\gamma =1\) denotes first iteration) receives the initial solutions from model \(P_0\). After solving \(P_0^{\gamma =1}\) the number of selected facilities has dropped from 50 to 13 which are the following: 1, 5, 6, 7, 17, 23, 27, 31, 34, 43, 44, 45, and 46. It must be noticed that from all DMUs, the model found and excluded those with Technical Efficiency equal to 1. The problem was solved to optimality and without any infeasibility occurrence. The results of model \(P_0^{\gamma =1}\) can be easily confirmed from Table 3 as all the selected facilities have a Technical Efficiency equal to 1; if any infeasibility instance would occur, based on the flowchart of B&E algorithm (Fig. 2), wider Technical Efficiency bounds would be used. The cost after the first iteration of B&E algorithm is 27,059,894.34 r.m.u. As the number of facilities that are selected from \(P_0^{\gamma =1} \) is less than the threshold set for DEA functionality (terminating criterion \(\left| {N^{\gamma =1}} \right| <21\)), the algorithm stops. The optimal solutions are the one derived from \(P_0^{\gamma =1}\). The number of selected facilities that are eventually selected are 13 (from 50 that have been selected after solving \(P_0)\) leading to cost reduction from 27,352,117.18 r.m.u. to 27,059,894.34 r.m.u.

5.2.4 Larger instances

In this section the application of the algorithm to larger instances is demonstrated. In Table 4 the instance characteristics (size of the problem), the results (objective function value and number of selected facilities) and the iterations for final solutions are presented.

As it can be seen from Table 4, in the proposed problems initially all the warehouses are selected. The initial cost is considered as an upper bound for the iterations of B&E algorithm and at each iteration overall cost decreases. In all instances, the algorithm terminated after the first iteration \((\gamma =1)\) either because the number of facilities is less than the threshold or because after selecting the efficient warehouses of first iteration, the MILP model \(P_0^\gamma \) was infeasible.

6 Conclusions

Efficiency measurement is applied to firms or to units when the data (inputs and outputs) are known a priori. Using DEA technique the efficiency is extracted for each DMU under examination. Considering each solution as a DMU, it is possible to evaluate the solutions efficiency using endogenous data. When modeling the supply chain network design, single sourcing constraints are used in order to provide better results in terms of minimizing the overall cost (objective function value) and economies of scale through facilities concentration. However, this type of constraint is a “hard” constraint and may eventually leads to infeasibility.

In this paper a Branch and Efficiency (B&E) algorithm has been deployed for the optimal design of supply chain networks. The algorithm integrates DEA technique in the design of supply chain network, through an iterative process and takes into advantage the strengths of DEA to provide efficiency scores for multiple inputs and outputs. Up to now, DEA technique has been applied in order to measure performance of a system based on exogenous data. The proposed B&E algorithm takes into the data provided within the optimization procedure in each iteration. Based on this approach, the problem is initially solved providing non-zero solutions for the initialization of the algorithm. The initialized values are fed to DEA to measure the efficiency of the unit under examination (in this case warehouses). Through the addition of efficiency cuts, the algorithm selects only the efficient solutions, which minimize overall cost (or maximize profit/revenue).

For illustrative purposes, a two-stage supply chain model is proposed. Production units form the first link of this supply chain network while customer’s site form the last link, both of which are assumed to be already installed. The model is designed in such a way so as to provide decisions about the potential installation of warehouses. Setting a-priori the characteristics that could capture the efficiency of each facility (warehouse), the data are provided to the algorithm. The proposed algorithm has two characteristics; heuristic and evaluative. The first comes from the fact that even if initial solutions are provided during the initialization process, the algorithm searches among the “efficient neighborhoods” and would accept or exclude solutions based on efficiency cuts, after the evaluation process. The algorithm is generic and can be applied in any type of supply chain, regardless the level of complexity, making it a valuable tool for long and short term managerial decisions.