Introduction

The facility location–allocation (FLA) decision is often considered the most important factor leading to the success of a private- or public-sector organization. Daskin (2013) emphasizes the importance of facility location problems by asserting in his recent book that in short, the success or failure of both private and public-sector facilities depends in part on the locations chosen for those facilities. Since the design of efficient supply chain networks starts from efficient FLA decisions (see Olivares-Benitez et al. 2012), the FLA models have been widely used in practical life as well as in many academic disciplines. Consequently, the topic of FLA problem has received considerable attention in the literature.

The basic issues of FLA are where to locate and how to size facilities, and how to meet demands from the facilities. FLA decisions inherently consist of two kinds of decision plans. One is a strategic decision plan on the facility location, while the other one is an operational decision plan on the facility allocation. The goal of FLA decision is to locate the facilities in a way that satisfies/covers the demand points most efficiently. In fact, various types of FLA models have been developed to answer questions such as how many facilities to locate, where to locate facilities, and how to distribute the items to the customers concerning different location criteria.

Many references cited in Farahani et al. (2010) demonstrate that the FLA problems are inherently multi-objective, where those objectives sometimes conflict with each other in nature (see Lee et al. 1981). The traditional FLA models deal with the objective of cost minimization, whereas demand-oriented objectives focus on measuring the ‘proximity or closeness’ of the facilities. The profit maximization objective may be achieved by either cost minimization or maximization of demand satisfied/covered, or both. They (2010) emphasize the importance of multi-objective facility location–allocation (MOFLA) problems after observing the substantial growth of the literature on MOFLA problems. Thus, the growing attention and interest in these problems are due to the recognition of the need to consider more objectives/criteria to achieve closer solutions to reality.

Multi-objective programming (MOP) technique provides an analytical framework where a variety of objectives can be focused on simultaneously so that a decision maker can use to provide optimal solutions. But most of the MOP techniques require decision-makers’ judgment to provide weights to the deviational variables in the objective function to appropriately reflect the importance and desirability of deviations from the various target values. As the number of performance measures increases, solving the MOP model will yield a great number of alternative options. The reason is that each different weight factor set for performance measures may generate a different option. As Ragsdale (2015) states, no standard procedure is available to assign values to the weight factors in a way that guarantees the decision-makers find the most desirable solution. Evaluating alternatives generated by solving the MOP model can be viewed as a multiple-criteria decision-making (MCDM) problem, requiring a systematic solution evaluation system.

In this paper, we utilize data envelopment analysis (DEA) technique for such a systematic solution evaluation system. DEA yields relative efficiency of comparable units, which are called decision-making units (DMUs) in DEA parlance, that employ multiple outputs and inputs. To denote the relative efficiency for each DMU, DEA produces an efficiency score that is defined as the ratio of the sum of weighted outputs to the sum of weighted inputs.

The objective of this paper is to present and demonstrate how to combine DEA and MOP techniques for the efficient FLA decisions and patterns to help practitioners as well as decision-makers who are responsible for the strategic and operational decision plans. To combine DEA and MOP, we classify all performance measures into inputs or outputs to formulate the FLA problem as an MOP model. Then we solve the model for various values of weights given to the performance measures. Considering each generated alternative option for a given set of weight as a DMU, we evaluate all alternative options by utilizing DEA technique to find the efficiency of each alternative option and identify the most efficient FLA schemes. In this way, decision-makers evaluate and identify efficient and robust FLA decisions without any subjective judgment. Furthermore, once decision-makers identify efficient FLA patterns through the proposed procedure, they can modify their operational decisions without sacrificing the efficiency heavily under unexpected disruptions. We demonstrate our procedure through a case study.

This paper is organized as follows. After the literature review of FLA with DEA, we provide a brief introduction to general FLA models with MOP with the minimax objective approach and DEA. The proposed method of combining an MOFLA model and DEA is discussed. Next, we demonstrate our proposed method by MOFLA formulation and DEA evaluation through a case study using actual data in South Carolina, followed by conclusions.

Literature review

The FLA problem is widely used in practical life. The pioneering work of Weber (1929) on the location theory prompted the formulation of various mathematical programming models. After Cooper (1963) initially proposes an FLA problem by presenting a heuristic method, Hakimi (1964, 1965) applies the FLA problem to network design as a powerful tool. Several heuristic methods have been developed for the traditional FLA problem. Recently, Mesa et al. (2017) applied Cuckoo search via Levy flights (CS-LF) to uncapacitated FLA problem, claiming that applying CS-LF yields better facility locations compared to particle swarm optimization and other existing algorithms. Traditionally, minimizing cost/time/distance has been the primal objective. Some authors (Askin et al. 2014; Manatkar et al. 2016) consider also maintaining desired service level in addition to reducing the costs. Recently, contrary to the most single-period FLA problem, Manzini and Gebennini (2008) and Manatkar et al. (2016) apply mixed integer programming optimization models to design and manage multi-period, multi-stage and multi-commodity FLA problem. Hotelling (1929) introduces an influential competitive facility location model where each of two players selects a location in a linear segment, and a continuum of uniformly-distributed consumers along that segment select the closest facility. Hakimi (1983), Aboolian et al. (2007), Zhang et al. (2016a, b), and Bagherinejad and Niknam (2017) are a few of them who study and develop various algorithms and procedures for the competitive facility location problems. Balcik and Beamon (2008) consider facility location decisions for a humanitarian relief chain responding to quick-on-set disasters. Habib et al. (2016) conduct an up-to-date survey of mathematical models developed in humanitarian supply chain area and highlight the potential research areas which require the attention of the researchers. Recently, Fereiduni and Shahanaghi (2017) present a robust network design model for humanitarian logistics which will assist in location and allocation decision for multiple disaster periods. They (2017) also use the Monte Carlo simulation for generating related random numbers and different scenarios and utilize the p-robust approach to formulate the new network.

As shown in Farahani et al. (2010), various researchers have worked on multi-objective/criteria facility location–allocation (MOFLA) problems, such as Blake and Carter (2002), Drezner et al. (2006), Yang et al. (2007), Ho et al. (2008), Farahani et al. (2010), Hong et al. (2015), Farahani et al. (2015), Fang and Li (2015), and Manatkar et al. (2016). Khalili-Damghani et al. (2015) propose a bi-objective mixed integer mathematical programming for locations of warehouses and routing of vehicles to reduce the total cost of the supply chain and to balance the workload of distribution centers while the due dates of delivery of perishable product are met, concurrently. To solve their model, they use an evolutionary algorithm called Non-Dominated Sorting Generic Algorithm-II. Oddoye et al. (2009), Beheshtifar and Alimoahmmadi (2015), Zhang et al. (2016a, b) and Ahmadi-Javid et al. (2017) apply MOFLA models to solving health-care related facility location–allocation problems.

DEA has been a widely used mathematical programming technique that evaluates the performance of a set of homogenous DMUs. Hafezalkotob et al. (2015) propose a robust DEA (RDEA) to investigate the efficiencies of DMU where there are discrete input and output data. Khalili-Damghani et al. (2016), also pointing out that the observed values of the input and output data in real-world problems are sometimes imprecise or vague, present a comprehensive fuzzy DEA framework for solving performance evaluation problems with coexisting desirable input and undesirable output data in the presence of simultaneous input–output projection. In conventional DEA, DMUs are represented as black boxes where only the initial inputs and final outputs are considered to measure their efficiency, neglecting intervening processes, i.e., different series or parallel function. A new DEA model called network DEA (NDEA) accounts for divisional efficiencies as well as the overall efficiency (see Cook et al. 2010). Saniee-Monfared and Safi (2013) propose a set of performance indicators to enable efficiency-analysis of academic activities and apply NDEA structure to account for sub-functional efficiencies such as teaching quality, research productivity, as well as overall efficiency. Tavana et al. (2016) apply NDEA to evaluate the performance of three-level supply chains and shows the applicability and efficiency of the proposed framework, insisting that the proposed method can be easily implemented in any multi-level supply chain.

Thomas et al. (2002) propose the combined obnoxious-facility location/DEA model, assuming that the number of facilities to be opened is predetermined. Klimberg and Ratick (2008) develop and test location modeling formulations by utilizing DEA to find optimal and efficient FLA patterns. Following Klimberg and Ratick (2008), Fang and Li (2015) propose a combined DEA and goal programming (GP) approach. But these two approaches assume both inputs and outputs are given to model DEA for the efficient FLA decision, whereas our paper generates all inputs and outputs from solving the MOP model. Klimberg et al. (2011) extend Klimberg and Ratick (2008) to consider the more realistic situation in which the units of products from the facilities are decision variables. Another example using the combining DEA and MOP approach is found in Ghoushchi et al. (2017) who develop the DEA model using imprecise data based on GP to evaluate and select sustainable suppliers in the supply chain.

This paper is motivated by Klimberg and Ratick (2008), Klimberg et al. (2011) and Fang and Li (2015). All of them develop and test location modeling formulations by utilizing DEA to find optimal and efficient FLA patterns. Klimberg and Ratick (2008) and Klimberg et al. (2011) postulate that locating facilities at different potential sites may affect the performance of the facility’s ability to transform inputs into usable outputs. Their model formulation simultaneously considers the interaction of spatial efficiencies of different location patterns through the use of least cost objective and the facility efficiencies at those sites through the use of DEA objective. Their models (2008 and 2011) require the huge amount of the predetermined input and output data and consequently the huge number of the constraints for their combined location and simultaneous DEA model, as the numbers of facilities and their potential sites increase. In addition to those huge data and constraints required by their models, it would be not only difficult to quantify all inputs and outputs for a facility to be located to cover the allocated sites, but also very subjective for a decision maker to decide these magnitudes of such inputs and outputs. In addition, they apply the results obtained by DEA as one goal/objective for their MOP models. That is not the way that DEA has been developed and applied in many researches on the topic of DEA. For example, suppose that one of the performance measures is the total cost. The total cost should be used as an input to apply DEA method. In other words, the decision of locating a facility at a site incurs a part of the total cost and will affect the relative efficiency of FLA scheme when DEA is applied. That is the way that DEA has been invented and applied to find the efficiency of DMUs.

In this paper, we propose a more reasonable and practical approach to finding efficient FLA decisions by generating the inputs and outputs directly through formulating and solving the MOP model and by applying DEA for those inputs and outputs generated by the MOP model. No literature on our innovative approach to FLA decisions has been known.

FLA model with MOP with minimax objective approach

The following nomenclature is used:

Sets:

M :

Index set of potential facility sites (j =1, 2, …, M and m =1, 2…M).

Parameters:

\(b_{j}\) :

Minimum number of sites that facility j can cover

\(B_{j}\) :

Maximum number of sites that facility j can cover

\(c_{jm}\) :

Cost of shipping one unit of demand per mile from facility j to site\demand point m

\(C^{\text{max} }\) :

Maximum number of facilities can be built

\(CAP_{j}^{\text{max} }\) :

Capacity of facility located at site j

\(d_{jm}\) :

Distance between facility located at site j and site m

\(D_{m}\) :

Demand of site m

\(h_{j}\) :

Holding cost per item per time unit at facility located at site j

\(F_{j}\) :

Fixed cost for constructing and operating facility located at site j

\(L_{j}\) :

Replenishment lead time for facility located at site j

\(S_{j}\) :

Ordering cost for facility at site j to place an order

β :

Desired service level

\(\sigma_{m}\) :

Standard deviation of demand at site m

Decision variables:

\(C_{j}\) :

Binary variable deciding whether a facility is located at site j

\(y_{jm}\) :

Binary variable deciding whether site m is covered by facility located at site j

In above nomenclature, we assume \(a_{jm}\) and \(d_{jm}\) equal to zero if j = m.

Let I and O denote the index set of performance metrics for inputs (i = 1, 2, …, p) and outputs (\(r\) = 1, 2, …, s). Let the nonnegative deviation variables, (\(\omega_{1}^{ + } ,\) \(\omega_{2}^{ + } ,\)\(\omega_{p}^{ + }\)) and (\(\omega_{1}^{ - } ,\) \(\omega_{2}^{ - } ,\)\(\omega_{s}^{ - }\)), denote the amounts by which each value of performance metrics deviates from the minimum and maximum values, respectively, which are called overachievement and underachievement deviation n variables. Then, the deviation variables are expressed as

$$\omega_{i}^{ + } = X_{i}^{ + } - TGV_{i}^{ + }$$

and

$$\omega_{r}^{ - } = TGV_{r}^{ - } - X_{r}^{ - } ,$$

where \(TGV_{i}^{ + }\) and \(TGV_{r}^{ - }\) represent the target value of performance metric, \(I^{ + }\) and \(O^{ - }\), respectively. Now, the minimax objective can be expressed as

$${\text{Minimize}}\;{\text{the}}\;{\text{maximum}}\;{\text{of}}\;\left\{ {\alpha_{1}^{ + } \frac{{\omega_{1}^{ + } }}{{TGV_{1}^{ + } }}, \ldots , \alpha_{p}^{ + } \frac{{\omega_{p}^{ + } }}{{TGV_{p}^{ + } }},\alpha_{1}^{ - } \frac{{\omega_{1}^{ - } }}{{TGV_{1}^{ - } }}, \ldots , \alpha_{s}^{ - } \frac{{\omega_{s}^{ - } }}{{TGV_{s}^{ - } }} } \right\},$$

where \(\alpha_{i}^{ + }\) and \(\alpha_{r}^{ - }\) are relative importance weights attached to the overachievement and underachievement deviation variables and the sum of all weights equals one for the purpose of analysis. Now, set Q equal to the maximum variable, such as

$$Q \, = { \text{max} } \left\{ {\alpha_{1}^{ + } \frac{{\omega_{1}^{ + } }}{{TGV_{1}^{ + } }}, \ldots , \alpha_{p}^{ + } \frac{{\omega_{p}^{ + } }}{{TGV_{p}^{ + } }},\alpha_{1}^{ - } \frac{{\omega_{1}^{ - } }}{{TGV_{1}^{ - } }}, \ldots , \alpha_{s}^{ - } \frac{{\omega_{s}^{ - } }}{{TGV_{s}^{ - } }} } \right\}.$$
(1)

The formulation of MOFLA model with MOP with the minimax objective is given as follows:

$${\text{Minimize}}\;Q\;{\text{in}}\;\left( 1 \right)$$
(2)

subject to

$$\alpha_{1}^{ + } \frac{{\omega_{1}^{ + } }}{{TGV_{1}^{ + } }} \le Q, \ldots , \alpha_{p}^{ + } \frac{{\omega_{p}^{ + } }}{{TGV_{p}^{ + } }} \le Q, \alpha_{1}^{ - } \frac{{\omega_{1}^{ - } }}{{TGV_{1}^{ - } }} \le Q, \ldots , \alpha_{s}^{ - } \frac{{\omega_{s}^{ - } }}{{TGV_{s}^{ - } }} \le Q,$$
(3)
$$X_{1}^{ + } - \omega_{1}^{ + } = TGV_{1}^{ + } , \ldots , X_{p}^{ + } - \omega_{p}^{ + } = TGV_{p}^{ + } , X_{1}^{ - } + \omega_{1}^{ - } = TGV_{1}^{ - } , \ldots X_{s}^{ - } + \omega_{s}^{ - } = TGV_{s}^{ - } ,$$
(4)
$$\mathop \sum \limits_{j \in M}^{{}} y_{jm} = 1, \quad \forall m \in M$$
(5)
$$\mathop \sum \limits_{j \in M}^{{}} C_{j} \le C^{\text{max} } ,$$
(6)
$$y_{jm} \le C_{j} , \quad \forall j\;{\text{and}}\;\forall m \in M$$
(7)
$$C_{j} \cdot b_{j} \le \mathop \sum \limits_{m \in M}^{{}} y_{jm} \le C_{j} \cdot B_{j} ,\quad \forall j \in M$$
(8)
$$\mathop \sum \limits_{m \in M}^{{}} D_{m} y_{jm} \le CAP_{j}^{\text{max} } , \quad \forall j \in M.$$
(9)

Constraints (5) make certain that each site is covered by a facility. Constraints (6) define the maximum number of facilities to be built. Constraints (7) ensure that each site can only be covered by a selected facility. Constraints (8) make sure that the facility j must cover at most \(B_{j}\) and at least \(b_{j}\) sites. Constraints (9) show the capacity of the facility j.

Data envelopment analysis (DEA) models

Among many performance evaluation methods, data envelopment analysis (DEA) has been widely used to evaluate the relative efficiency of a set of peer organizations called decision-making units (DMUs) that have multiple inputs and outputs. The main reason might be that DEA models need not recourse to the exact behavior function of those organizations regarding the transformation of multiple inputs to outputs. DEA defines relative efficiency as the ratio of the sum of weighted outputs to the sum of weighted inputs. The mathematical model of DEA (see Cooper et al. 2011) may be stated as:

$${\text{max}} \varphi = \mathop \sum \limits_{r = 1}^{s} u_{r} O_{ro} ,$$
(10)

subject to

$$\mathop \sum \limits_{i = 1}^{p} v_{i} I_{io} = 1,$$
(11)
$$\begin{aligned} & \mathop \sum \limits_{r = 1}^{s} u_{r} O_{r\delta } - \mathop \sum \limits_{i = 1}^{p} v_{i} I_{i\delta } \le 0,\quad \delta = 1, \ldots , n, \\ & u_{r} , v_{i} \ge \varepsilon , r = 1 \ldots , s;\;i = 1 \ldots , p \\ \end{aligned}$$
(12)

where n number of DMUs being compared in the DEA analysis, \(\varphi\) efficiency rating of the DMUo being evaluated by DEA, Or amount of output r generated by DMU, Ii amount of input i used by DMU, p number of inputs used by the DMUs, s number of outputs generated by the DMUs, ur coefficient or weight assigned by DEA to output r, vi coefficient or weight assigned by DEA to input i, ε non-Archimedean.

Let φ* denote the optimal value of the objective function corresponding to the optimal solution (u*, v*). DMUo is said to be efficient if φ* = 1. DEA models can be either input-oriented or output-oriented, depending upon the rationale for conducting DEA. The model given by (10)–(12) is called an input-oriented CCR model, which was initially proposed by Charnes et al. (1978), and φ* is called constant returns to scale (CRS) efficient score (ES).

Now, the dual program of (10)–(12), which is called input-oriented envelopment DEA model (e-DEA), is formulated:

$${\text{min}} \theta - \varepsilon \left\{ {\mathop \sum \limits_{i = 1}^{p} \xi_{i}^{ - } + \mathop \sum \limits_{r = 1}^{s} \xi_{r}^{ + } } \right\}$$
(13)

subject to

$$\mathop \sum \limits_{\delta = 1}^{n} \lambda_{\delta } I_{i\delta } - \theta I_{io} + \xi_{i}^{ - } = 0,\quad i = 1, \ldots , p,$$
(14)
$$\begin{aligned} & \mathop \sum \limits_{\delta = 1}^{n} \lambda_{\delta } O_{r\delta } - O_{ro} - \xi_{r}^{ + } = 0, \quad r = 1, \ldots , s, \\ & \xi_{i}^{ - } , \xi_{r}^{ + } , \lambda_{\delta } \ge 0, \quad \delta = 1, \ldots n. \\ \end{aligned}$$
(15)

In the above dual model given by (13)–(15), \(\theta\) is the efficient score (ES) and DMUo is said to be efficient if \(\theta^{*} = 1\) and \(\lambda_{\delta }\) is the dual variable, used to indicate benchmark information. \(\xi_{r}^{ + }\) and \(\xi_{i}^{ - }\) are slack variables used to calculate the target input and output variables for an inefficient DMU. For \(\theta^{*} = 1\), the performance of DMUo is fully efficient if and only if all slacks \(\xi_{r}^{ + *} = \xi_{i}^{ - *} = 0\), otherwise is weakly efficient. See Cooper et al. (2011) for details. If the performance of a DMU is weakly efficient, it implies that the DMU is on the best-practice frontier, but the performance can be still improved by reducing input(s) or increasing output(s). Thus, weakly efficient DMUS are not classified as really efficient. By solving the above model, we can obtain the efficiency score of each DMU and check if the performance of the target DMU is fully efficient or weakly efficient.

Performance evaluation or measurement often depends upon by the context. One could ask “what is the relative attractiveness of a particular DMU when compared to others?” Following this vein, Seiford and Zhu (2003) propose the stratification/context-dependent DEA method to measure the attractiveness score and progress of DMUs with respect to a given evaluation context. For this, they stratify DMUs into different efficiency levels. Let \(J^{1} = \left\{ {DMU_{\delta } ,\delta = 1, 2, \ldots , n} \right\}\) be the whole set of n number of DMUs and iteratively define \(J^{\ell + 1} = J^{\ell } - E^{\ell } ,\) until \(J^{\ell + 1}\) becomes null. \(E^{\ell }\) consists of all the efficient DMUs on the \(\ell\)th level, that is, \(E^{\ell } = \left\{ {DMU_{k} \in J^{\ell } |\theta^{*} \left( {\ell ,\text{ }k} \right) = 1} \right\},\) and \(\theta^{*} \left( {\ell ,\text{ }k} \right)\) is the optimal value to the following CRS model when \(DMU_{k}\) is under evaluation.

$$\theta^{*} \left( {\ell ,\text{ }k} \right) = \mathop {\hbox{min} }\limits_{{\lambda_{\delta } ,\theta \left( {\ell ,\text{ }k} \right)}} \theta \left( {\ell ,\text{ }k} \right)$$
(16)

Subject to

$$\mathop \sum \limits_{{\delta \in F\left( {J^{\ell } } \right)}}^{K} \lambda_{\delta } I_{i\delta } - \theta \left( {\ell , k} \right)I_{ik} \le 0,\quad i = 1, \ldots , p,$$
(17)
$$\begin{aligned} & \mathop \sum \limits_{{\delta \in F\left( {J^{\ell } } \right)}}^{K} \lambda_{\delta } O_{r\delta } - O_{rk} \ge 0, \quad r = 1, \ldots , s, \\ & \lambda_{\delta } \ge 0, \delta = 1, \ldots n, \\ \end{aligned}$$
(18)

where \(\delta \in F\left( {J^{\ell } } \right)\) means \(DMU_{\delta} \epsilon J^{\ell}\), i.e., \(F\left( \cdot \right)\) represents the correspondence from a DMU set to the corresponding subscript index set. In fact, all DMUs in \(E^{\ell }\) are equivalent from the traditional DEA perspective. The DEA stratification model given by (16)–(18) partitions the set of DMUs into different frontier levels characterized by \(E^{\ell }\). The attractiveness score for each DMU in the \(\ell\)th level (\(E^{\ell }\)) is computed against DMUs in the \(\left( {\ell + 1} \right)\)th and lower levels as the evaluation context (see Zhu 2014). For an example, to find an attractiveness score for \(DMU_{q} = \left( {I_{q} , O_{q} } \right)\) from a specific level, \(E^{{\ell_{o} }} , \ell_{o} \in \left\{ {1, \;2,\, \ldots \,L - 1} \right\}\), we solve the following model:

$$H_{q}^{*} \left( d \right) = \hbox{min} H_{q} \left( d \right),\quad d = 1, \ldots L - \ell_{o}$$
(19)

Subject to

$$\mathop \sum \limits_{{\delta \in F\left( {E^{{\ell_{o} + d}} } \right)}}^{K} \lambda_{\delta } I_{\delta } - H_{q} \left( d \right)I_{q} \le 0$$
(20)
$$\begin{aligned} & \mathop \sum \limits_{{\delta \in F\left( {E^{{\ell_{o} + d}} } \right)}}^{K} \lambda_{\delta } O_{\delta } - O_{q} \ge 0, \\ & \lambda_{\delta } \ge 0,\quad \delta \in F\left( {E^{{\ell_{o} + d}} } \right). \\ \end{aligned}$$
(21)

\(H_{q}^{*} \left( d \right)\) is called d-degree attractiveness of \(DMU_{q}\) from a specific level \(E^{{\ell_{o} }} .\) In this way, the stratification/context- dependent DEA can have more discriminating power on each stratification level. In this paper, we adopt the DEA-based stratification concept for the FLA problem. First, we stratify DMUs into efficiency level, such as \(E^{1} ,E^{2} , E^{3} , \ldots E^{L}\), using (16)–(18). Then we compute the attractiveness score (AS) for each DMU in the first level \(E^{1}\) against the DMUs in each lower level, such as \(E^{2} , E^{3} , \ldots E^{L}\), using (19)–(21) after setting \(\ell_{o}\) = 1. Then, we compute the average attractiveness score (AAS) for \(DMU_{q}\) in \(E^{1}\), \(AAS_{q}\), which is defined as

$$AAS_{q} = \frac{{\mathop \sum \nolimits_{d = 1}^{L - 1} H_{q}^{*} \left( d \right)}}{L - 1}.$$
(22)

Note that all DMUs in the first level \(E^{1}\) are same in terms of efficiency score φ* = 1 in (10) or \(\theta^{*} = 1\) in (13). We can identify the most efficient DMUs which have the highest value of AAS.

Now, we propose the following formal procedure of combining DEA and MOP model for the efficient FLA decision:

Step 1 (MOFLA formulation and pre-stratification)

  1. (1)

    Define objectives/goals for performance measures (PMs) to be considered. Then, classify PMs into p inputs and r outputs.

  2. (2)

    Formulate MOFLA problem as MOP model shown in Eqs. (1)–(9).

  3. (3)

    Set the value of weight for each PM, where each weight changes between 0 and 1 with an increment of Δ, where 0 ≤ Δ ≤ 1.

  4. (4)

    For each set of weights, solve the MOP model and call each solution as \(DMU_{\delta } , \quad \delta \, = 1, \, 2, \, \ldots , \, n\).

Step 2 (Stratification DEA)

  1. (1)

    Set \(\ell\) = 1 and construct \(E^{\ell }\) by evaluating each DMU in \(J^{\ell }\).

  2. (2)

    Set \(J^{\ell + 1} = J^{\ell } - E^{\ell } .\) If \(J^{\ell + 1} = \emptyset\), go to (3) after setting \(\ell = L.\) Otherwise, repeat (2) after setting \(\ell\) = \(\ell\) + 1.

  3. (3)

    Setting \(\ell_{o}\) = 1, compute \(H_{q}^{*} \left( d \right), d = 1, 2, \ldots , L - 1\), and compute \(AAS_{q}\), given in (22), for \(DMU_{q}\) in \(J^{{\ell_{o} }}\).

Step 3 Rank the DMUs in \(J^{{\ell_{o} }}\) based on the value of AAS, from highest to lowest.

Case study in South Carolina

Historic flooding tore through South Carolina (SC) in October 2015 when numerous rivers burst their banks, washing away roads, bridges, vehicles, and homes. Hundreds of people required rescue and the state’s emergency management department urged everyone in the state not to travel. The Federal Emergency Management Agency (FEMA) opened disaster recovery centers (DRCs) in several SC counties to help SC flood survivors. We use the problem of locating DRCs in SC as our case study. We assume that each DRC follows an (s, Q) policy to maintain its inventory and carries a safety stock to maintain the desired service level of β. To follow Step 1 in the formal procedure, we define objectives for PMs for our case study.

Objective 1: Minimize the total logistics cost (TLC)

Minimizing TLC has been the traditional objective of most FLA models. Given this problem setting, the TLC consists of the fixed cost of locating DRCs, the transportation/shipping cost from DRCs to the demand points, cycle stock cost, and safe stock cost to maintain the desired service level of β, and is given in Eq. (23):

$$\begin{aligned} {\text{TLC}}\; = & \;{\text{fixed cost}}\;{\text{for}}\;{\text{facilities}}\; + \;{\text{shipping cost}}\; + \;{\text{cycle}}\;{\text{stock}}\;\text{cost}\; + \;{\text{safe}}\;{\text{stock cost}} \\ = & \mathop \sum \limits_{j \in M}^{{}} f_{j} C_{j} + \mathop \sum \limits_{j \in M}^{{}} \mathop \sum \limits_{m \in M}^{{}} D_{m} d_{jm} y_{jm} c_{jm} \\ & \quad + \mathop \sum \limits_{j \in M}^{{}} C_{j} \left[ {\sqrt {2S_{j} h_{j} \mathop \sum \limits_{m \in M}^{{}} D_{m} y_{jm} } + h_{j} z_{\beta } \sqrt {\mathop \sum \limits_{m \in M}^{{}} L_{j} \sigma_{m}^{2} y_{jm} } } \right]. \\ \end{aligned}$$
(23)

Objective 2: Minimize the maximum coverage distance (MCD)

Ideally, each DRC should be located near the affected sites or demand points. Our assumption is that the shorter distance is, the shorter delivery time is. The second PM is the maximum coverage distance (MCD) to be minimized so that each demand point is covered by one of the DRCs within the endogenously determined distance. In other words, the objective for the second PM is equivalent to attempting to minimize the longest delivery distance between DRCs and the affected sites. If MCD is too large, it will cause inefficiency to the resulting FLA scheme, since it will take an excessive time for a DRC to deliver to a remotely located site. Now, MCD is given by

$$MCD = \hbox{max} \left\{ {d_{jm} y_{jm} } \right\}, \quad \forall j\;{\text{and}}\;m.$$
(24)

Objective 3: Minimize the maximum demand-weighted coverage distance (MDWCD)

MCD does not consider demand associated with each site but the longest distance only. Thus, it would be important to consider not only MCD but also the demand-weighted distance. The next PM is the maximum demand-weighted coverage distance (MDWCD) to be minimized, which is given by

$$MDWCD = \hbox{max} \left\{ {D_{m} d_{jm} y_{jm} } \right\}, \quad \forall i, j,\;{\text{and}}\;m.$$
(25)

Objective 4: Maximize the covered demands in case of emergency (CDE)

Deckle et al. (2005) studied the problem of minimizing the total number of DRCs in Alachua County, Florida, subject to each county resident being close to a DRC must be less than a given threshold. In fact, each location should be within a certain distance of the nearest DRCs to be served in case of emergency. In addition, there may be some environmental constraints or difficulties such as road damage and weather issues, which may limit the maximum coverage distance given by MCD in (24). Thus, the maximum effective coverage distance (MECD), denoted by \(D^{c}\), may be shorter than MCD. However, while minimizing MCD, it is desirable to maximize the covered demands within \(D^{c}\). The fourth PM is the covered demands in case of emergency (CDE) to be maximized, which is expressed as

$$CDE = \mathop \sum \limits_{m \in M} \mathop \sum \limits_{j \in J} D_{m} \gamma_{jm} y_{jm} ,$$
(26)

where binary parameters, \(\gamma_{jm}\), are

$$\gamma_{jm} = \left\{ {\begin{array}{*{20}l} {1,} \hfill & {{\text{if}}\;d_{jm} \le D^{c} } \hfill \\ {0,} \hfill & {{\text{otherwise}} .} \hfill \\ \end{array} } \right.$$
(27)

Objective 5: Maximize the covered demands in case of emergency (ENDS)

To enhance DRC’s resilience, it would be important to locate DRCs at the safest areas if possible, so that the chances of DRCs’ being disrupted are minimized. We assume that if a DRC is disrupted, it can’t handle the supplies being delivered to the affected site. Now, our fifth PM is the expected number of non-disrupted supplies (ENDS) to be maximized, which is given by

$$ENDS = \mathop \sum \limits_{j \in M}^{{}} C_{j} \left( {1 - p_{j} } \right)\mathop \sum \limits_{m \in M}^{{}} \left( {y_{jm} D_{m} } \right),$$
(28)

where \(p_{j}\) denotes the risk probability of DRCj’s being disrupted. Note that both \(C_{j}\) and \(y_{jm}\) in (28) are decision variable, thus Eq. (28) is no more a linear combination. To linearize it, we define \(Z_{jm} = C_{j} *y_{jm}\) and rewrite (28) as

$$ENDS = \mathop \sum \limits_{j \in M}^{{}} \left( {1 - p_{j} } \right)\mathop \sum \limits_{m \in M}^{{}} D_{m} Z_{jm} ,$$
(29)

where

$$\hbox{max} \left\{ {0, C_{j} + y_{jm} - 1} \right\} \le Z_{jm} \le \frac{{C_{j} + y_{jm} }}{2}.$$

Now, among the above five PMs, the first three PMs, TLC, MCD, and MDWCD, to be minimized would be classified as inputs, \(I = \left\{ {TLC,\; MCD,\;MDWCD} \right\}\), whereas the other two, CDE and ENDS, to be maximized as outputs, \(O = \left\{ {CDE, ENDS} \right\}.\) Let the nonnegative deviation variables, \(\left\{ {\omega_{TLC}^{ + } , \omega_{MCD}^{ + } , \omega_{MDWCD}^{ + } , \omega_{CDE}^{ - } , \omega_{ENDS}^{ - } } \right\}\) denote the amounts by which each value of TLC, MCD, MDWCD, CDE, and ENDS deviates from each target value, the minimax variable from Eq. (1), Q, is expressed as

$$Q \, = \hbox{max} \left\{ {\alpha_{1}^{ + } \frac{{\omega_{TLC}^{ + } }}{{TLC_{\text{min} } }}, \alpha_{2}^{ + } \frac{{\omega_{MCD}^{ + } }}{{MCD_{\text{min} } }}, \alpha_{3}^{ + } \frac{{\omega_{MDWCD}^{ + } }}{{MDWCD_{\text{min} } }},\alpha_{1}^{ - } \frac{{\omega_{CDE}^{ - } }}{{CDE_{\text{max} } }}, \alpha_{2}^{ - } \frac{{\omega_{ENDS}^{ - } }}{{ENDS_{\text{max} } }}} \right\}.$$
(30)

Then, we formulate a multi-objective nonlinear programming (MONLP) model with the minimax objective as follows:

$$\hbox{min} \;Q\;{\text{in}} \;\left( {30} \right)$$
(31)

subject to

$$\alpha_{1}^{ + } \frac{{\omega_{TLC}^{ + } }}{{TLC_{\text{min} } }} \le Q,$$
(32)
$$\alpha_{2}^{ + } \frac{{\omega_{MCD}^{ + } }}{{MCD_{\text{min} } }} \le Q,$$
(33)
$$\alpha_{3}^{ + } \frac{{\omega_{MDWCD}^{ + } }}{{MDWCD_{\text{min} } }} \le Q,$$
(34)
$$\alpha_{1}^{ - } \frac{{\omega_{CDE}^{ - } }}{{CDE_{\text{max} } }} \le Q,$$
(35)
$$\alpha_{2}^{ - } \frac{{\omega_{ENDS}^{ - } }}{{ENDS_{\text{max} } }} \le Q,$$
(36)
$$TLC\;{\text{in}}\;\left( {23} \right) - \omega_{TLC}^{ + } = TLC_{\text{min} } ,$$
(37)
$$MCD\;{\text{in}}\;\left( {24} \right) - \omega_{MCD}^{ + } = MCD_{\text{min} } ,$$
(38)
$$MDWCD\;{\text{in}}\;\left( {25} \right) - \omega_{MDWCD}^{ + } = MDWCD_{\text{min} } ,$$
(39)
$$CDE\;{\text{in}}\;\left( {26} \right) + \omega_{CDE}^{ - } = CDE_{\text{max} } ,$$
(40)
$$ENDS\;{\text{in}}\;\left( {29} \right) + \omega_{ENDS}^{ - } = ENDS_{\text{max} } ,$$
(41)
$$\mathop \sum \limits_{g = 1}^{3} \alpha_{g}^{ + } + \mathop \sum \limits_{g = 1}^{2} \alpha_{g}^{ - } = 1, \;{\text{and}}$$
(42)
$${\text{Constraints}}\;\left( 5 \right) - \left( 9 \right).$$

This competes (1) and (2) in Step 1.

Solving MONLP and applying DEA

As Hong et al. (2015) use major disaster declaration records in South Carolina (SC) from FEMA database, we use the same data. Forty-six counties in SC are clustered based on proximity and populations into twenty counties. Then, one city from each clustered county based on a centroid approach is chosen, assuming that all population within the clustered county exists in that city. The distance between these cities is considered to be the distance between counties. We assume that when a major disaster is declared, the DRC in that county can’t function due to the damaged facility and supply items and closed or unsafe roads and highways. The FEMA database provides a list of counties where a major disaster was declared. Based on the historical record available in the FEMA database and the assumption, the risk probability for each site (a county or a clustered county) is calculated in Table 1. For example, the probability that a facility in Walterboro will suffer from a major disaster and will be shut down is 25%. For the case study, we hypothetically pre-determine and list the input parameters in Table 2.

Table 1 Data for locations of DRCs
Table 2 Input data used for the case study

As shown in the MONLP model given by Eqs. (30)–(42), it is necessary to find the target value for each PM. These values can be obtained by setting the corresponding weight equal to 1 and solving the model. For example, setting α = (0, 1, 0, 0, 0) and solving the MONLP model yield the target value of MCD, \(MCD_{\text{min} } .\) Now, we solve and summarize the target values of five PMs, \(TLC_{\text{min} }\),\(MCD_{\text{min} } , MDWCD_{\text{min} }\), \(CDE_{\text{max} }\), and \(ENDS_{\text{max} }\) in Table 3. Using these target values in Table 3, the MONLP model is solved for various values of α, where each weight changes between 0 and 1 with an increment of 0.1. There are 1001 configurations arising out of the combinations of the setting of α under the condition given in Eq. (42). After solving the model, we reduce 1001 configurations into 542 consolidated configurations, based upon the values of the five performance measures. To apply DEA, we consider each of 542 configurations as a DMU. In fact, each configuration or DMU denotes an FLA scheme.

Table 3 The target values of five performance measures

There are two important issues in carrying out an efficiency study using DEA. One is the homogeneity of DMUs, and the other one is the isotonicity property. Since all DMUs are generated by the same MONLP model under different weights, the homogeneity of DMUs is satisfied to generate meaningful DEA results. DEA requires that there be an isotonic (order preserving) relation between inputs and outputs, i.e., an increase in any input should not result in a decrease in any output (see Golany and Roll 1989). We use the correlation analysis between all inputs and outputs to test this relationship. We find that the correlation coefficients (ρ) between TLC versus CDE and between MDWCD and CDE are negative, such as ρ(TLC vs. CDE) = − 0.6623 and ρ(MDWCD vs. CDE) = − 0.5467, whereas all other correlation coefficients are positive. Negative correlation coefficients would indicate that CDE should not be treated as an output to apply DEA. As Charnes et al. (1984) invert the values of some factors before they are entered into the analysis, we define nCDE as the non-covered demand in case of emergency, which is obtained by subtracting CDE from the total demand of 4,496 K as seen from Table 1. Before applying DEA, we delete CDE from the set of outputs and add nCDE as a new input, which results in \(I = \left\{ {TLC, MCD, MDWCD, nCDE} \right\}\) and \(O = \left\{ {ENDS} \right\}\). Now, the correlation coefficient between nCDE and ENDS is positive, ρ(nCDE vs. ENDS) = 0.1844. In fact, all correlation coefficients between all inputs and an output become positive, which implies that the isotonicity property would not be violated to implement DEA for the case study. This completes (3) and (4) in Step 1.

Now, using the stratification DEA method explained in Step 2, we stratify 542 DMUs into 24 levels, \(E^{1} , E^{2} , \ldots E^{24}\), and identify DMUs in each level. To rank DMUs in Level 1, \(E^{1} ,\) we compute 23 attractiveness scores and then the AAS given in Eq. (22). By excluding the weakly efficient DMUs in \(E^{1}\), we find that there are thirty-one (31) fully efficient DMUs (FEDMUs). Based upon the AASs, we rank all 31 FEDMUs. This completes Step 3. Each DMU represents the optimal location of DRCs and their allocation scheme. In Table 4, we present all FEDMUs in \(E^{1}\), as well as the least efficient DMUs in the last two levels, \(E^{23}\) and \(E^{24}\). We also present the combination of weights, the value of each PM, CRS efficient score (ES), and AAS and its ranking. For example, we see that DMU4 generated by solving MONLP with either the combination of weights, (0.0, 0.0, 0.0, 0.7, 0.3) or (0.0, 0.0, 0.0, 0.6, 0.4), yields ($36,842 K, 267 miles, 44507 K miles, 3721 K, 1420 K) as the optimal values of TLC, MCD, MDWCD, nCDE, and ENDS. It obtains a complete and fully efficient score of 1.000, yields an AAS of 1.2587 with its ranking of 18. As shown in Table 4, all of FEDMUs in \(E^{1}\) possess a perfect ES of 1.000, whereas DMU10 in the last level, \(E^{24}\), is considered as the least efficient with ES of 0.5970. For the purpose of comparison with inefficient DMUs, the two DMUs in \(E^{23}\), DMU251 and DMU277, are presented too. It appears that DMU494 in \(E^{1}\) which has the highest AAS of 1.4545, yields the most efficient FLA scheme.

Table 4 DMUs in the first level and last two levels

In Table 5, we present the optimal DRC location–allocation schemes for top six DMUs in \(E^{1}\) and three DMUs in \(E^{23}\) and \(E^{24}\). All of the FLA schemes in Table 5 are depicted in Figs. 1 and 2. A notable observation from Table 5 and Fig. 1 is that all six efficient DMUs in Level 1 select {Charleston, Conway, Greenville, Lexington, Rock Hill} as the optimal locations for DRCs, and their allocations show very similar patterns with just a few exceptions. For example, DRC Lexington covers {Aiken, Columbia, Sumter} in DMU494 and DMU463, {Aiken, Columbia, Orangeburg} in DMU96 and DMU82, {Aiken, Columbia, Orangeburg, Sumter} in DMU539, and {Aiken, Columbia} in DMU97. DRC Rock Hill covers {Sumter} in DMU90. We can observe the only difference in allocation for the two most efficient DMUs, DMU494 and DMU539, such that {Orangeburg} will be covered by DRC Charleston in DMU494 but by DRC Lexington in DMU539. Also, from Table 4, we see that DMU494 yields higher values of input, TLC, as well as of output, ENDS, than those of DMU539, while both DMUs yield the same values of other performance measures, MCD, MDWCD, and, nCDE. Note that the optimal location–allocation schemes of DRCs for the three DMUs in the two lowest levels, \(E^{23}\) and \(E^{24}\), are quite different from the schemes for the four efficient DMUs in \(E^{1}\). All of them commonly select {Beaufort, Greenwood, Greenville} as the optimal locations for DRCs and their allocations are quite different, as shown in Fig. 2. From Table 4, we observe that their input values, as well as outputs, are higher than the DMUs in \(E^{1}\).

Table 5 DRC location and allocation for some efficient and inefficient DMUs
Fig. 1
figure 1

Efficient facility location–allocation schemes for DMU494, DMU539, DMU463, DMU97, DMU96, and DMU82

Fig. 2
figure 2

Inefficient facility location–allocation schemes for DMU251, DMU277, and DMU10

In Fig. 1, we observe that Hampton is covered by DRC Greenville in the scheme of DMU463, while it is covered by DRC Charleston in other efficient DMUs. Hampton is 190 miles away from Greenville. From Table 4, we see that MCD for DMU463 is 190 miles, but MWDCD is the target (minimum) value of 13,150. The reason is that since Hampton has a small demand of 33 K from Table 1, a relatively high value of MCD does not affect MWDCD. The two DRC locations, Lexington and Charleston, have higher risk probabilities, 0.313 and 0.25, respectively, than Greenville with 0.125. In addition, Hampton is 84.25 and 78 miles away from Lexington and Charleston, which exceed the maximum effective coverage distance, \(D_{c}\), of 35 miles. Thus, without sacrificing MWDCD and CDE, DRC Greenville for DMU463 covers Hampton to enhance ENDS at the sacrifice of MCD and TLC. Note that finding the location of DRC is a strategic decision plan. In an operational level, if neither DRC Charleston nor DRC Lexington is disrupted, it would enhance efficiency to let either Charleston or Lexington, instead of DRC Greenville, cover Hampton if it has enough capacity. In this way, practitioners and decision-makers can make flexible operational decisions once they have efficient and robust FLA schemes on their hands.

Figure 2 depicts the schemes of DMU251, DMU277, and DMU10, which generally yield high values of inputs and an output, as seen from Table 4. Note that the most inefficient scheme DMU10 with ES of 0.5937, which is generated with α = (0, 0, 0, 0, 1), dominates DMU251 and DMU277 regarding all inputs and an output. On the contrary to other schemes, the optimal number of DRCs generated by DMU10 is only four and all four DRCs, {Anderson, Beaufort, Greenwood, Greenville}, are located at the sites whose risk probabilities are the lowest ones, such as 0.063, 0.125, 0.125, and 0.125. In other words, DMU10 will guarantee the maximum expected demand covered/satisfied under the risk of disruptions, at the sacrifice of all input performance measures. If satisfying demand under the high risk of disruptions has the highest priority, decision-makers will adopt the scheme of DMU10. Decision-makers may want to consider DMU251 generated with α = (0.1, 0, 0, 0, 0.9) rather than DMU10, since they can observe that switching from DMU10 to DMU251 would reduce ENDS by 133 K (= 4027–3894 K) (3.3%), but all other inputs are significantly saved: TLC, MCD, MDWCD, and nCDE are reduced by $14,034 K (29%), 68 miles (31%), 28282 K miles (45%), 636 K (19%), respectively.

To identify robust locations of DRCs, we list, in Table 6, the locations for DRCs for all DMUs in \(E^{1}\) and list the DRC location sets with the frequencies (≥ 2) in descending order in Table 7. In Table 8, we also find the frequency of each DRC location and list the locations, frequencies along with the corresponding percentages in descending order. As shown in Tables 6 and 7, a set of five sites for the location of DRCs, {Charleston, Conway, Greenville, Lexington, Rock Hill}, is selected 15 times out of 31 cases. As stated before and shown in Fig. 1, the six most efficient DMUs select these DRC locations. As expected, these five locations are ranked from 1st to 5th in terms of the frequency of DRC location as shown in Table 8. Identifying the robust locations of DRCs will be important for decision-makers to decide an alternative location when the risk of facility disruptions is very high. For example, from Tables 6 to 7, {Beaufort} can replace {Conway} or {Rock Hill} as a DRC location if either {Conway} or {Rock Hill} is disrupted. Similarly, {Aiken, Columbia} can replace {Lexington, Rock Hill} which is under the high risk of disruptions. In addition, decision-makers can exclude some sites from the candidate of DRC location. For example, the seven sites, {Bennettsville, Georgetown, McCormick, Moncks Corner, Orangeburg, Spartanburg, Sumter}, are never selected as a DRC location.

Table 6 Locations selected for DRCs for all DMUs in Level 1
Table 7 Frequency of robust DRC locations for the DMU in Level 1
Table 8 Frequency of locations selected for DRCs for the DMUs in Level 1

Conclusions

In this study, we propose an innovative procedure of deciding efficient facility location–allocation (FLA) by combining data envelopment analysis (DEA) methodology with multi-objective programming model. We accomplish by first using a multiple objective nonlinear programming (MONLP) with the minimax objective approach, which would yield more balanced options, to generate all inputs and outputs for each configuration arising out of the combinations of the weight factor α. Each location–allocation scheme generated by solving MONLP for a given set of α is treated as a decision-making unit (DMU). For the generated DMUs, we apply DEA to identify efficient location–allocation schemes, and then we use the context-dependent/stratification DEA to rank the efficient schemes. The contribution of this paper is that our procedure generates inputs and outputs directly from the multi-objective mathematical model, so that DEA would be more realistically applied to identify the efficient schemes. As stated before, the literature available so far has assumed that all inputs and outputs for locating a facility to a site are given or known unrealistically to apply DEA for their models.

Through case study using actual major disaster declaration records in South Carolina, we demonstrate the applicability of our procedure and observe that our combined approach with MONLP model and DEA for the FLA problem performs well. The MONLP model generates various FLA schemes based upon the weights given to each performance measure which is classified as an input or output. Out of those FLA schemes, DEA identifies efficient schemes, and then context-dependent DEA ranks those efficient schemes and allows us to identify the most efficient FLA schemes. Our new approach to the MOFLA problems would provide many insights to academians as well as practitioners and researchers.

In this paper, we assume that if a facility becomes unavailable or shut down due to disruptions, all the demand for the sites assigned to the disrupted facility is lost. For future research, it would be interesting to consider the backup supply from the undisrupted facility, so that the demand for some sites would be satisfied.