1 Introduction

The urban and rural transportation systems, as the lifeline of urban and rural development, link all aspects of production, exchange, distribution, and consumption. With rapid developments of e‒business and urbanization in developing countries such as China, logistics and shipping demands between urban and rural regions have been growing fast, thanks to continuously improved transportation infrastructures. According to the annual report of agricultural products e‒commerce development (ARAPED, 2022), the average annual growth rate of the Chinese rural online retail sales in the past seven years (2015‒2021) reached 28.7% (from 3.5 to 20.5 trillion RMB), as shown in Fig. 1.Footnote 1 Both the nationwide and rural online retail sales had achieved rapid growth in the past years despite the COVID‒19 pandemic. Online retail sales in rural regions account for a significant proportion of the nation’s total online retail sales. To accommodate the rapidly growing logistics demands between urban and rural regions, the supporting logistics network needs to be upgraded for efficient urban‒rural cargo flow.

Fig. 1
figure 1

Nationwide vs. rural online retail sales in China during 2015‒2021

In this regard, the Chinese government is speeding up the integration of rural e‒commerce and express delivery systems to better link production and consumption between urban and rural areas. According to the statistics recently issued by the State of Post Bureau of China, express delivery services now cover almost all towns in China, with over 37 billion express packages shipped to and from China’s rural regions in 2021. The associated exchanges of industrial and agricultural products between urban and rural areas reached 1.85 trillion RMB (or US$291 billion). However, there are still some prominent problems in urban and rural deliveries, such as the “last‒mile” problem and insufficient village logistics service points. Coordinated development between urban and rural logistics systems is a promising solution to these issues so as to realize rural revitalization and common prosperity.

In the past decades, the urban logistics industry received considerable investments and attention in many countries and areas around the world (Lagorio et al., 2016; Savelsbergh and Van Woensel, 2016; Buyukozkan & Ilicak, 2022). However, little attention was paid to the rural logistics industry. This may be attributed to the fact that the demand of rural logistics is more dispersed and often less‒developed than that of urban logistics. Moreover, the design and operation of urban and rural networks are usually implemented separately. As a result, the logistics costs for shipments between urban and rural regions are usually high, and the associated logistics efficiency and inter‒regional connectivity tend to be low. In order to address these issues, the Chinese government has recently implemented the program of “Integrated Urban‒Rural Transportation System Development” (MOT, 2017). So far, a total of 113 counties have been selected as the first two batches of pilot areas in this program.Footnote 2 This provides a favorable condition for the development of an integrated hierarchical urban‒rural logistics network linking urban, town and village areas, on a basis of hub‒and‒spoke (H&S) network configuration. For example, with this program, Jiangling county’s government in Hubei province of central China has built 2 urban hubs, 4 town hubs, and 65 village hubs to provide the logistics services to a total of 8 towns and 107 villages in that county.Footnote 3 In general, there are two types of nodes in a typical H&S network: hub node and spoke node. The location of a hub largely determines the express delivery cargo distribution and the connections between the hub and spoke nodes (Shang et al., 2021; Zhong et al., 2018). In a typical urban‒rural logistics network, there are three types of hubs and four kinds of hierarchical connections. The three hub types include urban logistics hubs, town logistics hubs and village logistics hubs. Accordingly, there are four connection layers, namely the top layer connecting urban logistics hubs, the second layer connecting town logistics hubs and urban logistics hubs, the third layer connecting village logistics hubs and town logistics hubs, and the bottom layer connecting spokes and hubs (urban, town or village hubs).

With the rapid development of rural e‒commerce activities, the rural logistics demand has been growing rapidly in recent years. However, logistics demand usually fluctuates by the time of day, day of week, and seasons due to factors such as economic growth, population change, information technology improvement, weather, and various disruptions such as the COVID‒19 pandemic, the effects of which cannot be accurately predicted. The uncertainty in the logistics demand could significantly affect the service price and cost, and the efficiency of the logistics network. Apparently, a frequent adjustment of the urban‒rural logistics network is infeasible, partly due to the high cost that it can cause (e.g., it can be very costly to set up a new hub and the feeder networks even if it is possible to re‒deploy equipment to another location). Therefore, it is important to optimize the design of the integrated urban‒rural logistics network in the presence of uncertain demand.

In light of the above research needs, this paper addresses the challenge of integrated design of the urban‒rural logistics networks under demand uncertainty. The main contributions of this study are two‒fold. First, a hierarchical hub location model is proposed for the design of the urban‒rural logistics networks, which minimizes the expected total system cost by determining the locations, number, and capacities of urban, town and village hubs. Different from the traditional urban logistics network design, the model proposed in this paper is an integrated optimization of the urban and rural logistics networks, in which the interactions among “urban‒town‒village” logistics hubs, and among hub‒spoke connections in the urban‒rural logistics system are considered. The effects of hub capacity constraints for each hub type are also incorporated, together with the effects of logistics demand uncertainty using a demand scenario‒based approach. Second, a demand scenariobased branchand‒Benderscut algorithm is developed to solve the proposed model. A case study of Jiangling urban‒rural region in Hubei region of China is carried out for illustration purpose. The results generated by the proposed algorithm are compared with those obtained by GUROBI solver and the practical scheme being implemented in the region.

The remainder of this paper is organized as follows. In the next section, some related literature is reviewed. Section 3 formulates the model for the integrated design of the urban‒rural logistics network. Section 4 presents the branch‒and‒Benders‒cut algorithm. In Sect. 5, a case study is provided to illustrate the application of the proposed model and solution algorithm. Section 6 concludes this paper and provides suggestions for further studies.

2 Literature review

The integration of urban‒rural transportation networks plays a crucial role in the new urbanization and rural revitalization in China, and has attracted widespread attention in the last decade. For example, the studies of Yu et al. (2013), Zhong et al. (2018) and Lei et al. (2022) addressed the public transit hub location problems between urban and rural regions. Wang and Sun (2016) analyzed the relation of transportation infrastructure and rural development in China, and found that investment in transport infrastructure has a positive impact on China’s rural development. Qu et al. (2022) pointed out that low‒cost, accessible transportation systems connecting rural and urban areas are crucial for creating an efficient rural‒urban partnership. However, these studies mainly focused on public passenger transport systems, whereas little attention has been paid to the design of urban‒rural logistic networks.

The design of urban‒rural logistics network can be considered as an extension of common hub location problems, also referred to as H&S network design problems in the literature. Among the hub location problems, the p‒hub median problem and p‒hub center problem are the most prevalent specifications (Campbell, 1994; Yaman, 2011). Since the pioneering work of O'kelly (1986) in modeling of the p‒hub median problem, the hub location problems have been well studied in the literature (Contreras et al., 2012; Contreras and O’Kelly, 2019; Alumur et al., 2021). For the convenience of readers, we have summarized in Table 1 some related studies about the hierarchical hub location problems, in terms of network structure, decision variables, uncertainty source and solution method.

Table 1 Summary of relevant studies on network design

It can be observed in Table 1 that the previous studies have mainly focused on hierarchical hub network (without direct spoke‒spoke links) and hybrid hub network (allowing direct spoke‒spoke links). Zhao et al. (2016) and Yao et al. (2019) proposed a hybrid H&S express network decision model and found that compared with the pure H&S structure, the hybrid H&S network can contribute to cost reduction, and at the same time can decrease detour and increase the timeliness and service level. The p‒median problem in the hierarchical hub location models aims to determine the hub locations in a k‒level network to minimize the total cost (Wang et al., 2020; Zhong et al., 2018). Shang et al. (2021) proposed a model for the design of a hierarchical urban‒rural public passenger transport hub network. In addition to the difference in network structures, some studies have also considered other constraints. For example, Dukkanci and Kara (2017) considered travel time bounds in the design of a hierarchical H&S network. Alumur et al. (2018) presented a hub location model considering the service time limit and congestion at hubs. Wang et al. (2021) proposed a hub location model of agricultural product transportation network for exploring the impact of delivery time limit on the hub location. Kaveh et al. (2021) proposed a hub location model with an objective function of minimizing the total transportation and waiting time of the network. A limitation of these studies is the assumption of no direct link between spokes. Note that direct transportation may be more economical than connection via hubs when the logistics demand between the two spokes is sufficiently large. Therefore, in this paper a hierarchical H&S network allowing direct connections between spokes (i.e., a hierarchical hybrid H&S network) is analyzed.

Table 1 also shows that the existing studies on the hub location problem mainly considered the number and location of hubs as decision variables, whereas the capacity of hub has usually been ignored or pre-given. In general, hub capacity should meet the logistic demand for that hub, and thus it is necessary to consider hub capacity constraints in the model (Fotuhi & Huynh, 2018; Mišković et al., 2017). On the other hand, the capacities of hubs generally have a significant effect on the hub locations and flow allocation, and thus the locations and capacities of hubs should be jointly determined. In this regard, Correia and Captivo (2003), Correia et al. (2010) and Irawan and Jones (2019) addressed capacity‒constrained hub location problems with multiple capacity levels, in which hub capacity is a decision variable. Yet, they did not propose an effective exact algorithm to solve the capacitated hub location problem, and the uncertainty is not incorporated.

In addition, there are some studies that considered uncertain hub location problems. In reality, hub location is generally a long‒term decision, and the uncertainty in demand side and/or supply side may lead the current location decisions to be inefficient for future logistics demand. However, frequent changes of hub location are unrealistic due to substantial costs involved. The uncertain facility location problems can be classified into the stochastic facility location problems and the robust facility location problems (Soyster, 1973; Shang et al., 2021). Both can capture the effects of uncertainty via various possible scenarios. The scenario probability is known in the stochastic optimization models. However, such a probability may be unknown in the robust optimization models. Robust location models are usually suitable for the situations in which it is difficult to collect data, or there is a lack of data about the probability distribution of the uncertain parameters (Shavarani et al., 2019; Zokaee et al., 2017).

With the literature review above, it is clear that the existing relevant studies mainly focused on the passenger public transport network and/or the urban transport network, with little attention paid to the modelling of comprehensive urban and rural logistics network. The integrated design of the urban and rural logistics networks are especially important for the mobility of human and goods between the two types of regions, the development of rural economy, and the reduction in the gap between urban and rural region development. Moreover, most studies only considered the locations of hubs, but not the hub capacity as decision variable, which can be an important issue in the design of urban‒rural hierarchical network.

3 Model formulation

3.1 Problem description

This paper aims to contribute to the integrated design of urban and rural logistics networks under demand uncertainty. The locations, number and capacities of the hubs in the urban and rural logistics networks will be optimized to minimize the expected total system cost. For illustration purpose, Fig. 2 shows a typical hierarchical urban‒rural logistics network system with 19 hierarchical hubs (including 3 urban hubs, 6 town hubs and 10 village hubs) and 11 spoke nodes (i.e., small villages in rural region). Different from the traditional urban logistics network design, this paper focuses on an integrated optimization of the urban and rural logistics networks, in which the interactions among hierarchical “urban‒town‒village” logistics hubs are incorporated. The locations and capacities of multi‒type hubs are thus endogenously determined by the model proposed in this paper. The hub capacity constraints for each hub type, which require that the logistics demand of each hub does not exceed its capacity, are also considered. In addition, the effects of logistics demand uncertainty are also modeled through using a demand scenario‒based approach.

Fig. 2
figure 2

A typical hierarchical urban‒rural logistics network

To facilitate the presentation of the essential ideas without losing generality, the following basic assumptions are made in this paper.

A1 The urban hub network is a complete network, with each pair of urban hubs connected directly. This assumption has been widely adopted in the literature, e.g., see Zhong et al. (2018).

A2 Town hubs connect to their associated urban hub, and the town hubs with the same urban hub connect with each other directly (e.g., see Alumur et al., 2012).

A3 Each village hub is only assigned to a town hub, and each town hub is only assigned to an urban hub (e.g., see Shang et al., 2021).

A4 Each spoke (i.e., non‒hub) node connects to one hub only. Direct connections between spoke nodes are not allowed (e.g., see Yaman, 2011).

3.2 Definitions of parameters and variables

(i) Sets

\(N\): Set of all nodes.

\(P\): Set of candidate urban hub locations, \(P \subseteq N\).

\(R\): Set of candidate village hub locations, \(R \subseteq N\).

\(Q\): Set of candidate town hub locations, \(Q \subseteq N,Q \cap R = \emptyset\).

\(H\): Set of all candidate hub locations, \(H = P \cup R \cup Q\).

\(Q_{k}\): Set of alternative capacity levels for urban hub \(k \in P\).

\(Z_{m}\): Set of alternative capacity levels for candidate town hub \(m \in Q\).

\(T_{e}\): Set of alternative capacity levels for candidate village hub \(e \in R\).

\(S\): Set of all stochastic demand scenarios, \(s \in S\).

(ii) Parameters

\(p_{s}\): Probability of demand scenario s, with \(\sum\nolimits_{s \in S} {p_{s} } = 1\).

\(w_{ij}^{s}\): Logistics demand between origin‒destination (O‒D) pair ij under demand scenario s.

\(D_{ij}\): Distance between O‒D pair ij.

\(\Lambda\): Transportation cost per unit of cargo per km.

\(c_{ij}\): Transportation cost between O‒D pair ij per unit of cargo, with \(c_{ij} = \Lambda D_{ij}\).

\(\alpha_{1}\): Discount coefficient of transportation cost per unit of cargo on connections between urban hubs.

\(\alpha_{2}\): Discount coefficient of transportation cost per unit of cargo on connections between town hubs and between urban hubs and town hubs.

\(\alpha_{3}\): Discount coefficient of transportation cost per unit of cargo on connections between village hubs and town hubs.

\(C_{kq}\): Construction cost of candidate village hub k with capacity level \(q \in Q_{k}\).

\(C_{mz}\): Construction cost of candidate town hub m with capacity level \(z \in Z_{m}\).

\(C_{et}\): Construction cost of candidate village hub e with capacity level \(t \in T_{e}\).

\(M_{kq}^{{}}\): Capacity of candidate village hub k with capacity level \(q \in Q_{k}\).

\(M_{mz}\): Capacity of candidate town hub m with capacity level \(z \in Z_{m}\).

\(M_{et}\): Capacity of candidate village hub e with capacity level \(t \in T_{e}\).

\(P_{k}^{\prime }\): Minimum number of urban hubs to be constructed.

\(P_{k}\): Maximum number of urban hubs to be constructed.

\(P_{m}^{\prime }\): Minimum number of town hubs to be constructed.

\(P_{m}\): Maximum number of town hubs to be constructed.

\(P_{e}^{\prime }\): Minimum number of village hubs to be constructed.

\(P_{e}\): Maximum number of village hubs to be constructed.

\(\omega\): Penalty coefficient.

(iii) Decision variables

\(h_{kq}\): 0‒1 variable, equal to 1 if urban hub is located at node k with capacity level q, and 0 otherwise.

\(d_{ez}\): 0‒1 variable, equal to 1 if town hub is located at node e with capacity level z, and 0 otherwise.

\(b_{mt}\): 0‒1 variable, equal to 1 if village hub is located at node m with capacity level t, and 0 otherwise.

\(x_{ie}\): 0‒1 variable, equal to 1 if node i is assigned to hub e, and 0 otherwise; if node e is a village hub, then \(x_{ee} = 1\).

\(u_{vm}\): 0‒1 variable, equal to 1 if village hub v is assigned to town hub m, and 0 otherwise; if node m is a village hub, then \(u_{mm} = 1\).

\(y_{mk}^{{}}\): 0‒1 variable, equal to 1 if town hub m is assigned to urban hub k, and 0 otherwise; if node k is a urban hub, then \(y_{kk} = 1\).

\(z_{mn}\): 0‒1 variable, equal to 1 if there is a connection between town hub m and non‒village hub n, and 0 otherwise.

\(a_{vm}^{is}\): Logistics volume from village hub v to town hub m which originates at node i under demand scenario s.

\(g_{mn}^{is}\): Logistics volume from town hub m to non‒village hub n which originates at node i under demand scenario s.

\(f_{kl}^{is}\): Logistics volume from urban hub k to town hub l which originates at node i under demand scenario s.

\(\eta_{{s{\text{e}}}}\): Logistics volume exceeding capacity of village hub e under demand scenario s.

\(\eta_{sm}\): Logistics volume exceeding capacity of town hub m under demand scenario s.

\(\eta_{sk}\): Logistics volume exceeding capacity of urban hub k under demand scenario s.

3.3 The model

Based on the assumptions and notations presented in the previous sections, the stochastic model for the urban‒rural logistics hub network design is presented as follows:

$$ \begin{aligned} \min F = & \sum\limits_{{z \in Z_{m} }} {\sum\limits_{m \in Q} {C_{mz} } d_{mz} } + \sum\limits_{{t \in T_{e} }} {\sum\limits_{e \in R} {C_{et} } b_{et} } + \sum\limits_{{q \in Q_{k} }} {\sum\limits_{k \in P} {C_{kq} } h_{kq} }\\ &+ \sum\limits_{s \in S} {p_{s} \sum\limits_{i \in N} {\sum\limits_{j \in N} {\left( {w_{ij}^{s} + w_{ji}^{s} } \right)} } \sum\limits_{e \in H} {c_{ie} } x_{ie} } \\ & + \sum\limits_{s \in S} {p_{s} \sum\limits_{i \in N} {\sum\limits_{k \in P} {\sum\limits_{{l \in P\backslash \{ k\} }} {\alpha_{1} } } } c_{kl} f_{kl}^{is} } + \sum\limits_{s \in S} {p_{s} } \sum\limits_{i \in N} {\sum\limits_{m \in Q} {\sum\limits_{{k \in P \cup Q\backslash \{ m\} }} {\alpha_{2} } } } c_{mk} g_{mk}^{is}\\ & + \sum\limits_{s \in S} {p_{s} } \sum\limits_{i \in N} {\sum\limits_{v \in R} {\sum\limits_{{m \in Q\backslash \{ v\} }} {\alpha_{3} } } } c_{vm} a_{vm}^{is} \\ & + \omega_{1} \sum\limits_{s \in S} {p_{s} } \sum\limits_{e \in R} {\eta_{se} } + \omega_{2} \sum\limits_{s \in S} {p_{s} } \sum\limits_{m \in Q} {\eta_{sm} } + \omega_{3} \sum\limits_{s \in S} {p_{s} } \sum\limits_{k \in P} {\eta_{sk} } \, \\ \end{aligned} $$
(1)
$$ {\text{s.t.}}\quad \sum\limits_{e \in H} {x_{ie} } = 1,\quad \forall i \in N $$
(2)
$$ x_{ie} \le x_{ee} ,\forall i \in N,e \in H $$
(3)
$$ \sum\limits_{m \in Q} {u_{vm} } = x_{vv} ,\forall v \in R,t \in T $$
(4)
$$ u_{vm} \le u_{mm} ,\forall m \in Q,\forall v \in R,z \in Z $$
(5)
$$ \sum\limits_{k \in P} {y_{mk} } = u_{mm} ,\forall m \in Q,z \in Z $$
(6)
$$ y_{mk} \le y_{kk} ,\forall m \in Q,k \in P $$
(7)
$$ \sum\limits_{{t \in T_{e} }} {b_{vt} } \le 1,\forall v \in R $$
(8)
$$ \sum\limits_{{z \in Z_{m} }} {d_{mz} } \le 1,\forall m \in Q $$
(9)
$$ \sum\limits_{{q \in Q_{k} }} {h_{kq} } \le 1,\forall k \in P $$
(10)
$$ z_{mn} \ge y_{mk} + y_{nk} - 1,\forall m \in Q,n \in Q\backslash \{ m\} ,k \in P,k \ne m,k \ne n $$
(11)
$$ 2z_{mn} \le y_{mk} + y_{nk} ,\forall m \in Q,n \in Q\backslash \{ m\} ,k \in P,k \ne m $$
(12)
$$ z_{mn} = e_{nm} ,\forall m \in Q,n \in Q\backslash \{ m\} $$
(13)
$$ z_{ik} \ge y_{ik} ,\forall i \in P \cup Q,k \in P\backslash \{ i\} ,s \in S $$
(14)
$$ \sum\limits_{{m \in Q\backslash \{ j\} }} {a_{jm}^{is} } - \sum\limits_{{m \in Q\backslash \{ j\} }} {a_{mj}^{is} } = \sum\limits_{r \in N} {w_{ir}^{s} } \left( {x_{ij} - x_{rj} } \right),\forall i \in N,j \in R,s \in S $$
(15)
$$ \begin{aligned}&\sum\limits_{{n \in P \cup Q\backslash \{ j\} }} {g_{mn}^{is} } + \sum\limits_{j \in R} {a_{mj}^{is} } - \sum\limits_{j \in R} {a_{jm}^{is} } - \sum\limits_{{n \in P \cup Q\backslash \{ j\} }} {g_{nm}^{is} }\\ &\qquad = \sum\limits_{r \in N} {w_{ir}^{s} } \left( {x_{im} - x_{rm} } \right),\forall i \in N,m \in Q,s \in S \end{aligned}$$
(16)
$$ \begin{aligned} & \sum\limits_{{l \in Q\backslash \{ j\} }} {g_{jl}^{is} } + \sum\limits_{{k \in P\backslash \{ j\} }} {f_{jk}^{is} } - \sum\limits_{{k \in P\backslash \{ j\} }} {f_{kj}^{is} } - \sum\limits_{{l \in Q\backslash \{ j\} }} {g_{lj}^{is} } \\ &\qquad= \sum\limits_{r \in N} {w_{ir}^{s} } \left( {x_{ij} - x_{rj} } \right),\forall i \in N,j \in P,s \in S \end{aligned} $$
(17)
$$ \sum\limits_{i \in N} {\sum\limits_{n \in Q} {a_{ne}^{is} } } + \sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } x_{ie} \le \sum\limits_{{t \in T_{e} }} {M_{et} b_{et} } + \eta_{se} ,\forall e \in R,s \in S_{e} $$
(18)
$$ \sum\limits_{i \in N} {\sum\limits_{v \in R} {a_{vn}^{is} } } + \sum\limits_{i \in N} {\sum\limits_{m \in P \cup Q} {g_{mn}^{is} } } + \sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } x_{in} \le \sum\limits_{{z \in Z_{m} }} {M_{nz} d_{nz} } + \eta_{sn} ,\forall n \in Q,s \in S $$
(19)
$$ \sum\limits_{i \in N} {\sum\limits_{m \in Q} {g_{mk}^{is} } } + \sum\limits_{i \in N} {\sum\limits_{l \in P} {f_{lk}^{is} } } + \sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } x_{ik} \le \sum\limits_{{q \in Q_{k} }} {M_{kq} h_{kq} } + \eta_{sk} ,\forall k \in P,s \in S $$
(20)
$$ P_{m}^{\prime } \le \sum\limits_{{z \in Z_{m} }} {\sum\limits_{m \in Q} {d_{mz} } } \le P_{m} $$
(21)
$$ P_{e}^{\prime } \le \sum\limits_{{t \in T_{e} }} {\sum\limits_{e \in R} {b_{et} } } \le P_{e} $$
(22)
$$ P_{k}^{\prime } \le \sum\limits_{{q \in Q_{k} }} {\sum\limits_{k \in P} {h_{kq} } } \le P_{k} $$
(23)
$$ g_{jl}^{is} \le z_{jl}^{{}} \sum\limits_{r \in N} {w_{jr}^{s} } ,\forall i \in N,j \in Q,l \in P \cup Q\backslash \{ j\} ,s \in S $$
(24)
$$ b_{et} ,d_{nz} ,h_{kq} ,x_{ie} ,u_{vm} ,y_{mk} ,z_{mn} \in \{ 0,1\} ,\forall i \in N,e \in H,v \in R,m \in H,k \in P,n \in H\backslash \{ m\} $$
(25)
$$ a_{vm}^{is} ,g_{mn}^{is} ,f_{kl}^{is} ,\eta_{se} ,\eta_{sn} ,\eta_{sk} \ge 0,\forall i \in N,m \in H,k \in P,n \in H\backslash \{ m\} ,l \in P\backslash \{ k\} ,s \in S $$
(26)

The objective function Eq. (1) represents the total cost of the urban‒rural logistics system under all the demand scenarios. The cost includes hub construction costs, transportation costs and penalty costs. The first three terms are the construction costs of village hubs, town hubs and urban hubs, respectively. The fourth term is the transportation costs between the no‒hub nodes and hubs. The fifth term is the transportation costs between the urban hubs. The sixth term is the transportation costs between the town hubs and between the urban hubs and the town hubs. The seventh term is the transportation costs between the town hubs and the village hubs. If the demand of a hub exceeds its capacity, then a penalty is incurred. The final three terms are the penalty costs due to violations of the capacity constraints of urban, town, and village hubs, respectively.

Constraints (2) and (3) indicate that each non‒hub node is assigned to one hub. Constraints (4) and (5) indicate that each village hub is assigned to one town hub. Constraints (6) and (7) indicate that each town hub is assigned to one urban hub. Constraints (8), (9) and (10) ensure that at most one capacity level is attached to a hub. Constraints (11), (12) and (13) indicate that if town hubs i and j are assigned to the same urban hub, then a connection is formed between i and j. Otherwise, i and j are not connected. Constraint (14) represents the connection between i and k if town hub i is assigned to urban hub k. Constraint (15) indicates the flow conservations at village hubs. Constraint (16) indicates the flow conservations at town hubs. Constraint (17) indicates the flow conservations at urban hubs. Constraints (18), (19) and (20) are the capacity constraints of the village hubs, town hubs and urban hubs, respectively. Constraints (21), (22) and (23) are the boundary constraints of total number of selected hubs. Constraint (24) indicates that if there is no connection between two hubs, then the logistics quantity between them is 0. Constraint (25) is a 0‒1 integer constraint. Constraint (26) is nonnegative constraint.

4 Solution algorithm

Benders decomposition algorithm is a typical method for solving the complex mixed integer programming problem with integer variables and coupling constraints (Tang et al., 2013; Taherkhani et al., 2020; Jenabi et al., 2022). As suggested by Contreras et al. (2012), Benders decomposition algorithm has a strong ability to solve large‒scale instances of the hub location problem. In each iteration of Benders decomposition, an integer master problem must be solved and the computational time of this master problem generally increases with the number of added cuts (Rei, et al., 2009). Since Benders cuts can be generated from any master problem solution and not just from an optimal integer solution, Naoum-sawaya and Elhedhli (2013) explored the use of Benders cuts in a branch‒and‒cut framework, called the Branch‒and‒Benders‒Cut (BBC) algorithm. In this paper, we use the BBC algorithm to solve the model proposed in the previous section. A multi-cut strategy is employed to improve the convergence and efficiency of the basic BBC algorithm by considering the property of multiple scenarios of the urban-rural logistics hierarchical hub location problem.

Within the standard branch‒and‒cut framework, the BBC algorithm proceeds by solving the Linear Relaxation of Master Problem (denoted as LRMP) at each node of the branch‒and‒bound tree, and valid Benders cuts are added to the model of LRMP in each iteration. The process iterates until the gap between the lower bound and the upper bound is sufficiently small, thus yielding the global optimal solution. The framework for the algorithm is formulated as follows.

Fixing the values of the binary variables \(h_{kq} ,d_{ez} ,b_{et} ,x_{ie} ,u_{vm} ,y_{mk}\), and \(z_{mn}\) (e.g., denoted as \(\overline{h}_{kq}\), \(\overline{d}_{ez}\), \(\overline{b}_{et}\), \(\overline{x}_{ie}\), \(\overline{u}_{vm}\), \(\overline{y}_{mk}\), \(\overline{z}_{mn}\), each of which is 0 or 1), we can obtain the Benders sub‒problems, represented by SP, below:

$$ \begin{gathered} \min \, \sum\limits_{i \in N} {\sum\limits_{k \in P} {\sum\limits_{{l \in P\backslash \{ k\} }} {\alpha_{1} } } } c_{kl} f_{kl}^{is} + \sum\limits_{i \in N} {\sum\limits_{m \in Q} {\sum\limits_{{k \in P \cup Q\backslash \{ m\} }} {\alpha_{2} } } } c_{mk} g_{mk}^{is} + \sum\limits_{i \in N} {\sum\limits_{v \in R} {\sum\limits_{{m \in Q\backslash \{ v\} }} {\alpha_{3} } } } c_{vm} a_{vm}^{is} \hfill \\ + \omega_{1} \sum\limits_{e \in R} {\eta_{se} } + \omega_{2} \sum\limits_{n \in Q} {\eta_{sn} } + \omega_{3} \sum\limits_{k \in P} {\eta_{sk} } \hfill \\ \end{gathered} $$
(27)
$$ {\text{s.t.}}\quad \sum\limits_{{m \in Q\backslash \{ j\} }} {a_{jm}^{is} } - \sum\limits_{{m \in Q\backslash \{ j\} }} {a_{mj}^{is} } = \sum\limits_{r \in N} {w_{ir}^{s} } \left( {\overline{x}_{ij} - \overline{x}_{rj} } \right),\forall i \in N,j \in R,s \in S $$
(28)
$$ \begin{aligned} &\sum\limits_{{n \in P \cup Q\backslash \{ j\} }} {g_{mn}^{is} } + \sum\limits_{j \in R} {a_{mj}^{is} } - \sum\limits_{j \in R} {a_{jm}^{is} } - \sum\limits_{{n \in P \cup Q\backslash \{ j\} }} {g_{nm}^{is} }\\ &\qquad = \sum\limits_{r \in N} {w_{ir}^{s} } \left( {\overline{x}_{im} - \overline{x}_{rm} } \right),\forall i \in N,m \in Q,s \in S\end{aligned} $$
(29)
$$ \begin{aligned} &\sum\limits_{{l \in Q\backslash \{ j\} }} {g_{jl}^{is} } + \sum\limits_{{k \in P\backslash \{ j\} }} {f_{jk}^{is} } - \sum\limits_{{k \in P\backslash \{ j\} }} {f_{kj}^{is} } - \sum\limits_{{l \in Q\backslash \{ j\} }} {g_{lj}^{is} } \\ &\qquad = \sum\limits_{r \in N} {w_{ir}^{s} } \left( {\overline{x}_{ij} - \overline{x}_{rj} } \right),\forall i \in N,j \in P,s \in S\end{aligned} $$
(30)
$$ \eta_{se} - \sum\limits_{i \in N} {\sum\limits_{n \in Q} {a_{ne}^{is} } } \ge \sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } \overline{x}_{ie} - \sum\limits_{{t \in T_{e} }} {M_{et} \overline{b}_{et} } ,\forall e \in R,e \notin Q,s \in S $$
(31)
$$ \eta_{sn} - \sum\limits_{i \in N} {\sum\limits_{v \in R} {a_{vn}^{is} } } - \sum\limits_{i \in N} {\sum\limits_{m \in P \cup Q} {g_{mn}^{is} } } \ge \sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } \overline{x}_{in} - \sum\limits_{{z \in Z_{m} }} {M_{en} \overline{d}_{ez} } ,\forall n \in Q,s \in S $$
(32)
$$ \eta_{sk} - \sum\limits_{i \in N} {\sum\limits_{m \in Q} {g_{mk}^{is} } } - \sum\limits_{i \in N} {\sum\limits_{l \in P} {f_{lk}^{is} } } \ge \sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } \overline{x}_{ik} - \sum\limits_{{q \in Q_{k} }} {M_{kq} \overline{h}_{kq} } ,\forall k \in P,s \in S,q \in Q_{k} $$
(33)
$$ g_{jl}^{is} \le \overline{z}_{jl} \sum\limits_{r \in N} {w_{jr}^{s} } ,\forall i \in N,j \in Q,l \in P \cup Q\backslash \{ j\} ,s \in S $$
(34)
$$ a_{vm}^{is} ,g_{mn}^{is} ,f_{kl}^{is} ,\eta_{s} \ge 0,\forall i \in N,m \in H,k \in P,n \in H\backslash \{ m\} ,l \in P\backslash \{ k\} ,s \in S $$
(35)

The dual form of the above SP model, represented by DSP, can be formulated as

$$ \begin{gathered} \max \, \left[ {\sum\limits_{r \in N} {w_{ir}^{s} } \left( {\overline{x}_{ij} - \overline{x}_{rj} } \right)} \right]\varepsilon_{m}^{s} + \left[ {\sum\limits_{r \in N} {w_{ir}^{s} } \left( {\overline{x}_{im} - \overline{x}_{rm} } \right)} \right]\phi_{m}^{s} + \left[ {\sum\limits_{r \in N} {w_{ir}^{s} } \left( {\overline{x}_{ij} - \overline{x}_{rj} } \right)} \right]\gamma_{k}^{s} \hfill \\ + \left( {\sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } \overline{x}_{ie} - \sum\limits_{{t \in T_{e} }} {M_{et} \overline{b}_{et} } } \right)\iota_{e}^{s} + \left( {\sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } \overline{x}_{in} - \sum\limits_{{z \in Z_{m} }} {M_{en} \overline{d}_{ez} } } \right)\kappa_{n}^{s} \hfill \\ + \left( {\sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } \overline{x}_{ik} - \sum\limits_{{q \in Q_{k} }} {M_{kq} \overline{h}_{kq} } } \right)\lambda_{k}^{s} + \overline{z}_{jl} \sum\limits_{r \in N} {w_{jr}^{s} } \nu_{jl}^{s} \hfill \\ \end{gathered} $$
(36)
$$ {\text{s}}.{\text{t}}.\quad \gamma_{l}^{is} - \gamma_{k}^{is} - 1 \le \alpha_{{1}} c_{kl} ,\forall i \in N,k \in P,l \in P,s \in S $$
(37)
$$ \phi_{k}^{is} - \phi_{m}^{is} + \gamma_{j}^{is} - \gamma_{m}^{is} + \kappa_{m}^{is} + \nu_{mk}^{is} \le \alpha_{2} c_{mk} ,\forall i \in N,m \in Q,k \in P \cup Q,j \in P,s \in S $$
(38)
$$ \varepsilon_{v}^{is} - \varepsilon_{m}^{is} + \phi_{m}^{is} - \phi_{v}^{is} - \iota_{v}^{is} - \kappa_{m}^{is} \le \alpha_{3} c_{vm} ,\forall i \in N,v \in R,m \in Q,s \in S $$
(39)
$$ \iota_{e}^{is} \le \omega_{1} ,\forall i \in N,e \in R,s \in S $$
(40)
$$ \kappa_{n}^{is} \le \omega_{2} ,\forall i \in N,n \in Q,s \in S $$
(41)
$$ \lambda_{k}^{is} \le \omega_{3} ,\forall i \in N,k \in P,s \in S $$
(42)
$$ \iota ,\kappa ,\lambda ,\nu \le 0 $$
(43)

where \(\varepsilon_{m}^{is}\), \(\phi_{m}^{is}\), \(\gamma_{k}^{is}\), \(\iota_{e}^{is}\), \(\kappa_{n}^{is}\), \(\lambda_{k}^{is}\) and \(\nu_{jl}^{is}\) are the dual variables of constraints (27)‒(35), respectively.

With the introduction of the auxiliary variable \(\zeta\), the Benders master problem, represented by MP, can then be formulated as

$$ \begin{aligned}& \min \sum\limits_{{z \in Z_{m} }} {\sum\limits_{m \in Q} {C_{mz} } d_{mz} } + \sum\limits_{{t \in T_{e} }} {\sum\limits_{e \in R} {C_{et} } b_{et} } + \sum\limits_{{q \in Q_{k} }} {\sum\limits_{k \in P} {C_{kq} } h_{kq} }\\ &\qquad+\sum\limits_{s \in S} {p_{s} } \sum\limits_{i \in N} {\sum\limits_{j \in N} {\left( {w_{ij}^{s} + w_{ji}^{s} } \right)} } \sum\limits_{e \in H} {c_{ie} } x_{ie} + \zeta \end{aligned} $$
(44)
$$ {\text{s}}.{\text{t}}.{\text{ Constraints }}\left( {2} \right) - \left( {{14}} \right),{\text{ and }}\left( {{21}} \right) - \left( {{26}} \right) $$
(45)
$$ 0 \ge \sum\limits_{s \in S} {p_{s} } \left\{ \begin{aligned}& \left[ {\sum\limits_{r \in N} {w_{ir}^{s} } \left( {\overline{x}_{ij} - \overline{x}_{rj} } \right)} \right]\varepsilon_{m}^{is} + \left[ {\sum\limits_{r \in N} {w_{ir}^{s} } \left( {\overline{x}_{im} - \overline{x}_{rm} } \right)} \right]\phi_{m}^{is} \\ &\qquad+ \left[ {\sum\limits_{r \in N} {w_{ir}^{s} } \left( {\overline{x}_{ij} - \overline{x}_{rj} } \right)} \right]\gamma_{k}^{is} + \overline{z}_{jl} \sum\limits_{r \in N} {w_{jr}^{s} } \nu_{jl}^{is} \hfill \\ &\qquad+ \left( {\sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } \overline{x}_{ie} - M_{et} \overline{b}_{et} } \right)\iota_{e}^{is} + \left( {\sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } \overline{x}_{in} - M_{nz}^{{}} \overline{d}_{ez} } \right)\kappa_{n}^{is}\\ &\qquad + \left( {\sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } \overline{x}_{ik} - M_{kq} \overline{h}_{kq} } \right)\lambda_{k}^{is} \hfill \\ \end{aligned} \right\} $$
(46)
$$ \zeta \ge \sum\limits_{s \in S} {p_{s} } \left\{ \begin{aligned}& \left[ {\sum\limits_{r \in N} {w_{ir}^{s} } \left( {\overline{x}_{ij} - \overline{x}_{rj} } \right)} \right]\varepsilon_{m}^{is} + \left[ {\sum\limits_{r \in N} {w_{ir}^{s} } \left( {\overline{x}_{im} - \overline{x}_{rm} } \right)} \right]\phi_{m}^{is} \\ &\qquad + \left[ {\sum\limits_{r \in N} {w_{ir}^{s} } \left( {\overline{x}_{ij} - \overline{x}_{rj} } \right)} \right]\gamma_{k}^{is} + \overline{z}_{jl} \sum\limits_{r \in N} {w_{jr}^{s} } \nu_{jl}^{is} \hfill \\ &\qquad + \left( {\sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } \overline{x}_{ie} - M_{et} \overline{b}_{et} } \right)\iota_{e}^{is} + \left( {\sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } \overline{x}_{in} - M_{nz} \overline{d}_{ez} } \right)\kappa_{n}^{is}\\ &\qquad + \left( {\sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } \overline{x}_{ik} - M_{kq} \overline{h}_{kq} } \right)\lambda_{k}^{is} \hfill \\ \end{aligned} \right\} $$
(47)

Once the first‒stage variables are fixed, the sub‒problems can be solved independently (Birge & Louveaux, 2011). To obtain more information from each sub‒problem and improve the lower bound of MP more effectively, the multi‒cut strategy is applied to accelerate Benders decomposition (Rahmaniani et al., 2018). The MP can be stated as

$$ \begin{aligned}&\min \sum\limits_{{z \in Z_{m} }} {\sum\limits_{m \in Q} {C_{mz} } d_{mz} } + \sum\limits_{{t \in T_{e} }} {\sum\limits_{e \in R} {C_{et} } b_{et} } + \sum\limits_{{q \in Q_{k} }} {\sum\limits_{k \in P} {C_{kq} } h_{kq}}\\ &\qquad +\sum\limits_{s \in S} {p_{s} } \sum\limits_{i \in N} {\sum\limits_{j \in N} {\left( {w_{ij}^{s} + w_{ji}^{s} } \right)} } \sum\limits_{e \in H} {c_{ie} } x_{ie} + \sum\limits_{s \in S} {p_{s} } \zeta_{s}\end{aligned} $$
(48)
$$ {\text{s.t.}}\;{\text{Constraints}}\;\left( {45} \right) - \left( {46} \right) $$
(49)
$$\begin{aligned}&\zeta_{s} \ge \left[ {\sum\limits_{r \in N} {w_{ir}^{s} } \left( {\overline{x}_{ij} - \overline{x}_{rj} } \right)} \right]\varepsilon_{m}^{is} + \left[ {\sum\limits_{r \in N} {w_{ir}^{s} } \left( {\overline{x}_{im} - \overline{x}_{rm} } \right)} \right]\phi_{m}^{is}\\ &\qquad + \left[ {\sum\limits_{r \in N} {w_{ir}^{s} } \left( {\overline{x}_{ij} - \overline{x}_{rj} } \right)} \right]\gamma_{k}^{is} + \overline{z}_{jl} \sum\limits_{r \in N} {w_{jr}^{s} } \nu_{jl}^{is}+ \left( {\sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } \overline{x}_{ie} - M_{et} \overline{b}_{et} } \right)\iota_{e}^{is} \hfill \\ & \qquad+ \left( {\sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } \overline{x}_{in} - M_{nz} \overline{d}_{ez} } \right)\kappa_{n}^{is}+ \left( {\sum\limits_{i \in N} {\sum\limits_{j \in N} {w_{ij}^{s} } } \overline{x}_{ik} - M_{kq} \overline{h}_{kq} } \right)\lambda_{k}^{is} \hfill \\ \end{aligned} $$
(50)

The step‒by‒step procedure of the BBC algorithm is as follows.

figure a

5 Case study

5.1 Data input

In this section, we apply the proposed model and solution algorithm to the urban‒rural express logistics system of the Jiangling county in Hubei province of Central China. The reasons for choosing such a county for the case study are as follows. First, the Jiangling county has been selected as a delivery logistics pilot county of Hubei Province, and the Jiangling government has recently launched the construction of a three‒tiered rural logistics network of county, town and village levels. Second, we have detailed data of this county, such as socio‒economic, geographical, demographic and logistics data, thus that our case study can be constructed for realistic scenarios.

The total revenue of the courier industry of the Jiangling region in 2021 hit 29.064 million RMB, with a total of 2.5956 million parcels delivered. This region has an area of 1048.74 square kilometers and a population size of 278,200 as of 2021. It consists of 8 towns (i.e., Baimasi town, Haoxue town, Zishi town, Xionghe town, Shagang town, Puji town, Qishi town and Majiazhai town) and 107 villages, as shown in Fig. 3. According to the data provided by the Jiangling county government, there are currently two urban hubs (No. 1 and 2), 4 town hubs (No. 8, 9, 10, 12), and 65 village hubs (No. 15, 18, 19, 21, 23, 24, 25, 26, 27, 28, 30, 31, 33, 34, 35, 38, 39, 41, 42, 45, 46, 48, 50, 52, 56, 57, 60, 62, 64, 65, 66, 67, 68, 69, 70, 71, 73, 74, 75, 76, 77, 78, 79, 82, 85, 86, 87, 88, 90, 91, 92, 93, 94, 96, 97, 98, 99, 103, 106, 108, 110, 111, 112, 114, 120). The aim of the numerical example is to illustrate the optimal solutions of the locations, number, and capacities of the county, town and village hubs for this region, and to judge the gap between the present hub scheme and the optimal scheme. To do so, we consider all of the 107 villages as the candidate village hubs, 10 nodes (No. 5, 6, 7, 8, 9, 10, 11, 12) as the candidate town hubs and 4 nodes (No. 1, 2, 3, 4) as the candidate urban hubs. The stochastic logistics demand \(q_{ij}\) between any O‒D pair ij is assumed to follow a truncated normal distribution with \(TN\left( {\overline{q}_{ij} ,\sigma_{ij} } \right)\), where \(\overline{q}_{ij}\) and \(\sigma_{ij}\) are the mean and standard variance of the stochastic demand \(q_{ij}\). The data for the mean \(\overline{q}_{ij}\) and variance \(\sigma_{ij}\) of the O‒D demand are not shown here for saving space, together with the geographical locations of towns and villages. However, these data are available from the authors on request. In this example, the average transportation cost, \(\Lambda\), per express package per km is RMB0.2/km (Lian, 2019). The discount factors \(\alpha_{1}\), \(\alpha_{2}\) and \(\alpha_{3}\) for the average transportation cost per package of inter‒hub connections are 0.7, 0.8 and 0.9, respectively (Shang et al., 2021; Zhong et al., 2018). The values of the penalty coefficients \(\omega_{{1}}\), \(\omega_{{2}}\), and \(\omega_{{3}}\) are large enough (e.g., 1000) such that the capacity constraints can be satisfied when the algorithm iterations terminate.

Fig. 3
figure 3

Geographical distributions of urban, towns and villages in Jiangling region, China

According to the suggestions of Jiangling county government, only three possible capacity levels are considered for the urban, town and village hubs, as shown in Table 2. The construction cost for each hub capacity level is also shown in Table 2. In order to consider the effects of stochastic demand, Monte Carlo simulations are implemented to generate a set of demand scenarios, leading to a scenario probability distribution \(\left\{ {p_{s} } \right\}\) with \(\sum\nolimits_{s \in S} {p_{s} } = 1\). The proposed solution algorithm is carried out on a personal computer (Dell, AMD Ryzen 7 5800H 3.20 GHz CPU, RAM 16.0 GB). The solutions obtained are compared with those by the software of GUROBI solver.

Table 2 Input data of capacities and construction costs for candidate hubs

5.2 Discussion of results

5.2.1 Comparison of solution performances of different solvers

We first look at the performance of the solution of the BBC algorithm proposed in this paper and the GUROBI solver under different demand scenario sizes. For all demand scenario sizes, the algorithmic iterations terminate when the relative gap of the upper and lower bounds of the objective function values reaches 0.1%. In order to control the computational time, we set the upper bound of the computational time as 36,000 s. Once this upper bound is reached, the iterations terminate automatically. Table 3 shows the solution performances of the BBC algorithm and the GUROBI solver when the demand scenario size changes from 5 to 80. It can be seen that the GUROBI solver can find the global optimal solution for small‒sized scenarios only (i.e., 5, 10, 20). As the scenario number increases to 30, the computational time exceeds 36,000 s, and the iterations stop. As a result, the gap of the upper and lower bounds of the iterative solution is 12.51%. It can also be seen that the BBC algorithm proposed in this paper can obtain the global optimal solution under all the scenarios, and the solution speed outperforms the GUROBI solver, in terms of the computational time. In addition, as the scenario exceeds 70, the expected system cost stabilizes at a level of about RMB1.4259 × 107. In the following analysis, a demand scenario size of 70 is adopted.

Table 3 Solution performance of BBC algorithm and GUROBI solver

Figure 4 shows the change of the relative gap of the upper and lower bounds of the objective function values during the course of iterations with CPU time for the BBC algorithm and GUROBI solver when the demand scenario size is 70. It can be observed that the GUROBI solver cannot converge within a specified time threshold of 36,000 s, whereas the BBC algorithm can rapidly converge within about 12,000 s. Therefore, the BBC algorithm proposed in this paper is promising in solving the design models of large‒scale urban‒rural logistics networks.

Fig. 4
figure 4

The convergences of the proposed BBC algorithm and the GUROBI solver

5.2.2 Comparison of optimal solution and existing scheme

As previously stated, the Jiangling county government is currently adopting 2 nodes (No. 1, 2) as the urban hubs, 4 nodes (No. 8, 9, 10, 12) as the town hubs and 65 villages as the village hubs. We now identify the difference of the present scheme and the optimal solution generated by the proposed BBC algorithm. Table 4 shows the locations and number of the urban, town and village hubs with the optimal solution and the present scheme. It can be seen that the BBC algorithm leads to the optimal solution of 91 hubs, with an expected total cost of RMB1.4259 × 107. The 91 hubs include 3 urban hubs (No. 1, 2, 4), 6 town hubs (No. 5, 8, 9, 10, 12, 14) and 82 village hubs (No. 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 33, 34, 35, 38, 39, 41, 40, 42, 43, 45, 46, 49, 50, 52, 53, 56, 57, 60, 61, 62, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 76, 77, 78, 79, 82, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 96, 97, 98, 100, 102, 103, 105, 108, 109, 110, 111, 112, 114, 115, 117, 118, 119, 120). However, the current implemented scheme generates an expected total cost of RMB1.6471 × 107, which is 15.69% higher than that of the optimal scheme.

Table 4 Locations and the number of urban, town and village hubs under different schemes

5.2.3 Comparison of solutions of stochastic model and deterministic model

Finally, we look at the difference in the solutions of stochastic model and deterministic model. The deterministic model refers to the situation of \(\sigma_{ij} = 0\) in the truncated normal distribution \(TN\left( {\overline{q}_{ij} ,\sigma_{ij} } \right)\). Table 5 shows the solutions of the stochastic model and the deterministic model. It can be seen that the optimal number and locations of the hubs significantly change across the two models. Specifically, compared with the stochastic model, the deterministic model adds nodes 95 and 101 as the village hubs, but removes the village hubs of No. 5, 30, 39, 40, 72, 85, 89 and 90. As a result, the expected total cost of the deterministic model decreases by 5.28%. This means that the deterministic model will cause an underestimation of the total system cost, compared with the stochastic model.

Table 5 Comparison of the results of stochastic model and deterministic model

6 Conclusion and further studies

In this paper, a hierarchical hub location model is proposed to minimize the expected total system cost for the integrated design of urban and rural logistics networks under demand uncertainty. The locations, number, and capacities of urban‒town‒village hubs are simultaneously optimized and the effects of logistics demand uncertainty are incorporated. A demand scenario‒based branch‒and‒Benders‒cut algorithm is presented to solve the proposed model. A case study of Jiangling urban‒rural region in Hubei province of China is implemented, together with a comparison of the results generated by the proposed algorithm and those obtained with GUROBI solver and the present scheme in that region.

The results showed that ignoring the effects of demand uncertainty could lead to a significant decision bias. Therefore, there is indeed a need for the planner to take into account the effects of future demand fluctuations in the urban‒rural logistics network design. The proposed methodology in this paper outperforms the GUROBI solver in terms of the size of problem solved and the computational time. It can also significantly improve the efficiency of the urban‒rural logistics system in terms of the expected total system cost. The model and solution algorithm proposed in this paper can thus serve as a useful tool for future design of urban‒rural logistics networks and for the evaluation of urban‒rural transportation and logistics policies.

Although the numerical results presented in this paper are consistent with reality, some real‒world features are not fully captured, which may be explicitly considered in further studies. First, only logistics service is considered, and the passenger transportation service has been ignored. In reality, both human and goods movements are served, sometimes simultaneously through the urban‒rural transportation network. It is thus necessary to design an integrated urban‒rural transportation network for promoting the human and goods mobility. Second, only demand uncertainty is considered. In reality, uncertainty may also come from the supply side. There is thus a need to jointly consider the uncertainty effects from the demand and supply sides. Third, only travel distance is considered as an indicator of transport cost. It will be more reasonable to consider both travel time and travel distance in the transport cost computation, which is left for a future study.