1 Introduction

Inventory routing problem (IRP) is a classic combinatorial optimization problem that integrates critical supply chain activities, through intertemporal coordination of tactical and operational decisions (Manousakis et al. 2021). Several research contributions have addressed operational research approaches for the efficient solution of the IRP and its variants (Hu et al. 2018). Most of these scientific works have focused exclusively on the optimization of economic criteria.

The increased environmental concerns as well as the fact that supply chain activities emit pollutants lead to simultaneous consideration of economic and environmental decisions (Tirkolaee et al. 2023). To this end, several research contributions have proposed green IRP variants (Cheng et al. 2016, 2017). Typically, environmental decisions are related to fuel or energy consumption, emissions taxation policies, adoption of alternative fuels, and environmentally friendly vehicles (Cheng et al. 2017).

However, modern supply chain practices focus on the adoption of holistic sustainable approaches (Weber et al. 2023). A sustainable supply chain (SSC) relies on the integrated management of economic, environmental, and societal criteria (Martins and Pato 2019; Karakostas et al. 2020; Lotfi et al. 2023; Hashim et al. 2023; Shahsavani and Goli 2023). Working hours constitute a critical criterion for social sustainability (Resat and Unsal 2019). In the case of truck drivers, the European Union (EU) has published specific regulations that provide rules on driving and working times, breaks, and rest periods (Sartori et al. 2022). Respecting these rules can ensure the safety of drivers and their overall satisfaction with their job (Guo et al. 2022).

More specifically, the EU Directive 2002/15/EC (European Commission 2002) and the EU Regulation no. 561/2006 (European Commission 2006) stipulates that:

  • The accumulated driving time between two break periods must not exceed the limit of four and a half hours.

  • The duration of a route must not exceed the limit of nine hours.

  • The accumulated working time of the truck drivers must not exceed the limit of six hours.

  • Breaks should last 45 min.

This work introduces a novel sustainable supply chain network optimization problem that integrates strategic, tactical, and operational decisions. More specifically, a fleet size and mix pollution multi-period, multi-commodity IRP is proposed with further consideration of EU drivers’ working hours regulations as well as speed limits. Strategic decisions are related to the composition of a heterogeneous fleet of vehicles. Tactical decisions are related to inventory control and replenishment rates, and operational decisions consider the delivery schedules in each time period. According to sustainability criteria, the proposed optimization problem considers economic criteria, such as the costs of the supply chain system, environmental criteria, such as fuel consumption and CO\(_2\) emissions, and social criteria, such as work hours and breaks for drivers.

The key novelty of this work is the introduction of a novel integrated mixed integer linear programming (MILP) model that can assist logistic managers in small and medium enterprises in their effort to make optimized planning decisions (Caceres-Cruz et al. 2014). Such enterprises typically have to serve a limited number of customers regularly over a period of time. Therefore, an integrated optimization tool is required to address classic hard logistics optimization decisions under the effect of strict EU regulations. Computational analyses were performed to justify the significant impact of such regulations in the configuration of an optimal distribution system.

2 Literature review

This section is divided into three parts. The first provides a brief overview of research contributions on IRPs, while the second focuses on research contributions on the truck driver scheduling problem, and the third one focuses on the exploration of recent research works conducted on the optimization of sustainable supply chain networks.

2.1 Recent contributions on IRP

Cheng et al. (2017) studied a green IRP with multiple time periods, a single type of product, and fleet scheduling decisions. The authors developed a MILP model to formulate the problem under consideration. To address environmental-oriented decisions, they adopted a comprehensive fuel consumption model that was integrated into the MILP formulation developed. According to their computational analysis, the benefits of adopting a comprehensive objective function, as well as a mixed fleet of vehicles, were illustrated in terms of the total cost of the system and the environmental impacts. Soysal et al. (2018) delved into the potential benefits of employing a horizontal collaboration approach within the context of a green IRP featuring multiple periods and various types of food products subject to demand uncertainty. The authors constructed a chance-constrained programming model to mathematically formulate the considered problem. From an environmental standpoint, they took into account factors such as fuel consumption, pollutant emissions, and food waste. Their numerical analyses revealed that horizontal collaboration could result in significant savings of 17% in costs and a substantial reduction of 29% in emissions.

Micheli and Mantella (2018) developed a base case model to study economic and environmental criteria through an IRP with multiple time periods, multiple products, and a heterogeneous fleet of vehicles under uncertainty of demand. The authors investigated the impact of different carbon emissions control policies on several key performance indicators. Their numerical analyses showed that the use of heterogeneous vehicles can also lead to less driving time than adopting a homogeneous fleet. Alinaghian et al. (2021) investigated a green IRP with hard time windows. The authors examined a multi-period single-commodity supply chain system featuring a heterogeneous fleet of vehicles under the maximum stock inventory policy. They formulated the problem as a MILP model and introduced an augmented tabu search and differential evolution heuristic algorithm for solving multiple instances of the problem. Additionally, to address routing decisions effectively, they implemented supplementary heuristic approaches, including an improved Clarke–Wright algorithm, an enhanced push-forward insertion heuristic method, and a speed optimization heuristic algorithm.

A multi-period IRP system within the cold supply chain for food distribution was investigated by Wei et al. (2019). The authors categorized customers into two groups based on their proximity to the depot. Customers close to the depot were served by self-owned vehicles, while the other group was served by outsourced vehicles. Additionally, the authors assumed shorter route time limits for self-owned vehicles compared to outsourced vehicles. The problem was formulated as an MILP model. Small-sized problem instances were addressed using the CPLEX commercial solver, and larger cases were tackled using a genetic algorithm (GA) developed by the authors. Schenekemberg et al. (2020) examined a multi-period IRP involving homogeneous fleet management decisions, such as fleet rentals and vehicle cleaning. A branch-and-cut algorithm was employed for small-sized problems, while larger instances were addressed by a matheuristic based on adaptive large neighborhood search. Coelho et al. (2020) explored an extension of the multi-depot IRP, considering a multi-commodity supply chain system with heterogeneous vehicles and a limit on route duration.

Manousakis et al. (2021) proposed an improved branch and cut algorithm for the solution of a multi-period two-commodity flow IRP, under the consideration of a limited homogeneous fleet of vehicles. Mahjoob et al. (2021) developed a modified adaptive GA for the solution of a multi-period, multi-product IRP, with a heterogeneous fleet of vehicles. Soysal et al. (2021) studied a closed-loop, multi-period IRP, under the consideration of a mixed fleet of electric and conventional vehicles. The authors developed an MILP and solved small-sized problem instances using a commercial solver, while a fix and optimize algorithm was developed for the solution of larger problem cases. The authors conducted several numerical analyses to extract useful managerial insights. Neves-Moreira et al. (2022) studied a multi-commodity IRP with pickups and deliveries under the effect of both deterministic and uncertain demand.

2.2 Contributions on the truck drivers scheduling problem

Kok et al. (2011) studied the vehicle departure time optimization problem (VDO), as a post-processing approach to the well-known vehicle routing problem with time windows, considering real-life conditions, such as driving hours regulations. The authors formulated the problem as an MILP model and developed an insertion construction heuristic for its efficient solution. Goel (2012) developed an MILP for a variant of the TDSP, considering rest periods at customer locations or at suitable rest areas. The objective of the problem was the minimization of the duration of the route with respect to the relative regulations. The solution of the problem was carried out using a dynamic programming approach.

Goel and Vidal (2014) studied a problem with scheduling for truck drivers with time windows taking into account the hours of service regulations of drivers. More specifically, the authors investigated the impact of such regulations from the United States, EU, Canada, and Australia on transportation costs. They developed a hybrid metaheuristic optimization method to solve the problem under consideration. According to their findings, EU regulations were found to be associated with the highest safety rates, while Canadian truck driver rules led to more economically efficient solutions. Rincon-Garcia et al. (2020) developed a metaheuristic algorithm based on a large neighborhood search for the solution of time-dependent VRP with time windows taking into account both the regulations of EU truck drivers’ working hours and the regulations of the UK road transport working time. The proposed solution method produced solutions with \(19\%\) fewer vehicles, \(17.7\%\) less traveled distance, and \(4.4\%\) less duration of the route. Sartori et al. (2022) introduced the truck driver scheduling problem with interdependent routes, considering EU regulations. The authors introduced a set of constraints into an MILP to produce feasible schedules, as well as a label propagation algorithm. According to their findings, the proposed solution method can produce valid schedules for problem cases with several drivers, and a large number of interdependent routes.

2.3 Recent contributions on the optimization of sustainable supply chains

A multi-objective optimization problem for designing a sustainable closed-loop supply chain network in the aluminum industry was investigated by Pahlevan et al. (2021). The authors took into account the life cycle of products to calculate the environmental impacts of the proposed network. The problem was formulated mathematically as a multi-objective MILP, and three novel multi-objective metaheuristic methods were developed for its efficient solution. Pervin et al. (2023) devised a sustainable inventory model considering the composite demand of products with a fixed lifespan, time-dependent holding costs, and warehouse carbon emissions. Meanwhile, Ghosh et al. (2023) investigated a multi-objective sustainable waste management problem framed as a solid transportation problem with three objectives. The first objective pertains to the total system profit, representing the economic aspect of sustainability. The second objective, total elapsed time, encompasses transportation, loading, and unloading times, reflecting the social aspect of sustainability. The third objective addresses the environmental aspect by focusing on total carbon emissions. To address the presented problem, the authors proposed two solution methods-one relying on neutrosophic linear programming and the other employing the \(\epsilon\)-constraint method.

In a recent study, Tirkolaee et al. (2023) delved into the domain of municipal solid waste management by investigating a sustainable periodic capacitated arc routing problem. This problem encompasses the simultaneous optimization of multiple conflicting objectives, namely total cost minimization, total pollution reduction, and maximum job opportunity utilization. To address this complex optimization challenge, the authors devised a novel multi-objective MILP model that effectively captures the intricate interrelationships between these objectives. Additionally, they proposed two robust multi-objective Pareto-based metaheuristic algorithms tailored to efficiently navigate the intricate solution space and generate a diverse range of non-dominated solutions, catering to various stakeholder preferences and decision-making scenarios. A recent study proposed a comprehensive framework for designing a sustainable supply chain that incorporates the Internet of Things (IoT) to achieve cost reduction, carbon emission minimization, and social impact maximization through job creation (Goli et al. 2023). The proposed supply chain model encompasses both strategic and operational aspects, including supplier selection, facility location, transportation mode optimization, and product distribution level determination. To attain these objectives, the authors developed a multi-objective MILP model, which was subsequently transformed into a single-objective model using goal programming (GP). In their study, Goli et al. (2023) delved into the design of a sustainable five echelons canola oil-based biodiesel supply chain network, while considering the inherent uncertainties in supply and demand. They employed a mixed integer non-linear programming (MINLP) model to address the strategic and tactical-level decisions, aiming to minimize total system costs and carbon emissions while simultaneously maximizing the social impact through job creation. Barman et al. (2023) explored a supply chain inventory management problem that encompasses both synchronous and asynchronous rework of defective products, alongside the consideration of three established carbon emission policies.

Giri et al. (2023) examined the optimization of an electric sustainable supply chain network, incorporating economic, environmental, and social considerations. They devised a multi-objective MILP model to address these multifaceted objectives. Additionally, they introduced a multi-choice conic goal programming solution method integrated with a utility function to effectively tackle the complex problem at hand. Ala et al. (2024) devised a novel fuzzy multi-objective optimization model to investigate a sustainable healthcare supply chain network problem, encompassing multiple echelons, multiple products, and a fuzzy approach to address the uncertainty associated with a subset of critical model parameters. They introduced three multi-objective metaheuristic algorithms, along with the well-established \(\epsilon\)-Constraint exact method, to effectively tackle the complex problem at hand. Das et al. (2024) meticulously delved into the optimization of a sustainable two-stage solid logistics network by formulating a multi-objective multi-facility location-allocation problem. The authors effectively addressed uncertainties by employing triangular type-2 neutrosophic numbers and employing a novel ranking approach. To optimize three conflicting objectives, they employed the \(\epsilon\)-Constraint approach and demonstrated its effectiveness through numerical examples. They also conducted a comparative study against other Pareto-based multi-objective approaches.

3 Problem statement and mathematical formulation

This work introduces the fleet size and mix pollution inventory routing problem (FSMP-IRP), a multi-period, multi-commodity optimization challenge that incorporates EU truck drivers’ working hours regulations and speed limits. The problem is formulated on a complete graph \(G = \{V, E\}\), where \(V\) denotes the set of nodes comprising geographically dispersed customers \(I = \{1, \ldots , |I |\}\), and an additional node \(|I |+ 1\) represents the depot. The set of edges is defined as \(E = \{(i, j): i, j \in V, i \ne j\}\). The supply chain system encompasses the distribution of multiple product types \(P = \{1, \ldots , |P |\}\) across a finite time horizon \(T = \{1, \ldots , |T |\}\). The distribution task involves a fleet of heterogeneous vehicles \(K = \{1, \ldots , |K |\}\), which includes light-duty trucks \(K_{L} = \{1, \ldots , |K_{L} |\}\), medium-duty trucks \(K_{M} = \{|K_{L} |+ 1, \ldots , |K_{M} |\}\), and heavy-duty trucks \(K_{H} = \{|K_{M} |+ 1, \ldots , |K |\}\). Each selected vehicle can traverse between two nodes at a specified speed level chosen from a set of available speed levels \(L = \{1, \ldots , |L |\}\).

Each customer exhibits a period-dependent demand for each type of product in every time period, denoted as \(d_{ipt}\). Additionally, it is assumed that distinct product types share identical storage conditions. Consequently, a generalized holding cost, \(h_{i}\), is assigned to each customer. Every vehicle is characterized by a space capacity, \(SC_{k}\), and a payload capacity, \(PC_{k}\). Specific units of space within each type of vehicle are required for each unit of product type, indicated by \(sr_{pk}\). Moreover, each unit of product type p is associated with a specific weight, \(PW_{p}\). Similarly, \(ST_{pk}\) represents the time necessary to unload each unit of product type p from vehicle k.

Tables 1 and 2 provide the common and specific parameters of the vehicle types and their corresponding values, respectively.

Table 1 Common vehicle parameters
Table 2 Vehicle-specific parameters (Cheng et al. 2017)

The parameters, which represent the working and driving hours limits of truck drivers, according to EU regulations, are summarized in Table 3.

Table 3 Limits of working and driving hours

The decision variables required to mathematically formulate the problem under investigation are provided in Table 4.

Table 4 Decision variables of the model

The objective of the problem under consideration is to minimize the total cost of the supply chain system, consisting of general routing costs, inventory costs, fuel consumption costs, CO\(_2\) emissions taxation costs, vehicles’ usage costs, and drivers’ wages costs. Based on fuel consumption, the Comprehensive Modal Emission Model (CMEM) of Barth et al. (2005) and Barth and Boriboonsomsin (2009) has been adopted. CMEM has been successfully applied in several supply chain network optimization studies (Koç et al. 2016; Dukkanci et al. 2019; Karakostas et al. 2020). The following formulas are provided to simplify the mathematical expression of fuel consumption:

  • \(\lambda = \frac{\epsilon }{HVDF \cdot \psi }\),

  • \(\gamma _{k} = \frac{1}{1000 \cdot VDTE \cdot \eta }\),

  • \(\alpha = \tau + g \cdot CR \cdot \sin {\theta } + g \cdot CR \cdot \cos {\theta }\),

  • \(\beta _{k} = 0.5 \cdot CAD \cdot \rho \cdot FSA_{k}\).

To this end, a mixed integer linear programming model is proposed as follows:

$$\begin{aligned} \begin{array}{l} \min \sum \limits _{i \in I} {h_i} \cdot \sum \limits _{p \in P} \sum \limits _{t \in H} {\left( {\frac{1}{2} \cdot {d_{ipt}} + \sum \limits _{h \in T,h < t} {{w_{ipth}} \cdot \left( {t - h} \right) + \sum \limits _{h \in T,h > t} {{w_{ipth}} \cdot \left( {t - h + |T|} \right) } } } \right) } \\ ~~ + \sum \limits _{i \in V} {\sum \limits _{j \in V} {\sum \limits _{k \in K} {\sum \limits _{t \in T} {{c_{ij} \cdot x_{ijkt}}} } } }\\ ~~ + \sum \limits _{i \in V} {\sum \limits _{j \in V} {\sum \limits _{k \in K} {\sum \limits _{l \in L} {\sum \limits _{t \in T} {fd} \cdot \frac{\left( z_{ijklt} \cdot ~c_{ij} \right) }{{s_{l}} } } } } } \\ ~~ + \sum \limits _{k \in K} { \sum \limits _{t \in T} { vs_{kt} \cdot VUC_{k} } } \\ ~~ + \sum \limits _{i \in V} \sum \limits _{j \in V} \sum \limits _{k \in K} \sum \limits _{t \in T} { \bigg \{ \lambda \left( f_{c} + \left( f_{e}\sigma \right) \right) \Bigg ( \sum \limits _{l \in L} { \frac{\left( z_{ijktl} \cdot ~EFF_{k} \cdot ~ES_{k} \cdot ~ED_{k} \cdot ~c_{ij} \right) }{s_{l}} } ~~ + \bigg ( \alpha \cdot \gamma _{k} } \cdot \left( CW_{k} \cdot ~x_{ijkt} + a_{ijkt} \right) \cdot ~c_{ij} \bigg ) + \left( \beta _{k} \cdot ~\gamma _{k} \cdot \sum \limits _{l \in L} \left( s_{l}^2 \cdot ~ z_{ijktl} \right) \right) \Bigg ) \bigg \} \end{array} \end{aligned}$$
(1)

subject to

$$\begin{aligned}{} & {} \sum \limits _{l \in L} {z_{ijktl}} = x_{ijkt}, \quad \forall i,j \in V, ~\forall k \in K, \quad ~\forall t \in T \end{aligned}$$
(2)
$$\begin{aligned}{} & {} \sum \limits _{l \in L} {x_{iikt}} = 0, \quad \forall i \in V, ~\forall k \in K, \quad ~\forall t \in T \end{aligned}$$
(3)
$$\begin{aligned}{} & {} \sum \limits _{j \in V} {\sum \limits _{k \in K} {{x_{ijkt}}} } \le 1, \quad \forall t \in T,\ \quad \forall i \in I \end{aligned}$$
(4)
$$\begin{aligned}{} & {} \sum \limits _{j \in V} {\sum \limits _{k \in K} {{x_{jikt}}} } \le 1, \quad \forall t \in T,\ \,\,\forall i \in I \end{aligned}$$
(5)
$$\begin{aligned}{} & {} \sum \limits _{j \in V} {{x_{ijkt}}} - \sum \limits _{j \in V} {{x_{jikt}}} = 0, \quad \forall i \in V, ~\forall k \in K, \quad ~\forall t \in T \end{aligned}$$
(6)
$$\begin{aligned}{} & {} \sum \limits _{i \in I} { { x_{i(n+1)kt} } } \le 1, \quad \forall ~ k \in K, \quad \forall ~ t \in T \end{aligned}$$
(7)
$$\begin{aligned}{} & {} U_{ikt} - U_{jkt} + \left( |I| \cdot \sum \limits _{l \in L} {x_{ijkt}} \right) \le |I| - 1, \quad \quad \forall ~ i,j \in I, \forall ~ k \in K, \forall ~ t \in T \end{aligned}$$
(8)
$$\begin{aligned}{} & {} \sum \limits _{i \in V} {a_{ijkt}} - \sum \limits _{i \in V} {a_{jikt}} = \sum \limits _{p \in P} q_{jpkt} \cdot PW_{p}, \quad \forall j \in I, ~\forall k \in K, \quad ~\forall t \in T \end{aligned}$$
(9)
$$\begin{aligned}{} & {} a_{ijkt} \le x_{ijkt} \cdot PC_{k}, \quad \forall i,j \in V, ~ \forall k \in K, ~ \forall t \in T \end{aligned}$$
(10)
$$\begin{aligned}{} & {} \sum \limits _{i \in I} \sum \limits _{p \in P} { q_{ipkt} \cdot sr_{pk} \le SC_{k}, \quad \forall ~k \in K, \quad \forall ~ p \in P } \end{aligned}$$
(11)
$$\begin{aligned}{} & {} \sum \limits _{i \in I} \sum \limits _{p \in P} { q_{ipkt} \cdot PW_{p} \le PC_{k}, \quad \forall ~k \in K, \quad \forall ~ p \in P } \end{aligned}$$
(12)
$$\begin{aligned}{} & {} \sum \limits _{h \in T} { w_{ipth} = d_{ipt}, \quad \forall i \in I, ~ \forall p \in P, ~ \forall t \in T } \end{aligned}$$
(13)
$$\begin{aligned}{} & {} \sum \limits _{t \in T} { w_{ipth} } = \sum \limits _{k \in K} { q_{ipkh} }, \quad \forall i \in I, ~ \forall p \in P, ~ \forall h \in T \end{aligned}$$
(14)
$$\begin{aligned}{} & {} w_{ipth} \le d_{ipt}, \quad \forall i \in I, ~ \forall p \in P, ~ \forall t,h \in T \end{aligned}$$
(15)
$$\begin{aligned}{} & {} q_{ipkt} \le M_{1} \cdot \sum \limits _{j \in V} { x_{ijkt} }, \quad \forall i \in I, ~ \forall p \in P, ~ \forall k \in K, ~ \forall t \in T \end{aligned}$$
(16)
$$\begin{aligned}{} & {} \sum \limits _{j \in V} {x_{ijkt} } \le M_{1} \cdot q_{ipkt}, \quad \forall i \in I, ~ \forall p \in P, ~ \forall k \in K, ~ \forall t \in T \end{aligned}$$
(17)
$$\begin{aligned}{} & {} vs_{kt} \le \sum \limits _{i \in V} \sum \limits _{j \in V} { x_{ijkt} }, \quad \forall k \in K, ~ \forall t \in T \end{aligned}$$
(18)
$$\begin{aligned}{} & {} x_{ijkt} \le vs_{kt}, \quad \forall i,j \in V, i \ne j, ~ \forall k \in K, ~ \forall t \in T \end{aligned}$$
(19)
$$\begin{aligned}{} & {} \sum \limits _{l \in L, l > 3} {z_{ijklt}} = 0, \quad \forall ~ i,j \in I, \forall ~ k \in K, \forall ~ t \in T, ~ c_{ij} \in \left[ 0, \frac{{\overline{c}}}{3}\right) \end{aligned}$$
(20)
$$\begin{aligned}{} & {} \sum \limits _{l \in L, l > 8} {z_{ijklt}} = 0, \quad \forall ~ i,j \in I, \forall ~ k \in K, \forall ~ t \in T, ~ c_{ij} \in \left[ \frac{{\overline{c}}}{3}, \frac{{\overline{c}}}{2}\right] \end{aligned}$$
(21)
$$\begin{aligned} \small{} & {} \begin{aligned} ADT_{ktj} \ge&\,~ADT_{kti} + \left( \sum \limits _{l \in L} { z_{ijklt} \cdot \frac{c_{ij}}{s_{l}} } \right) - ( BW_{kti} \cdot DMD ) -\left[ \left( 1 - \sum \limits _{l \in L} z_{ijklt} \right) \cdot MRD \right] , \\&\forall i,j \in V, ~i \ne j, ~i \ne |I |+ 1, ~ \forall k \in K, ~ \forall t \in T \end{aligned} \end{aligned}$$
(22)
$$\begin{aligned} \small{} & {} \begin{aligned} ADT_{ktj} \ge&\,\left( \sum \limits _{l \in L} { z_{ijklt} \cdot \frac{c_{ij}}{s_{l}} }\right) - ( BW_{kti} \cdot DMD ) - \left[ \left( 1 - \sum \limits _{l \in L} z_{ijklt}\right) \cdot MRD \right] , \\&\forall i = |I |+ 1, ~ j \in I, ~ \forall k \in K, ~ \forall t \in T \end{aligned} \end{aligned}$$
(23)
$$\begin{aligned}{} & {} ADT_{ktj} \ge \sum \limits _{l \in L} { z_{ijklt} \cdot \frac{c_{ij}}{s_{l}} }, \quad \forall i,j \in V, i \ne j, ~ \forall k \in K, ~ \forall t \in T \end{aligned}$$
(24)
$$\begin{aligned} \small{} & {} \begin{aligned} AWT_{ktj} \ge&\,~AWT_{kti} + \left( \sum \limits _{l \in L} { z_{ijklt} \cdot \frac{c_{ij}}{s_{l}} } \right) + \left( \sum \limits _{p \in P} { q_{jpkt} \cdot ST_{pk} }\right) - ( BW_{kti} \cdot WMD ) \\&\quad -\left[ \left( 1 - \sum \limits _{l \in L} z_{ijklt}\right) \cdot MRD\right] , \\ {}&\quad \forall i,j \in V, i \ne j, ~ i \ne |I |+ 1, ~ \forall k \in K, ~ \forall t \in T \end{aligned} \end{aligned}$$
(25)
$$\begin{aligned} \small{} & {} \begin{aligned} AWT_{ktj} \ge&\,~\left( \sum \limits _{l \in L} { z_{ijklt} \cdot \frac{c_{ij}}{s_{l}} }\right) + \left( \sum \limits _{p \in P} { q_{jpkt} \cdot ST_{pk} } \right) - ( BW_{kti} \cdot WMD ) \\&-\left[ \left( 1 - \sum \limits _{l \in L} z_{ijklt} \right) \cdot MRD \right] , \\&\quad \forall i = |I |+ 1, ~ \forall j \in I, ~ \forall k \in K, ~ \forall t \in T \end{aligned} \end{aligned}$$
(26)
$$\begin{aligned} \small{} & {} \begin{aligned} AWT_{ktj} \ge&\,~\left( \sum \limits _{i \in V} \sum \limits _{l \in L} { z_{ijklt} \cdot \frac{c_{ij}}{s_{l}} } \right) + \left( \sum \limits _{p \in P} { q_{jpkt} \cdot ST_{pk} }\right) \\&\quad -\left[ \left( 1 - \sum \limits _{l \in L} z_{ijklt} \right) \cdot MRD \right] , \\ {}&\quad \forall j \in V, ~ \forall k \in K, ~ \forall t \in T \end{aligned} \end{aligned}$$
(27)
$$\begin{aligned} \small{} & {} \begin{aligned} AT_{ktj} \ge&\,~AT_{kti} + \left( \sum \limits _{l \in L} { z_{ijklt} \cdot \frac{c_{ij}}{s_{l}} } \right) +\left( \sum \limits _{p \in P} { q_{ipkt} \cdot ST_{pk} }\right) + ( BW_{kti} \cdot MBD ) \\&\quad -\left[ \left( 1 - \sum \limits _{l \in L} z_{ijklt} \right) \cdot (2 \cdot MRD) \right] , \\ {}&\quad \forall i, j \in V, i \ne j, i \ne |I |+ 1, ~ \forall k \in K, ~ \forall t \in T \end{aligned} \end{aligned}$$
(28)
$$\begin{aligned} \small{} & {} \begin{aligned} AT_{ktj} \ge&\,~\left( \sum \limits _{l \in L} { z_{ijklt} \cdot \frac{c_{ij}}{s_{l}} } \right) + ( BW_{kti} \cdot MBD ) - \left[ \left( 1 - \sum \limits _{l \in L} z_{ijklt}\right) \cdot (2 \cdot MRD)\right] , \\ {}&\quad i = |I |+ 1, ~ \forall j \in I, ~ \forall k \in K, ~ \forall t \in T \end{aligned} \end{aligned}$$
(29)
$$\begin{aligned}{} & {} AT_{kti} \le \sum \limits _{j \in V} \sum \limits _{l \in L} z_{jiklt} \cdot MRD, \quad \forall i \in V, ~ \forall k \in K, ~ \forall t \in T \end{aligned}$$
(30)
$$\begin{aligned}{} & {} \sum \limits _{i \in V} BW_{kti} \le M_{2} \cdot vs_{kt}, \quad \forall k \in K, ~ \forall t \in T \end{aligned}$$
(31)
$$\begin{aligned}{} & {} BW_{kti} \le \sum \limits _{j \in V, j \ne i} \sum \limits _{l \in L} z_{jiklt}, \quad \forall i \in V, ~ \forall k \in K, ~ \forall t \in T \end{aligned}$$
(32)

To clarify, the parameter \(M_{1}\) is set equal to the total demand of the customers for each type of product in all time periods, while the parameter \(M_2\) is set equal to the number of customers.

The objective function (1) refers to the minimization of the total cost of the supply chain system, which consists of inventory costs (average holding costs and penalty costs of deferred deliveries), general routing costs, driver wages costs, vehicles’ usage costs, fuel consumption costs, and carbon emission taxation costs. Constraints (2) indicate that the movement from a node to another with a selected vehicle in a specific time period will be performed with at most one specific speed level. Constraints (3) eliminate the one-node cycles in the supply chain network. Constraints (4) and (5) guarantee that a customer is serviced by at most one vehicle in each time period. Constraints (6) establish the balance between the interior and exterior flows of vehicles in each node in each time period. Each vehicle can perform at most one route in each time period, as imposed by constraints (7). The Miller-Tucker-Zemlin subtour elimination constraints are formulated as presented in Equations (8). Constraints (9) impose the balance between the interior and exterior flows of product load in each customer over each period of time. Constraints (10) guarantee that the product load in the vehicle moving between two nodes of the network cannot exceed the payload capacity of this vehicle. Constraints (11) impose that the space requirements of the products delivered to a customer in a specific time period by a selected vehicle cannot exceed its space capacity. Similarly, Constraints (12) guarantee that the weight of products transported by a selected vehicle in a specific time period cannot exceed its payload capacity. Constraints (13) denote that the deferred scheduled deliveries to a specific customer for a specific period of time must be equal to the demand of this customer. Moreover, constraints (14) impose that the deferred scheduled deliveries to a specific customer for a specific time period must be equal to the actual deliveries in this time period. Constraints (15) impose that a deferred scheduled delivery to a customer cannot exceed the corresponding demand of this customer. Constraints (16) and (17) guarantee that a vehicle will deliver to a customer if and only if this vehicle is scheduled to visit this customer. Constraints (18) and (19) ensure that a vehicle can perform a route in a specific time period if and only if it is selected in this time period. Constraints (20) and (21) guarantee the abidance to the speed limit regulations.

The following constraints refer to EU regulations on the working and driving times of truck drivers. Constraints (22), (23), and (24) focus on the accumulative driving time. In general, the accumulative driving time for a specific vehicle in a specific time period for each customer must be greater than the accumulative driving time of the previous customer plus the driving time to reach the current customer minus the duration of a possible break. Constraints (25), (26), and (27) refer to accumulative working time. The accumulative working time consists of both driving and servicing (products unloading) times. In general, the accumulative working time after the service of a specific customer in a specific time period must be greater than the accumulative working time after the service of the previous customer plus the driving time to reach the current one, plus the necessary time to service the current customer minus the duration of any possible break. Constraints (28), (29), and (30) address the feasibility of the arrival time to each node in each time period. It should be clarified that the constraints (30) guarantee that the overall duration of a route will not exceed the corresponding limit. Finally, constraints (31) and (32) impose that only a driver of a selected vehicle can perform a break in a specific time period.

4 Computational analyses

4.1 Computing environment and problem instances

The proposed MILP model was developed using the Gurobi Python interface. The commercial solver ran on a laptop PC (Windows 10 Home 64-bit), with an Intel Core i7-9750 H CPU at 2.6 GHz and 16 GB RAM. The execution time limit was set to two hours. To avoid potential “Out-Of-Memory" errors, the parameter “NodeFileStart" was set equal to 0.5 to utilize disk space for the storage of nodes. To validate the proposed model and perform the necessary computational analyses, 15 problem instances were randomly generated. The new benchmark set is publicly available at https://sites.uom.gr/sifalera/benchmarks.html.

The name of each problem case has the form “X-Y-Z”, where “X” denotes the number of customers, “Y” is the number of product types, and “Z” denotes the number of time periods. The first line in each file provides these three numbers (Customers, Types of Products, and Time Periods). The second line provides the coordinates of the location of the depot. The next “X” lines contain the coordinates of the location of the customers and their corresponding holding costs. Customer and depot locations’ coordinates were randomly generated following a uniform distribution in the range [0,80].

The following “Customers \(\cdot\) Periods” lines present the period-dependent demand for each customer for each type of product. The period-dependent demand of each customer for each type of product was generated following the normal distribution. For each demand value, the mean and standard deviation of the normal distribution were also randomly produced following the uniform distribution in the ranges of [5,15] and [0,5], respectively. The next line in each instance file provides the space required per type of product to load it into a truck. These values were randomly generated following the uniform distribution in the range of [1,3].

The space capacity of each type of truck is presented in the next line. Space capacity data of light-duty trucks were generated following the uniform distribution in the range of \([\text {MaxDemand} \cdot \text {MaxSpace}, \frac{\text {TotalDemand} \cdot \text {MaxSpace}}{2 \cdot \text {Periods}}]\) , where “MaxDemand” denotes the maximum value of demand produced, “MaxSpace” represents the maximum value of space required for the loading of one unit of product under the consideration of all product types, and “TotalDemand” \(= \sum \nolimits _{i \in I} \sum \nolimits _{p \in P} \sum \nolimits _{t \in T} d_{ipt}\). The space capacity of medium-duty trucks is equal to 1.2 times the space capacity of light-duty trucks, while the space capacity of heavy-duty trucks is 1.44 times greater than the space capacity of light-duty trucks.

The following \(|P |\) lines provide the time required to unload each unit of each type of product from each type of vehicle. Initially, for each type of product, a random number was produced from the uniform distribution on the interval [1,3]. The generated number was transformed into seconds by multiplying by 60. This product was set as the unloading time of the selected type of product in the case of light-duty trucks. In the case of medium-duty trucks, the previously produced time is increased by 1.05 times, while in case of heavy-duty trucks, it is increased by 1.08 times.

Based on the weight of each type of product, the corresponding data are provided in the next line of each instance file. The product weight for a specific product type was generated from the uniform distribution in the interval [1.5,5]. The last line in an instance file presents the payload capacity of each vehicle type. The payload capacity of light-duty trucks was randomly generated from the uniform distribution on the interval \([\text {MaxDemand} \cdot \text {AvgProductWeight}, (\frac{\text {TotalDemand}}{2 \cdot \text {Periods}}) \cdot \text {AvgProductWeight}]\), where “AvgProductWeight” is equal to the average unit weight of all product types. Similarly to the space capacity, the payload capacities of medium and heavy trucks are increased by 1.2 and 1.44 times the payload capacity of light-duty trucks, respectively.

4.2 Computational results

Table 5 provides the total and individual costs according to the best-found solution for each instance.

Table 6 provides the fleet composition as it has been scheduled in the best-found solutions for each problem case.

Table 5 Summary of total and individual costs
Table 6 Fleet schedule based on the solutions of basic model

An important decision in designing the considered supply chain system involves scheduling drivers’ breaks. In this context, we present an illustrative example of such schedules for the specific problem instance “8-3-3". Three breaks are scheduled, one for each time period. To elaborate, the driver of the heavy-duty truck is planned to take a break after visiting “Customer_4" in the first time period, another break is scheduled in the second time period after visiting “Customer_2", and in the third time period, the driver of a heavy-duty truck will take a break after visiting “Customer_8". The break schedule is summarized as follows:

  • 8-3-3:

    • First break:   (Heavy-duty, Period_1, Customer_4)

    • Second break: (Heavy-duty, Period_2, Customer_2)

    • Third break:   (Heavy-duty, Period_3, Customer_8)

4.3 Managerial insights

This section is dedicated to the computational sensitivity analysis of the potential impact on the total cost and structure of the supply chain system, due to possible fluctuations in key model parameters. The flow of these sensitivity analyses is summarized in Figure 1.

Fig. 1
figure 1

The sequence of different sensitivity analyses

4.3.1 Impact of CO\(_{2}\) taxation cost fluctuations

Carbon emissions taxation constitutes a key policy for the reduction of carbon emissions (Zhou et al. 2021; Xu et al. 2022). Thus, it is crucial to investigate the potential impact of CO\(_{2}\) taxation cost fluctuations on the emissions produced by the supply chain system, as well as on its overall structure. To this end, four CO\(_{2}\) taxation policies were examined. The first policy considers the absence of any taxation on CO\(_{2}\) emissions. The second policy considers an increase of 10% on the initial tax, while the third and fourth refer to the application of increased taxation rates by 15% and 20%, respectively. Figure 2 illustrates the trade-offs between different CO\(_{2}\) taxation policies and the produced emissions.

Fig. 2
figure 2

Impact of CO\(_{2}\) taxation policies on carbon emissions

The examination of various CO\(_{2}\) taxation levels and their impact on average CO\(_{2}\) emissions provides insightful findings. Firstly, it is evident that the introduction of a CO\(_{2}\) taxation policy plays a pivotal role in reducing emissions. The average CO\(_{2}\) emissions are observed to be at 947.97 kg, when no taxation is applied. Introducing an initial taxation level results in a slight reduction in average CO\(_{2}\) emissions to 937.96 kg, indicating a potential influence of taxation on emission levels. Further adjustments, such as a 10% increase in taxation, lead to a decrease in average CO\(_{2}\) emissions to 934.72 kg. However, the efficacy of carbon emission management measures diminishes when transitioning to more stringent taxation policies. This deterioration may be attributed to disparities in fleet composition as well as routing patterns, during specific time periods, possibly influenced by strict EU driving and working regulations. Figure 3 illustrates the slight differences in the average cost of the fleet composition under the effect of each CO\(_{2}\) taxation policy, while Fig. 4 presents the impact of different CO\(_{2}\) taxation policies on routing costs.

Fig. 3
figure 3

The relationship between different CO\(_{2}\) taxation policies and average vehicles usage costs

Fig. 4
figure 4

The relationship between different CO\(_{2}\) taxation policies and average routing costs

From a total cost perspective, no significant effects of different taxation policies have also been observed. Table 7 summarizes the average total and individual costs of the supply chain system under the effect of different CO\(_{2}\) taxation policies.

Table 7 Cost comparisons between different CO\(_{2}\) taxation policies

Table 8 focuses on two problem instances and provides fleet requirements per time period, aiming to better understand the impact of different carbon taxation policies on fleet composition.

Table 8 Differences in fleet composition and routing schedules based on CO\(_{2}\) taxation policies

4.3.2 Impact of fuel price fluctuations

Fuel price fluctuations can significantly affect traffic flows (Zhang and Burke 2020). Therefore, investigating the potential impact of fuel price fluctuations on the supply chain system under consideration constitutes an interesting research objective. Figures 56, and 7 illustrate how changes in fuel price affect average fuel consumption, fleet composition, and average transportation cost.

Fig. 5
figure 5

The impact of fuel price fluctuations on the average fuel consumption

Fig. 6
figure 6

The impact of fuel price fluctuations on the average vehicles usage cost

Fig. 7
figure 7

The impact of fuel price fluctuations on the average routing cost

According to the numerical analysis, there is a slight relationship between fluctuation in fuel prices and average fuel consumption. It has been observed that as the price of the fuel increases, the average fuel consumption gradually decreases. To handle such fluctuations, the solver attempted to provide a different fleet composition and delivery schedules. More specifically, an increase of 10% in fuel prices leads to fewer vehicles being used compared to the basic case, while a more severe increase of 20% leads to the utilization of more vehicles, but to more efficient routing patterns. However, a slight increase in fuel consumption was observed when the fuel price increased by 15%. This unanticipated outcome can possibly be caused by the effect of both strict EU regulations and the consideration of speed limits. To this end, the solver produces an inefficient routing scheme to produce a time-feasible solution that fully satisfies customers’ demands.

Table 9 provides the average total and individual costs of the supply chain system under the effect of different fuel price fluctuations.

Table 9 Cost comparisons under the effect of different fuel price fluctuations

4.3.3 Impact of holding cost fluctuations

Recent studies have underlined the important role of holding cost changes in a supply chain system (Hu et al. 2018). To this end, the potential impact of holding cost fluctuations on the supply chain system under consideration has been set under investigation. More specifically, two cases have been considered. According to the first, an increase in holding costs of 10% was applied. Similarly, an increase of 15% was considered in the second case. Table 10 provides the average total and individual costs under the effect of fluctuation of the holding cost.

Table 10 Cost comparisons under the effect of holding cost fluctuations

The numerical analysis revealed the significant impact of holding cost fluctuations on the average overall cost of the supply chain system. There is an obvious strong impact on average inventory cost (increased by 15% and 22% respectively) as well as on routing cost (increased by 5% and 1.5% respectively).

Figures 89, and 10 illustrate the replenishment plan for all types of product in all time periods for the third customer of the problem instance “6-4-5”, under the effect of holding cost fluctuations.

Fig. 8
figure 8

Replenishment plan for the third customer in the basic case of instance “6-4-5”

Fig. 9
figure 9

Replenishment plan for the third customer under the effect of the first holding cost fluctuation of instance “6-4-5"

Fig. 10
figure 10

Replenishment plan for the third customer under the effect of the second holding cost fluctuation of instance “6-4-5"

5 Conclusions

Sustainability has emerged as a cornerstone of competitive advantage in today’s dynamic and multifaceted business landscape. To address this prevailing challenge, this study has proposed a novel sustainable supply chain network optimization problem that simultaneously considers strategic, tactical, and operational decisions, encompassing economic, environmental, and social aspects, while adhering to stringent EU regulations on truck drivers’ working hours and driving schedules. To effectively address this intricate problem, an MILP model has been developed and implemented using the Gurobi-Python interface. Moreover, artificial problem instances were generated based on established guidelines from the open literature and solved using the Gurobi solver, yielding valuable insights into the problem’s behavior.

This study has investigated the impact of various external factors, including carbon emission taxation policies, fuel price fluctuations, and holding cost variations, on a sustainable supply chain network optimization problem. The findings provide valuable insights into the trade-offs between economic, environmental, and social objectives. The results indicate that carbon emission taxation policies play a pivotal role in reducing CO\(_{2}\) emissions. However, the efficacy of these policies diminishes when transitioning to more stringent taxation policies due to the associated increase in fuel costs. This highlights the need for careful consideration of the balance between emission reduction and economic viability when implementing carbon emission taxation policies. Fuel price fluctuations have a significant impact on average fuel consumption and fleet composition. Our study shows that as fuel prices increase, the solver attempts to provide a different fleet composition and delivery schedules to minimize fuel costs. This adaptation is crucial for maintaining operational efficiency and reducing fuel-related emissions. However, a slight increase in fuel consumption was observed when the fuel price increased by 15\(\%\) due to the combined effect of strict EU regulations and speed limits. This underscores the importance of considering these regulatory and infrastructural factors when analyzing the impact of fuel price fluctuations. Holding cost variations have a significant impact on the overall cost of the supply chain system. The sensitivity analysis of holding cost fluctuations reveals a significant impact on both the total cost and inventory management of the supply chain system. The increase in holding costs leads to higher inventory levels, which translates to increased inventory costs and higher total transportation costs. To mitigate these cost increases, the solver adjusts the replenishment plan by delivering more frequently to customers, leading to shorter replenishment cycles and lower inventory levels. In summary, the sensitivity analysis highlights the importance of considering holding costs when optimizing supply chain operations. Holding costs can significantly impact the overall cost structure and inventory management practices. The proposed MILP model provides valuable insights into the trade-offs between inventory management, transportation costs, and total cost of the supply chain system under varying holding cost scenarios.

While the FSMP-IRP model presented in the research paper represents a significant advancement in supply chain optimization, it is important to acknowledge certain limitations that can be addressed in future research:

Model Complexity: The model’s comprehensiveness in addressing multiple objective function components and incorporating various constraints leads to its increased computational complexity. Further optimization techniques may be needed to enhance the computational efficiency of the model, especially for large-scale problem instances. Developing an efficient heuristic solution method will be an interesting research direction, as it can obtain solutions for larger problem cases in a short computational time.

Uncertainty and Adaptive Problem Features: The model assumes deterministic demand patterns and does not explicitly consider stochastic factors or disruptions that may arise during operations. Incorporating probabilistic modeling and robust optimization techniques can improve the model’s resilience to uncertainties and disruptions. Moreover, from a problem perspective, the consideration of product-dependent holding costs and emission-based tax rates will be an interesting research direction.

Social Responsibility Considerations: While the model incorporates social responsibility aspects, such as driver working hours, further research can delve into more nuanced social factors, such as fair labor practices and ethical sourcing. It would be of great research interest to explore human resource management decisions related to the staffing of truck drivers and warehouse personnel within the current iteration of the problem (Graczyk-Kucharska et al. 2020).

Sustainability Impact Evaluation: The model estimates CO\(_2\) emissions and assesses the environmental impact of supply chain operations. However, a more comprehensive evaluation of the overall sustainability impact of the model would enhance its applicability in sustainable supply chain management.