1 Introduction

In 2020, more than two billion people purchased goods online (Statista 2020). Exhibiting a global annual growth rate of 8.1%, e-commerce is expected to account for 21.8% of all sales by 2024 (Statista 2020). This growth poses major challenges to all stakeholders in last-mile logistics, particularly in the parcel delivery sector.

Despite handling an ever-increasing parcel volume, delivery service providers struggle to make their businesses more profitable. In the highly competitive market environment, the revenue per parcel is decreasing (PricewaterhouseCoopers 2018). The principal reason for this is a lack of efficiency on the final leg of the delivery process, with approximately 50% of a parcel’s delivery cost being attributed to the “last mile” (McKinsey & Company 2016). Further aggravating the need for innovations is the issue of failed deliveries, i.e., the problem that customers are not at home to accept a parcel, which necessitates additional delivery attempts or leaves the customers unsatisfied and hunting for their parcels. Last, in the face of mounting environmental concerns from both the public and regulators, delivery service providers strive to contribute less to pollution and urban congestion.

Both technological (e.g., delivery drones, delivery robots) and infrastructural (e.g., cargo bike delivery, crowdshipping) solutions have been proposed in theory and practice to address the last-mile challenge (cf. Boysen et al. 2020; Savelsbergh and van Woensel 2016). Some of these concepts still seem futuristic, while others are already in use. Among the most applied practices is out-of-home delivery (OOHD). In this infrastructural concept, the delivery service provider drops off parcels at pickup stations where the customers collect them later, i.e., the customers perform the last leg of the delivery process themselves. Typical pickup stations are post offices, gas stations, and small businesses. In the literature, pickup stations are also referred to as PUDOs, transshipment facilities, delivery options, shared delivery locations, or common delivery locations. While the usage of pickup stations varies by country and is most often still only an auxiliary practice, parcel service providers across Europe have been substantially expanding their networks in the past five years (Last Mile Experts 2021).

Compared to home delivery (HD), OOHD offers important advantages to service providers. First, dropping off multiple parcels at a pickup station significantly reduces the number of locations the drivers need to visit. By consolidating orders, the same number of drivers can serve more customers, thereby lowering transportation costs. Furthermore, the consolidation of customer orders helps reduce the service provider’s environmental impact since the drivers travel less distance and fewer delivery vehicles are needed. Second, OOHD is not dependent on customers’ presence at specific points in time. The handover process is decoupled through the usage of pickup stations: drivers no longer meet customers. Instead, they visit pickup stations, where the customers retrieve the parcels when it is convenient for them. Third, for some customers, both HD and OOHD might be possible. In such cases, the service provider obtains additional (location) flexibility in the planning process.

For these reasons, delivery service providers seek to promote OOHD by introducing new delivery products in addition to the long-established HD product. However, “outsourcing the last mile” to customers also comes with new challenges. Service providers must account for customers’ preferences. As most customers prefer home delivery, simply offering an OOHD product will not result in the desired shift toward this product. Generally, incentivizing customers to buy an OOHD product poses challenges (PricewaterhouseCoopers 2018; DHL 2018). Therefore, providers must actively steer customers toward selecting OOHD products. Possible avenues are charging additional fees to HD customers, rewarding OOHD customers with discounts, or branding OOHD as eco-friendly. Moreover, current delivery networks and their processes are often not aligned with an OOHD strategy. With the adoption of pickup stations, drivers can drop off sizable parts of the entire vehicle load at only a few delivery locations. This results in tours with high truck loads and short durations, especially in densely populated delivery regions close to the depot. To make full use of the vehicles’ capacity and the drivers’ working hours, multiple trips per vehicle/driver are needed.

In this paper, we address these challenges and contribute to the literature as follows: First, we develop a framework for evaluating and analyzing last-mile business models based on a real-world case study. The framework enables a data-driven approach to determine the potential of delivery networks that rely heavily on OOHD in the face of uncertainty. Second, we introduce a VRP with delivery options and multiple trips (MT-VRPDO). In the MT-VRPDO, every customer must be visited by the service provider’s fleet of delivery vehicles. In contrast to most other VRPs, there might be multiple possible delivery locations for at least some customers. Each site is associated with location- and customer-specific costs. The vehicles are subject to capacity and time limitations and may each complete multiple trips, with every trip starting and ending at the depot. The service provider chooses the delivery locations, determines route plans, and assigns trips to vehicles to minimize the total operational costs. These encompass routing costs, fixed vehicle costs, and costs depending on the selected delivery locations. Third, we present a solution approach based on adaptive large neighborhood search (ALNS). We utilize both well-known components from the literature adapted to this context and new techniques explicitly developed for large problem sizes. Finally, we show how to apply this approach in a real-world setting to instances with more than 1000 customers and how to derive managerial insights concerning the OOHD products’ design and pricing using the proposed framework.

The paper is organized as follows. Section 2 introduces the case of GLS, a major European parcel delivery service provider, and demonstrates a data-driven framework for decision support in last-mile delivery. Section 3 gives an overview of the related vehicle routing literature and empirical studies in the context of last-mile delivery with pickup stations. Section 4 provides a mixed-integer programming (MIP) formulation of the multitrip vehicle routing problem with delivery options (MT-VRPDO). Section 5 presents the solution approach. Section 6 provides decision support based on the results of the case study. Section 7 concludes the paper with some final remarks.

2 Data-driven decision support for delivery options in last-mile delivery

This section outlines a data-driven approach to support the decision making of a major European parcel delivery service provider when introducing new business models. Section 2.1 examines the service provider’s challenges in detail, introducing different types of possible OOHD products and showcasing the interplay between the products’ design, pricing, customer behavior, and vehicle routing. Section 2.2 offers a more abstract view of the service provider’s decisions and presents a framework for the data-driven evaluation of delivery options in last-mile delivery.

2.1 Real-world case study

The GLS Group is a European parcel delivery service provider operating in 41 countries. In fiscal year 20/21, the company transported 840 million parcels. The German division of GLS (hereafter referred to as the service provider) currently offers HD as the standard shipping product and markets OOHD at roughly the same price. However, a study by PricewaterhouseCoopers (2018) states that half of the German customers in the business-to-consumer (B2C) segment are willing to accept OOHD to close-by pickup stations as the standard shipping product. The study also reports an average willingness-to-pay for HD of 2.77 € and suggests that offering OOHD as the standard for this segment and surcharging for HD entails considerable potential benefits.

The service provider has already established a network of pickup stations and intends to further tap into this potential. They envision offering a premium and a basic product for B2C customers. The premium product corresponds to HD, i.e., the current standard product, while the yet-to-be-introduced basic product enables OOHD. The service provider is still undecided about the exact nature of the basic (OOHD) product and the possible pricing of the HD product. Specifically, they seek to answer three major questions: “What is the cost-saving potential of different OOHD specifications?”, “How will certain combinations of OOHD products and surcharges for the HD product shape demand?”, and, ultimately, “What is the most promising combination of an OOHD product and the HD product’s surcharge?”.

2.1.1 OOHD product design and routing

The first question refers to the eligibility of different delivery locations. There are multiple conceivable specifications of the OOHD product that differ, e.g., in the number of eligible pickup stations and the inclusion or exclusion of home delivery. The service provider has identified four promising basic products and seeks to determine which to offer alongside HD, the existing home delivery product. Two of the proposed basic products (PU1 and PUX) limit the delivery locations to pickup stations, i.e., they exclude home delivery. The other two basic products (FLEX1 and FLEXX) allow delivery both at a home location and at pickup stations. In the case of PU1 and FLEX1, the pickup station is exactly specified, whereas in the case of PUX and FLEXX, multiple pickup stations may be feasible. Table 1 summarizes the possible delivery locations associated with different products.

Table 1 Products and delivery locations

By purchasing either the premium product, i.e., HD, or choosing the basic product on offer, the customers select which locations are eligible for their orders. While purchasers of HD and PU1 specify the delivery location exactly, purchasers of PUX, FLEX1, or FLEXX cede the decision on the final delivery location to the service provider. The service provider wants to exploit the potential flexibilities resulting from the location choice during the construction of route plans.

Figure 1 illustrates the effect of different products on the route plan. The first panel shows a depot, four customers, and two pickup stations. The depot (\(0)\) is marked by a square, the customers’ home locations (\(1\)\(4\)) are represented by small circles, and the pickup stations (\(5\)\(6\)) are represented by triangles. The large, dashed circles depict the ranges of the pickup stations, i.e., the distance that is acceptable for OOHD. The other three panels show a variety of route plans. In the second (third/fourth) panel, all customers choose HD (PU1/FLEX1). Bold arrows represent the routes of delivery vehicles, and dotted lines represent deliveries to pickup stations. Product selection profoundly impacts the route plan, as exemplified by customer 3. When customer 3 chooses PU1 or FLEX1 and thereby declares location 6 eligible, the resulting route is much shorter.

Fig. 1
figure 1

Different route plans

2.1.2 Pricing and customer choice behavior

The second question addresses the pricing of the HD product and the customers’ reaction to this pricing. The service provider is interested in generating additional revenue (from the sale of the premium product) resulting from a multiplication of the surcharge and the number of customers that select the HD product. Clearly, the proportion of customers that opts for the HD product is closely connected to the amount of the surcharge: the higher the surcharge is, the more customers will select OOHD instead. Consequently, it is unclear whether it is more profitable to set a high surcharge (with fewer customers choosing HD) or to steer more customers toward selecting the HD product by charging less.

2.1.3 Data-driven decision support

The last question requires joint consideration of the above effects for decision support. As discussed, the provider wants to select an OOHD product and set a surcharge for the premium HD product. These two problems can hardly be considered separately, as they are tightly interlaced. The price of the HD product influences the customer choice behavior and thereby the proportion of customers selecting OOHD, which in turn impacts the routing problem (the feasible delivery locations for a customer). At the same time, customers’ willingness-to-pay for the HD product depends on the “desirability” of the OOHD product on offer. For example, customers might opt for HD if the OOHD product is PUX and their orders can be delivered to any pickup station in their vicinity but opt for OOHD if the OOHD is PU1 and the order will be delivered to a specific pickup station.

To make a data-driven decision, the service provider must examine different combinations of OOHD products and HD surcharges. On the one hand, they want to gain insights into the potential effects of the four basic products on the vehicle routing problem and to assess the efficiency of the current pickup station network. This includes measurement of the environmental impact of OOHD, quantification of the value of customer flexibility, and an investigation into the importance of multitrip routing (cf. Sect. 6.2). On the other hand, they need to compute revenues and profits to decide which product-surcharge combinations are favorable (cf. Sect. 6.3).

2.2 An evaluation framework for delivery options

The service provider faces an integrated pricing and routing problem. Choosing an OOHD product and setting a surcharge for HD are planning tasks on a strategic or tactical level, whereas the construction of route plans is executed daily, i.e., on the operational level. As discussed above, the tasks are highly intertwined because the pricing and product selection decisions shape the resulting routing problem. Thus, the problem should ideally be handled in an integrated fashion, i.e., in a single optimization model. However, such an approach is unsuitable in this context for two reasons.

First, the service provider’s data basis is insufficient for such calculations. While historic customer location data are available and potential network configurations (e.g., the locations of pickup stations) are known, the provider has no data on customer choice behavior, such as customer preferences or price sensitivity. Hence, for a given surcharge, they cannot determine which B2C customers will select the basic OOHD product and which will select the premium HD product. If historic customer choice data were available, this could be accomplished by estimating a customer choice model (Baur et al. 2014; Berbeglia et al. 2022).

Second, solving the integrated problem is computationally intractable. The service provider does not want to base the selection of an OOHD product and the pricing of the HD product on the customers of a single day. Therefore, stochastic variations of the set of customers must be considered, making it exceptionally challenging to obtain accurate results for real-world problem sizes.

For the above reasons, we propose a framework that separates decisions on the strategic and tactical levels from those on the operational level. The basic idea is to leave decisions on the upper levels to the provider's management and generate the data necessary to make these decisions by solving a series of carefully chosen vehicle routing problems. This approach is summarized in our framework for a data-driven evaluation. Figure 2 depicts the framework with oval shapes representing managerial tasks and rectangular shapes depicting tasks centered on the extraction and processing of data. The figure can be read chronologically from top to bottom.

Fig. 2
figure 2

Data-driven evaluation framework

First, the provider specifies the logistics system and associated delivery processes. For example, they determine whether capacity restrictions at pickup stations are relevant and whether multiple tours per vehicle and shift are possible. Different vehicle routing problems may result from these specifications, and suitable solution algorithms must be deployed. To address the problem of missing data, the provider defines a set of scenarios. A scenario, in this context, is determined by three characteristics: an OOHD product, a surcharge for the HD product, and a proportion of customers choosing the OOHD product. The scenarios should represent a systematic enumeration of possible choice behaviors, i.e., reactions to every product-surcharge combination. The breadth of scenarios to be evaluated may be reduced depending on the provider’s knowledge of the customers’ perception of the products. For example, it is reasonable to assume that the number of customers selecting the HD product decreases as its surcharge increases.

A sufficient number of instances is generated for each scenario to account for stochastic variation. They differ in the sets of customers selecting OOHD/HD, whereas the proportion of customers is the same for all instances of a scenario. Following a data-driven approach, the provider uses the data available on past customers to make the instances as realistic as possible, e.g., by reflecting that customers located closer to pickup stations are more likely to select an OOHD product. They can uniquely adjust the sampling process to their knowledge and the available data.

The service provider can then tackle the underlying vehicle routing problem (cf. Sect. 4) for each instance of every scenario by applying a solution algorithm (cf. Sect. 5). Once all instances have been solved, evaluations and comparisons of the scenarios can be performed (cf. Sect. 6).

3 Literature review

The multitrip vehicle routing problem with delivery options combines aspects of a wide range of vehicle routing problems. To the best of our knowledge, the use of multiple trips has not been addressed in the context of vehicle routing problems with alternative delivery locations for each customer (order). In this section, we review the optimization problems most closely related to the MT-VRPDO and the methods used to solve them. We divide the routing-related literature into two parts (Sects. 3.1 and 3.2). Section 3.1 discusses problems that explicitly consider OOHD, i.e., the utilization of pickup stations and parcel lockers. Section 3.2 provides a brief overview of some broader problem classes related to the problem at hand, including vehicle routing problems with multiple trips. Furthermore, Sect. 3.3 notes some of the empirical studies conducted in the context of OOHD. Finally, Sect. 3.4 links the routing problem to the research area of demand management.

3.1 Vehicle routing with OOHD

Vehicle routing problems with OOHD have been discussed only recently. The key aspect of these problems is that orders are not necessarily delivered to a customer’s home location but may be dropped off at a pickup station or a parcel locker. Since the concepts of pickup stations and parcel lockers are largely interchangeable for routing purposes, we refer to both types of delivery locations as pickup stations and use the term parcel locker only when the distinction is necessary.

Naturally, not every assignment of a customer’s order to a pickup station is reasonable. In all contributions of the literature, the set of possible delivery locations is limited. Most commonly, the proximity of the pickup station to the customer’s home location determines feasibility: delivery to a pickup station is feasible if the distance between the customer’s home location and the pickup station is below a threshold.

Beyond this preselection of delivery locations, three different ways exist in the literature to handle the assignment of customer orders to delivery locations:

  • Location choice with additional consideration of customer preferences: The service provider chooses a delivery location among the set of predetermined delivery locations. The choice is restricted through additional constraints or directly affects the objective function value.

  • Free location choice: The service provider chooses a delivery location among the set of predetermined delivery locations.

  • Ex ante location: The exact delivery location of every order, be it HD or OOHD, is known ex ante; thus, there is no location choice component.

We present the exponents of these three approaches, starting with the authors who study location choice with additional consideration of customer preferences.

3.1.1 Location choice with customer preferences

In the models of this group, each order is associated with a set of possible delivery locations. This set is determined before the optimization process and may include both home locations and pickup stations.

Most customers prefer some delivery locations over others, e.g., their home location over a pickup station. There are two prevalent methods to handle customer preferences directly within the optimization problem: the preferences are represented by either additional constraints or additional objective function components. Dumez et al. (2021a) describe the vehicle routing problem with delivery options (VRPDO), where each delivery option is a different delivery location with a time window. Furthermore, a customer-specific preference level is associated with every location. Customers rank all delivery locations for their orders. Constraints ensure overall customer satisfaction, e.g., “at least 50% of the orders are delivered to the most preferred delivery locations; at least 75% of the orders are delivered to one of the two most preferred delivery locations.” The authors propose a heuristic that periodically solves a set partitioning problem on tours generated through large neighborhood search (LNS). Tilk et al. (2021) similarly construct the solution space. Similar to the VRPDO, their model features preference level, location capacity, and time window constraints. The authors develop a branch-price-and-cut algorithm and present a labeling algorithm to solve the pricing subproblem.

The optimization problem described by Mancini and Gansterer (2021) also considers different delivery locations and customer time windows. However, instead of modeling customer acceptance of pickup stations through preference levels, the choice of delivery location is reflected in the objective function, i.e., customers receive discounts (compensation) if they are not served at home. The proposed solution method combines iterated local search (ILS) and a matheuristic that fixes the variables associated with the choice of delivery location. Grabenschweiger et al. (2021) investigate a similar problem with location-specific compensation, focusing on different parcel locker sizes. Within their adaptive large neighborhood search (ALNS) application, they heuristically solve a bin-packing problem to assign orders to parcel lockers.

The following publications do not mention discounts explicitly. Nevertheless, the objective functions feature location-specific costs, thereby enabling a seamless integration of customer preferences. Baldacci et al. (2017) define the vehicle routing problem with transshipment facilities (VRPTF) as an edge-flow-based MIP, strengthen the corresponding LP relaxation, derive bounds through dual ascent heuristics, and solve it to optimality using a branch-cut-and-price algorithm. Friedrich and Elbert (2022) extend their model with time windows and develop a heuristic based on ALNS and variable neighborhood descent (VND). Sitek and Wikarek (2019) formulate a binary linear problem that penalizes assignments to pickup stations in the objective function. They solve the problem by first applying different criteria to assign customers to pickup stations and then considering a traveling salesman problem. Additionally, they present an exact approach to obtain optimal solutions for very small instances.

3.1.2 Free location choice

Other works incorporate a location choice between home locations and pickup stations but do not consider customer preferences in the model beyond the initial preselection of feasible locations. In these contributions, the focus is often on another aspect of the optimization problem, e.g., a two-echelon structure. Zhou et al. (2018) formulate the multidepot two-echelon vehicle routing problem with delivery options. Customers are served from a satellite depot on the second level at either a home location or a pickup station. In contrast to other models, the authors allow for split deliveries at pickup stations. The solution approach uses hybrid genetic search (HGS) with adaptive diversity control (Vidal et al. 2012). Enthoven et al. (2020) also propose a two-echelon distribution scheme and consider the case where orders are fulfilled either through cargo bikes on the second echelon or through trucks delivering to pickup stations on the first level. The problem is solved heuristically using an LNS-based approach.

Dragomir et al. (2022) address a pickup-and-delivery problem wherein the service provider chooses both the pickup and the delivery location. Whereas most locations are only eligible during a given time window, locker boxes function as time-independent delivery alternatives. The authors employ a multi-start ALNS heuristic to assign delivery requests to vehicles and variable neighborhood descent (VND) to determine the locations and the sequence of nodes visited by each vehicle. The problem formulation by Orenstein et al. (2019) permits deliveries to alternative locations as well as the nondelivery of a parcel. While nondeliveries incur penalties in the objective function, there is no distinction regarding location cost or customer preference between the different feasible delivery locations associated with every order. The authors present a multiperiod simulation study and show the value of customer flexibility by solving the problem heuristically using the savings algorithm, the petal algorithm, and tabu search (TS). Yu et al. (2022) integrate not only alternative locations but also the employment and compensation of occasional drivers, a practice commonly referred to as “crowdshipping,” into their model. Customers belong to different types that signify the feasibility of the different delivery modes. Their solution approach is based on ALNS. Zhou et al. (2016) study a location-routing variant of a VRPDO by including the decision of whether to open a pickup station. Customers may be served at preselected pickup stations without incurring further costs. Furthermore, their model includes a failure probability for home deliveries. The problem is solved with HGS combining a genetic algorithm and local search components.

3.1.3 Ex-ante location

In the third group of papers, the service provider does not decide on the delivery location. All delivery locations are fixed before optimization; i.e., there is only one feasible delivery location for each order. Some orders are assigned to home locations, while others are assigned to pickup stations. As a result, there is no location choice, and the optimization focuses on routing aspects. These contributions highlight the benefits of consolidating orders at pickup stations rather than the advantages of location flexibility.

Employing both ruin-and-recreate and genetic search (GS) paradigms, Heshmati et al. (2019) find that widespread use of pickup stations significantly improves many KPIs, including costs, CO2 emissions, and the number of vehicles used. In a simulation study based on the Belgian city of Antwerp, Arnold et al. (2018) find that including pickup stations in the delivery network is profitable “even at high pickup costs” compared to traditional home delivery, especially when failed home deliveries are taken into account. The authors first cluster the customers and then apply the savings algorithm to estimate delivery costs.

Zhang and Lee (2016) are among the first to incorporate parcel lockers and multiple depots into the vehicle routing problem. Using ant colony optimization (ACO), they show that both total delivery duration and the distance traveled are significantly reduced when the adoption rate of OOHD increases. He et al. (2019) obtain similar results by applying their two-stage matheuristic to a closely related problem with a single depot while also providing insights into a dynamic variant.

3.1.4 Overview

Table 2 provides an overview of the literature on vehicle routing problems with OOHD. The second column indicates whether the service provider chooses the final delivery location. The third column denotes how customer preferences are considered beyond the initial preselection of feasible delivery locations. P (preference) denotes approaches defining preference levels and ensuring an overall service level through constraints. D (discount) represents problem formulations where the objective function is directly influenced by the location choice, as customers receive a discount if they are served at a pickup station. (D) signifies works that exhibit such location-dependent objective function components without labeling them as discounts or compensation to the customer. C (computational study) denotes papers that extensively investigate the influence of customer preferences in the computational study. Columns 4 and 5 show whether further constraints concerning the capacity limits of pickup stations and time windows are included. The sixth column depicts the makeup of the objective function, where T denotes transportation costs (arc-based), F denotes fixed vehicle costs, L denotes location-specific costs, and O denotes other components. Note that both fixed vehicle costs and the emission costs regarded by Heshmati et al. (2019) and Arnold et al. (2018) may be attributed to single arcs and may thus be included in the transportation costs T. The final column shows the proposed solution method.

Table 2 Overview of literature dealing with OOHD

3.2 Related routing problems

The defining properties of the MT-VRPDO are the choice of delivery location, the location-specific delivery costs, and the multiple uses of vehicles. To the best of our knowledge, these properties have not been examined in combination. When viewed individually, they are not unique to the MT-VRPDO and have been discussed in the context of various other vehicle routing problems. We will now briefly outline further related routing problems that address these aspects without explicitly representing the application area of pickup stations.

3.2.1 Location choice

In the generalized vehicle routing problem (GVRP) introduced by Ghiani and Improta (2000), all locations are parts of mutually exclusive clusters. The GVRP is concerned with finding the shortest routes visiting all clusters exactly once. In the context of parcel delivery, each cluster represents a customer with a set of feasible delivery locations. The GVRP is seldom solved directly but instead serves as a modeling tool for a wide range of vehicle routing problems, in which the set of locations to be visited is not predetermined (cf., e.g., Baldacci et al. 2010). However, in many of these problems, the clusters are not necessarily mutually exclusive, e.g., multiple customers may have the same feasible pickup station. Among these problems are the GVRPTW (cf., e.g., Yuan et al. 2021; Dumez et al. 2021b), the VRPDO, and the vehicle routing problem with roaming delivery locations (VRPRDL).

The VRPRDL is based on the concept of trunk delivery, i.e., a technology granting the delivery service provider short-term access to the trunk of a customer’s car. Associated with every order are multiple locations and corresponding time windows that designate delivery opportunities. This problem was first laid out by Reyes et al. (2017), who present a solution method based on the ruin-and-recreate paradigm. Ozbaygin et al. (2017) provide a branch-and-bound algorithm for the VRPRDL and incorporate the possibility of home delivery during the entire service horizon.

Jungwirth et al. (2022) formulate an optimization problem that applies to the area of hospital scheduling. It features flexible service locations, time windows, time-dependent location capacities, and location- and customer-specific cost components (compensations). They provide a solution approach that utilizes ALNS, guided local search, and a backtracking mechanism to revise prior decisions in the construction phase. Furthermore, they show its benefits compared to manual planning.

3.2.2 Location-specific costs

Due to location-specific costs, e.g., compensation paid to pickup station owners, the MT-VRPDO is also closely related to optimization problems belonging to the class of vehicle routing problems with profits (VRPPs). In these problems, there is no obligation to visit all locations. Instead, locations are associated with profits, and the decision maker selects the most profitable locations with respect to routing constraints, such as vehicle capacity and maximum route duration. Hence, the choice of location often also implies which orders are accepted and which are rejected. Archetti et al. (2014) present a review of VRPPs.

Among VRPPs, the vehicle routing problem with private fleet and common carrier (VRPPC) is the most similar to the MT-VRPDO. First described as the “truckload and less-than-truckload problem” by Chu (2005), the VRPPC allows outsourcing orders. It is concerned with minimizing the total cost, consisting of fixed vehicle costs, travel costs, and fees paid to subcontractors (“common carriers”). Usually, every vertex is assumed to be associated with a specific fee. The VRPPC is a special case of the MT-VRPDO, in which every vehicle can perform only one trip and the subcontractor locations (~ pickup stations) are all identical to the depot. Similar to the MT-VRPDO, the VRPPC generalizes the GVRP. Similarly, in the vehicle routing problem with occasional drivers (VRPOD, Archetti et al. 2016), private individuals are utilized and compensated for fulfilling delivery requests, a concept also known as “crowdshipping.”

3.2.3 Multitrip

Finally, in the MT-VRPDO, a vehicle may complete multiple trips, a case first described by Fleischmann (1990). Cattaruzza et al. (2016) provide a comprehensive overview of mathematical formulations, as well as exact and heuristic solution approaches. To address large instances, both population-based (cf., e.g., Cattaruzza et al. 2014) and ALNS-based algorithms (cf., e.g., François et al. 2019) have been proposed. Azi et al. (2010) combine two characteristics of the MT-VRPDO by including both multiple trips and location-specific cost components. In contrast to the MT-VRPDO and in line with many other VRPPs, each order is associated with only one feasible delivery location and can be rejected. Moreover, the authors propose a branch-and-price algorithm. In further work, the same authors investigate the problem in a dynamic environment (Azi et al. 2012) and present an ALNS heuristic to efficiently solve larger instances (Azi et al. 2014).

3.3 Empirical studies

In addition to the routing literature listed above, OOHD concepts have been widely studied on a more aggregated level without explicit routing components. Morganti et al. (2014b) present the alternative delivery modes used in Germany and France, and Morganti et al. (2014a) document the massive growth of pickup station and locker networks in both countries. Durand and Gonzalez-Feliu (2012) compare four last-mile delivery scenarios and find that consolidation in pickup stations leads to significant savings. Edwards et al. (2010) calculate CO2 emissions in the context of failed delivery attempts and discuss critical success factors for employing a network of pickup stations. Song et al. (2009) also investigate delivery failures and compute considerable potential cost and emissions savings. Lemke et al. (2016) study the usability of parcel lockers in Poland from a customer perspective and contend that their popularity will increase primarily due to environmental aspects. Rohmer and Gendron (2020) provide an overview of different pickup facilities and the corresponding optimization problems and solution methods. Janjevic et al. (2019), Faugère and Montreuil (2020), and Liu et al. (2019) discuss the design of delivery networks concerning the inclusion of pickup stations. Using discrete choice modeling to integrate customer behavior, Lin et al. (2022) aim to optimize parcel locker networks with regard to expected profit from the perspective of a parcel locker company. Vakulenko et al. (2018) conduct a focus group study with an emphasis on customer value. Kawa (2020) presents results from interviews conducted with online retailers and finds that OOHD positively impacts customer satisfaction.

3.4 Demand management

Steering customers to select profitable products is a challenge in the research area of demand management. The incorporation of demand management strategies into vehicle routing issues is mostly examined in dynamic and stochastic environments (cf. Fleckenstein et al. 2022; Snoeck et al. 2020). With application areas such as attended home delivery, same-day delivery, and ridesharing, the research focus has also been on vehicle routing problems heavily constrained by time windows. By offering delivery time windows of different lengths at different (static) prices, a concept known as differentiated pricing (Agatz et al. 2013), service providers can exploit heterogeneities in customer flexibility. Gönsch (2020) surveys incompletely specified products (ICSPs, flexible products). For ICSPs, not all specifics of the product are disclosed to the customer at the moment of purchase. Instead, the seller benefits from an added level of planning flexibility. Customers benefit from the lower price of the ICSP in comparison to a completely specified product. The challenge is to design an ICSP that generates considerable planning flexibility without requiring excessive customer flexibility. Strauss et al. (2021) investigate flexible products in the context of home delivery. Customers may choose products that include multiple possible delivery time windows without specifying the actual time of delivery upon purchase. The actual time window is not revealed to the customers of the flexible product until all customers of a planning period have made their product selection. In this manner, the service provider obtains routing flexibility. Moreover, prices are set dynamically, such that customers are incentivized to select profitable products.

4 The multitrip VRP with delivery options

This section presents a MILP formulation of the multitrip vehicle routing problem with delivery options (MT-VRPDO). Section 4.1 introduces the terminology and the notation; Sect. 4.2 presents the MILP formulation.

4.1 Terminology

Let \(\mathcal{G}=\left(\mathcal{N},\mathcal{A}\right)\) be a complete directed graph, where the vertex set \(\mathcal{N}\) consists of a set of delivery locations and the depot location (\(0\)). The set of delivery locations includes both home locations and pickup stations. Each arc \(\left(i,j\right)\in \mathcal{A}\) is associated with a driving time \({t}_{ij}\) and a cost coefficient \({c}_{ij}\), which depends on the driving time and the driving distance, i.e., it includes fuel consumption costs and the driver’s hourly wages.

Let \(\mathcal{K}=\left\{1,\dots ,k,\dots ,K\right\}\) be the set of customers. Each customer has an order of weight \({d}_{k}\) and a consideration set \({\mathcal{I}}_{k}\) of possible delivery locations, where each location corresponds to a different delivery option. The makeup of the set \({\mathcal{I}}_{k}\) is determined according to customer \(k\)’s product choice (cf. Table 1).

Every delivery location \(i\) has a capacity \({\kappa }_{i}\). The set \({\mathcal{K}}_{i}=\left\{k\in \mathcal{K}:i\in {\mathcal{I}}_{k}\right\}\) denotes the set of customers whose orders may be dropped off at location \(i\). Visiting customer \(k\) at location \(i\) incurs location-specific cost \({g}_{ki}\), which may include compensation paid to pickup station owners and customers. Note that \({g}_{ki}\) may be negative if customers are charged an extra fee for certain delivery locations, e.g., their home location.

The parcel service provider utilizes a homogeneous fleet, where each vehicle has capacity \(Q\). For every vehicle used, a fixed cost \(F\) arises. Additionally, there is a time limit \({T}_{\max}\) bounding the time on the road for each driver (= vehicle).

A vehicle may complete multiple trips within the time limit, i.e., a vehicle may return to the depot to be reloaded for a second trip. Reloading at the depot takes \(r\) time units. Split deliveries are not feasible.

The MT-VRPDO is composed of three closely intertwined decisions. With the aim of minimizing the total costs, the decision maker assigns customers to delivery locations, plans routes, and groups routes for individual drivers.

4.2 Model

The MT-VRPDO can be described as a mixed-integer linear program. The model is based on the MT-VRP formulation by Karaoğlan (2015) and the VRPDO by Dumez et al. (2021a) and adapted to the setting described above. All parameters and decision variables are summarized in Table 3.

$${\text{Min}}\mathop \sum \limits_{{\left( {i,j} \right) \in {\mathcal{A}}}} c_{ij} \cdot x_{ij} + F \cdot \left( {\mathop \sum \limits_{{j \in {\mathcal{N} {\setminus} }\left\{ 0 \right\}}} x_{0j} - \mathop \sum \limits_{{\left( {i,j} \right) \in {\mathcal{A}}:i,j \ne 0}} z_{ij} } \right) + \mathop \sum \limits_{{k \in {\mathcal{K}}}} \mathop \sum \limits_{{i \in {\mathcal{I}}_{k} }} g_{ki} \cdot y_{ki}$$
(1)
$${\text{s}}{\text{. t}}{.}\;\mathop \sum \limits_{{j \in {\mathcal{N}}}} x_{ij} = \mathop \sum \limits_{{j \in {\mathcal{N}}}} x_{ji} \;\;\;\;\;\forall { }i \in {\mathcal{N}}$$
(2)
$$\mathop \sum \limits_{{j \in {\mathcal{N}}}} x_{ij} \le 1\;\;\;\;\;\forall { }i \in {\mathcal{N} {\setminus} }\left\{ 0 \right\}$$
(3)
$$\mathop \sum \limits_{{i \in {\mathcal{I}}_{k} }} y_{ki} = 1\;\;\;\;\;\;\forall { }k \in {\mathcal{K}}$$
(4)
$$y_{ki} \le \mathop \sum \limits_{{j \in {\mathcal{N}}}} x_{ji} \;\;\;\;\;\;\forall k \in {\mathcal{K}},i \in {\mathcal{I}}_{k}$$
(5)
$$T_{i} + s_{i} + t_{ij} \le \left( {1 - x_{ij} } \right)T_{{{\text{max}}}} + T_{j} \;\;\;\;\;\forall { }\left( {i,j} \right) \in {\mathcal{A}}:j \ne 0$$
(6)
$$T_{i} + s_{i} + t_{i0} \le T_{{{\text{max}}}} \;\;\;\;\;\forall { }i \in {\mathcal{N} {\setminus} }\left\{ 0 \right\}$$
(7)
$$\mathop \sum \limits_{{j \in {\mathcal{N} {\setminus} }\left\{ 0 \right\}}} z_{ij} \le x_{i0} \;\;\;\;\;\forall { }i \in {\mathcal{N} {\setminus} }\left\{ 0 \right\}$$
(8)
$$\mathop \sum \limits_{{i \in {\mathcal{N} {\setminus} }\left\{ 0 \right\}}} z_{ij} \le x_{0j} \;\;\;\;\;\forall { }j \in {\mathcal{N} {\setminus} }\left\{ 0 \right\}$$
(9)
$$T_{i} + s_{i} + t_{i0} + r + t_{0j} \le \left( {1 - z_{ij} } \right)T_{\max } + T_{j} \;\;\;\;\;\forall { }\left( {i,j} \right) \in {\mathcal{A}}:i,j \ne 0$$
(10)
$$\psi_{i} = \mathop \sum \limits_{{k \in {\mathcal{K}}_{i} }} d_{k} \cdot y_{ki} \;\;\;\forall { }i \in {\mathcal{N} {\setminus} }\left\{ 0 \right\}$$
(11)
$$u_{i} + \psi_{j} \le \left( {1 - x_{ij} } \right)Q + u_{j} \;\;\;\;\forall { }\left( {i,j} \right) \in {\mathcal{A}}:i,j \ne 0$$
(12)
$$\psi_{i} \le u_{i} \le Q\;\;\;\forall { }i \in {\mathcal{N} {\setminus} }\left\{ 0 \right\}$$
(13)
$$\psi_{i} \le \kappa_{i} \;\;\;\;\forall { }i \in {\mathcal{N} {\setminus} }\left\{ 0 \right\}$$
(14)
$$s_{i} = v_{i} + \mathop \sum \limits_{{k \in {\mathcal{K}}_{i} }} w_{ki} \cdot y_{ki} \;\;\;\forall { }i \in {\mathcal{N} {\setminus} }\left\{ 0 \right\}$$
(15)
$$x_{ij} \in \left\{ {0,1} \right\}\;\;\;\;\forall \left( {i,j} \right) \in {\mathcal{A}}$$
(16)
$$y_{ki} \in \left\{ {0,1} \right\}\;\;\;\forall k \in {\mathcal{K}},i \in {\mathcal{I}}_{k}$$
(17)
$$z_{ij} \in \left\{ {0,1} \right\}\;\;\;\;\forall \left( {i,j} \right) \in {\mathcal{A}}:i,j \ne 0$$
(18)
$$T_{i} \ge 0\;\;\;\forall i \in {\mathcal{N}}$$
(19)
$$u_{i} \ge 0\;\;\;\forall i \in {\mathcal{N} {\setminus} }\left\{ 0 \right\}$$
(20)
$$\psi_{i} \ge 0\;\;\;\forall i \in {\mathcal{N} {\setminus} }\left\{ 0 \right\}$$
(21)
$$s_{i} \ge 0\;\;\;\forall i \in {\mathcal{N} {\setminus} }\left\{ 0 \right\}$$
(22)
Table 3 Decision variables and parameters

The objective function (1) minimizes the total cost, which consists of arc-based transportation costs, fixed vehicle costs, and location-specific delivery costs. Since vehicles may be used on multiple trips, whereas fixed costs are incurred only once per vehicle, the fixed costs cannot simply be incorporated into the transportation cost matrix. \({\sum }_{j\in \mathcal{N}{\setminus} \left\{0\right\} }{x}_{0j}-{\sum }_{\left(i,j\right)\in \mathcal{A}:i,j\ne 0}{z}_{ij}\) is the number of vehicles used, which is calculated by subtracting the number of follow-up trips from the total number of vehicles leaving the depot.

Constraints (2) are flow conservation constraints. Constraints (3) ensure that every nondepot vertex is visited at most once, while constraints (4) guarantee that every customer is served at a feasible location. Constraints (5) state that delivery location \(i\) can be selected for customer \(k\) only if a vehicle visits location \(i\). Constraints (6) through (10) ensure adherence to the time limit and prevent subtours. Constraints (6) set the points of visit within a trip, and constraints (7) make sure that there is sufficient time for the vehicle to return to the depot after visiting the last customer on a trip. Constraints (8) through (10) incorporate the possibility of multiple trips. The decision variable \({z}_{ij}\) indicates that a trip starting with vertex \(j\) and another trip ending with vertex \(i\) are carried out consecutively by the same vehicle (cf. Karaoğlan 2015). It may be set to \(1\) under the following three conditions: vertex \(i\) is the last customer of a trip (i.e., \({x}_{i0}=1\)), vertex \(j\) is the first vertex of a trip (i.e., \({x}_{0j}=1\)), and the time limit is not exceeded. Constraints (11) through (14) guarantee compliance with the capacity limits of both vehicles and pickup stations. Constraint (15) defines the service time.

5 Solution method

MT-VRPDO instances of realistic size cannot be solved exactly within a reasonable timeframe. Therefore, we employ a matheuristic that combines an adaptive large neighborhood search (ALNS) with the periodical solution of traveling salesman problems (TSP) and bin-packing problems (BPP). ALNS was introduced by Ropke and Pisinger (2006) and relies on the iterative application of the destroy-and-repair principle (also known as ruin and recreate (Shaw 1998)). As it is a concept widely used to address large-scale combinatorial optimization problems, we limit the discussion in this chapter to the adaptations and mechanisms we introduced instead of describing the entire solution procedure in detail.

The newly proposed algorithmic features are motivated by either analyzing the optimization problem (problem-oriented) or the real-world application (application-oriented). Problem-oriented mechanisms rely on properties of the MT-VRPDO. For example, the above-mentioned TSP component is based on the property that no (implicit) precedence relationships between vertices on the same trip exist, e.g., due to the presence of time windows. In many other last-mile optimization problems, such a component could lead to infeasible solutions (cf. Sect. 3.1). Application-oriented mechanisms factor in the characteristics of real-world instances and the strategic nature of the decisions to be made. For example, the sweep destroy operator presented in Sect. 5.1 exploits the spatial distribution of customers common parcel delivery networks.

The remainder of this section is structured as follows. Sections 5.1 and 5.2 list the utilized destroy-and-repair operators, respectively. Section 5.3 presents selected algorithmic features, which were implemented to either accelerate the solution process or enhance the likelihood of finding very good solutions. A pseudocode overview of the solution procedure is presented in Appendix A.

5.1 Destroy operators

We apply eight different destroy operators. The first group of operators was developed to specifically tackle this routing problem and its large instances. To the best of our knowledge, these operators have not been previously proposed:

  • Sweep: Customers are deleted if their home locations are in a specified range of polar angles corresponding to a share of \(\delta\)% of all customers. The polar angles are calculated from the location coordinates with the depot at the origin. In the chosen sector, both customers assigned to home locations and customers assigned to pickup stations are removed from the solution. The intuition behind this application-oriented operator is that customers in the “border” region of two or more trips have a higher probability of being in the “wrong” trip. By removing all customers of a certain sector, capacity and time slack is induced in the trips that originally visited customers in the sector. This process facilitates the insertion of the deleted customers. The value of \(\delta\) is chosen such that a sector typically includes the locations of two to four trips.

  • Pickup station: A pickup station is selected at random. All customers that may be assigned to this pickup station are removed from the solution, i.e., if pickup station \(i\) is selected, all customers in \({\mathcal{K}}_{i}=\left\{k\in \mathcal{K}:i\in {\mathcal{I}}_{k}\right\}\) are removed. Breaking up the assignment of customers to delivery locations through this problem-oriented operator can alter the solution substantially, as solutions vary considerably depending on which customers are served at a pickup station.

  • Related trips: A random trip is selected. A second trip is chosen whose average polar coordinate of the visited locations is closest to the average polar coordinate of the first trip. All customers on both trips are removed from the solution. The intention is that the new solution resulting from the repair operator necessitates fewer trips. It is thus especially useful for multitrip vehicle routing problems (problem-oriented).

Figure 3 demonstrates how the novel destroy operators work. The first panel shows the original solution. Black circles represent the home locations of customers who must be visited at their home location. White circles represent the home locations of customers whose orders may be dropped off at the pickup station (triangle) or at the home location. The dotted lines signify that a customer’s order is dropped off at the pickup station in the current solution. The other three panels (from left to right) show incomplete solutions after applying sweep destroy, pickup station destroy, and related trips destroy.

Fig. 3
figure 3

Destroy operators

The second group of operators are adaptations and slight variations of operators commonly used in a variety of vehicle routing problems (see Appendix B for a detailed description). These are random destroy (Ropke and Pisinger 2006), shortest trip destroy (Nagata and Bräysy 2009), arc cost-based destroy (Ropke and Pisinger 2006), arc width-based destroy (Arnold et al. 2019), and string destroy (Christiaens and Vanden Berghe 2020).

5.2 Repair operators

After the application of a destroy operator, a (reduced) set of trips that meet all trip feasibility requirements remains. To restore the feasibility of the overall solution, the algorithm must reinclude the removed customers. Whenever a customer \(k\) is to be reincluded, all possible delivery locations \(i\in {\mathcal{I}}_{k}\) are examined. \(k\) must be assigned to a delivery location, and possibly, the delivery location must be (re)inserted into a trip. We define a reinclusion point as a tuple consisting of a location, a trip, and a position in that trip. We use the term best (reinclusion) point to refer to the reinclusion point leading to the lowest reinclusion costs.

In each iteration, the algorithm chooses one of ten different operators to repair the solution. Nine of these repair operators are extensions of parallel insertion (cf. Solomon 1987), and one is based on regret insertion (cf. Potvin and Rousseau 1993). All ten operators include an assignment decision on top of the routing-focused insertion decision. The nine operators based on parallel insertion are detailed below. They differ in the order in which they reinclude the customers and in the strategy that determines the reinclusion point. Every combination of reinclusion order and reinclusion strategy constitutes a different repair operator.

Prior to the reinclusion of the first customer, the removed customers are sorted in order of nonascending demand, nonascending distance to the depot, or randomly. In real-world instances, there often are large differences between the demand sizes of different customers. As customers with high demand are harder to incorporate into trips close to the capacity limit than are customers with lower demand, it is advantageous to reinclude them early during the repair process. Similarly, customers with a large distance to the depot are more likely to cause a significant detour and an exceedance of the time limit. Hence, an early reinclusion of such customers is preferable. Additionally, the early reinclusion of high-demand and remote customers warranted by this application-oriented mechanism helps in “shaping” a trip. In contrast, customers with low demand in the vicinity of the depot are easily shifted from one trip to another.

Three reinclusion strategies (best point, best point with blinks, pickup station-based) are utilized. In the best point insertion strategy, the insertion of the current customer is performed at the reinclusion point of the least cost increase. In the blinks variant, every reinclusion point is skipped with a (small) blinking probability (cf. Christiaens and Vanden Berghe 2020) to randomize the reinclusion outcome and in the hope of finding better reinclusion points at a later stage of the repair process. In the proposed station variant of parallel insertion, the customer is assigned to a pickup station regardless of whether this corresponds to the best reinclusion point (i.e., the least cost increase) as long as the reinclusion is feasible. While moving a single customer to a pickup station is often detrimental to the objective value, shifting many customers to a pickup station simultaneously (as proposed through this problem-oriented approach) facilitates the exploration of the solution space.

5.3 Enhancements to the solution procedure

In this section, we specify some performance enhancements of the algorithm. They are primarily motivated by the application, as they demonstrate approaches to handling large, real-world instances in the context of strategic decision making. First, we propose pruning the search space to handle large instances more efficiently. Second, we describe the circumstances under which we allow temporary infeasibilities and present a look-ahead mechanism. Both measures aim to prevent the introduction of additional vehicles during the search process, since this is usually tantamount to large increases in the overall costs. Third, we outline an intensification routine. Last, we illustrate how solutions of certain instances are utilized to warm-start the solution of other instances.

5.3.1 Pruning the search space

As described in Sect. 5.2, there are numerous possibilities to reinclude a customer. When addressing large instances, it is, however, not practicable to exhaustively search for the best reinclusion point of customer \(k\), as this would require checking the value of an assignment to each delivery location in the set \({\mathcal{I}}_{k}\) and an insertion at any position of any trip. This can easily yield thousands of possible reinclusion points for just a single customer and thus thousands of time-consuming feasibility and profitability checks.

We propose limiting those reinclusions that necessitate an insertion (and not just an assignment) to the \(\theta\) trips nearest to location \(i\in {\mathcal{I}}_{k}\). Nearness is calculated as the minimal Manhattan distance to a location on the trip with a heavy penalty for a foreseeable violation of trip capacity. The nearest trips are calculated every \(\eta\) iterations for the upcoming \(\eta\) customers to be reincluded and all pickup stations that are not yet visited in the solution.

5.3.2 Temporary infeasibility

When no feasible reinclusion point is found during the reinclusion loop of customer \(k\), a new trip may be initialized. However, initializing a new trip tends to generate solutions that are neither cost-efficient nor provide a promising start solution for the following ALNS iterations. Therefore, the initialization of a new trip depends on the “degree” of the solution’s incompleteness at this stage of the reinclusion process:

  • If there are more than \(\zeta\) (randomized threshold value) customers not included in the solution, a new trip to one of the delivery locations \(i\in {\mathcal{I}}_{k}\) is added since it is likely that the remaining noninserted customers will not fit into the existing trips anyway.

  • If the number of nonincluded customers is smaller than \(\zeta\), \(k\) is not reincluded and is instead appended to a list of infeasible customers, \(infCusts\).

After all customers have been considered for reinclusion, a second-level process is launched if some customers cannot be feasibly reincluded, i.e., if \(infCusts\) is nonempty. All customers in \(infCusts\) are then reincluded at their best reinclusion points. This results in infeasibilities (exceeded time and/or capacity limits).

To restore feasibility, (different) customers are removed from the infeasible trips. Customers are deleted from either end of the trip until the resulting incomplete solution meets all feasibility requirements. These customers are often close to the depot and are thus more likely to fit into other trips. The deleted customers are then reincluded in the solution with a repair operator.

5.3.3 Look-ahead mechanism

Another way to prevent an excessive number of vehicles is to proactively reduce the number of customers added to the list \(infCusts\). We propose a look-ahead mechanism that is invoked during the repair procedure whenever a customer is assigned to a pickup station \(i\) not yet visited in the solution. After \(i\) is inserted into one of the trips, a removed customer \(k\) who can be served at only this pickup station (\({\mathcal{I}}_{k}=\left\{i\right\}\)) is immediately reincluded (i.e., assigned to \(i\)).

5.3.4 Intensification routine

Once no improvements have been found in a predetermined number of ALNS iterations, we start an intensification routine to reignite and perturb the search process. The 2-opt*-neighborhood (Potvin and Rousseau 1995), which recombines two trips such that the beginnings of the trips are interchanged, is especially suitable in this application context for several reasons. First, its size is relatively small, making it possible to search the neighborhood exhaustively. Second, multiple trips are involved in every move, which increases the chance that an improvement will lead to further improvements during the ALNS or TSP phase. Last, large parts of the input solution remain untouched by the 2-opt*-moves, such that the likelihood of spending considerable computational effort exploring unpromising areas of the solution space is slim. Once the 2-opt*-neighborhood is searched exhaustively, the best move is performed. The procedure is repeated until no more improving 2-opt*-moves can be found. Appendix A details a proposed reduction of the 2-opt*-neighborhood that does not exclude any feasible solutions.

5.3.5 Warm starts

Finally, we make use of the similarity of different MT-VRPDO instances. For two MT-VRPDO instances, \(\alpha\) and \(\beta\), that differ in only the sets of feasible delivery locations \({\mathcal{I}}_{k}\), every solution to \(\alpha\) is feasible for \(\beta ,\) if \({\mathcal{I}}_{k}^{\alpha }\subseteq {\mathcal{I}}_{k}^{\beta } \forall k\in \mathcal{K}\). In such cases, the location assignments included in the solution to instance \(\alpha\) are necessarily feasible location assignments for instance \(\beta\). Thus, the best solution to \(\alpha\) is used to warm-start the solution process for \(\beta\). As solving the MT-VRPDO is primarily useful for the evaluation of closely related scenarios (cf. Sect. 6, Appendix A), warm-starting provides another opportunity to reduce the computational effort.

6 Data-driven decision support

In this section, we evaluate and compare the different scenarios of our case study. Section 6.1 describes the data basis and characterizes scenarios and instances. Section 6.2 examines the cost-saving potentials of OOHD, the effect of offering different products, and the importance of multitrip routing. Furthermore, we examine the environmental impact of OOHD. Section 6.3 expands this perspective by means of a demand management component. Through the simulation of scenarios reflecting different customer choice behaviors and the solution of the corresponding MT-VRPDOs, we derive managerial insights into the profitability of different OOHD products and prices for the HD product.

The solution procedure is implemented in Python. We used Gurobi 9.1.2 to solve the TSPs and BPPs. The experiments were performed on an Intel(R) Xeon(R) Gold 5218 CPU @ 2.30 GHz with a time limit of 500 s per instance. For further parameter settings pertaining to the solution procedure, we refer to Appendix B. To keep the focus on data-driven decision support, we omit a discussion of the performance of the proposed solution procedure here. Appendix C compares the performance to the application of Gurobi 9.1.2 leaning on small artificial instances. Appendix D demonstrates how the selected problem-oriented and application-oriented algorithmic features work together to enhance the results on the real-world data set.

6.1 Data, scenarios, and instances

We conduct a computational study on location and demand data from GLS. The data of a typical urban and a typical rural delivery region are provided. In the urban region, there are \(18\) pickup stations and a set of \(892\) customers. In the rural region, there are \(8\) pickup stations and \(1101\) customers. Distances \({d}_{ij}\) [km] and transportation times \({t}_{ij}\) between the locations are calculated based on their positions, local average speeds, and location service times. Following GLS’s experience, the service time is \(100\) seconds for home locations and \(200\) seconds for pickup stations. These times are independent of the location load, as they are composed of parking and handing over parcels. Therefore, they are not treated as variables, i.e., \({s}_{i}={v}_{i}\). The transportation cost \({c}_{ij}\) equals \(0.37\frac{\EUR}{km}\cdot {d}_{ij}+10\frac{\EUR}{h}\cdot ({t}_{ij}+{s}_{i})\), where the first component reflects fuel costs and the second reflects time-dependent driver salary.

In the current pickup station network, the maximum distance of a customer to the closest pickup station corresponds to a travel time of \(24 (36)\) minutes in the urban (rural) region. The range of the pickup stations, accordingly, is set to \(24 (36)\) minutes; i.e., customers consider only those pickup stations that are reachable within this maximum time. Note that the actual pickup time is usually much shorter, e.g., when the customers visit the pickup station on their way to work. In the regions considered, the capacity of delivery locations is not a limiting factor, i.e., \({\kappa }_{i}>Q \forall i\in \mathcal{N}{\setminus} \left\{0\right\}\). Each vehicle of the service provider’s fleet has a capacity of \(Q=1150\ \text{kg}\). The driver time limit \({T}_{\max}\) is \(7.5\) hours. The reload time \(r\) at the depot is \(20\) minutes. The location-specific costs \({g}_{ki}\) represent the pickup station owner compensation of \(0.50\, \EUR\) per customer. The fixed cost per vehicle used is \(100\, \EUR\).

There are two customer types: B2C and B2B customers. In the urban (rural) region, approximately 43% (56%) of customers are B2C customers. All B2B customers may only be served at home. Every B2C customer selects one of two products: a premium or a basic OOHD product. The premium product is guaranteed home delivery (HD), and the basic product enables OOHD. We examine four different products that incorporate the possibility of OOHD (cf. Table 1). Two of the products (PU1 and PUX) ensure that customers are visited at a pickup station. The other two products (FLEX1 and FLEXX) are more flexible and allow for HD and OOHD. For PUX and FLEXX, multiple pickup stations may be feasible, while for PU1 and FLEX1, the pickup station is exactly specified.

We investigate scenarios that differ in the OOHD product on offer and in the percentage of B2C customers that select the OOHD product. Note that we refer to this product as the OOHD product and to customers who select it as OOHD customers, even though some may be visited at home in the case of FLEX1 and FLEXX. The uncertainty about customer choice behavior is reflected in the percentage of B2C customers that select the OOHD product, which is varied from 0 to 100% in 10% increments. In total, there are \(4\cdot 11=44\) scenarios.

For each region, we generate thirty random sets of customers to reflect stochastic fluctuations. Any customer is present with a probability of 95% in each customer set. On average, an urban instance has 845 customers (370 B2C customers), and a rural instance has 1045 customers (587 B2C customers). The OOHD customers are chosen with probabilities inversely proportional to the distance to the closest pickup station. Any OOHD customer included in percentage \(x\) is also selected for percentage \(y\) if \(x<y\) (cf. Appendix E for results for instances with fewer customers).

In summary, each instance \(\alpha\) is distinguished by four characteristics: region (\({R}_{\alpha }\in \left\{\text{urban, rural}\right\}\)), customer set (\({S}_{\alpha }\in \left\{\mathrm{1,2},\ldots , 30\right\}\)), OOHD product (\({O}_{\alpha }\in \left\{\text{PU1, PUX, FLEX1, FLEXX}\right\}\)), and OOHD percentage (\({P}_{\alpha }\in \left\{{0\%, 10\%, 20\%,\ldots, 100\%}\right\}\)). We examine a total of \(2 \left(R\right)\cdot 30 \left(S\right)\cdot 4 \left(O\right)\cdot 11 \left(P\right)=2640\) MT-VRPDO instances.

6.2 Cost and environmental impact of out-of-home delivery products

In this section, we quantify the impacts of different OOHD products on the service provider’s overall costs and eco-friendliness. In particular, we elaborate on the following main insights:

  • OOHD products display substantial cost-saving potential across all product combinations and delivery regions, as the decrease in routing cost outweighs the compensations paid to pickup station owners.

  • The savings in rural delivery regions are higher than in urban ones.

  • OOHD products enable significant reductions of the service provider’s carbon footprint as measured by the number of vehicles used and the number of kilometers driven.

  • Multitrip routing is crucial for reaping the benefits of OOHD in urban delivery regions.

  • OOHD products that also allow for HD are especially conducive to efficient vehicle routing, as a considerable share of flexible OOHD customers are visited at home.

6.2.1 Total costs and cost structure

Table 4 (urban region) and Table 5 (rural region) summarize the total costs resulting from different OOHD products. These include transportation cost, fixed vehicle cost, and compensation paid to pickup station owners. Each row represents the percentage of B2C customers that select the OOHD product. Each block of two columns represents the examined product combination (HD & PU1, HD & PUX, …), with one column showing the total cost and one column showing the cost reduction compared to serving all customers at home (row 0%). All values are averages over thirty instances.

Table 4 Total cost in urban region (in €)
Table 5 Total cost in rural region (in €)

Across all product combinations and regions, we observe significant cost savings when the number of OOHD customers increases: the cost savings average between 3.6% (urban region, PU1) and 4.4% (rural region, FLEXX) for every 100 B2C customers selecting the OOHD product instead of the HD product (Calculation example for urban region, PU1: \(\frac{Total\, cost\, 0\% - Total\, cost\, 100\%}{Total\, cost\, 0\%}\cdot \frac{100}{number\, of\, private\, customers}=\frac{1762-1530}{1762}\cdot \frac{100}{370}=3.6\%\)). Generally, the savings are larger for rural instances than urban instances, as the geographical demand distribution is much denser in the urban region without OOHD. Therefore, the consolidation provided by OOHD is more beneficial in the rural region.

As mentioned in Sect. 5.3 and detailed in Appendix D, some instances are quite similar and differ in only the sets \({\mathcal{I}}_{k}\). Tables 4 and 5 show some of the effects of this property. First, the costs for product FLEXX are strictly smaller than the costs for any other product when the OOHD percentage is the same. Second, under the same conditions, the costs for FLEX1 and PUX are strictly smaller than the costs for PU1. Third, the costs in columns FLEX1 and FLEXX are nonincreasing when the percentage of OOHD customers increases. This third property is not present in the PU1 and PUX columns because the home location does not remain feasible for the customers switching to the OOHD product. Thus, the solution to an instance with a lower OOHD percentage is generally not feasible for an instance with a higher OOHD percentage for these product combinations.

When comparing costs between the different product types, it becomes evident that the flexibility to choose between OOHD and HD (PU1 vs. FLEX1; PUX vs. FLEXX) has more value for the service provider than does the flexibility to choose among a larger number of pickup stations (PU1 vs. PUXX; FLEX1 vs. FLEXX; cf. Tables 4 and 5). The product FLEX1 combines a relatively low level of customer flexibility (discomfort) with a marked increase in location flexibility for the service provider. Consequently, the service provider can let the customers choose the pickup station, i.e., offer FLEX1 instead of FLEXX, without forfeiting much cost-saving potential.

Figure 4 shows how the total cost is composed of fixed vehicle cost, transportation cost, and compensation paid to pickup station owners. The transportation cost is further split into time-dependent (driver salary) and distance-dependent (fuel) costs. Displayed are the averages with OOHD product FLEX1 (cf. Tables 4 and 5). The benefits of OOHD on fixed vehicle cost and transportation cost distinctly outweigh the increase in compensation paid.

Fig. 4
figure 4

Cost structure (FLEX1)

6.2.2 Environmental impact and multitrip routing

Figure 5 depicts the development of vehicle kilometers driven and the number of vehicles used. As regulators and customers are becoming more sensitive to environmental issues, these measures directly linked to CO2 emissions are gaining importance. Heavy usage of pickup stations can significantly reduce the carbon footprint of the service provider. The effect is more pronounced in the rural region, since the detour incurred by a single customer is typically much larger. Whether a higher number of OOHDs has positive environmental impacts overall strongly depends on the customers’ pickup behavior. For instance, it makes a considerable difference if the customers use dedicated motorized trips to collect the parcels or pick them up on their way to/from work.

Fig. 5
figure 5

Environmental impact (FLEX1)

Multitrip routing enables the provider to exchange time capacity for vehicle capacity. While the driver’s time limit is usually the most restricting factor in the rural region, vehicle capacities are the scarce resource in the densely populated urban region. Thus, the multiple use of vehicles is especially worthwhile in cities and suburbs (close to the depot). Figure 6 shows the differences in the usage of capacity between urban and rural instances when FLEX1 is offered. The median usage is reported to avoid outliers caused by multiple trips per vehicle.

Fig. 6
figure 6

Median usage of driver time limit and vehicle capacity (FLEX1)

When no OOHD occurs in the rural region, there is a considerable discrepancy between vehicle capacity usage and driving time usage. With more OOHD, this gap diminishes. Because driving time is not as scarce a resource in the urban region, reloading is crucial to use both vehicle capacity and the drivers’ working hours efficiently. When reloading is not possible, the driving time limit is not exhausted (Fig. 6). Therefore, more vehicles and drivers are needed, leading to a cost increase (Fig. 7).

Fig. 7
figure 7

Impact of reloading in the urban region

6.2.3 Customer flexibility

Figure 8 shows the percentage of customers for whom delivery occurs at a pickup station after selecting a flexible OOHD product. The percentage is at times as small as 50%, i.e., only 50% of the orders that could be delivered to a pickup station are dropped off there. The number is especially low in the densely populated urban region and when the overall OOHD percentage is small. In such cases, the detour induced by visiting a customer at home is more likely to be short and thus less costly than paying a compensation to the pickup station owner.

Fig. 8
figure 8

Percentage of flexible customers served at pickup stations

However, this does not imply that the delivery service provider should stop expanding their pickup station network. For OOHD to work, it is crucial that customers are satisfied with the pickup experience, which depends mostly on the distance to the pickup station. In urban (rural) instances, where 100% of B2C customers choose the OOHD product, the average pickup distance from the customer’s home location is as high as 13 (26) minutes in the current pickup station infrastructure. Since a reasonable pickup distance is indispensable in making OOHD work, this highlights the need for a further increase in the number of pickup stations.

6.3 Controlling customer purchase behavior

In this section, we examine the effect of charging different prices for the HD product with the purpose of controlling customer choice behavior by examining different scenarios as described in Sect. 2.2.

For simplicity, we focus on HD and just one of the OOHD products, namely FLEX1 (delivery to home location or closest pickup station) in this discussion. The results for the other product combinations can be found in Appendix F. Based on an average willingness-to-pay for HD of 2.77 € (PricewaterhouseCoopers 2018), we vary the surcharge for HD from 0.00 € to 5.00 € in 0.50 € increments. The percentage of B2C customers selecting OOHD (for each level of surcharge) is varied from 0 to 100%. This leads to 121 different scenarios.

Tables 6 and 7 show the difference between the additional revenues generated by selling HD to some of the B2C customers and the total cost. As the amount of the surcharge does not influence the solution of the routing problem, the entries of Tables 6 and 7 can be computed without solving further MT-VRPDO instances.

Table 6 Additional revenues—total cost (urban region, FLEX1)
Table 7 Additional revenues—total cost (rural region, FLEX1)

The 0.00 € columns correspond to the FLEX1 columns in Tables 4 and 5. The values in all other columns are calculated by adding the additional revenue, i.e., the number of customers choosing HD multiplied with the surcharge. For example, the entry \(-1429\) (Table 6, 60%, 1 €) results from the calculation \(-1577\, \EUR+148\cdot 1\, \EUR\), where \(1577\, \EUR\) is the total cost (cf. Table 4, FLEX1, 60%), \(148\) is the number of HD customers (40% of all B2C customers), and \(1\, \EUR\) is the surcharge per HD customer. Note that the values in the tables do not represent the overall profit, as they do not include the basic revenues generated by the standard delivery fee.

The tables display the trade-off between the generation of additional HD revenue and the utilization of OOHD cost-saving potential. When looking at a single column from top to bottom, two effects take hold. First, as the percentage of FLEX1 customers increases, the total cost shrinks. Second, the number of HD customers, i.e., the number of B2C customers paying a surcharge, also decreases. For small surcharges, the cost-saving component outweighs the “lost” revenue. In contrast, for high surcharges, the extra revenue from HD customers outweighs the extra (transportation) cost for serving these customers at home. In column 1.50 € of Table 7 (rural region), the values are all roughly equal, signifying that the total profit is largely independent of the number of customers selecting each product. For the urban instance, this surcharge is 0.80 €.

The results of Tables 6 and 7 represent a breadth of scenarios and thereby reflect the uncertainty regarding customer choice behavior. In a next step, the service provider can gain insights into profitable pricing schemes by evaluating possible price-response functions for the HD product. Basically, such a function specifies the number of B2C customers choosing the HD product depending on the surcharge. Since in our application no demand data is available, the function could, e.g., be estimated by making reasonable assumptions and by discussions with managers using judgmental forecasting.

The fields in bold print in Table 6 highlight an exemplary price-response function, i.e., a presumed mapping of each surcharge to a percentage of B2C customers selecting OOHD. It can be read as follows: if the surcharge is 0.00 €, all customers select HD; if the surcharge is 0.50 €, 10% of customers select FLEX1 and 90% select HD. This continues until all customers select FLEX1 when the surcharge is 5.00 €. Clearly, the relationship between the FLEX1 percentage and surcharge need not be linear (cf. Table 7 for a different price-response function in the rural region).

Figure 9 shows the difference between the additional revenues and the total cost as a function of price (values correspond to the bold print in Tables 6 and 7). For the price-response function assumed, the data suggest an optimal surcharge in the range of 3.00 € in the urban region and 3.50 € in the rural one. The example also shows that setting different pricing in different regions may be beneficial to the service provider. The regional disparity arises for two reasons. First, the price-response function assumed here implies a higher willingness-to-pay (for HD) for customers in the rural region, as reflected by the lower percentage of customers choosing OOHD at 3.50 €. Second, the detour for visiting a customer at home is usually longer in rural regions, causing HD to be profitable only at higher surcharges.

Fig. 9
figure 9

Impact of surcharge

7 Summary and outlook

In this paper, we introduced a framework for the data-driven evaluation of business models in last-mile logistics based on a real-world case study. We presented a realistic application and a new VRP variant (MT-VRPDO) in which the delivery service provider actively chooses the delivery location. Furthermore, we demonstrated how the introduction of OOHD products can pave the way to a more profitable, more efficient, and more eco-friendly future of last-mile delivery services.

The scenario-based approach of the framework and an enhanced ALNS procedure enable the evaluation of network structures and support tactical and strategic decision making under uncertainty. We focused on a delivery network with pickup stations. However, the same framework and a slightly adjusted optimization model can be used to assess a multitude of delivery network infrastructures, delivery modes, and product assortments. These include but are not limited to shipping through subcontractors or crowdshippers, the usage of multiple echelons, the deferral of packages to other delivery days, and the utilization of other alternative delivery locations, e.g., workplaces or cars (cf. Chu 2005; Archetti et al. 2016; Zhou et al. 2018; Laganà et al. 2021; Reyes et al. 2017).

In addition to the variety of application areas, many other interesting research questions related to the MT-VRPDO remain to be investigated. First, it might be sensible to offer more than two products, necessitating a closer look at the underlying problem of assortment optimization (cf. Mackert 2019, for an application to attended home delivery). Moreover, it might be beneficial to take a closer look at the compensation paid to pickup station owners and efficient configurations of pickup station networks, which are still in their infancy in many countries (Last Mile Experts 2021). Furthermore, this paper may underestimate the value of OOHD by disregarding failed deliveries that almost exclusively occur with HD. It is also worthwhile to study a dynamic version of the MT-VRPDO and use demand management techniques (Agatz et al. 2013; Klein et al. 2019) to steer customers into selecting profitable products in real time. Last, the multitude of possible application areas warrants a further algorithmic examination, which was out of the scope of this paper.