# The probabilistic traveling salesman problem with time windows

- First Online:

- Received:
- Accepted:

DOI: 10.1007/s13676-013-0018-0

- Cite this article as:
- Voccia, S.A., Campbell, A.M. & Thomas, B.W. EURO J Transp Logist (2013) 2: 89. doi:10.1007/s13676-013-0018-0

## Abstract

With time-definite services occupying a large part of the delivery business, the explicit consideration of time windows into a route design has the potential to reduce transportation costs and the penalty costs associated with late deliveries. In this paper, we incorporate time windows into a priori routes by introducing the probabilistic traveling salesman problem with time windows (PTSPTW). The PTSPTW is an extension of the well-known probabilistic traveling salesman problem, where in addition to stochastic customer presence, each customer has an associated time window during which deliveries must be made. We present a recourse model and a variable neighborhood search with variable neighborhood descent algorithm to solve problem instances. We also present computational experiments that demonstrate the value of incorporating stochasticity into the problem.

### Keywords

Traveling salesman problem Probabilistic Time windows## Introduction

Time-definite delivery plays a crucial role in the shipping industry. Now more than ever, businesses are operating with lean production strategies and just-in-time inventories. With this trend, freight is shipped in smaller lot sizes and the predictability of arrival times is critical (Schultz 2008). Randy Guidry, communications coordinator for Averitt Express, notes “Every year, more and more of our customers are requesting appointments within a delivery window” (Terreri 2011). In 2010, time-definite, day definite and same day delivery services accounted for 51.4 % of the United States shipping industry’s total market value (Datamonitor 2010b). In the European market, these segments accounted for 62.3 % in 2010, a 10.6 % increase from 2009 (Datamonitor 2010a, 2009).

With time-definite services occupying a large part of the delivery business, the explicit consideration of time windows into a route design has the potential to reduce transportation costs and the penalty costs associated with late deliveries. Currently, many companies employ a pre-planned, or a priori, route which identifies an ordering of all possible customers that a particular driver may need to visit. The driver then skips those customers on the route who do not require a delivery on that day. A priori routes can be implemented with relative ease and are an alternative to the high cost of re-optimization. In addition, a priori routes offer both drivers and customers consistency and help to improve driver efficiency as the driver becomes familiar with the route. Although businesses place high importance on delivery time windows, the consideration of time windows in a priori routing has received little attention in the literature. Additionally, a priori policies can be important subproblems in dynamic routing problems (Goodson et al. 2012; Manni 2009). For these reasons, we examine the impact of time windows on a priori route design.

We incorporate time windows into a priori routes by introducing the probabilistic traveling salesman problem with time windows (PTSPTW). While methods have been developed to address such elements as stochastic travel or service times within a time window constrained environment, the literature does not address how to deal with stochastic customer presence within this same environment. For this reason, we maintain the presence of the customer as the only stochastic element in our model.

The PTSPTW can be considered an extension of the probabilistic traveling salesman problem (PTSP). The PTSP is the problem of finding a minimum expected cost a priori tour through a set of customers \(N = \{i \mid 1, \ldots, n\}\) with probabilities \(P = \{p_i \mid 1, \ldots, n\}\) of requiring service on any given day. The travel time between any two customers *i* and *j* is given by *d*_{ij}, where *d*_{ij} = *d*_{ji}. These travel times also serve as the cost to traverse each arc. In the PTSPTW, also associated with each customer \(i \in N\) is a time window with an earliest time of service *e*_{i} and latest time of service *l*_{i}, where service must begin at or after *e*_{i} but should not begin later than *l*_{i}. If arrival at customer *i* occurs prior to *e*_{i}, then service is postponed until *e*_{i}.

Time windows present particularly challenging modeling issues when considered in conjunction with stochastic customer presence. This paper’s primary contribution is the presentation of a recourse model that incorporates time windows into a routing problem with stochastic customer presence. As in previous work on the probabilistic traveling salesman problem with deadlines (PTSPD), our recourse model accounts for late deadline violations by penalizing any violations in the objective function. Unlike the PTSPD however, our model for the PTSPTW also accounts for early arrivals. This is an extension that requires nontrivial calculations to find the probability of early arrivals at each customer. A secondary contribution of this paper is the results from computational experiments that identify the circumstances under which PTSPTW solutions differ from the solutions for the TSPTW. The TSPTW is the deterministic analog of the PTSPTW. Ultimately, this knowledge can help practitioners to determine when the added computational burden of incorporating stochasticity into the model is justified.

This paper is organized as follows: In "Literature review", we survey the related literature. In "Recourse Model" and "Solution Approach", we present a recourse model and an algorithm to solve problem instances. We describe our experimental design in "Experimental Design" and present results of computational experiments in "Results". "Conclusions" summarizes our work and offers directions for future research.

## Literature review

The PTSPTW is related to both the PTSP and stochastic vehicle routing problems with time constraints. Jaillet (1988) introduces the PTSP. Campbell and Thomas (2008) introduce and propose models for the PTSPD, and Campbell and Thomas (2009) offer tractable ways to solve the PTSPD. These papers on the PTSPD summarize related literature prior to 2008.

A number of recent papers have been published on the PTSP and related problems. A subset of these papers focus on solving the PTSP through local search methods (Marinakis and Marinaki 2010; Balaprakash et al. 2009; Marinakis and Marinaki 2009; Liu 2008; Marinakis et al. 2008). Birattari et al. (2008) and Weyland et al. (2009) use local search methods but also employ sampling techniques for the evaluation of the objective function. Mohan et al. (2008) and Chen et al. (2009) introduce arc-routing variants of the PTSP.

Weyland et al. (2011) introduce a new method for reducing the computational complexity of the objective function of the PTSPD. An approximation for the objective function based on Monte Carlo Sampling is combined with a quasi-parallel evaluation of the samples. The authors show that heuristics which use this new method outperform previous approaches for the PTSPD both in terms of computational time and solution value. Weyland et al. (2012) show that the evaluation of the objective function of the PTSPD is #P-hard. The complexity class #P captures the set of problems where one is not only interested in whether a solution exists, but also the number of solutions that exist (Weyland et al. 2012).

While our problem relates to the PTSP in that it deals with stochastic customer presence, problems in the literature can also be characterized by other stochastic elements such as stochastic demands or travel times. The common feature among these different stochastic elements is that they all lead to stochastic arrival times. The remainder of this section focuses on a variety of stochastic problems with time constraints.

Jaillet et al. (2012) seek to minimize a lateness index for single-vehicle problems with stochastic travel times and deadlines. However, this objective is not appropriate when a payment is made based on late delivery, as is the situation in our problem.

Chang et al. (2009) developed a heuristic for a version of the time-dependent, stochastic traveling salesman problem with time windows (STSPTW) where travel and service times are stochastic. The authors use an *n*-path relaxation of a deterministic traveling salesman problem (TSP) and a convolution-propagation approach to approximate arrival times at each customer.

A related problem to the TSP is the orienteering problem (OP). In the orienteering problem, a profit is associated with each customer. The objective is to construct a tour that maximizes the total profit while observing a specified time limit. Campbell et al. (2011) and Tang and Miller-Hooks (2005) introduce stochastic versions of the OP where travel and service times are random. Campbell et al. formulate a recourse model and present a variable neighborhood search to solve instances of the orienteering problem. Tang and Miller-Hooks formulate their problem as a chance-constrained stochastic program and develop an exact method that solves small to medium-size problems as well as a heuristic method for larger instances.

Another related routing problem is the stochastic vehicle routing problem (SVRP) where customers have demands and vehicles have capacity constraints. Several recent papers highlight the SVRP with time windows (SVRPTW). Lei et al. (2011) model a version of the SVRPTW with stochastic demands as a stochastic program with recourse and propose an adaptive large neighborhood search heuristic. Li et al. (2010) introduce a problem with stochastic travel and service times and present two different model formulations. The first model is a chance constrained programming model where the objective is to minimize transportation cost when some constraints hold under a certain confidence level. The second model is a stochastic programming model with recourse where the objective is to minimize the expected transportation cost of a priori solutions. A tabu search-based method is used to solve both models.

Erera et al. (2009) present a stochastic dynamic vehicle routing problem with time windows (SDVRPTW). The SDVRPTW is a problem in which customers are dynamically inserted into routes. The problem is similar to ours in that customer presence is stochastic and each customer has an associated time window. However, Erera et al. maintain time window feasibility and use a paired-vehicle recourse action while we penalize late time window violations and use a single-vehicle recourse action.

## Recourse model

In this section, we present a recourse model for the PTSPTW. A recourse model is characterized by two stages. In stage one, an a priori solution is determined. Then the random variables are realized and a recourse action is applied to the original solution. In this case, the a priori solution is a route that identifies the order in which customers will be visited. After information becomes available about which customers need to be visited, the recourse action is applied. This recourse action consists of adjusting the route so that customers who do not need to be visited are skipped, but the remaining customers are still visited in the a priori route order.

In our model, service at each customer cannot begin before *e*_{i}. An early arrival requires the driver to wait until *e*_{i}. However, the vehicle is permitted to visit a customer after *l*_{i} has passed, but a penalty is incurred for doing so. We utilize a per-unit-time penalty charge for arriving after the close of the time window. The penalty is represented by λ. The per-unit-time charge represents cases where the delivery company is charged per unit time of lateness. For instance, FedEx Custom Critical refunds varying percentages of the cost of a shipment based on how late the shipment is delivered (FedEx 2005). For additional examples, see Charnisirisakskul et al.(2004) and Slotnick and Sobel (2005). Thus, the objective of the PTSPTW, formulated as a recourse model, is to identify an a priori route where the sum of the expected travel costs between customers and the expected penalty term is minimum.

In the case of our recourse model, we let τ define an order, or tour, in which the customers \(i=1,\ldots, n\) are to be visited. We assume that the customers are indexed according to their position in τ. We assume a tour always begins and ends at a fixed depot, and the depot is indexed as *i* = 0. We assume integer travel times and deadlines and that all tours start at time *t* = 0. Our recourse model provides a formula for evaluating the cost of a given tour τ.

*i*,

*j*) depends on customers

*i*and

*j*being realized and no customers

*k*, \(k = i+1, \ldots, j-1,\) being realized.

*t*. To begin, let the random variable

*A*

_{i}be a random variable representing the time of arrival at the customer in position

*i*in τ. We assume that arrival at a customer

*i*cannot be early or late if customer

*i*is not realized. Hence, we are left to compute \(P(A_i = t \mid X_i =1)\), where

*t*represents the time of arrival at customer

*i*. For notational convenience, let \(g(i, t) = P(A_i = t \mid X_i = 1)\). We also let \(G(i, t) = P(A_{i} \leq t \mid X_{i} = 1)\), which can be computed as

*G*(

*i*,

*t*) = ∑

_{k=0}

^{t}

*g*(

*i*,

*k*) =

*G*(

*i*,

*t*− 1) +

*g*(

*i*,

*t*). We assume that arrival at the depot occurs at time

*t*= 0. Therefore,

*G*(0,

*t*) = 1 for all values of

*t*. Because there is no time window or penalty associated with the depot, it is not necessary to consider the probability that the arrival at the depot on the return trip occurs at any particular time.

To account for the fact that arrival at customer *i* depends on the departure times at previous customers, we let *h*(*i*, *t*) be the probability of departing from customer *i* at time *t*. We can then compute the *g* and *h* values in an iterative, recursive fashion. Note that arrival and departure times for a customer differ when waiting for the opening of the time window occurs. Before describing the computation procedure in Algorithm 1, we first introduce bounds on the earliest and latest times that a customer can be visited.

*i*can be limited based on the location of customer

*i*in the tour. The earliest arrival time at customer

*i*, expressed as

*T*

_{i}

^{min}, occurs when all customers prior to

*i*are not realized. Thus,

*T*

_{i}

^{min}=

*d*

_{0,i}for all

*i*. The latest possible arrival time at customer

*i*is computed by assuming that every customer prior to

*i*is realized. If arrival occurs prior to the start of the time window for any customer prior to

*i*, then this additional waiting time must be accounted for in the computation. Given

*T*

_{1}

^{max}=

*d*

_{0,1}, we can compute the other values recursively:

*t*outside of the range of

*T*

_{i}

^{min}to

*T*

_{i}

^{max}result in

*g*and

*h*values of zero unless

*T*

_{i}

^{max}<

*e*

_{i}, in which case

*h*(

*i*,

*e*

_{i}) has a value of one.Open image in new window

We now clarify the initialization procedure for Algorithm 1. The values *g*(*i*, *t*), *h*(*i*, *t*), and *G*(*i*, *t*) are set to zero for all combinations of *i* and *t*, where \(i=0,1, \dots, n\) and \(t=0, 1, \dots, T_n^{max}\), unless explicitly noted. The values *g*(0,0) and *h*(0,0) are set to one to represent the probabilities of both arriving and departing from the depot at time zero. Also, *h*(*i*, *e*_{i}) is set to one if *T*_{i}^{max} < *e*_{i}. Finally, *G*(0,*t*) is set to one for all values of *t*.

Using the previously described bounds and initialization, Algorithm 1 recursively computes *h* and *g* values. Recall that *g*(*i*, *t*) represents the probability of arriving at customer *i* at a particular time *t*, given that customer *i* is realized. Therefore, the algorithm considers the probability that previous customers are realized, as well as the probability of departing at a particular time from these previous customers. Thus \(g(i,t)=\sum\nolimits_{v=0}^{i-1} p_v h(v,t-d_{v,i}) \prod\nolimits_{k=v+1}^{i-1} (1-p_k)\), and is computed recursively in Algorithm 1. In words, *p*_{v}*h*(*v*, *t* − *d*_{v,i}) represents the probability that customer *v* is realized times the probability that departure from customer *v* occurs at the current time minus the travel time from customer *v* to customer *i*. This departure time implies that customers *v* and *i* are realized but no customers \(k, k = v+1, \ldots, i-1,\) are realized. The probability that these customers are not realized is represented by the term \(\prod\nolimits_{k=v+1}^{i-1} (1-p_k)\) in the formula. Note also that when *t* < *d*_{v,i}, *t* − *d*_{v,i} is negative, and thus represents an impossible situation when considering the original assumptions of the problem. Thus *h*(*v*, *t* − *d*_{v,i}) is set to zero when *t* < *d*_{v,i}.

*G*(

*i*,

*t*) is the cumulative function for

*g*(

*i*,

*t*),

*G*(

*i*,

*t*) can be computed recursively using the formula

*G*(

*i*,

*t*− 1) +

*g*(

*i*,

*t*). We are left to compute

*h*(

*i*,

*t*), which depends on three situations. Either

- 1.
time

*t*is before the start of the time window, - 2.
time

*t*is equivalent to the start of the time window, or - 3.
time

*t*is after the start of the time window.

*i*at time

*t*is zero. That is,

*h*(

*i*,

*t*) = 0. If the second situation occurs, then the probability of departing from customer

*i*at time

*t*is equivalent to the probability of arriving at customer

*i*at or before time

*t*. Therefore,

*h*(

*i*,

*t*) =

*G*(

*i*,

*t*). If the third situation occurs,

*h*(

*i*,

*t*) =

*g*(

*i*,

*t*). This completes the description of Algorithm 1.

*O*(

*n*

*T*

_{n}

^{max}) time, while the body of the algorithm is computed in

*O*(

*n*

^{2}max

_{i}{

*T*

_{i}

^{max}−

*T*

_{i}

^{min}}) time. Without restrictions on distances, this is not necessarily polynomial in

*n*. Once the

*g*and

*h*values are known, the per-unit-time penalty portion of the objective can be computed in

*O*(

*n*max

_{i}{

*T*

_{i}

^{max}−

*l*

_{i}}) time, and the distance portion of the objective function can be computed in

*O*(

*n*

^{2}) time. The entire objective calculation is dominated by the

*g*and

*h*computations, and thus takes

*O*(

*n*

^{2}max

_{i}{

*T*

_{i}

^{max}−

*T*

_{i}

^{min}}) time. In addition, Weyland et al.’s (2012) characterization of the PTSPD objective function as #P-hard can be trivially extended to the PTSPTW.

## Solution approach

Our solution approach is motivated by the recent work of da Silva and Urrutia (2010) on the TSPTW, the deterministic version of the problem discussed in this paper. Da Silva and Urrutia combine a Variable Neighborhood Search (VNS) with a Variable Neighborhood Decent (VND) as a local search. The algorithm yields high quality solutions for the data sets from which those used in this paper are derived and achieves best-known results on other benchmark data sets. Thus, we chose to implement a VNS/VND heuristic as well.

In broad terms, the VNS we use can be described as a systematic change of neighborhoods during a two-phase procedure. The first phase is a perturbation of the current best solution, while the second phase is a descent to find a new local minimum. The purpose of the perturbation is to move the solution out of a local minimum. Note that we do not need the initial construction phase implemented by da Silva and Urrutia (2010) because we penalize time window violations, thus making all solutions feasible.

We now describe the algorithm by beginning with a description of the VNS. As input, the VNS takes an initial seed tour *x*, a value *k*_{max} that controls the maximum level of perturbation in the Shake function, and an objective function \(f(\cdot)\) that maps a tour to the set of positive real numbers. Specifically, \(f(\cdot)\) is the objective function presented in Eq. 4. After initializing *k* to 1 and *improvement* to true, the algorithm perturbs the current solution *x* by calling the Shake function. The perturbed solution, \(x^\prime\), is then passed to the VND to find a local minimum *x*^{*}. If the objective value of *x*^{*} is less than the objective value of the current solution *x*, then *x* is set to *x*^{*}, *k* is reset to 1, and the procedure repeats. If, on the other hand, the objective value of *x*^{*} is not less than the objective value of *x* and *k* does not equal *k*_{max}, then *k* is incremented by 1 and the procedure repeats. When *k* reaches *k*_{max}, then no improving solution has been found within the last *k*_{max} iterations. The parameter *improvement* is set to false, the current solution *x* is set to *best*, and the algorithm terminates returning *best*, the best tour found. The pseudo code for the VNS algorithm is presented in Algorithm 2.Open image in new window

We now describe the Shake function that is used to perturb the current best solution. As input, the Shake function requires a tour *x* and a neighborhood specification *k*. We denote a set of neighborhoods for the Shake function with \(N_k^{Shake}=\{1, \dots, k_{max}\}\) and with *N*_{k}^{Shake}(*x*) the set of solutions in the *k*^{th} neighborhood. Specifically, *N*_{k}^{Shake}(*x*) contains all tours that differ from *x* by a combination of *k* 1-shift moves. A 1-shift move removes a single customer from a tour and reinserts the customer in a new location on the tour. We assume the solutions from *N*_{k}^{Shake}(*x*) are ordered such that \(\{x^1,\dots,x^{|N_k^{Shake}(x)|}\}\). The perturbation selects a random tour \(x^w \in N_k^{Shake}(x)\), sets \(x^\prime\) to *x*^{w}, and outputs the perturbed solution \(x^\prime\). The pseudo code for the algorithm is presented in Algorithm 3.Open image in new window

In the algorithm presented by da Silva and Urrutia, *k*_{max} is set to 30. However, such large neighborhoods lead to many iterations of the VNS and VND. Because the computational complexity of the PTSPTW objective calculation is greater than that of the TSPTW objective calculation, a large number of iterations causes the running time to be very large. Additionally, during preliminary experiments, we found that the algorithm often converged to the same solution for values of *k*_{max} set to 30 and set to 5. For these reasons, we set *k*_{max} to 5.

Hansen et al. (2010) propose an implementation of VNS/VND for combinatorial optimization problems which differs from the implementation proposed by da Silva and Urrutia in the procedure for changing neighborhoods within the VND. In preliminary experiments, we found that both algorithms generally converged to the same solutions, but the algorithm proposed by Hansen et al. converged faster. For this reason, we base our VND implementation on the algorithm proposed by Hansen et al.

The VND is used to optimize the recently perturbed solution. The input for the VND includes a tour *x*, a value *j*_{max}, and the previously described objective function \(f(\cdot)\). We denote a set of neighborhoods for the VND with \(N_j^{VND}=\{1, \dots, j_{max}\}\) and with *N*_{j}^{VND}(*x*) the set of solutions in the *j*^{th} neighborhood. The value *j*_{max} represents the last neighborhood that the VND iterates through. We set *j* to 1 to specify the the previously discussed 1-shift neighborhood and we set j to 2 to specify the 2-Opt neighborhood. A 2-Opt move deletes two edges from a tour so that the tour is broken into two paths, then reconnects the paths in the only other possible way. Because we specify two neighborhoods for the VND, we set *j*_{max} to 2. Our choices of the 1-shift and 2-Opt neighborhoods are also motivated by the work of da Silva and Urrutia and the descriptions of the neighborhoods can be found therein.

After initializing *j* to 1 and *improvement* to true, a locally optimal solution \(x^\prime\) is found by iterating through the entire neighborhood *N*_{j}^{VND}. If the objective value of \(x^\prime\) is less than the objective value of the current solution *x*, then *x* is set to *x*^{*}, *j* is reset to 1, and the procedure repeats. If, on the other hand, the objective value of \(x^\prime\) is not less than the objective value of *x* and *j* does not equal *j*_{max}, then *j* is incremented by 1 and the procedure repeats. When *j* reaches *j*_{max}, then no improving solution has been found in either neighborhood. The parameter *improvement* is set to false, the current solution *x* is set to *x*^{*}, and the algorithm terminates returning *x*^{*}, the best locally optimal tour found. The pseudo code for the VND algorithm is presented in Algorithm 4.Open image in new window

## Experimental design

Next, we investigate what instance characteristics result in different tours when customer presence is modeled stochastically instead of deterministically. To address this question, we focus on the effects of customer realization probabilities, time window widths, per-unit-time penalties, and the number of customers.

The data sets we use are a subset of the TSPTW instances first proposed by Dumas et al. (1995). We use the 20-, 40- and 60-customer instances with time-window widths of 20, 60, and 100 U. These instances are labeled “Feasible” in the results tables. We also generate new data sets from each of the existing instances. These new instances differ in the starting and ending times of the time windows, and represent the situations where feasible solutions with respect to time windows are unlikely to exist if all customers are realized. These situations are likely to occur as the demand for time-definite services grows. Our results, which are presented in the next section, indicate that large cost savings are possible when these situations are modeled stochastically compared to deterministically. We denote the new early and late deadlines as \(e_i^\prime\) and \(l_i^\prime\), respectively. In general, we set \(l_i^\prime\) equal to the early deadline. Then we set \(e_i^\prime\) equal to \(l_i^\prime\) minus the width of the corresponding time window, unless that time is negative in which case \(e_i^\prime\) is set to zero. As an example, consider an instance with a time window width of 20 where *e*_{i} = 15 and *l*_{i} = 35. Then \(l_i^\prime=15\), and since 15–20 = −5, set \(e_i^\prime=0\). Sometimes, however, \(l_i^\prime\) is equal to zero. In this case, we instead let \(l_i^\prime\) equal the late deadline and \(e_i^\prime\) equal \(l_i^\prime\) minus the width of the corresponding time window. Suppose we again have a time window width of 20. If *e*_{i} = 0 and *l*_{i} = 20, then since \(l_i^\prime\) would equal 0, we let \(l_i^\prime=20\) and \(e_i^\prime=0\). These instances will be referred to in the tables by the label “Infeasible."

Similar to the design of the PTSPD experiments in Campbell and Thomas (2008) (see Campbell and Thomas (2006) for PTSPD benchmark data sets), we consider four different probability settings for each instance. Three of these settings are homogeneous with customer presence probabilities set to 0.1, 0.5, and 0.9, respectively. These instances are referred to as 0.1, 0.5, and 0.9 in the results tables. The probabilities represent the likelihood that each customer will be realized. A probability of 0.1 may be suitable to represent a home or small business because packages are unlikely to be delivered every day. Probabilities of 0.5 may be suitable to represent small or medium businesses that receive packages a few days a week, while probabilities of 0.9 may be suitable to represent medium or large businesses that receive packages almost every day. The fourth probability setting is heterogeneous, where the probability of each customer is randomly assigned a probability of either 0.1 or 1. This case represents the situation in which both small and large businesses are served by the same vehicle. This data set will be referred to in the tables by the label “Mixed."

For each instance with 20, 40 and 60 customers, we consider two different per-unit-time penalties. As was done in the PTSPD work of Campbell and Thomas (2008), one penalty is set to 5, while in the other case the penalty is set to 50. These penalties represent small and large costs of failing to satisfy customer time windows. In total, we solve 720 different instances.

Because our goal is to determine differences between TSPTW and PTSPTW solutions, for feasible instances we seed the VNS/VND heuristic described in "Solution Approach" with the best-known TSPTW solution [see da Silva and Urrutia (2010)] and solve the corresponding PTSPTW instance with homogeneous and mixed probability settings. For the infeasible instances, we construct corresponding TSPTW solutions to use as seed solutions for the VNS/VND heuristic. To construct these TSPTW solutions, we assign the same per-unit-time penalties as the corresponding PTSPTW set, and set all probabilities to one. We run the VNS/VND heuristic 10 times using the corresponding best-known TSPTW solution with feasible time windows, converted to infeasible time windows, as a starting solution. We select the best TSPTW solution out of the 10 runs for each instance to use as the seed solutions for the VNS/VND algorithm to solve the corresponding PTSPTW infeasible instances with homogeneous and mixed probability settings.

In evaluating our computational results, it is necessary to be able to compare two solutions. One useful metric of comparison is the Hamming distance introduced by Bierwirth et al. (1996) for the Job Shop Scheduling problem. Ehmke et al. (2012) use this measure with vehicle routing problems. In terms of tours, the Hamming distance is the ratio of the precedence relationships of customers being preserved between two tours. In order to calculate the Hamming distance, a tour must be converted to a bit-string representation. A tour with *n* customers has a bit-string representation of \({{n^2-n}\over {2}}\) bits. If customer *i* is located before customer *j* on a tour, then the bit is set to one. On the other hand, if customer *j* is located prior to *i*, then the bit is set to zero. The normalized Hamming distance for two tours represented as bit-strings *x* and *y* is calculated by \(d_{x,y}={{1}\over {l}}\sum_{k=1}^lxor(x_k,y_k)\), where *l* is the length of the bit-strings. The “exclusive-or" operator is represented by \(xor(\cdot )\) and results in one if the bits *x*_{k} and *y*_{k} differ, and zero otherwise. In this paper, a Hamming distance of zero indicates two identical tours, while a Hamming distance of one indicates that no customers share the same precedence relationship between two tours. A Hamming distance of one occurs only in the case where one tour is the exact inverse of the other tour.

Summary of feasible experiments

Probability | AVG. \(\Updelta\) (%) | AVG. Ham. | AVG. Time | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|

0.1 | 0.5 | 0.9 | Mixed | 0.1 | 0.5 | 0.9 | Mixed | 0.1 | 0.5 | 0.9 | Mixed | ||

| Penalty = 50 | ||||||||||||

| 0.03 | 0.04 | NC | 0.02 | 0.01 | 0.00 | NC | 0.00 | 6.09 | 5.36 | 5.45 | 5.07 | |

| 0.50 | 2.00 | 0.46 | 0.47 | 0.05 | 0.03 | 0.01 | 0.02 | 6.08 | 5.99 | 5.79 | 4.79 | |

| 0.46 | 0.20 | 0.06 | 0.98 | 0.10 | 0.01 | 0.01 | 0.05 | 6.31 | 4.66 | 4.81 | 4.88 | |

| Penalty = 50 | ||||||||||||

| 0.03 | NC | NC | 0.01 | 0.01 | NC | NC | 0.00 | 5.84 | 5.12 | 5.45 | 4.89 | |

| 0.40 | 1.01 | 0.13 | 0.42 | 0.04 | 0.02 | 0.01 | 0.01 | 6.28 | 6.15 | 5.40 | 4.80 | |

| 0.31 | 0.11 | 0.02 | 0.97 | 0.09 | 0.01 | 0.01 | 0.04 | 6.68 | 5.00 | 4.89 | 4.94 | |

| Penalty = 5 | ||||||||||||

| 0.12 | 0.30 | 0.03 | 0.57 | 0.01 | 0.00 | 0.00 | 0.00 | 156.08 | 138.82 | 132.23 | 120.88 | |

| 0.58 | 0.96 | 0.01 | 2.65 | 0.03 | 0.01 | 0.00 | 0.02 | 167.98 | 124.56 | 113.12 | 120.56 | |

| 1.73 | 1.13 | 0.00 | 4.98 | 0.05 | 0.02 | 0.00 | 0.04 | 164.72 | 124.49 | 103.83 | 153.78 | |

| Penalty = 50 | ||||||||||||

| 0.11 | 0.20 | 0.04 | 0.37 | 0.01 | 0.00 | 0.00 | 0.00 | 154.54 | 131.82 | 135.48 | 127.85 | |

| 0.50 | 0.69 | 0.01 | 2.62 | 0.03 | 0.01 | 0.00 | 0.02 | 170.77 | 120.24 | 115.17 | 120.41 | |

| 1.43 | 1.11 | 0.00 | 3.87 | 0.05 | 0.02 | 0.00 | 0.03 | 169.63 | 120.33 | 104.94 | 133.84 | |

| Penalty = 5 | ||||||||||||

| 0.19 | 0.62 | 0.06 | 2.65 | 0.01 | 0.00 | 0.00 | 0.01 | 823.93 | 705.98 | 668.30 | 960.30 | |

| 0.59 | 0.96 | 0.78 | 3.65 | 0.02 | 0.01 | 0.00 | 0.01 | 932.91 | 716.23 | 677.40 | 850.16 | |

| 1.52 | 1.88 | 0.17 | 4.54 | 0.04 | 0.02 | 0.00 | 0.03 | 1,013.68 | 783.61 | 632.49 | 970.53 | |

| Penalty = 50 | ||||||||||||

| 0.16 | 0.33 | 0.00 | 1.61 | 0.01 | 0.00 | 0.00 | 0.00 | 990.25 | 827.87 | 819.30 | 936.87 | |

| 0.45 | 0.45 | 0.03 | 2.76 | 0.02 | 0.00 | 0.00 | 0.01 | 1,192.77 | 868.17 | 766.85 | 810.23 | |

| 1.31 | 1.31 | 0.17 | 4.41 | 0.04 | 0.02 | 0.00 | 0.02 | 1,303.77 | 937.40 | 741.76 | 989.51 |

Summary of infeasible experiments

Probability | AVG. \(\Updelta\) (%) | AVG. Ham. | AVG. Time | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|

0.1 | 0.5 | 0.9 | Mixed | 0.1 | 0.5 | 0.9 | Mixed | 0.1 | 0.5 | 0.9 | Mixed | ||

| Penalty = 5 | ||||||||||||

| 2.93 | 0.60 | NC | 1.63 | 0.01 | 0.00 | NC | 0.00 | 6.02 | 5.38 | 5.23 | 4.95 | |

| 3.82 | 3.40 | 0.04 | 4.98 | 0.07 | 0.06 | 0.00 | 0.04 | 6.02 | 5.24 | 4.36 | 4.95 | |

| 5.46 | 6.00 | 0.51 | 22.58 | 0.13 | 0.06 | 0.01 | 0.17 | 6.78 | 4.53 | 3.85 | 6.24 | |

| Penalty = 50 | ||||||||||||

| 13.03 | 2.20 | NC | 5.52 | 0.02 | 0.01 | NC | 0.00 | 5.99 | 5.61 | 5.06 | 5.51 | |

| 21.30 | 8.93 | 2.74 | 6.61 | 0.08 | 0.05 | 0.01 | 0.04 | 6.70 | 5.09 | 4.63 | 5.26 | |

| 28.98 | 13.89 | 1.74 | 39.60 | 0.17 | 0.08 | 0.01 | 0.17 | 6.93 | 5.06 | 3.99 | 6.13 | |

| Penalty = 5 | ||||||||||||

| 0.80 | 3.93 | 1.75 | 3.42 | 0.02 | 0.01 | 0.00 | 0.01 | 158.54 | 129.41 | 120.97 | 126.99 | |

| 4.53 | 7.16 | 0.20 | 3.16 | 0.05 | 0.02 | 0.00 | 0.02 | 165.50 | 110.67 | 94.76 | 109.27 | |

| 13.16 | 14.14 | 1.53 | 24.99 | 0.08 | 0.04 | 0.01 | 0.11 | 163.05 | 113.88 | 91.78 | 157.81 | |

| Penalty = 50 | ||||||||||||

| 5.54 | 12.77 | 3.58 | 8.54 | 0.01 | 0.01 | 0.00 | 0.00 | 155.76 | 133.02 | 123.90 | 157.76 | |

| 24.45 | 12.77 | 0.30 | 3.71 | 0.06 | 0.01 | 0.00 | 0.02 | 196.67 | 110.97 | 95.29 | 143.38 | |

| 25.51 | 20.95 | 0.44 | 3.94 | 0.09 | 0.04 | 0.01 | 0.03 | 182.84 | 119.47 | 91.93 | 167.07 | |

| Penalty=5 | ||||||||||||

| 0.66 | 1.12 | 0.05 | 4.79 | 0.01 | 0.00 | 0.00 | 0.01 | 833.44 | 705.66 | 686.59 | 1,157.12 | |

| 18.94 | 23.46 | 12.22 | 37.42 | 0.05 | 0.03 | 0.04 | 0.07 | 1,073.15 | 825.71 | 843.04 | 1,397.34 | |

| 6.67 | 6.87 | 0.72 | 22.46 | 0.06 | 0.02 | 0.00 | 0.10 | 1,098.85 | 703.37 | 561.38 | 1,362.16 | |

| Penalty = 50 | ||||||||||||

| 7.44 | 6.02 | 0.02 | 14.31 | 0.01 | 0.00 | 0.00 | 0.01 | 999.66 | 832.76 | 769.01 | 1,153.81 | |

| 40.86 | 33.69 | 18.68 | 54.01 | 0.04 | 0.02 | 0.03 | 0.07 | 1,307.59 | 930.00 | 890.20 | 1,259.74 | |

| 48.12 | 36.69 | 4.04 | 33.28 | 0.06 | 0.02 | 0.01 | 0.11 | 1,655.80 | 876.45 | 671.45 | 1,968.37 |

## Results

Out of all the probability settings, the mixed setting displays the largest differences from the TSPTW solutions for cost. This is because of the PTSPTW’s ability to prioritize customers with larger probabilities in the tour. As noted earlier, however, the mixed setting does not display the largest differences in structure. This indicates that even small changes in tour structure can produce large changes in cost.

It is intuitive that time windows drive the tour construction for the TSPTW. Our results support the idea that time windows are also a significant contributor to tour construction for the PTSPTW. In a majority of cases, instances with time window widths of 100 have greater percentage of changes in objective value and also greater Hamming distances than instances with time window widths of 20. This is because larger time windows allow greater flexibility in the construction of the tours.

Per-unit-time penalty increasing from 5 to 50 affects the solution tours for all the instances as expected. For the feasible instances, the average percentage of change in objective value decreases as the penalty increases. Because these are feasible instances, it is possible to serve many customers within their respective time windows. It is likely that incurring a large penalty cost for missing a deadline is more costly than traveling a longer distance to arrive within the respective time window. Thus, larger penalties push the solutions toward the optimal TSPTW solutions. All of the Hamming distances either decrease or do not demonstrate change within two significant digits.

For infeasible instances, the opposite trend occurs. The average percentage of change in objective value generally increases as the penalty increases from 5 to 50. By construction of these instances, it is unlikely that a tour exists such that the driver will be able to serve all customers within their respective time windows. Therefore, penalty charges are unavoidable. As the charge increases, the cost of the tour increases. Any percentage of change in objective value that occurs at a low penalty level is multiplied at the higher level.

An increase in the number of customers also causes an increase in the average percentage of change in objective value for both feasible and infeasible instances. Hamming distance, on the other hand, tends to decrease and is most evident with time window widths of 100. It is possible that because of the larger number of customers on the tour, more precedence relationships are preserved even when customer positions differ and cause changes in the objective value.

Lastly, it is worthwhile to mention two factors contributing to increased runtimes. The most noticeable factor is problem size. As the number of customers increases, so does the runtime. This is due to the fact that as the solution space grows, the number of times that the objective value is calculated increases. The second factor is the use of small or mixed probabilities. Instances with these two probability types deviate more from their TSPTW solutions than instances with larger probabilities. This implies that the increased runtime is caused by more local search moves.

## Conclusions

Large probabilities of customer presence have less impact on the solution than small probabilities.

When customers have a combination of high and low probabilities, solutions are significantly impacted by modeling customers stochastically. This is because of the stochastic model’s ability to prioritize customers with larger probabilities in the solution.

Large time window widths have a greater impact on the solution than small time window widths when customers are modeled stochastically. An explanation is that as time window widths increase, the algorithm has more flexibility for placing customers on the tour.

Solutions to the PTSPTW come with large computation times due to the complexity of the objective calculation. Factors that contribute to larger runtimes are large problem sizes and problems with low or mixed probabilities.

An additional recommendation for future work is to solve larger problem instances. The key challenge with larger instances is the computational complexity of the objective function. Therefore, the solution method will likely need to incorporate sampling methods to evaluate the objective. Investigation of these ideas will help further our understanding of the advantages and disadvantages of incorporating stochasticity into the model.