1 Introduction

Traffic congestion has a hefty toll on the society and economy. In one estimate, the cost of congestion in 2010 in the urban areas of the USA amounted to USD 101 billion [1, 2]. A recent study shows that the avoidable cost of congestion for the Australian capital cities is estimated to be around AUD16.5 billion for the 2015 financial year, which is a significant increase from about AUD12.8 billion for 2010 [3]. It seems there is no ceiling cap for such costs. For instance, traffic congestion could cost the UK economy more than £300 billion over the next 16 years which represents a 63% increase in annual costs by 2030 [4]. Building more roads should not be the put forward as the first solution to the congestion for the following reasons: (1) More roads encourages more trips; something is known as induced demand [5]. (2) More roads may adversely cause more congestion; the phenomenon is known as Braess’ paradox [6, 7]. (3) Roads’ constructions are capital intensive and time-consuming [8, 9].

By casting the transport in a supply–demand context, the congestion occurs when the demand outweighs the supply. Hence, the solution to the congestion can be attained either by adding to the supply or by lowering the demand. In the supply side (more roads), “Building more roads to prevent congestion is like a fat man loosening his belt to prevent obesity” (Lewis Mumford 1955). In the demand side, congestion pricing, originally proposed in the economic literature [10], has emerged as a significant leverage to bring balance back to the supply–demand equation. Economists, who believe in the power of a free market, seek solutions in the cost and pricing mechanism when they see any supply–demand imbalance [11, 12].

A traditional approach is to flag some (highly congested) roads as “toll roads” aiming to redistribute the traffic load evenly across the network including less congested roads. Labeling road with a price tag does not fall delicately in the mind of the people. Furthermore, equity (in terms of who has to pay and how much) is the main challenge in such scheme. To this end, some scholars introduced some sort of tradable credits which will be discussed in depth in the next section.

In the same spirit, we propose an unorthodox scheme in which some roads are tolled and some other (underutilized) roads are designated as subsidy roads. The aim is to encourage commuters to avoid highly congested roads and see alternative roads in the periphery area by means of credit incentive (subsidy).

The remainder of the article is organized as follows. The relevant studies in the literature are reviewed in Sect. 2. The concepts and underlining mathematical features of the methodology are elaborated in Sect. 3. Numerical results are presented in Sect. 4 followed by conclusion remarks in Sect. 5.

2 Literature review

This section presents a comprehensive review through which we cover a variety of subjects pertaining to the toll pricing, credit-based schemes, system optimal, etc.

2.1 Actual cost of congestion and system optimal

What we pay for transport is fuel cost, fare/tolls and value of travel time while we ignore other hidden costs imposed upon the society such as environmental degradation—namely externality cost. Hence, if we pay the actual cost of what we are consuming (the product here is the mobility), the market will correct itself and no such imbalance would occur. As the result, the question now is what is the actual cost of using a road, so that if it is charged as a toll the congestion would disappear? The pioneers on this idea are [13,14,15], and the following excerpt summarizes their take on the matter.

Although many traffic flow patterns could exist, it is widely believed that people seek to minimize their individual travel cost/time which results in a user equilibrium (UE) traffic pattern. Unfortunately, the UE is not the best pattern as far as the total costs of the transport system (the entire road network) are concerned. The “best” traffic pattern is the one at which the costs of the entire transport system are minimized, referred to as the system optimal (SO). The traffic congestion can be defined and formulated as “minimization of the cost of the system” that is to forcing the traffic pattern from UE to SO. To do so, congestion pricing is a plausible approach. It is mathematically proved [16] that the price tags of the roads are their respective marginal costs. Accordingly, the UE turns to the SO. The marginal cost can represent some externality costs such as pollutions and inequity [17]. From a mathematical point of view, it is easy to identify the marginal costs of the roads [16].

2.2 First-best congestions pricing versus second-best and equity concerns

In reality, it is not possible to communicate the marginal costs of all roads of the network with the drivers (i.e., tolling all roads) and expect to see a full SO traffic flow (known as first-best congestion pricing [18]). Rather, a few roads, presumably the highly congested ones, are selected as tolled roads and some fairly consistent toll rates are assigned to them (known as second-best congestion pricing [19]. Examples are the imposition of tolls on the roads forming cordon lines encompassing a central business district (CBD) or on the roads forming some screen lines which bisect the road network. One of the main challenges in the pricing schemes is the equity [20]. Equity concern is associated with the distribution of costs and benefits among the members of the society. A policy can be called equitable if it meets a normative standard of fairness (so equity means fairness). Following is one of the typical complaints: “I am living just one block after the cordon line in the CBD and my office is in the outer CBD. Beside the fact that you put my house inside the cordon line, I have no business in the CBD, so it is totally unfair to tax me.” In the USA, many congestion pricing proposals have been rejected based on worries that they are inequitable [21].

The equity concerns arise when people feel what they are paying is more than what they are getting. The inequity makes the concept of congestion pricing hard sell to both public and traffic authorities which is also known as acceptability issue, [2, 22,23,24]).

2.3 Credit-based scheme, an answer to equity concerns

One answer to the equity issue raised in the above typical complaint can be set out as follows: The authorities would issue a limited number of cordon-passing credits, at a reasonable price, to the people working or living in the CBD. Commuters who receive the credits can then sell them in a free market to other frequent commuters. Such a scheme is called “tradable credit scheme (TCS).” Although the TCS has been extensively studied in the past 20 years, there is no real implementation in the real world [25]. The TCS is hampered by three crucial questions: who are eligible to receive the initial credit?; what should be the initial price?; and, what would be the trading mechanism?

Given the above-mentioned complexities involved with the TCS, recently, the inclusion of “subsidy” in the congestion pricing has been introduced; that is, instead of traversing the main (tolled) roads, commuters can choose some other roads (un-tolled) slightly away from the main (tolled) roads. In addition, by traveling on these un-tolled roads commuters are given credits; so they are called subsidized roads.Footnote 1 The concept is called “toll-and-subsidy scheme (TSS).” In contrast to the TCS, there is no provision for the credit trading, nor does it need to carefully pick up the eligible drivers which both are advantageous.Footnote 2 There are also a number of other advantages associated with such schemes: (1) The equity and acceptability features are highly upheld. (2) It would strongly rally the public behind the scheme since it cannot be considered as a different form of taxation. (3) The scheme can be used to promote the public transport mode. (4) Districts in the vicinity of subsidized roads are expected to receive added values which can be then exploited by zoning authorities in the land-use planning initiatives (a practice called value capture [26]).

2.4 TSS and capacitated traffic assignment

Although finding the locations of the tolled roads can also be cast in a mathematical programing framework, past experiences have shown that it is usually a decision to be made by the authorities considering many technical, societal as well as political factors. One practical, intuitive and less computationally intensive approach is via traffic control policies, which are discussed in the following exposition. In principle, congestion pricing aims to keep the traffic volumes under some certain levels. These levels could be simply set to prevent traffic jam on some specific part of the cities, such as downtowns or the CBD. Traffic authorities have a fairly good understanding of the acceptable level of traffic loads in major roads or total incoming traffic in the CBD. Or in a more insightful way, the acceptable level of the incoming traffic to the CBD can be found from the macroscopic fundamental diagram [27]. Therefore, the location of the roads to be tolled and accepted level of traffic loads on them are assumed known and exogenously given in the TSS.

The only remaining issue is the pricing regime. The TSS’ pricing problem is articulated as follows: Given a set of roads with capped traffic volumes to be either tolled or subsidized, which one must be tolled, which one must be subsidized and how much?

The past studies have mathematically proved that this problem always has a solution [28]. The main focus of the current research is to present a methodology to reach a TSS tailored to large-sized networks. Since the congestion pricing is subject to a set of capped traffic volumes, we first transform the problem to a conventional “capacitated traffic assignment problem (CTAP).” We then interpret the values of the Lagrange multipliers of the capacity constraints as the toll/subsidy values.Footnote 3 To solve the resulting CTAP, we relax the capacity constraints by adding their Lagrange multiplier values to the travel time of the respective roads, viz. inflated travel time [29]. By doing so, the CTAP is transformed into a simple (un-capacitated) traffic assignment problem (TAP) which is easier to solve. The solution algorithm is designed as an iterative process: The travel times are increased to the level at which the traffic volumes do not exceed the roads’ capacities.

We present the TSS in the context of congestion pricing and then the tradable schemes. The interested reader for a comprehensive review of the former subject can consult with [18, 30, 31] and for the latter with [1, 20, 24, 32,33,34,35].

2.5 Literature on TSS

This section is dedicated to a comprehensive review on the subject of the TSS which sometimes referred to as toll-cum-rebate or price-rewarding schemes as well.

Bernstein [36] proposed a time-varying pricing scheme that includes a toll and a subsidy on the same routes. Commuters are charged if they arrive at the peak hour and are subsidized if they choose off-peak times. Over a simple network, it was demonstrated that the scheme could reduce the equilibrium cost. Adler and Cetin [37] discussed a redistribution scheme in which, revenue collected from a desirable route is transferred to drivers on a less desirable route. In contrast to the SO traffic pattern, they showed the redistribution scheme renders almost identical results. Guo and Yang [38] investigated the existence of Pareto-improvingFootnote 4 TSS considering multiple users. In the presence of the restricting Pareto-improving condition, they show that the TSS problem always has a solution if a significant gap exists (The gap is the room for improvement and it refers to the difference in total travel time between the UE and SO traffic patterns). Even in a more restrictive scheme where no subsidy exists, the Pareto-improving solution exists [39, 40]. Furthermore, in a fashion reminiscent of the TSS, the revenue from tolling highway roads is used to subsidize the fare adjustments on transit lines. In a controlled environment, Ben-Elia and Ettema [41] investigated the impact of a reward scheme on the behavior of commuters in the Netherlands. The results suggest that the reward scheme can be certainly effective in the short run, while it has yet to be investigated in the long run. It is worth noting that, recently, the idea of using rewards to change commuters’ behavior has been implemented in the Netherlands—though in limited scale—as part of a program called Spitsmijden [41]. Using a bilevel Stackelberg game approach, Maillé and Stier-Moses [42] investigated a mechanism based on a pure subsidy (without any toll) that aims at reducing congestion in urban networks. They concluded that the subsidies can significantly lower the social cost and part of the budget for rebates may come from the savings that arise from the most efficient use of the system.

In conjunction with the TSS, some scholars seek the revenue-neutral property; that is, the total toll revenue collected is set equal to the total amount of subsidies paid (essentially, the average commuter pays nothing [43]). As the result, the authorities can attain an optimized system by merely redistributing the revenue without engaging in financial transfer to or from the commuters. The scheme is also called Robin Hood toll scheme [44], which can be characterized by the following simple equation: toll–subsidy = 0.

By relaxing the non-negativity of the toll values [44] introduced the idea of the valid toll set. Based on their work, by establishing a number of linear programs one can obtain optimal TSSs for various objectives and considerations. Nevertheless, the computational time is found to be of highest significance. Our method differs in the fact that the traffic volumes are bounded from the above and the computational expenses is of no concern.

Compared to the tradable schemes (TCS), the Robin Hood method tries to obviate the “trade” part, which comes at a restrictive cost compared to what our method (TSS) offers. In other words, in our approach, we relax the equality sign in the equation; that is, the sum of the toll minus subsidy could be positive or negative (while in Robin Hood, this must equate to zero). This is aligned with what we are after (i.e., certain roads associated with certain target volumes to be achieved either by toll or subsidy); if we enforce Robin Hood method (adding additional constraints), the problem may become infeasible.

Regardless of the challenging task of the toll redistribution, it is highly appealing to merge the two ideas of revenue-neutral and Pareto-improving TSS, where everyone is a winner. For a single origin–destination (O–D) pair, Eliasson [45] showed that compared to the do-nothing scenario, in a revenue-neutral TSS, where the total travel time is reduced, no commuter is found to be worse off. Liu, Guo and Yang [46] and Nie and Liu [47] examined the existence of Pareto-improving and revenue-neutral pricing scheme in a simple bimodal network consisting of road and a parallel transit line. Results provided by Liu, Guo and Yang [48] suggests that the condition for every user being better off is weaker than the condition for total system travel time reduction. In other words, any revenue-neutral pricing scheme is Pareto-improving as long as it reduces the total system travel time. The results provided by Nie and Liu [47] imply that the revenue-neutral property is a restricting condition which may compromise the existence of a solution. In other words, external subsidies may be required to make every commuter happy (Pareto-improving). Xiao and Zhang [48] showed that on a one-origin or one-destination network, a Pareto-improving, system-optimal and revenue-neutral TSS always exists. It turns out that such a scheme may not always exist for a multiorigin network; hence, they sought the maximum possible revenue collected by the TSS problem subject to the Pareto-improving constrains. Given the results of the above studies, it is safe to say that maintaining both the Pareto-improving and revenue-neutral properties is difficult; possibly the best bet is to drop the revenue-neutral in order to secure a solution. Chen and Yang [28] proved that in the context of nonnegative toll schemes and/or toll-cum-rebate schemes for Pareto-efficient control, the TSS always has a solution. Considering that tolls and subsidies are represented by positive and negative values, then, a negative cycle is a distinct closed path along which the sum of the road travel costs is negative. If a negative cycle exists, commuters can make money merely by traversing along the negative cycle. Chen and Yang [28] proved that the TSS—as we set out here—has always a solution, if, loosely speaking, the amount of toll is higher than that of subsidy.

3 The mathematical model

As noted early, the problem to be undertaken is as follows: Given a set of roads associated with capped volumes (let us call it capacity) to be tolled or subsidized, what is a pricing scheme. In other words, what is the price tag (toll or subsidy) of each road? In this section, we first articulate the mathematical expression of the problem and then propose the solution algorithm.

3.1 Formulation of the problem

The problem as set out above with the capped traffic volume essentially is a capacitated traffic assignment problem (CTAP). Consider \( G(N,\,A) \) a traffic network as a graph which consists of N nodes and A roads on which \( R \subset N \times N \) is a set of origin–destination pairs. Set of roads nominated as toll/subsidy is denoted by \( \bar{A} \subset A \). The CTAP can be formulated as a nonlinear programing problem as follows (throughout the paper, all terms are nonnegative unless otherwise stated):

[CTAP]:

$$ \hbox{min} \quad z(x) = \sum\nolimits_{a \in A} {\int_{0}^{{x_{a} }} {t_{a} (x)\,{\text{d}}x} }, $$
(1)

s.t.:

$$ \sum\nolimits_{p} {f_{p,r}^{{}} = q}_{r}^{{}} \quad \quad r \in R \to {\text{Lagrange multiplier }}(w_{r}^{{}} ), $$
(2)
$$ f_{p,r}^{{}} \ge 0\quad \quad p \in P_{r}^{{}} ,\,r \in R, $$
(3)
$$ x_{a}^{{}} = \sum\limits_{r} {\sum\limits_{p} {f_{p,r}^{{}} \,\delta_{a,p}^{r} } } \,\,\,\,\quad \quad a \in A,p \in P_{r}^{{}} ,\,r \in R, $$
(4)
$$ x_{a} \le C_{a} \quad \quad a \in \bar{A} \to {\text{Lagrange multiplier }}(\beta_{a} ), $$
(5)

where \( z \) is the Beckmann objective function to be minimized; \( x_{a}^{{}} \) is the traffic flow on road \( a \); \( q_{r}^{{}} \) is the travel demand pertaining to OD pair \( r \in R \); \( f_{p,r}^{{}} \) is the traffic flow on path \( p \) connecting OD pair \( r \); \( P_{r} \) is the set of all paths connecting OD pair \( r \); \( \delta_{a,p}^{r} \) is the road-path incidence (1 if road \( a \) belongs to path \( p \) connecting OD pair \( r, \) and 0 otherwise); \( C_{a} \) is the capped traffic volume of road \( a \).Footnote 5

3.2 Karush–Kuhn–Tucker conditions and Lagrange multiplier

It is proved that the CTAP subject to linear constraints is a strictly convex problem which renders a unique global optimal solution of road flows [49,50,51,52,53,54,54]. Let us consider \( w_{r}^{{}} {\text{ and }}\,\beta_{a} \) as Lagrange multipliers for travel demand and capacity constraints, respectively; hence, the Karush–Kuhn–Tucker (KKT) conditions are established as

$$ f_{p,r}^{{}} (u_{p,r}^{{}} + \sum\nolimits_{a \in A} {\delta_{a,p}^{r} \,\beta_{a} - w_{r}^{{}} } ) = 0\quad \quad p \in P_{r}^{{}} ,\,r \in R ,$$
(6)
$$ \beta_{a} (C_{a} - x_{a} ) = 0\quad \quad a \in \bar{A}, $$
(7)
$$ u_{p,r}^{{}} + \sum\nolimits_{a \in A} {\delta_{a,p}^{r} \,\beta_{a} - w_{r}^{{}} } \ge 0\quad \quad p \in P_{r}^{{}} ,\,r \in R, $$
(8)
$$ C_{a} - x_{a} \ge 0\quad \quad a \in \bar{A}, $$
(9)
$$ f_{p,r}^{{}} \ge 0\quad \quad p \in P_{r}^{{}} ,\,r \in R, $$
(10)
$$ \beta_{a} \ge 0\quad \quad a \in \bar{A}, $$
(11)
$$ \sum\nolimits_{p} {f_{p,r}^{{}} = } q_{r}^{{}} \quad \quad r \in R, $$
(12)

where \( u_{p,r}^{{}} = \sum\nolimits_{a \in A} {\delta_{a,p}^{r} \,t_{a}^{{}} } \) is the total travel time perceived by commuters on a path \( p \) connecting OD pair \( r \). Let us define \( \hat{u}_{p,r}^{{}} \), an “inflated” travel time of the respective path as

$$ \hat{u}_{p,r}^{{}} = \sum\nolimits_{a \in A} {\delta_{a,p}^{r} \,(t_{a} + \beta_{a} )} \quad \quad a \in \bar{A} $$
(13)

Introduction of (13) into (6) and (8) results in

$$ f_{p,r}^{{}} (\hat{u}_{p,r}^{{}} - w_{r}^{{}} ) = 0\quad \quad p \in P_{r} ,\,r \in R, $$
(14)
$$ \hat{u}_{p,r}^{{}} - w_{r}^{{}} \ge 0\quad \quad p \in P_{r} ,\,r \in R .$$
(15)

With respect to Eqs. (14) and (15), it can be proved that \( w_{r}^{{}} \) is the travel time of the shortest path connecting OD pair \( r \); hence, if a path takes some traffic volume \( (f_{p,r}^{{}} > 0) \), it is certainly the shortest path. In other words, the first principle of Wardrop holds and the global optimum solution of the CTAP is, in fact, a UE traffic flow. According to Eqs. (7), (9) and (11), if the capacity constraint is binding, which means saturation \( (x_{a} = C_{a} ) \), then the corresponding lagrange multipliers are nonzero \( (\beta_{a} > 0) \); otherwise, it is zero. There are two terms contributing to the augmented travel time [Eq. (13)]: normal or cruise travel time \( (t_{a} ) \) and beta \( (\beta_{a} ) \). The beta can be interpreted as toll/subsidy value or alternatively the additional delay to prevent the respective road become oversaturated (i.e., traffic volume above the capacity). In the literature, the beta is also interpreted as the waiting time caused by the queue buildup in the oversaturated roads [49,49,50,52, 55,55,56,58]. As shown, the beta which represents the value of toll or subsidy is a positive number, whereas the subsidy is supposed to be a negative value. We will later handle this issue when an especial provision is devised to prevent any negative cycle.

3.3 Augmented travel time

Let us first assume that there exists only toll and no subsidy. After we establish the formulation for the tolls, we will then extend the discussion to the subsidy. Let us consider the delay function as \( t_{a}^{{}} = t_{a}^{0} \,(1 + f(x_{a} )) ,\) where \( t_{a}^{0} \) is the free flow travel time perceived by commuters on the road \( a \) and \( f(x_{a} ) \ge 0 \) is a non-decreasing and convex function of \( x_{a} \) such that \( f(x_{a} ) = 0|x_{a} = 0 \). The aforementioned function can accommodate a variety of known delay functions including the widely used function proposed by the US Bureau of Public Roads (the BPR delay function) [59]. Therefore, we can intuitively argue that, as long as a road is empty or uncongested, its travel time is the free flow travel time \( (x_{a} \approx 0\, \Rightarrow \,\,t_{a}^{{}} \approx t_{a}^{0} \,) \); as the traffic volume builds up, \( t_{a} \), the travel time, increases—certainly higher than the free flow travel time (this increase can be considered as a factor of \( t_{a}^{0} \) which is greater than 1, i.e., \( ((1 + f(x_{a} )) \ge 1) \)). According to Eq. (13), \( \hat{t}_{a}^{{}} \) the inflated travel time of “saturated” roadFootnote 6 \( a \) can be formulated as

$$ \hat{t}_{a}^{{}} = (t_{a}^{0} + b_{a} )(1 + f(x_{a} )) ,$$
(16)
$$ \beta_{a} = \hat{t}_{a}^{{}} - t_{a}^{{}} = b_{a} (1 + f(x_{a} )), $$
(17)

where \( b_{a} \) is an additional penalty in the free flow time in Eq. (16). In other words, \( b_{a} \) is the value of beta at \( x_{a} = 0 \); hence, we referred to it as the “initial-beta.”

Note that we have proved that the additional penalty term in the travel time is identical to the Lagrangian value of the respective capacity constraint [see \( \beta_{a} \) in Eq. (13)]. For more information, interested reader can also consult with [60]).

As a result, the travel time in the CTAP is replaced with the augmented travel time \( (t_{a}^{{}} \leftarrow \hat{t}_{a}^{{}} ) \) and the capacity constraint is also dropped. It is due to the fact, the beta, the Lagrange multiplier of the corresponding capacity constraint, now contributes to the travel time. Therefore, the CTAP is transformed to an un-capacitated TAP. If we had already known the global optimum value of the beta (in the CTAP), we would have just needed to solve the TAP based on the inflated travel times using any known algorithm or software. Of course, this is not the case; hence, the values of the (initial) betas are updated iteratively in the course of solving the TAP as explained in the following section. Moreover, the value of the beta is zero unless the corresponding road is oversaturated. In other words, Eqs. (16) and (17) apply only to the oversaturated roads to bring them down at their capacity levels (to become saturated). Hence, nonzero values are assigned to the initial-beta of the (over)saturated roads while they are set to be updated in the next iterations.

The proposed formulation differs from the second-best method which is in fact a bilevel programing problem. In the second-best methods, the aim is to minimize the total travel time spent in the network by enforcing a pricing regime, which is extremely hard to solve due to the computational complexity (also known as NP hard problem). In our proposed method, the target traffic loads of a number of selected roads (to be either tolled or subsidized) are already known and the aim is to set up the amount of tolls and subsidies to reach the target values. The main contribution of this study therefore can be attributed to avoiding a bilevel programing formulation, instead to articulate it as a CTAP which is much easier to solve even for large-sized networks. The important point to note is that, although a CTAP is basically solved, the toll/subsidy values are derived from the Lagrangian values of the respective capacity constraints.

3.4 A heuristic method to update the initial-beta

A variety of algorithms has been proposed to solve the CTAP for which Bagloee and Sarvi [29] have recently presented a comprehensive review. The inherent mathematical complexities associated with the capacity constraints have resulted in solution algorithms laden with a number of parameters to be calibrated, which is a prohibitive factor. In addition, arriving at an initial feasible solution to launch the algorithm is also a challenge. Alternatively, in this study, the mathematical complexity of the CTAP is overcome by adopting an intuitive interpretation of capacity; that is, the Lagrange multipliers of the capacity constraints are interpreted as additional delay imposed on the oversaturated roads to make them saturated. Such an interpretation has also been employed in the congestion pricing, namely the “trial-and-error” methods [18, 61,62,63,64].

The concept embedded in Eqs. (16) and (17) is to lift the delay function of the oversaturated roads until the traffic volume stabilizes at capacity level. The value of the initial-beta in the delay functions is updated iteratively, for which the main challenge is to progress toward convergence and user equilibrium. There is a strong correlation between the capacitated traffic assignment and the general theory of the congestion pricing [18] which is centered on charging commuters based on the marginal cost. Inspired by such an observation, the amount of additional delay to be added to obtain the inflated travel time is derived from the concept of marginal cost as follows [16, 49, 65]:

$$ \tilde{t}_{a} (x_{a} ) = t_{a} (x_{a} ) + x_{a} \cdot \,\partial t_{a} (x_{a} )/\partial x_{a}, $$
(18)

where \( \tilde{t}_{a} (x_{a} ) \) is the marginal cost or travel time experienced by an additional commuter added to \( x_{a} \) who are already traversing road a, and \( \partial t_{a} (x_{a} )/\partial x_{a} \) is the additional travel time experienced by each driver among \( x_{a} \). The marginal cost enforces system-optimal flow across the network to extend the underutilized roads will absorb the traffic discharged off the oversaturated roads. Hence, \( x_{a} \cdot \,\partial t_{a} (x_{a} )/\partial x_{a} \)—the additional delay imposed on the respective road—is considered as a template to update the initial-betas as follows:

$$ \hat{t}_{a,C}^{(i)} = (t_{a}^{0} + b_{a}^{(i)} ) \,(1 + f(C_{a} )), $$
(19)
$$ \nabla b_{a}^{(i)} = (x_{a}^{(i)} - C_{a} ) \frac{{\hat{t}_{a}^{(i)} - \,\hat{t}_{a,C}^{(i)} }}{{C_{a} }}, $$
(20)
$$ b_{a}^{(i + 1)} = b_{a}^{(i)} + \nabla b_{a}^{(i)}, $$
(21)

where superscript \( i \) and \( a \) denote the current iteration and respective (over)saturated road; \( \hat{t}_{a,C}^{(i)} \) represents travel time at capacity \( (x_{a} = C_{a} ) \) of the inflated delay function; \( b_{a}^{(i)} \) is the initial-beta or an additional penalty to free flow travel time \( (b_{a}^{(i)} ) \);\( \nabla b_{a}^{(i)} \) is the pace of the initial-beta computed at the current iteration, while \( b_{a}^{(i + 1)} \) is the updated initial-beta computed for the next iteration, initialized by zero \( (b_{a}^{(1)} = 0) \).

The formulation provided in Eq. (20) follows the spirit of marginal cost (i.e., \( x_{a} \cdot\partial t_{a} (x_{a} )/\partial x_{a} \)) to update the pace values. The “x” is replaced by the excess traffic \( (x_{a}^{(i)} - C_{a} ) \) and the slope of the delay function (\( {\text{d}}t/{\text{d}}x \)) is replaced by the slope of the inflated travel time minus the pace value normalized by the value of the capacity \( (\hat{t}_{a}^{(i)} - \hat{t}_{a,C}^{(i)} )/C_{a} \). In Fig. 1, Exhibit 1, the above formulations [Eqs. (19)–(21)] for three iterations on the (inflated) delay function are graphically shown. In the first iteration when there is no initial-beta \( (b_{a}^{(1)} = 0) \), the volume stands at \( x_{a}^{(1)} > C_{a} \), and the pace \( \nabla b_{a}^{(1)} > 0 \) is computed as shown graphically, which lifts the delay function for the next iteration \( (b_{a}^{(2)} = 0 + \nabla b_{a}^{(1)} ) \). In the second iteration, even though it is an inflated travel time \( (\nabla b_{a}^{(1)} ) \), the traffic volume still stands above the capacity \( (x_{a}^{(2)} > C_{a} ) \). Hence, the value of the pace \( (\nabla b_{a}^{(2)} > 0) \) is uplifted again in the third iteration: \( (b_{a}^{(3)} = b_{a}^{(2)} + \nabla b_{a}^{(2)} ) \). The third iteration is executed and the volume stands at capacity \( (x_{a}^{(3)} = C_{a} ) \). The three keywords or components of the proposed algorithm, beta \( (\beta_{a} ) \), initial-beta \( (b_{a} ) \) and the pace \( (\nabla b_{a} ) \) are shown in Fig. 1. During this progressive approach, if an (over)saturated road is found unsaturated—at an intermediate iteration—its corresponding penalty is nullified \( (p_{a}^{(i + 1)} = 0|x_{a}^{(i)} < C_{a} ) \) as shown in Fig. 1, Exhibit 2.

Fig. 1
figure 1

Conceptual representation of the proposed methodology for the road delay functions

The above conventions for computing the initial-beta can be summarized by two rules:

$$ b_{a}^{(i + 1)} = \left\{\begin{array}{cc} b_{a}^{(i)} + \nabla b_{a}^{(i)} \,\,{\text{from}}\,\,\,(20) &\quad x_{a}^{(i)} \ge C_{a} \\ 0 &\quad x_{a}^{(i)}< C_{a}. \end{array} \right. $$
(22)

The above setup complies with the KKT conditions at the stationary point of the capacitated assignment problem, in which the beta of the non-saturated roads must be set zero [18].

For ease of reference, the proposed methodology is referred to as the “inflated travel time” (ITT) method, similar to the terminology used in the literature. In the context of the other methods, the numerical results showed promising convergence behavior.

3.5 Subsidy

We can now include the subsidy in the formulation. As noted early, a negative cycle may spoil the whole pricing process. To avoid such a case, we ensure the roads’ travel times to take always positive values. To this end, we first zero out the free flow travel times of all the toll/subsidy roads. We then initialize the initial-beta to the respective free flow travel time. Effectively, nothing changes and the delay functions remain intactFootnote 7:

$$ \left\{ {\begin{array}{*{20}c} {b_{a}^{(0)} = t_{a}^{0} } \\ {} \\ {t_{a}^{0} = 0} \\ \end{array} } \right.\quad \quad \begin{array}{*{20}c} {} \\ {a \in \bar{A}} \\ {} \\ \end{array} .$$
(23)

This process is graphically shown in Exhibit 3 of Fig. 1. Let us have a close look at Exhibit 3: In the end of the computations (the last iteration), due to the value of the beta (or the last initial-beta), the resulting delay function can either stand at A or B (above and below the free flow time axis). The beta minus the free flow time (\( \beta_{a} - t_{a}^{0} \)) can result in either a positive value (delay function A) or a negative value (delay function B) which represents toll or subsidy, respectively. Moreover, the absolute value of the subsidy cannot exceed the free flow time. In other words, no road will take a negative travel time; hence, no negative cycle will appear.

Therefore, given the outputs at the end of the last iteration (iteration \( n \)), the toll/subsidy values \( ( - t_{a}^{0} \le s_{a} \le + \infty ) \) can be calculated as follows:

$$ s_{a} = \hat{t}_{a}^{(n)} - t_{a}^{0} (1 + f(\bar{C}_{a} ))\quad \quad a \in \bar{A}. $$
(24)

3.6 Capacity feasibility

In case the travel demand is higher than the capacity of the network, the problem becomes infeasible. It is important for any algorithm to have some mechanism to detect and address the infeasibility cases. To this end, one can introduce a dummy node connected with all zones via un-capacitated roads associated with high travel time. Therefore, the problem always remains feasible.

The dummy roads not only obviate any feasibility concern, they are also devised to replicate reality. Consider a single OD connected via a single road with a capacity of 10 vehicles. Faced with 15 vehicles demand, the algorithm allows 10 vehicles to drive through while 5 vehicles have no chance to do so; instead, they are assigned to the dummy roads as the leftover traffic volume. In reality, these excess 5 vehicles have no choice except to change their departure times which is also studied separately in the literature. In other words, the residual demand remains off the network until the next available traffic assignment interval, which is discussed in the literature on dynamic traffic assignment [66].

3.7 Termination conditions

The proposed algorithm can be easily integrated into a conventional solution algorithm for the TAP such as Frank–Wolfe. Hence, given fixed rates of the initial-betas, the solution algorithm itself needs to converge and meet its own termination criterion which is mainly driven by a relative gap. Boyce, Ralevic-Dekic and Bar-Gera [67] and Dial [68] recommend a relative gap of 0.0001 to ensure a perfect convergence to road flow stability. This criterion is adopted in our numerical evaluations. It is also expected that the initial-betas show convergence behavior over the successive iterations. Given the gradual buildup of the initial-beta, a convergence criterion can be defined as pace values falling below a trivial value (denoted by \( \varepsilon \)):

$$ \mathop {\hbox{max} }\limits_{a} |\nabla b_{a}^{(i)} /\hat{t}_{a} |\,\, \le \,\varepsilon. $$
(25)

The numerical results showed that \( \varepsilon = 1\% \) is sufficient to obtain reliable results. Consequently, the algorithm does not terminate unless both criteria the relative gap and the relative pace values are met.

The attempt to find the Lagrange multiplier values of the capacity constraints iteratively is, in fact, equivalent to solving a partial dual transformation of the original CTAP [62]. It can be shown that the gradient of the partial dual problem leads to a vector of excessive traffic volume for which the sub-gradient method is deemed to be a competent solution algorithm [69, 70]. The ITT algorithm is essentially a sub-gradient method because the dual values are updated iteratively based on the excessive traffic volume.

A key component of the sub-gradient methods is the step size which is uniformly applied to all roads and may vary across iterations. In order to ensure the convergence, the values of the step size must comply with a set of rules [71]. Hence, finding an appropriate scheme for the step size is a challenge. Instead, we proposed an intuitive step-size mechanism (analogous to marginal cost), exclusive to each road and iteration.

Note that the optimality conditions as well as the convergence aspects of the ITT method have been discussed in the literature ([29]). The novelty of our method in this research is to translate the Lagrangian values as toll/subsidy values. To this end, free flow travel time is first zeroed out; when the CTAP terminates, the difference between the ultimate travel time and the original travel time function specifies the toll/subsidy values (this process is also graphically illustrated in Fig. 1). We are yet to prove that the step size devised for the ITT abides by the aforementioned rules.

In the sequel, we report on the numerical results. As a recap, Fig. 2 shows a flowchart of the proposed methodology for the toll/subsidy pricing scheme.

Fig. 2
figure 2

Flowchart of the proposed methodology for the toll/subsidy pricing. Note that the only loop forwarded to the second block in the flowchart (where it says “Solve the corresponding (un-capacitated) traffic assignment problem (TAP) for only one iteration”) is repeated until the corresponding TAP converges to a trivial relative gap. In other words, that does not mean the TAP is solved only for one iteration. In fact, it means a lot more iterations while being paused to provide an opportunity to update the initial-beta in the subsequent block

4 Numerical results

The methodology is coded in a “macro,” which is the programing language of the EMME 3 [72]—a transportation planning software, in which the un-capacitated TAP is solved using the Frank–Wolfe algorithm. A desktop PC with a 3.60 GHz CPU and 16 GB of RAM is employed. In this section, we first report of numerical analysis of a pedagogical example to illustrate the concept of toll-and-subsidy. Second, a large-sized road network is undertaken to show applicability of the proposed methodology to real-life examples.

4.1 A Pedagogical example based on the Braess’ network

Figure 3 depicts the famous Braess network consisting of 5 roads connecting an origin to a destination as well as roads’ delay functions in which the travel demand is 6 [16]. Figure 3a also shows traffic volumes of the existing condition with no toll/subsidy such that the total travel time becomes 552 (see also Table 1). It is proved that road #5 is a Braess tainted road; that is, if it is closed, the traffic circulation improves (or in other words, the total travel time in the network decreases to 498, see Fig. 3b and Table 1). To this end, a traffic authority would like to retain the road #5, instead to toll it to decrease the total traffic load from current 2 to 0.5. Figure 3c shows the results in which amount of toll imposed on the road #5 became 9.75 and the total travel time decreased to 506.63. As can be seen, the traffic volume of road #1 has reached 2.75. In the last try, the traffic authority would like to retain the traffic volume of the road #5 at 0.5 and increase the traffic volume of the road #1 to 3.5. It is intuitively conceivable that to increase more traffic passing through the road #1 there must be some incentive or subsidy, whereas the road #5 still has to be tolled. Figure 3d shows the results in which the target traffic loads have been achieved at the cost of toll price of 1.5 (for road #5) and subsidy of −16.5 (for road #1), while the total travel time stands at 519, which is still lower than that in the existing network scenario (Fig. 3a). All the calculations are reported in Table 1.

Fig. 3
figure 3

Braess example, different pricing regimes

Table 1 Braess example, different too/subsidy scenarios

4.2 Winnipeg case study

The ITT algorithm is applied to a large-sized network of the city of Winnipeg, Canada, which is widely used in the literature [73] (it is also provided by [72] as part of the EMME 3). The case study consisted of 154 zones, 943 nodes and 3075 directional roads with hourly travel demand of 56,219 vehicles. The delay functions are assumed to comply with the general format of the BPR (Bureau of Public Roads) functions.

The city is bisected by two rivers which merge in the middle of the city. According to traffic surveys and field observations, traffic authorities have come up with a desirable traffic distribution over the west-north river crossings to avoid any major gridlock in the AM peak hour. Figure 4 depicts 11 bridges which are subject to the traffic control.

Fig. 4
figure 4

Winnipeg case study, locations of the river crossing nominated for the TSS

Note that it is postulated that the target traffic values of the toll/subsidy roads are set in such a way the total system travel time improves. This can be derived from macroscopic fundamental diagrams [27] or also known as perimeter control which is an active area of research [74, 75].Footnote 8 In our study, the location and target traffic values of the toll/subsidy are known and are exogenously given. To this end, in Table 2, characteristics of the crossings including the exiting traffic volume, free flow travel times \( t_{a}^{0} \), physical capacities \( \bar{C}_{a} \) as well as the capped traffic volume \( C_{a} \) are indicated. In this section, we first report on results of above-mentioned network and demand scenario. As a sensitivity analysis effort, given the same network, we then project the travel demand to 5% and report results of the congestion pricing.

Table 2 Winnipeg case study, input and output values

4.2.1 Case study of existing network and existing travel demand

The algorithm runs for 237 iterations until the relative gap and the relative pace value falls below 0.0001 and 0.01, respectively. The computation time lasts only 15 s. The results including the traffic volumes and the inflated travel time as well as the toll/subsidy values are also shown in Table 2. The restrictions on the traffic volume are upheld, and the slight violations—as shown under \( x_{a} /C_{a} \) column—are less than 1%, which can be attributed to the fact that the ITT is a heuristic algorithm. Figure 5 demonstrates variations of traffic volume and the inflated travel times over successive iterations pertaining to the roads 1, 2 and 3 (note that we chose the first three roads; otherwise, the figure would be inundated with myriad of data becoming hard to read). The results indicate that the variation is volatile in early iterations, but it then stabilizes as the results head toward the convergence. Similar erratic variations in solving the CTAP have also been reported in the literature [55].

Fig. 5
figure 5

Winnipeg case study, the first three bridges over successive iterations: a variation of the traffic volumes; b variation of the inflated travel time

Figure 6a illustrates traffic volumes before and after implementation of the TSS. It is obvious that the traffic volumes are squeezed below the capped values. Variations of the pace values over successive iterations are depicted in Fig. 6b. In this figure, the maximum of the pace values (in absolute value) across the toll/subsidy roads are plotted. As can be seen, the pace values rapidly converge to zero. The overall convergence of the algorithm is depicted in Fig. 6c in which the relative gaps (in percentage) as well as the Beckmann values are shown. The last column in Table 2 presents the toll/subsidy value—in Canadian dollar unit—in which the value of time is assumed $16.69 per hour [76]. The maximum toll and subsidy found to attain the target crossing volume are $4.13 and $0.26 (twenty-six cents). Given the rapid expansion of communication technology, devises such as GPS are now indispensable gadget. Therefore, communicating a set of diverse rates of toll/subsidy—as is the case in the output—should not be a problem.Footnote 9

Fig. 6
figure 6

Winnipeg case study: a traffic volume on the screen line before and after TSS; b maximum of changes in penalty; c convergence behavior

4.2.2 Case study of existing network and projected travel demand of 5%

The travel demand is projected to 5% which stands at 59,029.94 vehicle trips. Growth of 5% can be considered as annual or biannual growth of urban traffic, so that one can gain some insight of the annual changes in the toll/subsidy amounts. Similar to Table 3, comparative results of the projected demand scenarios are shown in Table 3. As seen, the total traffic flow passing through the toll/subsidy road increases from 24,407.3 to 25,609.1, which is equivalent to 4.9% growth. The amounts of subsidy and toll also have seen moderate changes which is an endorsement to the stability of the final results.

Table 3 Winnipeg case studies, comparisons of two scenarios: existing demand and projected demand of 5%

5 Conclusion

We studied the idea of the TSS for congestion pricing. The element of subsidy is an appealing factor that can greatly compensate some shortcomings related to equity and acceptability in the previous schemes. A fully fledged TSS problem ought to be postulated as a bilevel programing problem which is highly intractable. Nonetheless, in practice, the problem is usually as simple as follows: The location of the tolled (or subsidized) roads is known; it could be a cordon line surrounding a CBD or screen lines bisecting the urban network. Even—based on the traffic survey or historical data—the target traffic volume on the tolled road is also given. Therefore, the question of interest is: What is the best pricing regime, or, which road should be tolled and which ones should be subsidized and how much? To answer this question, given the capped values of the target roads, we first transformed the problem to a conventional CTAP. Second, the Lagrangian values of the respective capacity constraints are interpreted as the toll/subsidy. Third, we relaxed the capacity constraints by moving their Lagrangian values to the respective roads’ delay function. In other words, the delay functions receive an additional term to be iteratively updated in such a way the traffic volumes remain below the capped values. We referred to the process as “inflated travel time.” The algorithm was then applied to the real dataset of the Winnipeg, Canada, in which 11 river crossings bridges of the city of Winnipeg were subject to the TSS. The maximum values of toll and subsidy were found to be 4 and 0.2 Canadian dollars.

The subsidy can also be well played as a leverage to address land-use-related problems. The area in the vicinity of subsidized roads can be a magnet for business and job opportunity. The immediate role of the subsidy can be attributed to the fact that it makes the pricing product more sellable to the public since it brushes off the stain of being another form of taxation.

The concept of the subsidy does not need any payment mechanism. The collected subsidy can be later reimbursed in many different ways such as toll payment, parking fee, public transport ticket, traffic fine, annual car registration to name a few. It is also easy to protect the system from any mall practices, for instance, total daily value of collected subsidies can be limited, and hence, no one can make money by simply driving in the city (a condition to ensure that the problem always has a solution [28]).

In the same spirit, Morosan and Florian [77] have recently solved a distance-based pricing with minimum and maximum toll rates based on which commuters are charged based on vehicle kilometers traveled (VKT). Therefore, integration of the VKT-based TSS can be a worthwhile research.

On other threads for future studies, the followings are worth mentioning. In this study, we assumed that the travel demand is fixed and all drivers have an identical perception about the toll/subsidy. It is, therefore, a worthy attempt to extend the research to multiclass travel demand in which the value of time varies across different sectors of the society. The elasticity of the travel demand to the changes in the pricing regime by itself deserves a further investigation. With the same spirit, consideration of the mutual impact of transit and private modes in the presence of congestion pricing needs to be further studied. As noted before, in order to subject different classes of cars (e.g., heavy truck vs. passenger cars), different amount of toll/subsidy, the multiclass modeling is inevitable. Furthermore, replacing the link-based traffic assignment with path-based method is a significant boon to the model, such that one can get more insight of the origin destination and the trajectory of the drivers. This information can then be used for planning purpose such as distance-based congestion pricing which is an interested line of research. Investigating the congestion pricing via driving simulator and its applications in the intelligent transportation system to promote the freight modes are some worthy stream of research [78, 79]