Introduction

Since the advent of the four-step transport planning framework in the 1950s, there has been a drive to incorporate linkages between transport networks and the broader economy. The simplest four-step models assume that origin–destination (OD) demands for trip assignment remain static as transport networks change. For small changes, this assumption is most likely realistic, but larger ones can lead to fundamental shifts in economies and travel demand. For example, improvements in travel time savings may lead to migration, changes in employment patterns, increased attractiveness of shopping regions and so forth.

One method of incorporating OD demand elasticity is to update the generation, distribution and mode split steps using travel times from the assignment step. Land-use transport interaction (LUTI) models (Wegener 2004) achieve this by simulating changes in land-use patterns resulting from changes in travel times or accessibility indicators, and then recalculating travel demand from the new land-use patterns. While this can allow for the partial equilibration of land markets, there is still a reliance on fixed economic parameters for markets other than transport and land as well as of prices for all markets.

Unabated population growth, urbanisation, and increase in consumption leads to higher pressure on transport services. In addition, changes in the production structure and economic patterns also can influence transport demand over time. As a result, a new body of research focuses on transport modelling, defined by the creation of transport input–output (IO) models at the urban and regional scale, where authors used IO, gravity, and SCGE approaches to model inter-regional freight demand (Ivanova 2014).

Furthermore, there is increasing interest in understanding the welfare gains and losses that occur due to market imperfections and technological externalities, known as wider economic impacts (WEIs). These are not captured in the conventional consumer surplus metric used in cost benefit analyses, whether driven by classic four-step modelling structures or contemporary activity-based models.

Computable general equilibrium (CGE) models have been adapted in recent years to analyse the economic effects of transport and to quantify WEIs (Wangsness et al. 2016). A CGE model comprises a set of equilibrium equations representing supply and demand in every market in a given economy. These equations are built from fundamental microeconomic models of household (consumer) and firm (producer) behaviour, and thus a CGE model can provide a wide array of welfare, price and GDP outputs. They can also be used to identify the eventual distribution of impacts both spatially and through markets from the equilibration of the entire economy.

CGE models for transport are typically applied to estimate productivity gains from transport improvements, where transport costs determine regional demand and freight costs (Robson et al. 2018). Some formulations represent household travel demand as well, in which case transport costs influence household decisions on where to live, work and shop. In most models, transport costs are static parameters calculated by a transport model. There are a limited number of models with feedback between CGE and transport submodels such that the CGE model generates demand for the transport model, and the transport model generates transport costs for the CGE model (Shahraki and Bachmann 2018).

This type of CGE model with endogenous travel costs provides a basis to develop an extended four-step planning model that accounts for a fuller range of economic responses. Since CGE models are simulators of economies, it is possible to generate trips at a very fundamental level—as a derived demand from the activities of households and firms, whose behaviour is determined from utility and profit maximisation models. A key advantage of linking these two models is that the one model could then be used to estimate both the magnitude and distribution of economic impacts, rather than relying on exogenous transport and economic models, generating a faster and richer set of outputs for strategic planning. The two-fold benefit towards both CGE and transport modelling practice would be that the transport model provides a complete endogenous response of transport costs for the CGE model, and the CGE model provides an endogenous economic response for the transport model.

However, very few, if any, existing CGE models with endogenous transport costs represent transport networks in such detail (Robson et al. 2018). For example (Broecker et al. 2004) modelled Socio-Economic and Spatial Impacts of EU Transport Policy using regional transport model linking to the CGE for the entire European Union. Transport models in the four-step framework are required to handle detailed networks, incorporate different time periods and generate a complete set of travel demands (including household and freight) to model transport costs accurately and at a resolution that permits the analysis of physical changes in infrastructure. Instead, existing CGE models with endogenous transport costs mostly represent sketch networks at the geographical scale of the CGE model, which are generally much coarser than the scale of travel zones in a four-step model, making the analysis of physical changes more difficult. Furthermore, existing models tend to only generate trips that are relevant to their model behaviour, e.g. commuting and shopping trips for an urban CGE model, or freight trips for a regional CGE model. Without representing the full set of travel demands, transport costs do not reflect those observed in the actual network.

This paper presents an integrated CGE and transport model that generates household and freight trips and simulates a detailed road network for different time periods. It is conceptually distinct from the model in Robson and Dixit (2017a) in that the transport submodel can be calibrated and run as a conventional transport model. Compared to that model, the transport submodel is new in that it is formulated separately, generates freight trips and allows for different geographies and time periods between the submodels. The CGE submodel in this paper is an extended version of the submodel in Robson and Dixit (2017a) by adding foreign market and government agents. Above all of these modifications, the Robson and Dixit work was based on a mixed complementarity formulation which was restricting the usage of the model for large scale problems. This is now relaxed in the proposed formulation of this paper.

This structure provides a universal platform for modelling the economy and transport networks together, which can be extended for land-use. The model is calibrated for the Sydney transport network and economy, and is tested by simulating the WestConnex motorway project currently under construction. At this stage, the model does not simulate mode choice, but can be implemented in future models. The proposed model paves the path of development of transport-based CGE models. This suggests a paradigm shift from CGE-based transport models, which are typically limited in terms of the level of complication that can be borrowed from a transport model such as number of zones/links of a realistic transport network or properties of the travel demand model. In introducing a transport-based CGE model, the transport aspects of the proposed formulation can be seen as an object-oriented structure which can be easily plugged in if needed without requiring many changes in the CGE model. As such, the proposed transport-based CGE model can better accommodate transport related policies to assess their effectiveness and robustness. Compared to the existing literature, Fig. 1 shows where the current paper stands in literature and the gap it fills with proposing transport-based CGE model.

Fig. 1
figure 1

A comparison between the existing literature and the current paper (Anas and Hiramatsu 2012; Tikoudis 2020; Tscharaktschiew and Hirte 2012; Dröes and Rietveld 2015)

In this paper, “Literature review” section reviews existing CGE models for transport. “Methodology” section develops the integrated model and discusses its assumptions. “Calibration” section describes data for the model calibration and details the calibration process. “Application” section describes the model application. “Results and discussion” section presents and discusses the results of the model application. “Conclusion” section concludes the paper.

Literature review

CGE models originate from the fields of IO modelling and general equilibrium theory. In the 1930s and 40s, Leontief (1941) developed a model to predict the required production levels of industries in an economy to meet a given final demand, based on a matrix of inter-industry commodity flows (an IO table). Johansen (1960) extended this model to include prices, which enabled behavioural models of households and firms to be introduced to the framework. Meanwhile, models of the economy were being developed in the field of general equilibrium theory, in particular the Arrow–Debreu model (1954), which the solution algorithms of Scarf (1973) made operational. However, the IO approach started to dominate by the 1980s due to its flexibility, and the terms and concepts of the two streams of literature effectively merged. CGE models from this era, such as ORANI (Dixon et al. 1982), became prominent for a range of high-level economic issues such as the setting of trade, tax and labour policies.

Many different fields have developed CGE models for transport with their own theory and practices. Robson et al. (2018) defined three main categories: urban CGE models, regional CGE models, and congestion and externality CGE models. Urban CGE models borrow more from the general equilibrium literature to simulate urban economies and incorporate urban-scale characteristics such as land-use and congestion. While early urban CGE models were theoretical in nature, more recent examples such as RELU-TRAN have been applied in real cities to study issues such as fuel prices, congestion tolling and job growth from infrastructure investment (Anas 2013). CGE models at the regional scale, including spatial CGE models, spatial price equilibrium-based (SPE) CGE models and new economic geography-based (NEG) CGE models, focus more on the economic impacts of inter-regional freight costs. Spatial CGE models are spatial extensions of conventional CGE models, and thus draw significantly from the IO literature. These have been developed for a diversity of applications globally, including in Europe (Bröcker 1998), North America (Wittwer 2017) and Asia (Kim et al. 2004a). NEG-based CGE models simulate the formation of urban agglomerations through industries with imperfect competition and increasing returns to scale. Some of these have been developed extensively for transport appraisal, such as the RAEM (Ivanova et al. 2007) family of models used to estimate WEIs in the Netherlands. Finally, congestion and externality CGE models are generally non-spatial, but represent travel commodities and household travel demand in detail to calculate congestion externalities. All of these fields have models that contain useful aspects for developing a CGE model that is aligned with the four-step transport planning model.

Shahraki and Bachmann (2018) identified three key aspects of CGE models that relate to transport networks: transport costs, network behaviour and trip generation. In spatial CGE models, transport costs are incorporated as a margin for industries to move goods from their region of production to their region of use. These margins are provided by an explicit transport industry in a full model. However, an alternative approach first proposed by Samuelson (1954) and popularised by the NEG literature (Fujita et al. 1999) is to assume that a portion of the transported good is used up in proportion to the freight cost (the ‘iceberg assumption’). While this method is computationally simpler, it can lead to serious misspecifications in production (Tavasszy et al. 2011). The transport costs themselves are normally outputs from an exogenous transport model without feedback from the CGE model.

In contrast, many urban and congestion/externality CGE models, as well as RAEM (Ivanova et al. 2007) and a few other regional models, represent household travel demand either as a commodity itself or derived from household activities. These trips can then be simulated in a transport model to determine demand for the travel commodity or underlying activity in the CGE model. Congestion and externality CGE models such as Mayeres’ model (2000) tend to have the most detailed household travel demands, but due to their non-spatial nature, cannot simulate congestion in transport networks other than through a single increasing function relating travel demands to costs. On the other hand, urban CGE models such as RELU-TRAN (Anas and Liu 2007) and Rutherford and van Nieuwkoop’s model (2011) are linked with traffic assignment models as they generate OD demands from household activities. In an iteration of RELU-TRAN, a CGE submodel is run to generate OD demands, which are then fed into a traffic assignment submodel to determine routing and travel costs for the next iteration of the CGE submodel. Rutherford and van Nieuwkoop’s model instead combines the CGE and traffic assignment submodels into a single mathematical problem. To do this, the model uses a multi-commodity flow (MCF) form of user equilibrium (UE) assignment, which can be expressed as a mixed complementarity problem and combined with the CGE submodel equations. There are also equivalent integrated CGE and transport models for freight trip generation in the SPE CGE literature such as Friesz et al.’s model (1994), but few, if any, models generate both household and freight trips.

Models such as RELU-TRAN (Anas and Liu 2007) and Rutherford and van Nieuwkoop’s model (2011) resemble the four-step framework, but lack detailed transport networks or the generation of a complete set of trips. Robson and Dixit’s model (2017a), based on Rutherford and van Nieuwkoop’s structure, generates a fuller set of trips (commuting, shopping and leisure), but only at the daily level. Another set of trips to consider is business trips, which is included in the RAEM model (Oosterhaven et al. 2001; Knaap and Oosterhaven 2011).

In summary, existing CGE models developed for transport applications have generally retained an economic focus, with a gradual relaxation of fixed transport cost assumptions through incorporating link congestion functions or separate transport network models. However, in almost all these cases, the transport models are somewhat additional to the CGE models, and thus lack emphasis on the necessary features required in transport modelling. These include separate geographies between the CGE and transport models to enable the simulation of detailed transport networks, rather than sketch networks, a full set of household and freight trip purposes, and the incorporation of different time periods for modelling the road network.

The proposed framework can easily incorporate population and employment modifications. The size and distribution of the population is captured in the model through household agents, where each household represents the people living in each combination of residence location, workplace location and employment industry. The size of each household agent is driven by their endowments of time and capital, with each household’s endowment being the sum of the endowments of the represented population. Population forecasts can therefore be incorporated by adjusting the endowments of each household for each forecast year. For example, if a region’s population is forecast to double (with employment and income distributions remaining the same), the endowments of the corresponding households can be doubled to forecast travel demand. This also enables a comparative static analysis for each forecast year by running the model both with and without the infrastructure/policy change. Likewise, workforce forecasts can be incorporated by adjusting the endowments of the corresponding households, except by employment region and industry rather than by residence region which is analogous to using the Pisserides search model (Pissarides 2000) in RAEM. However, unlike population forecasts, overall levels of employment are determined endogenously through modelling the labour-leisure decision and do not need to be modelled externally. Further, this model is designed so that mode split behaviour can be added without great difficulty, for example using an MNL model (resulting in a logsum calculation of trip costs) or CES function at the trip generation and trip cost stages of the framework.

Activity-based modelling is a more recent class of travel demand modelling that simulates the daily activities and corresponding movements of individuals, providing far greater detail than conventional four-step models. To the authors’ knowledge, there are no examples of integrated CGE and activity-based models for travel demand, although there are examples of integrated CGE and microsimulation models such as Bourguignon et al. (2005). It may be possible to develop such a model, but due to the additional complexity of activity-based models, the required computing time may be prohibitive.

Methodology

The proposed model is an integrated CGE and transport model that generates both household and freight trips and can simulate different geographical scales and time periods between the submodels. Four-step models typically have high spatial detail, with travel zones under 1 km2 in size to load demand precisely onto links. A range of time periods are also usually simulated as travel patterns vary significantly throughout the day. On the other hand, a CGE model grows rapidly as the number of regions increases, and becomes unwieldy at the number of zones required in a four-step model. Even if it were possible, it is debatable whether spatial variances in economic conditions justify CGE modelling at the travel zone level. In this model, the CGE submodel is an extended version of the submodel in Robson and Dixit (2017a), while the transport submodel is significantly different.

In terms of the structure of integration, unlike the model in Robson and Dixit (2017a) which was formulated as a single mixed complementarity problem, this model comprises separate CGE and transport submodels. In terms of the structure of integration, unlike the model in Robson and Dixit (2017a) which was formulated as a single mixed complementarity problem, this model comprises separate CGE and transport submodels in which the model tends toward equilibrium due to negative feedback reactions. The change is in view of tests that were conducted on the original model—when the transport network was scaled up, solvers for the MCF UE assignment formulation slowed considerably and often could not converge. In contrast, conventional assignment algorithms could solve the same networks in minutes due to their ability to adjust volumes across multiple links (path flows). As a result, the submodels have been split into explicit models, despite the original structure potentially affording efficiencies in allowing a solver to find the fastest solution path between the two submodels.

Both the CGE and transport submodels are static. The model is calibrated to a baseline equilibrium that is replicated when the model is run without changes. When an economic or transport parameter is changed, the model calculates a counterfactual equilibrium representing the long-term state of the economy, which provides a consistent basis for measuring impacts and comparing proposals. This is founded on the assumption that markets and transport networks tend towards equilibrium over time through the adjustment of prices and routing respectively. Any parameter in the CGE and transport submodels can be changed, including the structure of the transport network, yielding corresponding changes in prices, outputs, travel demand and trip costs.

Model structure

Figure 2 shows the structure of the model, which is implemented as a script in the mathematical programming software AMPL (Fourer et al. 2003). The model begins by running the CGE submodel to simulate the behaviour of households and firms interacting in markets over a set of regions R. A solution of the CGE submodel is characterised by a set of prices \({p}_{g,\left(i,r\right)}\), \({p}_{l,\left(i,r\right)}\) and \({p}_{k}\), firm outputs \({y}_{\left(i,r\right)}\) and tax collection g as described in “CGE submodel” section. Household activities in the CGE submodel are facilitated by commuting, shopping and leisure-associated vehicle trips, each of which costs time as given by the transport submodel (\({v}_{l,h},\;{v}_{c,h,\left(i,r\right)}\) and \({v}_{t,h}\) for commuting, shopping and leisure respectively). This generates a set of daily household vehicle OD demands between and within regions R, as in the first three steps of the four-step model. In addition, firms source their inputs from different regions as well as their own region, and therefore must purchase a freight service \({\rho }_{f,\left(j,r\right)}\) in proportion to travel time to transport commodities from their region of production to region of use. This generates a further set of freight vehicle OD demands between and within regions R.

Fig. 2
figure 2

Model flowchart

In the next stage of the model, the daily household and freight OD demands between and within regions R are converted into a set of OD demands between travel zones \(Z\) for time periods T. The trips are then assigned to the road network \(\left(N,L\right)\) in the transport submodel according to user equilibrium for each time period. Finally, the trip costs are converted back to daily time prices for household activities and freight requirements for firm production in the CGE submodel. Please note that the model does not incorporate time as a production input so changes in non-transport production time cannot be reported.

When changes in the CGE and transport submodel variables fall below a specified critical gap between iterations of the full model, the model is considered to have converged. The remainder of “Methodology” section details each stage of the model.

CGE submodel

The CGE submodel is an extended version of the one in Robson and Dixit (2017a). The submodel comprises a set of simultaneous nonlinear equations that are parameterised by travel time prices and freight requirements from the transport submodel. Commodity flows from the CGE submodel are then used to generate trips for the transport submodel. At any equilibrium, the established CGE model represents an economy with an excess demand function and a non-zero price vector. With a finite number of equilibria, the economy has a general equilibrium configuration (Mas-Colell et al. 1995). The model is based on the Arrow–Debreu development economy, which features competitive markets with no restrictions on free-market operation and atomistic competition. The CGE model's output function is a nested Constant Returns to Scale (CRS) Constant Elasticity of Substitution (CES) function with a regular structure, which was borrowed from the ORANI model (Dixon et al. 1982). The proportion of intermediate input demands between regions is fixed, and the Armington (1969) assumption of imperfect substitution between regional sources of goods is applied. The developed CGE model does not have a variable closure (fixed closure with no variability in closure), also known as a long-term closure, as it contains the same number of variables and equations (all macro variables endogenous).

Online Appendix A provides a full listing of the model sets, parameters, variables and equations. For the CGE model, all of the variables defined in Online Appendix A—Table A.4 are endogenous, while all of the parameters defined in Online Appendix A—Table A.2 are exogenous. Online Appendix C details the derivation of the CGE submodel equations, including the demand and supply equations of the economic agents.

The agents and commodities of the CGE submodel are built from two fundamental sets that govern the scope of the model:

  • A set of regions R.

  • A set of industries I, with one industry \({i}_{T}\in I\) specified as a transport industry to provide freight services. The remaining industries form set \(J=I\setminus \left\{{i}_{T}\right\}.\)

The CGE submodel simulates the behaviour of two core sets of economic agents:

  • A set of firms F, with each firm \(f\in F\) representing the production activity of industry \(i\in I\) in region \(r\in R\) such that \(F\subseteq I\times R.\)

  • A set of households \(H\), with each household \(h\in H\) representing the people residing in region \(r\in R\) and employed in firm \(f\in F\) such that \(H\subseteq R\times F\subseteq R\times I\times R.\)

Furthermore, there are two other economic agents that interact with the households and firms:

  • A foreign market agent that provides commodities produced externally (imports) for firm production at a fixed price, and purchases a fixed amount of commodities produced by firms for external use (exports).

  • A government agent that collects tax revenue from firm production and redistributes it to households.

Firms and households trade the following commodities in the markets of the CGE submodel:

  • A set of goods, with each good representing the output of each firm \(\left(i,r\right)\in F.\) Each good \(\left(i,r\right)\) has a price per unit \({p}_{g,\left(i,r\right)}.\)

  • A set of labour commodities, with each labour commodity representing the labour input of each firm \(\left(i,r\right)\in F\). Labour refers to time that is endowed to households and sold to firms. Each labour commodity \(\left(i,r\right)\) has a price per unit \({p}_{l,\left(i,r\right)}\).

  • Capital, representing the common production capital commodity in the submodel. Capital is endowed to households and sold to firms as per the putty-putty assumption of flexibility between industries and costless movement between regions. Capital has a price per unit \({p}_{k}\).

Firms F also purchase bundles of imports at prices per unit \({p}_{I,f}.\) For simplicity, households are assumed to purchase only local goods, although this assumption can be easily relaxed if necessary.

The economy is assumed to have open markets and perfect competition with constant returns to scale in production such that all economic agents are price takers and firms earn zero economic profit. This form of competition, as opposed to monopolistic competition, was chosen for this particular model to reduce potential issues with multiple equilibria. In addition, welfare measured from the CGE submodel (traditionally used for the measurement of indirect impacts) is not additive with welfare measured from the transport submodel (traditionally used for the measurement of direct impacts), and the CGE submodel should be taken as the single point of measurement of welfare.

The choices regarding labour and capital mobility were made in view of the strategic planning uses of the model. The infinite mobility of capital reflects a long-term perspective where there is sufficient time for it to transform and move as required, for consistency between counterfactual scenarios. Likewise, labour was assumed to be immobile to allow for consistent comparisons between groups of people, and to have the ability to incorporate workforce forecasts. Both of these assumptions will be reviewed in future models.

Households

Households maximise utility subject to both time and monetary constraints. Utility is specified as the nested constant elasticity of substitution (CES) function with constant returns to scale shown in Fig. 3, where consumption refers to the aggregate intake of goods. The chosen nesting structure accords with conventional utility functions in the literature in “Literature review” section to fit classic assumptions in microeconomic behavioural modelling and to enable the use of elasticity of substitution parameters from existing models.

Fig. 3
figure 3

Structure of utility function

At the top level, utility \({u}_{h}\) is a Cobb–Douglas combination of consumption \({c}_{h}\) and leisure \({t}_{h}\):

$${u}_{h}={c}_{h}^{{\alpha }_{c,h}}{t}_{h}^{{\alpha }_{t,h}}$$
(1)

Consumption itself is a Cobb–Douglas combination of aggregate sectoral goods \({c}_{h,i}\):

$${c}_{h}=\prod_{i\in I}{c}_{h,i}^{{\beta }_{h,i}}$$
(2)

Finally, each aggregate sectoral good is a CES combination of goods from each firm \({c}_{h,\left(i,r\right)}\):

$${c}_{h,i}={\left(\sum_{r\in R}{\gamma }_{h,\left(i,r\right)}{c}_{h,\left(i,r\right)}^{\frac{{\sigma }_{h,i}-1}{{\sigma }_{h,i}}}\right)}^{\frac{{\sigma }_{h,i}}{{\sigma }_{h,i}-1}}$$
(3)

To purchase goods, households sell their capital endowment \({\omega }_{k,h}\) and a portion of their daily time endowment \({\omega }_{t,h}\) as labour \({l}_{h}.\) Households are also allocated a share \({\psi }_{h}\) of government tax income \(g\). With price index for consumption \({p}_{c,h}\) and price (wage) for the household’s labour commodity \({p}_{l,h},\) the monetary constraint is:

$${p}_{c,h}{c}_{h}={p}_{l,h}{l}_{h}+{p}_{k}{\omega }_{k,h}+{\psi }_{h}g$$
(4)

Households are also constrained by time per day. Besides labour and leisure time itself, commuting trips are required to supply labour, shopping trips are required to obtain goods, and leisure-associated trips are required to access activities associated with non-labour time. Together, these trips aim to cover all sources of household travel demand. With travel time prices \({v}_{l,h},\;{v}_{c,h}\) and \({v}_{t,h}\) given for each unit of labour, consumption and leisure respectively, the daily time constraint is:

$${l}_{h}+{t}_{h}+{v}_{l,h}{l}_{h}+{v}_{c,h}{c}_{h}+{v}_{t,h}{t}_{h}={\omega }_{t,h}$$
(5)

Maximising utility in Eq. (1) subject to constraints in Eqs. (4) and (5) yields demand equations for goods \({c}_{h,\left(i,r\right)}\left({{\varvec{p}}}_{{\varvec{g}}},{{\varvec{p}}}_{{\varvec{l}}},{p}_{k},{v}_{l,h},{v}_{c,h},{v}_{t,h}\right)\) and supply equations for labour \({l}_{h}\left({{\varvec{p}}}_{{\varvec{g}}},{{\varvec{p}}}_{{\varvec{l}}},{p}_{k},{v}_{l,h},{v}_{c,h},{v}_{t,h}\right)\). The derivation of these equations is provided in Online Appendix C. Thus, any decision on where to shop or how to allocate time to labour or leisure depends on travel times.

In addition to the above utility structure, residential location choice is optionally allowed for household agents at the discretion of the analyst. This migration behaviour is not directly integrated into the utility function (for reasons that will be discussed below), and instead operates at a separate level after agents receive their utility for each cycle of the model.

The mechanism for migration is as follows. Each agent observes the utilities of all agents that are similar to them in terms of employment but dissimilar in terms of residential location. Any differences in utility would be attributed to location choice (as agents are otherwise homogenous), meaning that agents are inclined to move the region where they expect to receive the highest utility.

Mathematically, the mechanism for migration is as follows:

$${\%}_{migration}=\frac{{u}_{max}-{u}_{current}}{{u}_{max}}$$

where \({\%}_{migration}=\) the percentage of household agents that migrate out of a given region in each model cycle. \({u}_{max}=\) the maximum utility an agent expects to receive when migrating to the corresponding region. \({u}_{current}=\) the utility the agent currently expects to receive if they do not migrate.

  1. 1.

    The assumptions behind this mechanism is as follows: Agents are unable to directly observe how changing residential location would influence their utilities, and must instead observe the influence of residential location indirectly via the utilities other agents receive.

  2. 2.

    Agents can accurately and fully observe the received utility of all other agents identical to themselves. This is a necessary condition to assume rational decisions made by agents as the model is not stochastic on the CGE side.

  3. 3.

    Cost of migration is not modelled.

    There is no cost when deciding to migrate between residential locations. There are no costs to purchasing a new property/renting, no income from selling existing property (if any), no time or monetary cost of moving etc. This is due to the lack of a real estate market, which other CGE models have included (Anas and Liu 2007) but lacking in this one as it is primarily a tool for transport appraisal and not for pure economic analysis.

  4. 4.

    Zones have infinite room and can fit infinite number of agents.

The mechanism outlined above is dissimilar to many CGE models that treat migration endogenously within a household agent’s utility function, and instead takes the approach that an agent does not include residential location as part of their utility assessment as they are unable to predict their expected utility when moving to a new location. This is done partially because agents are in actuality not able to predict their expected utility, as well as the fact that within the utility function of the household agents as presented in the model, agents are primarily deciding between consumption and leisure. The mechanism between this tradeoff is time, as consumption is the result of time spent working and leisure is the result of time spent not working. The choice of residential location does not seem an appropriate third decision to include as it is not a time-based decision.

By separating residential decisions from the endogenous utility function, the migration mechanism trades accuracy for simplicity and flexibility. While it is true that this mechanism is not a highly accurate representation of migration, it has the advantage of being able to be added or removed to the model at will by the analyst, since it is not endogenous to the utility function of the agents. The result is that the analyst can look at short-term effects of economic shocks (such as the construction/improvement of road infrastructure like the WestConnex) as well as gauge the rough long-term effects of such decisions. More detailed migration models can be included in future work, but for this study, the current formulation is believed to be sufficient.

Firms

Firms maximise profits subject to a monetary constraint, but since firms earn zero economic profit, their behaviour is modelled as a cost minimisation problem with production levels determined from market equilibrium. Production is specified as the nested constant elasticity of substitution (CES) function with constant returns to scale shown in Fig. 4. Like the utility function in “Households” section, the chosen nesting structure accords with conventional production functions to fit classic microeconomic assumptions and enable elasticity of substitution parameters to be borrowed from existing models.

Fig. 4
figure 4

Structure of production function

At the top level, output \({y}_{f}\) is a Leontief (perfect complements) combination of intermediate inputs from each industry other than transport \({x}_{g,f,j},\) imports \({x}_{I,f}\) and a primary factor composite \({x}_{p,f}\):

$${y}_{f}=\mathrm{min}\left(\frac{{x}_{g,f,j}}{{\mu }_{g,f,j}}\forall j\in J,\frac{{x}_{I,f}}{{\mu }_{I}},\frac{{x}_{p,f}}{{\mu }_{p,f}}\right)$$
(6)

Each intermediate input is a CES combination of goods from each firm \({x}_{g,f,\left(j,r\right)}\):

$${x}_{g,f,j}={\left(\sum_{r\in R}{\theta }_{f,\left(j,r\right)}{x}_{g,f,\left(j,r\right)}^{\frac{{\upsilon }_{f,j}-1}{{\upsilon }_{f,j}}}\right)}^{\frac{{\upsilon }_{f,j}}{{\upsilon }_{f,j}-1}}$$
(7)

However, \({x}_{g,f,\left(j,r\right)}\) represents the input of good \(\left(j,r\right)\) after it has been transported to firm f’s region. This is accomplished by a Leontief combination of the good itself \({x}_{s,f,\left(j,r\right)}\) and a freight margin \({x}_{m,f,\left(j,r\right)},\) where \({\rho }_{f,\left(j,r\right)}\) represents the units of transport required to relocate a unit of good \(\left(j,r\right)\) from its source to destination:

$${x}_{g,f,\left(j,r\right)}=\mathrm{min}\left({x}_{s,f,\left(j,r\right)},\frac{{x}_{m,f,\left(j,r\right)}}{{\rho }_{f,\left(j,r\right)}}\right)$$
(8)

The freight margin is assumed to be proportional to the travel time from source to destination and is supplied by the transport industry in the source region such that:

$${x}_{s,f,\left({i}_{T},r\right)}=\sum_{j\in J}{x}_{m,f,\left(j,r\right)}$$
(9)

The primary factor composite is a CES combination of labour \({x}_{l,f}\) and capital \({x}_{k,f}\):

$${x}_{p,f}={\left({\zeta }_{l,f}{x}_{l,f}^{\frac{{\varphi }_{f}-1}{{\varphi }_{f}}}+{\zeta }_{k,f}{x}_{k,f}^{\frac{{\varphi }_{f}-1}{{\varphi }_{f}}}\right)}^{\frac{{\varphi }_{f}}{{\varphi }_{f}-1}}$$
(10)

With price index \({p}_{g,f,j}\) per unit of intermediate input from each industry, price \({p}_{I,f}\) per unit of imports and price \({p}_{p,f}\) per unit of primary factor composite, as well as production tax rate \({\pi }_{f},\) the cost \({e}_{f}\) for firm f to produce \({y}_{f}\) units of output is:

$${e}_{f}=\left(1+{\pi }_{f}\right)\left({p}_{I,f}{x}_{I,f}+{p}_{p,f}{x}_{p,f}+\sum_{j\in J}{p}_{g,f,j}{x}_{g,f,j}\right)$$
(11)

The parameters of \({\theta }_{f,\left(j,r\right)}\) and \({\zeta }_{l,f}\) which are firm share parameter for intermediate input of good and firm share parameter for labour respectively are estimated using IO tables with the calibration equation defined in Robson (2018). \({\upsilon }_{f,j}\) and \({\varphi }_{f}\) which are the firm elasticity of substitution between regional sources of intermediate inputs and firm elasticity of substitution between labour and capital are taken out from ORANI model (Dixon et al. 1982; Armington 1969).

Minimising the cost of producing \({y}_{f}\) units of output in Eq. (11) subject to Eq. (6) yields demand equations for goods \({x}_{s,f,\left(i,r\right)}\left({y}_{f},{{\varvec{p}}}_{{\varvec{g}}},{{\varvec{p}}}_{{\varvec{l}}},{p}_{k},{{\varvec{\rho}}}_{{\varvec{f}}}\right),\) imports \({x}_{I,f}\left({y}_{f},{{\varvec{p}}}_{{\varvec{g}}},{{\varvec{p}}}_{{\varvec{l}}},{p}_{k},{{\varvec{\rho}}}_{{\varvec{f}}}\right),\) labour \({x}_{l,f}\left({y}_{f},{{\varvec{p}}}_{{\varvec{g}}},{{\varvec{p}}}_{{\varvec{l}}},{p}_{k},{{\varvec{\rho}}}_{{\varvec{f}}}\right)\) and capital \({x}_{k,f}\left({y}_{f},{{\varvec{p}}}_{{\varvec{g}}},{{\varvec{p}}}_{{\varvec{l}}},{p}_{k},{{\varvec{\rho}}}_{{\varvec{f}}}\right).\) The derivation of these equations is provided in Online Appendix C.

Foreign markets and government

In the CGE submodel, foreign markets and government are treated in a relatively simple manner. Their incorporation in the model is primarily to establish their structures such that they can be extended as required in future models.

Foreign markets provide a source of imports and export demand. As described in the sections above, the foreign market agent provides imports \({x}_{I,f}\) to firms at prices \({p}_{I,f}.\) Hence, there is no substitution between imported and local goods, although the volume of imports scales with firm production and changes in the costs of imports are reflected in the costs of goods. Foreign markets also demand quantity \({x}_{e,\left(i,r\right)}\) of each good \(\left(i,r\right)\) for export.

The government agent imposes an ad valorem tax on firm production at rate \({\pi }_{f}\) for firm f. The total tax collected \(g\) is then redistributed in full to households at share \({\psi }_{h}\) for household h.

Equilibrium

Equilibrium conditions for the goods, labour and capital markets are:

$$\sum_{f\in F}{x}_{s,f,\left(i,r\right)}+\sum_{h\in H}{c}_{h,\left(i,r\right)}+{x}_{e,\left(i,r\right)}={y}_{\left(i,r\right)},\;\; \forall \left(i,r\right)\in F$$
(12)
$${x}_{l,f}=\sum_{r\in R}{l}_{r,f},\;\; \forall f\in F$$
(13)
$$\sum_{f\in F}{x}_{k,f}=\sum_{h\in H}{\omega }_{k,h}$$
(14)

The zero profit conditions for firms are:

$${p}_{g,\left(i,r\right)}={p}_{g,f}=\left(1+{\pi }_{f}\right)\left({\mu }_{I,f}{p}_{I,f}+{\mu }_{p,f}{p}_{p,f}+\sum_{j\in J}{\mu }_{g,f,j}{p}_{g,f,j}\right), \;\;\forall f\in F$$
(15)

Finally, the full redistribution condition for the government agent is:

$$g=\sum_{f\in F}{\pi }_{f}\left({p}_{I,f}{x}_{I,f}+{p}_{p,f}{x}_{p,f}+\sum_{j\in J}{p}_{g,f,j}{x}_{g,f,j}\right)$$
(16)

Import prices and export demand are fixed for the foreign market agent.

Trip generation

In the CGE submodel, both household and production activities are facilitated by vehicle trips, with costs derived from the transport submodel. This section describes how these trips are generated for the transport submodel.

Regional household trips

Households undertake three types of trips—commuting, shopping and leisure-associated i.e. all trips other than commuting and shopping—in time periods

A commuting trip is assumed to comprise a daily return journey from the household’s region of residence to their region of employment. With daily trip generation rate \({\delta }_{l,\left(r,\left(i,{r}^{{\prime}}\right)\right)}\) for each unit of labour, time period proportion for outbound legs and time period proportion for inbound legs, the number of commuting trips \({d}_{l,r,{r}^{{\prime}}}\) in time period from \(r\in R\) to \({r}^{{\prime}}\in R\) is:

(17)

Likewise, a shopping trip is assumed to comprise a daily return journey from the household’s region of residence to the good’s region of production, which is also assumed to correspond with the retail location. A retail sector can be added in calibrating the model to ensure shopping patterns are realistic. With daily trip generation rate \({\delta }_{c,\left(r,f\right),\left(i,{r}^{{\prime}}\right)}\) for each unit of each good, time period proportion for outbound legs and time period proportion for inbound legs, the number of daily shopping trips \({d}_{c,r,{r}^{{\prime}}}\) from \(r\in R\) to \({r}^{{\prime}}\in R\) is:

(18)

Finally, households are assumed to spend their leisure time in fixed proportions across all regions. With daily trip generation rate \({\delta }_{t,\left(r,f\right),{r}^{{\prime}}}\) for travel to region \({r}^{{\prime}}\in R\) for each unit of leisure, time period proportion for outbound legs and time period proportion for inbound legs, the number of daily leisure-associated trips \({d}_{t,r,{r}^{{\prime}}}\) from \(r\in R\) to \({r}^{{\prime}}\in R\) is:

(19)

Therefore, the number of household trips in time period from \(r\in R\) to \({r}^{{\prime}}\in R\) is:

(20)

Regional freight trips

Production activities of firms involve three types of commodity flows—local trade of goods, imports and exports. All of these flows require corresponding one-way trips in time periods . To accommodate imports and exports, an external region \({r}_{e}\) is added to the set of regions R, forming set \({R}_{e}=R\cup {r}_{e}\) to represent trips into and out of the model regions.

Local trade of goods requires freight from the region of production to the region of use. With daily trip generation rate \({\delta }_{L,\left(i,r\right),\left(j,{r}^{{\prime}}\right)}\) for each unit of good traded and time period proportion , the number of local freight trips \({d}_{L,r,{r}^{{\prime}}}\) in time period from \(r\in R\) to \({r}^{{\prime}}\in R\) is:

(21)

Flows of imports require freight from the external region to the region of use. With daily trip generation rate \({\delta }_{I,\left(i,r\right)}\) for each unit of imports and time period proportion the number of import trips in time period from the external region to \(r\in R\) is:

(22)

Similarly, flows of exports require freight from the region of production to the external region. With daily trip generation rate \({\delta }_{E,\left(i,r\right)}\) for each unit of exports and time period proportion the number of export trips in time period from \(r\in R\) to the external region is:

(23)

Therefore, the number of freight trips in time period from \(r\in {R}_{e}\) to \({r}^{{\prime}}\in {R}_{e}\) is:

(24)

Conversion to travel zone trips

Each region in R is then split into a number of travel zones Z for the transport submodel. In the example in Figs. 5a and 5b, region \({r}_{1}\) has been split into travel zones \({z}_{1,1}\) and \({z}_{1,2},\) and region \({r}_{2}\) has been split into travel zones \({z}_{2,1}\) and \({z}_{2,2}\). As a result, what was previously one OD pair at the regional level becomes four OD pairs at the travel zone level. Travel zones are also allocated to the external region at access points such as ports, airports, intermodal facilities and roads that enter and exit the boundary of the modelled regions.

Fig. 5
figure 5

Correspondence between regional and travel zone trips

In the model, the two types of OD trips (regional and travel zone) are treated separately, i.e. the regional OD trips are calibrated to a dataset of total household and freight trips by time period, and the travel zone OD trips are calibrated as a transport model with separate household demand (using light vehicles) and freight demand (using both light and heavy vehicles). The two levels of trips are then connected by assuming that demand in the travel zone OD pairs change in the same proportions as their corresponding regional OD pairs. This implies that spatial trip patterns within regions do not vary as the economy shifts. A justification for this assumption is that land use patterns within regions are controlled by local planning laws, and thus are likely to scale together.

From this assumption, the total daily trips at the regional level from the CGE submodel and as split into time periods may be represented by a separate set of travel zone OD demands for each time period (e.g. hourly flows). These travel zone OD demands need not sum to the total demand from the CGE submodel; they just need to be representative of typical conditions across the time period. In addition, the model allows for travel zone trip patterns to differ between household and freight trips, and therefore the calculations for household and freight trip costs are kept separate.

Furthermore, it may be computationally difficult to simulate the full road network due to the number of iterations required in the full model or the spatial extent of the CGE submodel. Figure 5c shows an example road network \(A-B-C-D\) connecting the travel zones, with the centroids represented by the triangles. While a trip from \({z}_{1,1}\) to \({z}_{2,1}\) is likely to use the given road network by first accessing node B, travelling from B to C, and then leaving node C, a trip from \({z}_{1,2}\) to \({z}_{2,2}\) may travel via local road networks not represented in the model. Therefore, to facilitate the use of more limited road networks, the travel zone OD trips are pre-assigned into:

  • A set of direct trips that bypass the road network \(\left(N,L\right).\) The cost of a direct trip in time period from \(z\in Z\) to \({z}^{{\prime}}\in Z\) is fixed.

  • A set of network trips that are assigned to the road network \(\left(N,L\right)\). The cost of a network trip in time period from \(z\in Z\) to \({z}^{{\prime}}\in Z\) comprises a fixed cost from the centroid of z to the nearest node \(n\in N,\) a variable cost from \(n\) to \({n}^{{\prime}}\in N\) (the nearest node to \({z}^{{\prime}}\)), and a fixed cost from \({n}^{{\prime}}\) to the centroid of \({z}^{{\prime}}\).

This pre-assignment is a proposed method to simplify the transport model, which may be required when using previously developed transport networks from existing models, as the integrated CGE and transport model requires multiple iterations to converge to equilibrium. The effect is that route choice and congestion are not simulated for trips that are not likely to use the simulated transport network, according to the pre-assignment criteria, which will likely reduce the magnitude of impacts of congestion. A fraction of local traffic using higher order networks might be considered with adjusting route choices depending on circumstances where travel times are relatively inelastic for small changes in flows.

The loading of trips into the network to determine variable costs from the transport submodel is captured by the multipliers and for household and freight trips respectively. For freight trips in particular, the multiplier covers the allocation of trips to the external region via access points and also captures the conversion of trips into passenger car units. With the number of regional household OD trips in time period from r to \({r}^{{\prime}}\) given in Eq. (20), the number of regional freight OD trips in time period from r to \({r}^{{\prime}}\) given in Eq. (24) and a set of background trips for preloading into the transport submodel, the number of network OD trips in time period from \(n\) to \({n}^{{\prime}}\) is:

(25)

For each regional OD pair, the total corresponding travel zone OD trips are captured by the multipliers and for household and freight trips respectively, and the fixed costs for both direct and network trips are captured by the multipliers and for household and freight trips respectively, as explained in “Trip costs” section. Depending on the data used, multipliers may exist for intra-zonal trips stemming from intra-regional activities in the CGE submodel. However, since these trips would have the same origin and destination, their cost would be zero.

Transport submodel

Following the generation of network OD trips , the trips are assigned to the road network \(\left(N,L\right)\) for each time period in the transport submodel to yield OD costs . In user equilibrium (UE) assignment, every user travelling from \(n\) to \({n}^{{\prime}}\) is assumed to independently choose a path that minimises their travel cost. The model is solved by finding an equilibrium set of flows such that each user cannot find a better path (route) that lowers their travel cost, subject to all other users remaining on their current paths. This is known as Wardrop's First Principle (1952)—as a result, all used paths from \(n\) to \({n}^{{\prime}}\) have the same travel cost, which is lower than the travel cost for any unused path.

This model adapts a nonlinear programming formulation of UE assignment from Sheffi (1984), based on the Frank–Wolfe algorithm (2006), that only requires a set of link flows and not the enumeration of paths. The algorithm to solve for a set of OD costs for each time period is described in Online Appendix D.

Trip costs

The final stage of the model, prior to the next iteration, is to calculate the travel time prices \({v}_{l,h},\;{v}_{c,h,\left(i,r\right)}\) and \({v}_{t,h},\) as well as the freight requirements \({\rho }_{f,\left(j,r\right)}\) for local trade from OD costs Freight costs for imports and exports do not impact the CGE submodel as both import prices and export demand are exogenous.

Household trip costs

Firstly, the number of household travel zone trips associated with each regional OD pair is:

(26)

Next, the sum of fixed costs for the household travel zone trips associated with each regional OD pair is:

(27)

Finally, using the variable costs from the transport submodel, the average OD cost for the household travel zone trips associated with each regional OD pair is:

(28)

The travel time prices \({v}_{l,h}\), \({v}_{c,h,\left(i,r\right)}\) and \({v}_{t,h}\) are then:

(29)
(30)
(31)

Freight trip costs

The calculation of regional OD costs for local freight trips parallels that of household trip costs. The number of freight travel zone trips associated with each regional OD pair is:

(32)

The sum of fixed costs for the freight travel zone trips associated with each regional OD pair is:

(33)

Using the variable costs from the transport submodel, the average OD cost for the freight travel zone trips associated with each regional OD pair is:

(34)

With the amount of freight margin required per unit of trip cost \({\epsilon }_{\left(i,r\right),\left(j,{r}^{{\prime}}\right)},\) the freight requirement \({\rho }_{\left(i,r\right),\left(j,{r}^{{\prime}}\right)}\) for firm \(\left(i,r\right)\) purchasing good \(\left(j,{r}^{{\prime}}\right)\) is:

(35)

Calibration

The proposed model has been calibrated for testing purposes using data for the Sydney transport network and economy. While the data was derived from real measurements, it should be treated as synthetic as it has not undergone thorough validation. The model's performance is assessed by convergence of model parameters over iterations. Five parameters, namely the price of goods, the price of labour, the output of firms, demand, and utility, were chosen and their values were observed during the model run. The percentage change in these variables compared to their values in the previous iteration is computed, and by the third iteration, the variables had reached a point of convergence with less than 0.01 change compared to the previous value in the previous iteration.

Transport submodel calibration

The calibration of the transport submodel was heavily based on the 2011 Sydney Strategic Travel Model (STM) (Bureau of Transport Statistics 2011), which is currently used as the standard four-step transport planning model in Sydney. The 2,404 travel zones in the STM covering the contiguous Sydney urban area were adopted for set Z, as well as the four time periods of ‘AM peak’, ‘interpeak’, ‘PM peak’ and ‘night’ for set T.

The road network is a condensed version of the 2011 STM network, which originally contained 63,747 links and 25,268 nodes. Even though this network is used in an existing four-step model, the multiple iterations required between the CGE and transport submodels made it necessary to simplify the network to a core structure appropriate for strategic modelling, in order to keep computation times reasonable. This involved removing links that would be insignificant to the model results, combining links where possible and checking for network consistency.

Starting from the original STM network, all links outside the Sydney SA2 regions R as well as all local and sub-arterial roads (and corresponding isolated nodes) were removed. This resulted in a number of extraneous midblock nodes along the remaining arterials, highways and freeways which no longer served any technical purpose. A macro was written to scan through every node in the network, identify midblock nodes and remove them by combining links. Further reductions were made by reclassifying all roads to either arterials or freeways, simplifying freeway interchange structures by combining freeway ramp lanes with mainline lanes, and removing additional midblock nodes along freeways. The final step was to rectify inconsistencies generated in the network that would prevent the model from operating correctly. All loop links (links that had identical start and end nodes), duplicate links (where a set of links shared common start and end nodes), dead-ends and isolated nodes were removed. The final road network for Sydney, shown in Fig. 6, contains 1898 links and 791 nodes representing all of Sydney’s arterials and freeways.

Fig. 6
figure 6

Base road network for Sydney

Next, the travel zone OD matrices for each time zone were extracted from the STM. Each travel zone centroid was then allocated to the nearest node by Euclidean distance, and approximate travel times by local or arterial roads were compared to allocate the travel zone OD trips into direct or network trips. This provided network OD matrices to enable the calibration of the link parameters. Following a number of runs of the transport submodel in which the link parameters were adjusted until the travel times closely matched those observed in Google Maps, the link parameters were finalised as:

  • Arterials 50 km/h free-flow speed; 1000 veh/h capacity; BPR multiplier of 1, BPR power of 5.

  • Freeways 75 km/h free-flow speed; 1500 veh/h capacity; BPR multiplier of 0.1, BPR power of 6.

CGE submodel calibration

Once the transport submodel calibration process is complete, the CGE model is calibrated to match travel demand “exactly”, which means that there is no need for the transport demand parameters to be estimated statistically.Calibration of a CGE model involves the specification of a benchmark economy that is assumed to be at equilibrium. The parameters of the CGE model are then back-calculated from the benchmark such that the model replicates the benchmark when run without changes. This is in contrast to the calibration of many transport models—it is an exact calibration, rather than a statistical one.

The calibration of the CGE submodel mirrors the calibration of the Robson and Dixit model (2017a). As in the Robson and Dixit model, the 14 Statistical Areas Level 4 (SA4) (Pink 1270) covering Sydney were chosen for the set of regions R, and the economy was aggregated to two industries: transport, and all others.

Prior to the calibration of the CGE submodel parameters, the multipliers , and for linking trips and trip costs between the CGE and transport submodels were derived. Data from the 2011 Household Travel Survey (Bureau of Transport Statistics 2013) was first obtained to determine regional OD demand by time period and purpose. Next, a script iterated through each travel zone OD pair and allocated trips and trip costs to the corresponding regional OD pair. This generated the required multipliers and enabled the calculation of travel time prices and freight margins after running the transport submodel.

The development of the calibration data, including sources of IO data, is detailed in Robson and Dixit (2017b) and the calibration process is described in Robson and Dixit (2017a).

Application

Sydney is the capital city of New South Wales, Australia and its 2016 population of 5,030,000 is the largest in Australia (Australian Bureau of Statistics 2017). The Sydney economy is dominated by the professional and financial services sectors (SGS Economics & Planning 2017), with economic activity concentrated in the Central Business District (Region 3). Several secondary but significant hubs of activity are also spread throughout the urban area. WestConnex is the largest road project by distance and funding under construction in Sydney. It comprises a largely underground 33 km long motorway through inner Sydney, connecting the M4 Motorway from the west of the city to the Sydney Central Business District, before proceeding south to Sydney Airport, Port Botany and the M5 Motorway. Construction on the A$16.8 billion project commenced in 2015 and is anticipated to be completed in 2023. The project is expected to deliver a benefit–cost ratio of 1.88 (NSW Government 2015).

Prior to WestConnex, the 22 km stretch of the M5 Motorway between Prestons and Beverly Hills was widened from two to three lanes in each direction. WestConnex itself comprises four stages:

  • Stage 1A M4 Widening: an additional lane in both directions on the existing 7.5 km stretch of M4 Motorway between Parramatta and Homebush.

  • Stage 1B M4 East: a new 6.5 km, mostly tunnelled motorway between the current terminus of the M4 Motorway in Homebush and the intersection of Parramatta Road and Wattle Street in Haberfield.

  • Stage 2 New M5: a new 9 km tunnelled motorway duplicating the existing M5 East Motorway from King Georges Road in Beverly Hills to a new interchange in St Peters.

  • Stage 3 M4-M5 Link: a new 8 km tunnelled motorway between the terminus of the M4 East in Haberfield and the terminus of the New M5 in St Peters.

New links were added to the road network in Fig. 6 and existing links were modified to represent the M5 widening and each stage of WestConnex. These links are shown in Fig. 7. The model was then run for the following six scenarios simulating the progressive introduction of each stage:

  • Scenario 1: 2011 base.

  • Scenario 2: 2011 base plus M5 widening.

  • Scenario 3: 2011 base plus M5 widening and WestConnex Stage 1A.

  • Scenario 4: 2011 base plus M5 widening and WestConnex Stages 1A and 1B.

  • Scenario 5: 2011 base plus M5 widening and WestConnex Stages 1A, 1B and 2.

  • Scenario 6: 2011 base plus M5 widening and WestConnex Stages 1A, 1B, 2 and 3.

Fig. 7
figure 7

Road network for model application

Results and discussion

Table 1 provides a summary of results for Scenarios 2 to 6 in relation to Scenario 1, including welfare as measured by equivalent variations and percentage changes in non-transport production, leisure hours, household travel costs and household travel demand. The results from Table 1 utilise the version of the model without residential location choice for household agents. Table 2 provides the same summary for the version of the model including location choice. It should be noted that the total welfare effect is indicated by EV, while double counting can occur through simple impact addition It is also important to note that the simplistic nature of the residential location choice mechanism used for the model leads to economic results that are highly skewed and do not represent the true impacts of the WestConnex project. Results from the model that are relevant to travel times and travel demand, however, can be useful.

Table 1 Summary of Scenario 2 to 6 results in relation to Scenario 1—no migration
Table 2 Summary of Scenario 2 to 6 results in relation to Scenario 1—migration inclusive

Validation of transport appraisals that handle WEIs can benefit from comparison of model implied value of time and real derived value of time. The value of time reported in the Transport for NSW report is 16.26$ per hour per person for a private car (TFNSW 2018). The developed CGE model provides a range of value of time for different people based on their residence and employment region, as well as the industry in which they work. The average value of time value from the model is equal to 17.62$ per hour per person for a private car which indicates realistic implied value of time.

In Figs. 8 and 9, spatial pattern of equivalent variations (EV) and change in non-transport production output are provided respectively to present the spatial pattern of economic impacts.

Fig. 8
figure 8

Spatial pattern of equivalent variations (EV) per capita (2011 A$)

Fig. 9
figure 9

Spatial pattern of change in non-transport production output

Scenario 1 was run to ensure the calibration data was replicated properly. Scenarios 2 to 6 were run for 20 iterations each, although the results largely converged to a band around 5% to 10% above and below the mean results within three iterations. These fluctuations were likely due to convergence criteria in the submodels being set too loosely, which is a subject for future investigation. Each model iteration took around 2 h to solve, almost entirely due to the transport submodel requiring around 30 min to converge for each time period. The transport submodel can be sped up in future models by implementing more efficient assignment algorithms and using multiple cores for processing.

For the model that did not include residential migration (i.e. for results presented in Table 1), in Scenario 2, the widening of the M5 Motorway resulted in a total increase in welfare of $A259.10 million per year. The largest beneficiaries per capita were residents in Region 9, largely due to the M5’s role as the primary road connection between Region 9 and the Sydney Central Business District. Correspondingly, non-transport production in Region 9 increased, but Region 13 had the largest increase due to its higher concentration of industrial activity. Residents in Regions 3, 5, 13 and 14 near the route of the M5 also experienced significant gains in welfare. On the other hand, residents in Region 8 experienced a slight disbenefit due to poorer traffic conditions from induced demand. The addition of lanes on the M4 Motorway in Scenario 3 led to more evenly spread gains in welfare across the metropolitan area, totalling A$300.17 million per year. Regions 1, 8 and 10 experienced the fewest benefits, being some of the furthest regions from the improvements. In general, the M4 widening in Scenario 3 only resulted in marginal welfare improvements from Scenario 2, with leisure time, travel time and household travel demand similar to the M5 widening alone.

The introduction of new WestConnex links between the M4 and M5 via the Sydney Central Business District in Scenarios 4, 5 and 6 led to significant welfare gains of A$593.14 million, A$881.83 million and A$1,602.84 million per year respectively, with every region experiencing improvements. In these three scenarios, Region 3 covering the Sydney Central Business District enjoyed the largest per capita increase in welfare, reflecting its position around the centre of WestConnex. In Scenario 4, the benefits were relatively spread across the metropolitan area, but over the progressive introduction of WestConnex, the benefits shifted towards the inner regions served by the new motorway. By Scenario 6, the inner city regions of 4, 5 and 6 in addition to Region 3 were the largest beneficiaries, with outlying regions in particular Regions 8 and 10 again experiencing the fewest benefits. In terms of non-transport production, Regions 3 to 6 correspondingly experienced the largest proportional increases in output due to their proximity to WestConnex. Production reduced in some outlying regions due to competition from the inner regions. In all scenarios and for all regions, the output of the transport industry shrunk in line with the reduction in freight requirements from improved travel times.

For results presented for the model including migration (i.e. for Table 2), it is important to note the large EV values presented in Table 2 should not be taken at face value. The mechanism of residential migration presented in this model is simplistic, and is intended to have higher flexibility in place of accuracy. It is likely that the lack of migration costs and the assumption that regions can contain an infinite number of household agents heavily skews the results. As such, the EV values presented in Table 2 should not be directly compared with those presented in Table 1.

Instead, the sign of the EV values obtained is noted to be mostly negative, which suggests that the WestConnex project may provide negative EV in the very long run once agents are able to move to their location of choice. It is also possible that the changes in utilities household agents experience as a result of their relocation is very large compared to the travel time changes the WestConnex project provides. It is more likely that the latter case is true, as the assumptions in the model of no relocation costs and regions having infinite housing capacity would incentivise heavy congregation in city CBD regions (region 13). Better modelling of residential location choice in future studies would improve the analytic capabilities of the model.

In both iterations of the model, however, the comparison of different scenarios' outcomes can be used as a decision-making criterion, policy recommendation, and economic implication. For Table 1, scenarios 4, 5, and 6 can be considered if total welfare gains are a priority, as these scenarios resulted in significant total welfare gains. In the same vein, the negative welfare impact of scenarios 2 and 3 in region 8 should be considered when making decisions on these two scenarios. If equal benefit distribution across regions is a key decision-making criterion, scenarios 3 and 4 can be chosen because they have the most evenly distributed welfare impact with the lowest normalised standard deviation of all scenarios in terms of welfare effect. If a particular region has received additional attention, scenarios can be chosen based on their impact on that region. As illustrated in Fig. 10, scenarios 4,5,6 benefit regions 3 and 6, respectively, whereas scenarios 2 and 3 benefit regions 9 and 14, respectively. Non-transport production output, labour time, leisure time, household travel time, and household travel demand are some of the other criteria provided by the model for multi-criteria decision making.

Fig. 10
figure 10

Regions equivalent variations (EV) per capita (2011 A$)

For Table 2 results, decisions should be made in line with the process outlined immediately above, but instead of considering EVs (which are likely highly inaccurate and reflect the results of the migration process rather than the infrastructure project), consideration should instead be made for the total travel time that agents experience, with emphasis placed on minimising travel times.

Figure 11 shows changes in link volumes from Scenario 1 to Scenario 6. From Fig. 11, traffic was diverted from parallel arterial routes onto WestConnex and the widened M5. On the other hand, many links feeding the motorway network experienced increases in demand. Overall, there was a 0.1737% increase in household travel demand (induced demand) across the whole metropolitan area. The network improvements led to a reduction in household travel time of 3.051%, even after accounting for induced demand. The travel time saved was retained by households as leisure time, and the relative reduction in the cost of leisure travel compared to commuting travel led to further substitution towards leisure time from labour time.

Fig. 11
figure 11

Change in link volumes from Scenario 1 to Scenario 6

To provide more detailed information about the transport related indicators, Tables 3 and 4 present the percentage change in travel time for different activity types. Table 3 provides the information for the model without residential location choice while Table 4 includes residential location choice. For Table 3, in all scenarios, leisure travel times reduce by the greatest proportion, followed by commuting and shopping. In each subsequent scenario (representing the progressive introduction of WestConnex), travel times for all three purposes continue to decrease, which is completely matching the intuition about the explanation of the network. Nonetheless, the essential point here would be the significance of such improvement in the level of service provided by the network expansion compared to the cost of the project and other economic benefits discussed under Tables 1 and 2.

Table 3 Total travel times (percentage change from Scenario 1)—no migration
Table 4 Total travel times (percentage change from Scenario 1)—migration inclusive

The results in Table 4 are slightly less intuitive. The greatest reduction in travel time is commute time in this model iteration instead of leisure travel time. There are instead increases in travel time in scenario 2, but this returns to being decreases in travel time in scenarios 3–6. There is still a decreasing trend in travel times as the project continues (with a slightly smaller decrease in scenario 4 compared to 3). The increases in scenario 2 is likely a result of the changes in the network due to migration causing overcrowding in certain regions. As the project has the smallest impacts at this stage, it is expected that changes in travel time caused by this overcrowding would be significant when compared to the reductions the project would provide. As the project progresses however, the reductions are now significant enough to offset the initial increases. It is worth noting that the large reductions in commuting travel time is likely partially due to the migration; household agents are likely to migrate to locations closer to their place of employment, and in so doing contributes to a reduction in commute travel time.

Another advantage of the proposed transport-based CGE model is that accessibility indicators derived from the true transport (the relatively close to the real world transport network of this paper) network can be extracted in interaction with economic indicators. Table 5 and 6 presents labour and retail accessibility based on accessibility for each region being estimated according to the inverse of exponential of travel time multiplied by labour and retail opportunities for each firm shown in Eqs. 36 and 37 (Goodwin 2019) with provided graphical presentation in Fig. 12.

Table 5 Accessibility values for labour and retail opportunities—migration inclusive
Table 6 Accessibility values for labour and retail opportunities—no migration
Fig. 12
figure 12

Graphical presentation of accessibility values for labour and retail opportunities

$$\mathrm{Labour\; accessibility}=\sum_{i=1}^{i=14}\frac{Labour\; demand \,(i)}{\mathrm{exp}\left(weight \times travel time \,(i)\right) }$$
(36)
$$\mathrm{Retail\; accessibility }=\sum_{i=1}^{i=14}\frac{Good\; production\, (i)}{\mathrm{exp}\left(weight \times travel time\right)\, (i)}$$
(37)

Table 5 shows changes in labour accessibility for residents in each region, in the migration inclusive model. Table 6 shows the same for the model with migration included. Labour accessibility is calculated as the sum of hours demanded by each firm, weighted by an exponential function of the travel time to each firm. The changes in accessibility generally align with changes in production as reported in Table 1 in the manuscript. This table also shows changes in retail accessibility for residents in each region. Similarly, retail accessibility is calculated as the sum of non− transport output from each firm, weighted by an exponential function of the travel time to each firm.

Table 6 shows generally negative changes in both labour and retail accessibility. As these metrics measure retail and labour opportunities, they are likely to be highly skewed due to migration (similar to Table 2’s results), and as such is likely not very useful for informing policy planning in the current state. With better modelled migration, these metrics would be more useful.

To further assess the accessibility formula used in estimation of the results of the previous table, Figs. 13 presents a sensitivity analysis of the weight factor in the accessibility function for labour accessibility values of Scenario 6 discussed in the previous table, and Fig. 14 shows a sensitivity analysis of the same for retail accessibility, varying from 0.5 to 3, while it was set to 1 in the result presented in Tables 5 and 6.

Fig. 13
figure 13

Sensitivity analysis of weight factor in labour accessibility function

Fig. 14
figure 14

Sensitivity analysis of weight factor in retail accessibility function

It can be seen from the figures above that the percentage change in the estimated accessibility values can be linearly proportional to the accessibility weight factor of travel time in the impedance function.

Sensitivity analysis

To determine the model's empirical robustness, a sensitivity analysis is performed. This is accomplished by altering particular input variables one at a time to examine the degree of change in the model's outputs; significant unintuitive changes in the model’s outputs from minor input changes would suggest potential model instability.

Testing is carried out on variables from both the CGE and the transport submodels of the integrated model. Endowment of capital (\({e}_{k}\)) and endowment of time (\({e}_{t}\)) from CGE submodel and link free flow costs (\({z}_{0}\)) and link capacities (\({q}_{c}\)) from transport submodel are the variables that have altered.

The two CGE submodel variables are selected as they are easy to understand motivators for household agent behaviour. Changes in the model outputs should be able to be reasonably intuitively understood based on changes in these variables. Abnormal changes are therefore evidence for model instability. The transport submodel variables were similarly selected as they are intuitive to understand. Additionally, typical hard infrastructure upgrades primarily affect these two aspects of transportation, and so ensuring they do not contribute to significant model instability is especially important for policy considerations.

For each variable outlined above, four tests are conducted with reducing and increasing the input variable by 5% and 10%. The stability of outputs was then assessed by comparing results to the base model. The equivalent variation (EV) of the network and the percentage change in income compared to total household income in the base scenario are calculated.

Tables 7 and 8 shows the sensitivity analysis on the impact of variables from the CGE submodel and transport submodel on EV and the percentage change in income compared to total household income in the base scenario.

Table 7 Sensitivity analysis on impact of endowment of capital (\({e}_{k})\) and endowment of time (\({e}_{t})\) on EV and the percentage change in income compared to total household income in base scenario
Table 8 Sensitivity analysis impact of link capacities (\({q}_{c}\)) and link free flow costs (\({z}_{0})\) on EV and the percentage change in income compared to total household income in base scenario

Changes in the model outputs for income and utility are intuitively explained from the input variable changes, where any additional costs or reduction in endowments resulted in negative EVs and the reverse resulted in positive EVs with no significant fluctuations in the model outputs. Changes Other economic model outputs (the goods gap and labour gap) follow an intuitive trend. For example, reductions in agent capital endowments lead to a large labour gap in each region, as less labour is supplied at the same demand. In the same line, increasing in network links capacity and decreasing of the links free flow cost resulted in positive EVs and income change as expected without any substantial changes in the model outputs.

Conclusion

Existing models in the four-step transport planning framework can simulate travel demands and interactions in transport networks with a high degree of detail, but many rely on fixed economic parameters for markets other than transport and land. With recent interest in the wider economic impacts of transport, CGE models have been increasingly applied to analyse transport projects and policies. Since CGE models are simulators of entire economies, they are able to estimate the magnitude and distribution of impacts both spatially and through markets, while also providing a rich set of outputs including GDP and welfare.

This paper presented an integrated CGE and transport model and calibrated it for the Sydney road network and economy. The contributions of this model are both conceptual—in applying the CGE submodel as a trip generator and distributor in the four-step framework—and technical—in developing the linkages between the submodels that enable them to be calibrated as full CGE and transport models. Several existing models link CGE and transport network models, but very few incorporate detailed networks or generate a complete set of travel demands (including household and freight), both of which are necessary in the four-step framework. In the presented model, the CGE submodel simulates the behaviour of households and firms interacting in markets, where their behaviour takes trip costs into account. The model then generates trips as a derived demand from agent activities and assigns them to the road network according to user equilibrium, before feeding back trip costs to the CGE submodel. The model was tested by simulating the WestConnex motorway project under construction in Sydney. The results showed significant increases in welfare for regions close to the improvements.

Like any model, the presented model was built for a purpose—to provide a tool for the rapid strategic assessment of transport projects and policies when economic responses cannot be assumed to remain static. Compared with some activity-based models and discrete choice modelling frameworks, CGE models tend to represent the behaviours of travel markets in less detail, but do provide a comprehensive structure that allows for the incorporation of more complex models where required.

There are a multitude of avenues for expansion of the presented model. The most immediate feature required in the four-step framework is mode choice. From this, land markets can be incorporated as additional commodities in the CGE framework to replicate the behaviours of a LUTI model. The inclusion of a better and more accurate residential location choice mechanism within the model could also lead to more accurate economic analysis. Incorporating monopolistic competition as per new economic geography models could also be investigated as it would enable the assessment of wider economic impacts stemming from agglomeration. This could lead to further improvements in productivity from increasing returns to scale, and correspondingly worse congestion. Further transport interactions could be added, for example business trips, which may have a significant impact on the productivity of service industries that dominate cities like Sydney.

While the developed model is investment-driven due to expenditures on capital stock and investment activities, other studies have focussed on models with saving driven closure. For instance, Cardenete and López-Cabaco (2021) provided saving account where the closure establishes the level of investment. In another study by Kim et al. (2004a), aggregate savings determine investments. There is only one capital market, and savings come from four main sources: household savings, regional production sector corporate savings, private borrowings from overseas, and government savings. Future work can be directed to providing a saving driven closure for the model. When defining macro closure, the impact on macroeconomic factors should also be considered. Closure rules have a greater impact on macroeconomic factors than on distributional ones, where the effect of the macro adjustment on the size of income distribution is generally minimal and indifferent to the closure rule (Adelman and Robinson 1988).

While the proposed model in this study is used to assist in capturing the wider economic benefits of implemented policies with a focus on detailed transport behaviour, various studies have assessed the impact of financing methods and expenditure patterns on the economy. Any infrastructure improvement, such as increased capability, must be funded. It should be noted that infrastructure funding may have a significant impact on welfare. The economic effects depend on the magnitude and source of funding (i.e., tax revenues, government funding, or private funding) (Kim et al. 2017). For instance, the effects of the transportation investment on the Korean economy is analysed by Kim (1998). According to results, financing alternatives for infrastructure projects impact elasticities of infrastructure investment such as GDP, exports, private utility, and inflation. The effect of transportation investment on inflation is found to be minimised if transportation investment expenditure is entirely funded by tax revenues. In the same line, Kim et al. (2011) investigated the effects of highway development using various financing methods, such as using the current tax system or imposing earmarked taxes. According to the findings, imposing regional earmarked taxes has a greater impact on income growth and the reduction of regional income inequality than the current tax system. In addition, Chen et al. (2017) investigated transportation infrastructure public–private partnership's socioeconomic impacts using a dynamic computable general equilibrium (CGE) model. The simulation results show that public–private partnerships generate greater positive gross economic output and welfare impacts compared to traditional public financing models by reducing the regional economy's collected tax burden. The effect of funding can be divided into two categories: short-term construction and long-term operation and maintenance (Kim et al. 2004b). While this study considers the yearly impact, the Financial Computable General Equilibrium (FCGE) model proposed by Kim et al. (2017) can be used to model the short- and long-term economic impacts of financing.

The focus of the current study is on detailed transport behaviour. Greater focus on including financing method (defining source of financing) and allocation of expenditure pattern could produce an overall impact assessment which could be explored in further research. In addition, future work could be directed to incorporate a saving account in the model to provide a complete impact assessment. The savings account can be used to determine whether transportation development costs will be paid for from government funds or from private savings. The resulting model would provide a universal tool in which a modeller could input proposed changes to a transport network, and from the one model estimate link flows and induced demand, extract welfare metrics and forecast long-term changes to the urban economy.