1 Introduction

Empirical findings of travel behaviour studies have provided evidence that commuters presented with integrated multimodal traveller information (IMTI) in congested travel environment, exhibited a certain degree of mode switching [14]. However, from the transportation system’s perspective, the potential efficiency and success of various policy measures for eliminating or reducing traffic problems in metropolitan areas largely depend on the level of public acceptance of the traveller information [57]. A variety of travel behaviour models based on revealed and stated preference surveys have been developed to study the nature and dynamics of traveller’s choice behaviour [811]. Despite the obvious need for assessing user acceptance and the potential impacts of these systems in terms of improving traffic conditions for individual drivers and the overall transportation system, there is still a lack of models to evaluate their full impacts. The idealised analytical models are not appropriate for observing commuters’ behaviour dynamics as they lack the ability of representing decision-making processes occurring in a complex dynamic environment. On the other hand, treating travel behaviour within the framework of discrete choice models may not be that effective because, with the spatial and temporal dimensions, the choice set becomes astronomically large if one wishes to gain adequate levels of precision in forecasts. Furthermore, these tests and experiments are very expensive to conduct and do not allow for effective evaluation of different alternatives. Realising the limitations of other approaches with respect to the dynamics of travellers’ behaviour under the influence of real-time information, several researches have invested considerable effort in laboratory experiments being conducted with the support of the simulation models [1216].

Currently, the simulation models are not sufficiently capable such that all the existing ITS components can be evaluated. Furthermore, in many modelling applications a more disaggregated approach to model socio-economic processes is generally desirable to reduce the model aggregation [1721]. These two observations together have led inevitably to the adoption of a sophisticated micro-simulation platform to model the ITS systems in an intelligent environment, where the simulation is closer to reality based on the production-rule system approach. The premise underlying a production-rule system is that choice behaviour in a certain context can be better described by a series of “if-then” rules. If the condition in the “if” part is true, then the action of the “then” part follows. This approach utilises an external element (agent) that is integrated with the system to rationalise the “if-then” process within the system. Such an approach is known as the agent-based simulation and has been shown to have the potential to model complicated cognitive processes. This is the approach taken in order to fill the gap of existing simulation research into the impact of IMTI system, the authors have developed a rule-based mode choice model, the intelligent network simulation model (INSIM), as reported in an earlier study [22]. The INSIM possesses the sophistication to imitate the commuters’ mode choice behaviour in an information-rich environment.

INSIM has four main components: INSIM expert system (IES), transportation network simulation model (based on PARAMICS), application programming interface (API), and data transfer interface (DTI), as shown in Fig. 1. The IES is responsible for generating mode choice decision. The transportation network simulation model simulates the traffic and generates travel statistics as output files. These output text files are collected by the API, and transformed to Microsoft Access Database (MDB) files and exported to DTI, which is a database that stores the PARAMICS output files in MDB format while retaining intact the original time-dependent updated origin and destination (OD) matrix file. The API takes in the information in the OD matrix file and releases the vehicles in the simulated network according to the pattern specified in the file. The IES is designed in a way to provide the flexibility to adopt an alternative mode choice model that can give prediction of commuters’ mode choice. Therefore, the impacts of real-time IMTI on the overall performance of the transportation network can be analysed.

Fig. 1
figure 1

Architecture of the INSIM

In most current research on intelligent network simulation model, a risk strategy is typically adopted to handle the risk of network uncertainty [2328]. In the similar manner, the most important characteristics of the IES are its high-quality performance and adaptive learning. The high-quality performance assures system reliability and timeliness. The adaptive learning allows the IES to update dynamically and learn on its own through its experience. The reliability of the IES depends upon its decision-making process, which can be observed by two specific features, i.e. the capability to explain about the decision taken, and to deal with uncertainty. The IES is capable to explain how and why a decision was reached. The execution of rules tending towards certain decisions can be explained by the inference chain or the rule flow. Furthermore, the rule of inference has also been designed to deal with knowledge that is incomplete or not completely certain. A heuristic algorithm is developed to solve the uncertainty problem and generate a suitable decision for the provided facts. This ability of the IES allows it to provide a solution even when the provided information is not 100 % accurate or is incomplete. In this regard, it is also necessary to analyse the level of accuracy of the provided solution. To check the accuracy of the designed algorithm, an experiment was conducted and its level of mode prediction accuracy was analysed and has been reported in an earlier study by the Authors [22].

Accurate simulation results are highly dependent on the selection of model parameters and the calibration methodology [29, 30]. Calibration studies have been performed for various traffic simulation tools [3133]. The typical calibration method is to minimise the differences between the observed data and simulation output during a certain time period on a typical day. Before application of INSIM, each module in INSIM is to be well calibrated with real traffic data to ensure realistic simulation results. This paper covers the important task involved in the calibration of the INSIM, especially IES, and is organised as follows. The study network and network coding are presented in Sect. 2. Detailed discussions regarding the calibration of INSIM are then provided in Sect. 3 and followed by conclusions in Sect. 4.

2 Study network

To assess the capabilities of the integrated IES with the transportation network simulation model, the transportation network for the central and western part of Singapore was selected for the simulation. The simulated network covers the area bounded by 5 major expressways and numerous arterial streets. The expressways are Pan-Island Expressway (18 km), Ayer Rajah Expressway (19 km), Central Expressway (9 km), Bukit Timah Expressway (5 km), and Kranji Expressway (9 km). In the multimodal network established for this study, there are a total of 21 residential and commercial zones, 2 Mass Rapid Transit (MRT) Lines, 23 MRT stations, 2 MRT interchanges, 38 selected signalised road junctions, 14 flyovers, and 1 roundabout. Every MRT station has feeder bus services, which traverse between nearby zones and MRT stations. Figure 2 shows the overall view of the simulated multimodal transportation network.

Fig. 2
figure 2

Overall view of the simulated transportation network (source: Onemap, Singapore)

The physical data of the selected transportation network were collected through a survey of the study area, and also from several reliable web-based sources. All the selected expressways as well as two arterial streets (i.e. Jalan Boon Lay and Pioneer North Road) were covered. The transport network design criteria provided by the Land Transport Authority (LTA) of Singapore [34] was also consulted. The road geometrics and details of the signalised junctions were observed and sketched during site surveys. The details of public mode of transport were collected from Singapore Mass Rapid Transit’s website [35].

2.1 Geometric design

The imported map of the study area provided all the essential details regarding the road network layout, location of flyovers, roundabouts and road junctions, MRT network, MRT stations, and bus interchanges. It also gave the details about residential, recreational, and commercial zones. This map was loaded and displayed as the background, over which nodes, links, bus stops, and MRT stations were placed.

The expressways are mostly 3-lane dual carriageways, except at certain locations where they are upgraded to 4-lane dual carriageways. The expressways have on-ramps and slip lanes that allow the vehicles to enter or exit, respectively. All the arterial roads are 2-lane dual carriageways. The lane widths are 3.7 m for expressways and 3.4 m for other roads. The speed limits vary from 80 to 90 km/h for expressways and 60 to 70 km/h for arterials roads.

The road network in PARAMICS was coded according to the road geometrics, including the junctions, roundabouts, and flyovers. The type of links for expressways and arterial streets were specifically designed and provided in the “category” file, where details about number of lanes and speed limits were defined. The speed limit for expressways was standardised at 85 km/h and for arterial streets at 55 km/h, given that PARAMICS models simulate the average travelling speed at 10 km/h higher than the defined values.

2.2 Surveillance equipment

In the real network, traffic streams are being routinely observed via video cameras as well as loop detectors, and the statistics regarding traffic volumes and speeds are estimated and provided to certain users through web-based services. The data for the selected network were extracted from the archived database of such records.

To collect the simulated traffic statistics, 18 loop detectors were coded at points where surveillance cameras and loop detectors were present in the physical network. The loop detectors provided information about traffic counts and average travelling speeds on specific links. The gathered traffic data were also utilised in the calibration process of the simulated transportation network model.

2.3 Signalised junctions

A total of 38 signalised junctions were selected for the simulation model. These junctions were either cross-junctions or T-junctions, with slip lanes for left turning vehicles. The signal phasing patterns are fixed with varying signal cycle lengths. All the signalised junctions have pedestrian cross-walks. The signal phasing pattern is such that there are 3 s amber time and 2 s all-red time. The maximum cycle time is 120 s and maximum green time of any approach is 50 s. The details regarding the coded phasing pattern schemes are shown in Fig. 3. The un-signalised junctions are as follows: no control (57 junctions), yield sign control (27 junctions), and stop sign control (33 junctions).

Fig. 3
figure 3

Coded phasing pattern schemes

2.4 Public mode of transport

The public mode of transport consists of two modes: one is the bus service and the other is the MRT. It was found from the travel behaviour survey that more than 90 % of commuters in the sample reported that they had access to MRT, and they were willing to travel by public mode of transport if it provided better service [36]. One of the service criteria was the journey time, which is assumed to be lower in the case of MRT as compared to bus service. Thus, the bus service was modelled as a feeder service to MRT, and MRT was taken as the major mode of public transport for work/school trips.

The bus routes that served as feeder service to MRT and the bus stops associated to these bus routes were coded and modelled by geographical location. Similarly, the MRT lines and stations were also coded. The MRT lines were modelled as single-lane dual carriageways, with rights-of-way dedicated to MRT services only. Each MRT station was designated a zone, and explicitly simulated in such a way that the residential and commercial zones were connected to the MRT zones (stations) by feeder bus services. The simulated buses were 12 m long with a seating capacity of 47 passengers and a top speed of 20 km/h. The simulated trains had 6 cars with a total length of 138 m and a top speed of 45 km/h. Each train had a seating capacity of 372 passengers.

2.5 Traffic information and OD matrix

It is assumed that travellers commute in an information-rich environment, where pre-trip IMTI and en-route traffic information were available. All the driver vehicle units (DVU) were coded as controlled by familiar drivers and dynamic feedback assignment was adopted. The PARAMICS model disseminated interval traffic information to all the familiar drivers and information was updated at every minute.

An OD matrix was estimated based on the traffic counts for calibrating the developed transportation network. A traffic survey was conducted and the statistics such as traffic counts, average travelling speed, and average travel time were gathered for the calibration.

3 Calibration of the INSIM network

The calibration of the simulated network was done from two different perspectives. The first one was to adjust the traffic counts and the OD matrix in the simulation as compared with the existing situations, such that the simulated network within the complete simulation framework can be close to the real traffic conditions. The second perspective was the calibration of behavioural parameters of the PARAMICS simulation model, such that they imitated the local commuters’ travel behaviour. The calibration of the behavioural parameters was necessary because the IES was developed based on the commuters’ travel behaviour as observed/estimated from the travel behaviour surveys conducted in Singapore. To integrate IES with PARAMICS to simulate the local traffic environment in accordance with IES, both models should have the same behavioural assumptions, so that the behavioural parameters that govern the PARAMICS simulation model can realistically represent the behaviour of local commuters.

3.1 Study site and basic input data

The network chosen for the simulation model is shown in Fig. 4. To calibrate the behavioural parameters, a portion of the selected study network (marked with blue broken line) in Fig. 4, was considered. This portion of the network was selected because it covers an area bounded by two expressways (i.e. Pan-Island Expressway and Ayer Rajah Expressway), four arterials (i.e. Jalan Boon Lay, Boon Lay Way, International Road, Pioneer Road North), and one collector (i.e. Jurong West Street 75).

Fig. 4
figure 4

Detailed map and overview of the simulated network

There is a major bus interchange integrated with the MRT station. Such an arrangement facilitated possible transfer between feeder buses and the transit mode. Thus, this portion of the network provided the opportunity to collect travel data for both the public and private modes, for similar trips within the same corridor. The details and overview of the simulated network are presented in Figs. 4 and 5.

Fig. 5
figure 5

Detailed simulated network showing zones, links and nodes

The basic input data include network geometry, driver behaviour, vehicle characteristics, transportation zones, travel demands, traffic control systems, traffic detection systems, bus routes, and bus stops. The road geometry and infrastructure locations were obtained from field surveys. The transportation zones were located at the entrance and exit nodes of all the major roads.

3.2 Required data for calibration

The calibration involved the checking of model results against observed data and adjusting the values of the parameters until the model results converged within an acceptable error threshold. The collected data for model calibration included traffic volume, average travelling speed, and travel time data. The traffic volume and average travelling speed data on expressways and arterial streets were obtained from an archived database. The travel time data were obtained by floating car method for Pioneer North Road and Boon Lay Way.

To construct the real-world traffic variations in the simulation, a typical traffic variation can be represented by the traffic conditions of a typical day, of which the traffic data were the target of calibration process. The selection of a typical day can be implemented based on the comparison of traffic volume (at any selected station) of a candidate day with the average traffic volume of all candidate days using the GEH statistic, used by British engineers:

$$\text{GEH} = \sqrt {\frac{{\left( {E - V} \right)^{2} }}{{{{\left( {E + V} \right)} \mathord{\left/ {\vphantom {{\left( {E + V} \right)} 2}} \right. \kern-0pt} 2}}}},$$
(1)

where E is the candidate data and V is the average data. If the GEH values for more than 85 % of the selected stations are less than 5, the traffic condition and the demand pattern of the candidate day are considered typical. There were 35 stations for which data were available. These stations were located on the two expressways i.e. PIE and AYE, and various arterial streets within the network. The data were collected for 1 h during the morning peak period from 8 to 9 AM. It was found that traffic condition on Wednesday was typical with respect to all weekdays. Therefore, the traffic volume and average travelling speed data on a certain Wednesday were chosen for calibration purpose. It was observed from the extracted data that the aggregated average traffic volume on PIE was 3600 vehicles per hour with an average travelling speed of 94 km/h, whereas on AYE the aggregated traffic volume was 5300 vehicles per hour with an average travelling speed of 87 km/h.

The travel time data were collected by floating car method. On the typical day (Wednesday) during the peak period (8–9 AM) several runs were made between Nanyang Technological University (NTU) and Boon Lay Interchange (BLI). The length of the selected route was 4.2 km. The number of floating car runs was obtained from the following equation:

$$n = \left( {\frac{ts}{\mu \varepsilon } \times 100} \right)^{2},$$
(2)

where n is the required number of runs, \(\mu\) is the mean travel time in the runs, s is the standard deviation, \(\varepsilon\) is the desired margin of error (percent of \(\mu\)), and t (Student’s t-statistic) is the confidence coefficient.

The standard deviation was unknown prior to runs being conducted. A total of 12 runs were performed. It was observed from the floating car runs that the average travel time was 314 s and the standard deviation was 22.3 s. The estimated number of runs with margin of error at 5 % was \(9.75 \approx 10\). The 95 % confidence interval at 12 runs was estimated to be ±14.2 s, and it was inferred that the true mean lies between 300 and 328 s. The corresponding average travelling speed on this route was estimated to be between 46.1 and 50.4 km/h. The average travelling speed on the typical day for this route was observed to be 50.4 km/h with standard deviation of 14 km/h. These results provided evidence that the estimated travel time statistics by floating car method gave the details regarding the basic travelling conditions.

Similarly, the travel time data for the public mode of transport were collected. Two bus routes were considered: the first one (SBS 199) served between NTU and Boonlay Interchange (BLI), and the other (SBS 242) served between Jurong West area and BLI. The route length of SBS 199 is 4.2 km, and it is associated with 9 bus stops out of which 3 have bus bays. The route length of SBS 242 is 3 km with 13 bus stops, out of which 3 have bus bays. The frequency of SBS 199 is 4 min during peak hours and 7 min during off-peak hours and for SBS 242 is 3 min during peak hours and 5 min during off-peak hours on week-days, respectively. On each route, there are 4 signalised junctions and 2 mid-block pedestrian signals.

Initially, 30 runs were conducted on each bus route, and the travel time data were collected. For SBS 199 route, it was found that the average travelling time was 752 s with a standard deviation of 51.4 s. With an error margin of 5 %, the estimated number of runs was 7.8 ≈ 8. The resulting 95 % confidence interval was ±19.5 s for 30 runs, which means that the true mean travel time can be between 732 and 772 s. For SBS 242 route, the average travelling time was 556 s and the standard deviation was 46.2 s. With an error margin of 5 %, the estimated number of runs was 11.6 ≈ 12. The resulting 95 % confidence interval was estimated to be 17.5 s, which means that the true mean travel time can be in-between 538 and 574 s. The average travelling speeds based on the observed travel time values on routes SBS 199 and SBS 242 were 23.9 and 32.4 km/h, respectively.

The PARAMICS simulation model needs an OD matrix as a starting point of the calibration process. The time-dependent OD matrix was generated by IES. The OD estimation process was dependent on commuters release profile and zone-based trip distribution. It is necessary that the generated OD matrix should match with some reference OD matrix, so that the PARAMICS model can simulate realistic traffic conditions. The reference OD matrix was estimated from the traffic count data obtained for the typical day.

The cordon points of the network were taken as zones, and 15-min interval traffic counts at all cordon points were gathered from the data. The total traffic attractions and generations of each zone were then assumed to be the distribution of traffic with respect to the OD matrix. The available loop detector data were then compared for checking the network-wide equilibrium of traffic volumes, such that the traffic entering the network should be equal to the traffic leaving the network. To estimate the OD matrix, the observed traffic counts at the cordon points were taken into consideration. These traffic counts were assumed to be the total attraction and generation from the respective zones corresponding to the cordon points. Once the traffic volume being generated and attracted to each zone was estimated, it was then compared with a reference OD matrix. This reference OD matrix was obtained from the study done by Ang [37], which provided the details regarding the traffic volumes being generated during the morning peak period.

The data gathered from the traffic survey provided details about the traffic volumes for 7 zones. The traffic volume data for these zones were compared with those provided in the reference OD matrix and the growth factor was estimated. Based on the estimated growth factor, which was found to be 5.69 % per annum, the total productions and attractions for each zone of the reference OD matrix were revised. The adopted expansion procedure was

  1. (a)

    the total production/attraction for each zone of the reference OD was expanded, excluding the total productions/attractions for the zones that were obtained from the traffic survey;

  2. (b)

    all the expanded total productions/attractions were summed up, and compared with the expanded total traffic volume, to estimate the overall difference,

  3. (c)

    the estimated overall difference was distributed among all the zone-based total productions/attractions depending upon their share of traffic volume aggregated in the total traffic volume;

  4. (d)

    each zone-based total productions/attractions were then adjusted according to the allocated share of the estimated difference;

  5. (e)

    the adjusted zone-based total productions/attractions were again summed up, such that their sum was now equal to the expanded total traffic volume; and

  6. (f)

    the finalised zone-based total productions/attractions were taken as the target values to be used in the final version of the estimated OD matrix.

Each cell value of the reference OD matrix was also expanded, corresponding to the expanded total traffic volume. The zone-based total productions/attractions for these cell values were estimated and taken as the base case values. The Furness technique was then used for balancing the base case values with the target values of the zone-based total productions/attractions. Only the expanded values of the zone-based total productions/attractions were changed, and the total productions/attractions estimated from the traffic survey were unchanged. Based on the estimated OD matrix, the percentages of trips being productions and attractions for each zone were calculated. The zone-based distribution of trips was estimated and is presented in Table 1. The zone-based trip distribution was utilised by IES to generate the time-dependent OD matrix.

Table 1 Zone-based trip distribution (%)

3.3 Preliminary testing

In INSIM, three types of demand-related data are necessary: zone-based percentile distribution of trips, percentage of total traffic flow to be released onto the network in each time interval, and the total number of trips to be generated. Field data were processed to gather the demand-related data along with the required performance measures. The transportation network model was simulated based on the demand-related data, and was calibrated by comparing the simulated output results with the gathered performance measures. It was observed from the field data that on a typical day the peak period was from 8 to 9 AM, and 33,120 private mode trips were generated within the physical geography of the simulated network during this period. The peak period was defined by observing the traffic volumes on a typical day from 7 to 10 AM. It was found that the traffic volumes were highest during the time period of 8–9 AM. Available information in recent years [38] revealed that roughly 40 % commuters took private mode for their work trip and 50 % took public mode, and 10 % did not require any motorised transport. Based on these statistics, a total of 79,877 trips were simulated for a 1-h time period. The estimated zone-based percentile trip distribution (Table 1), and a uniform trip release profile was also defined. Default values for vehicle population composition, vehicle characteristics, and driver aggressiveness and awareness distributions were not changed. The dynamic feedback assignment model provided by PARAMICS was adopted to replicate route choice behaviour.

Initially, 10 simulation runs were conducted in order to identify critical variables that may significantly influence the performance of PARAMICS. Variables related to driver behaviour (e.g. the distributions of aggressiveness and awareness), compositions of vehicles, driver’s familiarity, etc. were examined. The number of simulation runs was based on number of vehicles (NV). The average NV value was 32,580 and the standard deviation was 335. The estimated number of runs with a margin of error of 1 % was 5.12 ≈ 6. The required simulation runs were conducted with different seed numbers. The numerical outputs of the median simulation run were then compared to corresponding real data.

Outputs used for comparison were related to the total demand generated and the total flow reaching the destination. The outputs were aggregated and analysed using Microsoft Excel. Outputs aggregated in this step for further comparison included total generations (total vehicles released in 1 h), and total attractions (total number of vehicles completing their journey in 1 h). Results in Table 2 indicate a shortage in the number of vehicles being released into the network, and the number of vehicles reaching their destinations. The difference in productions and attractions is due to vehicles which were released in the network but have not reached their destinations. Thus, the difference between the productions and attractions is equal to the total number of vehicles which are still in the network. Therefore, parameter values other than the default needed to be explored. In order to improve the accuracy of the simulation at this stage, the types of vehicles using the network and their respective percentages in the vehicle population were adjusted. There was no systematic way for adjusting the percentage of different types of vehicles except to experiment different combinations so as to get acceptable results. After several combinations the final percentages of different types of vehicles used in the simulation are presented in Table 3.

Table 2 Output data comparison with the un-calibrated model
Table 3 Calibrated percentages of different types of vehicles

It was observed during the simulation that the DVU traversing on expressways did not use the outermost lanes, specifically at the merging points at on-ramps along expressway lanes. At such locations, the vehicles slowed down for longer time periods, and such behaviour resulted in heavy congestion. To analyse such issue, three parameters concerned with driver behaviour were investigated. Two parameter sets in particular, aggressiveness and awareness, were examined—since these two significantly influence the behaviour of drivers. There are four default types of distributions associated with each of these two parameters. Numerous runs with different combinations of distributions were conducted until the outputs of lane use were improved further. The third parameter was the ramp awareness distance. It was observed from the PARAMICS graphical interface of the simulated network that increasing the length of ramp awareness distance increased the flow on expressways and improved the outermost lane utilisation. The ramp awareness distance was adjusted one by one for all the ramps, and the minimum value was adjusted to 150 m. Using combination of the above adjustments, 10 simulation runs were conducted and the improved results are summarised in Table 4.

Table 4 Output data comparison based on the calibrated model

Theoretically, it is a dynamic OD demand estimation problem. To date, there are some OD estimation tools e.g. Estimator of PARAMICS, but their capabilities and potentials for a corridor network are still unclear [39]. The time-dependent OD matrix was estimated on the basis of zone-based percentile trip distribution. The proposed dynamic OD estimation process could be regarded as a process that assigns the total OD to a series of consecutive time stamps. The time-dependent OD demand for each time stamp was then extracted by reconstructing the dynamic OD matrix based on a set of demand profiles for every time stamp, e.g. 1 or 5 min. INSIM allows the user to define the time-dependent releases of demand (trips) and the zone-based trip distribution.

The field data provided 15-min interval counts for all the cordon points in the network. Based on these data, the profile of vehicle production from any origin zone and that of vehicle attraction to any destination zone were thus estimated. Four different trip release profiles for different time stamps were developed as shown in Fig. 6, such that:

Fig. 6
figure 6

Different trips release profiles

  1. (a)

    A uniform distribution was assumed for the demand release profile, i.e. RP1. The total demand was uniformly spread over 60 min of the simulation period with a time stamp value of 5 min. For the first four time stamps, 9 % of the total volume was released, and in the remaining 8 time stamps 8 % were released.

  2. (b)

    A nonuniform distribution was assumed for the demand profile RP2. The 15-min counts were segregated into 5-min counts, by uniformly distributing the 15-min counts over 3 periods of 5-min duration, such that 7 % of demand was released in each 5-min time stamp for the first 15-min time interval, 9 % of demand was released in each 5-min time stamp for second 15-min time interval, and so on.

  3. (c)

    A normal distribution was assumed for the demand profile RP3, such that the maximum release of trips occurred during the 2nd and 3rd quarters of the simulation period. In this profile the demand increased uniformly and arrived at the peak discharge of 12 %, after which it decreased uniformly.

  4. (d)

    A normal distribution skewed towards the right was assumed for the demand profile RP4. In this profile the peak discharge of 12 % occurred in the 3rd quarter of the simulation period. This profile imitated the real traffic flow condition.

3.4 Calibration of route choice model

In the OD estimation process, the network, traffic and route choice behaviour parameters were to be fixed initially, as this process is based on the traffic assignment matrix that is affected by any change in the simulation input parameters.

Due to the existence of expressways and parallel streets in the study network, the routing algorithm adopted in the PARAMICS simulation was important. The network was calibrated using the dynamic feedback assignment model provided by PARAMICS. Dynamic feedback assignment in PARAMICS assumes that different drivers perceive different costs from a decision node to the destination. The perceived cost was calculated, and the perceived shortest route was chosen at the decision node. At this stage the parameter to be calibrated for the route choice model is the number of drivers who are familiar with the road network. Since there was no data to calibrate it, it was assumed that most drivers in the morning peak period were familiar drivers who had the knowledge of road network and traffic conditions. Therefore, it was assumed that 95 % of drivers were familiar drivers, who could choose their route from the available options.

3.5 Secondary testing

Multiple simulation runs for each parameter combined with the each trip release profile were also performed. During these simulations, certain changes in the cell values (i.e. specific origin to destination traffic volumes) of the estimated OD matrix were incorporated. After multiple iterations, the calibration criteria were satisfied. The highest GEH value was obtained by adopting RP4 trip release profiles with 95 % of familiar drivers. The calibration results of this step are shown in Table 5, which presents the 15-min intervals traffic counts at the selected measurement stations. It shows that for more than 87.5 % of all measurement locations, their GEH values were lower than 5, which satisfied the calibration acceptance criterion of this step.

Table 5 Traffic count calibration results for release profile RP4

3.6 Calibration of driving behaviour model

The last step was to calibrate the driver behaviour model to reflect local driver characteristics. The local driver characteristics can be examined through the comparison of the simulated and observed volume-occupancy curves drawn based on aggregated point detector data, and the point-to-point travel time measurements. Thus, the network was further calibrated under the following conditions: the network has been coded and partially calibrated for the parameters in above section; there was no data that can support the calibration of route choice model; the dynamic assignment model in the microscopic simulator can be accepted; and the zone-based percentile trip distribution and trip release profile have been obtained.

The objective function is to minimise the deviations among the observed and the simulated volume-occupancy curves, and the point-to-point travel time measurements, subject to the condition that the traffic counts, which have already been calibrated, do not change. But if the traffic counts do change, then they must satisfy the GEH calibration acceptance criteria. The point-to-point travel time match was performed only for one route between NTU and Boon Lay Interchange (BLI). The driver behaviour model can be calibrated by adjusting the car-following and lane-changing models. The global parameters for the car-following and lane-changing models are mean headway and driver reaction time, which can drastically influence the simulated driver behaviour. These two parameters were fine-tuned by trial-and-error. The purpose was to reconstruct traffic variations and match the congestion pattern of the study network, so that the resulting simulated travel time measurement matched with the real life estimated travel time values. Five different combinations of mean headway and driver reaction time were analysed. For each combination 5 simulation runs were performed and the median NV value was taken into consideration. It was observed that higher values of mean headway and reaction time resulted in lower travelling speeds and vice versa.

The final calibrated mean target headway and driver reaction time were 0.85 and 0.75, respectively. The calibrated demand profile for the simulated peak period is shown in Fig. 7.

Fig. 7
figure 7

Calibrated demand profile

The root mean square percent error (RMSP) was estimated to examine the error between the simulated traffic demand and the actual observed traffic demand, and it was found to be 8.29 %. Figure 8 shows the comparison of observed and simulated point-to-point travel time for the selected route, which has an RMSP value of 2.49 %. Based on the lower value of RMSP, it can be stated that calibration process has been effective as indicated by the test statistics.

Fig. 8
figure 8

Observed and simulated point-to-point travel time

3.7 Calibration of public mode of transport

The last stage of calibration process was to calibrate the public mode of transport. Two public modes of transport i.e. train and bus services were simulated. As the bus shares the right-of-way on the surface street network, its journey time can be affected by the congestion occurring in the network. The bus journey time is dependent on two parameters namely the average travelling speed of buses and the number of passengers waiting at bus stops.

The number of waiting passengers can influence the travel time by affecting the bus dwell time at the bus stops, which is an integral part of travel time. Increasing the number of passengers can increase the boarding passengers’ queue length resulting in longer bus dwell time at the bus stops and vice versa. Thus, the bus dwell time can be a function of number of passengers arriving at the bus stop. The public mode of transport was calibrated by adjusting the bus travel time. The selected parameters were the average travelling speed by bus and the arrival rate of passengers at the bus stop. The objective function was to minimise the difference between the simulated and observed bus travel times. The field data were available for two bus routes, i.e. SBS 242 and SBS 199. Along with other bus services these two routes were simulated.

Initially the bus travel time was calibrated by adjusting the travelling speed. Three different speeds, i.e. 15, 20, and 25 km/h, were simulated. Each speed was simulated for 5 times with different seed number. Then the bus travel time was fine-tuned by adjusting the arrival rate of passengers at each bus stop. After multiple simulation runs, the bus journey time was found to be acceptable. The average travel time measured for buses from simulation were compared with the average travel time for buses measured in the field. The comparison of observed and simulated point-to-point travel times on bus route SBS 242 and SBS 199 are presented in Figs. 9 and 10, respectively. The RMSP was estimated to be 9.54 % for SBS 242, and 9.01 % for SBS 199 route.

Fig. 9
figure 9

Observed and simulated travel time by bus route SBS 242

Fig. 10
figure 10

Observed and simulated travel time by bus route SBS 199

4 Conclusion

This paper calibrated a self-developed traffic simulation model, INSIM, with the observed field data from the central and western part of Singapore. Results reflected that the improved model and the generated outputs are acceptable. The network coding errors were corrected and a reliable time-dependent OD matrix was estimated. The INSIM was adjusted, such that the commuter release profile and the percentile trip distribution resulted in simulated traffic counts that were acceptable according to the GEH statistics. The PARAMICS route choice model was adopted and 95 % of the driver population was assumed to be familiar with the road network. The driver behaviour model was calibrated, and it reflects the local driver behaviour. The mean headway and driver reaction time were adjusted to achieve the calibration of the driver behaviour model. Finally, the public mode of transport was calibrated by adjusting the average travelling speed of buses and the arrival rate of passengers at the bus stops.

Model calibration is a necessary process for various kinds of traffic simulation model, especially for newly developed rudiment before application. This paper provides the whole calibration procedure using local travel network and data. It can simulate the travel time and travelling speed parameters, which shall be utilised by the IES to generate commuters’ travel mode choice under the influence of integrated traveller information, within the statistically acceptable limits, as compared to the observed data. Future tasks include applying the calibrated INSIM in various kinds of real networks to achieve its functions.