1 Introduction

Czech motorways and national roads constitute the primary road network. As one of the densest road networks in Europe, it carries the highest portion of traffic and connects the most important destinations in the Czech Republic [70]. This is challenging in terms of ensuring sufficient operation and maintenance, as well as safety, especially when the oldest part of motorway network is currently undergoing a large-scale renovation. For Czech national road agency, to be able to rationally manage network safety in both planning and operation stages, quality network-wide data and tools are needed. While such tools already exist in some countries, their transferability is limited. This motivated authors to develop the tools for the Czech conditions.

The paper describes the process of developing accident prediction models for all types of Czech road network elements. Unlike a number of international studies, which usually dealt only with a selected road category, the study focused on the whole network in its complexity, covering motorways and national roads, and including road sections, intersections and interchanges. The authors conducted their own traffic survey, collected and processed all necessary data, and transformed the results into practical on-line tools. The developed tools are able to improve the effectiveness of road safety impact assessment (in the planning stage), as well as network safety ranking (identification of hazardous road locations in the operation stage).

Next Section 2 presents Literature review, followed by Data and methods (Section 3), Modelling, results and applications (Section 4) and concluded by Summary, discussion and conclusions.

2 Literature review

In recent years, European Union (EU) road safety orientation has been steered by EU Directive 2008/96/EC on road infrastructure safety management. Its main purpose is to establish management procedures to ensure safe road network, both in planning and operation stages. The Directive introduces four procedures: (1) road safety impact assessments and (2) road safety audits to be carried out for new road constructions or reconstructions; for existing roads, (3) network safety ranking to find the critical sections and (4) road safety inspections are to be periodically conducted. Two of these procedures – road safety impact assessment and network safety ranking – are further referred to in the text.

2.1 Road safety impact assessment

The idea of road safety impact assessment (RSIA) is that road safety should play an important role in the decision making during the design/planning stage [32, 50, 91]. The 2008 Directive defines RSIA as “a strategic comparative analysis of the impact of a new road or a substantial modification to the existing network on the safety performance of the road network”, conducted alongside with other impact assessments, such as environmental impact assessment (EIA). By 2011, EU member states implemented the Directive procedures, including RSIA, to their national guidelines. However, independent analysis [79] concluded that in most cases, implementation only resulted in legislative documents, which offer mere Directive translations, rather than detailed guidelines for practical use. This is also probably a reason why RSIA remains the least used of Directive processes [79].

In principle, RSIA process involves definition and comparison of project variants (“do nothing” and one or more alternative scenarios), of which road safety performance needs to be assessed, including impacts on adjacent road network. Therefore, the underlying principle is quantification of safety. For this purpose, rates derived from accidents and injuries have been traditionally used, based on Police accident records and tabulated for several road network categories. This approach has been used in majority of European countries [12, 14, 19, 33, 36, 53, 85, 87]. However, in recent years, general use of accident rates is declining, since they were found to incorrectly assume a linear relationship between accident frequency and the degree of exposure [31, 41, 66]. Accident prediction models (APMs, or safety performance functions, SPFs) are instead preferred, since they are able to model accident frequency non-linearity, while also taking into account multiple explanatory variables (potential risk factors).

Another element of state-of-the-art approach to safety estimation involves accident modification factors (AMFs, also known as crash modification factors, CMFs). These are multiplicative factors, used for calculating the expected number of accidents after implementing safety measure at a specific site, through multiplication with expected accident frequency without treatment [38]. An AMF value higher than 1.0 indicates an expected increase in accidents, while a value lower than 1.0 indicates an expected reduction in accidents after the treatment. Various methodologies may be used in order to obtain AMF values, while before-after methodology with Empirical Bayes adjustment, has been deemed the most suitable [42, 64, 74]. In addition, AMFs were recently put into the centre of evidence-based decision-making [92], as a foundation of efficiency assessments to be applied in all cases, where “lack of reliable knowledge of the effects of countermeasures is a key barrier to the advancement of many critical, life-saving initiatives” [59].

To sum up, road safety impact assessment should be based on state-of-the-art assessment approach, using accident prediction models (APMs) and accident modification factors (AMFs). These elements constitute the foundations of evidence-based road safety management [28, 45, 96]. However, most of existing APMs and AMFs were developed in North America, Australasia or North-Western Europe [95], and it is known that these results may not be easily transferable [40, 59]. This means that should state-of-the-art approach be applied, a number of EU member states, including the Czech Republic, need to develop their own tools. [9]

In Europe, APMs and AMFs are used for example in Denmark, Sweden and Finland [62, 82, 88]. Analysis of their guidelines and manuals showed that:

  1. 1.

    Applied models are relatively simple, developed for specific road types and intersection types and often involving only traffic volume. This practice is consistent with pilot Czech applications, where simple models were also found to provide sufficient quality of safety estimates [4, 6, 11].

  2. 2.

    Accident modification factor sets are usually combined from local estimates and international sources, mainly the Norwegian Handbook of Road Safety Measures [31]. While this practice may be sufficient for relatively similar conditions of Sweden and Finland, suitability in the Czech Republic is uncertain.

2.2 Network safety ranking

The Directive states, that “safety ranking and management of the road network in operation” (in short, network safety ranking, NSR) should take into account the number of accidents, that have occurred in previous years per unit of road length in relation to the volume of traffic; and it should result in a priority list of road sections where an improvement of the infrastructure is expected to be highly effective. However, this description is rather short and thus provides, as well as RSIA description in the previous subsection, space for users in EU countries to apply their own approaches.

Several reviews indicated that in most countries, it is common to rely on traditional approaches, based on exceeding a fixed threshold of number of Police-reported accidents, without considering traffic volume and other risk factors [22, 30, 37, 52]. Such definitions have also been applied in the Czech Republic [12, 86]. However, the research has shown that safety performance is influenced not only by accident occurrence, but also by its random variations, known as regression to the mean [42]. Due to this effect, the locations (road sections or intersections) identified as critical in one period, may not be critical in another period. The effect of these random fluctuations around a long-term mean may be corrected for by adding information on safety of similar units. The previously mentioned Empirical Bayes (EB) method, i.e. combination of the information contained in accident counts with the information contained in knowing the safety of similar entities, through using an accident prediction model, is considered a state-of-the-art method of road safety assessment [30, 44, 56]. With the EB approach, in the end the list is produced which enables ranking the locations based on their potential for safety improvement [1]. The list includes not only accident locations, but also potentially hazardous locations, where no accidents have yet occurred.

Nevertheless, to be able to apply the EB method on a primary road network, it is necessary to develop a set of accident prediction models for all types of network elements (road sections, intersections, interchanges, etc.). For example, US Highway Safety Manual (HSM) introduced a number of US-specific accident prediction models. It was found that these models are well transferable between the US states [21, 76, 94]; however, transferability outside of the US was not always successful, as indicated by studies in Canada or Italy [65, 71, 97]. In the USA, also interchange safety analysis tools are available [20, 80]. But like with HSM transferability, calibration of these tools may be uncertain [25, 81]. At the same time, several European countries have developed their own accident prediction models. However, they differ in their level of complexity: sometimes they are rather accident rates (i.e., implying linear relationship between traffic volume AADT and accidents) and they often consider either sections or intersections only [47, 62, 82]. In terms of data requirements, motorways can be even more challenging due to complexity of interchanges. This is probably why motorway safety studies usually omit interchanges or consider them part of sections [24, 27, 55, 73, 98].

Based on data from several European countries, a recent PRACT project has developed the base model to be calibrated to local conditions based on historical data [49]; limitation is that the models do not consider intersections, and the calibration tool is not yet available.

Following the international research, EB method with accident prediction model was recently applied in a part of Czech secondary road network [6]. However, its application in more complex primary road network has not yet been attempted.

2.3 Summary

The literature review, focusing on international state-of-the-art practice of road safety impact assessment (RSIA) and network safety ranking (NSR), concluded that both procedures should ideally be based on accident prediction models (in addition, RSIA also requires accident modification factors). The process of developing accident prediction models (APMs) has been known from several international studies and applications; however, these usually did not cover complete road networks. In the example of motorways, the models need to consider all the elements, including road sections, various types of intersections, interchanges, etc. How this was done in case of Czech primary roads (motorways and national roads), is described in the present paper.

3 Data and methods

In order to develop accident prediction models, as the main tool for both road safety impact assessment (RSIA) and network safety ranking (NSR), following points need to be resolved:

  1. 1.

    Sample definition

  2. 2.

    Network segmentation

  3. 3.

    Definition of variables

  4. 4.

    Selection of model function form

These steps are described in the following sections.

3.1 Sample definition

Since the idea of network safety ranking is to screen the entire road network, the sample should in theory comprise the whole population. However, the studied elements should be described by predictors, of which values are constant through the studied period. On the other hand, for the purpose of road safety impact assessment, accident prediction models may be based on a representative sample only.

Based on these requirements, following decision guided the definition of the studied samples:

  • The parts of motorway network under renovation (including also the main motorway D1 between two largest cities Prague and Brno) were removed from the sample.

  • The categories covered typical cross-sections, such as undivided/divided sections or 3-leg/4-leg intersections. Less frequent categories were omitted.

3.2 Network segmentation

Given the variety of Czech primary road network types, it was decided to distinguish motorways and national roads, as well as intersections (interchanges on motorways) and sections. Since RSIA also assesses impacts on adjacent road network, in addition to motorways and national roads, secondary roads were considered as well. At the same time, it was found that some categories were relatively rare: for example, 6-lane or 2 + 1 sections are not as frequent, as would be required for developing a reliable accident prediction model. The same held for secondary roads, sample size of which was limited by unavailability of AADT data.

The final 11 categories were the following:

  • Motorways

    • ○ Interchanges (comprising conflict points and ramps)

    • ○ Sections (divided)

  • National roads

    • ○ Intersections (3-leg, 4-leg or roundabouts)

    • ○ Sections (undivided or divided)

  • Secondary roads

    • ○ Intersections (3-leg or 4-leg)

    • ○ Sections

In Table 1, the models are labelled as M01, M02, …, M11.

Table 1 Overview of categorization into 11 road network element types

Regarding road sections, it is necessary to realize the consequences of various section lengths. In case of using homogeneity principle, i.e. keeping constant values of explanatory variables within a section, one may obtain too short sections [67]. In addition, for network ranking, it should be kept in mind that the follow-up on-site visits require manageable lengths (say, in units of kilometres). Therefore, the goal was to define logical section, while also trying to minimize differences within the sections.

On motorways, average length of sections between interchanges was 5 km. These sections were thus taken as motorway sections. Subsequently, each interchange was split into conflict points (merging, diverging, crossing, etc.) and sections between them (ramps). On national roads, sections were defined in following steps:

  1. 1.

    Intersections with complete AADT data (on both major and minor roads) were identified. Sections were defined between these intersections, usually between two settlements.

  2. 2.

    In case of sections over 10 km, some of intermediate intersections (even with incomplete AADT data) were used to split the sections into two.

Note that data contained both divided and undivided roads. On divided sections, sections in two directions were defined independently. Undivided sections comprised both directions.

3.3 Definition of variables

To develop accident prediction models, three basic datasets were needed (accident data, traffic data, road data), which will be described in the following paragraphs.

3.3.1 Accident data

Geo-located accidents have been collected by Czech Traffic Police since 2007, which makes it a suitable network-wide data source. However, this data is not routinely linked to the units of interest, for example interchange conflict points and ramps. Thus for this study, the linkage was done manually in GIS environment. Unfortunately, location precision was to some extent limited, and did not allow for example distinguishing accidents in acceleration/deceleration lanes or merging zones. Therefore, instead of using them to develop separate models, they were considered a subset of road section accidents.

Typically, 3-year period of accident data is used [30]. However, since motorways have relatively low accident frequencies, 7 years of accident data (2009–2015) were used in the study.

In RSIA, modelling outputs (estimated accident frequencies) are to be further converted to accident costs and used in economic valuation of contemplated project variants. In this context, the models should be aligned with categories, used in Czech accident cost guidelines [90]. Thus, following accident variables were defined:

  • number of accidents, which involved a fatality (FAT)

  • number of accidents, which involved a severe injury (SEV)

  • number of accidents, which involved a slight injury (SLI)

  • number of accidents, which involved only property damage (PDO)

  • including combinations, such as injury accidents (FAT+SEV + SLI) and total accidents (FAT+SEV + SLI + PDO)

Accidents were split between sections and intersections based on their distance from intersections. Following Avelar et al. [13], all accidents within 100-m area around the intersection centre were considered intersection accidents.

3.3.2 Traffic data

Regarding traffic data, Czech national traffic census covers most of the primary road network; however, it was observed that it does not include interchange ramps on motorways. Since this data is required for developing interchange models, manual traffic counts needed to be performed on all ramps of approx. 450 interchanges (in more than 1000 profiles). Each profile was observed for 2 h of a working day and the results were factored up to AADT, using national guidelines [16].

Also on national roads, national traffic census data was found incomplete. AADT was missing on large number of minor roads, which intersect national roads – therefore it would not be possible to develop intersection models. Due to immense number of these intersections, it was not feasible to complement it with own survey. Therefore, simplified approach was taken, similar to the Norwegian practice [47] – i.e., not considering intersections with minor roads as individual units, but in terms of their frequency. Feasibility of this approach was previously tested in Czech conditions and found satisfactory [11].

3.3.3 Road data

Road data is digitally maintained by Road and Motorway Directorateʼs Road Databank and updated twice a year. This data was imported to authorsʼ own GIS environment and used to obtain selected road parameters. Additional information was collected manually from public online maps Mapy.cz or Google Maps.

Selection of specific explanatory variables (risk factors) should be guided by a theory, or previously documented evidence. Literature review indicated following typically used road-related variables:

  • on interchanges: horizontal alignment, ramp type (on/off), ramp length, area type (rural/urban), etc. [17, 35, 51, 78, 83]

  • on road sections: geometric characteristics (alignment), cross-section parameters (lane, median and shoulder widths), roadside hazard rating, speed-related variables, pavement quality [2, 6, 23, 39, 55]

  • on at-grade intersections: number of legs, lanes, turn lanes, type of traffic control device (signalized/unsignalized), intersection angle, sight distance, etc. [15, 18, 60, 93]

Based on the review, the following road-related explanatory variables were defined in this study:

  • Interchange conflict points were described by their type, traffic control device, presence/absence of channelization by road marking, and the number of driving directions. Interchange ramps were described by their length, type, curvature and radius.

  • For intersections on national roads, traffic control device and presence/absence of turn lanes and bypass lanes was recorded. Following previous Czech studies [3, 77], roundabouts were further described by number of legs, inscribed circle diameter, central island diameter, average of roundabout entry angles, average of deviation angles, width of circulatory lane, and width of truck apron.

  • Sections of national roads were described by length and curvature change rate (CCR). In order to consider minor intersections, their number was used; analogically, number of accesses from petrol stations or rest areas was used on motorways. Both variables were used to calculate density (frequency per 1 km). In addition, number of available parking space was used as a proxy for potential traffic flow to/from motorway rest areas.

Motorway network is located primarily outside of urban areas. Intersections on national roads were labelled according to their rural/urban location. Road sections, which comprised mix of both conditions, were described by a ratio of location in urban areas (e.g., 0.3 meaning 30% and 70% of length in urban and rural area, respectively). The list of explanatory variables, including their symbol, unit and data source, is provided in Table 2.

Table 2 List of explanatory variables, with their symbol, unit and data source

Additional Table 3 provides information on numbers of elements and descriptive characteristics of their accident data.

Table 3 Numbers of elements and descriptive characteristics of their accident data

3.4 Selection of model function form

Accident prediction model equation consists of two parts: (1) exposure, i.e., AADT (and length, in case of sections), and (2) risk factors. In literature, various forms of considering road section risk exposure may be found, for example:

  • both length and AADT in a power form [34, 39, 69]: \( {L}^{\beta_1}\bullet {(AADT)}^{\beta_2} \)

  • length as an offset and AADT in a power form [20, 80]: \( L\bullet {(AADT)}^{\beta_1} \)

  • length in exponential form [17, 61]: \( \exp \left({\beta}_1\bullet L\right)\bullet {(AADT)}^{\beta_2} \)

In a previous Czech study [4], several model function forms were compared in terms of proportion of explained systematic variation, and models with power function of section length proved better. Therefore, it was also chosen for this study. For the models of intersections or conflict points, there is a relative consensus in literature [57, 68, 72, 89]: exposure is usually defined as a product of conflicting flows or a product of major flow and proportion of minor flow. Model function forms adopted for this study, together with their linearized form, which will be used for modelling, were as follows:

  1. 1.

    for sections:

$$ \widehat{N}=\exp \left({\beta}_0\right)\bullet {L}^{\beta_1}\bullet {(AADT)}^{\beta_2}\bullet \exp \left(\sum \limits_{i=3}^n\left({\beta}_i\bullet {x}_i\right)\right) $$
(1a)

in linearized form:

$$ \ln \left(\widehat{N}\right)={\beta}_0+{\beta}_1\bullet \ln (L)+{\beta}_2\bullet \ln (AADT)+\sum \limits_{i=3}^n\left({\beta}_i\bullet {x}_i\right) $$
(1b)
  1. 2.

    for intersections:

$$ \widehat{N}=\exp \left({\beta}_0\right)\bullet {\left({AADT}_{major}\right)}^{\beta_1}\bullet {\left({AADT}_{minor}\right)}^{\beta_2}\bullet \exp \left(\sum \limits_{i=3}^n\left({\beta}_i\bullet {x}_i\right)\right) $$
(2a)
$$ \widehat{N}=\exp \left({\beta}_0\right)\bullet {\left({AADT}_{major}\right)}^{\beta_1}\bullet {\left({AADT}_{proportion}\right)}^{\beta_2}\bullet \exp \left(\sum \limits_{i=3}^n\left({\beta}_i\bullet {x}_i\right)\right) $$
(3a)
$$ \widehat{N}=\exp \left({\beta}_0\right)\bullet {\left({AADT}_{sum}\right)}^{\beta_1}\bullet \exp \left(\sum \limits_{i=2}^n\left({\beta}_i\bullet {x}_i\right)\right) $$
(4a)

in linearized form:

$$ \ln \left(\widehat{N}\right)={\beta}_0+{\beta}_1\bullet \ln \left({AADT}_{major}\right)+{\beta}_2\bullet \ln \left({AADT}_{minor}\right)+\sum \limits_{i=3}^n\left({\beta}_i\bullet {x}_i\right) $$
(2b)
$$ \ln \left(\widehat{N}\right)={\beta}_0+{\beta}_1\bullet \ln \left({AADT}_{major}\right)+{\beta}_2\bullet \ln \left({AADT}_{proportion}\right)+\sum \limits_{i=3}^n\left({\beta}_i\bullet {x}_i\right) $$
(3b)
$$ \ln \left(\widehat{N}\right)={\beta}_0+{\beta}_1\bullet \ln \left({AADT}_{sum}\right)+\sum \limits_{i=2}^n\left({\beta}_i\bullet {x}_i\right) $$
(4b)

where:

\( \widehat{N} \) :

predicted accident frequency for each severity level

L :

section length

AADT :

annual average daily traffic

AADT major :

major road AADT

AADT minor :

minor road AADT

AADT sum :

sum of entering AADTs on roundabout legs

AADT proportion :

proportion of minor road AADT to total AADT \( \left(\frac{AADT_{minor}}{AADT_{major}+{AADT}_{minor}}\right) \)

xi (i = 1, 2, …):

other explanatory variables (risk factors)

β 0 :

regression constant, to be estimated in modelling

βi (i = 1, 2, …):

regression coefficients, to be estimated in modelling

4 Modelling, results and applications

This section describes development of accident prediction models and accident modification factors and their application in road safety impact assessment and network safety ranking.

4.1 Accident prediction models

Using the described data, modelling was conducted with aim of developing accident prediction models for 11 categories (Table 1) and 8 severity levels (FAT, SEV, SLI, PDO, injury accidents, total accidents, plus combinations). The modelling, i.e. calibrating the mentioned function forms, entails estimating regression coefficients βi and obtain model equations. Explanatory variables (xi) were added stepwise, while controlling for achieved statistical significance and discarding non-significant variables. The process was performed using generalized linear modelling (GLM) procedure in IBM SPSS, with a negative binomial error structure and logarithmic link function.

However, it was found that modelling for categories of individual severity levels did not result in quality APMs, due to low frequencies of severe and fatal accidents. Therefore, consistently with Srinivasan and Bauer [75], alternative approach was taken:

  1. 1.

    Developing APM for total accident frequency (#total).

  2. 2.

    Determining average ratios of individual severity levels. For example, for fatalities (FAT):

$$ {(ratio)}_{FAT}=\# FAT/\# total $$
(5)
  1. 3.

    Regression constants of APMs for individual severity levels were obtained by multiplying the regression constant of total APM by individual ratios. For example:

$$ {\left({\beta}_0\right)}_{FAT}={\left({\beta}_0\right)}_{total}\bullet {(ratio)}_{FAT} $$
(6)

This means, that APMs for individual severity levels of a specific category have different regression constants β0, but uniform regression coefficients β1, β2, …

During modelling, effect of several explanatory variables was found insignificant, for example:

  • interchange conflict points: channelization by road marking, number of driving directions

  • motorway sections: density of accesses, available parking space

  • intersections on national roads: presence of bypass lane or turn lanes, location in rural/urban areas

  • sections of national and secondary roads: location in urban areas, curvature change rate

More information, as well as comparison with other studies, is provided in a previous paper [10]. The list of final model forms is given in Table 4. Specific values of regression parameters for each accident severity level are provided in the Appendix.

Table 4 Overview of final accident prediction model forms

4.2 Accident modification factors

In addition to accident prediction models (APMs), also accident modification factors (AMFs) are required for conducting road safety impact assessments. However, as mentioned in the literature review, while AMFs are available internationally (especially in Nordic countries or Northern America), no comparable AMF studies for Czech condition exist. The exceptions were two before-after EB studies:

  • Ambros et al. [5] estimated AMF for roundabout conversions. The obtained values were within a range indicated in an international review [29].

  • Recently, Ambros et al. [7] also estimated effectiveness of traffic signal conversions. The results were partly consistent with international values [31].

The results of both studies were therefore relatively comparable to the international findings. While more studies are definitely needed to reach definite conclusion about transferability possibilities, for the current purpose it was assumed that using international AMF values will be acceptable. The approximate values of AMFs for typical countermeasures were collated, based on international sources [26, 31, 46, 48, 58, 84] and used as a recommended list to be used for conducting RSIA.

4.3 Application in road safety impact assessment

The APMs and AMFs, described in previous text, present the background for conducting RSIA on Czech primary roads. To be useful for practitioners, two tools were provided by CDV:

  • Practical guidelines, describing both theory and practice of RSIA, including worked example application of assessing road safety impacts of a planned city bypass [8].

  • On-line application,Footnote 1 which enables selection of bypass route in a map, entering data (AADT, road category, etc.) for both route elements, and calculation of predicted accident frequency (see Fig. 1).

Fig. 1
figure 1

Illustration of a principle of the developed on-line road safety impact assessment application

The final predictions, summed over a selected route and transformed into accident costs, present the predicted costs of a contemplated project variant (for example, bypass). These may be compared to a do-nothing scenario (i.e., not building a bypass) in order to determine the expected safety change. Further comparison of these benefits to the project costs enables a simple cost-benefit analysis and economic evaluation.

4.4 Application in network safety ranking

The developed APMs were used to obtain predicted accident frequency (\( \widehat{N} \)) for each section (i). For the purpose of network safety ranking, Empirical Bayes estimate of expected accident frequency (EB) was then calculated, using predicted accident frequency, reported accident frequency (N) and length-dependent overdispersion parameter [43]. Finally potential for safety improvement (PSI) was obtained as a difference between predicted accident frequency and EB estimate [63]:

$$ {EB}_i={w}_i\bullet {\widehat{N}}_i+\left(1-{w}_i\right)\bullet {N}_i $$
(7)
$$ {w}_i={k}_i/\left({k}_i+{\widehat{N}}_i\right) $$
(8)
$$ {k}_i=k\bullet {L}_i $$
(9)
$$ {PSI}_i={EB}_i-{\widehat{N}}_i $$
(10)

where:

EB i :

Empirical Bayes estimate

w i :

weight

\( {\widehat{N}}_i \) :

predicted accident frequency

N i :

reported accident frequency

k i :

overdispersion parameter

L i :

section length

PSI i :

potential for safety improvement

Values of PSI were used for network screening and ranking. Given smaller sizes of 3-legs, 4-legs and roundabouts samples, they were combined into one group. Thus, descendent ranking enabled identifying hot spots in following five groups:

  1. 1.

    Interchange conflict points

  2. 2.

    Interchange ramps

  3. 3.

    Motorway sections

  4. 4.

    At-grade intersections on national roads

  5. 5.

    Sections of national roads

The ranked list of locations were visualized in an on-line mapFootnote 2 and handed over to the road agency, which will use them to prioritize and perform necessary steps to improve safety of Czech primary road network. Illustration is provided in Fig. 2: user may select specific elements using filter, and study their location including visualization of input data and a link to Google Street View map.

Fig. 2
figure 2

Illustration of a principle of the developed on-line map with the results of network safety ranking

5 Summary, discussion and conclusions

Conduct of evidence-based road safety impact assessment and network safety ranking needs to be based on scientific tools: accident prediction models, accident modification factors and Empirical Bayes method. While such tools already exist in some countries, their transferability is limited. This motivated the authors to develop these tools in the Czech conditions, and describe this exercise in this paper. Unlike a number of similar international studies, which usually dealt only with a selected road category, the data and developed models had to consider not only road sections, but also intersections and interchanges. The focus was on Czech primary road network, consisting of motorways and national roads. In addition, the focus was widened to include also secondary roads, in order to enable assessment of impacts on adjacent road network.

The developed models enabled both identification of hazardous road locations (network safety ranking) and evaluation of safety of planned infrastructure (road safety impact assessment). The models also provided interpretation of influence of statistically significant risk factors. Effects of typical variables (AADT, length, number of intersection legs, etc.) were consistent with literature [68]. On the other hand, several variables did not have sufficiently significant effect (for example channelization, parking space along motorway or intersection bypass lanes). However, it is important to note, that the reported effects are only associations; in order to capture causal effects, one would need to conduct before-after studies. Another caveat is that although original dataset was relatively large, there is always a risk of omitted variable bias [54]. Examples of variables, which were not taken into account, may include for example speed behaviour. Another issue was use of proportions of observed accident severities in obtaining accident severity frequencies, caused by sample size limitations. While this has been used by some authors [58], it was noted as potentially biased, due to ignoring correlations between severities and traffic volumes [75].

Nevertheless, the current study helped establishing the framework for two practical procedures: road safety impact assessment and network safety ranking, including the examples of developed on-line tools. It may serve as an example for other member countries, which also lack their local tools. Follow-up studies may focus on future model updating and improvements (for example by adding some less available data such as grade, vertical alignment, pedestrian traffic or land use), as well as development of Czech accident modification factors to be used in selecting the most suitable countermeasures.