Introduction

Accidents are occurring more and more frequently due to the continuous and rapid development of road traffic, causing an increase in the numbers of injuries and fatalities each year [48]. According to the “Global Plan for the Decade of Action for Road Safety 2011–2020” released by the World Health Organization, the annual global death toll caused by road traffic related collisions is nearly 1.25 million [55]. Moreover, traffic accidents have become the third leading cause of death worldwide [32]. From an economic point of view, the losses caused by traffic accidents account for about 3% of the Gross Domestic Product (GDP) of low- and middle-income countries [55]. Road transportation is a complex and open system, which is affected by many factors such as the environment, infrastructure, hardware, software, human factors, and their interaction.

In European countries, safety management action has been implemented through Safety Management Systems (SMS) since the 1970s. The concepts in SMS cover the managerial actions for all accidental risks to life, health, property and the environment [15]. Li and Guldenmund [29] pointed out that new technologies, audit tools and standard procedures have been developed to improve the SMS over the past 20 years. There are also several other widely used risk management models, including the Systems-Theoretic Accident Model and Processes (STAMP) developed by Newnam and Goode [34], the Haddon matrix [51], Human Factors Analysis and Classification System (HFACS) [45], and many others.

In recent years, many studies have focused on the use of HFACS for transport system risk assessment and management studies that have proved its effectiveness and reliability. This model describes the four levels of human negligence, namely “unsafe behavior,” “preconditions for unsafe behavior,” “unsafe supervision,” and “organizational influence.” Reinach and Viale [41] improved the original framework and introduced a fifth aspect, “external factors,” as the highest level to optimize the relevant failure modes within the entire framework. The advantage of HFACS is that it can actively track the causes of accidents and find and determine the root cause of hazardous events. In addition, the model can effectively and reasonably analyze the human factors in system problems as they happen. It provides a wider range of theoretical and methodological support for accident analysis and prevention in safety science. This model has been widely used in various fields for the analysis of natural gas pipelines [53], chemical storage [43], and mining safety [27]. In regards to road safety, Zhang et al. [56] explored the causal relationships between traffic accident factors, and discussed the interaction between the various HFACS levels to establish a systematic assessment method. Their results showed “unsafe behavior” to be the most frequent failure mode in road accidents. The priorities for accident categories are rollover, leaks, fire, explosion, and collision. Zhang et al. [55] also identified “unsafe behavior” as one of the most important failure modes for traffic accidents. After a review of many bus accidents they found the four most important human factors to be “organizational process,” “insufficient supervision,” “personal readiness,” and "violations.” Obviously, there are multiple paths which can lead to failure while passing through from the bottom to the top level, which is a strong implication of the interdependence between the various human factors.

Although HFACS is a useful tool to explore the possible failure modes in risk analysis, it is still insufficient when used alone for road accident analysis. First, it can only identify the potential failure modes, because of the lack of quantitative risk assessment techniques [55]. Second, it does not consider the severity, occurrence or detection of the different failure modes [26]. Third, it is unable to identify or prioritize the risk degree of the failure modes. Therefore, this study proposes a novel multi-criteria decision-making (MCDM) based risk assessment model for the improvement of road traffic safety, especially in relation to inter-city bus traffic. The model uses HFACS to identify the potential failure modes of road accidents. Failure mode and effects analysis (FMEA) is then applied to the proposed risk factors, including severity (S), occurrence (O), detection (D), and expected cost (E), to review the potential failure modes and obtain a risk assessment matrix. Many novel FMEA models include E as a practical risk factor, which reflects the necessity of resource allocation [31, 32].

Finally, the Z-number-based best–worst method (Z-BWM) is used to determine the weights of the risk factors, and Z-number-based weighted aggregated sum product assessment (Z-WASPAS) is applied to calculate the risk degree of the potential failure modes. The experts’ judgements integrated by using the Z-number method are necessary applied to estimate the failure modes because there are insufficient data to estimate the severity, occurrence and so on for most of the failure modes. The Z-number allows one to effectively integrate the uncertain or vague information of the experts’ judgements. The Z-number-based analytic hierarchy process (Z-AHP) requires an excessive number of pairwise comparisons and has poor consistency but Z-BWM improves these problems. In addition, the calculation process of Z-WASPAS is easier to understand and carry out than either the Z-number-based VIseKriterijumska Optimizacija I Kompromisno Resenje (Z-VIKOR) or Z-number-based technique for order preference by similarity to ideal solution (Z-TOPSIS). Z-WASPAS also includes the concept of the Risk Priority Number (RPN) found in the original FMEA calculation. The proposed model identifies potential failure modes for every aspect and prioritizes the risk of failure modes in road traffic accidents. Finally, a case study based on data for inter-city bus traffic in Taiwan is carried out as an empirical example and managerial implications are provided. The specific contributions and characteristics of this research are summarized below.

  1. 1.

    This study develops a novel risk assessment model, which uses HFACS to summarize the potential failure modes for inter-city bus transportation.

  2. 2.

    The risk factors for assessment are determined based on FMEA theory, to obtain the S, O, and D. In addition, considering the actual situation of resource constraints, a fourth risk factor, “E” is added.

  3. 3.

    Z-numbers are used to reflect the uncertainty and confidence level of the experts’ risk assessment responses.

  4. 4.

    Z-BWM is used to overcome the problem of too many pairwise comparisons required by classic analytic hierarchy process method.

  5. 5.

    Z-WASPAS is used to calculate the risk level of the failure modes and decide the priorities for improvement.

  6. 6.

    An empirical case study of an inter-city bus system in Taiwan is carried out to help understand the practical implications.

The rest of the paper is organized as follows. "Literature review" gives a brief review of recent related FMEA and HFACS studies, and points out gaps in the research in this field. The calculation processes for HFACS, FMEA, Z-BWM and Z-WASPAS used in this study are described in "The proposed hybrid model". In "Illustration in a real case study", the inter-city Taiwan bus system used to illustrate the proposed model is outlined. Some theoretical and practical implications are discussed in "Discussion". Finally, a summary of the findings, contributions of this research, and future research directions is included in "Conclusions and future work".

Literature review

Road traffic safety and accidents

Road transport is comprised of a complex system involving personnel, vehicles, roadways, and the surrounding environment. To effectively reduce risks in such a complex system, it is necessary to have a thorough understanding of the various factors that lead to traffic accidents, for instance, the design of vehicles, roads and traffic systems, driving regulations, as well as road and vehicle maintenance. Peden et al. [38] pointed out that the most common causes of traffic accidents arise from infrastructure, environmental, vehicular and human factors. Stanton and Salmon [46] argued that there is no effective structured method for assessing the effect of human error in road transportation accidents. An effective system for accurate classification of various causative factors is also lacking, although they found that human error and driving behavior errors triggered about 75% of traffic accidents. Di Pasquale et al. [12] found human error-related factors to be the cause of about 60–90% of accidents in the road transport system, with the remaining accidents being attributable to technical defects. Thus, about 85% of road transport accidents are caused by human error.

Numerous studies exploring the causes of traffic accidents have appeared in the literature. According to Laaraj and Jawab [22], the current methods can be divided into two categories: traditional and systematic. Traditional methods mainly focus on exploring human factor errors, such as the driver's personality traits, driving while fatigued, and deviations in driving behavior. Beanland et al. [6] analyzed responses to driver behavior questionnaires to uncover the personality traits that lead to abnormal driving behaviors. Jiang et al. [20] used detailed statistical data from 45 serious traffic accidents to sort out key error factors in terms of “direct cause”, “type of accident”, and “accident responsibility”, which included “speeding”, “improper driver operation”, “vehicle overload”, “fatigue driving”, and “bad driving habits.” The traditional methods focus on driver error, but this perspective might be too narrow to analyze the multi-dimensionality and complexity of a transportation system.

Systematic methods, on the other hand, consider overall factors (e.g., environmental, human or machine) and their interaction within the road system. These methods include the accident map (Accimap) [39], cognitive reliability and error analysis method (CREAM) [47], HFACS [50], STAMP [23], etc. System analysis models are more suitable than traditional methods for understanding problems in the real world. However, both types of methods show that human factors, especially driving behavior, are the key cause of road traffic accidents. Thus, a comprehensive analysis of the effect of human factors in the system is crucial.

HFACS

HFACS was developed for the classification of human errors. It was originally proposed by Shappell and Wiegmann [45] as a tool for investigating human factors and database classification structure. It was mainly used by the US military to investigate and analyze the human causes of aviation accidents at that time. The causality framework for human caused accidents proposed by Reason [40] used in this framework is based on the identification of “active failures” and “latent failures” in human error (the so-called Swiss cheese model of human error) distilled into four levels, namely, “unsafe behavior,” “preconditions for unsafe behavior,” “unsafe supervision,” and “organizational influence” [50].

The HFACS model has been adapted to make it more suitable for application in different circumstances, such as for the investigation of errors in railway, sea freight, healthcare, mining, and construction settings, to name a few. HFACS has been combined with other methods for risk analysis as a means to remedy the shortcomings of traditional human error identification methods. For example, Celik and Cebi [8] combined the Fuzzy AHP with HFACS to analyze the human factors leading to marine accidents. Wei et al. [49] combined HFACS with expert subjective assessment methods and Grey System Theory to analyze human error in aviation accidents. Chiu et al. [10] used a fuzzy TOPSIS in combination with HFACS to assess air transportation maintenance tasks. Akyuz [2] used the analytic network process (ANP) combined with the HFACS to explore serious gas leakage incidents by liquefied petroleum gas carriers. Hsieh et al. [18] combined AHP and TOPSIS with HFACS to evaluate human error in Intensive Care Units (ICUs) in Taiwan. Havle and Kılıç [16] applied a combination of triangular fuzzy numbers and AHP with HFACS to analyze air traffic navigation errors in the North Atlantic. Chen et al. [9] combined interval type-2 fuzzy numbers and Prospect Theory with HFACS to explore the human factors leading to vessel accidents.

In addition, there have been many studies focusing on verifying the reliability of HFACS [11, 13, 28, 35, 36] proving that HFACS can be used to analyze human factors in a system or organization. Zhang et al. [55] and Zhang et al. [56] used HFACS to investigate the factors influencing major road traffic accidents in China. They summarized the human factors into the five categories of “unsafe behavior,” “preconditions for unsafe behaviors,” “unsafe supervision,” “organizational influence,” and “external factors.” Past studies have shown that the consideration of external factors can make the HFACS model broader in scope. It can be applied for comprehensively and systematically identifying the factors of active failure and latent failure in an organization. It can further help the user to understand potential causes and correlations that have led to the accident. However, the HFACS model does not take into account that information may be difficult to quantify or that part of the information may be missing, and lacks the ability to assess the S, O, and D of the real problem.

FMEA combined with MCDM

The FMEA model was developed by the Grumman Aircraft Corporation in the 1960s to analyze the flight control systems of naval aircraft [14]. It is a proactive risk assessment technology [7]. Over the past 50 years, it has been used in various fields, especially for the evaluation of the failure modes of products or systems. Failure is unavoidable in most systems. It is important to eliminate possible failures and prevent system-related problems in advance. The FMEA assessment is based on three risk factors, the S, O, and D, which together comprise the RPN = S × O × D, which is quantified based on the opinions of experts expressed through linguistic variables or through estimated values and critical analysis of the failure modes [25]. However, the RPN has often been criticized as having the following shortcomings [31]:

  1. 1.

    The FMEA assumes that these three factors have same importance weights.

  2. 2.

    Different S, O, and D ratings may produce the same RPN value.

  3. 3.

    The RPN calculation is too simplistic and problems exist because the multiplication process makes the results sensitive to any small change in the evaluation of the risk factors.

  4. 4.

    Only three factors are considered while other important factors, such as cost, correction time, etc. are ignored.

Many methods have been proposed to improve the traditional FMEA by overcoming the shortcomings mentioned above. For example, Liu et al. [24] reviewed 169 papers related to the application of MCDM method published between 1979 and 2018 in efforts to solve the shortcomings and improve the evaluation efficiency of the traditional FMEA. In the real world, evaluation teams are often composed of experts with different areas of expertise and from different departments. Uncertainties in their assessment can arise due to time pressure, insufficient information, differences in linguistic scores and differences of opinion between the experts, or simply because of the vagaries of the risk assessment process itself, as well as inaccuracy, uncertainty and hesitation. Uncertainty theory methods have been applied to overcome these problems. The most common approach has been to use fuzzy set theory to deal with the uncertainty issue, followed by evidence theory, intuitionistic fuzzy set theory, 2-tuples fuzzy language theory, etc. In recent years, FMEA has been applied in combination with novel fuzzy theories and linguistic variable methods. For example, Jiang et al. [19] combined FMEA with Z-numbers for the evaluation of aircraft turbine runner blades; Mohsen and Fereshteh [33] merged FMEA with Z-numbers to study the failure modes of geothermal power plant systems; Lo and Liou [30] combined FMEA with grey theory to explore potential failure modes of smartphones. Clearly, the problems of ambiguity, inaccuracy, and uncertainty in decision-making and judgment of risk assessment can be more effectively dealt with using uncertainty theory. However, to date, no study has used combined Z-numbers with the BWM method to increase the accuracy of this method for calculating the importance of the risk factors.

Research gap

A review of the literature review shows little effort to integrate HFACS and FMEA to form a comprehensive risk assessment framework for traffic safety and transportation management. Therefore, in this study, we first use HFACS to explore the potential failure modes of intercity bus traffic accidents. Then, FMEA is applied to evaluate the risk factors and obtain the risk evaluation matrix. Finally, Z-BWM is used to determine the weights of the risk factors, and Z-WASPAS is used to calculate degree of risk of the failure modes. In the real world, experts are prone to “fuzzy, uncertain, and inaccurate” risk assessments. Therefore, Z-numbers are applied to represent the reliability of this fuzzy information. The evaluation of road traffic risk through Z-BWM combined with Z-WASPAS allows us to obtain more reliable results.

The proposed hybrid model

HFACS

The HFACS is a classification method for the exploration of human error mainly for the analysis of accidents caused by human behavior. It was first used by the US Air Force to investigate and analyze the human factors leading to aviation accidents. This method was developed by Shappell and Wiegmann [45] within the framework of human accident causality based on the so-called Swiss cheese model of human error originally proposed by Reason [40].

As in the Swiss cheese model, there are four layers in the HFACS framework. Each layer represents a different level of protection in the system, with the upper level affecting the next levels. The aim in risk management, is to try to reduce the errors that occur at all levels in the system, in order to reduce the occurrence of accidents. This framework has been widely applied in different fields, for rail transport, shipping, medical care and biochemistry, but especially in transportation systems, attracting the time and energies of a large number of researchers. However, it has been found that there are still shortcomings to human error identification methods. To improve the shortcomings of the traditional HFACS model for accident analysis, it is combined other methods. For example, the Fuzzy AHP has been combined with the HFACS framework to identify the key factors of human error in the control room of a nuclear power plant [21]. The AHP and fuzzy TOPSIS were combined with the HFACS method, to identify and prioritize failure modes based on the analysis of 157 adverse medical events [17]. Sarıalioğlu et al. [44] used fuzzy fault tree analysis (FTA) combined with an HFACS model to analyze vessel engine room fires and explosion accidents. The afore-mentioned studies have proven that the systematic and complete model architecture of HFACS can effectively and reasonably analyze the human factors that cause system problems.

FMEA

FMEA is widely used in product design, production processes, and for quality assurance or system operation safety and reliability requirements. This method requires the formation of an expert team from different professional fields for identification of failure modes based on the collected opinions of the team members. The failure modes are then sorted according to their risk level. This is a proactive assessment technology for the elimination of potential risks [7]. The evaluation and selection of key factors is the first step in risk assessment using FMEA. In the traditional model, the experts use linguistic scoring methods to assess the severity of the three risk factors of severity, occurrence, and detection. The scores are then simply multiplied to obtain the risk priority number for analyzing the failure mode [25].

In this study, it is assumed that any accidents in an intercity bus system will cause significant casualties and financial losses, so a fourth risk factor, E, is added. Table 1 shows the four risk factors of the risk assessment scale. The corresponding linguistic variables are given in the table, and responses range in value from 1 to 10. The higher the value of each risk factor, the higher the risk of the failure mode. Severity refers to the magnitude of the “impact” of system failure. If the problem is serious, a higher severity value will be assigned. Occurrence rate refers to the frequency with which failure modes may occur and is described qualitatively. Detection refers to the possibility of detecting and discovering the failure before it occurs [4]. The expected cost refers to the cost for the implementation of risk prevention measures. If the costs of the occurrence of failure and action for improvement are underestimated or ignored, it may lead to greater losses. The final risk priority index is used to assess the severity of risks and failures, and to judge the priority for improvement.

Table 1 The four risk factors on the risk assessment scale

The original RPN index ignores the relative importance of the risk factors in the key analysis. Hence, various weighting methods have been used to determine the weights of the risk factors [24]. The shortcomings of the traditional FMEA risk assessment method can be overcome by combining it with the MCDM method. The BWM is chosen as the weighting method in this study because of its ease-of-operation for the decision maker. Also, fewer pairwise comparisons are needed than for the popular AHP method. In addition, considering the ambiguity, inaccuracy, and uncertainty of the expert judgments, this study uses Z-numbers to integrate this vague and uncertain information.

Z-BWM

The Z-numbers, originally proposed by Zadeh [52] as a soft calculation method for dealing with incomplete or uncertain information. The Z-number contains two kinds of fuzzy information, the evaluation value, and the evaluation confidence. The degree of certainty of a fuzzy event can be measured by the “probability” and the “confidence.” With the application of Z-numbers these two types of information are converted into a set of fuzzy numbers. The Z-BWM results clearly reflect the degree of confidence in the expert judgements, so as to obtain more realistic appraisal results from the experts. For a more detailed descriptions of the original BWM method see Rezaei [42].

The scale used for pairwise comparison of the importance between risk factors is as follows: “Equally Important (EI) (1, 1, 1),” “Weakly Important (WI) (2, 3, 4),” “Fairly Important (FI) (4, 5, 6),” “Very Important (VI) (6, 7, 8),” “Absolutely Important (AI) (8, 9, 9).” The five-point scale adopted for confidence assessment of expert judgments includes “Very Low (VL) (0, 0, 0.3),” “Low (L) (0.1, 0.3, 0.5),” “Medium (M) (0.3, 0.5, 0.7),” “High (H) (0.5, 0.7, 0.9),” “Very High (VH) (0.7, 1, 1).” The linguistic variables and corresponding membership functions for Z-BWM are shown in Table 2. For example, when the decision maker evaluates the importance of factor S in comparison to factor O, it is given as “FI,” and the degree of confidence in the evaluation scale is “H”. The corresponding membership function is thus (3.35, 4.18, 5.02).

Table 2 The linguistic variables for Z-BWM and their membership functions

The steps of Z-BWM are summarized as follows:

Step 1. Decide on the most and least important risk factors

From the risk factors defined by the FMEA method, select the most important (i.e., most satisfactory, best, or most preferred) and the least important (i.e., most unsatisfactory, worst, or least preferred).

Step 2. Compare the most important factor with the other factors to produce the Z-BO vector

The relative importance of the most important risk factor and other risk factors is assessed by the experts using the evaluation scale, as shown in Table 2. The linguistic variable assessments range from “〈EI, VL〉, (1, 1, 1)” to “〈AI, VH〉, (3.33, 3.80, 4.28).” Each evaluation item is given a set of Z-numbers. The Z-BO vector can be generated by

$$ \otimes A_{Bj} = \left( { \otimes a_{B1} {, } \otimes a_{B2} {,} \ldots {, } \otimes a_{Bn} } \right), $$
(1)

where “\(\otimes\)” is represented as a Z-number and \(\otimes a_{Bj}\) represents the importance of the most important risk factor B relative to risk factor j. The comparison between the most important risk factor and itself must be “EI” and “VH” (i.e., \(\otimes a_{BB}\) = (1, 1, 1)).

Step 3. Compare the remaining factors with the least important factor to produce the Z-OW vector

Similar to Step 2, the relative importance of the other risk factors is assessed by the experts by comparison with the least important risk factor, to generate the Z-OW vector, as follows:

$$ \otimes A_{jW} = \left( { \otimes a_{1W} {, } \otimes a_{2W} {,} \ldots {, } \otimes a_{nW} } \right)^{T} , $$
(2)

where \(\otimes a_{jW}\) represents the importance of the remaining risk factor j relative to the least important risk factor W, and \(\otimes a_{WW}\) = (1, 1, 1).

Step 4. Calculate the optimal weight value of each risk factor \(\left( { \otimes w_{1}^{*} {, } \otimes w_{2}^{*} {,} \ldots {, } \otimes w_{n}^{*} } \right).\).

The two vectors of Z-BWM (Z-BO and Z-OW vectors) can be converted into a nonlinear constrained optimization problem. Please refer to Aboutorab et al. [1] for details about the conversion process and principles. The nonlinear mathematical equations generated by Z-BWM are

Min \(\otimes \zeta\).

$$ {\text{s.t.}} \left\{ {\begin{array}{*{20}c} {\left| { \otimes w_{B} - \otimes a_{Bj} \cdot \otimes w_{j} } \right| \le \otimes \zeta } \\ {\left| { \otimes w_{j} - \otimes a_{jW} \cdot \otimes w_{W} } \right| \le \otimes \zeta } \\ {\sum\limits_{j = 1}^{n} {R\left( { \otimes w_{j} } \right) = 1} } \\ {l_{j}^{w} \le m_{j}^{w} \le u_{j}^{w} } \\ {l_{j}^{w} \ge 0} \\ {j = 1{, }2{,} \ldots {, }n} \\ \end{array} } \right.. $$
(3)

where \(\otimes \zeta = \left( {l^{\zeta } {, }m^{\zeta } {, }u^{\zeta } } \right)\), \(\otimes w_{j} = \left( {l_{j}^{w} {, }m_{j}^{w} {, }u_{j}^{w} } \right)\), \(\otimes a_{Bj} = \left( {l_{Bj} {, }m_{Bj} {, }u_{Bj} } \right)\), and \(\otimes a_{jW} = \left( {l{}_{jW}{, }m_{jW} {, }u_{jW} } \right)\). Considering \(l^{\zeta } \le m^{\zeta } \le u^{\zeta }\), we assume \(\otimes \zeta^{*} = \left( {k^{*} {, }k^{*} {, }k^{*} } \right)\), \(k^{*} \le l^{\zeta }\). Equation (3) can be converted to Eq. (4):

Min \(\otimes \zeta^{*}\).

$$ {\text{s.t.}} \left\{ {\begin{array}{*{20}c} {\left| {\left( {l_{B}^{w} {, }m_{B}^{w} {, }u_{B}^{w} } \right) - \left( {l_{Bj} {, }m_{Bj} {, }u_{Bj} } \right) \cdot \left( {l_{j}^{w} {, }m_{j}^{w} {, }u_{j}^{w} } \right)} \right| \le \left( {k^{*} {, }k^{*} {, }k^{*} } \right)} \\ {\left| {\left( {l_{j}^{w} {, }m_{j}^{w} {, }u_{j}^{w} } \right) - \left( {l_{jW} {, }m_{jW} {, }u_{jW} } \right) \cdot \left( {l_{W}^{w} {, }m_{W}^{w} {, }u_{W}^{w} } \right)} \right| \le \left( {k^{*} {, }k^{*} {, }k^{*} } \right)} \\ {\sum\limits_{j = 1}^{n} {R\left( {\frac{{l_{j}^{w} + 4 \cdot m_{j}^{w} + u_{j}^{w} }}{6}} \right) = 1} } \\ {l_{j}^{w} \le m_{j}^{w} \le u_{j}^{w} } \\ {l_{j}^{w} \ge 0} \\ {j = 1{, }2{,} \ldots {, }n} \\ \end{array} } \right.. $$
(4)

The optimal fuzzy weights \(\left( { \otimes w_{1}^{*} {, } \otimes w_{2}^{*} {,} \ldots {, } \otimes w_{n}^{*} } \right)\) can be obtained by solving Eq. (4).

Step 5. Integrate the Z-BWM analysis results from the experts

This study uses the arithmetic mean to integrate the Z-BWM results from the experts, that is, the obtained multiple sets of weights are averaged to generate the criterion group weights \(\otimes w_{j}^{agg}\).

Z- WASPAS

In the original FMEA method, the risk priority number is obtained simply by multiplying the three risk factors together. However, different combinations might generate the same RPN number and they are strongly sensitive to variations in the RPN element evaluations [31]. To remedy the above shortcomings, WASPAS, a hybrid MCDM method, was developed by combining two systems the weighted sum model (WSM) and weighted product model (WPM). This method has been used to deal with various MCDM problems. Its ranking accuracy is higher than the traditional methods [3, 5]. The Z-WASPAS method proposed in this paper is used to determine the risk scores of the failure modes and their ranking. The Z-WASPAS operation steps are as follows:

Step 1. Obtain the initial assessment matrix

The linguistic variables in Table 1 are applied to identify the scores of the failure modes. Next, the experts assign confidence levels for each score according to the five-level scale: “Very Low (VL) (0, 0, 0.3),” “Low (L) (0.1, 0.3, 0.5),” “Medium (M) (0.3, 0.5, 0.7),” “High (H) (0.5, 0.7, 0.9),” or “Very High (VH) (0.7, 1, 1).” The linguistic variables for Z-WASPAS and its corresponding membership functions are listed in Table 3.

Table 3 The linguistic variables for Z-WASPAS and their membership functions

Assume that the FMEA team has k experts. These experts assess the score of failure mode i for risk factor j to obtain the initial D assessment matrix (Eq. 5), where k = 1, 2,…, p; j = 1, 2,…, n; and i = 1,2,…, m.

$$ \otimes {D}^{(k)} = \left[ {\begin{array}{*{20}c} { \otimes d_{11}^{(k)} } &\quad { \otimes d_{12}^{(k)} } &\quad \cdots &\quad { \otimes d_{1n}^{(k)} } \\ { \otimes d_{21}^{(k)} } &\quad { \otimes d_{22}^{(k)} } &\quad \cdots &\quad { \otimes d_{2n}^{(k)} } \\ \vdots &\quad \vdots &\quad \ddots &\quad \vdots \\ { \otimes d_{m1}^{(k)} } &\quad { \otimes d_{m2}^{(k)} } &\quad \cdots &\quad { \otimes d_{mn}^{(k)} } \\ \end{array} } \right], $$
(5)

where \(\otimes d_{ij}^{(k)}\) represents the score of the kth expert’s assessment of failure mode i under risk factor j, and \(\otimes d_{ij}^{(k)} = \left( { \otimes d_{ij}^{(k),l} , \, \otimes d_{ij}^{(k),m} , \, \otimes d_{ij}^{(k),u} } \right)\).

Step 2. Use Eq. (6) to calculate the average assessment matrix

$$ \otimes {X} = \left[ {\begin{array}{*{20}c} { \otimes x_{11}^{{}} } &\quad { \otimes x_{12}^{{}} } &\quad \cdots &\quad { \otimes x_{1n}^{{}} } \\ { \otimes x_{21}^{{}} } &\quad { \otimes x_{22}^{{}} } &\quad \cdots &\quad { \otimes x_{2n}^{{}} } \\ \vdots &\quad \vdots &\quad \ddots &\quad \vdots \\ { \otimes x_{m1}^{{}} } &\quad { \otimes x_{m2}^{{}} } &\quad \cdots &\quad { \otimes x_{mn}^{{}} } \\ \end{array} } \right], $$
(6)

where \(\otimes x_{ij} = \frac{{\sum\nolimits_{k = 1}^{p} { \otimes d_{ij} } }}{k}\) and \(\otimes x_{ij}^{(k)} = \left( { \otimes x_{ij}^{(k),l} , \, \otimes x_{ij}^{(k),m} , \, \otimes x_{ij}^{(k),u} } \right)\).

Step 3. Calculate the normalized matrix

To unify the units and score range of the assessment risk factors, the normalized matrix Eq. (7) is obtained through Eq. (8) as follows:

$$ \otimes {P} = \left[ { \otimes p_{ij} } \right]_{m \times n} , $$
(7)
$$ \otimes p_{ij} = \frac{{ \otimes a_{ij} }}{{{10}}}. $$
(8)

Step 4. Calculate the performance indexes WSM and WSP

In this step, the group weights obtained for Z-BWM are used as the parameter values for the calculation of performance indexes WSM and WSP.

$$ \otimes SQ_{i} = \sum\limits_{j = 1}^{n} { \otimes p_{ij} . \otimes w_{j}^{agg} } , $$
(9)
$$ \otimes PQ_{i} = \prod\limits_{j = 1}^{n} {( \otimes p_{ij} )^{{ \otimes w_{j} }} } . $$
(10)

Step 5. Calculate the integrated risk index \(\varphi\).

The centroid method is used to deburr the fuzzy value, for example, \(\otimes \theta = \left( {\theta^{l} , \, \theta^{m} , \, \theta^{u} } \right)\), to obtain the crisp value (\(\theta\)), as shown in Eq. (11).

$$ \theta = \frac{{\theta^{l} + 4 \cdot \theta^{m} + \theta^{u} }}{6}. $$
(11)

The next step is to de-fuzzify all fuzzy numbers using Eq. (11). Both WSM and WSP are considered in Z-WASPAS, and the preference ratio of these two is determined by the parameter. The integrated risk index \(\varphi\) is

$$ \varphi_{i} = \lambda sQ_{i} + (1 - \lambda )pQ_{i} . $$
(12)

The failure modes are ranked according to the index \(\varphi\). The failure mode with the highest performance value \(\varphi\) has the highest priority for improvement.

Illustration in a real case study

This study uses data for inter-city buses in Taiwan for empirical analysis. Many highway bus accidents, often causing a large number of casualties, have occurred in recent years. For example, in September 2017, an inter-city highway bus dodging another vehicle lost control and crashed into a guardrail, resulting in 6 deaths and 11 injuries. Another highway bus roll-over accident which occurred in June 2019 caused 3 deaths and 13 injuries. In addition to the injuries and deaths, serious traffic accidents also pose a huge challenge to the economy. The annual economic losses caused by traffic accidents accounted for about 3.13% of Taiwan’s GDP [54]. All of this makes effective reduction of possible accidents before they occur an urgent issue for government.

Identifying the failure modes through HFACS

The literature review in "Literature review" shows that although the HFACS has been used for comprehensive risk assessment in various fields, most of these studies did not consider the impact of external factors. This study thus adds “external factors” to the HFACS analysis. Initially, 5 types of human factors and 15 failure modes were derived from the literature review. Then, 10 experts were invited as an assessment team. The team members included supervisors from government agencies, senior researchers from a transportation institute, mangers of inter-city bus companies, and accident investigators. The experts have an average of more than 10 years’ experience in related fields with rich levels of professionalism, knowledge, and industry-related practical experience, thus, their assessment results are deemed credible.

The five human factors include “external factors,” “organizational influence,” “unsafe supervision,” “preconditions for unsafe behavior,” and “unsafe behavior.” Based on the HFACS analysis and two rounds of interviews, some of the initial failure modes were modified and five more added to more accurately reflect the operating environment in Taiwan. Two failure modes from external factors and 18 failure modes from internal factors comprise the evaluation framework in this case study. The two failure modes for external factors are “Regulatory omissions (FM1)” and “Administrative omissions (FM2)”. The 18 failure modes within internal organizations include “Human resources (FM3),” “Financial resources (FM4),” “Equipment and resources (FM5),” “Safety atmosphere (FM6),” “Organizational operation (FM7),” “Insufficient supervision (FM8),” “Incompletely planned operating mechanism (FM9),” “Failure to correct known errors (FM10),” “Regulatory violations (FM11),” “Personal readiness (FM12),” “Human resource management (CRM) (FM13),” “Poor mental state (FM14),” “Poor physical state (FM15),” “Physical/Psychological limitations (Intelligence) (FM16),” “Driver decision-making error (FM17),” “Driver operating error (FM18),” “Driver perception error (FM19),” and “Driver violations (FM20).”

The five types of human factors and the final 20 failure modes identified by the HFACS method are listed in Fig. 1 and described in detail in Table 4.

Fig. 1
figure 1

Proposed HFACS framework for analysis of potential failure modes for inter-city buses

Table 4 Proposed failure mode evaluation

The HFACS framework is a useful tool to assist in the investigation and establishment of preventive measures for traffic accidents. The expert team can effectively and systematically distinguish potential failure modes that cause accidents within the organization. However, the failure modes obtained by the HFACS model cannot quantify their degree of risk and lacks a tool for the assessment of the severity, frequency, and measurability of actual problems.

To remedy the shortcomings of the original FMEA method, the modified model considers factor importance and the RPN value calculated by the WASPAS method. In addition, Z-numbers are incorporated to consider the ambiguity and uncertainty of the experts’ survey responses. The proposed model uses Z-numbers combined with BWM to determine the importance of the risk factors, and the WASPAS method to prioritize the failure modes.

Weighted RPN elements through Z-BWM

The advantages of the Z-BWM and its calculation process are explained in detail in "Z-BWM". First, each expert is required to select the most important and least important risk factors. The steps to obtain the Z-BO and Z-OW vectors from each expert are based on the evaluation scale in Table 2. The results obtained from professional feedback from the 10 experts are shown in Tables 5 and 6. For example, as indicated in Table 5, the first expert believes that S is the most important risk factor. Therefore, compared with other risk factors, the Z-BO vector formed by S is {(1, 1, 1), (3.35, 4.18, 5.02), (7.59, 8.54, 8.54), (4.24, 4.95, 5.66)}. Similarly, D is selected as the least important risk factor, and the transposed Z-OW vector is {(7.59, 8.54, 8.54), (5.02, 5.86, 6.69), (1, 1, 1), (4.24, 4.95, 5.66)}, as shown in Table 6. All experts’ responses are processed similarly. Before performing the Z-BWM calculation, the consistency ratio (CR) is calculated for all BWM questionnaire responses to confirm the logic and reliability of the experts’ answers. The average CR value is 0.035, indicating a high degree of consistency [42].

Table 5 Z-BO vectors
Table 6 Z-OW vectors

Expert 1’s Z-BO factors and Z-OW factors are used as an example to demonstrate calculation process of the Z-BWM, the nonlinear mathematical model, as shown in Eq. (13).

Min \(\otimes \zeta^{*}\).

$$ {\text{s.t.}} \left\{ {\begin{array}{*{20}c} {\left| {\left( {l_{S}^{w} {, }m_{S}^{w} {, }u_{S}^{w} } \right) - \left( {{3}{\text{.35, 4}}{.18, 5}{\text{.02}}} \right) \cdot \left( {l_{O}^{w} {, }m_{O}^{w} {, }u_{O}^{w} } \right)} \right| \le \left( {k^{*} {, }k^{*} {, }k^{*} } \right){;}} \\ {\left| {\left( {l_{S}^{w} {, }m_{S}^{w} {, }u_{S}^{w} } \right) - \left( {{7}{\text{.59, 8}}{.54, 8}{\text{.54}}} \right) \cdot \left( {l_{D}^{w} {, }m_{D}^{w} {, }u_{D}^{w} } \right)} \right| \le \left( {k^{*} {, }k^{*} {, }k^{*} } \right){;}} \\ {\left| {\left( {l_{S}^{w} {, }m_{S}^{w} {, }u_{S}^{w} } \right) - \left( {{4}{\text{.24, 4}}{.95, 5}{\text{.66}}} \right) \cdot \left( {l_{E}^{w} {, }m_{E}^{w} {, }u_{E}^{w} } \right)} \right| \le \left( {k^{*} {, }k^{*} {, }k^{*} } \right){;}} \\ {\left| {\left( {l_{O}^{w} {, }m_{O}^{w} {, }u_{O}^{w} } \right) - \left( {{5}{\text{.02, 5}}{.86, 6}{\text{.69}}} \right) \cdot \left( {l_{D}^{w} {, }m_{D}^{w} {, }u_{D}^{w} } \right)} \right| \le \left( {k^{*} {, }k^{*} {, }k^{*} } \right){;}} \\ {\left| {\left( {l_{E}^{w} {, }m_{E}^{w} {, }u_{E}^{w} } \right) - \left( {{4}{\text{.24, 4}}{.95, 5}{\text{.66}}} \right) \cdot \left( {l_{D}^{w} {, }m_{D}^{w} {, }u_{D}^{w} } \right)} \right| \le \left( {k^{*} {, }k^{*} {, }k^{*} } \right){;}} \\ {\frac{{l_{S}^{w} + 4 \cdot m_{S}^{w} + u_{S}^{w} }}{6} + \frac{{l_{O}^{w} + 4 \cdot m_{O}^{w} + u_{O}^{w} }}{6} + \frac{{l_{D}^{w} + 4 \cdot m_{D}^{w} + u_{D}^{w} }}{6} + \frac{{l_{E}^{w} + 4 \cdot m_{E}^{w} + u_{E}^{w} }}{6} = 1{;}} \\ {l_{S}^{w} \le m_{S}^{w} \le u_{S}^{w} {; }l_{O}^{w} \le m_{O}^{w} \le u_{O}^{w} {; }l_{D}^{w} \le m_{D}^{w} \le u_{D}^{w} {; }l_{E}^{w} \le m_{E}^{w} \le u_{E}^{w} {;}} \\ {l_{S}^{w} {, }l_{O}^{w} {, }l_{D}^{w} {, }l_{E}^{w} \ge 0.} \\ \end{array} } \right. $$
(13)

Equation (13) is further expanded to get Eq. (14).

Min \(k^{*}\).

$$ {\text{s.t.}} \left\{ {\begin{array}{*{20}c} {\left| {l_{S}^{w} - {3}{\text{.35}} \cdot u_{O}^{w} } \right| \le k^{*} {; }\left| {m_{S}^{w} - {4}{\text{.18}} \cdot m_{O}^{w} } \right| \le k^{*} {; }\left| {u_{S}^{w} - {5}{\text{.02}} \cdot l_{O}^{w} } \right| \le k^{*} {;}} \\ {\left| {l_{S}^{w} - {7}{\text{.59}} \cdot u_{D}^{w} } \right| \le k^{*} {; }\left| {m_{S}^{w} - {8}{\text{.54}} \cdot m_{D}^{w} } \right| \le k^{*} {; }\left| {u_{S}^{w} - {8}{\text{.54}} \cdot l_{D}^{w} } \right| \le k^{*} {;}} \\ {\left| {l_{S}^{w} - {4}{\text{.24}} \cdot u_{E}^{w} } \right| \le k^{*} {; }\left| {m_{S}^{w} - {4}{\text{.95}} \cdot m_{E}^{w} } \right| \le k^{*} {; }\left| {u_{S}^{w} - {5}{\text{.66}} \cdot l_{E}^{w} } \right| \le k^{*} {;}} \\ {\left| {l_{O}^{w} - {5}{\text{.02}} \cdot u_{D}^{w} } \right| \le k^{*} {; }\left| {m_{O}^{w} - {5}{\text{.86}} \cdot m_{D}^{w} } \right| \le k^{*} {; }\left| {u_{O}^{w} - {6}{\text{.69}} \cdot l_{D}^{w} } \right| \le k^{*} {;}} \\ {\left| {l_{E}^{w} - {4}{\text{.24}} \cdot u_{D}^{w} } \right| \le k^{*} {; }\left| {m_{E}^{w} - {4}{\text{.95}} \cdot m_{D}^{w} } \right| \le k^{*} {; }\left| {u_{E}^{w} - {5}{\text{.66}} \cdot l_{D}^{w} } \right| \le k^{*} {;}} \\ {\frac{{l_{S}^{w} + 4 \cdot m_{S}^{w} + u_{S}^{w} }}{6} + \frac{{l_{O}^{w} + 4 \cdot m_{O}^{w} + u_{O}^{w} }}{6} + \frac{{l_{D}^{w} + 4 \cdot m_{D}^{w} + u_{D}^{w} }}{6} + \frac{{l_{E}^{w} + 4 \cdot m_{E}^{w} + u_{E}^{w} }}{6} = 1} \\ {l_{S}^{w} \le m_{S}^{w} \le u_{S}^{w} {; }l_{O}^{w} \le m_{O}^{w} \le u_{O}^{w} {; }l_{D}^{w} \le m_{D}^{w} \le u_{D}^{w} {; }l_{E}^{w} \le m_{E}^{w} \le u_{E}^{w} {;}} \\ {l_{S}^{w} {, }l_{O}^{w} {, }l_{D}^{w} {, }l_{E}^{w} \ge 0.} \\ \end{array} } \right.. $$
(14)

The weight value of the risk factor from Expert 1 can be obtained through the single-objective solution, which is presented as follows:

\(\left( {l_{S}^{w} {, }m_{S}^{w} {, }u_{S}^{w} } \right) = \left( {0.618{, }0.658{, }0.692} \right)\); \(\left( {l_{O}^{w} {, }m_{O}^{w} {, }u_{O}^{w} } \right) = \left( {0.084{, }0.093{, }0.104} \right)\); \(\left( {l_{D}^{w} {, }m_{D}^{w} {, }u_{D}^{w} } \right) = \left( {0.056{, }0.062{, }0.071} \right)\); \(\left( {l_{E}^{w} {, }m_{E}^{w} {, }u_{E}^{w} } \right) = \left( {0.170{, }0.188{, }0.205} \right)\).

The group optimal weights integrated from the 10 experts’ responses are listed in Table 7. Obviously, S is the risk factor with the highest weight, (0.381, 0.419, 0.451). The median value of the fuzzy Z-number of the risk factor is shown in the schematic diagram of the relative importance of weights in Fig. 2. Next, the Z-BWM results are used as one of the input parameters for the Z-WASPAS to find the aggregated RPN for each failure mode.

Table 7 Weight results for Z-BWM
Fig. 2
figure 2

Schematic diagram of the relative importance of risk factors

Ranking failure modes through Z-WASPAS

Inter-city bus system operations are complex and involve many internal and external organizations. Prioritizing the failure modes can allow decision makers to more effectively concentrate on certain critical failure modes. WASPAS is one of the most effective ways to solve this type of problem because its analytical process is simple and fast. It meets the needs of decision makers and supports the formulation of strategies for improvement and prevention for key failure modes. The number of failure modes does not affect the efficiency and reliability of this method. In this study, 10 experts evaluated the risk degrees of 20 potential failure modes using semantic variables as shown in Table 3. The average assessment Z-matrix of the 10 experts is shown in Table 8.

Table 8 The initial assessment matrix

The differences in importance of the risk factors are considered based on the results of the weights calculated by Z-BWM and are included in the Z-WASPAS calculation. Table 9 shows the Z-WASPAS results. The top five failure modes in the ranking are FM12, FM3, FM17, FM2, and FM18. It is determined that failure mode FM12 has the highest integrated RPN, which is 0.5527, therefore, it should have the highest priority for prevention and correction.

Table 9 The ranking results from Z-WASPAS

Discussion

Management implications

Using the HFACS method, the potential failure modes can be easily and systematically identified. The method considers five factors and their interaction for more effective detection of any possible failures. The modified FMEA and MCDM methods are further applied to remedy the shortcomings of the HFACS. According to the empirical results of Z-BWM, the priority is SDEO with the risk factor of severity having the highest Z-number (0.381, 0.419, 0.451). For bus companies and government agencies, severity represents the consequences of human error as a threat to human life. This is the most important risk factor. The second place ranking is D. This risk factor can be divided into pre-detection and degree of discoverability when risk occurs and can be greatly reduced by regular testing and evaluation of drivers and vehicles before human error occurs. The third ranked factor is E, which refers to the economic cost that needs to be invested or the resources required to repair or restore functioning after failure has occurred. How to prevent or reduce the risk of high-cost human error is one of the key factors. Finally, O represents the occurrence of traffic accidents due to human behavior and faulty decision-making. The occurrence rate will be reduced if the first three risk factors are properly controlled.

The Z-WASPAS method is used to rank the failure modes. The results of the evaluation show the following risk priority index from high to low: FM12 ≻ FM3 ≻ FM17 ≻ FM2 ≻ FM18 ≻ FM14 ≻ FM19 ≻ FM6 ≻ FM4 ≻ FM20 ≻ FM1 ≻ FM15 ≻ FM16 ≻ FM13 ≻ FM8 ≻ FM5≻ FM9 ≻ FM7 ≻ FM10 ≻ FM11. The subsequent analysis is dedicated to those failure modes ranked from number 1 to 6, because these are the modes requiring the most urgent attention and the implementation of solutions by the management team.

The first ranked failure mode is personal readiness (FM12) with an RPN value of 0.5527, giving it the highest priority for improvement. The job of inter-city bus drivers entails long working hours and requires intense concentration while on duty. Serious accidents are more likely to happen if the driver fails to effectively manage off-duty time or maintain good physical condition. In addition, the driver needs to maintain good health. Human resource (FM3) is ranked in second place with an RPN value of 0.5005. The possible mistakes related to this include employing drivers with inappropriate personality traits, insufficient risk perception training, and work force shortages leading to excessive overtime shifts. It is recommended that managers should consider the personality traits of the drivers and strengthen training systems for worker recruitment. Another recommendation is to change the salary structure so that the driver is not driving in poor mental condition and is regularly paid for overtime. Driver’s decision-making error (FM17) and administrative negligence (FM2) are ranked in third and fourth place. Driver decision-making errors include inappropriate response to emergencies and poor judgment of the direction and status of oncoming vehicles. Improper operation or behavior are the main causes for the above failures. Managers should provide good training and educate drivers about the importance of following the standard operating procedures. A monitoring system or warnings about excessive speed might be possible enforcement solutions. Administrative negligence can include governmental negligence in implementing the relevant traffic rules and guidelines for supervision and management responsibilities, and not ensuring that transportation companies conduct safety training and are managed well. Bus companies tend to disregard regular employee training to save on costs and capital investment for vehicles maintenance and replacement. It is recommended that the relevant government departments should strengthen safety regulations and subsidy policies, by requiring bus companies to build safety management systems and/or offering subsidies for maintenance of equipment and vehicle replacement. In addition, government departments could request bus companies to hold training seminars to strengthen driver safety knowledge and awareness. “Driver operating error (FM18)” and “poor mental state (FM14)” ranked fifth and sixth. Driver mistakes can include overly rapid acceleration or deceleration, improper lane changes, failure to maintain safe driving distances, etc. Poor mental state can include personal attitudes, not caring about safety, driver fatigue, mental fatigue, and so on. Bad moods also increase the chance of accidents. Therefore, these human error factors are also the key failure modes that decision makers need to pay attention to and improve.

Model comparison

In this section, a variety of methods are compared and discussed starting with three weighting methods, which include AHP, the Full Consistency Method (FUCOM) [37], and BWM. These three methods are all based on the use of pairwise comparisons to derive the criteria weights. The AHP requires the largest number of pairwise comparisons, which is n (n − 1)/2 followed by FUCOM and BWM with n − 1 and 2n − 3, respectively. The higher the number of pairwise comparisons, the less consistent results may result. Obviously, FUCOM’s consistency performance is the best. However, although BWM offers slightly weaker consistency results, its input data are comprised of two conceptually different vectors (BO and OW vectors), which can more accurately reflect the expert’s evaluation. The three methods were tested and the weights of the risk factors found to be only slightly different. The importance ranking remained the same SDEO for all three methods, and there was no effect on the ranking of the failure modes. The results indicated that all three methods are applicable but the BWM method was selected for case analysis in this study because it can reflect the experts' assessment information from two different perspectives.

A comparison of six alternative ranking methods was performed to illustrate the practicality of the proposed integration model for the ranking of failure modes. The risk factor weight settings for these six models are all based on those in Table 7, to ensure consistency. Model 1, Z-WASPAS, is the method proposed in this study, model 2 is the original FMEA calculation method, where the values of S, O, D, and E are multiplied. In addition, four common MCDM methods are included in the model comparison, Z-number-based Additive Ratio ASsessment (Z-ARAS), Z-TOPSIS, Z-VIKOR, and Z-number-based grey relational analysis (Z-GRA). Z-numbers are introduced into all six models to reflect the uncertainty and confidence level of the experts. Table 10 shows the failure mode ranking results for the six models. Obviously, F12 is ranked the highest risk failure mode by all models. However, the ranking of the other failure modes is inconsistent, as shown in Fig. 3. A further step is carried out using the Spearman correlation coefficient to identify the correlation in the ranking results obtained with the six methods. Z-WASPAS, the original FMEA, Z-ARAS, and Z-GRA have higher correlation coefficients (greater than 0.9), as shown in Table 11. On the other hand, Z-TOPSIS and Z-VIKOR are based on the concept of distance, thus, their results are different from the Z-WASPAS results. Although we cannot prove that the proposed Z-WASPAS method offers excellent results, this method is intuitive and simple to implement, which helps FMEA experts to analyze and find the root causes in real circumstances. In addition, one of the calculation parameters of Z-WASPAS is based on the original concept of FMEA multiplication which also corresponds to the FMEA calculation process.

Table 10 Ranking results of failure modes for the six models
Fig. 3
figure 3

Ranking results of the six models

Table 11 Correlation coefficients between models

The risk factor E is included in our proposed model, which is significantly different from the conventional FMEA (only three risk factors S, O, and D) calculation. Figure 4 shows the difference between E and without E. For example, FM3 is ranked 2nd and 6th, respectively, which will affect the improving strategies of decision-makers. In addition to FM3, there are also many failure mode sequencing differences. In practice, risk costs are generally divided into three categories, that is, tangible costs, intangible costs, and prevention or control costs of risk losses. In the transportation system, the occurrence of risk accidents will cause the loss of the expected economic benefits of the enterprise. This paper proposes HFACS-FMEA for the first time to summarize and analyze the potential failure modes and risks of the inter-city bus accidents. Due to the rising awareness of risk costs management, this study incorporates this concept to make the analysis results closer to practice and more reliable.

Fig. 4
figure 4

The difference between E and without E

Sensitivity analysis

The most important risk factor is S with a weight value of (0.381, 0.419 and 0.451). Obviously, it has a significant influence in the overall evaluation system. In this section, we explore whether changes to the weight of risk factor S will affect the overall ranking results of the evaluation system. Sensitivity analysis is conducted to observe whether there will be any significant change in the priority ranking of the failure modes analysis. We changed the value of the weight of S from 0.1 to 0.9 and adjusted the weights of other risk factors proportionally. Since the Z-number is comprised of a set of three values, the upper boundary value was used as the basis for adjustment, as shown in Table 12. According to Fig. 5, changing the weights led to a change in the ranking of all failure modes except for FM12, which remained in first place. This indicates the extreme sensitivity of the other failure modes. This phenomenon indicates that it is necessary to determine the weights of the risk factors, for example by using the Z-BWM evaluation method described in this study is an essential step to estimate the weights of the risk factors while conducting a risk assessment.

Table 12 Nine weight combinations
Fig. 5
figure 5

Ranking results from the nine sensitivity analyses

Conclusions and future work

This study proposes a new hybrid model for risk assessment regarding inter-city bus transportation. The model improves several shortcomings of the HFACS. First the HFACS framework is used to establish possible failure modes. Then the FMEA is combined with MCDM for risk assessment. Z-numbers are used in the analysis in combination with BWM to determine the importance of the risk factors, and finally the Z-WASPAS method is applied to rank the failure modes. The conclusions are summarized below:

  1. 1.

    The results show that severity has the highest weight value, thus, management needs to assign it the first priority for problem solving.

  2. 2.

    The results show that FM12 (personal readiness) is the number one ranked failure mode. Management should prioritize it for improvement.

  3. 3.

    Z-numbers are utilized to deal with the vagueness and uncertainty of expert judgements, allowing consideration of the confidence level and differences in reliability. In addition, to avoid missing information by using linguistic variables while conducting expert survey, the weights of the risk factors and values of failure modes are all fully retained through fuzzy interval values during the calculation process, which makes the evaluation results more reliable.

  4. 4.

    The risk factor of expected cost is added to the risk assessment model and failure modes extended to include external factors in response to real circumstances. Management can adopt more comprehensive strategies for improvement in the safety of inter-city bus service.

This study proposes a hybrid model to effectively identify potential risks before system failure occurs with the aim of eliminating the risk of failure by preventing and improving the risk factors. However, there are some restrictions and suggestions for decision-makers and further research.

  1. 1.

    The assessment of the risk factors by the 10 experts was integrated by an arithmetic mean method. However, due to differences in the area of expertise and degree of experience of each expert, we can also consider giving different weight values or using the multiple attribute group decision-making (MAGDM) method to consider the different weights.

  2. 2.

    The proposed evaluation model can be applied to other types of public transportation, such as railways, ferries, airplanes, subway systems, and the like. In addition, it could be adapted for other industries, for example, for product quality or supplier selection problems in manufacturing.

  3. 3.

    In the past, there have been very few studies in the transportation field that have combined MCDM and FMEA to produce hybrid model. This study focuses on the application of Z-numbers combined with BWM and Z-WASPAS calculations. This hybrid model can be extended and compared with other MCDM methods, such as TOPSIS, VIKOR, COPRAS, and GRA in the future. In addition, Z-numbers can be compared with other fuzzy theories, such as Pythagorean fuzzy sets, spherical fuzzy sets, Fermatean fuzzy sets, etc., to select the most suitable fuzzy theory for each application.

  4. 4.

    This research focuses on ranking the risk factors but assumes that they are independent of each other. It is suggested that DEMATEL can be introduced to consider the interdependence of the risk factors, to determine the degree of mutual influence as laid out in a causal relationship diagram.