1 Introduction

The fuzzy multi-criteria decision-making (MCDM) methods have been widely used on evaluation and selection problems with uncertainty (Mardani et al., 2015) such as supplier selection (Chai et al., 2013), site selection (Yap et al., 2019; Deveci et al., 2021), service selection (Masdari & Khezri, 2021), healthcare technology selection (Mardani et al., 2019; Deveci, 2023), circular economy assessment (dos Santos Gonçalves & Campos, 2022; Bai et al., 2022; Lee et al., 2023), energy policymaking (Kaya et al., 2019), urban transport policymaking (Deveci et al., 2023a, b, c; Jeevaraj et al., 2023; Mishra et al., 2023), construction management (Chen & Pan, 2021). Notably, the pursuit of reliability in assessment and selection methods increases particularly for high-investment and long-life cycle projects (Önüt et al., 2009; He et al., 2016; Escrig-Olmedo et al., 2017; Wu et al., 2018; Kaya et al., 2019; Pouyakian et al., 2022). The nature of such projects motivates managers to commit to the choices of suppliers, sites, technologies, or strategies for the long haul. Switching to alternative options halfway could incur significant costs or environmental harm. Therefore, the choices made initially should be forward-looking and reliable enough to accommodate the projects with costly and long-term features. For example, a manufacturing project of lightweight vehicles powered by hydrogen fuel cells with hydrogen storage challenges has the feature of high-investment and long-life cycle. Such a hydrogen vehicle can cost around USD 80k. Once the hydrogen storage method is chosen and determined, this technology could be applied to several models, and probably would not be replaced easily during the 20-year lifespan of vehicles. Furthermore, the life of vehicles can be prolonged by applying the circular economy principles that are being widely popularised (Aguilar Esteva et al., 2021). The hydrogen storage tanks can be refurbished to extend their use and the components can be reused within the same type of vehicles.

However, there are no sufficient studies that improved the conventional fuzzy MCDM methods, particularly in response to the need for long-lasting choices. The two essential factors that ensure a reliable assessment, i.e., the satisfaction with the choice and the risk of failure, have yet to be considered at the same time in MCDM. Either the satisfaction (Ghorbani et al., 2013; Avikal et al., 2014) or the risk analysis (Li & Zeng, 2016; Liu et al., 2019) was individually applied to strengthen an assessment in fuzzy MCDM. This gap in research motivates us to originate and develop a new fuzzy MCDM method in this paper to enhance the reliability of assessments to guarantee the results are applicable and effective over a long period.

We innovatively introduce a ‘time-based’ satisfaction weight for the decision criteria, which incorporates decision-makers’ satisfaction with criteria over different project time horizons—short, medium, and long terms. As per our knowledge, our approach is the first one that differentiates and quantifies decision-makers’ perceptions across various stages of a project. It is achieved by involving a time dimension in the commonly used Kano model (Kano, 1984; Berger et al., 1993), which effectively attaches enough importance to the satisfaction level that is likely to vary throughout the project lifecycle. The integration of this time-based satisfaction weight can increase the reliability of assessments in high-investment and long-life cycle projects. Simultaneously, a risk discounted weight is used to raise the reliability of the assessment by tuning the performance of decision criteria based on the failure mode and effects analysis (FMEA) method (Li & Zeng, 2016). This addition is critical given the intricate risk profiles are typically associated with high-investment and long-life cycle projects. There is a high possibility that the risks will affect the actual performance of criteria and cause them to deviate significantly from the anticipated outcomes. Proactively incorporating the risk factor into assessments can diminish the possibility of such deviations.

Since it is our initiation to include the time-based satisfaction and risk factors in the assessment method to strengthen the reliability, there are no existing MCDM methods that can be directly referred to technically develop this new conception. Thus, in our assessment method, as we will utilise the Kano model for the time-based satisfaction factor, and the FMEA for risk factor as the underpinning, we integrate the three approaches for the first time, i.e., Kano model, FMEA, and fuzzy MCDM consisting of fuzzy analytic hierarchy process (AHP) and fuzzy technique for order preference by similarity to the ideal situation (TOPSIS). Inventively, three phases are specifically designed for this novel MCDM method, where Phase I applies the fuzzy AHP and Kano model for weight determination of the decision criteria; Phase II is based on the FMEA for risk assessment on the criteria for alternatives; and finally, Phase III uses the fuzzy TOPSIS to determine scores for alternatives. An elaborative framework of the proposed 3-phase MCDM method is illustrated in Fig. 1.

Taking a step further in examining the practical applications of the MCDM methods, only a handful of literature such as (Hamdan & Cheaitou, 2017; Grazioso et al., 2017) created the software implementations with practical, hands-on software with GUIs. Therefore, we are inspired to transfer the proposed 3-phase MCDM method to a free, interactive web application with graphical user interface (GUI), especially for evaluating different hydrogen supply chain stakeholders. It is built in the HyChain (accessible via https://hychain.co.uk/), which is a smart hub founded by us to provide hydrogen supply chain solutions and knowledge to diverse audiences, including industry professionals, academics, and the general public. Creative efforts are invested into the GUI design, ensuring an intuitive and engaging user experience. This first-of-its-kind web application extends the reach of our 3-phase MCDM method beyond the academic domain. It enables any users to effortlessly apply it to real-world assessments of hydrogen supply chains, or more broadly, to any high-investment and long-life cycle projects. The tool can provide the facility to replicate the example case detailed in the paper to select the best hydrogen storage method for lightweight vehicles.

The contributions of this paper are summarised as follows:

  1. 1.

    The implementation of reliable assessment through a novel MCDM method that considers the time-based satisfaction and risk factors simultaneously.

  2. 2.

    The technical development of a 3-phase MCDM method combining fuzzy MCDM with Kano model and FMEA.

  3. 3.

    The web application development of the proposed MCDM method that pertains hydrogen supply chain stakeholder assessment.

The remainder of the paper is organised as follows. Section 2 reviews the literature on reliability-oriented methods in fuzzy MCDM. Section 3 develops a novel 3-phase MCDM method for reliable assessment, which combines the Kano model, FMEA, fuzzy AHP, and fuzzy TOPSIS. In Sect. 4, an example case of hydrogen storage method selection for lightweight vehicles is applied to demonstrate step by step the proposed 3-phase MCDM method. Section 5 introduces the MCDM web application design and interface. The potential policy implications on business management are also discussed. Section 6 in the end summarises this research and provides insights on future directions.

Fig. 1
figure 1

Framework of the proposed 3-phase MCDM method for reliable assessment

2 Literature review

The fuzzy MCDM, based on the fuzzy set theory (Zadeh, 1965), is a systematic decision-making tool for evaluating the alternatives and choosing the best option in uncertain and ambiguous environments where linguistic variables need to be well defined (Kahraman et al., 2015; Mardani et al., 2015). In this section, we first review the literature that used reliability-oriented methods in fuzzy MCDM as summarised in Table 1, referring to the studies for improving the reliability of assessment and selection problems. The purposes of employing these methods are interpreted in Column 4 of the table. Regardless of the different application contexts, the so-called reliability of assessment is mainly manifested through considerations of stakeholder voices or satisfaction, risk, quality, and psychological behaviours. Based on the practical needs of targeting a reliable assessment method for high-investment and long-life cycle projects, we identify that satisfaction and risk are two imperative, effective factors to be simultaneously considered for this type of project. However, these two factors have not yet been taken into account at the same time. This gap gives us space to propose a new method that not only incorporates both the factors but furthermore reinforces the satisfaction factor to feature the time effects (see the last row in Table 1). Appropriate technical methods can then be correspondingly determined, i.e., the Kano model for satisfaction measurement and FMEA for risk analysis. Both of them would be integrated with two typical fuzzy MCDM methods, i.e., fuzzy AHP (Liu et al., 2020; Pereira & Bamel, 2023) and fuzzy TOPSIS (Salih et al., 2019) (like most of the literature in Table 1 have done so and validated the suitability of adopting fuzzy AHP and fuzzy TOPSIS as the representatives of fuzzy MCDM) to generate a new MCDM method in this paper.

Table 1 Literature of reliability-oriented methods used in fuzzy MCDM

The Kano model is used in the proposed MCDM method for satisfaction measurement. It is a tool initially proposed by Kano (1984) in 1984 to classify the product requirements into different dimensional quality attributes, i.e., must-be, one-dimensional, and attractive attributes. The quality attributes depict different linear or nonlinear relationships between product performance and customer satisfaction. The original Kano model has been extended to different variations over the years (Shahin et al., 2013). The main advantage of using the Kano model in our MCDM as the basis to express the time-based satisfaction of decision-makers with criteria is that through a better understanding of the characteristics of criteria, we can recognise the criteria with higher weights in satisfaction. They will drive greater influence on the performance evaluation of alternatives. By this means, guided by the differentiated satisfaction levels in short, medium, and long terms, the assessment results would be reliable, even for a long-term period.

The FMEA is also employed in the proposed MCDM method for risk analysis. FMEA was created in 1949 for the US Department of Defence (Stamatis, 2003), and has been widely combined with different sorts of MCDM methods and applied in various industries (Liu et al., 2019; Huang et al., 2020). It is a powerful risk and reliability management approach to proactively identifying failure modes and their effects, causes, and control mechanisms, and ranking the failure modes in these three aspects to obtain the severity, likelihood and control scores. Traditionally, the numeric scores can be multiplied together to form a risk priority number (RPN) for each failure mode. However, the traditional RPN was criticised due to its shortcomings as summarised in Liu et al. (2013). Therefore, we use an improved format of RPN as suggested in Li and Zeng (2016) for our MCDM to examine the potential risks of criteria to make the assessment more reasonable and reliable.

Since it is the first time that the Kano model and FMEA will be integrated into fuzzy MCDM (specifically fuzzy AHP and fuzzy TOPSIS) in the proposed method, we refer to the existing four studies in the field that incorporated Kano and FMEA from the technical point of view even though they were not used in fuzzy MCDM.

An approach to enhance FMEA capabilities was proposed in Shahin (2004) through its integration with Kano model. The traditional ways of deciding the severity score for an effect of failure mode, as well as defining the RPN in FMEA were modified by classifying severity from customers’ perceptions, rather than managers’ eyes. The Kano model was used to picture the relationship between the frequency and severity of the effect of failure mode. A new index called correction ratio was proposed as a failure prioritisation to replace RPN. This approach enables managers to prevent failures at the early stage of design and can be used before and after the production stages.

With the same goal in mind of developing a customer-oriented FMEA, another new FMEA was designed in Koomsap and Charoenchokdilok (2018) to improve the earlier work (Shahin, 2004) by getting a better reflection of customer voices. Kano model was applied to identify how customers perceive failure mode effects. Customer dissatisfaction was integrated into this approach, where severity and likelihood were viewed as the factors influencing customer dissatisfaction. Similarly, a new PRN was developed. In contrast to the approach in Shahin (2004) and traditional FMEA, this new customer-oriented FMEA proved that how customers perceive the failure mode effects has the greatest impact on prioritisation.

A more precise method for determining categories of requirements was designed in Madzík and Kormanec (2020) by using requirement curves. It mainly addressed three improvements compared with the previous Kano model and FMEA integration work (Shahin, 2004)—requirement categorisation to eliminate imprecision, RPN calculation including the effect of the characteristics of requirements, and prioritisation of preventive measures to reduce the risk of dissatisfaction.

Apart from the three studies above which do not combine with MCDM methods, a recent research integrating Kano model and FMEA into a non-fuzzy MCDM called VIKOR (an acronym in Serbian of multicriteria optimization and compromise solution) was presented in Hettiarachchi et al. (2022). VIKOR was integrated with a power law-based customer-oriented FMEA to achieve more logical and reliable prioritisation in comparison with other customer-oriented FMEAs.

There are only limited studies combining Kano model with FMEA, while no one has applied both of them to fuzzy MCDM. Hence, this technical gap motivates us to develop a novel MCDM method containing three phases for blending fuzzy MCDM with Kano model and FMEA as shown in Fig. 1.

3 3-phase MCDM method for reliable assessments

The proposed MCDM method is developed to evaluate and select alternatives reliably for high-investment and long-life cycle projects in three phases, namely (1) weight determination of the decision criteria, (2) risk assessment on the criteria for alternatives, and (3) score assessment for alternatives. The framework of this 3-phase structure is given in Fig. 1. In the following subsections, we will develop the 3-phase MCDM method step by step.

3.1 Phase I: Weight determination of the decision criteria

Phase I (6 steps) is for weight determination of the decision criteria. A team of \(k_1\) decision-makersFootnote 1 is established for the assessment of alternatives in a high-investment and long-life cycle project.

3.1.1 Building decision hierarchy structure

The decision-maker panel first determines a comprehensive decision hierarchy structure with a goal to assess \(k_2\) alternatives based on \(k_3\) main criteria and \(k_4\) sub-criteria. Each sub-criterion is subordinate to a main criterion.

3.1.2 Identifying weight of quality attributes for criteria by Kano model

The original Kano model (Kano, 1984) used functional and dysfunctional Kano Questionnaire (KQ) and a 5-by-5 evaluation table to classify the quality attributes of a product (Berger et al., 1993). This paper proposes a Staged Kano Questionnaire (SKQ) to determine the quality attributes of the decision criteria. Compared with the KQ, the SKQ takes the time factor into consideration as the criteria may perform differently in different stages of the project life cycle, which is a key improvement in the evaluation of alternatives in high-investment and long-life cycle projects. The SKQ contains three pairs of functional (f.1–f.3) and dysfunctional questions (d.1–d.3) for the short, medium, and long term of each criterion x as follows:

  1. (f.1)

    If the criterion x works well in the short term, how do you feel?

  2. (d.1)

    If the criterion x does not work well in the short term, how do you feel?

  3. (f.2)

    If the criterion x works well in the medium term, how do you feel?

  4. (d.2)

    If the criterion x does not work well in the medium term, how do you feel?

  5. (f.3)

    If the criterion x works well in the long term, how do you feel?

  6. (d.3)

    If the criterion x does not work well in the long term, how do you feel?

The 5-by-5 evaluation table known as Kano evaluation Table (in Table 2), categorises the quality attributes into the following six types based on five-dimensional answers,Footnote 2 (Berger et al., 1993):

  1. 1.

    Attractive attribute (A): refers to sufficient quality attributes leading to customer satisfaction and excitement, and absence does not lead to customer dissatisfaction.

  2. 2.

    Must-be attribute (M): refers to quality attributes that are not mentioned unless not included, sufficiency will not result in more satisfaction but insufficiency will lead to strong dissatisfaction.

  3. 3.

    One-dimensional attribute (O): refers to sufficient quality attributes leading to customer satisfaction, and insufficiency leading to customer dissatisfaction.

  4. 4.

    Questionable attribute (Q): refers to quality attributes that the customer probably does not understand.

  5. 5.

    Indifferent attribute (I): refers to quality attributes that sufficiency or insufficiency will not affect customer satisfaction.

  6. 6.

    Reverse attribute (R): refers to sufficient quality attributes leading to customer dissatisfaction or vice-versa.

Table 2 Kano evaluation table (Berger et al., 1993)

Each decision-maker \(DM_p\) needs to answer the SKQ, and to weight the importance of short (\(ws_p\)), medium (\(wm_p\)) and long (\(wl_p\)) term effects with regard to the criteria performance, where \(ws_p+wm_p+wl_p=1\), for all \(p=1,~\ldots ,~k_1\). In terms of the way to answer the SKQ, each of the six questions should come back with a degree of agreementFootnote 3 on the five-dimensional answers from Kano evaluation Table, denoted as \(mP^{q}_r\) for functional questions f.1-f.3 or \(mN^{q}_r\) for dysfunctional questions d.1-d.3, where r refers to the corresponding five-dimensional answers, and q refers to the short, medium and long terms. An example of a decision-maker’s answer to the SKQ for one criterion is given in Table 3. For instance, \(mN_2^{short}=0\) means that when being asked the question d.1, the decision-maker feels that it ‘not at all’ must be that way, while \(mP_1^{mid}=4\) means that the decision-maker ‘strongly’ like it that way for the question f.2.

Table 3 An example of a decision-maker’s answer to the SKQ for one criterion

The degree of agreement with the answers for each time stage can be normalised by \({mP^{q}}={mP_r^{q}}/{\sum _{r=1}^{5}mP_r^{q}}\) and \({mN^{q}}={mN_r^{q}}/{\sum _{r=1}^{5}mN_r^{q}}\). For instance, the two normalised degree vectors for the long term can be calculated from Table 3 as,

$$\begin{aligned} \begin{array}{l} \textbf{mP}^{long}=\left( 0.5, 0.25,0.125,0.125,0\right) ,\\ \textbf{mN}^{long}=\left( 0,0,0.25,0.25,0.5\right) . \end{array} \end{aligned}$$

By comparing the 5\(\times \)5-matrix \(\textbf{S}^{q} = (\textbf{mP}^{q})^{T}\times \textbf{mN}^{q}\) with the Kano evaluation Table in Table 2, we can add up the degree values in \(\textbf{S}^{q}\) with the same quality attribute to generate a quality attribute weight vector \(\textbf{T}^{q}\) for each criterion. To continue the example above,

$$\begin{aligned} \begin{array}{ll} \textbf{S}^{long} &{}= (\textbf{mP}^{long})^{T}\times \textbf{mN}^{long} \\ &{}=\left( \begin{matrix}0&{}\quad 0&{}\quad 0.125&{}\quad 0.125&{}\quad 0.25\\ 0&{}\quad 0&{}\quad 0.063&{}\quad 0.063&{}\quad 0.125\\ 0&{}\quad 0&{}\quad 0.031&{}\quad 0.031&{}\quad 0.063\\ 0&{}\quad 0&{}\quad 0.031&{}\quad 0.031&{}\quad 0.063\\ 0&{}\quad 0&{}\quad 0&{}\quad 0&{}\quad 0\end{matrix}\right) . \end{array} \end{aligned}$$

As there are three A—attractive attribute in Kano evaluation Table, we add up the values in the corresponding rows and columns in \(\textbf{S}^{long}\), i.e., \(s_{1,2}^{long}=0\), \(s_{1,3}^{long}=0.125\) and \(s_{1,4}^{long}=0.125\), and get the sum as \(t^{long, A}=0.25\). Thereby, the weight of quality attributes for the long term is as,

Combining the three-time stages, the weight of quality attributes \(\textbf{ST}_p\) that the decision-maker \(DM_p\) evaluates on one criterion can be obtained as,

$$\begin{aligned} \begin{array}{l} \textbf{ST}_p=ws_p \cdot \textbf{T}^{short}_p+wm_p \cdot \textbf{T}_p^{mid}+wl_p \cdot \textbf{T}_p^{long}. \end{array} \end{aligned}$$
(1)

We then aggregate all \(k_1\) decision-makers’ \(\textbf{ST}_p\) to calculate the weighted average of the weight of quality attributes on one criterion, i.e., \(\textbf{AT}=(at_A, at_M, at_O, at_Q, at_I, at_R)\), based on the importance weight of decision-makers \(DM_p^{weight}\) as,

$$\begin{aligned} \textbf{AT}=\frac{\sum _{p}^{k_1}DM_p^{weight} \cdot \textbf{ST}_p}{\sum _{p}^{k_1}DM_p^{weight} }. \end{aligned}$$
(2)

3.1.3 Determining importance weight of quality attributes by fuzzy AHP

The fuzzy AHP method can capture imprecise human qualitative judgements by using linguistic variables. The decision-makers define a 5-level linguistic scale based on triangular fuzzy numbers as the relative importance scale (see Table 4). It is used in a fuzzy pairwise comparison matrix \(\widetilde{\textbf{AP}} =(l_{ij},m_{ij},u_{ij})_{n\times n}\) for scoring the relative importance of the six quality attributes, i.e., A, M, O, Q, I, and R (here \(n=6\)). Before accepting the pairwise comparison matrix \(\widetilde{\textbf{AP}}\), it is required to pass the consistency check. First, the fuzzy matrix \(\widetilde{\textbf{AP}}\) should be converted to a crisp one by \(\textbf{AP}^{crisp} =(1/6 \cdot l_{ij}+2/3\cdot m_{ij}+1/6 \cdot u_{ij})_{n\times n}\), (Yu & Hua, 2003).

Then, the consistency ratio CRFootnote 4 needs to be verified for \(\textbf{AP}^{crisp}\).

Table 4 Relative importance scale for pairwise comparison matrix
Table 5 Random consistency index (RI) (Avikal et al., 2014)

Next, the fuzzy synthetic extent \(\widetilde{{AS}}_i\)Footnote 5, for A, M, O, Q, I, R where \(i=1,~\ldots ,~6\), can be calculated from the pairwise comparison matrix \(\widetilde{\textbf{AP}}\) as,

$$\begin{aligned} \begin{array}{l} \displaystyle \widetilde{{AS}}_i = \left( \sum \nolimits _{j=1}^{n}l_{ij},\sum \nolimits _{j=1}^{n}m_{ij},\sum \nolimits _{j=1}^{n}u_{ij}\right) \bigotimes \\ \quad \displaystyle \left( \frac{1}{\sum _{i=1}^{n}\sum _{j=1}^{n}u_{ij}},\frac{1}{\sum _{i=1}^{n}\sum _{j=1}^{n}m_{ij}},\frac{1}{\sum _{i=1}^{n}\sum _{j=1}^{n}l_{ij}}\right) , \end{array} \end{aligned}$$
(3)

and the degree of possibility of one fuzzy synthetic extent larger than the other, e.g., \(\widetilde{{AS}}_1 = (l_1, m_1, u_1)\ge \widetilde{{AS}}_2 = (l_2, m_2, u_2)\) can be computed as,

$$\begin{aligned} \begin{array}{l} \displaystyle \textrm{Pos}\, (\widetilde{{AS}}_1 \ge \widetilde{{AS}}_2) = \displaystyle {\left\{ \begin{array}{ll} 1, \qquad \qquad \qquad \qquad \quad \,\,\,\,\,\,\, \text { if } m_1\ge m_2\\ \displaystyle \frac{u_1-l_2}{(u_1-m_1)+(m_2-l_2)}, \text { if } u_1\ge l_2\\ 0, \qquad \qquad \qquad \qquad \quad \,\,\,\,\,\,\, \text { otherwise. } \end{array}\right. } \\ \end{array} \end{aligned}$$
(4)

On this base, the degree of possibility of \(\widetilde{{AS}}_i\) larger than all the other fuzzy synthetic extents is given by,

$$\begin{aligned} \begin{array}{l} \displaystyle \textrm{Pos}\,(\widetilde{{AS}}_i \!\ge \! \widetilde{{AS}}_j \vert j \!= \!1,\dots ,n;j \!\ne \! i)\!=\!\min _{j\in \{1,\dots ,n,j\ne i\}}\!\textrm{Pos}\,(\widetilde{{AS}}_i \!\ge \! \widetilde{{AS}}_j),\\ \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \quad i=1,\dots ,n. \end{array} \end{aligned}$$
(5)

Thus, the importance weight vector \(\textbf{AW}=(aw_1, \dots , aw_n)^T\) can be calculated, where each

$$\begin{aligned} \displaystyle aw_i=\frac{\textrm{Pos}\,(\widetilde{{AS}}_i\ge \widetilde{{AS}}_j\vert j=1,\dots ,n;j\ne i)}{\sum _{k=1}^n\textrm{Pos}\,(\widetilde{{AS}}_k\ge \widetilde{{AS}}_j \vert j=1,\dots ,n;j\ne k)}. \end{aligned}$$
(6)

3.1.4 Calculating satisfaction weight of criteria

Integrating the weight of quality attributes for all \(k_4\) criteria \(\textbf{AT}\) (a \({k_4\times 6}\) matrix) with the importance weight of quality attributes themselves \(\textbf{AW}\), we can obtain a satisfaction weight of all criteria \(\textbf{SW}=(sw_1,\dots ,sw_{k_4})^T\), where

$$\begin{aligned} \displaystyle sw_v=\sum \nolimits _{i=1}^{6} at_{vi} \cdot aw_i,\quad v=1,\dots , k_4. \end{aligned}$$
(7)

The satisfaction weight is newly proposed in our method. It measures how well the decision criteria can satisfy the decision-maker’s needs in short, medium and long terms, which is a critical factor to be taken into account for high-investment and long-life cycle projects. The satisfaction weight of criteria will be incorporated with the conventional importance weight to define a more comprehensive weight for decision criteria.

3.1.5 Determining importance weight of criteria by fuzzy AHP

By using the same fuzzy AHP approach to determining the importance weight of quality attributes in Sect. 3.1.3, the decision-makers can obtain the traditional importance weight of criteria. Since there are \(k_3\) main criteria containing \(k_4\) sub-criteria, one fuzzy pairwise comparison matrix for all the main criteria, and \(k_3\) number of comparison matrices for all the groups of sub-criteria should be developed. For each of these fuzzy pairwise comparison matrices, we can get an importance local weight vector \(\textbf{LW}\). Combining \(\textbf{LW}\) of the main and sub-criteria based on hierarchy structure, a normalised importance weight vector of all the criteria can be produced as \(\textbf{GW}=(gw_1, \dots , gw_{k_4})^T\).

3.1.6 Integrating satisfaction and importance weight for consolidated weight

Eventually, a consolidated weight \(\textbf{CW}\) can be established based on the outputs of the two steps above as,

$$\begin{aligned} \textbf{CW} = \textbf{SW}\cdot \textbf{GW}, \end{aligned}$$
(8)

and then normalised to \(\textbf{W}=(w_1, \dots , w_{k_4})^T\). The consolidated weight of criteria \(\textbf{W}\) jointly evaluates the satisfaction and importance factors of all decision criteria. It is the final outcome of Phase I of our 3-phase MCDM method.

3.2 Phase II: Risk assessment on the criteria for alternatives

Phase II (3 steps) is for risk assessment on the decision criteria for all the alternatives. The comprehensive analysis of risks reduces the gap with the expected performance of criteria, which can make the final assessment results more reliable. This is particularly crucial for the high-investment and long-life cycle projects. This risk assessment phase employs the FMEA method (Li & Zeng, 2016), which contains three steps as follows.

3.2.1 Designing FMEA evaluation schemes

For each of the \(k_4\) number of decision criteria, the decision-maker panel needs to define a specific FMEA evaluation scheme, i.e., depict a series of potential risk situations in the three risk dimensions (likelihood, severity and control), and map the risk situations to a 10-point scale (larger points indicate higher risks). For instance, a generic FMEA evaluation scheme is illustrated in Table 6 for one criterion (Li & Zeng, 2016). The \(k_4\) FMEA evaluation schemes will later be used to rank the failure modes of criteria for the alternatives.

Table 6 Generic FMEA evaluation scheme (Li & Zeng, 2016)

3.2.2 Defining failure modes and ranking risks of criteria for alternatives

The decision-maker panel defines one or more failure modes under each criterion for each alternative. More than one set of the effects with severity ranks, the causes with likelihood ranks, and the detection or control approaches with control ranks can be recognised for each failure mode. The designed FMEA evaluation schemes from the previous step are used to rate the severity, likelihood and control levels. Thus, all the failure modes of criteria, along with the ranks in three dimensions compose an FMEA document for each alternative.

A partial FMEA document, which is an example of two failure modes of one criterion ‘storage system cost’ for one alternative ‘hydrogen storage method A’ is illustrated in Table 7.

Table 7 Example failure modes of one criterion ‘storage system cost’ for one alternative ‘hydrogen storage method A’

3.2.3 Calculating risk discounted weight for criteria scores

Based on the severity (S), likelihood (L), and control (C) ranks in the FMEA documents, a risk discount can be generated for each criterion of every alternative (Li & Zeng, 2016). The risk discounts will be used to tune the initial assessment scores on the criteria of alternatives, which can add the risk impacts to the final scores.

In order to calculate the risk discounts, first, a risk number RN can be defined by combining the risk severity and likelihood in a failure mode as \(RN=S\times L\), where \(1\le RN\le 100\). Then, an original risk discount can be formulated as \(od=(RN-1)/99\), where \(0\le od\le 1\). The lower the risk RN, the smaller the risk discount od. The original risk discount od is adjusted via an exponent ep,

$$\begin{aligned} d=od^{ep}=\left( \frac{S\times L-1}{99}\right) ^{ep}, \end{aligned}$$
(9)

where \(ep=-0.1C+1.55\), as the risk control C stands for the capability to detect and reduce the risk. The two parameter values in ep are tailored specifically for this paper due to the 10-point scale used in the FMEA evaluation schemes. We can find that \(ep=1, d=od\) when \(C=5.5\), i.e., the median of C range. Finally, we calculate the mean \({\bar{d}}\) if there are more than one failure mode under each criterion. Taking the example in Table 7, the risk discount of the criterion ‘storage system cost’ \({\bar{d}}=\{[(2 \times 2-1)/99]^{-0.1 \times 2+1.55}+[(1 \times 1-1)/99]^{-0.1 \times 2+1.55}\}/2=0.0045\).

The risk discounts \({\bar{d}}\) will be used for tuning purposes in Phase III. Those criteria which will be heavily discounted (i.e., with large risk discounts) indicate high risks in performance attainment and vice versa. A risk discounted weight \(rw=1-{\bar{d}}\) should be multiplied on the initial assessment score of each criterion. For the example above, the risk discounted weight of the criterion ‘storage system cost’ \(rw=1-0.0045=0.9955\). The risk discounted weight of all the criteria for all the alternatives can form a risk discounted weight matrix as \(\textbf{RW}=(rw_{hv})_{k_2 \times k_4}\), which is the outcome of Phase II.

3.3 Phase III: Score assessment for alternatives

Phase III (3 steps) is to assess the scores for all alternatives by combining the outcomes of Phases I (consolidated weight of criteria \(\textbf{W}\)) and II (risk discounted weight of criteria \(\textbf{RW}\)). Thereby, the final assessment scores fully take the decision criteria’s satisfaction, importance, and risk factors into consideration. This phase fundamentally builds on the fuzzy TOPSIS approach (Hwang & Yoon, 1981) to score the alternatives.

3.3.1 Calculating initial criteria scores for alternatives by fuzzy TOPSIS

Each of the \(k_1\) decision-makers needs to grade all the \(k_2\) potential alternatives with respect to the performance of the \(k_4\) decision criteria, by using a 5-level linguistic scale based on triangular fuzzy numbers as defined in Table 8.

Table 8 Criteria performance scale for rating alternatives

This fuzzy criteria performance rating is denoted as \(\widetilde{\textbf{CP}}=(a_{hvp},b_{hvp},c_{hvp})_{k_2\times k_4 \times k_1}\). Consolidating all the decision-makers’ ratings, an aggregated fuzzy criteria performance rating \({\widetilde{cp}}_{hv}=(a_{hv},b_{hv},c_{hv})\) can be obtained, where

$$\begin{aligned} a_{hv}=\min _{p\in k_1}\{a_{hvp}\},\,\, b_{hv}=\frac{1}{k_1}\sum _{p=1}^{k_1}b_{hvp},\,\, c_{hv}=\max _{p\in k_1}\{c_{hvp}\}. \end{aligned}$$
(10)

A normalised fuzzy score matrix can then be established as \(\widetilde{\textbf{R}}=({\widetilde{r}}_{hv})_{k_2\times k_4}\), where

$$\begin{aligned} \begin{array}{l} \displaystyle \widetilde{{r}}_{hv}=\displaystyle \left( \frac{a_{hv}}{c_v^*},\frac{b_{hv}}{c_v^*},\frac{c_{hv}}{c_v^*}\right) \!\text { and } c_v^*=\max _{h\in k_2} c_{hv}, v\in SBC\\ \displaystyle \widetilde{{r}}_{hv}=\displaystyle \left( \frac{a_v^-}{c_{hv}},\frac{a_v^-}{b_{hv}},\frac{a_v^-}{a_{hv}}\right) \!\text { and } a_v^-=\min _{h\in k_2} a_{hv}, v\in SCC \end{array} \end{aligned}$$
(11)

where SBC and SCC are the sets of benefit criteria and cost criteria, respectively. The benefit criteria refer to those the larger the better, while the cost criteria mean those the smaller the better. Thus, the weighted normalised fuzzy score matrix \(\widetilde{\textbf{Z}}=({\widetilde{z}}_{hv})_{k_2 \times k_4}\) is calculated by aggregating the consolidated weight of criteria \(\textbf{W}\) got from Phase I as,

$$\begin{aligned} \displaystyle {\widetilde{z}}_{hv}={\widetilde{r}}_{hv}\bigotimes w_v. \end{aligned}$$
(12)

\(\widetilde{\textbf{Z}}\) are regarded as the initial assessment scores of criteria, which will be adjusted by the risk discounted weight in the next step.

3.3.2 Tuning initial criteria scores by risk discounted weight

A risk-tuned fuzzy score matrix \(\widetilde{\textbf{RZ}}=({\widetilde{rz}}_{hv})_{k_2 \times k_4}\) is calculated by assigning the risk discounted weight \(\textbf{RW}\) obtained from Phase II to the weighted normalised fuzzy score matrix \(\widetilde{\textbf{Z}}\) as

$$\begin{aligned} \displaystyle {\widetilde{rz}}_{hv}={\widetilde{z}}_{hv} \bigotimes {rw}_{hv}. \end{aligned}$$
(13)

3.3.3 Determining final scores of alternatives by fuzzy TOPSIS

Continuing with the fuzzy TOPSIS approach, it computes in this step the fuzzy positive ideal solution (FPIS, \(\textbf{A}^*\)) and fuzzy negative ideal solution (FNIS, \(\textbf{A}^-\)) as,

(14)

Then the distances between the risk-tuned fuzzy scores \(\widetilde{\textbf{RZ}}\) and FPIS or FNIS for each alternative are defined respectively as,

$$\begin{aligned} \begin{array}{l} \displaystyle fd_h^+=\sum \nolimits _{v=1}^{k_4}dis({\widetilde{rz}}_{hv},{\widetilde{rz}}^*_v),\\ \displaystyle fd_h^-=\sum \nolimits _{v=1}^{k_4}dis({\widetilde{rz}}_{hv},{\widetilde{rz}}^-_v), \end{array} \end{aligned}$$
(15)

where dis states the distance between two fuzzy variables as,

$$\begin{aligned} \displaystyle dis({\widetilde{x}},{\widetilde{y}})=\sqrt{\frac{1}{3}\left[ (a_x-a_y)^2+(b_x-b_y)^2+(c_x-c_y)^2\right] }. \end{aligned}$$
(16)

Lastly, the closeness coefficient of each alternative \(CC_h\) considers the distances from \(\widetilde{\textbf{RZ}}\) to FPIS and FNIS simultaneously, which is formulated as,

$$\begin{aligned} \displaystyle CC_h=\frac{fd_h^-}{fd_h^-+fd_h^+}. \end{aligned}$$
(17)

Up to here, the final scores of alternatives \(CC_h\) have been obtained. The alternatives can be ranked from the best to worst based on \(CC_h\) values in descending order, for decision-makers’ convenience to further select the best option.

4 Application of 3-phase MCDM method: Hydrogen storage method selection for lightweight vehicles

Sustainable transport contributes to the reduction of carbon emissions. Some countries provide incentives to shift petroleum-fueled vehicles to hydrogen vehicles in order to mitigate environmental damage caused by the widespread use of gasoline and diesel fuel. The ensuing need for reliable decision mechanisms for automotive companies to assess the hydrogen-related alternatives and select the best option can be met by the proposed 3-phase MCDM method.

In this section, we use a numerical example case of hydrogen storage method selection for lightweight vehicles, which has the characteristics of high investment and long-life cycle. Our 3-phase MCDM method is demonstrated according to Sect. 3 on the assessment and selection of hydrogen storage methods.

4.1 Phase I: Weight determination of the decision criteria

Phase I mainly applies the fuzzy AHP and modified Kano model. A team of \(k_1=3\) decision-makers is established, containing a chief technology officer (CTO) of a hydrogen storage tank manufacturer (\(DM_1\)), a chief financial officer (CFO) of a hydrogen storage tank manufacturer (\(DM_2\)), and an automotive manufacturer (\(DM_3\)). The importance weights of the three decision-makers are rated as 5, 4, and 3, respectively, as per their dominance in the decision-making.

4.1.1 Building decision hierarchy structure

The three decision-makers determine a decision hierarchy structure (see Fig. 2). It includes \(k_3=4\) main decision criteria, namely capacity, system cost, operation, and health & safety. \(k_4=8\) sub-criteria in total can be found under all the main criteria. \(k_2=3\) alternatives of hydrogen storage methods will be assessed for selection. The hydrogen storage alternative A uses a compressed gaseous hydrogen tank (type IV hydrogen tank at 700 bar and atmospheric temperature), the hydrogen storage alternative B uses a liquid hydrogen tank (atmospheric pressure and -253\(^{\circ }\)C), while the hydrogen storage alternative C uses cryo-compressed hydrogen tank (250–350 bar and at cryogenic temperature).

Fig. 2
figure 2

Decision hierarchy structure of hydrogen storage method selection (Alternative A, B and C) for lightweight vehicles

4.1.2 Identifying weight of quality attributes for criteria by Kano model

Eight sets of SKQs for the eight criteria, each of which consists of functional questions f.1-f.3 and dysfunctional questions d.1-d.3 are distributed to every decision-maker to collect answers. Here we present the example answers from \(DM_1\) in Table 9 and omit the answers from \(DM_2\) and \(DM_3\). Table 3 in the last section illustrated answers from hydrogen storage tank manufacturer (\(DM_2\)) to the SKQ for criterion boil-off loss target (C4-1). The answers to each SKQ are compared with Table 2 for calculating the weight of quality attributes for all the eight criteria \(\textbf{AT}\). Combining with the decision-makers’ weighting on the importance of short, medium and long-term effects,Footnote 6 the results of \(\textbf{AT}\) are shown in Table 10.

Table 9 CTO’s (\(DM_1\)) answer to the SKQ for all criteria
Table 10 Weight of quality attributes for all criteria

4.1.3 Determining importance weight of quality attributes by fuzzy AHP

A fuzzy pairwise comparison matrix \(\widetilde{\textbf{AP}}\) can be obtained by scoring the relative importance between the six quality attributes as illustrated in Table 11. The importance weight vector of the quality attributes can be calculated as \(\textbf{AW}=(0.3875, 0.4677, 0.1384, 0.0055, 0.0005, 0.0005)^T\), which indicates that the ‘must-be’ attribute is the most important one.

Table 11 Fuzzy pairwise comparison matrix of quality attributes

4.1.4 Calculating satisfaction weight of criteria

The satisfaction weight of all the eight criteria \(\textbf{SW}\) integrates the weight of quality attributes \(\textbf{AT}\) and the importance weight of quality attributes \(\textbf{AW}\). The result of \(\textbf{SW}=(0.2342, 0.2276,\) \(0.2300, 0.2118, 0.2021, 0.1736, 0.2143, 0.2813)^T\).

4.1.5 Determining importance weight of criteria by fuzzy AHP

By using fuzzy pairwise comparison to score the relative importance between the four main criteria (see Table 12), as well as the sub-criteriaFootnote 7 under each main criterion, the importance local weight vectors \(\textbf{LW}\) can be computed. As shown in Column 2 of Table 13, the importance weight of the four main criteria \(\textbf{LW}=(0.3504, 0.0005, 0.1561, 0.4929)^T\). The results of the importance weight vectors of the sub-criteria under the four main criteria are in Column 4. Column 5 refers to the normalised importance weight vector of all the eight sub-criteria \(\textbf{GW}\).

Table 12 Fuzzy pairwise comparison matrix of main criteria
Table 13 Importance weight of criteria

4.1.6 Integrating satisfaction and importance weight for consolidated weight

The consolidated weight of all the eight criteria \(\textbf{W}\) is obtained by combining the satisfaction weight \(\textbf{SW}\) with the importance weight \(\textbf{GW}\). As the final outcome of the Phase I, \(\textbf{W}=(0.1695, 0.1648, 0.0005,\) 0.0683, 0.0651, 0.0001, 0.1310, 0.4008), where C4-2 (permeation & leakage) is the most important sub-criterion, and C4 (health & safety) is the most important main criterion.

4.2 Phase II: Risk assessment on the criteria for alternatives

Phase II is based on the FMEA method, which comprises the following three steps.

4.2.1 Designing FMEA evaluation schemes

The decision-maker panel defines the FMEA evaluation schemes in view of severity (Table 14), likelihood (Table 15), and control (Table 16) dimensions, respectively, for all the eight criteria.

Table 14 FMEA evaluation scheme for risk severity (S) for all criteria
Table 15 FMEA evaluation scheme for risk likelihood (L) for all criteria
Table 16 FMEA evaluation scheme for risk control (C) for all criteria

4.2.2 Defining failure modes and ranking risks of criteria for alternatives

For each alternative, the decision-maker panel develops an FMEA document, which is composed of failure modes with the effects, causes and control approaches for all the eight criteria, as well as the ranks in severity (S), likelihood (L) and control (C) dimensions based on the defined FMEA evaluation schemes. The FMEA documents for all three alternatives are combined and given in Table 17.

Table 17 FMEA document for all alternatives

4.2.3 Calculating risk discounted weight for criteria scores

The risk discounted weight of all the eight criteria \(\textbf{RW}\) is calculated for all three alternatives based on the FMEA documents Table 17. \(\textbf{RW}\) is the final outcome of Phase II, the values of which can be found in the square brackets in Table 18.

Table 18 Weighted normalised fuzzy scores \(\widetilde{\textbf{Z}}\) and risk discounted weight \(\textbf{RW}\) of all criteria for all alternatives

4.3 Phase III: Score assessment for alternatives

Phase III utilises the fuzzy TOPSIS approach.

4.3.1 Calculating initial criteria scores for alternatives by fuzzy TOPSIS

The decision-makers grade all three alternatives regarding the performance of all the eight criteria and get \(\widetilde{\textbf{CP}}\) (see Table 19). On top of this, the weighted normalised fuzzy score matrix \(\widetilde{\textbf{Z}}\)Footnote 8 can be computed by compositing the consolidated weight of criteria \(\textbf{W}\). The values in \(\widetilde{\textbf{Z}}\) are provided in the round brackets in Table 18, which are considered as the initial assessment scores of criteria.

Table 19 Fuzzy criteria performance rating

4.3.2 Tuning initial criteria scores by risk discounted weight

The initial criteria scores, i.e., the weighted normalised fuzzy score matrix \(\widetilde{\textbf{Z}}\) should be tuned by the risk discounted weight \(\textbf{RW}\) to produce the risk-tuned fuzzy score matrix \(\widetilde{\textbf{RZ}}\). The values of \(\widetilde{\textbf{RZ}}\) can be calculated from Table 18.

4.3.3 Determining final scores of alternatives by fuzzy TOPSIS

The final score, i.e., the closeness coefficient of each alternative \(CC_h\) can be calculated based on the distance between the risk-tuned fuzzy scores \(\widetilde{\textbf{RZ}}\) and FPIS or FNIS. We can get the result of \(CC_1=0.6159\) for alternative A, \(CC_2=0.6191\) for alternative B, and \(CC_3=0.6195\) for alternative C. It is indicated that hydrogen storage method C is ranked highest in the assessment as the best option to be recommended for use in lightweight vehicles, while method B is the second, and method A is the last in this ranking.

4.4 Discussions

A sensitivity analysis is performed to assess the robustness of time-based satisfaction factor, and a comparison with a classic fuzzy MCDM method is conducted. For sensitivity analysis, our proposed method is compared with ones that consider only short-, medium-, and long-term effects when decision-makers respond to the SKQ for all criteria. The results are presented in the first four columns of Table 20. Discrepancies are observed in the consolidated weight of criteria \(\textbf{W}\) with a rank reversal for criteria C3-1 and C3-2 between our method and the long-term-only method. The findings reveal a notable shift from the preferred hydrogen storage alternative C (\(CC_3=0.6195\)) in our method to alternative B (\(CC_2=0.6217\)) in the short-term-only method. It is evident that our method offers a more comprehensive perspective. The divergence in short-term-only and long-term-only results underscores the significance of our method in considering the time effects.

The results of method comparison with a classic fuzzy MCDM (fuzzy AHP and fuzzy TOPSIS) are illustrated in the first and last columns of Table 20. While the rankings of criteria weights and alternatives share the same one as our method, specific differences in the weights and closeness coefficient scores are observed. Our method integrates time-based satisfaction and risk factors, effectively preventing substantial biases or deviations that might surface as the project progresses. Consequently, our method ensures a more accurate and inclusive assessment, thereby enhancing the reliability and applicability of overall findings in planning high-investment and long-life cycle projects.

Table 20 Sensitivity analysis and method comparison

Our method is applied to the sustainable transport industry, especially for automotive companies seeking the best hydrogen storage method for lightweight vehicles. The results of the example case have demonstrated its efficacy in tackling the complex decision-making process involved in evaluating and selecting options that require high investment and a long-life cycle. When considering the four primary criteria identified, health & safety dominated the importance weight followed by capacity. The consolidated weight of criteria indicates permeation & leakage is the most important criterion, followed by gravimetric capacity and volumetric capacity with similar weights. The fill time is the least important criterion which accounts for only 0.0001.

Risk assessment of all alternatives shows that hydrogen storage alternative A exposes the least to all risks. The highest risk discount hits criterion temperature for hydrogen storage alternative B: elevated temperature over \(5\%\) of standard temperature due to poor quality control will cause a serious leak and tank cracking. This risk can be controlled by cooling system temperature monitoring and stopping charging as soon as possible. After assessing the performance of alternatives on each criterion and tuning the score by risks, the closeness coefficient of each alternative can be determined by fuzzy TOPSIS. In this implication, the most favorable method is hydrogen storage alternative C which uses a cryo-compressed hydrogen tank with 250–350 bar at cryogenic temperature.

5 Web application and policy implications

A free, user-friendly, and interactive web application with GUI is translated from the proposed 3-phase MCDM method. It is built in the HyChain platform to evaluate any stakeholders in the hydrogen supply chains (see the homepage in Fig. 3, accessible via https://hychain.co.uk/). Decision-makers can create a decision hierarchy structure, evaluate alternatives, and select the best option for stakeholders. This section briefly illustrates the user interface of the application by using it to solve the example case in Sect. 4.

As the first step, the primary form needs to be filled out to define the dimensions of the decision structure (as shown in Fig. 4, which is based on Sect. 4’s data). Under ‘Criteria’, users can click the ‘+’ or ‘-’ button on the right to add or delete a main criterion. The names of all the main and sub-criteria are required, as well as the number of failure modes under each sub-criterion. Users need also to specify the number of decision-makers, their importance, and the names of alternatives at this stage. Once the initial inputs are done, users should click the ‘Build Secondary Form’ button, which proceeds to generate the remaining forms based on the dimensions stipulated. Users then need to fill out the secondary forms, which contain the sections of ‘Criteria Type’ (Fig. 5), ‘Kano Questionnaire’ (Fig. 6), ‘Pairwise Comparisons for Relative Importance’ (Fig. 7), ‘Failure Mode and Effects Analysis (FMEA)’ (Fig. 8), and ‘Criteria Performance Scoring for Alternatives’ (Fig. 9)Footnote 9. Some forms, e.g., the ‘Kano Questionnaire’ (Fig. 6) requires users to switch sub-forms by picking specific ‘Decision Maker’, ‘Main Criteria’, and ‘Sub Criteria’ from the drop-down lists. After all the secondary forms are completed, users should click the ‘Calculate Results’ button. The back-end 3-phase MCDM method programmed in MATLAB computes and returns the final results, given in the ‘Final Ranking of Alternatives’ form (Fig. 10). The results match what are obtained from Sect. 4.3.3. The web application allows users to import input data and export output data via Microsoft Excel spreadsheet as well.

This tool can be customised to suit the specific circumstances of various industries or companies’ contexts. However, practitioners need to be mindful of several critical aspects: It is essential that all members of the decision-making team receive comprehensive training and ensure each step of the tool is clearly defined, discussed within the team and properly executed. Additionally, there exists the possibility of decision-makers not fully grasping the risk landscape, which can lead to the potential overlooking of critical risks. Finally, it is vital to communicate the significance and benefits of the method to all stakeholders, particularly to the high-level managers, who may need to be convinced of the value of engaging.

This tool can be strategically employed by companies and policymakers to guide the allocation of resources, particularly for projects that require long-term investment. Our research supports the creation of comprehensive regulatory frameworks that reinforce reliable and viable business decisions, where the mandatory risk evaluation emphasises the importance of foreseeing and mitigating potential risks. The application of the method enhances the understanding of the returns and risks across the lifespan of projects, leading to projects that not only meet immediate needs but also contribute to long-term sustainability goals. The integration of AI-powered decision support tools into educational programs for future managers and policymakers will ensure the new generation of leaders who are well-versed in sophisticated project reliable assessment methodologies. On an international scale, such assessment tools can inform the development of global standards for evaluating and benchmarking long-life cycle projects, thereby encouraging international collaboration and investment.

Fig. 3
figure 3

Homepage of the web application for hydrogen supply chain stakeholder assessment

Fig. 4
figure 4

The primary form interface of the web application (filled out based on Sect. 4)

Fig. 5
figure 5

‘Criteria Hierarchy’ and ‘Criteria Type’ interface in the secondary form (based on Sect. 4)

Fig. 6
figure 6

‘Kano Questionnaire’ interface in the secondary form (based on Sect. 4)

Fig. 7
figure 7

‘Pairwise Comparisons for Relative Importance’ interface in the secondary form (based on Sect. 4)

Fig. 8
figure 8

‘Failure Mode and Effects Analysis (FMEA)’ interface in the secondary form (based on Sect. 4)

Fig. 9
figure 9

‘Criteria Performance Scoring for Alternatives’ interface in the secondary form (based on Sect. 4)

Fig. 10
figure 10

‘Final Ranking of Alternatives’ as the final results (based on Sect. 4)

6 Conclusions

Selecting and evaluating sustainable alternatives, such as suppliers, sites, and technologies for investment is a critical and complex decision for business managers, particularly due to the inflexible nature of such choices once they are made. The classical fuzzy MCDM approaches like fuzzy AHP and fuzzy TOPSIS can identify the core of investors’ requirements and rank the alternatives based on their closeness to the ideal solutions. However, our results indicate that these classical approaches fail to address the key aspects of the selection and evaluation process, particularly the decision-makers’ satisfaction and potential risks, leading to questionable reliability.

This study addresses these limitations by introducing an innovative 3-phase fuzzy MCDM method that uniquely integrates time-based satisfaction and risk factors as dual pillars of reliability assessment in high-investment, long-life cycle projects. This method enhances the conventional MCDM framework by:

  1. 1.

    Pioneering SKQ in the time-based satisfaction weight, which applies a dynamic mechanism in Kano model. This adaptation considers uncertainties in long-life cycle projects, quantifying the vagueness of human thinking by including a 0–5 scaled degree of agreement on SKQ.

  2. 2.

    Tuning the initial criteria scores using a risk-discounted weight from FMEA method. This adjustment diminishes the result deviation between expected and actual outcomes caused by risks, acknowledging that FMEA documents are live and can evolve with new information and collective team experience to refine decision-making accuracy.

The application of this advanced MCDM method is demonstrated through a case study on a hydrogen storage method selection for lightweight vehicles, a decision that requires sustainability and is typically set for the vehicle’s lifespan of up to 20 years. The results indicate that ‘health & safety’ with a weight of 0.5318, emerges as the most important main criterion, in which ‘permeation & leakage’ with a weight of 0.4008, is the most important sub-criterion among all. The results identify ‘hydrogen storage method C (cryo-compressed hydrogen tank)’ as the optimal choice among the alternatives, followed by method B (liquid hydrogen tank) and method A (compressed gaseous hydrogen tank).

Debates over the final decision are inevitable. This novel 3-phase fuzzy MCDM method for reliable assessment not only ranks alternatives overall but also directs decision-makers to the consolidated weight of criteria and risk discounted weight as shown in Tables 13 and 18 to check for the most important and risk-related criteria. Therefore, this insight offers a transparent advantages and disadvantages analysis for each alternative to investors.

Furthermore, we have developed the proposed MCDM method into a user-friendly web application to ensure accessibility and continuous improvement. The tool is freely available, empowering end-users across various industries to engage in a reliable assessment of sustainable hydrogen supply chain management, or more broadly, any other high-investment and long-life cycle projects.

In conclusion, this paper bridges the gaps between the theoretical and practical development of reliable assessment, offering a reliable, dynamic, and comprehensive decision-making tool for large-scale and long-term sustainable investments. This tool can be strategically utilised by businesses and policymakers to guide resource allocations and to promote the establishment of comprehensive regulatory frameworks that underpin reliable and sustainable business decisions.

Finally, the future direction of research is broad and promising. A primary area involves the integration of additional reliability-oriented factors that can be considered and incorporated in more harmonised and effective ways, as suggested by (Kannan et al., 2023). In addition, other suitable (fuzzy) MCDM can be adapted to combine with the Kano model and FMEA for creating a new MCDM for reliable assessment (Amor et al., 2023). Moreover, alternative methods except the Kano model can be stimulated to refine the performance of the time-based satisfaction factor and the risk factor (Choudhary et al., 2023).