Introduction

The drivers of climate change within socio-ecological systems, such as land-use change and greenhouse gas emissions, alter anthropogenic and climatic pressures in the systems (Schröter et al. 2005; Folke 2006; Holman et al. 2016). The interconnectedness of these drivers adds uncertainty to our understanding of recent climate behaviour and future climate change projections, impacts and vulnerabilities. Such uncertainties have often been accommodated using scenarios to systematically answer “what-if” questions (van Ittersum et al. 1998; Zurek and Henrichs 2007; van Vuuren et al. 2012). Unlike predictions and forecasts, scenarios do not imply a probability or likelihood (van Vuuren et al. 2012). Instead, scenarios have been defined as “plausible descriptions of how the future may develop, based on a coherent and internally consistent set of assumptions about key relationships and driving forces”(Alcamo et al. 2005). As such, scenarios can be in quantitative (numerical) and qualitative (narrative) forms (Bamberger 2000; Philcox et al. 2010).

The complementarity of qualitative and quantitative scenarios is considered a potential strength in addressing complex problems (Vermeulen et al. 2013) since the so-called story and simulation (SAS) approach (Alcamo et al. 2006) became mainstream in scenario development (Kok 2009). SAS consists of a ten-step approach aimed at developing and translating (often stakeholder-led) narratives into (often scientist-led) model quantifications, iterating and revising them until they are linked (van Vliet et al. 2010). SAS yields credible, plausible and innovative scenarios because of the inclusion of expert models combined with other creative elements introduced by stakeholders (Alcamo and Henrichs 2008). The co-production also ensures consistency between stakeholders and model results (Kemp-Benedict 2012; Schweizer and Kriegler 2012). The final scenarios are more relevant and legitimate for end-users as stakeholders can identify their views (i.e. stakes) in the narratives.

Although conceptually strong, operationalising SAS has issues. Alcamo (2008b) already identified two SAS pitfalls: the ‘reproducibility’ and ‘conversion’ problems. The ‘reproducibility’ problem exists because assumptions and mental models are not explicit when a scenario narrative is developed, whereas the ‘conversion’ problem exists because narratives cannot be directly translated into quantifications. Moreover, the distinction between the two problems is often not straightforward. For example, fuzzy cognitive maps (Kosko 1986), recently applied by Kok (2009) and Van Vliet et al. (2010), ‘map’ variables and connections by assigning a weight to each connection. In the literature, this method has been described as improving the structure and reproducibility of qualitative scenarios (Alcamo 2008a) and has been applied as a conversion tool between qualitative and quantitative scenarios (Mallampalli et al. 2016). Nevertheless, most state-of-the-art studies have tended to focus on addressing the ‘reproducibility’ problem as evidenced by the proliferation of systematic stakeholder-based modelling (Voinov and Bousquet 2010), whilst studies focusing on the ‘conversion’ problem show methodological trade-offs between model compatibility, stakeholder expertise and development of narratives (Mallampalli et al. 2016). Hence, further studies addressing the ‘conversion’ problem are urgently needed. However, in order to tackle the ‘conversion’ problem in SAS, there is a need to take a step back, i.e. to better understand the gaps in knowledge within both qualitative and quantitative scenarios separately, before combining them (van Vliet et al. 2010). Scenario narratives integrate imagination in strategic thinking and combine short-term preoccupations in long-term planning with analytical thinking and creative visioning (Rasmussen 2005). Because narratives provide “holistic views” of the future and transcend the sum of single parts (Rasmussen 2005), it becomes very complex to reduce narratives to a selection of model variables. Whilst acknowledging this gap, we argue that narratives can be bridged to models, while maintaining their original characteristics.

In SAS, two sources of uncertainty matter, epistemic and linguistic uncertainties (Regan et al. 2002; Uusitalo et al. 2015) (Fig. 1). Epistemic uncertainty is due to imperfect knowledge about something that is theoretically knowable. In statistical models, epistemic uncertainty is further distinguished from aleatory uncertainty that relates to irreducible and unavoidable variation in stochastic processes (Uusitalo et al. 2015). Statistical models represent uncertainty “in terms of (aleatory) probability distributions and (epistemic) parameters” (O'Hagan et al. 2006). Linguistic uncertainty is inherent to our natural language and includes vagueness and ambiguity. Here, we categorise aleatory uncertainty as part of a broader “epistemic uncertainty” category (Regan et al. 2002) and disinguish it from “linguistic uncertainty”.

Fig. 1
figure 1

Sources and treatment of uncertainty concepts when constructing scenarios with the story and simulation approach. The combined result consists of a narrative (large circle) and a set of quantified input to a model (large square). Both narratives and model input can be more consistent and transparent if translated in similar terms (small triangle, small circle and small square). Adapted from Regan et al. (2002)

The linguistic and epistemic sources of uncertainty remain separate until narratives are translated to produce quantifications. Therefore, narratives and models are treated as two separate products (a circle and square in Fig. 1). Even though methods have been created to translate narratives into quantification, a clear operational link between narratives and models is still lacking (Houet et al. 2016). Currently, two main approaches address the SAS ‘conversion’ problem systematically and transparently: a Bayesian reasoning approach, as outlined by Kemp-Benedict (2010) and a fuzzy sets based approach, as outlined by Kok et al. (2014) and Alcamo (2008b). Bayesian approaches are frequently used as they allow stakeholder input (prior distributions) to be refined (to produce posterior distributions) through confrontation with data (e.g. Aprostolakis (1990) and Van der Sluijs (2007)). Kemp-Benedict (2010) uses Bayesian statistics to propose a direct quantification of narratives in terms of how much they differ from a reference or historical data set. The Bayesian statistics approach tackles the ‘conversion’ problem by converting qualitative elements directly into the desired model input, without extra data processing. Notwithstanding that statistical approaches—both frequentist and Bayesian—structure uncertainty due to unavoidable randomness and imperfect knowledge, we argue that including this in participatory (stakeholder) scenario development poorly covers the full spectrum of uncertainty. Even a systematic and transparent approach such as Kemp-Benedict’s (2010) is not universally applicable: assumptions are quantified in a priori distributions, and we question whether starting from such assumed distributions is realistic when the participatory setting is characterised by diverging stakeholder expertise.

The real challenge for SAS is to account for vagueness when narratives are developed in participatory settings, especially because stakeholder engagement “has become almost a ‘must’”(Voinov and Bousquet 2010). The socioeconomic scenarios that we analyse in this paper are to be understood in this context; they are co-developed by scientists and stakeholders. The socioeconomic scenarios, therefore, include vague or imprecisely defined terms and elements within the narratives, due to the heterogeneity of assumptions, e.g. from a wide range of viewpoints and expertise (Mallampalli et al. 2016). One approach to overcome these issues is to combine stakeholder and other expert opinions to define numerical ranges with associated levels of confidence and probability density functions that introduce extra assumptions (Schoemaker 1991; Refsgaard et al. 2007; Brown et al. 2014). Such an approach, however, has two deficiencies: it requires a (generally opaque) mixture of inputs from two distinct groups, i.e. modellers and stakeholders, and it ignores any outliers and asymmetries within the stakeholder-generated input data. The fuzzy sets approach, in contrast, relies on the existing structure of the stakeholder data to generate probability density functions. Consequently, the method prioritises fidelity and transparency, with the important advantage that final outputs can be directly linked to stakeholders’ inputs. In addition, fuzzy sets allow for weighting on the basis of stakeholders’ professed confidence levels.

In this paper, the objective is to develop and apply an objective approach to translate the linguistic uncertainty in narratives into epistemic uncertainty of model input. Thereby, we address the ‘conversion’ problem when linking narratives with models. After developing the participatory scenarios, we assign the concepts of linguistic uncertainty to the narratives and epistemic uncertainty to model quantifications of the scenario study. We describe the operationalisation of these concepts by first translating the vagueness of narratives to linguistic variables and fuzzy sets and secondly deriving probability density functions from these to generate model input. This paper complements earlier studies on fuzzy sets, e.g. Alcamo (2008b) and Kok et al. (2014), that focus on results only. The discussion addresses our choice to ‘bridge’ linguistic and epistemic uncertainties, rather than attempting to reduce them.

Design and methods

Developing participatory scenarios

The socioeconomic scenarios analysed in this paper are developed within the EU-funded IMPRESSIONS project (Harrison et al. in review). The objective of the scenarios is to provide the context to understanding future impacts, adaptation and vulnerability to climate change at different scales in Europe. Because of their geographic breadth and narrative and quantitative character, the IMPRESSIONS scenarios can be applied to test the fuzzy sets methodology and compare the results across heterogenous stakeholder groups. The IMPRESSIONS scenarios were intended to directly inform numerical inputs to a European-scale model known as the Integrated Assessment Platform (IAP) (Harrison et al. 2015), with the scenario-specific default values and uncertainty ranges of model inputs being derived from the stakeholder inputs. The quantifications derived from stakeholder input explicitly recognise both linguistic and epistemic uncertainty, in that the stakeholders intended the ranges (and the scenarios within which they occur) to allow for epistemic uncertainties in the quantities described (section “Producing narratives and quantifications in a participatory scenario process”), while the fuzzy-set and probabilistic interpretation accounts for linguistic uncertainty (sections “Measuring vagueness using fuzzy sets” and “Probabilistic interpretation of vagueness”).

The results discussed in this paper are based on scenario quantification for five case studies which cover Europe and Central Asia: Europe as a whole (Europe) (Kok et al. in review); Central Asia (Central Asia); Scotland (Scotland) consisting of national scale scenarios for Scotland; Iberian river basin scale scenarios (Iberia); and Hungarian municipality level scenarios (Hungary). For the Central Asia, Iberia and Hungary case studies, heterogeneous groups of stakeholders were engaged. For Europe and Scotland, scientists acted as stakeholders to modify a set of existing stakeholder-developed scenarios (Harrison et al. 2013, 2015). In this paper, we refer to these scientists as ‘expert stakeholders’.

Producing narratives and quantifications in a participatory scenario process

Stakeholders were invited to produce narratives and key quantifications in a 2-day workshop within each case study. Stakeholders were selected to cover a wide range of expertise on different sectors and to have different age, country, and educational backgrounds (see Gramberger et al. 2015, and Supplementary Material 1 for details on the stakeholder selection process). All scenario products were produced by stakeholders as the result of different workshop processes which alternate brainstorming sessions in groups with plenary discussions, called the STIR approach (Gramberger et al. 2015). The approach is designed such that both narratives and key input quantifications become intrinsically connected by having the same mix of facilitators and stakeholders producing both scenarios and quantifying key drivers. Such an approach is fundamental for consistent co-production of both narratives and quantifications (we refer to the Supplementary Material 2 for all steps in the co-production of narratives and quantifications).

The workshop process consists of three main components. In the first component, stakeholders are guided to list, discuss and select key uncertainties relevant to all scenarios and further develop narratives for each individual scenario. In the second component, narratives are discussed in groups (group exercise), where stakeholders are asked to provide qualitative (linguistic) trends for key variables in a written questionnaire (an example of the group exercise questionnaire is provided in Supplementary Material 3). The variables’ descriptions are presented to stakeholders without rephrasing the modellers’ wording to avoid misinterpretation. The third component consists of an individual exercise (an example of the individual exercise questionnaire is provided in Supplementary Material 3), where each stakeholder provides their personal opinion on what quantitative ranges represent those qualitative trends.

Given inevitable time constraints and the importance of completing all steps within the workshop, the maximum number of variables that could be quantified by stakeholders was limited to three or four. These variables were selected based on two criteria. First, the variables had to reflect the expertise of most of the invited stakeholders and nest well among the key issues for the case study. Second, the variables should relate to model input parameters that were among the most sensitive in the model. For the Europe case study, the results of a full sensitivity analysis of the IAP (Kebede et al. 2015) were available to support the choice of sensitive variables. In addition, stakeholders provided qualitative guidance to inform the quantification by the modellers of a much wider range of socioeconomic variables used within the impact models. For example, stakeholders provided trends for four capitals (human, social, manufactured, and financial) of resource availability (Porritt 2007) to be applied to model vulnerabilities to climate change (Dunford et al. 2015). This paper only analyses the trends for the stakeholder-quantified variables; therefore, analysis of capital trends is excluded.

Together with trends and quantification of variables, stakeholders were asked to provide an indication of their confidence when quantifying each variable, to have qualitative information on the stakeholders’ professed confidence levels about the quantifications. The question asked to both stakeholders and expert stakeholders after the quantification of each variable was ‘How confident are you for the quantification you provide for this variable based on your background knowledge (0–10)? (0 = not confident; 10 = very confident)’. We classified the data in four categories to analyse a qualitative ‘confidence index’. The sample was not suited to infer statistical conclusions: firstly, the sample size was too limited due to resource limitations; secondly, the motives behind confidence levels cannot be assessed; thirdly, a psychological and cultural analysis on the background of the stakeholders is out of the scope of this analysis. We therefore decided to analyse the confidence index qualitatively. The analysis of this index occurs according to one of the main assumptions introduced in this study, i.e. that incorporating a subjective level of confidence could alter the value or weight of quantification (Van der Sluijs 2007).

In the next section, we explore how to proceed with quantification of a given variable using the ranges provided by stakeholders during a facilitated workshop. We use the variable ‘Change in food imports from 2010’ for Europe case study as an example to follow the methodological steps and to illustrate the results of other linguistic variables.

Addressing the ‘conversion’ problem: quantification of narratives

Measuring vagueness using fuzzy sets

Vagueness in stakeholder-driven scenarios exists because each stakeholder could have a different interpretation of the linguistic term ‘high increase’ in food imports compared to 2010. To measure this vagueness, each stakeholder was asked to define what he/she personally meant by ‘high increase’ by providing a numerical range. The analysis of ranges derived from stakeholder values assumes that each stakeholder has a different, but equally valid, interpretation of the same statement due to different backgrounds, beliefs, knowledge, and so on.

Numerical ranges provided by each stakeholder can be represented with fuzzy numbers in a membership function (see, for example, Cornelissen et al. 2001). According to fuzzy logic, ‘high increase’ is a vague statement that should be addressed in a mathematical form (Zadeh 1975a, b), i.e. by quantifying the degrees of membership rather than assessing the likelihood or frequency of the linguistic term ‘high increase’. By providing a range for the linguistic term ‘high increase’, stakeholders estimate the vagueness of the linguistic term, thus addressing linguistic uncertainty. All the individual ranges constitute the input of each linguistic value for each linguistic variable (Fig. 2).

Fig. 2
figure 2

Representation of the linguistic variable ‘Change in Food Import compared to 2010’ and its linguistic values, together with the numerical ranges provided by each stakeholder for a linguistic variable and the fuzzy restrictions. Membership functions define the degree of membership (μ_Ã in y-axis) of fuzzy numbers in a 0–1 scale. Adapted from Zadeh 1975a

We define the analysed variable a ‘linguistic variable’ with a ‘linguistic value’. A linguistic value is the vague analogue of a numerical value and is the imprecise non-numerical value of a linguistic variable (Zadeh 1975a). Linguistic variable and model variable have the same meaning to reduce the risk of misinterpretation, which means that the model variable is presented to stakeholders as such in the form of a linguistic variable, without reducing or simplifying the meaning of the original model variable.

A linguistic variable can have several linguistic values. In our example, linguistic variable ‘Change in food imports from 2010’ has five linguistic values ranging from ‘high decrease’ to ‘high increase’ (Fig. 2). The purpose of the linguistic values is to inform the calculation of fuzzy membership functions that will be used as model inputs. The membership function of a linguistic variable μAassigns a numerical value to each range (x), depending on how much the range “belongs” to its linguistic variable. For instance, in Fig. 3, we assume that all ranges of a linguistic variable belong in a space included within the median, which belongs completely to the linguistic variable μA(xmedian) = 1, and the maximum and minimum of the ranges, which belong least to the linguistic variable μA(xmin, max) = 0.

Fig. 3
figure 3

Defuzzification of the linguistic values ‘high decrease’ and ‘low increase’ for the linguistic variable ‘change in food import compared to 2010’. Defuzzification is made with the centre of gravity method

The last step is the defuzzification of the linguistic value using one of the various existing operators. The operator used here is the centre of gravity (CoG) (Fig. 3), a continuous defuzzification operator, which means that small variations in an input should result in small changes in output values (Leekwijck and Kerre 1999). The centre of gravity of the membership function is defined in equation (Eq. 1) by the minimum, median and maximum values of each linguistic value obtained from the results of the entire stakeholder group (Kok et al. 2014). The CoG was selected because it represents the most “central” value between the membership function’s minimum, median and maximum and therefore was considered a better indicator of higher likelihood than the median, which is more sensitive to intermediate values. However, the disadvantage is that the CoG is more sensitive to changes in extreme ranges, compared to other metrics such as the mode or median.

$$ \mathrm{CoG}\left(\mathrm{x}\right)=1/3\left(\min +\mathrm{median}+\max \right) $$
(1)

The CoG provides the basis for converting the qualitative changes into quantitative changes for each scenario, as described below. The quantified changes are then run in the IAP. The uncertainty around these changes is handled as set out below.

Probabilistic interpretation of vagueness

The scenario quantification in the IAP recognises both epistemic uncertainty and the validity of each stakeholders’ perspective on future changes through the derivation of probability density functions for each model input. For each linguistic value of each linguistic variable, the CoG represents the single output of a fuzzy set. We define the CoG as the default value of the membership function. The variation in values around this default value is taken to define the linguistic uncertainty, which must be taken into account in subsequent analysis (modelling) steps. Generally, this can be achieved by representing the variation as a probability density function (PDF), allowing the form and range of the stakeholders’ quantification to be retained, and also allowing parameter sampling for rigorous sensitivity or uncertainty analyses. Ideally, the form of the PDF will be derived from the frequency of values suggested by stakeholders, with Gaussian and uniform distributions offering particularly useful and contrasting alternatives. However, in many cases, the appropriate function is not clear, either because of an asymmetric or multimodal frequency, or because inadequate frequency data are available. In these cases, it may be preferable to define discontinuous probabilities that minimise the need for additional assumptions to be made. In the case of the IAP, input parameter values were assigned different (linguistic) ranges: a ‘credible’ range within which the ‘default’ value occurs and a wider ‘possible’ range. Assuming a probability distribution for the CoG, we define, for each linguistic variable, a ‘default’, a ‘credible’ and a ‘possible’ range. Beta distributions are often chosen in the literature when non-normality is assumed due to their flexibility and limited ranges (Brown et al. 2014); however, they are not universally appropriate. For consistency with previous work, we use beta-distributions here, but fitted distributions all had a low alpha and beta (between 1 and 2), suggesting that uniform distributions may have provided alternative adequate fits.

Fitting of the distributions to each linguistic variable’s CoG was carried out with an online tool that requires a scaled [0,1] modal value and 10 or 90% limits (Brown et al. 2014). The CoG was therefore scaled as in equation (Eq. 2):

$$ \mathrm{Scaled}\ \mathrm{CoG}\left(\mathrm{x}\right)=\left(\mathrm{median}-\min \right)/\left(\max -\min \right) $$
(2)

This scaled CoG value was defined as the modal or ‘default’ value, with the 90% range taken to define the ‘credible range’ and the 95% range taken to define the ‘possible range’. This choice is based on previously selected confidence ranges with the IAP (Brown et al. 2014) and on expert judgement on the distributions fitted to the data obtained by stakeholders, but other choices would have been equally possible.

Results

Addressing the ‘conversion’ problem: quantification of scenario trends and input to the integrated assessment platform

As the CoG consists of the defuzzification of a single linguistic value, defuzzifying all linguistic values of a given linguistic variable is necessary to quantify the scenario trends until 2100. To exemplify this, we follow on from the example used in the Methods section of the linguistic variable ‘Change in food imports compared to 2010 [in %]’. For this linguistic variable, we have defuzzified five linguistic values: ‘high increase’ has a CoG of 88%; ‘low increase’ has a CoG of 22%; ‘no change’ has a CoG of 0%; ‘low decrease’ has a CoG of − 27%; and ‘high decrease’ has a CoG of − 63% (Fig. 4). These values are then applied according to the scenarios generated by stakeholders. For example, based on the narrative for the European SSP1 scenario, the expert stakeholders were asked to draw qualitative trends for the variable ‘Change in food imports compared to 2010 [in %]’. They decided that the trend should be a ‘low increase’ in 2025, then a ‘low increase’ in 2055 and finally a ‘high decrease’ by 2100. With defuzzified values: in the future scenario European SSP1, food imports in Europe will be 22% higher in 2025 compared to 2010 and then remain constant at 22% higher in 2055, before decreasing to − 63% by 2100 compared to 2010 levels (Fig. 4). These values for each SSP, based on the CoG of each linguistic value, represent the default input values for the impact models used in the case studies analysed.

Fig. 4
figure 4

Quantification of the centre of gravity for the five linguistic values (high decrease–low decrease–no change–low increase–high increase) of the linguistic variable ‘Change in Food Import compared to 2010 [in%]’. Four future trends are shown for the European Shared Socioeconomic Pathways

The results suggest that the distribution fitted to the scaled CoG (interpreted as the mode of the distribution) is close to a uniform distribution. Based on an assumption of quasi-uniformity (i.e. apparent but not precise uniformity, see Supplementary Material 4), the PDF ranges resulted in similar ranges to the membership functions for all quantified IAP variables. Ninety percent of the PDF was thus close to a 0.1 degree of membership and 95% of the PDF was close to a 0.05 degree of membership in the fuzzy set function (Fig. 5). The quasi-uniform distributions suggest the possibility of fuzzy numbers to directly provide all IAP input ranges (default, credible and possible) and to account for nearly all ranges by stakeholders by staying very close to the min and max of the fuzzy number (up to 0.05 membership). From a conceptual perspective, the quasi-uniformity is also consistent with our assumption of allowing stakeholders to define the likelihood of their quantifications by providing a ‘range’ instead of a ‘most likely value’ of each linguistic value and a ‘confidence’ index; presumably, this also suggests substantial uncertainty across stakeholders where wider ranges result from more stakeholders being included.

Fig. 5
figure 5

Visualisation of ‘default’ and ‘credible range’ for the integrated assessment platform using stakeholder-derived quantification

Analysis of assumptions in quantifications of scenario trends

To test whether uniformity could be the best assumption, we accounted for and qualitatively analysed the performance of the ‘confidence’ index that the stakeholders and expert stakeholders provided with their quantification of the linguistic values for each single linguistic variable (Supplementary Material 5). Three possible interpretations of the results were considered: (a) stakeholders, aware of their insufficient background knowledge, provide ‘very unlikely’ values and a low ‘confidence’ index or (b) stakeholders provide what an expert would consider a ‘reasonable or realistic’ estimate and are either ‘confident’ or ‘less confident’ or (c) stakeholders under- or overstate their expertise.

The qualitative analysis of the ‘confidence’ index supports our choice of a uniform-like probabilistic representation of likely values for two reasons. Firstly, we observed similar patterns in the ‘confidence’ index across the same case study rather than for the same variable. For example, confidence is generally lower for Europe and Scotland case studies compared to Central Asia, Hungary and Iberia. Stakeholders and expert stakeholders all tended to provide a similar confidence level independent of the variable, and thus independent of background knowledge. Secondly, we found no obvious correlation between ‘reasonable’ quantifications and the ‘confidence’ index for stakeholders or expert stakeholders: low confidence may indicate lack of knowledge in cases where unrealistic ranges were provided, or may indicate a critical attitude when reasonable ranges were provided. With the information available, however, we cannot infer whether, and how much, cultural and other personal factors (such as a critical attitude or understanding of the exercise) played a role in the ‘confidence’ index.

Ranges provided by stakeholders and scientists acting as stakeholders (‘expert stakeholders’) were also compared across case studies (Fig. 6) to understand whether minima and maxima values could lead to different distributions (e.g. multimodal or particularly skewed for either stakeholders or expert stakeholders). From a qualitative point of view, the analysis did not show any specific pattern in either stakeholders’ or expert stakeholders’ ranges. The mean and standard deviation of ranges are higher for the more extreme linguistic values of the variables (i.e. ‘high decrease and high increase’). The ‘confidence’ index seems to be inelastic to variation in ranges, narrower ranges being associated with greater confidence. For example, both Scotland case study expert stakeholders and Central Asia stakeholders did not provide ranges for one linguistic value but rather gave single values (represented by the leftmost column of each graph in Fig. 6), but Central Asia stakeholders produced more extreme values than those provided by Scotland case study expert stakeholders. Central Asia stakeholders and Scotland case study expert stakeholders scored differently in the ‘confidence’ index, though they had similar ‘confidence’ in providing quantifications for the same variables (GDP and population trends). The ‘confidence’ index also differed between expert stakeholders. Europe case study expert stakeholders provided the least ‘0’ range values and Scotland case study expert stakeholders provided the most ‘0’ range values (Fig. 6). Assuming that scientists possess the background knowledge for the model input quantification, such difference in ranges may indicate the preference of some scientists to provide the most likely value instead of a range.

Fig. 6
figure 6

Log of ranges for all linguistic variables for five case studies. The blue line is the theoretical normal distribution for all ranges (excluding single values, in the leftmost column) and the red-dotted line is the empirical distribution following all linguistic values. Linguistic value ranges for Europe and Scotland were quantified by expert stakeholders, while linguistic value ranges for Hungary, Iberia and Central Asia were quantified by stakeholders

Discussion

Our analysis demonstrates and applies a method to translate vagueness to probabilities as a double-edged sword that will improve current practice when operationalising the story-and-simulation approach, while also improving statistical and fundamental understanding of how uncertainties are perceived and dealt with.

To this end, we addressed linguistic and epistemic uncertainties by ‘bridging’ them, rather than attempting to reduce them because the narratives provide ‘holistic views’ of the future that the models could not fully capture. In our approach, both narratives and models still remain ‘black boxes’ (see the round and squared shapes in Fig. 1) throughout our analysis and neither the linguistic uncertainties of the narratives nor the epistemic uncertainties of modelling are reduced but ‘bridged’. However, effective methods do exist to unravel the single ‘black boxes’. These methods have the advantage of adding structure to either narratives (van Vliet et al. 2012) or both narratives and models (see for example the cross-impact balance approach for narratives in Schweizer and Kriegler (2012) or fuzzy cognitive mapping to link narratives and models in van Vliet et al. 2010). However, these methods do not yet address the different uncertainties and are less transparent or too complicated to carry out in stakeholder workshops.

We chose to represent the stakeholders’ fuzzy numbers with uniform-like distributions for input in the IAP to avoid adding further assumptions and unintended interpretation. However, we provide a description of stakeholder ranges to enable a qualitative comparison with direct quantification of stakeholder-led narratives by impact modellers (Fig. 6). This analysis shows that linguistic values in all case studies lead to different PDFs. In the Hungarian case study, Gaussian probabilities could approximate distributions reasonably well, in most cases, once the ‘0’ values (or zero uncertainty) were removed. We interpreted this result as supporting the idea that stakeholders themselves can ‘bridge’ linguistic and epistemic uncertainty within the fuzzy sets approach. Stakeholders may provide reasonable ranges and could substitute expert judgement from impact modellers, at least for selected variables.

In sections “Introduction” and “Design and methods”, we have introduced the assumption that direct quantification of stakeholder-led narratives by impact modellers could add ‘assumptions on assumptions’, and fail to simply translate uncertainties, if impact modellers solely rely on their own ad hoc judgements (Mallampalli et al. 2016) or simply misinterpret stakeholders’ reasoning and opinions. Impact modellers are well experienced in addressing epistemic/aleatory uncertainty but address linguistic uncertainty less systematically (Regan et al. 2002). We tested this assumption by qualitatively comparing trends of stakeholders’ and expert stakeholders’ ranges across case studies. Instead of comparing trends across case studies, an alternative approach could have been to make single assumptions about interpreting all variable ranges in terms of PDFs, or different assumptions for each variable. But, after preliminary screening of the results, and due to the different types of variables analysed, a direct comparison among similar variables was not possible due to the limited number of participants and variables. Alternatively, a quantitative analysis can be useful to validate the use of stakeholder-led quantifications from a modelling output perspective. For example, Monte Carlo sampling from the PDFs generated for sensitive variables like GDP, population and food imports (Kebede et al. 2015; Brown et al. 2014) could be performed to analyse the propagation of uncertainty in impact models of both stakeholder-led input uncertainty around the CoG and PDFs generated by direct quantification of stakeholder-led narratives by impact modellers.

We have also assumed that, even if probability-based quantification is highly appropriate given the inevitable approximations, there is no further information that can be introduced to define the form of the PDF. In contrast, mainstream alternatives generally structure both narrative and quantification assumptions using probabilistic methods based on Bayesian statistics, such as Bayesian networks (e.g. Henriksen and Barlebo 2008) and Bayesian reasoning (Kemp-Benedict 2013). We concluded that both methods would have been either incomplete or misleading in our analysis. Bayesian-based methods can structure uncertainties more transparently with both prior and posteriori distributions that quantify changing assumptions with acquired information. However, even such methods can be problematic if (1) little agreement exists on the source of the data—especially in data scarce case-studies such as Central Asia and (2) if participants have different expertise. Our qualitative analysis of the ‘confidence’ also shows that stakeholders’ and scientists’ (expert stakeholders) assumptions may be very different and difficult to predict. Even in ‘expert stakeholders’ participatory contexts, extra assumptions need to be minimised.

We suggest to further consolidate these results with a quantitative uncertainty analysis from the IAP output perspective and efforts to strengthen the input from a stakeholder workshop perspective. The stakeholders appreciated the usefulness of this exercise and generally agreed that stakeholders can help modellers in quantifying key scenario drivers. However, some stakeholders found the exercise difficult and this may have resulted in the generation of ‘outliers’. Nonetheless, ‘outliers’ were included in the quantification of the CoG simply because we interpreted them as extreme values or ranges, consciously provided by the stakeholders. We limited ourselves to the exclusion of physically impossible values. The most important exception was the quantification of GDP for the Central Asia case study. Here, extreme growth trends resulted in two very extreme scenarios. When faced with a choice between their quantification and model-led trends, stakeholders chose the (very extreme) trends resulting from their quantification. In such case, a compromise had to be made by applying the trend provided by stakeholders at the upper limit of what the models could represent. As suggested also by the confidence index analysis, however, there could be different but equally legitimate reasons for stakeholders to provide their trends. To reduce ‘outliers’ in future exercises, we suggest to improve the participatory process, at the root of this quantification, by better adapting questionnaires and processes to the stakeholders involved in the workshop, e.g. by including a formal stakeholder-mapping exercise prior to the workshops, to address uncertainty about the representativeness of stakeholders.

Conclusions

The fuzzy set method has been recognised as a simple and transparent method (Alcamo 2008b; Kok et al. 2014; Houet et al. 2016) to be applied from global to local case studies, despite room for further improvement from stakeholder engagement perspectives. This analysis, based on the assumption that stakeholder values are the best available (or at least better than some poorly defined combination of stakeholder and modeller inputs), is the first to show that stakeholders can provide reasonable ranges and that these can be used without adding further assumptions. Ongoing studies applying stakeholders’ quantifications with fuzzy sets within impact models (e.g. Li et al. 2017) will further explore this potential in different modelling environments, aiming at a universally accepted tool to produce quantifications from narratives. Even though a formal validation step may lead to changes in the method, the core steps described are transparent and can be reproduced by practitioners in the field in any workshop settings.

At an epistemological level, this analysis contributes to enhanced dialogue and understanding between modeller-led and local, stakeholder-led communities, and linkage of qualitative and quantitative approaches by bridging the different uncertainty concepts (linguistic and epistemic and aleatory uncertainties) addressed by their research questions. We therefore did not simplify the relevant uncertainties (e.g. combining fuzzy logic and probabilities), but created a common, systematic language between the two communities. We further hope that our research will raise more attention to fundamental issues of different sources of uncertainty in participatory scenario development.