Introduction

A reliable probabilistic volcanic hazard assessment (PVHA) is the basic scientific component for planning long-term risk mitigation actions. Despite its importance, we argue that very few efforts have been made to verify the influence of the many natural uncertainties on the final assessment. Here, we explore this point, with particular emphasis on the effects of the variability in eruption size class (with this term, we mean any kind of parametrization of the magnitude, intensity, or type of eruption; see Marzocchi et al. 2010) and vent location upon PVHA. This variability represents an objective difficulty in achieving a reliable PVHA. Indeed, PVHA is usually based on single eruptive scenarios analysis (e.g. Barberi et al. 1995, 2008; DPC 1995), on comparison among single scenarios (e.g. Costa et al. 2009), and/or analysis of phenomena which have occurred in the past (e.g. Orsi et al. 2004). To date, most PVHA applications are based on the analysis of single scenarios selected as the most probable or the most representative. In this way, the aleatory variability associated with the uncertainty in terms of the size class and vent locations is completely ignored.

The single scenario approach is often assumed because of the difficulty in merging together, in a consistent way, the information from different “eruptive settings” (ESs), defined as the occurrence of an eruption of a specific size class from a vent in a specific location (see Marzocchi et al. 2010). This is a crucial issue since both central volcanoes and calderas may erupt with wide variability in terms of size of the eruption and location of the vent (e.g. Simkin and Siebert 1994). In short-term applications, some attempts to reduce such variability have been adopted (e.g. Lindsay et al. 2010). In long-term applications, as in the present study, it is practically unavoidable, given that nowadays, it is impossible to “predict” the size and vent location of the next eruption.

In this paper, we consider the case of long-term tephra fall hazard at Campi Flegrei. Campi Flegrei caldera (CFc) is a highly urbanized area that has experienced significant variability both in vent location and size of eruptions (e.g. Orsi et al. 2004). In practice, we analyze the effects of the inclusion of different ESs and other uncertainties on the hazard posed by tephra fall at CFc using the Bayesian Event Tree for Volcanic Hazard (BET_VH) code described in the companion paper (Marzocchi et al. 2010). In this way, we can also directly check the reliability of PVHA based on single scenarios.

Furthermore, in this paper, we propose and apply a scheme that formally merges information coming from different models and field data relative to a generic volcanic outcome (e.g. pyroclastic flow, lava flow, tephra fall, lahars etc.) into the BET_VH model. The BET_VH model (Marzocchi et al. 2010) is a formal Bayesian inference procedure to assess volcanic hazard, based on an event tree schema. While the use of models, theoretical beliefs, past data etc., relative to eruption forecasting, has been widely discussed in previous papers (e.g. Marzocchi et al. 2008; Sandri et al. 2009), the inference of the long-term hazard posed by a generic volcanic outcome has not yet been treated. In this paper, we present a scheme that integrates the output of models with different reliability and coverage in order to provide a single prior probability distribution associated with each selected volcanic outcome. In this scheme, the output of each model is included by weighting it according to the reliability of the model itself. For instance, simple models can be used to provide a full description of all possible ESs and relative initial-boundary conditions; this is usually possible because simple models are computationally very fast, and thus, many simulations can be performed. On the other hand, more sophisticated models, characterized by longer computational times, may be used to better define a single ES of particular interest, such as the most likely ES. In this way, the prior distribution may be more precise for some specific ESs, reducing the epistemic uncertainty as much as possible. At the same time, it may provide an unbiased assessment of any other realistic possibility. Eventually, such a prior distribution may be then merged with field data collected from past eruptions in order to provide a full picture of the hazard posed by the selected volcanic outcome (posterior distribution).

To summarize, here we apply the BET_VH model to long-term tephra hazard assessment at the CFc with two goals: (1) to explore the effects of the uncertainty in size class and vent locations on the final assessment and (2) to provide a tutorial example of how the outputs of tephra dispersion models can be embedded into BET_VH. We emphasize that this tutorial example can be generalized in a straightforward fashion to any other kind of phenomena, such as pyroclastic flows, lava flows, or lahars, and to any other volcano.

From simulation output to prior distribution parameters at nodes 7 and 8 in BET_VH

The model BET_VH (Marzocchi et al. 2010) enables us to achieve a full volcanic hazard assessment for each generic volcanic outcome (e.g. pyroclastic flow, lava flow, tephra fall, lahars etc.). The Bayesian inference scheme applied to an event tree is schematized in Fig. 1a (see Marzocchi et al. 2010).

Fig. 1
figure 1

Event tree of the model BET_VH (a) for a generic volcano and (b) for Campi Flegrei

In this section, we describe how to implement the prior distribution relative to nodes 7 and 8. In practice, at such nodes, BET_VH describes the diffusion process of a selected volcanic outcome, given that an eruption (nodes 1–2–3) in a specific vent (node 4) with a specific size class (node 5) has produced the outcome (node 6). More specifically, at node 7, the possibility that a given area is reached by the selected volcanic outcome is estimated; we call this possibility “conditional probability of reaching episodes”. At node 8, the possibility that, in the same area considered at node 7, the volcanic outcome overcomes a specific threshold is estimated; we call this possibility “conditional probability of overcoming episodes”.

In order to define the prior probabilities at nodes 7 and 8, we need mathematical models that describes the spatial dispersion of the selected outcome in the volcano surrounding areas. In a second step, these prior probabilities may be eventually combined with field data (see Marzocchi et al. 2010). In “Appendix”, we develop a method to convert the output of one or more theoretical models into the parameters of the prior distribution. Here, we report the final equations only; further details can be found in “Appendix”. To help readers, in Tables 1 and 2, we report a complete list of indexes and symbols used in this paper and in Marzocchi et al. (2010). Note that here and in Marzocchi et al. (2010), each index reported in Table 1 refers always to the same variable (e.g. i refers always to vent positions etc.).

Table 1 Indexes for the BET_VH parameters (see Marzocchi et al. 2010)
Table 2 Symbols for the BET_VH parameters (see Marzocchi et al. 2010)

The prior distribution of the volcanic outcome (i.e. tephra fall in this application) at a generic k-th area produced by an eruption occurred at a generic i-th vent location and the generic j-th size class is completely defined by \(\Theta_{7;i,j}^{(k)}\) and \(\Lambda_{7;i,j}^{(k)}\), for node 7, and \(\Theta_{8;i,j}^{(k)}\) and \(\Lambda_{8;i,j}^{(k)}\), for node 8. Such values represent the average probability and the equivalent number of data (related to the variance), for nodes 7 and 8, respectively (Marzocchi et al. 2010). Such parameters can be estimated as:

$$ \left \{ \begin{array}{lll} \Theta_{7;i,j}^{(k)} =& \dfrac{\overline\alpha_{7;i,j}^{(k)}}{\overline \alpha_{7;i,j}^{(k)} + \overline \beta_{7;i,j}^{(k)} } \left\{ \dfrac{\overline\alpha_{7;i,j}^{(k)} + \overline \beta_{7;i,j}^{(k)} }{ \overline \alpha_{7;i,j}^{(k)} +\overline \beta_{7;i,j}^{(k)} + \Lambda_m} \right\} \\[16pt] &+ \dfrac{\nu_{7;i,j}^{(k)} }{N_{r}} \left\{ \dfrac{\Lambda_m }{ \overline \alpha_{7;i,j}^{(k)} + \overline \beta_{7;i,j}^{(k)} + \Lambda_m} \right\} \\[16pt] \Lambda_{7;i,j}^{(k)} =& \overline \alpha_{7;i,j}^{(k)} + \overline \beta_{7;i,j}^{(k)} - 1 + \Lambda_m \end{array} \right. $$
(1)
$$\label{eq2} \left \{ \begin{array}{ll} \Theta_{8;i,j,k}^{(s)} =& \dfrac{\overline\alpha_{8;i,j,k}^{(s)} }{\overline \alpha_{8;i,j,k}^{(s)} + \overline \beta_{8;i,j,k}^{(s)} } \left\{ \dfrac{\overline\alpha_{8;i,j,k}^{(s)} + \overline \beta_{8;i,j,k}^{(s)} }{ \overline \alpha_{8;i,j,k}^{(s)} +\overline \beta_{8;i,j,k}^{(s)} + \Lambda_m} \right\}\\[16pt] & + \dfrac{\nu_{8;i,j,k}^{(s)} }{\nu_{7;i,j}^{(k)}} \left\{ \dfrac{\Lambda_m }{ \overline \alpha_{8;i,j,k}^{(s)} + \overline \beta_{8;i,j,k}^{(s)} + \Lambda_m} \right\} \\[16pt] \Lambda_{8;i,j,k}^{(s)} =& \overline \alpha_{8;i,j,k}^{(s)} + \overline \beta_{8;i,j,k}^{(s)} - 1 + \Lambda_m \end{array} \right. $$
(2)

The parameters contained into these equations (see Table 2) are of two types: the model-dependent parameters \(\left(N_{r},\nu_{7;i,j}^{(k)}, \nu_{8;i,j,k}^{(s)} \text{and} \Lambda_m \right)\) and the hyper-prior parameters \(\left(\overline \alpha_{7;i,j}^{(k)}, \overline \beta_{7;i,j}^{(k)}, \overline \alpha_{8;i,j,k}^{(s)} \text{and} \overline \beta_{8;i,j,k}^{(s)}\right)\).

Hyper-prior parameters

The hyper-prior parameters are relative to a sort of hyper-prior Beta distribution for the nodes 7 and 8 (see “Appendix”). The use of such a hyper-prior distribution has two main goals.

The first goal is to substitute the value of zero probability for the areas that have never hit by numerical simulations with a more realistic (low) probability. In this case, these values have to represent a first-order guess of the probability at nodes 7 and 8, and they can be defined using empirical observations in other volcanoes (see, for instance, Tables 2 and 3 in Newhall and Hoblitt 2002), whenever such generalization is assumed to hold. This point is particularly important when the model(s) does not cover all the possible ESs combinations. In the simplest case, we can set \(\overline \alpha_{7;i,j}^{(k)} = \overline \beta_{7;i,j}^{(k)} = \overline \alpha_{8;i,j,k}^{(s)} = \overline \beta_{8;i,j,k}^{(s)} =1\), reproducing the least informative starting hypothesis (see discussion in Marzocchi et al. 2004, 2008, 2010).

The second and maybe most important goal of the hyper-prior parameters is that they allow different models, with different coverage, resolution and physical details, to be applied recursively. In practice, the parameters of the Beta distribution obtained from one model can be set as new hyper-prior parameters for the second model and so on (see “Appendix” for further details).

Model-dependent parameters

The model-dependent parameters are N r , \(\nu_{7;i,j}^{(k)}\), \(\nu_{8;i,j,k}^{(s)}\) and Λ m . The parameter N r is the number of runs of a model. The runs are produced to account for the statistical variability of the model input, given an eruption occurred in a specific vent location (node 4) and size class (node 5). For example, for tephra dispersion models, the runs account for the statistical variability of the winds distribution and eruptive conditions.

The parameter \(\nu_{7;i,j}^{(k)}\) counts the number of times, out of the N r runs of the model, for which the k-th area is reached by the volcanic outcome, given an eruption of size class j occurred at the vent location i; note that \(\nu_{7;i,j}^{(k)}\) is the results of a counting over the model’s results, and it may be equal to 0; thus, Eq. 2 may result undefined. In this case, the values of \(\Theta_{8;i,j,k}^{(s)}\) and \(\Lambda_{8;i,j,k}^{(s)} \) rely completely to the hyper-prior parameters (see “Appendix” for further details). In analogy, \(\nu_{8;i,j,k}^{(s)}\) counts the number of times, out of the \(\nu_{7;i,j}^{(k)}\), for which the threshold s of the volcanic outcome is exceeded (e.g. in terms of loading, dynamic pressure etc.) in the k-th area, given an eruption of size class occurred at the vent location i.

The only model-dependent parameter that is not set by the runs is Λ m , i.e. the equivalent number of data to be assigned to the model. Λ m must be set depending on the (subjective) reliability that the researcher gives to the modeling procedure adopted, i.e. not only to the model itself but also the underlying assumptions and the capability of the N r realizations to successfully describe the whole physical system (e.g. for tephra fall, wind variability, emission rates variability during the eruption, DEM models etc.). The choice of Λ m is subjective, but it is unavoidable. In practice, Λ m controls the decrease of the variance of the Beta distribution, i.e. the uncertainty on the estimation, due to the model results (see “Appendix” and Marzocchi et al. (2010) for further details).

Application to tephra fallout at the Campi Flegrei caldera

In this application, we focus on the tephra fallout hazard assessment for CFc. In particular, we analyze the effects of combining the output of different ESs compared to the same hazard calculated for just one single ES, where the uncertainty in the vent location and in the size class of the eruption are neglected. The event tree used for this application is reported in Fig. 1b. In the following, we describe the setting for each node and the tephra fallout model used, providing a complete tutorial for BET_VH applications (Marzocchi et al. 2010) and a description of the scheme to integrate output of different models (previous paragraph and “Appendix”). The results of the application to the CFc will be discussed in the sections below.

BET_VH: nodes 1 to 6

All the hazard maps shown in the followings will be conditioned to the occurrence of an eruption. For this reason, we do not focus our attention on the probability estimation at nodes 1–2–3 (probability of eruption in a time window τ). We remark that such an estimation would translate all the conditioned probability maps shown in this paper into absolute probability maps (see Marzocchi et al. 2010).

For node 4, we consider the probability map of vent opening proposed by Selva et al. (2007); this map accounts for geostructural knowledge of the caldera as well as past data observed for the last 5 kyr. The CFc is divided into 700 possible square vent locations (I 4 = 700) with size of 500 m. The probability map of best guess values is reported in Fig. 2a.

Fig. 2
figure 2

Average (best guess) conditional probability of (a) vent opening, i.e. node 4, from Selva et al. (2007), and (b) size class of eruption, i.e. node 5, from Orsi et al. (2009)

For node 5, according to the results of the recent Italian project V3-2 Campi Flegrei (ended in July 2007, DPC/INGV PROJECT V3), we classify CFc eruptions in four representative size classes (J 4 = 4), regardless of the vent location: effusive eruptions, small, medium and large explosive eruptions. Such grouping is based on the analysis of several parameters (in particular volume and magnitude that, for CFc, show a good agreement); an exhaustive description of such size classes can be found in Orsi et al. (2009) and references therein. Basically, the prior distribution of the size class probability is set through a power-law distribution for explosive classes (classes 2 to 4), observed in the size distribution of worldwide eruptions, plus a 0.05 probability for effusive eruptions (size class 1); this prior distribution is then combined with past data of the last 5 kyr. All of this information contributes to form the posterior distribution for node 5, giving best guess probabilities as reported in Fig. 2b.

For node 6, we assume that all eruptions of size classes 2 or larger certainly produce tephra; eruptions of size class 1 do not produce tephra (Simkin and Siebert 1994; Newhall and Hoblitt 2002). This choice is certainly a simplification and it implies the assumption that the effect of tephra dispersal from eruption of size class 1 is negligible respect to the ones from the other classes. This is justified by the fact that past effusive eruptions generated only small-volume lava domes and lava flows, while explosive eruptions generated particles fallout (e.g. Orsi et al. 2009 and references therein). In terms of BET_VH symbols, we have Θ6;1 = 0, Θ6;2 = Θ6;3 = Θ6;4 = 1. These values of Θ (0 or 1, states of certainty) imply that in the Beta distributions, either α or β will be equal to 0. In this case, the equivalent number of data (Λ6;j ) is not relevant, and the Beta distribution will be a Dirac’s delta centered in 0 and 1, respectively, i.e. prior probabilities are expressed in terms of certainties and thus the probability distributions will have zero variance (see, e.g. Marzocchi et al. 2004; Gelman et al. 1995).

Tephra fallout model

Dispersion and sedimentation of volcanic particles released from a sustained eruption column in the medium and distal areas can be fairly well determined from wind transport, turbulent diffusion and settling by gravity. For relatively small computational domains (i.e. <100 km) and short durations (i.e. < few days), wind field can be assumed to be constant and horizontally uniform. Moreover, vertical diffusion coefficient and vertical wind component can be reasonably neglected. Under these assumptions, the mass conservation equation for each class of particles with a given settling velocity can be solved by using a semi-analytical solution as described in Macedonio et al. (2005) and Pfeiffer et al. (2005).

Here, the goal is to model all possible kinds of fallout deposit produced by sustained columns at CFc, i.e. to cover all the possible combinations of vent position and size class (ESs). As a first approximation, we choose to neglect orography effects. This is a reasonable assumption since in the case of CFc, topographic differences are much smaller than the eruption columns we consider in this study. Therefore, we model the tephra dispersion for all the three size classes producing tephra, but using only one single vent location; then, we translate the virtual vent to each of all the possible 700 vent locations.

In order to set the emission rate and volume of the simulated eruptions, we select single values to each one of the size classes, simulating one reference eruption size for each class. We select the Agnano Monte Spina, the Astroni 6 and the Averno 2 eruptions as representative of the high-, medium- and low-magnitude eruptions, following Orsi et al. (2004), Costa et al. (2009) and Orsi et al. (2009). The input parameters of these three reference eruptions can be found in Table 3. In order to account for possible wind distribution variations, for each size class, we compute the solution relative to 13,149 different wind profiles for the Campanian region, i.e. 36 years of daily wind profiles, obtained from National Oceanic and Atmospheric Administration (NOAA) reanalysis (Costa et al. 2009).

Table 3 Input parameters used for the different size classes, from Table 1 in Costa et al. (2009)

The approximation of using a fixed emission rate for each size class is comparable to deterministic approaches (e.g. Barberi et al. 1990; Macedonio et al. 2008). It may be argued that this approximation may introduce a bias into the final hazard assessment because an eruption of a generic size class is usually characterized by a range of emission rates not just by a single value (Cioni et al. 2003). Nonetheless, the variability of the emission rate among different size classes is much larger than the variability of the emission rate for one specific size class. Therefore, a hazard assessment that accounts for the variability of different ESs (as we will show later) may be only slightly affected by the inclusion of variability of emission rates within each size class.

Finally, it is worth noting that the assumption of a constant wind (we consider only daily wind variations) may be not general for small scale eruptions (sometimes characterized by a prolonged activity over periods of days to weeks, during which low-level winds can be largely variable). However, in the case of the eruption chosen as reference for the low-magnitude eruption, i.e. Averno 2, the proximal deposit shows a uniaxial distribution (Costa et al. 2009) that justifies this assumption.

BET_VH: nodes 7 and 8

To set the prior probabilities at nodes 7 and 8, we follow the scheme presented above and in “Appendix”. First, we define the potentially impacted region as a rectangle of 80×100 km. Then, we divide it into 2,613 areas with size increasing with the distance from the caldera. In each of these areas, the probability is assumed to be homogeneous. The distance between the centers of adjacent areas ranges from a minimum of 1 km (within the caldera) to a maximum of 4 km (at the limit of the rectangular region); Fig. 3 shows a map representing the central points of each cell. As regards the threshold for tephra loading at node 8, we select s = 300 kg m − 2 according to Costa et al. (2009) and references therein.

Fig. 3
figure 3

Grid point for the probability maps at nodes 7 and 8. The step of the grid becomes larger moving away from the possible sources of eruptions; see text for more details. Adapted from Google Earth (2010 Europa Technologies; 2010 Tele Atlas)

To limit the CPU time for the evaluation of the BET_VH parameters, we randomly selected 1,000 runs of the model (N r  = 1,000) among the 13,149 different wind profiles. This choice does not affect the results since this will only limit (approximately to 0.01) the precision of the estimate. In principle, we can even select only runs relative to a given season for which the wind variability is smaller, obtaining the probability for an event occurring in spring, summer, autumn and winter, respectively.

Finally, we set the equivalent number of data of the model Λ m  = 10 and \(\overline \alpha_{7;i,j}^{(k)}= \overline \beta_{7;i,j}^{(k)}= \overline \alpha_{8;i,j,k}^{(s)} = \overline \beta_{8;i,j,k}^{(s)} = 1\). The choice of taking a uniform distribution for the hyper-prior is not critical since the fallout model results are assigned a much larger weight (Λ m  = 10), with the model covering all the considered ESs, i.e. all size classes and vent locations.

With the choices described above, we have the following input parameters for node 7:

$$ \Theta_{7;i,j}^{(k)}= \dfrac{1}{2} \left\{ \dfrac{2}{12} \right\} + \dfrac{\nu_{7;i,j}^{(k)}}{\text{1,000}} \left\{ \dfrac{10}{12} \right\} $$
(3)
$$ \Lambda_{7;i,j}^{(k)} = 11 $$
(4)

where i = 1,2,...,700 indicates the vent location, j = 2,3,4 indicates the size class, k refers to the k-th area and \(\nu_{7;i,j}^{(k)}\) is the number of times that tephra reaches the k-th area in the 1,000 random realizations of the model (see Table 2).

Similarly, at node 8, we have:

$$ \label{eqap8a} \Theta_{8;i,j,k}^{(s)} = \left \{ \begin{array}{ll} \dfrac{1}{2} \left\{ \dfrac{2}{12} \right\} + \dfrac{\nu_{8;i,j,k}^{(s)} }{\nu_{7;i,j}^{(k)}} \left\{ \dfrac{10}{12} \right\} & \textrm{if } \nu_{7;i,j}^{(k)}> 0 \\ \dfrac{1}{2} & \textrm{if } \nu_{7;i,j}^{(k)}= 0 \end{array} \right. $$
(5)
$$ \label{eqap8b} \Lambda_{8;i,j,k}^{(s)} = \left \{ \begin{array}{ll} 11 & \textrm{if } \nu_{7;i,j}^{(k)}> 0 \\ 1 & \textrm{if } \nu_{7;i,j}^{(k)}= 0 \end{array} \right. $$
(6)

where \(\nu_{8;i,j,k}^{(s)}\) is the number of times that the selected threshold of 300 kg m − 2 is overcome in the k-th area in the 1,000 random realizations of the model (see Table 2).

Results

To analyze the effects of the combined ESs approach, we present the results relative to a selection of single and combined ESs. As single ESs, we select the following settings:

  1. 1.

    S1: probability of an eruption similar to the Averno 2 event. For this ES, we select vent location 480 at node 4 and size class 2 at node 5.

  2. 2.

    S2: probability of an eruption similar to the Astroni 6 (prototype of medium explosive eruptions at CFc). For this ES, we select vent location 521 at node 4 and size class 3 at node 5.

According to the results reported by Orsi et al. (2009), S1 reflects the most likely size class of eruptions (small explosive), while S2 represents the medium explosive eruption class, often adopted for civil protection application (e.g. for Vesuvio, Barberi et al. 1995; 2008; DPC 1995).

We also consider the following ESs combinations:

  1. 1.

    COMBO1: probability of an eruption of any size class in the most probable vent location, i.e. within the Agnano crater (see Fig. 2). For this case, we select vent location 520 at node 4 and all size classes at node 5.

  2. 2.

    COMBO2: probabilities of an eruption of any size class from all possible vent locations. For this case, we select all vent locations at node 4 and all size classes at node 5; each vent and size class is weighted by its probability of occurrence.

In summary, both COMBO1 and COMBO2 are a weighted average of ESs, where each ES is weighted by its own probability of occurrence (see Marzocchi et al. 2010, Eqs. 20 and 21). Both COMBO1 and COMBO2 account for all combinations of size classes. Only COMBO2 also includes the variability of all vent locations.

The results for S1, S2, COMBO1 and COMBO2 cases are reported in Figs. 4, 5, 6 and 7, respectively; we plot the conditional best guess (average) probability maps for node 7 (upper panel, probability that tephra will reach a specific area) and for node 8 (lower panel, probability that tephra will overcome a loading of 300 kg m − 2 in each area). Referring to the symbols in Marzocchi et al. (2010, Eqs. 20 and 21), we plot the average of [φ c ] and [φ d ] for areas k = 1,...,2613. Note that the maps relative to the node 8 are zoomed. All these probabilities are conditional to the occurrence of an eruption belonging to the selected single or combined ESs. The absolute probability [Φ] in the time window τ is obtained multiplying these probabilities by the conditional probability of the selected single or combined ESs ([φ ES ]) and the probability to have an eruption in the time frame τ ([θ 1 − 2 − 3], independent from the ES), that is

$$[\Phi] = \left \{ \begin{array}{ll} \left[ \theta_{1-2-3} \right][ \left[ \phi_{ES}\right] \left[ \phi_{c} \right] & \textrm{for node 7} \\[2pt] \left[ \theta_{1-2-3} \right][ \left[ \phi_{ES}\right] \left[ \phi_{d} \right] & \textrm{for node 8} \end{array} \right. \label{phi} $$
(7)

Notably, the conditional probability of occurrence of single or combined ESs, i.e. [φ ES ], strongly varies across the possible ESs. We may interpret such probabilities as the relative weight of each output probability map (at nodes 7 and 8, as in Figs. 47). In other words, if [φ ES ] is relatively small, there is a relatively high probability that next eruption will occur with a different ES, i.e. the map will not be representative. On the contrary, if the probability of occurrence [φ ES ] is close to 1, it means that the output map is based on almost the totality of the possible eruptive settings (in terms of size class and vent location), i.e. the characteristics of the next eruption will be represented well by the probability values shown in the map. Therefore, [φ ES ] symbolizes the degree of representativeness of the map with respect to all possible ESs. As regards the selected ESs, S1, S2, COMBO1 and COMBO2, the average of [φ ES ] is 2.24 ×10 − 3, 1.19 ×10 − 3, 4.79 ×10 − 3 and 1.0, respectively. Note that the representativeness of COMBO1 is several times greater than the single ESs S1 and S2. On the other hand, COMBO2 has a representativeness three orders of magnitude greater than all the other cases (S1, S2 and COMBO1).

Fig. 4
figure 4

Probability maps for node 7 (a) and node 8 (b) for S1 (Averno 2). Further details can be found in the text. Adapted from Google Earth (2010 Europa Technologies; 2010 Tele Atlas)

Fig. 5
figure 5

Probability maps for node 7 (a) and node 8 (b) for S2 (Astroni 6). Further details can be found in the text. Adapted from Google Earth (2010 Europa Technologies; 2010 Tele Atlas)

Fig. 6
figure 6

Probability maps for node 7 (a) and node 8 (b) for COMBO1 (most likely vent and all size classes). Further details can be found in the text. Adapted from Google Earth (2010 Europa Technologies; 2010 Tele Atlas)

Fig. 7
figure 7

Probability maps for node 7 (a) and node 8 (b) for COMBO2 (all vents and all size classes). Further details can be found in the text. Adapted from Google Earth (2010 Europa Technologies; 2010 Tele Atlas)

The results for S1 and S2 were discussed by Costa et al. (2009). Here, we highlight the comparison between results for single ESs (S1 and S2) and those obtained considering a combination of ESs (COMBO1 and COMBO2). As it appears clear from the figures, the maps relative to COMBO1 and COMBO2 plotted in Figs. 6 and 7 are significantly different from those obtained for single ESs (Figs. 4 and 5). Figure 6, relative to COMBO1, shows the effect of the uncertainty on the eruption size. Even though the most likely size class is the smallest explosive one, the probability map for COMBO1 (at both nodes 7 and 8) is more spread than for S1 (Fig. 4). This is due to the fact that COMBO1 considers events of greater size classes; therefore, more distant areas can be reached by tephra. Notably, the results for COMBO1, in terms of most dangerous areas, are between the ones obtained for S1 and S2. This may have an important impact in PVHA for central volcanoes, where the uncertainty on vent location is relatively less important; in fact, the extension of the most likely damaged areas (node 8) in COMBO1 is smaller than the extension of the most likely damaged areas for S2, where only and event of size class 3 is modeled. In other words, if the latter is taken as reference event, we show that the hazard map derived by this event may represent an overestimation of the real hazard.

In Fig. 7, the effect of the inclusion of vent location uncertainty on the probability estimation for nodes 7 and 8 is shown. At CFc, as well as in all large calderas with many possible eruptive centers, such spatial uncertainty is one of the greatest sources of uncertainty. In fact, the changes in the probability maps are striking. In particular, the spatial uncertainty strongly increases the spreading of the probability distribution since all areas inside the caldera find themselves close to one possible vent. A second aspect to be considered is that the maximum value of the conditional probability is lower than the maximum values in all previous cases (note the color scales in the figures). This is clearly an effect of the uncertainty in the vent location since the probability of high tephra loading close to a vent is high, but the position of the vent is not certain as in the other cases (S1, S2 and COMBO1). That said, it is noticeable that the peak value of the absolute probability for COMBO2 is the largest since the relative [φ ES ] for COMBO2 is the highest compared to S1, S2 and COMBO1 (see Eq. 7).

A third interesting feature to be noted is that the central-eastern part of the caldera has a non-negligible probability both at nodes 7 and 8; this is a stable and common feature that only partially depends on the vent opening probability distribution used (node 4; Fig. 1a). In fact, this feature mostly depends on the fact that this area is downwind with respect to the vast majority of vents, thus a stacking effect is present. The global result of the spatial uncertainty is that the probability of reaching and overcoming episodes has an almost uniform and relatively high (about 0.6 and 0.2, respectively) value within the whole caldera, in particular in its central-eastern part.

Discussion and final remarks

The main goal of this paper is twofold: to explore the effects of including intrinsic uncertainties onto PVHA and to describe a realistic tutorial example for the use of the BET_VH code (described in the companion paper, Marzocchi et al. 2010) that can be easily extrapolated to many other applications. These goals are achieved through the application of the BET_VH model to the long-term tephra fall hazard for CFc.

The analysis reported here clearly shows that the inclusion of some uncertainties usually neglected in PVHA has a major impact on the final assessment. In particular, the use of combined ESs (COMBO 1 and COMBO 2) give results significantly different from single ESs (S1 and S2). The combination of all ESs generally gives smaller values of conditional probability and more spread distributions over the region around the volcano. As practical consequence of this, the most likely impacted area for the next eruption obtained combining different ESs is much larger than the one estimated by using a single ES, leading to a greater uncertainty about the most dangerous areas. This result suggests that the common practice in PVHA to use single “reference” events, such as the “maximum expected event”, the “most likely event”, or the “worse expected event”, may lead to significant biases, in particular for calderas. Single ESs are not able to capture the wide variability of the hazard, and the combination of all possible ESs becomes essential to achieve an unbiased PVHA. Specifically, in this application, we note that the uncertainty in vent location plays a major role, and it must be included to get a reliable PVHA. This is a basic point because a reliable PVHA must be accurate (i.e. absence of significant biases) because a biased estimation would be useless in practice; on the other hand, it may have a low precision (i.e. a large uncertainty) that would reflect our scarce knowledge of some physical processes involved (Marzocchi et al. 2004, 2008).

Noteworthy, we do not attempt to obtain the “best” PVHA for CFc because this would require the inclusion of more models and field data. Anyway, we argue that this attempt, including different ESs, may represent a drastic reduction of bias into the long-term tephra fall hazard assessment for CFc.

Finally, with this application, we have shown how BET_VH model may manage the output of different models, even models that describe just a single scenario, producing a prior probability distribution that averages them. This strategy has several similarities to the ensemble forecasting used in climate and weather forecasting (e.g. Krishnamurti et al. 2000), and it may be potentially used for any kind of volcanoes.