Introduction

The development of probabilistic eruption forecasting (Marzocchi and Bebbington 2012) has concentrated largely on forecasting the next initial onset time using the previous reposes and/or monitoring data, supplemented with analogues where necessary (Jenkins et al. 2012; Whelley et al. 2015). Models for long-term forecasts (Wickman 1966; Klein 1982; Bebbington and Lai 1996; Green et al. 2013) are based on past eruption records (Decker 1986) and appear to have been reasonably accurate (Bebbington 2013). Short-term forecasts instead use precursory seismicity (Voight 1988; Kilburn and Voight 1998; Bell et al. 2011; Boue et al. 2016), or multiple monitoring data (Marzocchi et al. 2008; Sobradelo et al. 2014). A more recent innovation has been the attempt to forecast the size of the future eruption (Burt et al. 1994; Marzocchi and Zaccarelli 2006; Marzocchi et al. 2010; Bebbington 2014). Much less attention has been focussed on forecasting how activity evolves once the eruption begins, although new multiparameter investigations of volcano-tectonic seismic signatures exhibit some promise (McCausland et al. 2019). However, volcanic eruptions are complex, cascading events with durations up to years in length, which are not easily reducible to an instant in time. This can lead to problems when using statistical models from the family of point processes (Bebbington 2008; Sheldrake et al. 2016). Once the eruption has begun, much emergency management focus is placed on when or if a major phase of activity will begin and when the end of the eruption will be. Hence, forecasting intra-eruptive activity is critically important. The question of what can be forecast in such circumstances, and how, has not previously been addressed.

While it is technically possible to express the progression of an eruption as an event tree (Newhall and Hoblitt 2002; Wright et al. 2019), facilitating the incorporation of monitoring data (Marzocchi et al. 2008), the number of branches grows rapidly with the number of phases. It is more efficient to use a Markov chain, with all the associated mathematical power, to model the indefinite number of possible phases. An introduction to Markov chains is provided by Guttorp (1995); semi-Markov models are described by Limnios and Oprisan (2001).

An exploratory analysis, which did not incorporate a forecastable model, of the multi-phase makeup of large explosive (VEI ≥ 4) eruptions was performed by Jenkins et al. (2007), using unpublished data held by the Smithsonian Institution’s Global Volcanism Program (GVP). A phase (or ‘stage’ in the terminology of Jenkins et al. 2007) reflects a discrete time period that is characterised by one predominant style of activity. The authors analysed 604 phases from 125 events and found that more than half of their events were multi-phase, the latter exhibiting a median of six stages, including quiescence. The study found that multi-phase eruptions were more likely to be characterised by declining explosivity and increasing effusivity throughout an event, with effusive phases typically of longer duration and interspersed with fewer quiescent phases than the explosive activity. This paper builds on the exploratory work of Jenkins et al. (2007), by constructing a model for forecasting the phase progression of an eruption in progress, using a much larger database coded in a substantially different manner.

The rest of the paper is organized as follows. We first describe the data, and how it is classified and coded for the model. The section following outlines the stochastic model used, basically a quiescence/duration augmented semi-Markov chain with time-varying transition probabilities. Results then include an example validating the model against expert opinion elicited during the 2012 Te Maari and 1991 Pinatubo eruption sequences. The paper concludes with a discussion of possible uses, improvements (particularly the incorporation of monitoring data and better use of analogue volcanoes) and limitations.

Data

The unpublished dataset on intra-eruptive activity made available by the GVP to Jenkins et al. (2007) has been greatly expanded in the years since their study. The new data, still unpublished, generously made available to us by Ed Venzke (Smithsonian Institution) and Sarah Ogburn (USGS), comprised a total of 781 multi-phase eruptions from AD79 to 2015. The dataset included information on the eruption VEI, start and end date (where available) and, critically, the eruption description. While these eruptions had already been decomposed into phases, these did not always reflect the eruption description, and so we recoded the entire dataset into discrete styles of activity in a consistent manner using the eruption descriptions provided by the GVP record. Where we could identify a more precise record from published sources, including the GVP bulletins, we used these sources to supplement the GVP eruption description in more accurately assigning the phase states and durations.

Coding states of activity

In order to permit our intended analysis, which is based on Markov chain theory (Guttorp 1995), each phase needs to be assigned a discrete ‘state’ describing the activity during that phase. The states are described in Table 1. Definition of the states was driven by (1) the descriptive terms in the data, (2) the need for states to be discrete and (3) their use in forecasting the potential hazard. Given the large number of possible transitions, and the need to have adequate data for each transition, nine states were considered enough to be descriptive while remaining manageable. In addition, there are two nominal states: 0 = Start; 10 = End.

Table 1 State definitions

A new phase is deemed to start when the activity style changes. This required some recoding of the GVP data, inserting additional phases. Our definition is purely temporal, with phases not overlapping in time, and so separate phases in the GVP data referring to different locations overlapping in time were rearranged to overlap in location rather than time. The definition of a phase in the GVP seems to have been variable, depending in part on the source, and in some cases we collapsed multiple phases into one. In addition, a number of date typos were corrected. Several eruptions included quiescent phases longer than three months. The GVP arbitrarily treats two eruptions separated by more than 3-month surface quiet as distinct eruptions (Siebert et al. 2010). Conversely, if work shows two phases with an intervening quiescence longer than 3 months to be non-distinct, they are considered part of the same eruption. Thus, quiescent phases longer than 3 months exist in our dataset and we have not separated such events out into two consecutive eruptions.

In general, explosions were coded as continuous (State 3) if noted as ‘continuous’ or ‘Strombolian’, and as intermittent (State 4) if described as such or as ‘Vulcanian’, or if a date range was given. Where eruptions on individual days were noted, they were coded as minor eruptions (State 5). Effusive and explosive activity (State 2; State 1 being effusive only) is assigned where the description specifically indicates it as such. States 5–7 are for discrete events with c. < 2-day duration, as opposed to State 4, which is generally assigned to longer duration events where specific dates are not given. The remaining phase states (8 and 9) are as described in Table 1.

Time is measured in days, start and end times being given in whole days only. A ‘quiescent’ phase (State 9) was inserted between any eruptive phases separated by more than one day. To account for partial days, especially when the start and end dates were the same, 0.5 day was added to the duration of all non-quiescent states for numerical calculation purposes. In many cases, partial details of end dates (and, less usually, start dates) were missing. These were assigned randomly within the implicit limits of the data and the other phases in the eruption. For example, an end date described as ‘June’ would have the day chosen randomly from a discrete uniform distribution on {1, 2, … , 30}, unless a subsequent phase started later in the month. Phases that were described as occurring without a break were coded as such, allowing appropriately for randomness in the case where details of both the end and subsequent start dates were missing. In other cases, there may or may not have been a quiescent phase between the end of one eruptive phase and the beginning of the next. For these entries, we allocated end and/or start dates randomly, preserving the order of phases and allowing for quiescent phases or not, depending upon the simulated dates. Thus, any run of the model may produce slightly different numbers of phases and durations based upon the date assignations, but the effect on model probabilities is negligible.

As an example of how eruptions can be coded, consider the example of Sheveluch 1991. This is a VEI 4 eruption, with the description (GVP Bulletin 2011):

Ash eruptions were recorded at Sheveluch on August 15, October 27, November 1 and November 24. On December 27, a possible gas-and-ash plume was reported. Intermittent gas-and-ash explosions continued into 2002. A short-lived explosive eruption was observed at 11:35 on 29 August sending an ash-rich plume to an estimated altitude of 10 km. The ash cloud drifted SE, and was recorded by geostationary weather-satellite imagery moving E across the Bering Sea. Following a strong tremor on May 7, 2002, a new lava dome was first seen on May 12 between the 1980 dome and the NW crater wall. A major explosion on May 22 destroyed the new dome and the west part of the old dome and produced a 20-km-high eruption column. Dome growth, gas-and-ash emissions and occasional pyroclastic flows were recorded through 2005. Dome growth and occasional thermal anomalies were reported in 2006, although no ash eruptions were reported until December 4. Dome growth, intermittent ash eruptions and occasional pyroclastic flows continued into 2008. On March 31, 2007, a mudflow covered an approximately 900-m-long section of road, in an area 20 km from Sheveluch. A large explosive eruption on October 28, 2010 produced an ash plume to 12-km altitude and pyroclastic flows that travelled 15 km; half of the lava dome was destroyed. Dome growth and intermittent explosive activity continued into 2011.

From this, we extract the state sequence in Table 2.

Table 2 State sequence of Sheveluch 1991, coded from the GVP eruption bulletins. Eff = Effusive, Exp = Explosion(s), Cts = Continuous, Int = Intermittent, E = Eruption

From the initial eruption dataset, we omitted eruptions with insufficient description to enable coding. This included purely submarine eruptions, and most eruptions with VEI 0. The final dataset contains 698 eruptions from 187 volcanoes. There are 2785 eruptive phases (including only eight deformation phases), and between 4302 and 4312 total phases depending on the random occurrences of intervening quiescence.

Figure 1 shows the number of phases (both total and eruptive) per event, broken down into three VEI categories. The only clear difference between large (VEI 4+) and medium (VEI 2–3) eruptions is the tendency of the smaller events to have more quiescent phases interspersing each eruptive phase, as shown by the uneven distribution of phases in the upper plot, and the much reduced number of non-quiescent phases relative to total phases. VEI 0-1 events tend to have fewer phases, but actually include the event with the greatest number of phases (Kilauea 1983) which involved 50 eruptive phases.

Fig. 1
figure 1

Number of phases per eruption. Left column all phases, right column eruptive phases only. Top row VEI 4+, middle row VEI 2–3, bottom row VEI 0-1

The durations of the individual phase types are shown in Fig. 2. The explosive phases (including [Effusive and Explosive]) appear to have identifiable modal durations of tens of days, whereas the effusive state durations have a mode of 1 or 2 days . It may be interesting to speculate whether these short duration effusive phases are prominent because this study is limited to multi-phase eruptions, excluding long single–phase effusive events which might make up a large proportion of effusive eruptions. Aside from the short duration events, the effusive duration distribution is similar to effusive and explosive phases, with modal durations typically in the order of tens of days. The quiescence durations clearly show some sort of incomplete truncation at around 100 days, which partially reflects the GVP definition of ‘up to 90-day surface quiet’.

Fig. 2
figure 2

Durations of different phases

The transitions between different phase types are shown in Table 3. Our database is dominated by intermittent and minor explosions, but includes 143 major eruption phases and 28 Plinian phases.

Table 3 Summary of the numbers of observed phase transitions

Markov chain analysis

The central idea behind using a Markov chain instead of a tree structure (Newhall and Hoblitt 2002) within which the eruption can only traverse in one direction is that the analysis allows for repeated visits to the same state. A Markov chain thus requires fewer transition probabilities to be estimated. Markov chains have been widely used in volcanic hazard analysis, to model shifts in location (Cronin et al. 2001; Eliasson et al. 2006), the quiescence-unrest-eruption cycle (Wickman 1976; Carta et al. 1981; Bebbington 2007), the unrest-eruption sequence (Aspinall et al. 2006) and volcanic regimes (Bebbington 2007). A Markov chain is characterised by having a ‘state’ X(k) ∈ {0, 1, … , m}, where k = 1, 2, … indexes time, governed by ‘transition probabilities’ \(p_{ij} = \Pr \left (X(k+1) = j | X(k) = i \right )\) which are independent of time (k), and of the previous states: X(1),X(2), … , X(k − 1). This independence is referred to as the ‘Markov property’.

From the database, we can count the number of transitions, nij from state i to state j (Table 3). If the number of departures from i, \(n_{i\cdot } = {\sum }_{j} n_{ij}\), then the maximum likelihood estimate of the transition probability from state i to state j is

$$ \hat{p}_{ij} = n_{ij} / n_{i\cdot} $$
(1)

(Guttorp 1995). For example, from Table 3, nEff⋅ = 476 (the sum of the second row in Table 3) and hence the estimated probability of transitioning from an effusive to quiescent state is nEff,Qui = 318/476 = 0.67; that is, two thirds of effusive phases are followed by quiescence. These estimated probabilities make up the transition matrix \(P = \left (\hat {p}_{ij}\right )\). Because we have an absorbing state (eruption end), we can decompose this as

$$ P = \left( \begin{array}{cc} Q & R \\ 0 & 1 \end{array} \right) $$
(2)

where Q is the transition matrix among the transient states (including eruption start), R is a vector giving the probability that a given state is the last before the end, and 0 is a vector of zeros (it is not possible to transition out of the [End] state).

Several interesting estimates are immediately available. The expected number of future visits to state j given that one is in state i is N = (IQ)− 1, where I is the identity matrix. The expected number of phases before absorption given that we are in state i is then E = N1, where 1 is a vector of ones. Finally, the probability of a future visit to state j given that one is in state i is W = (NI)(NI)− 1, where ⊗ denotes the Hadamard product (element by element multiplication).

Including durations

If we include quiescence as a state, then the Markov property (probability of next state depends only on current state) means that the majority of transitions into an eruptive state will be from quiescence, in effect making the next eruptive state completely random (or at least independent of all previous eruptive states). The obvious way to incorporate information on the previous eruption phase is to use a second-order Markov chain, which treats every two-state pair as a separate state. If the current state is (i,j), then the next state must be (j,k). This would have 9 × 8 = 72 states (the same phase cannot occur twice). However, as consecutive states overlap, only 36 of the 72 states can be reached in one transition. This number of states would require estimating 72 × 36 = 2592 transition probabilities, which is prohibitive for the data available, so we need to be more creative. The way we have chosen to deal with this is not to have quiescence as a state, instead the quiescence length becomes a ‘mark’ on the eruptive state, augmenting it as was done by Bebbington (2007). In fact, there will be two such marks, the quiescence length before the eruptive state (‘pre-quiescence’, denoted by Q0) and the quiescence length following the eruption state (‘post-quiescence’, denoted by Q1).

Our data give us information of the form \(\Pr (D_{i} = d | i \rightarrow j)\) where Di is the duration in state i, and \(i \rightarrow j\) indicates that the subsequent transition is from state i to state j. If quiescence is not treated as a separate state, we can augment this by including the quiescence before or after state i. We thus have (in obvious notation) \(\Pr (D_{i} = d, Q_{0} = q | i \rightarrow j)\) if we include quiescence before state i, or \(\Pr (D_{i} = d, Q_{1} = q | i \rightarrow j)\) if we use the quiescence between states i and j.

Although the phase duration and quiescence times are recorded in discrete days, we will model them as continuous random variables. It appears that the log-normal distribution (Bebbington and Lai 1996) is suitable for the purpose, as the phase durations in Fig. 2 can be represented as a mixture of log-normal distributions, where on the log scale they appear as a mixture of normals. A formal test is not feasible due to this mixing, and the fact that some state pairs only exhibit one or two transitions. Hence, we make the assumption based on mathematical and computational tractability, noting that it does not conflict with the appearance of the data.

If Z has log-normal distribution with parameters μ and σ, then log Z has a normal distribution with mean μ and variance σ2. The mean of Z is \(\exp \left (\mu + \sigma ^{2}/2\right )\) and the variance of Z is \(\exp \left (2\mu + \sigma ^{2}\right )\left (\exp \left (\sigma ^{2}\right ) - 1 \right )\). If we let fij(d) be the probability density for \(D_{i} = d | i \rightarrow j\), then

$$ f_{ij}(d) = \frac{1}{d \sigma_{ij} \sqrt{2\pi}} \exp \left( -\frac{1}{2 \sigma^{2}_{ij}} \left( \ln d - \mu_{ij} \right)^{2} \right) $$
(3)

where μij and σij are estimated as the mean and standard deviation of the relevant observed (and log-transformed) phase durations. If there are no transitions between states i and j, pij = 0 and so we do not need to estimate the duration distribution. Some duration distributions are exemplified by very few (often one) such transitions. In the case of only one transition, we can not simultaneously estimate both μ and σ. In the case of very few observations, they may be too clustered for numerical stability, with the resulting distribution approaching a point mass, which is very unrealistic. Hence, we estimate μ (and σ if possible) from the data, but ensure that σ is not unrealistically small by setting the minimum coefficient of variation as 1/n, where n is the number of observed transitions. As the coefficient of variation (standard deviation divided by the mean) of a log-normal distribution is \(\sqrt {\exp (\sigma ^{2})-1}\), this means setting \(\sigma = \sqrt {\log (1 + 1/n^{2})}\) as a minimum.

The probability densities gij(q) for the event ‘\(Q_{0} = q | i \rightarrow j\)’ and hij(q) for the event ‘\(Q_{1} = q | i \rightarrow j\)’ are handled similarly to f, with the addition of an atom at 0 representing the possibility of no intervening quiescence i.e.

$$ \begin{array}{@{}rcl@{}} g_{ij}(q) &=& q^{(0)}_{ij} \delta_{\{0\}}(q) + \frac{1-q^{(0)}_{ij}}{q \sigma^{(0)}_{ij} \sqrt{2\pi}}\\ &&\exp \left( -\frac{1}{2 \sigma^{(0)^{2}}_{ij}} \exp \left( \ln q - \mu^{(0)}_{ij} \right)^{2} \right) \end{array} $$
(4)

where \(q^{(0)}_{ij}\) is the observed probability of there being no quiescence between the respective eruptive phases, and δA(q) is the mathematical delta function with the properties that δA(q) = 0 if qA, and \(\int \delta _{A}(q) \d q = 1\).

Even a couple of thousand phases is not many to populate a model such as this. We will thus want to model the phase duration (quiescence length) as independent of the previous quiescence length (phase duration) for tractability. Since there are several dozen state pairs, a few can be expected to appear significantly correlated. This is known as the multiple comparison problem in Statistics (Miller 1981). We deal with this by noting that if we treat the P values from all the state-pair tests as a sample, the null hypothesis (no correlation) is that the P values should be uniformly distributed on (0,1). Such a meta-analysis of the rank correlations between phase durations and quiescence length (Fig. 3) shows that there is no evidence of dependence between the durations and either the previous or subsequent quiescence length. Using the resulting assumed independence between phase durations and quiescence lengths, we have

$$ \Pr(i \rightarrow j | D = d, Q_{0} = q) = \frac{p_{ij} f_{ij}(d) g_{ij}(q)} {{\sum}_{k} p_{ik} f_{ik}(d) g_{ik}(q)} $$
(5)

and similarly for Q1, with h replacing g. These are the joint probabilities i.e. assuming that we know d and q. In reality, we will know one of these, and know that the other will be at least as long as that observed to the present. It is then easy to calculate the conditional probabilities

$$ \Pr(i \rightarrow j | D_{i} > d, Q_{0} = q) = \frac{p_{ij} g_{ij}(q) {\int}_{d}^{\infty} f_{ij}(u) \text{d}u} {{\sum}_{k} p_{ik} g_{ik}(q) {\int}_{d}^{\infty} f_{ik}(u) \text{d}u} $$
(6)

or

$$ \Pr(i \rightarrow j | D = d, Q_{1} > q) = \frac{p_{ij} f_{ij}(d) {\int}_{q}^{\infty} h_{ij}(u) \text{d}u} {{\sum}_{k} p_{ik} f_{ik}(d) {\int}_{q}^{\infty} h_{ik}(u) \text{d}u} , $$
(7)

which give the probability that the next eruptive state will be state j, given that the previous eruptive state was i, and the observed previous and current durations (one eruptive phase, the other quiescence). Equation (6) can be used for day-to-day forecasting during an eruptive phase, similarly (7) can be applied during a quiescence.

Fig. 3
figure 3

Meta-test of phase duration/quiescence correlation. If there is no overall correlation, the P values from the rank correlation test should be uniformly distributed between 0 and 1. This can be examined using a Kolmogorov-Smirnov test, with statistic Dn, as shown for pre-quiescence A and post-quiescence B

Validation

We compare the modelled probabilities for state transitions through time with those previously established by expert judgement for the 2012 VEI 2 Te Maari eruption from Mt Tongariro, New Zealand (Jolly et al. 2014), and the 1991 VEI 6 eruption of Pinatubo, Philippines (Newhall 2000). This direct comparison with expert forecasts, which were based on monitoring data and additional insights into the volcano and eruption, offers a form of validation for our model estimates.

Results

We have analysed the data as a whole, leaving for the discussion the prospects for an analysis at finer detail. As there were only eight recorded deformation states, we will suppress the deformation state in what follows. We first find that the estimated transition matrix between eruptive states is (in which Eff = Effusive, Exp = Explosion(s), Cts = Continuous, Int = Intermittent, E = Eruption; as described in Table 1)

where the current state is the row, and the next state is the column. We see that solely effusive behaviour is unlikely to presage violent explosions, but that continuous or intermittent explosions, whether accompanied or not by effusive behaviour, are much more likely to herald a major eruption.Major, and especially Plinian, eruptions indicate a sequencewhich is (probably) not at an end. We next calculated the expected number of visits given occupancy of a particular state,

Here, the current state is the row, and each column (state) then gives the expected number of visits to that state from the current state. Note that the expected number of visits includes the current one, which is why the diagonal elements are greater than one. Also, \({\sum }_{j} N_{\text {Start},k}\) matches the observed average number of eruptive phases per eruption, 2777 phases from 698 eruptions. We see that being in a Plinian eruption phase is the state most likely to result in a further Plinian phase. Finally, the probability of visiting a future eruptive state at any point in the sequence, conditional on the currently (or last, if in quiescence) occupied eruptive state is given by

Here, the current state is the row, and each possible future state is the column. This outcome has informationrelevant to living with persistent volcanism; that is: effusive behaviour, continuous or intermittent explosions and minor eruptions do not elevate the probability of a major or Plinian eruption above the ‘at start’ values. In other words, Plinian eruptions do not become more likely during a sequence, unless a major eruption occurs.

For illustrative purposes, we will show examples of the conditional probabilities of the next eruptive state when the current state is [Effusive and Explosive] (Fig. 4, post-quiescence, and Fig. 5, pre-quiescence > 0 days). In the case of a pre-quiescence of 0 days (two consecutive eruptive states), the probabilities for all current states are shown in Fig. 6. Where we have quiescence following an effusive and explosive phase (Fig. 4), the probability of an eruption end increases as quiescence lengthens, which is hardly surprising, but the effect is nonlinear in the previous phaseduration, with longer phases supporting a greater probability of further effusive and explosive or continuous explosions. In the case of an ongoing effusive and explosive phase (Fig. 5), as the duration increases, transitions to [Effusive plus Explosive] (Fig. 5b), [Major Eruption] (Fig. 5f) or [Eruption End] (Fig. 5h) states become more likely, while [Effusive] (Fig. 5a), [Intermittent Explosive] (Fig. 5d), [Minor Eruption] (Fig. 5e) and [Plinian Eruption] (Fig. 5g) become less likely. Interestingly, the probability of a Plinian phase is much greater if the previous quiescence extended to c. 50 days. When states of activity are not interspersed with quiescent phases (Fig. 6), the transition probabilities are relatively insensitive to longer phase durations.

Fig. 4
figure 4

Probabilities of eruptive state following [Effusive and Explosive] given the duration of the last eruptive phase and the current quiescence. The panel titles are the next state.

Fig. 5
figure 5

Probabilities of eruptive state following [Effusive and Explosive] given the duration of the previous quiescence (assuming quiescence preceded the current state), and the current phase duration. The panel titles are the next state

Fig. 6
figure 6

Probabilities of next eruptive state, given the duration of the current state, and no quiescence preceding it. The panel titles are the current state

As an example to validate the model, we will consider the 2012 Te Maari eruption from Mt Tongariro, New Zealand. The eruption is described by Jolly et al. (2014), and consisted of a small (5 × 105m3 of tephra ejected to a height of 6 km) eruption on August 6, 2012, followed by a smaller eruption on November 21, 2012. In our model, these are both minor eruptions, separated by a quiescent period. Between the two eruptions, there were conflicting monitoring signals from seismic and gas measurements, which prompted the Science Advisory Panel to use expert elicitation to estimate probabilities at several points during the sequence (Jolly et al. 2014). Figure 7 shows the median of the expert’s probabilities (Jolly et al. 2014) that there will be no further eruption in the next 30 days against the conditional probability derived from the model that the next state will be [Eruption End]. The expert-derived median probabilities from Jolly et al. (2014) start lower and increase more rapidly following the eruptions than for our model, at least partially reflecting the difference between forecasting 30 days of quiet versus eruption end (c. > 90 day quiet). Neither approach forecasts the second minor eruption, although the model avoids the tendency of the experts’ probability estimate to quickly saturate at one, in spite of (or perhaps because of) their access to monitoring data and familiarity with the volcano. Nevertheless, the similarity in trends suggests that the model is providing rational, evidence–based forecasts that align with those of experts who may have additional insights on the volcano and eruption in question. A factor that may have weighed with the experts, that is not represented within our generic model, is that this was a phreatic eruption of almost insignificant size.

Fig. 7
figure 7

Model and expert probabilities for the Te Maari eruption

Discussion

The obvious use of this model is in emergency management and decision making. To examine this, we will consider the 1991 eruption of Pinatubo (Newhall et al. 1996). We first deleted the 1991 Pinatubo eruption from the data and re-ran the model to avoid biasing the estimates. Figure 8 shows the day-by-day probabilities estimated by our model for a major or Plinian eruption phase during the first 86 days of the sequence. Of interest is the spike in the probability of a Plinian eruption at 0.095 on 12 June 2 days before the 20–40-km eruption columns of 14–15 June. To put this in context, Newhall (2000) report an estimated expert–based probability on 10 June of 0.08 for a pyroclastic flow to at least 15-km distance occurring within 24 h (Woo 2008). Given that the experts had access to monitoring data and an intimate knowledge of the volcano, neither of which are included in our model, the model probabilities and trends are again reassuringly commensurate to those derived by experts during the crisis. The model probability from 12 June (0.095) is comparable with the evacuation threshold calculated on the basis of cost-benefit analysis for Vesuvius of 0.101 (Marzocchi and Woo 2007, 2009) or 0.087 (Bebbington and Zitikis 2016), which are based on a much smaller eruption.

Fig. 8
figure 8

Upper) Coded phases of activity; and Lower) Model probabilities for the 1991 Pinatubo eruption. The vertical black line marks the onset of the Plinian phase

A second use for the model is in providing an evidence base for assigning prior probabilities to each of the nodes in an event tree e.g. (Neri et al. 2008; Garcia-Aristizabal et al. 2013; Newhall and Pallister 2015). For example, taking a very generic approach where we use all of the phase database rather than a subsection of analogue volcanoes, our model would forecast that the most likely initial phase of an eruption is a minor explosive eruption or intermittent explosions, both with a probability of c. 0.33. Our model can then be used to dynamically update the event tree conditional on current activity or lack of it. Should the eruption begin with, for example, 50 days of effusive and explosive activity followed by quiescence, then the probabilities associated with future eruption states i.e. separate nodes on the event tree can be derived from Figure 4. Such prior probabilities can then be updated within a Bayesian framework using indicators such as monitoring data.

The current database contains only those eruptions identified as multi-phase. To support the pre-eruption development of an event tree or probabilistic forecasts, we will need to include also those eruptions that are demonstrably single-phase. Without these, the probability of an eruption continuing, and hence its duration, may be overestimated and the starting phases incorrectly weighted. A preliminary analysis (Ang Pei Shan, Nanyang Technological University, personal communication, 2018) of the GVP eruption descriptions available online suggests that approximately 35 to 40% of recorded historical eruptions can be considered multi-phase. If the methodology is to be extended to non-historical eruptions (i.e. those observed solely in the geological record), attention to under-reporting issues will be required.

An obvious avenue for future development is the incorporation of real-time monitoring data into the model to allow updating of the probabilities. In order to make the state transition probabilities conditional on monitoring signals, the latter needs to be transformed into a tractable form. One option, exemplified by the methodology in event trees (Newhall and Hoblitt 2002), is to discretize the possible signal levels via the establishment of thresholds, and apply fuzzy logic (Marzocchi et al. 2008) to effectively smooth the probabilities. Alternatively, a normal copula formulation (i.e. a multivariate probability distribution) in a Bayesian belief network (Hincks et al. 2018) could be explored. A less computationally intensive possibility would be to use a generalized linear model with the monitoring data as the independent variables. None of these are simple to develop in a volcano-independent manner. The state duration distributions could potentially also be modified using a proportional hazards formulation (Faenza et al. 2003; Green et al. 2013), given enough monitoring data.

Assigning states to sometimes terse eruption descriptions is occasionally problematic. Thus, there is a certain dichotomy between intermittent explosions and series of minor eruptions. In some cases, the explosions are truly intermittent (e.g. Vulcanian), but in many cases, they may simply reflect a lack of detailed observation about when the multiple minor eruptions actually occurred, or their exact number. The intermittent explosion category also covers the spectrum of possibilities between these end members. In some cases, the number of explosions in an intermittent explosions phase is known, and these are shown against the phase duration in Fig. 9. We see that, in the main, [Intermittent Explosion] is a collection of minor eruptions, as the modal eruption frequency is less than one a day. A substantial extension to this work could use this information regarding number of explosions as a basis for resampling those occurrences where explosion numbers are not known, allowing many of them to be decomposed into individual events. Repeating the resampling in a Monte Carlo framework would result in a distribution for the relevant transition probabilities, and a (meta-)distribution for the related distributions of durations and quiescences. Incorporating these into the model will improve its robustness with regard to individual explosions.

Fig. 9
figure 9

Number of observed explosions in intermittent explosion phases, where known

In terms of robustness, the multi-phase data are dominated by the more active and longer observed or well-studied volcanoes (Table 4; See Supplementary Table 1 for the full list). Vesuvius and Etna are somewhat atypical for the GVP database, in terms of the level of study and therefore phase data available. Thus, we have performed a sensitivity analysis by omitting them and recalculating the transition matrix P. This leaves 2413 eruptive phases in 631 eruptions from 185 volcanoes, and a recalculated transition matrix

While some of the probabilities change considerably in relative terms, for example, the transitions [Continuous explosion] → [Effusive] (0.147 to 0.066), [Minor eruption] → [Plinian eruption] (0.0026 to 0.0013) and [Continuous explosion] → [Major eruption] (0.056 to 0.088), the changes in absolute terms are small (only one probability changes by more than 0.05, this being the decrease in [Continuous explosion] → [Effusive] noted above).

Table 4 Eruptive phases per volcano

The phases database is largely compiled from eruption descriptions available in the GVP Bulletins, and hence wecan deduce that each eruption was observed in some detail. We also defined states to be relatively robust to reporting biases. However, as usual, the record of eruptions (and phases) increases nonlinearly with time as under-reporting diminishes. This is not an issue for our intra-eruption model provided that the catalogue is time homogeneous. Figure 10 shows the number of eruptions with a given minimum number of phases, along with the total number of phases. It is clear that the level of all the curves is proportional throughout time (the convergence at the end corresponds to the 1983–2018 Kilauea eruption), indicating that the catalogue is time homogeneous. We are unsure what has caused the post-1990 plateauing.

Fig. 10
figure 10

Cumulative numbers of eruptions and phases over time

As a final check, we applied the model to a smaller subset of eruptions (starting from 1950 onwards). Comparing the resulting transition probability matrix with that in Eq. 2, we observed only two noticeable differences: Firstly the transition probability from [Continuous explosion] to [Effusive] drops. Part of this can be ascribed to the deletion of all the eruptions from Vesuvius (where the last eruption was in 1944). In the full data set, there are only 20 such transitions, so this appears to be a small-size effect. The second difference is that the [Plinian] transition probabilities become increasingly grainy due to the reduced number of what is naturally the rarest eruptive phase type. As most Plinian phases occur as the initial phase of an eruption, we do not consider using the full data for intra-eruption forecasting unreasonable.

Our coded database of 698 eruptions from 187 volcanoes between AD79 and 2015 represents a fraction of the GVP Holocene record (currently host to information regarding more than 9800 eruptions (c. 2900 with text descriptions) from 871 volcanoes). With such a large database of phases currently, and the potential to significantly expand, future work should look to subdivide the data by volcano or eruption analogues, and recalculate the transition matrices for specific contexts. For example, if the volcanoes are stratified according to type and dominant composition, do we find that the records from andesitic stratovolcanoes are more informative about the behaviour of an andesitic stratovolcano than, say, rhyolitic calderas? The aim here would be to determine the weights in an ensemble-type forecast (Marzocchi et al. 2012), which will quantify the role of analogue volcanoes (Rodado et al. 2011). We see some variation in phase durations when we subset the current phases database by volcano type or tectonic setting, although the database is dominated by andesitic stratovolcanoes. In order to assess the utility and robustness of the intra-eruptive forecasting for individual volcanoes, we need to compile and analyse as much phase data as possible. Expanding the database presented here is a significant task currently underway.

Conclusions

Forecasting activity after eruption onset is of prime importance to emergency managers, particularly the question of when or if major explosive activity will begin and when the activity or eruption will end. To address this, we built upon the previous empirical study of intra-eruptive activity by Jenkins et al. (2007) by coding a greatly expanded eruption database for use with a semi-Markov chain model for forecasting the probability of transitions between phases. The unpublished eruption database provided by GVP and USGS was coded for the distinct styles of activity, and their durations, exhibited within 698 multi-phase eruptions from 187 volcanoes around the world. The resulting database of nearly 3000 eruptive phases (plus a further c. 1500 quiescent phases) was used to identify the probabilities of transitioning between different phases. Applying first- and second-order semi-Markov chain models to the data allowed us to condition the transitional probabilities on the previous and current phase duration, as well as the duration of pre- and post-quiescence. Comparing our model probabilities, with observed activity and expert-derived probabilities during eruptions at Tongariro and Pinatubo, shows that the model provides similar trends and values to those of experts, who have access to monitoring data and are very familiar with the volcano. This suggests that the model is providing rational, evidence–based forecasts and has potential as a supporting tool in intra-eruptive forecasting prior to or during an eruption. Future work could look to expand the database, and re-assess across subsets of volcanoes and/or eruptions. The potential to incorporate monitoring data into the model and output probabilities is also an interesting potential avenue of study.