Forecasting the duration of volcanic eruptions: an empirical probabilistic model

Gunn, L. S.; Blake, S.; Jones, M. C.; Rymer, H.

doi:10.1007/s00445-013-0780-8

Forecasting the duration of volcanic eruptions: an empirical probabilistic model

Research Article
Open access
Published: 05 December 2013

Volume 76, article number 780, (2014)
Cite this article

Download PDF

You have full access to this open access article

Bulletin of Volcanology Aims and scope Submit manuscript

Forecasting the duration of volcanic eruptions: an empirical probabilistic model

Download PDF

L. S. Gunn¹,
S. Blake¹,
M. C. Jones² &
…
H. Rymer¹

3968 Accesses
9 Citations
1 Altmetric
Explore all metrics

Abstract

The ability to forecast future volcanic eruption durations would greatly benefit emergency response planning prior to and during a volcanic crises. This paper introduces a probabilistic model to forecast the duration of future and on-going eruptions. The model fits theoretical distributions to observed duration data and relies on past eruptions being a good indicator of future activity. A dataset of historical Mt. Etna flank eruptions is presented and used to demonstrate the model. The data have been compiled through critical examination of existing literature along with careful consideration of uncertainties on reported eruption start and end dates between the years 1300 AD and 2010. Data following 1600 is considered to be reliable and free of reporting biases. The distribution of eruption duration between the years 1600 and 1669 is found to be statistically different from that following it and the forecasting model is run on two datasets of Mt. Etna flank eruption durations: 1600–2010 and 1670–2010. Each dataset is modelled using a log-logistic distribution with parameter values found by maximum likelihood estimation. Survivor function statistics are applied to the model distributions to forecast (a) the probability of an eruption exceeding a given duration, (b) the probability of an eruption that has already lasted a particular number of days exceeding a given total duration and (c) the duration with a given probability of being exceeded. Results show that excluding the 1600–1670 data has little effect on the forecasting model result, especially where short durations are involved. By assigning the terms ‘likely’ and ‘unlikely’ to probabilities of 66 % or more and 33 % or less, respectively, the forecasting model based on the 1600–2010 dataset indicates that a future flank eruption on Mt. Etna would be likely to exceed 20 days (± 7 days) but unlikely to exceed 86 days (± 29 days). This approach can easily be adapted for use on other highly active, well-documented volcanoes or for different duration data such as the duration of explosive episodes or the duration of repose periods between eruptions.

Forecasting eruptions from long-quiescent volcanoes

Article Open access 12 February 2022

Estimating the Intervals Between Mount Etna Eruptions

Intra-eruption forecasting

Article Open access 23 May 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The anticipated duration of future or on-going volcanic eruptions is often a topic of much concern in volcanically active areas, yet systematic studies of eruption duration are rare (Mulargia et al. 1985; Stieltjes and Moutou 1989; Simkin 1993; Sparks and Aspinall 2004; Mastin et al. 2009). Analyses of eruption durations can provide probabilistic constraints on the likely duration of future or on-going eruptions which could greatly benefit emergency response planning at times of volcanic crisis. Although much research has been conducted on forecasting the likely start of eruptions using statistical analysis of repose intervals (see Marzocchi and Bebbington (2012) for a review), the same cannot be said for duration data as a tool for forecasting the ends of eruptions. The aims of this paper are therefore to present a set of duration data and use it to illustrate a general statistical method of forecasting likely duration (independent of any other information) using Mt. Etna as a case study, chosen for its well-documented historical record.

The duration of a volcanic eruption can be defined as the period of time when fresh volcanic material is being emitted at the Earth’s surface. Here, we consider a period of continuous magma discharge as the basic building block of an eruption. However, the intensity of volcanic activity during an eruption is rarely constant. More often, discrete phases of heightened activity separated by periods of surface quiescence lasting hours, days or months can be observed (Simkin 1993; Siebert et al. 2010). The Smithsonian Institution’s Global Volcanism Program considers eruptive phases separated by periods of quiescence of less than 3 months as the same eruption, unless there are significant reasons to treat them as distinct events (Venzke et al. 2002; Siebert et al. 2010). However, the degree and duration of a quiescent pause required to warrant grouping a series of eruptive phases as one eruption, or splitting a series of eruptive phases into more than one eruption, is likely to depend on local circumstances. A similar argument applies to defining durations of repose periods.

This paper begins by critically assessing the available data on the duration of flank eruptions at Mt. Etna and presents a list of reliable eruption duration data. It goes on to describe and summarise these data using empirical survivor function plots and to assess variations in the distribution of eruption duration with time and location. The paper ends by demonstrating how survivor function statistics can be used to forecast the duration of future and on-going eruptions. Although the focus of this paper is Mt. Etna, the methods used to describe and forecast eruption durations are applicable to other volcanoes with well-documented historical activity.

Data selection

Mt. Etna background

Mt. Etna is the most active volcano in Europe, and consequently, it is one of the most widely studied and documented volcanoes in the world (Andronico and Lodato 2005). Hazard studies of Mt. Etna began in the late 1970s and early 1980s focussing on patterns in historic eruptions and predicting the location of future activity (Frazzetta and Romano 1978; Guest and Murray 1979; Duncan et al. 1981). Since then, numerous studies have built on this work by analysing catalogues of historic eruptions (Mulargia et al. 1985; Behncke and Neri 2003; Branca and Del Carlo 2004; 2005; Salvi et al. 2006; Neri et al. 2011; Smethurst et al. 2009; Passarelli et al. 2010; Proietti et al. 2011) and producing susceptibility and probabilistic hazard maps of surrounding areas (Andronico and Lodato 2005; Bisson et al. 2009; Behncke et al. 2005; Crisci et al. 2010; Harris et al. 2011; Cappello et al. 2012, 2013).

Two types of volcanic activity have been recognised in the historical records of Mt. Etna: persistent activity from summit vents and periodic activity from eruptive fissures on the volcano’s flanks (Guest and Murray 1979; Duncan et al. 1981; Acocella and Neri 2003; Behncke and Neri 2003; Branca and Del Carlo 2005; Crisci et al. 2010). Despite the typically explosive nature of summit activity, its effects are often localised to within a few hundred/thousand metres of the eruption site and therefore its threat to property and surrounding populations is confined above 1600–1800 m above sea level; consequently, only the tourist facilities are potentially exposed to the risk of lava invasion (Duncan et al. 1981; Proietti et al. 2011; Cappello et al. 2013). However, flank eruptions tend to produce lava flows that can extend for far greater distances and to lower elevations making them the greatest hazard on Mt. Etna (Duncan et al. 1981; Chester et al. 1985; Behncke and Neri 2003; Andronico and Lodato 2005; Behncke et al. 2005; Proietti et al. 2011). This greater relevance to lava flow hazard assessment, and the fact that the historical record of flank eruptions is considered reliable and nearly complete after 1600 AD (Mulargia et al. 1985; Behncke and Neri 2003; Branca and Del Carlo 2004; Behncke et al. 2005; Branca and Del Carlo 2005; Tanguy et al. 2007), whereas that of summit eruptions is only considered reliable after the late nineteenth century (Chester et al. 1985; Andronico and Lodato 2005; Branca and Del Carlo 2005; Proietti et al. 2011), led us to exclude summit activity from this analysis and focus only on flank eruptions. Mt. Etna’s flank eruptions occur from vents that are distributed unevenly across the volcano, being mostly concentrated in three rift zones and the Valle del Bove (Duncan et al. 1981; Acocella and Neri 2003; Behncke et al. 2005). Our compiled data includes information on vent location in order to investigate any relationships between eruption duration and location.

Mt. Etna eruption duration data

The dataset used here contains flank eruptions from 1300 to 2010. It is a result of a critical examination of the catalogues and descriptions of summit and flank activity compiled by Tanguy (1981), Mulargia et al. (1985), Behncke and Neri (2003), Branca and Del Carlo (2004), Behncke et al. (2005), Branca and Del Carlo (2005), Tanguy et al. (2007) and Neri et al. (2011) and, in specific cases, additional information gleaned from other sources. For this study, we are primarily interested in the duration of each flank eruption, so in those cases where flank activity occurred during a longer period of summit activity, the dates used are restricted to those of the flank component only. For example, volcanic activity began from both summit and flank vents on 18 May 1780. Summit activity continued into July (Tanguy et al. 2007), whereas the flank component of this eruption ended earlier, with reported end dates ranging from 28 to 31 May 1780 (Branca and Del Carlo 2004; Behncke et al. 2005; Branca and Del Carlo 2005; Tanguy et al. 2007). For this study, the dates of the flank activity are used and this eruption is reported as starting on 18 May and ending on 29 May 1780. In a few other cases (e.g. May 1759), the precise dates of flank activity during times of summit activity are not reported. These flank eruptions have been excluded.

Some eruptions on Mt. Etna consist of more than one eruptive phase separated by periods of quiescence ranging from hours to days. An argument could be made that each phase constitutes a separate eruption; however, because some eruptions are described in detail whereas others are more vague, it is unrealistic to assume that we have information about every quiescent period that occurred on Mt. Etna between the years 1300 and 2010. Instead, we propose that periods of quiescence of less than 10 days between eruptive phases are not sufficient enough to warrant separating an eruptive sequence into two eruptions.

Accounting for uncertainty

Uncertainties in the start and/or end dates of each eruption were considered in detail. One source of uncertainty is contradictory reporting. For example, the 1911 flank eruption is documented by Acocella and Neri (2003), Behncke and Neri (2003), Andronico and Lodato (2005), Behncke et al. (2005) and Neri et al. (2011) as starting on 10 and ending on 22 September, and these dates were chosen as the preferred start and end dates of this eruption in this study. However, Mulargia et al. (1985) reported this eruption as starting 1 day earlier (9 September). To account for this, an uncertainty in the duration of + 1 day has been assigned to the eruption’s start date. Furthermore, Tanguy (1981) and Tanguy et al. (2007) reported this eruption as ending 1 day earlier (21 September), whereas Branca and Del Carlo (2004) and Branca and Del Carlo (2005) reported it as ending 1 day later (23rd September). Here, an uncertainty in the duration of both + and − 1 day has been assigned to the eruption’s end date. This results in a preferred eruption duration of 12 days (10 to 22 September) with a maximum duration uncertainty of +2 days (9 to 23 September) and − 1 day (10 September to 21 September), thus the total duration of this eruption could range from 11 to 14 days. This method has been applied to all eruptions with contradictory start and/or end dates reported in the literature.

A second source of uncertainty arises where the start and/or end date of an eruption has been reported only to the nearest month or year. Here, a date was assigned along with a number of days uncertainty, according to the method adopted by Bebbington and Lai (1996) and Benoit and McNutt (1996) (Table 1). Sometimes, despite an eruption’s start or end only being known to the nearest month, slightly more qualitative information is provided indicating that it was ‘early,’ ‘mid’ or ‘late’ in that month. Again, the method of Benoit and McNutt (1996), summarised in Table 1, was applied.

Table 1 Table of assigned dates and uncertainties

Full size table

Where all sources examined give the same start and end date for an eruption an uncertainty value is assigned based on whether the eruption is reported to the nearest day or whether hourly resolution is provided in the primary literature (Table 1).

Some eruptions carry both literature-derived uncertainties and assigned uncertainties. For example, the 1755 eruption has a preferred duration of 6 days. This duration carries a + 1 day uncertainty which is derived from differences in the reported start date. The precise times of day that the eruption started and ended are unknown and although this literature-derived uncertainty covers the potential for the eruption duration to have been slightly longer than 6 days, it does not allow for it to be slightly shorter. To account for this, a − 0.5 day uncertainty in the eruption duration is assigned according to the ‘nearest day’ category of Table 1. The maximum uncertainty in the duration for this eruption is therefore + 1 day and − 0.5 days.

Eighty known or suspected flank eruptions are reported from 1300 AD to 2010, however, three of these are excluded as their location is ambiguous and may be best described as summit eruptions (September 1869, February 1999 and July 2006). A further 11 eruptions have unknown durations (1333, August 1381, 1444, September 1446, September 1578/79, June 1607, March 1689, May 1759, 1764, July 1787, and November 1918) and four were excluded due to their duration uncertainty being greater than 50 % of their total preferred duration (November 1566, September 1682, August 1874 and December 1949). This results in 62 eruptions considered to have reliable durations (listed in Table 2) that can be used in the following analyses, 49 of these eruptions carry duration uncertainties of less than ±10 %.

Table 2 Dataset of historical Mt. Etna flank eruptions with known durations, 1300–2010

Full size table

Additional information on specific eruptions

Tanguy et al. (2007) provide the most comprehensive catalogue of historical Etna eruptions extending from 1600 to 2003. The majority of the eruptions within this time period that are included in Table 2 are also reported by Tanguy et al. (2007), although sometimes, where numerous other sources give alternative dates, their dates are not used but are covered in the eruption’s assigned uncertainty. Two eruptions, however, are used here but not included by Tanguy et al. (2007). These are the February 1643 and the January 1968 eruptions (#8 and #41, Table 2). The latter eruption is documented in numerous other sources, including Tanguy (1981). Its exclusion by Tanguy et al. (2007) may have been an oversight, with other eruptions between 1966 and 1970 included in Tanguy (1981) but missing from Tanguy et al. (2007). The 1968 eruption is therefore included in our dataset using information from other sources (Table 2). The February 1643 eruption is excluded by Tanguy et al. (2007) due to some confusion in the literature between its vent location and the location of the 1646-7 lava flows (Tanguy et al. 2007); however, we include this eruption here, using the dates reported by Behncke et al. (2005) and Tanguy (1981).

Information about the dates of three other eruptions differs significantly from that recorded within the catalogue of Tanguy et al. (2007). These are the March 1956 and the February and November 1975 eruptions (#39, #45 and #46, Table 2). The flank eruption of March 1536 (#3, Table 2) was accompanied by summit activity that continued until the end of the year (Siebert et al. 2010; Tanguy et al. 2007). The flank component of this eruption is reported as ending in April (Behncke et al. 2005), whereas the information within Appendix 1 of Tanguy et al. (2007) states that the eruption ‘probably ended on 8 April.’ To account for this uncertainty, the precision to which the end date is known is considered to be in the ‘early month’ category of Table 1 so the 5 April is assigned with a ±5 day duration uncertainty (Table 2).

The two 1975 flank eruptions also occurred during a period dominated by summit activity. Such close association between the summit and flank activity makes isolating the dates of the flank component difficult and Tanguy et al. (2007) have simply recorded these eruptions within the longer summit activity. Other workers tried to resolve this, and it is the dates and uncertainty within these alternative references that are included in Table 2.

Mt. Etna vent location data

Flank eruptions at Mt. Etna are often associated with multiple aligned vents or fissures radiating from the volcano’s summit (Acocella and Neri 2003). Table 2 and Fig. 1 contain information about the location of each eruption, derived from maps by Romano et al. (1979), Chester et al. (1985), Acocella and Neri (2003) and Branca et al. (2011).

The East flank of Mt. Etna is dominated by the large collapse feature of the Valle del Bove (Guest et al. 1984) and smaller Valle del Leone. The 19 eruptions with vents/fissures located within the Valle del Bove and the one eruption within the Valle del Leone are identified as ‘VDB’ or ‘VDL’ in the location column of Table 2; however, for the remainder of this paper, the Valle del Leone eruption (#56, Table 2) will be grouped with the Valle del Bove eruptions and referred to as such.

The April 1971 eruption (#42, Table 2) was a complex flank eruption (Tanguy et al. 2007). The activity occurred at three vents on the upper South flank and a series of vents on the East flank of the volcano within the Valle del Bove and extending onto the NE flank (Branca and Del Carlo 2004; 2005; Tanguy et al. 2007; Le Guern 1972). Despite the varying location of activity during this eruption, and its association with the early formation of the summit’s South-East crater, it is included here as one event with a duration of 68 days on the ENE flank.

The May 1879 and October 2002 eruptions (#27 and #59, Table 2) both involved more than one vent located on different flanks of the volcano. Here, the vent which was active for each eruption’s entire duration is used, although the erupted material from both vents is shown on the map in Fig. 1. Precise vent locations could not be found for two of the eruptions in Table 2 (#8 and #45); however, examination of the literature and careful location of their erupted products has given enough evidence to assign approximate locations for these eruptions, with both eruptions #8 and #45 affecting the North–North–East region of the volcano.

The completeness of the historical record

The completeness of the eruption record requires some consideration when investigating past eruptive activity. It is important to recognise that some eruptions may have gone unnoticed or unrecorded entirely and that as a result our data (Table 2) is a sample of recorded eruptions only. The recording of Mt. Etna’s eruptive activity dates back to Greek and Roman epochs (Branca and Del Carlo 2004; 2005; Tanguy et al. 2007). However, the records are often only considered to be complete after 1600 AD (Mulargia et al. 1985; Behncke and Neri 2003; Branca and Del Carlo 2004, 2005; Behncke et al. 2005; Tanguy et al. 2007; Cappello et al. 2013). Figure 2a shows an apparent increase in eruption frequency since 1300 AD which is most probably an artefact of reporting. Prior to 1600 AD, data are scarce, and eruptions are often excluded due to insufficient information regarding their duration. Following 1600 AD, the steepness of the curve increases and fewer eruptions are excluded due to the dataset becoming a more complete representation of flank activity at Mt. Etna. All flank eruptions after 1970 have accurately known durations.

Figure 2b shows that this increased reporting of eruptions with time is accompanied by an increase in the number of reported eruptions with short durations. This may suggest that the early eruption record is biased towards eruptions which made the most impact on surrounding areas (Andronico and Lodato 2005). This reporting bias appears to reduce during the eighteenth century (Fig. 2b) and may reflect a shift towards more modern approaches in observing and documenting volcanic activity after the large 1669 flank eruption (Branca and Del Carlo 2004; 2005).

A regional bias in the quality and completeness of eruption records may also exist on Mt. Etna. The volcano’s Western flank appears to have experienced fewer flank eruptions than other areas of the volcano (Fig. 1). Geological maps of Mt. Etna (Romano et al. 1979; Branca et al. 2011) show more lava flows on this flank than are represented in this study; however, these are either a result of eruptions prior to 1300 AD, and therefore outside the range of this investigation, or have undocumented eruption years. Although the reduced number of eruptions, especially in recent years, from vents located on Mt. Etna’s West flank may reflect a preference for eruptive vents to open on other flanks, some of this may be a reporting bias due to the Western flank being the least populated region of Mt. Etna (Behncke et al. 2005). Similarly, 95 % of the reported eruptions within the uninhabited and poorly accessible Valle del Bove post-date 1600 AD (Table 2), which may reflect a reporting bias here too.

Data before 1600 AD may be a poor representation of Mt. Etna’s activity due to the reporting biases discussed and therefore cannot be used to make reliable forecasts about future activity. Data from before 1600 AD has therefore been excluded from the analyses in the remainder of this paper.

Statistical analysis

Survivor functions

The duration of a volcanic eruption can be considered as a type of survival time measurement. Survival analysis was first employed as a method of costing insurance premiums. It is now commonly used in medical studies to assess the length of remission following different treatments or in engineering situations to investigate the length of time before failure of an appliance or system (Machin et al. 2006). As with these types of data, eruption duration can be displayed graphically in an empirical survivor function plot, constructed by placing the observed durations (x _i) in rank order so that x ₁ ≤ x ₂ ≤ … ≤ x _N, where N is the total number of observations. The empirical survivor function (F̂(x _i)) is then plotted at duration x _i, where

$$\hat{F}(x_{i}) = \frac{N-i}{N}, \qquad i=1,\ldots,N. $$

(1)

The resultant empirical survivor function curve provides information about the survival experience of that dataset. Typically these curves have an inverse ‘S’ shape with shallow gradient tails to the distribution representing rarer events with unusually long or short durations and a steeper central portion where the majority of eruption durations plot. Figure 3 shows the empirical survivor function curve for preferred eruption duration data between the years 1600 to 2010 along with curves for the maximum and minimum possible eruption durations, derived from individual eruption duration uncertainty (discussed previously and reported in Table 2). This plot demonstrates that the overall shapes and positions of the three empirical survivor function curves are very similar, implying that individual eruption duration uncertainty has a negligible effect on the overall distribution of the data.

Temporal variation in eruption duration

A fundamental assumption of any investigation using historical eruption data as an insight into future activity is that the character of past eruptions is a good indicator of the volcano’s future activity (Chester et al. 1985; Behncke and Neri 2003; Behncke et al. 2005; Cappello et al. 2013). The following section considers the appropriateness of this assumption to the Mt. Etna data in Table 2.

The distribution of eruption duration between 1600 and 1669 is dominated by long duration eruptions, three of which are longer than any subsequent eruption (Fig. 2b). During this time, erupted lavas were rich in plagioclase phenocrysts and believed to have been stored in a shallow magma reservoir within the volcanic edifice prior to eruption. However, directly following the 1669 eruption Mt. Etna experienced a sharp decrease in productivity and a reduction in the phenocryst content of erupted lavas, which has been attributed to the draining of a shallow magma reservoir within the volcanic edifice during the seventeenth century (Hughes et al. 1990; Behncke and Neri 2003). It is possible that the shallow magma chamber existing at this time promoted longer duration eruptions.

After 1669 eruption durations range from 0.5 to 473 days and there has been a general increase in eruption frequency with time that is not an artefact of reporting (Behncke and Neri 2003; Behncke et al. 2005; Branca and Del Carlo 2005; Cappello et al. 2013). In particular, dramatic increases in eruption frequency and output rate have been recognised following 1971 (Andronico and Lodato 2005; Behncke et al. 2005; Branca and Del Carlo 2005; Smethurst et al. 2009; Cappello et al. 2013). A similar trend can be observed in our data (Table 2), with 20 flank eruptions in the past 38 years (1971–2010), as opposed to only 7 in the 41 years before it (1930–1971) (Fig. 2, Table 2). The increased frequency of eruptions following 1971 is accompanied by a reduction in short duration eruptions, with reported eruption durations of less than 6 days being absent after this time (Fig. 2b). Median eruption durations for these three time periods are 190 days (1600 to 1669), 24 days (1670–1971) and 50 days (1972–2010).

Figure 4 shows empirical survivor function curves for the eruption durations of these three time periods. The 1670 to 1971 and 1972 to 2010 datasets diverge at durations less than 10 days (Fig. 4). If such variation in eruption duration distribution is significant, it could indicate a change in the dynamics of the volcanic system at c. 1971 in such a way that discourages short duration eruptions, thus reducing their likelihood in the future. This implies that using the whole dataset of post-1669 eruptions would be an unrealistic representation of future activity, and that it might be more appropriate to use the 1972–2010 subset of the data. However, a Mantel–Haenszel Logrank test (Appendix A and (Machin et al. 2006)) indicates that the curves are not statistically different at the 0.05 level and it cannot be concluded that they derive from different distributions (test statistic = 2 on 1 degree of freedom). For forecasting future eruption durations on the basis of past eruptions this implies that restricting the input data to eruptions from 1972 to 2010 is currently unnecessary.

In contrast, the empirical survivor function curve for the 1600–1669 dataset is entirely offset from the 1670–1971 and 1972–2010 curves (Fig. 4) and a Mantel–Haenszel Logrank test (Appendix A and (Machin et al. 2006)) indicates that this offset is statistically significant at the 0.05 level (test statistic = 7 and 5.3 on 1 degree of freedom, respectively). This clear difference and the evidence for a different plumbing system beneath Mt. Etna prior to 1670 may indicate that a future eruption of this scale and duration is unlikely and therefore that we should only use eruptions after 1669 as the basis of any forecasting models. However, the 1600–1669 time period has previously been interpreted as the culminating phase of a century-scale cycle in eruptive activity at Mt. Etna, with the next cycle still continuing today (Behncke and Neri 2003; Tanguy et al. 2003; Cappello et al. 2013). Recent investigations into the plumbing system of Mt. Etna indicate increasing magma accumulation beneath the volcano (Behncke and Neri 2003; Patané et al. 2003; Allard et al. 2006). This, along with the trend of increasing eruption frequency and output rate, may indicate a gradual return to the style of activity that was typical in the early seventeenth century which Behncke and Neri (2003) ascribed to the ending of a century-scale cycle of activity. By excluding the 1600–1669 data, the model would be unable to account for the possibility that future activity at Mt. Etna could become more voluminous and potentially hazardous in the future. We will compare forecasting models using both the 1600–2010 and 1670–2010 datasets later.

Sectoral variation in eruption duration

Previous investigations into the location of historical flank eruptions at Mt. Etna have highlighted three regions of high vent density on the North-Eastern, Southern and Western flanks of the volcano interpreted as three rift zones where eruptions are common (Duncan et al. 1981; Chester et al. 1985; Behncke et al. 2005; Neri et al. 2011; Proietti et al. 2011). To assess whether the distribution of eruption duration varies between each rift zone, we have split the volcano into three sectors. Unlike Proietti et al. (2011), our sectors are not evenly distributed or positioned so that one boundary is directed North. Instead, we have used similar sectors to Behncke et al. (2005) whereby each sector contains one of the three identified rift zones along with any vents which appear closely associated with it. Using a point centred above the summit, these are between (A) 347 ° and 104 °, (B) 104 ° and 226 ° and (C) 226 ° and 347 ° (Fig. 1), and include the North-Eastern, Southern and Western rift zones, respectively.

The boundary between sectors A and B cuts through the Valle del Bove. Eruptions within this area are common and, since 1971, many lava flows from the summit’s South East crater enter this valley making the resurfacing rate high such that identifying vents and fissures within this area can be difficult. The precise positions of the 1955 and 1802 fissures (#13 and #19, Table 2) are unknown but reported to be close to Rocca Mussarra and are therefore considered here as part of sector A. Other fissures and vents within the Valle del Bove have been located using the sources previously discussed and assigned to sectors A or B accordingly.

The majority of eruptive vents and fissures outside of the Valle del Bove fall clearly within one of the three sectors (Fig. 1). The March 1981 eruption (#51, Table 2) was the result of a long fissure which crosses the boundary between sectors A and C. The eruption is most probably a result of the North–East rift zone and is therefore considered part of sector A (Fig. 1). Similarly, the eruptive fissure of the May 2008 eruption (#62, Table 2) crosses the boundary between sectors A and B. The lower portion of this fissure was active throughout the eruption and thus the eruption is attributed here to sector B (Fig. 1).

Empirical survivor function curves plotted for the 1600 to 2010 eruptions in sectors A, B and C are displayed in Fig. 5. The small sample size of sector C (n = 6) results in a crude empirical survivor function curve and any differences between its eruption duration distribution and that of sectors A and B is difficult to discern. The sample sizes of sectors A and B are higher (n = 23 and n = 29, respectively) and while the tails of their distributions overlap, the central portions diverge, with median durations of 18 days (sector A) and 84 days (sector B) (Fig. 5). To assess whether these differences are significant, Mantel–Haenszel Logrank tests have been performed on all possible combinations of sector pairs (i.e. A–B, A–C and B–C) and the results are summarised in Table 3. Despite the median duration of sector B (84 days) being higher than that for sectors A and C (18 and 19.5 days, respectively), the distributions cannot be considered statistically different at the 0.05 level. For sector pair A–B, a Mann–Whitney test and t test (applied to the logs of the data) were also performed, with similar results (p value results are 0.213 and 0.371, respectively). It can therefore be concluded that despite the observable differences in the central portion of the empirical survivor function curves (Fig. 5), we cannot reject the null hypothesis that there is no difference between the shapes of the eruption duration distribution of sectors A and B. This is likely to be due to the relatively small numbers of eruptions in statistical terms in each sector.

Table 3 Mantel–Haenszel Logrank test results for all possible sector pairs

Full size table

Forecasting the duration of future flank eruptions

Description of the statistical model

When duration data are modelled using theoretical distributions, survival analysis can be used to estimate the probability that a future eruption will exceed a given length of time. The probabilistic forecasts are based on best-fit parametric statistical models of empirical survivor functions. The two-parameter log-logistic and the three-parameter Burr type XII distributions have been considered and their survivor functions are

$$\hat{F}(x)\,_{\mathrm{(Log-logistic)}} = \frac{1}{1+(x/\sigma)^{\beta}} $$

(2)

$$\hat{F}(x)\,_{\mathrm{(Burr\,XII)}} = \frac{1}{\{1+(x/\sigma)^{\beta}\}^{\alpha/\beta}} $$

(3)

To identify the best-fit log-logistic and Burr type XII survivor functions, their parameters (α, β and σ) have been found by maximum likelihood estimation and their goodness of fit to the observed duration data tested using a Kolmogorov–Smirnov test. If the Kolmogorov–Smirnov test results indicate that the observed duration data could have been derived from either distribution, a likelihood ratio chi-squared test is used to assess whether there is any benefit in employing the more complicated Burr type XII distribution or whether the simpler log-logistic distribution provides an equally good fit to the data. Additional information on these methods can be found in Appendix B.

The best-fit survivor function can be used to make probabilistic forecasts about the duration of future and on-going volcanic eruptions. Three types of forecast are made in this investigation. The first is the probability of exceeding a specified duration x according to the survivor function given in Eq. 2 or 3. The second is a variation on the survivor function, adapted for on-going eruptions, wherein the residual life function is used to find the probability of exceeding a specified total duration x, having already reached duration t and is given by

$$\hat{F}_{t}(x)\,_{\mathrm{(Log-logistic)}} = \frac{\sigma^{\beta}+t^{\beta}}{\sigma^{\beta}+x^{\beta}} $$

(4)

$$\hat{F}_{t}(x)\,_{\mathrm{(Burr\,XII)}} = \left(\frac{\sigma^{\beta} + t^{\beta}}{\sigma^{\beta} + x^{\beta}} \right)^{\alpha/\beta} $$

(5)

Finally, the quantile function given by

$$x_{p}\,_{\mathrm{(Log-logistic)}} = \sigma\left( \frac{p}{1-p} \right)^{1/\beta} $$

(6)

$$x_{p}\,_{\mathrm{(Burr\,XII)}} = \sigma \left\{\frac{1}{(1-p)^{\beta/\alpha}} -1 \right\}^{1/\beta} $$

(7)

enables the user to find the duration associated with a stated quantile p, that is, the duration that has probability 1 − p of being exceeded. For each forecast, the 95 and 80 % confidence intervals have been calculated using the methods given in Appendix C.

Application of the model to Mt. Etna

The above investigations have shown that differences in the distribution of eruption duration before and after 1971 and differences in the distribution of eruption duration on different sectors of Mt. Etna’s flanks are not statistically significant at the 0.05 level. This indicates that the eruption durations recorded between 1670 and 2010 could have all derived from the same distribution, and therefore it is acceptable to use this data in the forecasting model presented below. We have also demonstrated that the distribution of eruption duration between 1600 and 1669 is dominated by long duration eruptions which may have been the result of a shallow magma reservoir existing beneath Mt. Etna at this time. A gradual return to this type of activity in the future has been proposed by Behncke and Neri (2003) so we have made eruption duration forecasts on two different datasets: 1600–2010 and 1670–2010.The 1600–2010 dataset allows us to account for the very long eruption durations that may occur in the future if a shallow magma reservoir were to be re-established. It contains a total of 58 observed eruption durations ranging from less than 1 day to 3,653 days with a median duration of 34.5 days (Table 2). The 1670–2010 dataset may give a more realistic forecast of eruption durations in the near future. This dataset contains 51 observed eruption durations ranging from less than 1 day to 473 days with a median duration of 26 days (Table 2).

For both the 1600–2010 and 1670–2010 datasets, the Kolmogorov–Smirnov goodness of fit test suggests that the observed durations could have been derived from either a log-logistic or Burr type XII distribution. Additional chi-squared tests indicate that there is no benefit in applying the Burr type XII distribution over the log-logistic distribution. The best fit log-logistic survivor functions have estimated parameter values of 0.94 and 40.56 (1600–2010) and 1.00 and 33.00 (1670–2010) for β and σ, respectively. The resultant survivor function curves are displayed graphically alongside their empirical survivor curves (Emp_SF) in Fig. 6.

Table 4 contains the results of seven forecasts made from the 1600–2010 and 1670–2010 datasets; three using the survivor function (a and b in Table 4), two using the residual life function where t is 14 days (c and d in Table 4) and two using the quantile function (e and f in Table 4). The values displayed in the first column of each table represent the scenario being forecast, e.g. the probability of an eruption exceeding 7 days or the duration associated with a p value of 0.34. The final two columns in each table represent the 95 and 80 % confidence intervals that have been calculated. When discussed in the text, 80 % confidence intervals are quoted.

Table 4 Forecast results for the 1600–2010 and 1670–2010 datasets

Full size table

The shape and position of the two empirical survivor function curves in Fig. 6 are similar. The greatest difference is the prominent long duration tail of the empirical survivor function curve in Fig. 6a (1600–2010) which is absent in Fig. 6b (1670–2010). This is a result of the long duration eruptions which occurred between 1600 and 1669. The effect of this on the forecasting model results is that the probability of exceeding a given duration is consistently lower for the 1670–2010 dataset than the 1600–2010 dataset and that this difference is slightly greater when forecasting longer duration eruptions (Table 4). For example, when the 1600–2010 dataset is considered, results show an 84 % (± 5 %) probability of exceeding 1 week (7 days) and a 57 % (± 7 %) probability of exceeding 1 month (30 days). These probabilities are reduced to 82 and 52 % when the 1670–2010 dataset is considered (a and b in Table 4). A similar trend is also present in the results of the residual life function (c and d in Table 4).

The survivor function and residual life function both give the probability of exceeding stated durations. Perhaps more useful is the quantile function, allowing the user to identify durations associated with specific probabilities. Furthermore, the assignment of qualitative terms such as ‘likely’ and ‘unlikely’ to sensible probabilities make the model results accessible to a wider audience. Here, we consider a ‘likely’ result as having a probability of 66 % or more, and an ‘unlikely’ result as having a probability of 33 % or less (following the approach taken in communicating climate change scenarios; (Budescu et al. 2009; Mastrandrea et al. 2010)). These equate to values of p of 0.34 and 0.67, respectively. The results of such forecasts are shown in e and f of Table 4. Using the 1600–2010 dataset results show a 66 % probability of exceeding 20 days (± 7 days) and a 33 % probability of exceeding 86 days (± 29 days) (e in Table 4), therefore it can be concluded that a future flank eruption on Mt. Etna is likely to exceed 20 days but unlikely to exceed 86 days. When the dataset is restricted to eruptions since 1669, these durations are reduced to 17 days (± 6 days) and 67 days (± 22 days), respectively (f in Table 4).

Conclusions

We have introduced a probabilistic model for forecasting the duration of future and on-going eruptions using a new dataset of historical flank eruption durations from Mt. Etna. The model shows great potential for future use as a forecasting tool and could greatly benefit emergency response planning both prior to and during volcanic crises. It is not specific to Mt. Etna and can easily be adapted for use on other highly active, well-documented volcanoes or for different duration data such as the duration of explosive episodes or the duration of repose periods between eruptions. The model uses datasets of historical eruption durations and thus relies on past eruptions being a good indicator of future activity. It is therefore limited to use on volcanoes with well-documented historic eruptions and data must firstly be assessed for reporting biases and any changes in eruption duration with time or location.

Critical assessment of documented flank eruptions from Mt. Etna resulted in a reliable dataset of reported eruption durations between the years 1600 and 2010 containing 58 eruptions with reported durations ranging from less than 1 day to 3,653 days. Eruptions between the years 1600 and 1669 include the three longest duration flank eruptions reported at Mt. Etna. As a result, this time period is statistically different from that following it. Although usually this would be the cause to exclude this data, a return to eruptions of this scale and duration in the future is conceivable. Other temporal variations in eruption duration were assessed but not found to be statistically significant. Furthermore, significant differences in the distribution of eruption duration from the prevailing three rift zones on Mt. Etna (NE, S and W) were also not found. However, there are indications of possible differences between NE and S sectors that future data and/or other information might strengthen.

We chose to run the forecasting model on two datasets: 1600–2010 and 1670–2010, allowing us to assess the effect of including the longer duration 1600–1669 eruptions. Results indicate that the probability of exceeding a given duration is consistently less for the 1670–2010 dataset; however, the degree to which this is the case is slight, especially where short durations are involved. When using the 1600–2010 dataset of historical flank eruption durations and by assigning the terms ‘likely’ and ‘unlikely’ to probabilities of 66 % or more and 34 % or less, respectively, the forecasting model was used to indicate that a future flank eruption on Mt. Etna would be likely to exceed 20 days (± 7 days) and unlikely to exceed 86 days (± 29 days).

References

Acocella V, Neri M (2003) What makes flank eruptions? The 2001 Etna eruption and its possible triggering mechanisms. Bull Volcanol 65(7):517–529
Article Google Scholar
Allard P, Behncke B, D’Amico S, Neri M, Gambino S (2006) Mount Etna 19932005: anatomy of an evolving eruptive cycle. Earth Sci Rev 78(1-2):85–114
Article Google Scholar
Andronico D, Lodato L (2005) Effusive activity at Mount Etna volcano (Italy) during the 20th Century: a contribution to volcanic hazard assessment. Nat Hazards 36:407–443
Article Google Scholar
Bebbington MS, Lai CD (1996) Statistical analysis of New Zealand volcanic occurrence data. J Volcanol Geoth Res 74(1-2):101–110
Article Google Scholar
Behncke B, Falsaperla S, Pecora E (2009) Complex magma dynamics at Mount Etna revealed by seismic, thermal, and volcanological data. J Geophys Res 114:1–17
Google Scholar
Behncke B, Neri M (2003) Cycles and trends in the recent eruptive behaviour of Mount Etna (Italy). Can J Earth Sci 40(10):1405–1411
Article Google Scholar
Behncke B, Neri M, Nagay A (2005) Lava flow hazard at Mount Etna (Italy): new data from a GIS-based. In: Manga M, Ventura G (eds) Kinematics and dynamics of lava flows. GSAMSP369, Boulder, Colorado, pp 189–208
Chapter Google Scholar
Benoit JP, McNutt SR (1996) Global volcanic earthquake swarm database and preliminary analysis of volcanic earthquake swarm duration. Ann Geofis XXXIX:221–229
Google Scholar
Bisson M, Behncke B, Fornaciai A, Neri M (2009) LiDAR-based digital terrain analysis of an area exposed to the risk of lava flow invasion: the Zafferana Etnea territory, Mt. Etna Italy. Nat Hazards 50(2):321–334
Article Google Scholar
Bonaccorso A, Bonforte A, Calvari S, Del Negro C, Di Grazia G, Ganci G, Neri M, Vicari A, Boschi E (2011a) The initial phases of the 2008—2009 Mount Etna eruption: a multidisciplinary approach for hazard assessment. J Geophys Res 116:1–19
Article Google Scholar
Bonaccorso A, Cannata A, Corsaro RA, Di Grazia G, Gambino S, Greco F, Miraglia L, Pistorio A (2011b) Multidisciplinary investigation on a lava fountain preceding a flank eruption: the 10 May 2008 Etna case. Geochem Geophys Geosyst 12(7):1–21
Article Google Scholar
Branca S, Coltelli M, De Beni E, Wijbrans J (2008) Geological evolution of Mount Etna volcano (Italy) from earliest products until the first central volcanism (between 500 and 100 ka ago) inferred from geochronological and stratigraphic data. Int J Earth Sci 97(1):135–152
Article Google Scholar
Branca S, Coltelli M, Groppelli G, Lentini F (2011) Geological map of Etna volcano, 1 : 50, 000 scale. Ital J Geosci 130(3):265–291
Google Scholar
Branca S, Del Carlo P (2004) Eruptions of Mt. Etna during the past 3200 years: a revised compilation integrating the historical and stratigraphical records. In: Bonaccorso A, Calvari S, Coltelli M, Negro CD, Falsaperla S (eds) Mt. Etna: Volcano Laboratory. Am Geophys Union, Washington, pp 1–22
Chapter Google Scholar
Branca S, Del Carlo P (2005) Types of eruptions of Etna volcano AD 1670–2003: implications for short-term eruptive behaviour. Bull Volcanol 67(8):732–742
Article Google Scholar
Budescu DV, Broomell S, Por HH (2009) Improving communication of uncertainty in the reports of the Intergovernmental panel on climate change. Psychol Sci 20(3):299–308
Article Google Scholar
Burton MR, Neri M, Andronico D, Branca S, Caltabiano T, Calvari S, Corsaro RA, Del Carlo P, Lanzafame G, Lodato L, Miraglia L, Salerno G, Spampinato L (2005) Etna 2004–2005: an archetype for geodynamically-controlled effusive eruptions. Geophys Res Lett 32(9):1–4
Article Google Scholar
Cappello A, Bilotta G, Neri M, Negro CD (2013) Probabilistic modeling of future volcanic eruptions at Mount Etna. J Geophys Res: Sol Ea 118(5):1925–1935
Article Google Scholar
Cappello A, Neri M, Acocella V, Gallo G, Vicari A, Del Negro C (2012) Spatial vent opening probability map of Etna volcano (Sicily, Italy). Bull Volcanol 74(9):2083–2094
Article Google Scholar
Chester DK, Duncan AM, Dibben C, Guest JE, Lister PH (1999) Mascali, Mount Etna region Sicily: an example of fascist planning during the 1928 eruption and its continuing legacy. Nat Hazards 19(1):29–46
Article Google Scholar
Chester DK, Duncan AM, Guest JE, Kilburn CRJ (1985) Mount Etna The anatomy of a volcano. Chapman and Hall, London
Google Scholar
Chester DK, Duncan AM, Sangster H (2012) Human responses to eruptions of Etna (Sicily) during the late-pre-industrial era and their implications for present-day disaster planning. J Volcanol Geoth Res 225-226:65–80
Article Google Scholar
Coltelli M, Proietti C, Branca S, Marsella M, Andronico D, Lodato L (2007) Analysis of the 2001 lava flow eruption of Mt. Etna from three-dimensional mapping. J Geophys Res 112:1–18
Article Google Scholar
Corsaro R, Miraglia L (2009) Dynamics of magma in the plumbing system of Mt. Etna volcano, Sicily, Italy: a contribution from petrologic data of volcanics erupted from 2007 to 2009. In: American Geophysical Union, Fall Meeting, abstract # V51C-169
Crisci GM, Avolio MV, Behncke B, D’Ambrosio D, Di Gregorio S, Lupiano V, Neri M, Rongo R, Spataro W (2010) Predicting the impact of lava flows at Mount Etna, Italy. J Geophys Res 115(B4):B04203
Google Scholar
Duncan AM, Chester DK, Guest JE (1981) Mount Etna volcano: environmental impact and problems of volcanic prediction. The Geogr J 147(2):164–178
Article Google Scholar
Frazzetta G, Romano R (1978) Approccio di studio per la stesura di una carta del rischio vulcanico (Etna-Sicilia). Mem. Soc. Geol. Ital. 19:691–697
Google Scholar
Guerra I, Lo Bascio A, Luongo G, Scarpa R (1976) Seismic activity accompanying the 1974 eruption of Mt. Etna. J Volcanol Geoth Res 1(4):347–362
Article Google Scholar
Guest JE, Chester DK, Duncan AM (1984) The Valle del Bove Mount Etna: its origin and relation to the stratigraphy and structure of the volcano. J Volcanol Geoth Res 21:1–23
Article Google Scholar
Guest JE, Murray JB (1979) An analysis of hazard from Mount Etna volcano. J Geol Soc London 136:347–354
Article Google Scholar
Harris AJL, Favalli M, Wright R, Garbeil H (2011) Hazard assessment at Mount Etna using a hybrid lava flow inundation model and satellite-based land classification. Nat Hazards 58(3):1001–1027
Article Google Scholar
Harris AJL, Murray JB, Aries SE, Davies MA, Flynn LP, Wooster MJ, Wright R, Rothery DA (2000) Effusion rate trends at Etna and Krafla and their implications for eruptive mechanisms. J Volcanol Geoth Res 102:237–269
Article Google Scholar
Hughes JW, Guest JE, Duncan AM (1990) Changing styles of effusive eruption on Mount Etna since AD 1600, Magma transport and storage. Wiley, New York, pp 385–406
Google Scholar
Le Guern F (1972) Etudes dynamiques sur la phase gazeuse éruptive. Technical report, Commissariat à l’Energie Atomique, France
Google Scholar
Machin D, Cheung Y, Parmar MKB (2006) Survival analysis: a practical approach, 2nd edn. Wiley
Marzocchi W, Bebbington M (2012) Probabilistic eruption forecasting at short and long time scales. Bull Volcanol 74(8):1777–1805
Article Google Scholar
Mastin L, Guffanti M, Servranckx R, Webley P, Barsotti S, Dean K, Durant A, Ewert J, Neri A, Rose W, Schneider D, Siebert L, Stunder B, Swanson G, Tupper A, Volentik A, Waythomas C (2009) A multidisciplinary effort to assign realistic source parameters to models of volcanic ash-cloud transport and dispersion during eruptions. J Volcanol Geoth Res 186(1-2):10–21
Article Google Scholar
Mastrandrea MD, Field CB, Stocker TF, Edenhofer O, Ebi K, Frame D, Held H, Kriegler E, Mach K, Matschoss P, Plattner G, Yohe G, Zwiers F (2010) Guidance note for lead authors of the IPCC fifth assessment report on consistent treatment of uncertainties Technical report, Intergovernmental Panel on Climate Change (IPCC)
Mulargia F, Tinti S, Boschi E (1985) A statistical analysis of flank eruptions on Etna volcano . J Volcanol Geoth Res 23(3-4):263–272
Article Google Scholar
Neri M, Acocella V (2006) The 2004–2005 Etna eruption: implications for flank deformation and structural behaviour of the volcano. J Volcanol Geoth Res 158(1):195–206
Article Google Scholar
Neri M, Acocella V, Behncke B, Giammanco S, Mazzarini F, Rust D (2011) Structural analysis of the eruptive fissures at Mount Etna (Italy). Ann Geophys 65(5):464–479
Google Scholar
Passarelli L, Sansò B, Sandri L, Marzocchi W (2010) Testing forecasts of a new Bayesian time-predictable model of eruption occurrence. J Volcanol Geoth Res 198(1-2):57–75
Article Google Scholar
Patané D, Gori PD, Chiarabba C, Bonaccorso A (2003) Magma ascent and the pressurization of Mount Etna’s volcanic system. Science 299(5615):2061–2063
Article Google Scholar
Pinkerton H, Sparks RSJ (1976) The 1975 sub-terminal lavas, Mount Etna: a case history of the formation of a compound lava field. J Volcanol Geoth Res 1(2):167–182
Article Google Scholar
Proietti C, De Beni E, Coltelli M, Branca S (2011) The flank eruption history of Etna (1610–2006) as a constraint on lava flow hazard. Ann Geophys 54(5):480–490
Google Scholar
Romano R, Sturiale C, Lentini F (1979) Geological Map of Mt. Etna. CNR, Progetto Finalizzato Geodynamica, Instituto Internazionale di Vulcanologia (Catania). 1:50.000 scale.
Salvi F, Scandone R, Palma C (2006) Statistical analysis of the historical activity of Mount Etna, aimed at the evaluation of volcanic hazard. J Volcanol Geoth Res 154:159–168
Article Google Scholar
Siebert L, Simkin T, Kimberley P (2010) Volcanoes of the world. 3rd edn. Smithsonian Institution, Washington
Simkin T (1993) Terrestrial volcanism in space and time. Earth Planet Sc Lett 21:427–452
Google Scholar
Smethurst L, James MR, Pinkerton H, Tawn JA (2009) A statistical analysis of eruptive activity on Mount Etna, Sicily. Geophys J Int 179(1):655–666
Article Google Scholar
Sparks RSJ, Aspinall WP (2004) Volcanic activity : frontiers and challenges in forecasting, prediction and risk assessment. In: Sparks, RSJ, Hawkesworth, CJ (eds) The state of the planet: frontiers and challenges in geophysics, IUGG/AGU, pp 359–373
Stieltjes L, Moutou P (1989) A statistical and probabilistic study of the historic activity of Piton de la Fournaise, Reunion Island, Indian Ocean. J Volcanol Geoth Res 36(1-3):67–86
Article Google Scholar
Tanguy J (1981) Les éruptions historiques de l’Etna: chronologie et localisation. Bull Volcanol 44(3):585–640
Article Google Scholar
Tanguy J-C, Condomines M, Le Goff M, Chillemi V, La Delfa S, Patanè G (2007) Mount Etna eruptions of the last 2,750 years: revised chronology and location through archeomagnetic and ²²⁶Ra- ²³⁰Th dating. Bull Volcanol 70(1):55–83
Article Google Scholar
Tanguy J-C, Le Goff M, Principe C, Arrighi S, Chillemi V, Paiotti A, La Delfa S, Patanè G (2003) Archeomagnetic dating of Mediterranean volcanics of the last 2100 years: validity and limits. Earth Planet Sc Lett 211(1-2):111–124
Article Google Scholar
Tanguy JC, Tazieff H, Cristofolini R (1973) The 1971 Etna Eruption: petrography of the Lavas [and discussion]. Philos T Roy Soc A 274(1238):45–53
Article Google Scholar
Venzke E, Wuderman RW, McClelland L, Simkin T, Luhr JF, Siebert L, Mayberry G, Sennert S (2002) Global volcanism program digital information series. http://www.volcano.si.edu/reports/. Last Accessed: 1 Aug 2012
Wadge G (1976) Deformation of Mount Etna, 1971–1974. J Volcanol Geoth Res 1(3):237–263
Article Google Scholar
Wadge G, Guest JE (1981) Steady-state magma discharge at Etna 1971-81. Nature 294(5841):548–550
Article Google Scholar

Download references

Acknowledgments

LSG is supported by a Natural Environment Research Council PhD studentship. We thank John Murray for sharing his invaluable knowledge of Mt. Etna’s volcanic activity and Peter Fawdon for his use of Arc GIS to help identify flank vents and fissures. We would also like to thank Sonia Calvari, Gilda Currenti and Marco Neri for their useful comments on an earlier version of this paper.

Author information

Authors and Affiliations

Department of Environment, Earth and Ecosystems, The Open University, Walton Hall, Milton Keynes, MK7 6AA, UK
L. S. Gunn, S. Blake & H. Rymer
Department of Mathematics and Statistics, The Open University, Walton Hall, Milton Keynes, MK7 6AA, UK
M. C. Jones

Authors

L. S. Gunn
View author publications
You can also search for this author in PubMed Google Scholar
S. Blake
View author publications
You can also search for this author in PubMed Google Scholar
M. C. Jones
View author publications
You can also search for this author in PubMed Google Scholar
H. Rymer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to L. S. Gunn.

Additional information

Editorial responsibility: S. Calvari

Appendices

Appendix A: Mantel–Haenszel Logrank test for comparing empirical survivor functions

A Logrank test has been used to assess the significance of any differences between the empirical survivor functions of two groups of duration data (g ₁ and g ₂). The method and equations outlined below are based on the information within (Machin et al. 2006).

Firstly, the observed durations (x) are placed in rank order irrespective of their original group and the expected number of eruptions ending from each group is then estimated at each duration interval (i) using

$$E_{\{g_{1}, i\}} = \frac{r_{i}T_{\{g_{1}, i\}}}{N_{i}} \qquad \text{and} \qquad E_{\{g_{2}, i\}} = \frac{r_{i}T_{\{g_{2}, i\}}}{N_{i}}. $$

(8)

Here, r _i is the total number of observed eruptions with duration i (irrespective of group), T _i is the total number of eruptions in the specified group (g ₁ or g ₂) with durations longer than or equal to i and N _i is the total number of observations in both groups with durations longer than or equal to i. The total number of observations in each group ($O_{g_{1}}$ and $O_{g_{2}}$) and the total expected number of eruptions ending in each group ($E_{g_{1}}$ and $E_{g_{1}}$) are calculated. For better treatment of tied data, where two or more observed eruptions are of equal duration, the Mantel–Haenszel version of the Logrank test is employed, involving the calculation of the hypergeometric variance V at each duration interval:

$$V_{i} = \frac{T_{\{g_{1}, i\}}T_{\{g_{2}, i\}}r_{i}s_{i}}{N_{i}^{2}(N_{i}-1)} $$

(9)

where s _i is the total number of observed eruptions with durations longer than i (irrespective of group). We then sum the individual V _i values obtained from Eq. 9 to get V and the $\chi _{MH}^{2}$ Logrank statistic is calculated by either:

$$\chi_{MH}^{2} = \frac{(O_{g_{1}} - E_{g_{1}})^{2}}{V} \qquad or \qquad \chi_{MH}^{2} = \frac{(O_{g_{2}} - E_{g_{2}})^{2}}{V} $$

(10)

The null hypothesis of the logrank test is that the datasets being compared all have the same survival experience, and thus any variation between their empirical survivor functions can be attributed purely to chance (Machin et al. 2006). The resultant test statistic is compared to the 95 % χ ² distribution quantile with degrees of freedom equal to one less than the number of groups being compared, and the null hypothesis is rejected if the test statistic is larger than this quantile.

A variation of this test can be used to compare three or more empirical survivor functions allowing the user to establish whether the differences are statistically significant; however, it does not provide information about where these differences occur. For this reason, we have chosen not to use this modified test, but to run the Logrank test outlined above on pairs of empirical survivor functions to assess where significant differences lie.

Appendix B: Modelling using appropriate statistical distributions

In order to make probabilistic forecasts of future eruption durations, empirical survivor function curves are modelled using a theoretical distribution. The log-logistic and Burr type XII distributions are tested in this study, and the survivor functions and related equations are shown in Eqs. 2 to 7, where x is duration, σ a scale parameter and both α and β are shape parameters. In both distributions, the duration is the only known quantity and all parameters have been estimated using maximum likelihood. Early stages of this investigation also tested the fit of exponential and Weibull distributions; however, these have provided insufficiently good fits to all duration datasets studied.

A Kolmogorov–Smirnov (KS) goodness of fit test has been used to determine whether the distributions provide a good fit to the observed duration data. This test is based on comparisons between the empirical distribution function (F _n) of the observed data and the cumulative distribution function (F ₀) of an assumed theoretical distribution. These equate to the inverse of the empirical survivor function (1) or theoretical distribution’s survivor function (2 and 3), respectively. Graphically, the KS test statistic D identifies the maximum vertical displacement between F _n and F ₀ and thus is obtained by computing the maximum absolute difference between F _n and F ₀ at all values of x:

$$D = \underset{x}{Max} | F_{n}(x) - F_{0}(x) | $$

(11)

The null hypothesis of this test is that the observed sample can be said to have derived from the theoretical distribution being tested. It can be accepted when the KS test statistic is lower than the critical value for that sample size (N) and appropriate significance level. Here, we test at a 5 % significance level where the critical value is given by $ \frac {1.36}{\sqrt {N}} $.

Some degree of approximation has been introduced to this method due to the parameters of the theoretical distributions being estimated from the observed duration data and the presence of tied data in the low duration region of the dataset. These are considered to have a negligible effect on the final test result.

Where both distributions satisfy the criteria to accept the null hypothesis, a further test is used to determine whether it is worthwhile applying the more complex Burr type XII distribution or whether the simpler log-logistic distribution provides an adequate fit to the data. To determine this, the difference between the maximised values of the log-likelihood associated with each distribution is doubled, and the resultant value compared to the χ ²distribution quantile on 1 degree of freedom at the 5 % significance level (3.84). If the calculated value is greater than this critical value, then the null hypothesis, that there is no difference between the two distributions is rejected and the Burr type XII distribution is used to model the observed duration data.

Appendix C: Calculating 95 and 80 % confidence intervals on model results

The results of the forecasting models presented so far are ‘point estimates’ for the specific value of interest (x or p for the survivor/residual life function and quantile function models, respectively). In each case 95 and 80 % confidence intervals are given in the form of

$$\mathrm{`point~estimate'}\quad \pm~~ 1.96 \, \sqrt{\hat{V}}$$

and

$$\mathrm{`point~estimate'}\quad \pm~~ 1.28 \, \sqrt{\hat{V}}$$

respectively, where V̂ is the estimated variance for the formula being used in the model. The calculation of V̂ is specific to the theoretical distribution and is based on standard asymptotic theory for maximum likelihood estimation. The equations involved are displayed in Table 5. There, the Cs are elements of the asymptotic covariance matrix associated with the maximum likelihood estimates β̂ and σ̂ of β and σ, respectively; specifically, C[1,1] is the asymptotic variance of β̂, C[2,2] that of σ̂ and C[1,2] is the asymptotic covariance between β̂ and σ̂.

Table 5 Equations involved in calculating variance (V̂) for the Log-logistic distribution in the survivor function (F̂(x)), residual life function (F̂ _t) and quantile function (x _p) models

Full size table

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Reprints and permissions

About this article

Cite this article

Gunn, L.S., Blake, S., Jones, M.C. et al. Forecasting the duration of volcanic eruptions: an empirical probabilistic model. Bull Volcanol 76, 780 (2014). https://doi.org/10.1007/s00445-013-0780-8

Download citation

Received: 17 September 2013
Accepted: 31 October 2013
Published: 05 December 2013
DOI: https://doi.org/10.1007/s00445-013-0780-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Forecasting the duration of volcanic eruptions: an empirical probabilistic model

Abstract

Similar content being viewed by others

Forecasting eruptions from long-quiescent volcanoes