Extreme weather and climate events are of great societal interest as they significantly affect people and property—usually adversely. They are also of public interest since they are unusual natural phenomena, which have scientific stories behind them. Just as weather is a topic of daily conversation, extreme weather events (including longer-duration climate extremes such as drought) provide a universal talking point. Whilst their proximate cause is meteorological, it is now inevitable that the question will be asked, “Was this event due to climate change?” This is a perfectly natural question to ask given that climate change is a reality and that in many cases, climate change will be felt most directly through its impact on extremes. (For example, sea level rise will generally impact society through storm surges leading to coastal inundation.) People relate to what they have experienced, so if extremes are the sharp edge of climate change, then it becomes important, from the standpoint of both communication and risk reduction, to address this question scientifically.
It is sometimes said that anthropogenic warming of the climate system will increase the energy of the atmosphere, which will lead to more storminess and thus more extreme behaviour. It is true that a warmer atmosphere can hold more moisture, which can provide more latent heat release in a convectively driven extreme event, and more precipitation in general. But atmospheric motions are driven by energy differences [1], not by energy itself, and the polar amplification that is a fundamental characteristic of global warming [2] will tend to reduce the pole-to-equator temperature gradient. Also, in a warming world, one would tend to expect a reduction in cold extremes. Thus, whilst climate change has undoubtedly affected weather and climate extremes, both the sign and the magnitude of the effect need to be assessed on a case-by-case basis. Although there may be general expectations based on global aspects of climate change, there can be local departures from that behaviour, and over any given time period, multi-decadal variability can also play a role in changes in extremes [3].
It has also to be recognized that an unprecedented event does not imply that climate has changed. Weather and climate records are of finite length, and as the record lengthens, new record-breaking events will continue to occur, even for stationary statistics. Thus, climate change is reflected in deviations from this behaviour [4]. The temporal inhomogeneity that results from defining a reference period at the beginning of a time series can also lead to spurious trends in extremes when the reference period is short and the variability is normalized by that of the reference period [5]; several high-profile papers have fallen prey to this error, e.g. [6].
Weather concerns the instantaneous state of the atmosphere, but climate is generally understood to comprise its averaged behaviour (including higher-order statistics represented in probability distributions) over some period of time. The attribution of changes in the observed statistics of extremes is clearly a climate science question, which can be addressed using well-established detection-attribution methods [7]. In contrast, the role of climate change in a particular extreme event concerns only a single observed event, and thus involves no observed change, nor any averaging over observed events. This takes it out of the traditional domain of climate science and places it more within the domain of weather science or (for the longer-duration extremes) seasonal prediction. If a weather or climate event is truly extreme in the present climate, then perforce it requires unusual meteorological conditions, which means that climate change is at most only a contributing factor. (As noted by [8••], the failure to recognize this fact can lead to apparently contradictory conclusions concerning the same event.) The issues involved are illustrated for the extreme northern winter of 2013/2014 in Fig. 1. However, even a small contributing factor can have enormous consequences in the context of an extreme event, because the impacts are generally highly nonlinear in the hazard. The scientific question is then to determine that contribution.
In general, there seem to be two basic (and at first sight orthogonal) approaches for determining the impact of one factor on an effect involving multiple factors. One is what will be called the ‘risk-based’ approach, where the change in likelihood of the effect arising from the presence of that factor is estimated. It is understood that the attribution is only probabilistic, much as smoking increases the risk of lung cancer but is neither a necessary nor a sufficient cause of lung cancer in any particular individual. This approach to extreme event attribution was introduced to the climate science community by [11] and applied by [12] to the European heat wave of 2003. The second is what will be called the ‘storyline’ approach, where the causal chain of factors leading to the event is identified, and the role of each assessed. This approach is exemplified in [13••]’s study of the 2011 Texas drought/heat wave.
In considering the effects of climate change, there is a striking difference between those associated with purely thermodynamic aspects of the climate system and those also involving dynamical aspects [14•]. The former—which include continental- or basin-scale averages of quantities such as sea level, surface air temperature, sea-ice extent, snow cover, or upper-ocean heat content—exhibit changes that are generally robust in observations, in theory, and in models. However, regional aspects of climate change, including regional patterns of precipitation, generally involve dynamical aspects of climate change related to atmospheric and oceanic circulation, and these are robust neither in observations, in theory, or in models. This distinction is reflected in the strength of the various findings in the latest Summary for Policymakers of the IPCC [15•]. The reasons include the comparatively small signal-to-noise of forced changes in dynamically related quantities, the poor understanding of the mechanisms behind them, and sensitivity of model behaviour to parameterized processes. Figure 2 illustrates the issue for the case of annual-mean precipitation. To the extent that extreme weather and climate events involve dynamical processes—and most of course do—these uncertainties must be addressed when addressing the role of climate change in the event. Trenberth et al. [16••] have recently argued that in some cases, these uncertainties may prevent a reliable application of the risk-based approach and thus that the storyline approach is to be preferred.
The Risk-Based Approach
The risk-based approach to extreme event attribution is fundamentally probabilistic and requires creating two sample populations, a ‘factual’ (the world as it is) and a ‘counter-factual’ (the world as it would have been without climate change). The conceptual framework is illustrated in Fig. 3a, b, for the case of small and large shifts in the mean (for simplicity, the distribution shape is not altered). Given a factual event, the effect of climate change can be expressed in terms of either the altered frequency of occurrence of an event of that magnitude (the intercepts of p0 and p1 with the vertical grey line in Fig. 3a) or the altered magnitude of an event having that frequency in the factual climate (the intercepts of p0 and p1 with the horizontal grey line in Fig. 3a). It is evident that the relative role of climate change compared to natural variability may be quite different when viewed in terms of frequency or in terms of magnitude (cf. [8••]).Footnote 1 Yet both perspectives are clearly valid; the magnitude perspective is typical in a regulatory context, e.g. the need to protect against a 200-year event.
Implementing this approach involves several steps, which have both practical and philosophical implications. The first step is the event definition. The observed extreme event is unique, so it must be abstracted to a class of event amenable to statistical analysis. This requires a choice of physical variable and the spatial and temporal averaging used to define the event. There is obviously considerable freedom in this choice, yet any particular choice can have a strong effect on the result; in Fig. 3a, different choices would correspond to different locations of the grey lines, and the p1/p0 ratios will be quite sensitive to this choice. See [17•] for an explicit example.
The second step is the construction of the factual likelihood distribution p1. This will generally be done with a climate model. The fundamental challenge is that in order to estimate the likelihood of an extreme event, one needs to perform many years of simulation—the more extreme the event, the larger the number of years. Yet, in order to do so, the model must be computationally cheap to run, which means that it may not be able to simulate credible facsimiles of the event in question. Even for cases where attribution seems easy, such as large-scale heat waves, land-surface feedbacks may involve mesoscale processes that are not adequately represented in models, and deficiencies in the representation of precipitation, let alone precipitation extremes, in coarse-resolution climate models are legion [18]. Therefore, the appropriateness of the model for the study in question needs to be carefully assessed.
The third step is the construction of the counter-factual likelihood distribution p0. All the issues of model fidelity discussed above apply here as well of course, with the additional complication that the counter-factual observations do not exist with which one might evaluate the model. One might use historical observations instead, but those will be highly limited and perhaps nonexistent for the extreme of interest, and the assumption needs to be made that observed climate change is identical to anthropogenic climate change. If the climate model is coupled, then the attribution of differences between the factual and counter-factual climates is clear (assuming the imposed greenhouse gas changes are entirely anthropogenic), but to speed up computations, often the sea-surface temperatures (SSTs) are imposed in an atmosphere-only model. The typical choice is to use observed SSTs for the factual and define the counter-factual SSTs by subtracting an SST anomaly taken from coupled model simulations of climate change. Sensitivity to the choice of the latter must be assessed. Moreover, if the observed SSTs were important for inducing the particular extreme in question, then the attribution is conditional on this situation, and that too must be accounted for. See [17•] for an explicit example.
There are also philosophical issues. The risk-based approach uses concepts developed in epidemiology. In that context, attribution involves analysis of a population, and the question is asked whether the observed data are more consistent with an outbreak of an infectious disease (say) than with noise. That corresponds to the classic detection-attribution question in climate science. But if the attribution question concerns a single event, then the analogy with epidemiology is no longer there. Moreover, the observed event is only used to motivate the choice of event class, and confrontation with observations is not an intrinsic part of the analysis, as it is with detection-attribution. (Observations may be used to establish confidence in the climate model, but are not explicitly used for hypothesis testing.) The results therefore pertain very much to ‘model world’, and their physical connection to the actual event is not immediate. If there is a reliable long-term data record, this issue can be addressed by couching the event attribution within a more traditional detection-attribution framework, as illustrated by [19] for annual-mean Central England Temperature. However, this will generally constrain the spatiotemporal footprint of the event and will be limited to situations where such long-term records exist and exhibit attributable trends.
There is furthermore the question of interpretation. Classically, there are two kinds of causation: necessary and sufficient. These concepts have probabilistic analogues [20]. Necessary causation means the effect could not have occurred without the factor in question, but it may be that other factors were also necessary. As already noted, this is generally going to be the case with extremes, because extreme meteorological variability is usually required in order to be in the tail of the PDF—as in Fig. 3a. An important point is that with only necessary causation, there is no predictive power for single events; if the factor in question recurs, then the effect may not recur because it depends on the presence of other factors. (Strictly speaking, there is predictive power for nonevents in the counter-factual world, but that is not particularly useful information.) This situation may be contrasted with sufficient causation, where the factor in question is enough to make the effect occur irrespective of other factors. The latter situation is illustrated in Fig. 3b; here, what is extreme in the counter-factual world is normal in the factual world, and perhaps should not even be called an extreme at all. There is moreover predictive power for single events, because one can expect these so-called extremes (relative to the counter-factual) to recur frequently. This is increasingly the case with summertime continental extreme temperatures, as shown in Fig. 4. Confusion will ensue if the distinction between the two kinds of causation is not recognized, but commonly used extreme-event attribution measures such as the fraction of attributable risk (FAR) only reflect necessary causation and do not distinguish between the two [22••].
Dynamic and Thermodynamic Mechanisms
As discussed earlier, there is a striking difference between the robustness of purely thermodynamic aspects of climate change and of dynamic aspects involving the atmospheric or oceanic circulation. The former are quite certain, the latter highly uncertain [14•]. At the regional scale, the thermodynamic aspects are strongly modulated by the dynamic aspects so the latter must be taken into account. Part of the issue is the relatively small signal-to-noise of the circulation changes expected from models [23••]—although there are regional exceptions [24]—and part is the general non-robustness of the circulation response in models (Fig. 4 of [14•], [25]). The difficulty is compounded by the fact that the forced circulation response can be expected to project on the modes of variability [26], so is difficult to separate from the noise using fingerprinting methods, and is not well constrained theoretically [27].
Given this situation, a number of researchers have attempted to separate the thermodynamic from dynamic aspects in explaining the behaviour of observed extremes. [28••] examined the cold European winter of 2010 and argued that once one accounted for the anomalous circulation regime, including record persistence of a negative North Atlantic Oscillation (NAO) index, the winter was anomalously warm, in line with a warming climate. The results are illustrated in Fig. 5. Diffenbaugh et al. [29•] examined the recent California drought and showed that whilst there was no apparent change in observed precipitation, the systematic warming over the past century meant that dry years were now almost invariably also warm years (hence increasing the proclivity of drought), whereas in the past, the combination of the two conditions was less common. California precipitation is controlled by dynamical processes related to the storm track, and its future evolution is therefore highly uncertain [30]. Without a clear prediction of precipitation changes, [29•] argue that the risk of drought in California is increasing. In both cases, the authors regard the thermodynamic aspects of the observed changes as certain, and the dynamic aspects as uncertain and probably best interpreted as natural variability.
If an extreme event was mainly caused by purely thermodynamic processes, then the risk-based analysis using a climate model is probably reliable and a strong attribution statement can be made. If, on the other hand, an extreme event was caused in part by extreme dynamical conditions, then any risk-based analysis using a climate model also has to address the question of whether the simulated change in the likelihood or severity of such conditions is credible. Without attributed observed changes, or a theoretical understanding of what to expect, or a robust prediction from climate models, this would seem to be an extremely challenging prospect. And if plausible uncertainties are placed on those changes, then the result is likely to be ‘no effect detected’. This is indeed what tends to be concluded in event attribution studies of dynamically driven extremes [31]. But absence of evidence is not evidence of absence. Can we do better?
The Storyline Approach
Since climate change is an accepted fact [15•], it should no longer be necessary to detect climate change; rather, the question (for extreme event attribution) is what is the best estimate of the contribution of climate change to the observed event. In this case, effect size is the more relevant question than statistical significance [32]. Trenberth et al. [16••] argue that a physical investigation of how the event unfolded, and how the different contributing factors might have been affected by known thermodynamic aspects of climate change, is the more effective approach when the risk-based approach yields a highly uncertain outcome. This storyline approach, which is analogous to accident investigation (where multiple contributing factors are generally involved and their roles are assessed in a conditional manner), was employed by [13••] to investigate the 2011 Texas drought/heat wave. Although [13••] emphasized the dominance of natural variability, specifically the precipitation deficit associated with anomalous Pacific SSTs, they estimated that about 0.7 °C (20 %) of the heat-wave magnitude relative to the 1981–2010 mean was attributable to anthropogenic climate change. Thus, the storyline approach can quantify the magnitude of the anthropogenic effect, but only for that particular event. This could be useful for liability, or for planning if historical events are used as benchmarks for resilience. (It may be difficult to convince people to invest in defences against a hypothetical risk, but easier to do so if an event has previously occurred so clearly could occur again, but potentially with more impact.)
A limitation of this approach is that it is only a partial attribution, in that it does not address the potential change in likelihood of the dynamical situation leading to the event. The counter-argument is that it is useful to distinguish between the dynamical and purely thermodynamic factors leading to the extreme event, as they have very different levels of uncertainty. Recognizing that distinction allows the risk-based and the storyline approaches to be cast within a common framework. If the extreme event was mainly the result of a dynamical situation conducive to that extreme, then one can represent the probability of the event in the conditional manner.
$$ \mathrm{P}\left(\mathrm{E}\right)=\mathrm{P}\left(\mathrm{E}\Big|\mathrm{D}\right)\mathrm{P}\left(\mathrm{D}\right)+\mathrm{P}\left(\mathrm{E}\Big|\mathrm{N}\mathrm{D}\right)\mathrm{P}\left(\mathrm{N}\mathrm{D}\right) $$
(1)
where E is the extreme event, D is the dynamical situation, and ND is not the dynamical situation (i.e. the complement of D). For small changes, the change in probability from climate change is then
$$ \delta P(E)=P(D)\delta P\left(E\Big|D\right)+P\left(E\Big|D\right)\delta P(D)+\delta \left[P\left(E\Big|ND\right)P(ND)\right]=P(D)\delta P\left(E\Big|D\right)\left\{1 + \frac{\updelta \mathrm{P}\left(\mathrm{D}\right)/\mathrm{P}\left(\mathrm{D}\right)}{\updelta \mathrm{P}\left(\mathrm{E}\Big|\mathrm{D}\right)/\mathrm{P}\left(\mathrm{E}\Big|\mathrm{D}\right)}\right\}+\delta \left[P\left(E\Big|ND\right)P(ND)\right]. $$
(2)
The risk-based approach estimates δP(E), or sometimes δP(E,D), the change in the joint occurrence of E and D (e.g. the combination of high temperature and anti-cyclonic circulation anomaly used by [8••] for the 2010 Russian heat wave). The dynamically conditioned attribution, in contrast, estimates δP(E|D); this is equivalent to the ‘circulation analogues’ approach described earlier and illustrated in Fig. 5, where anthropogenic changes in temperature for a particular extreme winter were estimated after conditioning on the circulation regime. The signal-to-noise of this estimate can be expected to be large, since the conditioning (especially if on a specific synoptic situation) eliminates most of the dynamical variability; the concept is illustrated in Fig. 3c. The product P(D) times δP(E|D) is simply the change in probability of the extreme event, assuming no change in occurrence of the dynamical situation that led to the event. The justification for the dynamically conditioned approach is that the latter change in occurrence, δP(D), is highly uncertain and best assumed to be zero unless there are strong grounds for assuming something else [16••]. In any case, its impact on δP(E) depends on the ratio of the signal-to-noise of the dynamical change, δP(D)/P(D), to that of the thermodynamic change, δP(E|D)/P(E|D). Since this ratio can generally be expected to be small, the neglect of this term is not unreasonable. There is also the last term in eq. 2, but assuming that D was a necessary condition for the occurrence of the extreme, it will be negligible. Since P(E,D) = P(E|D)P(D), neglect of the last term in eq. 2 is implicit in the risk-based approach when an event is defined by the joint occurrence P(E,D).
It may be noted that this approach is analogous to specifying the meteorology and quantifying the impact of a chemical change on atmospheric composition, which assumes that the composition change is too small to appreciably affect the meteorology. This was used by [33] to quantify the contribution of changes in ozone-depleting substances to the observed total-ozone record on a year-by-year basis, i.e. deterministically, rather than only statistically as would be the case with a free-running model.
For a weather extreme that is predictable, the dynamically conditioned approach can be implemented within a weather model that is capable of simulating the extreme in question. That is one of the great advantages of this approach: that one can obtain a credible estimate of δP(E|D). The main uncertainty probably lies in the specification of the counter-factual thermodynamic environment, but that is an issue for any attribution study. The concept is illustrated in Fig. 6, which shows the impact of cooler SSTs on re-forecasts of hurricane Sandy. Of course, since the atmosphere is chaotic, any small difference in conditions will lead to a difference in the outcome, and if the observed outcome was extreme, then one might generically expect a weakened extreme from any perturbation. This potential pitfall can be easily guarded against by also making a perturbation in the opposite direction. Lackmann [35] applied this approach to Sandy, finding that the hurricane’s intensity would have been slightly weaker had it occurred in 1900, but would be substantially greater if it re-occurred in 2100. Another application of this approach is [36•], who ran a nested convection-resolving model, constrained by the large-scale circulation, to simulate the 2012 Krymsk precipitation extreme. Remarkably, they identified a bifurcation whereby the mesoscale system leading to the extreme could only occur for Black Sea temperatures above a certain threshold (which they argued was of anthropogenic origin).
As illustrated by Fig. 3c, conditioning on the dynamical situation leading to the event can convert necessary causation to sufficient causation, in which case even a single event can distinguish between alternative hypotheses. For example, [37] argued that the exceptionally warm European fall/winter of 2006/2007 could not have occurred, in conjunction with the observed circulation anomaly, without anthropogenic warming. This then ties the attribution directly to the observed event, rather than being only probabilistic. The direct confrontation with data as an essential component of the attribution is a very attractive feature of this approach, as is its emphasis on a physically based causal narrative.
It may be that circulation changes are expected to be important in the future occurrence of an extreme. An example is provided in Fig. 7, where the spread in future cold-season Mediterranean drying—a model prediction with enormous socioeconomic implications for Europe—across the CMIP5 models is almost entirely explained by the spread in the circulation response [38]. In this case, eq. 2 is still informative because it allows one to separately estimate the uncertainty associated with the thermodynamic and dynamic aspects of climate change. Particular choices of δP(D) could be considered as plausible storylines.