Skip to main content

Advertisement

Log in

DADA: data assimilation for the detection and attribution of weather and climate-related events

  • Published:
Climatic Change Aims and scope Submit manuscript

Abstract

We describe a new approach that allows for systematic causal attribution of weather and climate-related events, in near-real time. The method is designed so as to facilitate its implementation at meteorological centers by relying on data and methods that are routinely available when numerically forecasting the weather. We thus show that causal attribution can be obtained as a by-product of data assimilation procedures run on a daily basis to update numerical weather prediction (NWP) models with new atmospheric observations; hence, the proposed methodology can take advantage of the powerful computational and observational capacity of weather forecasting centers. We explain the theoretical rationale of this approach and sketch the most prominent features of a “data assimilation–based detection and attribution” (DADA) procedure. The proposal is illustrated in the context of the classical three-variable Lorenz model with additional forcing. The paper concludes by raising several theoretical and practical questions that need to be addressed to make the proposal operational within NWP centers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  • Allen MR (2003) Liability for climate change. Nature 421:891–892

    Article  Google Scholar 

  • Baum LE, Petrie T, Soules G, Weiss N (1970) A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Ann Math Stat 41(1):164–171

    Article  Google Scholar 

  • Balmaseda MA, Alves OJ, Arribas A, Awaji T, Behringer DW, Ferry N, Fujii Y, Lee T, Rienecker M, Rosati T, Stammer D (2009) Ocean initialization for seasonal forecasts. Oceanography Special Issue 22(3)

  • Bengtsson L, Ghil M, Källén E (1981) Dynamic meteorology: Data assimilation methods. Springer-Verlag, New York

    Book  Google Scholar 

  • Bhend J, Franke J, Folini D, Wild M, Brönnimann S (2012) An ensemble-based approach to climate reconstructions. Clim Past 8:963–976

    Article  Google Scholar 

  • Bocquet M, Pires CA, Wu L (2010) Beyond Gaussian statistical modeling in geophysical data assimilation. Mon Wea Rev 138:2997–3023

    Article  Google Scholar 

  • Bocquet M (2012) Parameter-field estimation for atmospheric dispersion: application to the Chernobyl accident using 4D-Var. Quart J Roy Meteor Soc 138:664–681

    Article  Google Scholar 

  • Bucklew JA (2004) Introduction to rare event simulation. Springer

  • Carrassi A, Vannitsem S (2010) Model error and variational data assimilation: A deterministic formulation. Mon Wea Rev 138:3369–3386

    Article  Google Scholar 

  • Carrassi A, Ghil M, Trevisan A, Uboldi F (2008) Data assimilation as a nonlinear dynamical systems problem: Stability and convergence of the prediction-assimilation system. Chaos: An Interdisciplinary Journal of Nonlinear Science 18(2):023–112

    Article  Google Scholar 

  • Chekroun MD, Simonnet E, Ghil M (2011) Stochastic climate dynamics: Random attractors and time-dependent invariant measures. Phys D 240(21):1685–1700. doi:10.1016/j.physd.2011.06.005

    Article  Google Scholar 

  • Chevallier F (2013) On the parallelization of atmospheric inversions of CO2 surface fluxes within a variational framework. Geosci Model Dev Discuss 6:37–57

    Article  Google Scholar 

  • Christidis N, Stott PA, Scaife A A, Arribas A, Jones G S, Copsey D, Knight J R, Tennant W J (2013) A New HadGEM3-A-Based System for Attribution of Weather- and Climate-Related Extreme Events. J Clim 26(9):2756–2783

    Article  Google Scholar 

  • Cosme E, Brankart JM, Verron J, Brasseur P, Krysta M (2006) Implementation of a reduced-rank, square-root smoother for ocean data assimilation. Ocean Model 33:87–100

    Article  Google Scholar 

  • Dalcher A, Kalnay E, Hoffman RN (1988) Medium-range lagged average forecasts. Mon Wea Rev 116:402–416. doi:10.1175/1520-0493. 1988116<0402:MRLAF>2.0.CO;2.

    Article  Google Scholar 

  • Del Moral P, Garnier J (2005) Genealogical particle analysis of rare events. Ann Appl Probab 15(4):2496–2534

    Article  Google Scholar 

  • Evensen G (2003) The ensemble Kalman filter: theoretical formulation and practical implementation. Ocean Dyn 53:343–367

    Article  Google Scholar 

  • Gardiner C (2004) Handbook of stochastic methods for physics, Chemistry and the natural sciences. Publisher. Pls.; no web tonite

  • Gelb A (1974) Applied optimal estimation. M.I.T. Press, Cambridge

    Google Scholar 

  • Ghil M, Childress S (1987) Topics in geophysical fluid dynamics: Atmospheric dynamics, dynamo theory and climate dynamics. Springer-Verlag, New York, p 485

    Book  Google Scholar 

  • Ghil M, Malanotte-Rizzoli P (1991) Data assimilation in meteorology and oceanography. Adv Geophys 33:141–266

    Article  Google Scholar 

  • Ghil M, Cohn S, Tavantzis J, Bube K, Isaacson E (1981). In: Bengtsson L, Ghil M, Källén E (eds) Applications of estimation theory to numerical weather prediction. In: Dynamic meteorology: Data assimilation methods. Springer Verlag, pp 139–224

  • Gini C (1921) Measurement of inequality of incomes. Econ J 31(121):124–126. doi:10.2307/2223319

    Article  Google Scholar 

  • Greenland S, Rothman KJ (1998) Measures of effect and measures of association, Chapter 4. In: Rothman K J, Greenland S (eds) Modern Epidemiology, 2nd edn.,Lippincott-Raven, Philadelphia, USA

  • Hannart A, Pearl J, Otto FEL, Naveau P, Ghil M (2015) Counterfactual causality theory for the attribution of weather and climate-related events. Bull Am Meteorol Soc. in press

  • Harris T E, Kahn H (1951) Estimation of particle transmission by random sampling. Natl Bur Stand Appl Math Ser 12:27–30

    Google Scholar 

  • Heidelberg P (1995) Fast simulation of rare events in queueing and reliability models. ACM Trans Models Comput Simul 5:43–85

    Article  Google Scholar 

  • Hewitt C, Mason S, Walland D (2012) The global framework for climate services. Nat Clim Change 2:831–832

  • Hoffman RN, Kalnay E (1983) Lagged average forecasting, an alternative to Monte Carlo forecasting. Tellus 35A:100–118. doi:10.1111/j.1600-0870.1983.tb00189.x

    Article  Google Scholar 

  • Houtekamer PL, Mitchell HL, Pellerin G, Buehner M, Charron M (2005) Atmospheric data assimilation with an ensemble Kalman filter: Results with real observations. Mon Wea Rev 133:604–620

    Article  Google Scholar 

  • Hürzeler M, Künsch HR (2001). In: Doucet A, De Freitas JFG, Gordon NJ (eds) Approximation and maximising the likelihood for a general state-space model. In: Sequential Monte Carlo Methods Practice. Springer-Verlag, New York

  • Ide K, Courtier P, Ghil M, Lorenc A (1997) Unified notation for data assimilation: Operational, sequential and variational. J Meteor Soc Japan 75:181–189

    Google Scholar 

  • Ihler AT, Kirshner S, Ghil M, Robertson AW, Smyth P (2007) Graphical models for statistical inference and data assimilation. Phys D 230:72–87

    Article  Google Scholar 

  • Jazwinski AH (1970) Stochastic and filtering theory. Mathematics in sciences and engineering series 64 :376

  • Kalman RE (1960) A new approach to linear filtering and prediction problems. J Basic Eng 82D:33– 45

    Google Scholar 

  • Kantas N, Doucet A, Singh SS, Maciejowski JM (2009) An overview of sequential Monte Carlo methods for parameter estimation. In: General state-space models, IFAC System Identification, no. Ml

  • Kondrashov D, Sun CJ, Ghil M (2008) Data assimilation for a coupled ocean-atmosphere model. Part II: Parameter estimation. Mon Wea Rev 136:5062–5076. doi:10.1175/2008MWR2544.1

    Article  Google Scholar 

  • Kondrashov D, Shprits Y, Ghil M (2011) Log-normal Kalman filter for assimilating phase-space density data in the radiation belts. Space Weather 9:S11006. doi:10.1029/2011SW000726

    Article  Google Scholar 

  • Lee TCK, Zwiers FW, Tsao M (2008) Evaluation of proxy-based millennial reconstruction methods. Climate Dyn 31:263–281

    Article  Google Scholar 

  • Lorenz EN (1963) Deterministic non-periodic flow. J Atmos Sci 20:130–141

    Article  Google Scholar 

  • Lorenz MO (1905) Methods of measuring the concentration of wealth. Publications of the American Statistical Association 9(70):209–219. doi:10.2307/2276207

    Article  Google Scholar 

  • Martin MJ, et al. (2014) Status and future of data assimilation in operational oceanography. J of Oper Ocean. in press

  • Massey N, Jones R, Otto FEL, Aina T, Wilson S, Murphy JM, Hassell D, Yamazaki YH, Allen MR (2014) weather@home — development and validation of a very large ensemble modelling system for probabilistic event attribution. Q J R Meteorol Soc. doi:10.1002/qj.2455

    Google Scholar 

  • Otto FEL, Boyd E, Jones RG, Cornforth RJ, James R, Parker HR, Allen MR (2015) Attribution of extreme weather events in Africa: a preliminary exploration of the science and policy implications. Climatic Change

  • Palmer TN (1999) A non-linear dynamical perspective on climate prediction. J Clim 12:575–591

    Article  Google Scholar 

  • Pearl J (2000) Causality: Models, reasoning and inference. Cambridge University Press, Cambridge

    Google Scholar 

  • Pitt MK (2002) Smooth particle filters for likelihood evaluation and maximisation. Warwick Economic Research Papers, No. 651

  • Robert C, Blayo E, Verron J (2006) Comparison of reduced-order sequential, variational and hybrid data assimilation methods in the context of a Tropical Pacific ocean model. Ocean Dyn 56:624–633

    Article  Google Scholar 

  • Roques L, Chekroun MD, Cristofol M, Soubeyrand S, Ghil M (2014) Parameter estimation for energy balance models with memory. Proc R Soc A 470:20140349

    Article  Google Scholar 

  • Ruiz J, Pulido M, Miyoshi T (2013) Estimating model parameters with ensemble-based data assimilation: A review. JMSJ 91(2):79–99

    Article  Google Scholar 

  • Sakov P, Counillon F, Bertino L, Lister KA, Oke PR, Korablev A (2012) TOPAZ4: an ocean-sea ice data assimilation system for the North Atlantic and Arctic. Ocean Sci 8:633–656. doi:10.5194/os-8-633-2012

    Article  Google Scholar 

  • Stone DA, Allen MR (2005) The end-to-end attribution problem: from emissions to impacts. Clim Change 71:303–318

    Article  Google Scholar 

  • Stott PA, et al. (2013). In: Asrar GR, Hurrell J W (eds) Attribution of weather and climate-related events, in: Climate Science for Serving Society: Research, Modelling and Prediction Priorities. Springer. in press

  • Stott PA, Stone DA, Allen MR (2004) Human contribution to the European heatwave of 2003. Nature 432:610–614

    Article  Google Scholar 

  • Talagrand O (1997) Assimilation of observations, an introduction. J Meteor Soc Japan 75(1B):191–209

    Google Scholar 

  • Tandeo P, Pulido M, Lott F (2014) Offline parameter estimation using EnKF and maximum likelihood error covariance estimates: Application to a subgrid-scale orography parametrization. Q J R Meteorol Soc. doi:10.1002/qj.2357

    Google Scholar 

  • Trenberth K E, Fasullo J T, Shepherd T G (2015) Attribution of climate extreme events. Nature Clim Change 5:725–730

    Article  Google Scholar 

  • Wiener N (1949) Extrapolation, Interpolation and smoothing of stationary time series, with engineering applications. M.I.T. Press, Cambridge, p 163

    Google Scholar 

  • Wouters J, Bouchet F (2015) Rare event simulation of the chaotic Lorenz 96 dynamical system. Geophysical Research Abstracts, EGU General Assembly 2015 Vol. 17, EGU2015-10421-1

Download references

Acknowledgements

It is a pleasure to thank Fredi Otto and Dáithí Stone, who provided careful and constructive reviews of the original paper. This work has been supported by grant DADA from the Agence Nationale de la Recherche (ANR, France: AH and all co-authors) and by the Multi-University Research Initiative (MURI) N00014-12-1-0911 from the the U.S. Office of Naval Research (MG).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to A. Hannart.

Appendices

Appendix A Illustration of the computational benefit of the DADA approach

To illustrate the computational benefit, let Y be for instance a d-variate autoregressive process defined by Y t+1 = A Y t +w t , where w t is an i.i.d. noise having known PDF g(⋅) and where A has the usual properties that insure stationarity (Gardiner 2004). We then have:

$$\begin{array}{@{}rcl@{}} && f(\mathbf{y}) = \prod\limits_{t=1}^{T} g(\mathbf{y}_{t}-\mathbf{A}\mathbf{y}_{t-1})\pi(\mathbf{y}_{0})\,, \end{array} $$
(7a)
$$\begin{array}{@{}rcl@{}} && P(\phi(\mathbf{Y})\geq u) = {\int}_{\phi(\mathbf{y})\geq u} \prod\limits_{t=1}^{T} g(\mathbf{y}_{t}-\mathbf{A}\mathbf{y}_{t-1}) \pi(\mathbf{y}_{0}) \mathrm{d}y_{1,0} {\ldots} \mathrm{d}y_{d,0} {\ldots} \mathrm{d}y_{d,T}\,, \end{array} $$
(7b)

with π(⋅) the prior PDF on the initial state Y 0. Equation 7a shows that f(y) can be easily computed using a closed-form expression, while P(ϕ(Y)≥u) in Eq. 7b is an integral on d×T+1 dimensions which must instead be evaluated by using, for instance, a computationally quite costly Monte-Carlo (MC) simulation.

Appendix B Data Assimilation

The state-estimation problem for the system given by Eq. 4a and 4b has an exact solution given by the following sequential Kalman filter (KF) equations:

$$\begin{array}{@{}rcl@{}} {\mathbf{x}^{a}_{t}} &=& {\mathbf{x}^{f}_{t}} +\mathbf K(\mathbf{y}_{t}-\mathbf H {\mathbf{x}^{f}_{t}}) \,, \end{array} $$
(8a)
$$\begin{array}{@{}rcl@{}} \mathbf{P^{a}_{t}} &=& (\mathbf I -\mathbf K\mathbf H) \mathbf{P^{f}_{t}} \,, \end{array} $$
(8b)
$$\begin{array}{@{}rcl@{}} \mathbf{x}^{f}_{t+1} &=& \mathbf M {\mathbf{x}^{a}_{t}} \,, \end{array} $$
(8c)
$$\begin{array}{@{}rcl@{}} \mathbf P^{f}_{t+1} &=& \mathbf M \mathbf{P^{a}_{t}}\mathbf M^{\prime} + \mathbf Q \,. \end{array} $$
(8d)

where \(^{\prime }\) denotes the transpose operation. Here Eq. 8a and 8b are referred to as the analysis step and denoted by a superscript a, while the forecast step is given by Eq. 8c and 8d, and is denoted by a superscript f (Ide et al. 1997). The vector \({\mathbf {x}^{a}_{t}}\) and the matrix \(\mathbf {P^{a}_{t}}\) are the mean and covariance of X t conditional on (Y 1,...,Y t )=(y 1,...,y t ); \(\mathbf K=\mathbf {P^{f}_{t}}\mathbf H^{\prime }(\mathbf H\mathbf {P^{f}_{t}}\mathbf H^{\prime }+\mathbf R)^{-1}\) is the so-called Kalman gain matrix; while Q and R are the covariances associated with v t and w t , respectively. Following (Wiener 1949), one distinguishes between filtering, in which \({\mathbf {x}^{a}_{t}}\) and \(\mathbf {P^{a}_{t}}\) are conditioned only on the previous and current observations (y 0,...,y t ), and smoothing, in which they are conditioned on the entire sequence, 0≤tT. Furthermore, the sequential algorithm needs to be initialized at time t=0 with \({\mathbf {x}_{0}^{f}}\) and \(\mathbf {P}_{0}^{f}\), which thus represent the a priori mean and covariance of X 0, respectively, and have to be prescribed by the user.

Appendix C Derivation of the model evidence

In this appendix, we outline the derivation of model evidence within a general Bayesian framework, and we apply the latter to the narrower KF context to obtain Eq. 5. Consider two consecutive cycles of a DA run, the first with state vector x t and observation vector y t at instant t and the subsequent one with state vector x t+1 and observation vector y t+1 at instant t+1. We plan to find a tractable expression for the model evidence p(y t ,y t+1).

The model evidence provided by the full sequence of observations y=(y 0,...,y T ) will be inferred by recursion, using the results of this two-observation setting. In order to decouple the two cycles, one first has to spell out the Bayesian inference p(y t ,y t+1) = p(y t )p(y t+1|y t ). We look for a tractable expression for p(y t+1|y t ) by further introducing the states x t+1 and x t as intermediate random variables:

$$\begin{array}{@{}rcl@{}} \begin{array}{lll} p(\mathbf{y}_{t+1} | \mathbf{y}_{t}) &=& {\int}_{\mathbf{x}_{t+1}} \! p(\mathbf{y}_{t+1} | \mathbf{y}_{t}, \mathbf{x}_{t+1}) p(\mathbf{x}_{t+1} | \mathbf{y}_{t})\,\mathrm{d}\mathbf{x}_{t+1} \\ & = & {\int}_{\mathbf{x}_{t+1}} \! p(\mathbf{y}_{t+1} | \mathbf{x}_{t+1}) \left\{{\int}_{\mathbf{x}_{t}} \! p(\mathbf{x}_{t+1} | \mathbf{x}_{t})\, p(\mathbf{x}_{t} | \mathbf{y}_{t})\,\mathrm{d}\mathbf{x}_{t}\right\} \mathrm{d}\mathbf{x}_{t+1}\,, \end{array} \end{array} $$
(9)

where p(y t+1|x t+1) is the likelihood of the observation vector y t+1 conditional on the state vector x t+1 and it is known from Eq. 4b. The conditional PDF p(x t |y t ) of x t on y t at instant t — which appears on the right-hand side of the above equation — is referred to as the analysis PDF in the DA literature, where it is denoted by a superscript a (Ide et al. 1997), and it constitutes the main DA output. The integral \({\int }_{\mathbf {x}_{t}} \! p(\mathbf {x}_{t+1} | \mathbf {x}_{t}) p(\mathbf {x}_{t} | \mathbf {y}_{t})\,\mathrm {d}\mathbf {x}_{t}=p(\mathbf {x}_{t+1} | \mathbf {y}_{t})\), in which p(x t+1|x t ) is known from the model dynamics given by Eq. 4a, propagates this analysis PDF further in time, to instant t+1. Hence, the result of this integration coincides with the forecast PDF, denoted by superscript f in the DA literature (Ide et al. 1997). It follows that this decomposition is tractable using a DA scheme that is able to estimate the conditional and forecast PDFs.

Next, let us apply the general Bayesian inference (9) to the case in which all the PDFs involved are Gaussian; this requires, in turn, that both the dynamics and observation models M and H be linear, and that the input statistics all be Gaussian. In this case, the Kalman filter allows for the exact computation of the PDFs mentioned in Eq. 9, which turn out to be Gaussian.

In the following, \({\mathcal {N}}(\overline {\mathbf {x}},\mathbf {P})\) designates the Gaussian PDF of mean \(\overline {\mathbf {x}}\) and covariance matrix P. In this context, the analysis PDF at instant t is \({\mathcal {N}}({\mathbf {x}^{a}_{t}},{\mathbf {P}^{a}_{t}})\), where \({\mathbf {x}^{a}_{t}}\) and \({\mathbf {P}^{a}_{t}}\) are the analysis state and error covariance matrix at instant t. As a result of the linearity assumptions, the forecast PDF at instant t+1 is given by a Gaussian distribution \({\mathcal {N}}(\mathbf {x}^{f}_{t+1},\mathbf {P}^{f}_{t+1})\), where \(\mathbf {x}^{f}_{t+1}\) and \(\mathbf {P}^{f}_{t+1}\) are the forecast state and error covariance matrix at instant t+1. Further, the integration on x t+1 in Eq. 9 can readily be performed under these circumstances, with the outcome that p(y t+1|y t ) is distributed as \({\mathcal {N}}(\mathbf {H}\,\mathbf {x}^{f}_{t+1},\mathbf {R}+\mathbf {H}\,\mathbf {P}^{f}_{t+1}\,\mathbf {H}^{\prime })\).

The desired model evidence f(y) can then be computed by recursion on successive time steps as:

$$\begin{array}{@{}rcl@{}} f(\mathbf{y}) = p(\mathbf{y}_{0})\,\prod\limits_{t=1}^{T} (2\pi)^{-\frac{d}{2}}|\boldsymbol{\Sigma}_{t}|^{-\frac{1}{2}} \exp \left\{ -\frac{1}{2}(\mathbf{y}_{t}-{\mathbf{H}\mathbf{x}^{f}_{t}})'\boldsymbol{\Sigma}_{t}^{-1}(\mathbf{y}_{t}-{\mathbf{H}\mathbf{x}^{f}_{t}}) \right\}\,; \end{array} $$
(10)

here p(y 0) represents the prior PDF of the initial state, \(\boldsymbol {\Sigma }_{t} = \mathbf {R}+{\mathbf {H}\mathbf {P}^{f}_{t}}\mathbf {H}^{\prime }\), and this expression coincides with Eq. 5 and can be evaluated with the help of any DA method that yields the forecast states and forecast error covariance matrices, such as the KF or the EnKF. Note that the traditional standard Kalman smoother would give the same result as the KF, since they share the same forecasts.

Finally, Eqs. 9 and 10 above show that the likelihood f(y) may be obtained as a by-product of the inference on the state vector x, which usually is the main purpose in numerical weather prediction. This idea may actually be highlighted in even greater generality by considering the equality:

$$ {f(\mathbf{y}) = \frac{p(\mathbf{y}\vert\mathbf{x}) p(\mathbf{x})}{p(\mathbf{x}\vert\mathbf{y})}\,.} $$
(11)

While Eq. 11 is a direct consequence of Bayes theorem, it also illustrates a point that is arguably not so intuitive. The likelihood f(y) is obtained here as the ratio of two quantities: a numerator p(y|x)p(x) that is a model premise inherently postulated by Eq. 4a and 4b, and a denominator p(x|y) that may be viewed as the end result of the primary inferxence on x. In other words, estimating f(y) requires only a straightforward division, provided x has been previously inferred.

Equation 11 thus expresses with great clarity and simplicity a fundamental idea buttressing our proposal, as it provides a general theoretical justification for the suggestion of deriving the likelihood from an inferential treatment that focuses on x. To put it succintly, this equation basically says, “He who can do more can do less.” In the context of DA, whose end purpose is to infer the state vector x out of an observation y — i.e., the more part — it is possible to obtain the likelihood as a by-product thereof — i.e., the less part — and thus almost for free.

Appendix D PDF of the state vector

We associate a label ω∈Ω with each realization of the random process v t that drive the model given by Eq. 6. The PDF of the state vector x t can be obtained as the numerical solution of the corresponding Fokker-Planck equation, and it is the mean over Ω of the sample measures obtained for each realization ω of the noise v t and (Chekroun et al. 2011, and references therein). Each sample measure is supported on a random attractor that may have very fine structure and be time-dependent (Chekroun et al. 2011, Figs. 1–3 and supplementary material), but the PDF is supported smoothly, in the counterfactual world in which λ 0=0, on a “thickened” version of the fairly well-known strange attractor of the original L63 model. The latter PDF represents its attractor in dynamic system’s terminology.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hannart, A., Carrassi, A., Bocquet, M. et al. DADA: data assimilation for the detection and attribution of weather and climate-related events. Climatic Change 136, 155–174 (2016). https://doi.org/10.1007/s10584-016-1595-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10584-016-1595-3

Keywords

Navigation