Sequential learning of climate change via a physical-parameter-based state-space model and Bayesian inference

Lai, Yuchuan; Pozzi, Matteo

doi:10.1007/s10584-024-03739-w

Sequential learning of climate change via a physical-parameter-based state-space model and Bayesian inference

Open access
Published: 11 June 2024

Volume 177, article number 99, (2024)
Cite this article

Download PDF

You have full access to this open access article

Climatic Change Aims and scope Submit manuscript

Sequential learning of climate change via a physical-parameter-based state-space model and Bayesian inference

Download PDF

460 Accesses
12 Altmetric
1 Mention
Explore all metrics

Abstract

Flexible decision-making strategies provide an alternative option for climate adaptation by considering future learning of climate change. A physical-parameter-based state-space model (SSM) with Bayesian inference is developed in this work to investigate reduction of uncertainty from more observations and facilitate flexible adaptation strategies. This SSM method integrates a two-layer, energy-balance model to describe global mean temperature response, models multiple sources of uncertainty such as climate sensitivity and aerosol forcing, and uses the informative priors from processing Global Climate Model simulations. Focusing on global mean temperature anomaly, which has important implications on policies and related impacts, the SSM is assessed by applying it to both historical and pseudo-observations (i.e., model simulations used as observations), assessing the posterior probabilities of physical parameters, and evaluating reduction of projection uncertainty. Some limitations of the method are observed, such as the sensitivity related to the adopted forcing time series. Comparing the end-of-the-century projections of global mean temperature sequentially made at year 2020, 2050, and 2080 using pseudo-observations, the reduction of uncertainty from the SSM is evident: the range of 95% prediction intervals on average decreases from 1.9°C in 2020 to 1.0°C in 2050, and to 0.6°C in 2080 under the Shared Socioeconomic Pathway (SSP) 2–4.5 (or from 2.7°C, to 1.2°C and to 0.7°C under SSP5-8.5). These results illustrate how the SSM framework provides probabilistic projections of climate change that can be sequentially updated with more observations, and this process can facilitate flexible adaptation strategies.

Physics-guided probabilistic modeling of extreme precipitation under climate change

Article Open access 24 June 2020

Continental United States climate projections based on thermodynamic modification of historical weather

Article Open access 28 September 2023

Observation-based blended projections from ensembles of regional climate models

Article 04 July 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Forecasting and incorporating future climate conditions is a key to improve community resilience and promote engineering adaptation against climate change (ASCE-CACC 2018; Moss et al. 2019; Wright et al. 2019). Future climate information can be obtained from global climate models (GCMs; USGCRP (2017)), although their projections exhibit large uncertainty (Knutti and Sedláček 2013; Nissan et al. 2019) and it is challenging to incorporate them in engineering practice (Douglas et al. 2017). Substantial progresses and efforts have been made to improve and employ climate models (Eyring et al. 2016), including the recently released results from the Coupled Model Intercomparison Project phase 6 (CMIP6; Eyring et al. (2016)) with Shared Socioeconomic Pathways (SSPs; O’Neill et al. (2017)). However, practical applications of climate model projections (especially in engineering) are generally challenging (Douglas et al. 2017; Moss et al. 2019; Wright et al. 2021), and are subject to limitations related to temporal and spatial precision (Nissan et al. 2019) and large uncertainty (Steinschneider et al. 2015a; Cook et al. 2020; Lopez-Cantu et al. 2020; Helmrich and Chester 2022; Lai et al. 2022).

Flexible engineering adaptation strategies have been proposed and assessed in recent studies (Pozzi et al. 2017; Hui et al. 2018; Fletcher et al. 2019; Cohen and Herman 2021) to provide an alternative approach to traditional, rigid methods. Compared to traditional approaches of making fixed, long-term decisions on adaptation efforts (as many types of infrastructure have design life of over 100 years (ASCE-CACC 2015)), the alternative flexible strategies aim to introduce flexibility to engineering decision-making (Chester and Allenby 2019), e.g., by providing future expansion and modification options. These flexible strategies allow engineers to delay decisions or revisit adaptation sequentially, leverage the anticipatory reduction of climate uncertainty in the future, and increase benefit-to-cost ratios of infrastructure investments. Flexible strategies have been proposed and utilized in engineering with respect to other forms of uncertainty such as transportation demand (Fawcett et al. 2015) and economic development (Guma et al. 2009). With respect to climate uncertainty, flexible strategies have been studied in water resource management as optimal control problems (Herman et al. 2020) and in economic, adaptive decision-making such as using real-option analyses (Guthrie 2019; Ginbo et al. 2021; Kim et al. 2022). Similar concepts have been named as “adaptive” or “dynamic” (Hui et al. 2018; Herman et al. 2020), here we refer to this strategy as “flexible” adaptation (compared to traditional, rigid approaches).

Probabilistic estimation of climate variables – quantifying the value of information (Memarzadeh and Pozzi 2016) from more observations – are needed to facilitate flexible adaptation. Described as “learning scenarios” in Völz and Hinkel (2023), such probabilistic estimations – compared to the existing, “static” projections such as the ones provided in the Intergovernmental Panel on Climate Change (IPCC) Assessment Reports (ARs) – can be made at future moments in time (e.g., in 2050) with additional observations to assess the corresponding reduction of uncertainty. While other factors such as scientific advances can also contribute to reduction of uncertainty, this work focuses on quantifying the reduction of uncertainty from observing climate change.

The main objective of this work is to develop a probabilistic modeling framework – based on climate science and reasonably simplified to improve efficiency – to investigate the expected reduction of climate uncertainty based on future temperature observations and facilitate flexible adaptation strategies. As discussed in Völz and Hinkel (2023), many existing methods used to generate climate learning scenarios for flexible adaptation such as real-option analyses are simplified approaches and are inadequate of representing climate science; one promising option is to combine Bayesian approaches with climate models or with statistical approximation models. Time series models – which have been applied or developed for climate studies (Mudelsee 2010) such as the ARIMA-based (autoregressive integrated moving average model) approaches in Lai and Dzombak (2020, 2021) – are combined in this work with a Bayesian method (which, similarly, has been used in the studies like Tebaldi and Sansó (2009), Steinschneider et al. (2015b), Hui et al. (2018), and Fletcher et al. (2019)) to provide such a probabilistic framework.

This framework models physical parameters to describe global mean temperature anomaly considering multiple sources of uncertainty. The simplified energy-balance equations, describing global temperature response to radiative forcing (Gregory et al. 2004; Lewis and Curry 2015; Meehl et al. 2020), serve as a basis for a parametric form of the state-space model (SSM) (Cummins et al. 2020). Similar physical equations have been used in the integrated assessment models (Calel and Stainforth 2017) and for estimating parameters such as climate sensitivity of different GCMs (Cummins et al. 2020). Importantly, these energy-balance equations, as described in the subsequent section, provide the foundation to model the uncertainty from several key sources such as climate sensitivity, ocean heat uptake (Webster et al. 2008), and aerosol forcing (Myhre et al. 2013; Rotstayn et al. 2015). Inclusion of physical parameters in the modeling framework facilitates the assessment of these sources of uncertainty by using Bayesian inference with informative prior. It should also be noted that the uncertainty from socioeconomic development and mitigation efforts (e.g., among SSP scenarios) is incorporated in the framework as individual radiative forcing time series; this work models these SSP scenarios separately and does not address the reduction of uncertainty among SSP scenarios, however.

A SSM – integrated with physical parameters and calibrated via Bayesian inference – is consequently developed in this work to analyze time series of global mean temperature anomaly. Changes of global mean temperature are an important climate change indicator which has been used in policy documents such as the Paris Agreement and linked with different levels of regional impacts (Arnell et al. 2019; He et al. 2022). The methodology and results of this work focus on global mean temperature anomaly, although this SSM framework can be further modified to model changes in regional variables, e.g., based on pattern scaling (Tebaldi and Arblaster 2014) and linear relationships to model local response and variability (Beusch et al. 2020).

In addition to the use of historical observations of global mean temperature anomaly, the simulations from GCMs are used in this work to: (a) calibrate the SSM and (b) serve as synthetic observations (or “pseudo-observations” (Eyring et al. 2019)) to investigate relative accuracy and reduction of uncertainty in the future (in 2050 and 2080). The overall approach is described in Section 2. The results from the probabilistic inference of physical parameters of GCMs using the SSM are discussed in Section 3. The results of parameters’ posterior distribution and updated climate projections (using both pseudo- and historical observations) are discussed in Section 4. Summary, conclusions, and recommendations are provided in Section 5.

2 Methodology

The methodology of this work consists of three main components: the physical equations and parameters for describing global mean temperature anomaly, the selected parametric form of the SSM, and the application of Bayesian inference. These three components are described in the following sub-sections.

2.1 Physical modeling of global mean temperature anomaly

The time series of global mean temperature anomaly $T$ responding to radiative forcing $F$ can be described using a simplified, two-layer energy-balance model. The heat flux to the climate system, which is largely absorbed by ocean, can be expressed as $F-\lambda T$ (Gregory et al. 2002), where $\lambda$ is a climate feedback parameter representing the sensitivity of temperature response (Gregory and Andrews 2016). The first layer of the two-layer model represents air and surface ocean, whereas the second layer represents deep ocean which, due to its high heat capacity, responds to heat exchange more slowly than the surface layer (Calel and Stainforth 2017). A one-layer model (without an explicit separation of a deep ocean layer) and multiple-layer models can also be applied (such as in Cummins et al. (2020)), with less and more accurate modeling of energy exchange and at a lower and higher computational cost, respectively. In this work, the two-layer model is adopted because it is more accurate than the one-layer model (further details for the one-layer model can be found in the Supplemental Materials), includes a reasonably small number of model parameters, and can be efficiently calibrated via Bayesian inference.

This two-layer model is associated with two temperature anomaly series, i.e., the anomalies of the surface layer and of the deep ocean layer. The global mean surface air temperature anomaly is used for the surface layer, assuming that the heat exchange between the air and surface ocean occurs spontaneously and uniformly. Heat exchange between the surface layer and deep ocean layer is incorporated to the two-layer model. The temperature anomalies of these two layers at time $t$ are therefore modeled as (Calel and Stainforth 2017):

$$\begin{array}{c}{C}_{1}\frac{{\mathrm{d}}T}{{\mathrm{d}}t}=F-\lambda T-\beta (T-{T}_{\mathrm{LO}})\\ {C}_{2}\frac{{\mathrm{d}}{T}_{\mathrm{LO}}}{{\mathrm{d}}t}=\beta (T-{T}_{\mathrm{LO}})\end{array}$$

(1)

where $T$ and ${T}_{\mathrm{LO}}$ (in ºC or K) are the temperature anomalies at the surface layer and deep ocean (i.e., Lower Ocean) layer, respectively, ${C}_{1}$ and ${C}_{2}$ (in Wm⁻² K⁻¹ yr) are the heat capacities of the surface and deep ocean layers, respectively, $\beta$ and $\lambda$ (in Wm⁻² K⁻¹) are the heat exchange coefficient and the climate feedback coefficient, respectively, and $F$ (in Wm⁻²) is the radiative forcing.

To integrate these equations, a simple finite-difference numerical scheme of approximating the solution is used (as also adopted by some integrated assessment models (Calel and Stainforth 2017)):

$$\begin{array}{c}{C}_{1}\frac{T\left(t+\Delta t\right)-T(t)}{\Delta t}\simeq F(t)-\lambda T(t)-\beta [T\left(t\right)-{T}_{\mathrm{LO}}\left(t\right)]\\ {C}_{2}\frac{{T}_{\mathrm{LO}}\left(t+\Delta t\right)-{T}_{\mathrm{LO}}(t)}{\Delta t}\simeq \beta [T\left(t\right)-{T}_{\mathrm{LO}}\left(t\right)]\end{array}$$

(2)

hence, the evolution of surface and the deep ocean temperature anomalies can be expressed as:

$$\begin{array}{c}T\left(t+\Delta t\right)\simeq \frac{{C}_{1}-\lambda\Delta t-\beta\Delta t}{{C}_{1}}T\left(t\right)+\frac{\beta\Delta t}{{C}_{1}}{T}_{\mathrm{LO}}\left(t\right)+\frac{\Delta t}{{C}_{1}}F(t)\\ {T}_{\mathrm{LO}}\left(t+\Delta t\right)\simeq \frac{\beta\Delta t}{{C}_{2}}T\left(t\right)+\frac{{C}_{2}-\beta\Delta t}{{C}_{2}}{T}_{\mathrm{LO}}\left(t\right)\end{array}$$

(3)

where $\Delta t$ is the time step size.

The variability among the GCM simulations of future global mean temperature anomaly can be largely attributed to the different values of the parameters in Eq. (1) or Eq. (3), including $\lambda$, ${C}_{1}$, ${C}_{2}$ and function $F(t)$ (Geoffroy et al. 2013).

Parameter $\lambda$ indicates the sensitivity of temperature response to the imposed radiative forcing and can be calculated using the Equilibrium Climate Sensitivity (ECS) as $\lambda ={F}_{2\times {{\mathrm{CO}}}_{2}}/{\mathrm{ECS}}$, where ${F}_{2\times {{\mathrm{CO}}}_{2}}$ is the forcing value with a doubled CO₂ concentration from the pre-industrial level and is around 3.7 W/m² (Lewis and Curry 2015). As shown in Eq. (1), the temperature anomaly at a steady state is equal to $F/\lambda$ and $1/\lambda$ therefore represents the temperature change at equilibrium per unit radiative forcing. ECS is defined as the temperature change at equilibrium with a doubling of CO₂ concentration (Meehl et al. 2020) and with a radiative forcing ${F}_{2\times {{\mathrm{CO}}}_{2}}$.

The Transient Climate Response (TCR) is another commonly used climate sensitivity parameter: the TCR represents the temperature change when the CO₂ concentration is doubled during a period of 70 years with a 1% increase of concentration each year (Richardson et al. 2016). As the system does not reach equilibrium at the end of the 70-year period, the TCR is lower than the ECS, and it is affected by the rate of heat exchange, which in this model is related to parameters ${C}_{1}$, ${C}_{2}$, and $\beta$.

Radiative forcing $F$ is one major source of uncertainty in climate projections because of the uncertainty related to greenhouse gas (GHG) emissions and aerosol forcing (Andreae et al. 2005; Hawkins and Sutton 2009). The different climate change scenarios (such as the SSP2-4.5 and SSP5-8.5) represent the different trajectories of future GHG emissions (Hawkins and Sutton 2009). Aerosol forcing also greatly contributes to climate change uncertainty (Andreae et al. 2005), as an underestimated present-day aerosol cooling effect could lead to the underestimation of climate sensitivity, and consequently the underestimation of temperature increases when future aerosol cooling effect decreases (Myhre et al. 2013). Given that the objective of this work is to assess climate projection uncertainty including forcing uncertainty, an average forcing series for each SSP scenario with an additional consideration of its uncertainty is used instead of a fixed forcing series (such as the forcing with a quadrupling of CO₂ concentration used in Cummins et al. (2020)). The average forcing series of different SSP scenarios estimated in and provided by IPCC AR6 (Smith et al. 2021) are used in this work.

Two linear scaling coefficients (${\upgamma }_{1}$ and ${\upgamma }_{2}$) are used to model the uncertainty of the GHG and aerosol forcings:

$$F\left(t\right)={\gamma }_{1}{F}_{(t)}^{\mathrm{GHG}}+{\gamma }_{2}{F}_{(t)}^{\mathrm{aerosol}}+{F}_{(t)}^{\mathrm{other}}$$

(4)

where ${F}_{(t)}^{\mathrm{GHG}}$ models the contributions from GHG forcing, ${F}_{(t)}^{\mathrm{aerosol}}$ those of anthropogenic aerosol forcing, and ${F}_{(t)}^{\mathrm{other}}$ those of other sources, including natural forcing and the forcing from land use change.

2.2 The physical-parameter-based SSM

The two-layer model presented in Eq. (3) and (4) is subsequently developed into a parametric SSM. A SSM consists of a state transition function and a measurement function (Shumway and Stoffer 2017). The state transition function describes the changes of latent variable vector ${\varvec{x}}\left(t\right)$, including the surface and the deep ocean temperature anomalies, i.e., $T\left(t\right)$ and ${T}_{\mathrm{LO}}\left(t\right)$, at time $t$. The measurement function describes the relation between the latent and the measured temperature, including noise and errors to explain the discrepancies between latent temperature and GCM simulations or historical observations of global mean temperature anomalies.

Based on Eq. (3) and (4), the transition function is:

$$\begin{array}{c}{\varvec{x}}\left(t+\Delta t\right)={\varvec{A}}{\varvec{x}}\left(t\right)+{\varvec{B}}{\varvec{F}}\left(t\right)+{\varvec{\omega}}(t)\\ {\varvec{x}}\left(t\right)=\left[\begin{array}{c}T\left(t\right)\\ {T}_{\mathrm{LO}}\left(t\right)\end{array}\right]; {\varvec{A}}=\left[\begin{array}{cc}1-\frac{{F}_{2\times {{\mathrm{CO}}}_{2}}}{{C}_{1}{\mathrm{ECS}}}\Delta t-\frac{\beta }{{C}_{1}}\Delta t& \frac{\beta }{{C}_{1}}\Delta t\\ \frac{\beta }{{C}_{2}}\Delta t& 1-\frac{\beta }{{C}_{2}}\Delta t\end{array}\right]\\ {\varvec{B}}=\frac{\Delta t}{{C}_{1}}\left[\begin{array}{ccc}{\upgamma }_{1}& {\upgamma }_{2}& 1\\ 0& 0& 0\end{array}\right]; {\varvec{F}}\left(t\right)=\left[\begin{array}{c}{F}_{\left(t\right)}^{\mathrm{GHG}}\\ {F}_{\left(t\right)}^{\mathrm{aerosol}}\\ {F}_{\left(t\right)}^{\mathrm{other}}\end{array}\right];{\varvec{\omega}}\left(t\right)=\left[\begin{array}{c}{\omega }_{1}\left(t\right)\\ {\omega }_{2}\left(t\right)\end{array}\right]\end{array}$$

(5)

where vector ${\varvec{\omega}}\left(t\right)$ is a noise term including independent, zero-mean normal noises ${\omega }_{1}\left(t\right)$ and ${\omega }_{2}\left(t\right)$. For the analyses carried out in this work, $\Delta t$ is fixed to one year.

Natural climate variability can affect temperature series by introducing additive noise to the long-term climate change trend (Lai and Dzombak 2019). The GCM simulations and historical observations are modeled as measurement variable vector ${\varvec{y}}\left(t\right)$ with additional noise as:

$$\begin{array}{c}{\varvec{y}}\left(t\right)={\varvec{D}}{\varvec{x}}\left(t\right)+{\varvec{\nu}}(t)\\ \begin{array}{cc}{\varvec{D}}=\left[\begin{array}{cc}1& 0\\ 0& 1\end{array}\right];{\varvec{\nu}}\left(t\right)=\left[\begin{array}{c}{\nu }_{1}\left(t\right)\\ {\nu }_{2}\left(t\right)\end{array}\right]& (\mathrm{if\;deep\;ocean\;temperature\;measurements\;are\;processed})\end{array}\\ \begin{array}{cc}{\varvec{D}}=\left[\begin{array}{cc}1& 0\end{array}\right];{\varvec{\nu}}\left(t\right)={\nu }_{1}\left(t\right)& (\mathrm{if\;deep\;ocean\;temperature\;measurements\;are\;not\;used})\end{array}\end{array}$$

(6)

where ${\nu }_{1}\left(t\right)$ and ${\nu }_{2}\left(t\right)$ are independent, zero-mean normal noise terms affecting the surface and the deep ocean temperature anomalies, respectively.

As indicated in Eq. (6), the SSM can be applied with or without processing deep ocean temperature measurements. The analyses without using ocean temperature measurements are applied and presented subsequently, while some results from using GCM ocean temperature simulations are presented in the Supplemental Materials. One reason for excluding deep ocean temperature measurements is related to the relatively limited amount of historical observations of deep ocean temperature (Abraham et al. 2013). Additionally, the ocean temperature simulations from GCMs are commonly provided in separate ocean layers and additional assumptions and procedures are needed to process the multiple layers of ocean temperature simulations for the two-layer model, which can lead to challenges and high sensitivity of the parameter posterior distributions (as described in the Supplemental Materials).

It is also worth noting that, although not further assessed in this work, Eq. (6) provides a flexible parametric form to model regional variables. For example, matrix ${\varvec{D}}$ can be modified to include linear scaling factors for modeling regional responses and variables; ${\varvec{\nu}}(t)$ can also be modified to include additional regional variability and noise (e.g., following a framework similar to the one used in Beusch et al. (2020)). Such alternative modeling approaches are also described in the Supplemental Materials.

Combining Eq. (5) and (6), a parametric SSM form is developed with ten parameters ${\theta }_{1}$ to ${\theta }_{10}$, as presented in Table 1. Vector ${\varvec{\Theta}}$ lists all ten parameters, representing a linear transformation of the natural logarithm of the physical parameters: ${\varvec{\Theta}}={\varvec{H}}{\mathrm{log}}({[\begin{array}{ccc}{\mathrm{ECS}}& {C}_{1}& \begin{array}{ccc}{C}_{2}& \beta & \begin{array}{ccc}{\upgamma }_{1}& {\upgamma }_{2}& \begin{array}{ccc}{q}_{1}& {q}_{2}& \begin{array}{cc}{r}_{1}& {r}_{2}\end{array}\end{array}\end{array}\end{array}\end{array}]}^{\mathrm{T}})$, where 10-by-10 matrix ${\varvec{H}}$ can be deduced from the definitions in Table 1 (the matrix and its inverse is explicitly reported in the Supplemental Materials) and ${q}_{1}$, ${q}_{2}$, ${r}_{1}$, and ${r}_{2}$ are the standard deviations of zero-mean white noise ${\omega }_{1}(t)$, ${\omega }_{2}(t)$, ${\nu }_{1}(t)$ and ${\nu }_{2}(t)$, respectively.

Table 1 Summary of the ten SSM parameters and their representation of the physical parameters in Eq. (5) and (6)

Full size table

2.3 A two-step Bayesian inference procedure based on the SSM

Following the Bayesian method, posterior distributions of parameters are obtained integrating prior distributions and likelihood functions. Bayes’ formula is applied twice in this work, during the processing of GCM simulations and subsequently the processing of pseudo- or historical observations. When processing GCM simulations, broad prior distributions informed by the literature are used (this literature prior is discussed in Section 3). The posterior distributions obtained from this first step of processing GCM simulations are then integrated into a new distribution (i.e., the GCM-informed prior distribution) for processing observations.

This additional step of processing GCM simulations aims at leveraging long-term simulations from GCMs (up to 2099 used in this work) to inform the SSM parameters before processing observations. Compared to the distributions based on values taken from the literature, the parameters informed by the GCM simulations can better characterize some features of parameter uncertainty, e.g., highlighting the correlation among parameters.

2.3.1 Processing GCM simulations

In general, an approximate posterior distribution of the SSM parameters is obtained in this work by applying the Laplace approximation to the product of the model likelihood functions computed by the Kalman Filter and prior distributions (Shumway and Stoffer 2017). Additional technical details for this section including the Kalman Filter are also provided in the Supplemental Materials.

The posterior distribution is obtained from Bayes’ formula when processing the GCM-simulated temperature series ${\varvec{y}}$:

$$\begin{array}{c}p\left({\varvec{\Theta}}|{\varvec{y}}\right)=\frac{p\left({\varvec{y}}|{\varvec{\Theta}}\right)p\left({\varvec{\Theta}}\right)}{\int p\left({\varvec{y}}|{\varvec{\Theta}}\right)p\left({\varvec{\Theta}}\right){\mathrm{d}}{\varvec{\Theta}}}\\ {\varvec{y}}|{\varvec{\Theta}}\sim \mathrm{KF(}{\varvec{\Theta}}\mathrm{)}\\ p\left({\varvec{\Theta}}\right)\propto \mathcal{N}\left({\varvec{\Theta}},{{\varvec{\mu}}}_{0},{{\varvec{\Sigma}}}_{0}\right){f}_{\mathrm{TCR}}\left({\varvec{\Theta}}\right); {f}_{\mathrm{TCR}}\left({\varvec{\Theta}}\right)\propto \mathcal{N}\left(\mathrm{TCR(}{\varvec{\Theta}}\mathrm{)}, {\mu }_{\mathrm{TCR}},{\upsigma }_{\mathrm{TCR}}^{2}\right)\end{array}$$

(7)

where $p\left({\varvec{\Theta}}|{\varvec{y}}\right)$ is the posterior distribution of parameters; $p\left({\varvec{y}}|{\varvec{\Theta}}\right)$ is the likelihood function of time series computed from the Kalman filter; $\mathrm{KF(}{\varvec{\Theta}}\mathrm{)}$ indicates the processes of using the Kalman Filter with the parameter vector ${\varvec{\Theta}}$; $p\left({\varvec{\Theta}}\right)$ is the parameter prior, proportional to the product of a function ${f}_{\mathrm{TCR}}$ and a normal distribution with the mean vector ${{\varvec{\mu}}}_{0}$ and covariance matrix ${{\varvec{\Sigma}}}_{0}$; and symbol $\mathcal{N}$ indicates the normal distribution. Moments ${{\varvec{\mu}}}_{0}$ and ${{\varvec{\Sigma}}}_{0}$ are selected based on the values reported in the literature. $\mathrm{TCR(}{\varvec{\Theta}}\mathrm{)}$ is calculated deterministically from Eq. (5) (i.e., assuming the noise variance is zero) under an annual 1% increase of CO₂ forcing for a 70-year period. The distribution of $\mathrm{TCR(}{\varvec{\Theta}}\mathrm{)}$ is based on a likelihood function related to a nominal value of TCR as ${\mu }_{\mathrm{TCR}}=1.8{\mathrm{K}}$ and a standard deviation ${\upsigma }_{\mathrm{TCR}}=0.5{\mathrm{K}}$ (selected based on the reported range of TCR), and it is included to align parameters ${\varvec{\Theta}}$ with a reasonable value of TCR.

The Kalman Filter allows for inferring the hidden state and also for computing the likelihood related to entire time series (Rings et al. 2012). In this work the term “prediction” specifically refers to the prediction procedure of the Kalman Filter and “projection” refers the future temperature projection obtained from the original GCM simulations and the SSM (these terms can have different meaning in the literature (Merryfield et al. 2020)).

To apply the Laplace’s approximation, the Maximum A Posteriori (MAP) is identified by a numerical optimization algorithm applied to the unnormalized posterior (i.e., $p\left({\varvec{y}}|{\varvec{\Theta}}\right)p\left({\varvec{\Theta}}\right)$ in Eq. (7)). In this work, the “L-BFGS-B” method (Byrd et al. 1995) is used. The Hessian matrix is subsequently estimated at the MAP point, also by the L-BFGS-B algorithm to approximate the posterior with a Gaussian distribution (Nocedal and Wright 2006; Barber 2011).

Additionally, the posterior distribution related to a specific GCM is obtained by processing multiple ensemble members (i.e., different simulation runs) of this GCM (if available). Let ${\mathbf{Y}}_{i}=\{{{\varvec{y}}}_{i,1}, {{\varvec{y}}}_{i,2},{{\varvec{y}}}_{i,3},\dots \}$ list the ensemble members (${{\varvec{y}}}_{i,1}$ as ensemble member 1, for example) of GCM $i$, characterized by parameter value ${{\varvec{\Theta}}}_{i}$. The different ensemble members of a GCM are assumed to be independent of each other conditionally to ${{\varvec{\Theta}}}_{i}$ (hence ensemble members of GCM $i$ share the same parameter value ${{\varvec{\Theta}}}_{i}$ but not the hidden variables $\mathbf{x}(t)$). From the Laplace’s approximation applied to Eq. (7), the posterior distribution is:

$${\boldsymbol{\Theta }}_{i}|{{\varvec{Y}}}_{i} \stackrel{approx}{\sim } \mathcal{N}\left({\widehat{{\varvec{\mu}}}}_{i},{\widehat{\varvec{\Sigma }}}_{i}\right)$$

(8)

where ${\widehat{{\varvec{\mu}}}}_{i}$ and ${\widehat{{\varvec{\Sigma}}}}_{i}$ are the posterior mean vector and covariance matrix estimated by the Laplace’s approximation, respectively.

2.3.2 Processing pseudo- or historical observations

The GCMs are considered as different realizations of the Earth climate system and are represented by the posterior distributions of the parameters obtained in the previous step; the prior distribution used subsequently is informed by these intermediate, posterior distributions. Specifically, the posterior distributions identified for individual GCMs in the previous step are integrated into a single, unified prior distribution (i.e., the GCM-informed prior distribution) to process pseudo- or historical observations. Note that, simulated time series from a GCM can also be used as pseudo-observations in this work, if a GCM is used to provide pseudo-observations, this GCM is then excluded from being used to obtain the GCM-informed prior distribution.

Let ${\varvec{Y}}=\{{\mathbf{Y}}_{1},\boldsymbol{ }{\mathbf{Y}}_{2},\dots ,{\mathbf{Y}}_{m}\}$ list all simulations from m GCMs. By leveraging Eq. (8), an integrated posterior distribution conditional to ${\varvec{Y}}$ can be expressed as a mixture of Gaussians:

$$\begin{array}{c}p\left({\varvec{\Theta}}|{\varvec{Y}}\right)\simeq \sum_{i=1}^{m}{f}_{i}\left({\varvec{\Theta}}\right){P}_{i}\\ {f}_{i}\left({\varvec{\Theta}}\right)=\mathcal{N}\left({\varvec{\Theta}},{\widehat{{\varvec{\mu}}}}_{i},{\widehat{{\varvec{\Sigma}}}}_{i}\right)\end{array}$$

(9)

where ${f}_{i}$ is the posterior distribution related to the simulations of GCM $i$, and ${P}_{i}$ is the prior probability for this GCM, and a non-informative, uniform distribution ${P}_{i}=1/m$ is adopted in this work. The integrated distribution can be approximated via a single normal distribution, by matching moments:

$$\begin{array}{c}{\varvec{\Theta}}|\mathbf{Y}\stackrel{{\mathrm{approx}}}{\sim } \mathcal{N}\left(\widehat{{\varvec{\mu}}},\widehat{{\varvec{\Sigma}}}\right)\\ \widehat{{\varvec{\mu}}}=\sum_{i=1}^{m}{P}_{i}{\widehat{{\varvec{\mu}}}}_{i}; \widehat{{\varvec{\Sigma}}}=\sum_{i=1}^{m}{P}_{i}\left[{\widehat{{\varvec{\Sigma}}}}_{i}+{\left({\widehat{{\varvec{\mu}}}}_{i}-\widehat{{\varvec{\mu}}}\right)\left({\widehat{{\varvec{\mu}}}}_{i}-\widehat{{\varvec{\mu}}}\right)}^{\mathrm{T}}\right]\end{array}$$

(10)

where $\widehat{{\varvec{\mu}}}$ and $\widehat{{\varvec{\Sigma}}}$ are the estimated moments.

Instead of integrating into a single unified prior distribution, the Supplemental Materials also provide the description of an alternative approach to assign and estimate posterior probabilities of individual GCMs by using posterior distribution identified previously for each GCM (although this alternative approach is sensitive to the particular set of GCMs included in the analyses).

To process pseudo- or historical observations ${{\varvec{y}}}_{h}$, the Laplace’s approximation is implemented for a second time, with the same “L-BFGS-B” method to identify the MAP. Different from Eq. (7), the likelihood function related to TCR, which has been already used for informing prior distributions in Eq. (7), is not integrated in this step. The posterior distribution $p\left({\varvec{\Theta}}|{\varvec{Y}},{{\varvec{y}}}_{h}\right)$ is obtained by the Bayes’ formula:

$$\begin{array}{c}p\left({\varvec{\Theta}}|{\varvec{Y}},{{\varvec{y}}}_{h}\right)\propto p\left({{\varvec{y}}}_{h}|{\varvec{\Theta}}\right)p\left({\varvec{\Theta}}|{\varvec{Y}}\right)\\ {{\varvec{y}}}_{h}|{\varvec{\Theta}}\sim {\mathrm{KF}}({\varvec{\Theta}})\end{array}$$

(11)

where $p\left({{\varvec{y}}}_{h}|{\varvec{\Theta}}\right)$ is the likelihood function related to the observations, computed by the Kalman Filter.

The projections of future temperature anomaly series y^∗ are obtained as:

$$\begin{array}{c}p\left({{\varvec{y}}}^{*}|{{\varvec{y}}}_{h},{\varvec{Y}}\right)=\int p\left({{\varvec{y}}}^{*}|{\varvec{\Theta}}, {{\varvec{y}}}_{h}\right)p\left({\varvec{\Theta}}|{\varvec{Y}},{{\varvec{y}}}_{h}\right){\mathrm{d}}{\varvec{\Theta}} \\ {{\varvec{y}}}^{*}|{\varvec{\Theta}}, {{\varvec{y}}}_{h}\sim {\mathrm{KF}}({{\varvec{y}}}_{h}, {\varvec{\Theta}})\end{array}$$

(12)

where $p\left({{\varvec{y}}}^{\boldsymbol{*}}|{{\varvec{y}}}_{h},{\varvec{Y}}\right)$ is the updated, probabilistic future temperature anomaly projections given with observations, and ${\mathrm{KF}}({{\varvec{y}}}_{h},{\varvec{\Theta}})$ indicates the Kalman Filter with parameters ${\varvec{\Theta}}$ and observations ${{\varvec{y}}}_{h}$ up to the present time.

Projections in Eq. (12) are based on ${{\varvec{y}}}^{*}$ (i.e., the future series of “measurement” variables in the Kalman filter) instead of the latent variable ${{\varvec{x}}}^{*}$, as the former incorporates the additional noise related to natural variability (although distribution $p\left({{\varvec{x}}}^{*}|{{\varvec{y}}}_{h},{\varvec{Y}}\right)$ can also be easily obtained from the Kalman Filter).

To account for parameter uncertainty, a sampling and simulation procedure is applied following Eq. (12). Specifically, samples of parameters ${\varvec{\Theta}}$ is generated from the posterior distributions $p\left({\varvec{\Theta}}|{\varvec{Y}},{{\varvec{y}}}_{h}\right)$, and the Kalman Filter is then used to fit observations ${{\varvec{y}}}_{h}$ and simulate future series ${{\varvec{y}}}^{*}$. The generated different ${{\varvec{y}}}^{*}$ series are intended as possible realizations of future temperature anomaly series and indicate the trajectories of future changes incorporating the estimation of the moments of projections.

3 Results of probabilistic estimation of physical parameters of GCMs

3.1 The literature prior

As discussed previously, an identical prior distribution informed by the reported physical parameter values from the literature is used to process GCM simulations. Specifically, as reported from the recent IPCC AR6 (IPCC 2021), ECS has a range of 2 to 5K (90% confidence level), whereas TCR is reported with a range of 1.2 to 2.4K. These ranges are used to determine the distribution of ECS and TCR. Geoffroy et al. (2013) analyzed different GCMs of CMIP5 based on a similar two-layer model and suggests that the average ${C}_{1}$ among GCMs is around 7.3 Wm⁻²K⁻¹yr corresponding approximately to 80m of ocean depth, the average ${C}_{2}$ is around 106 Wm⁻²K⁻¹yr corresponding approximately to 1100m of ocean depth, and the average $\beta$ value is around 0.73 Wm⁻²K⁻¹. The forcing time series estimated from the IPCC AR6 (Smith et al. 2021) are used in this work, and the distributions of the two scaling coefficients ${\upgamma }_{1}$ and ${\upgamma }_{2}$ are selected to represent the uncertainty of the GHG and aerosol forcings reported. For example, the IPCC-AR6-estimates of GHG forcing and aerosol forcing at 2019 are from 3.4 to 4.4 Wm⁻² and from -1.94 to -0.06 Wm⁻² for a 90% interval, respectively (Smith et al. 2021). The selected prior distribution of the first six physical parameters (along with the TCR generated from this prior distribution) are presented in Fig. 1. For the four parameters related to noise, the empirical standard deviations of GCM temperature simulations are adopted. Specifically, empirical standard deviation values are derived from simulations of each GCM and log-normal distributions are then calibrated from the derived values of different GCMs. Further discussion is provided in the Supplemental Materials, together with the mean and variance selected for these parameters.

Compared to ECS and TCR which have been calculated and evaluated in a number of existing studies (e.g., IPCC AR6 (IPCC 2021) and Meehl et al. (2020)), the parameters such as ${C}_{1}$, ${C}_{2}$, and $\beta$ in this work are derived specifically based on the two-layer model, therefore a wide prior distribution is selected for these parameters and may be slightly different from the estimated values of similar parameters in the other studies. For example, the selected prior distribution of ${C}_{1}$ has a greater mean and larger uncertainty than the estimated values in Geoffroy et al. (2013).

3.2 Parameters inferred from GCM simulations

Processing GCM simulations from 1850 to 2099 under the SSP5-8.5 scenario, the posterior distributions of parameters are obtained for different GCMs of CMIP6, and the estimated ECS and TCR are compared with the reported values in Fig. 2. SSP5-8.5 scenario is used because temperature increase is greater under this scenario (i.e., with larger signal-to-noise ratios).

The estimates of ECS and TCR are generally in good agreement with the reported values, with the results of ECS more accurate than the results of TCR, according to Fig. 2. The reported values are inside the 95% posterior interval in 21 out of 29 cases for ECS and 17 out of 29 cases for TCR. The greater errors of estimating ECS and TCR for some GCMs are likely caused by the limitation of the two-layer model (e.g., the limited number of layers, the one-year time step, and its simplified modeling of temperature response).

The parameter inference of a GCM (CNRM-CM6-1) is further used as an example to assess the difference between the GCM simulations and the new simulations generated by the SSM (calibrated using the simulations from this GCM). The SSM simulations are obtained by generating random parameter values from the posterior distribution and generating random noise. The comparison is presented in Fig. 3.

Figure 3 suggests that the SSM can be used to project future temperature with the inferred physical parameters. The latent series ${\varvec{x}}\left(t\right)$ for the temperature anomaly of surface layer, as presented in the middle column of Fig. 3, exhibit less noise compared to the original GCM simulations in the first column, supporting the assumption that the ${\varvec{x}}\left(t\right)$ values represent an underlying smoothed temperature response to the forcings applied. The new simulated series from the SSM in the third column exhibit long-term temperature trend comparable to the original simulations presented in the first column (although the end-of-the-century temperature level under SSP2-4.5 may be slightly higher than the SSM-simulated series). Consequently, the results of Fig. 3 suggest that GCMs can be reasonably described or represented by the SSM by adopting the posterior distribution of the parameters inferred for these GCMs.

4 Results from processing pseudo- and historical observations

4.1 Posterior distributions of ECS and TCR

The posterior distributions of ECS and TCR after processing GCM simulations as pseudo-observations, are presented and discussed in this section. As TCR is not included in the parameters of the SSM, its posterior distribution is derived from the Kalman Filter simulations. Using one realization series from CNRM-CM6-1 and GFDL-ESM4 as pseudo-observations (selected because CNRM-CM6-1 has a relatively larger reported ECS value, whereas GFDL-ESM4 has a smaller ECS value), the posterior distributions of ECS and TCR – from processing different amounts of pseudo-observations – are presented in Fig. 4.

The results of Fig. 4 suggest that the SSM method can provide reasonable estimates of the physical parameters and additional pseudo-observations can reduce the uncertainty, although the results are dependent on the particular pseudo-observation series assessed. Similar to the results in Fig. 2, ECS results align more closely with reported values than the TCR results. Additionally, the uncertainty of ECS and TCR estimated for the GFDL-ESM4 pseudo-observation series largely reduces with the increase of observations. The results of CNRM-CM6-1 in Fig. 4 show a less observable reduction of uncertainty and an uptick of ECS starting around 2050. Such results suggest the relatively high sensitivity of ECS and TCR estimation with respect to the specific pseudo-observation series used. These features in the estimation of ECS and TCR (e.g., the uptick of ECS for CNRM-CM6-1 starting from 2050) are further investigated and are generally related to the average forcing time series used in this work. Specifically, the forcings and simulated temperature response in GCM can exhibit temporal changes different from the ones assumed and used in this work, causing the ECS and TCR posterior probabilities not temporally consistent and also dependent on pseudo-observation series (more detailed discussions are offered in the Supplemental Materials). The forcing uncertainty is considered and modeled in this work with ${\upgamma }_{1}$ and ${\upgamma }_{2}$, which are time-independent and can be a limitation of this SSM-based approach. Additional uncertainty on the forcing series can be incorporated to the SSM, although this addition is not further investigated.

4.2 Future temperature projections using pseudo-observations

Future temperature projections with the processing of increased amounts of pseudo-observations is examined in this section. Two pseudo-observation series from the two GCMs (CNRM-CM6-1 and GFDL-ESM4) are processed to project future temperature in Fig. 5 as examples, assuming that the pseudo-observations are available from 1850 to 2020, 2050, and 2080.

Figure 5 shows how the uncertainty of future projections is reduced when more observation is processed. Compared to the processing of observations up to 2020, the projections with observations up to 2050 and 2080 exhibit gradually smaller uncertainty in each case of Fig. 5. This reduction of uncertainty is the result of reduced posterior uncertainty such as presented previously in Fig. 4.

Further quantitative analyses are carried out to evaluate the reduction of uncertainty and projection accuracy. The simulations from each of 36 (for the SSP2-4.5) or 37 (for the SSP5-8.5) GCMs are used as pseudo-observations, of which up to 2020, 2050, and 2080 are separately used to obtain and compare future projections. The results among the 36/37 pseudo-observation series are summarized in Fig. 6. In addition to assessing projection accuracy with absolute errors, the continuous ranked probability score (CRPS; Gneiting and Raftery (2007)) – the higher the score, the lower the forecast performance – is also used. Uncertainty ranges of the 95% prediction intervals (i.e., calculated as the upper bound minus lower bound) are also summarized in Fig. 6. For example, using pseudo-observations up to 2020 under SSP5-8.5, the end-of-the-century 95% prediction intervals have a 2.7 °C range (averaged among 37 pseudo-observation series) and 3.2 °C and 2.1 °C as the upper and lower quartiles among the 37 pseudo-observation series.

The results of Fig. 6 suggest that for a specific future year, more observations (30 years of additional observations in this case) can largely reduce projection errors and uncertainty, whereas for the projections with a specific lead time (such as same 20 years ahead), the uncertainty can be reduced moderately and projection errors are similar. For example, for the end-of-the-century projections, the absolute errors on average are around 0.5 °C under the SSP2-4.5 when observations are available up to 2020 and are then reduced to 0.4 and less than 0.2 °C when the amount of observations increased to the end of 2050 and 2080, respectively. More noticeably, the uncertainty ranges substantially decrease for the end-of-the-century projections in parts (a3) and (b3): for example, increasing the availability of pseudo-observations from 2020 to 2050 results in the uncertainty ranges decreasing from 1.9 °C on average to 1.0 °C (SSP2-4.5) or from 2.7 °C to 1.2 °C (SSP5-8.5); additional 30 years of observations (up to 2080) lead to the further reduced 95% prediction interval ranges of 0.6 to 0.7 °C for the two SSP scenarios.

4.3 Results from processing historical observation records

Historical observations of global mean temperature anomaly (obtained from the NOAA Climate at A Glance as listed in Data Availability Statement; anomaly related to the 1850–1900 period is calculated and used in this case) are further assessed in this section and the results are presented in Fig. 7.

As presented in Fig. 7 part (a), the future temperature projections from the SSM already exhibit reduced uncertainty compared to the original GCM projections with the processing of historical observations. For example, the end-of-the-century, empirical 95% intervals among the realizations of GCMs are around 2.2–4.6 °C (SSP2-4.5) and 3.9–7.9 °C (SSP5-8.5), whereas the 95% prediction intervals from the SSM are 2.0–3.8 °C (SSP2-4.5) and 3.2–5.7 °C (SSP5-8.5) for the end-of-the-century projections. This reduction of projection uncertainty is consistent with the results in the previous studies such as Ribes et al. (2021), which suggests that the end-of-the-century, 90% prediction intervals are 2.3–3.7 °C (SSP2-4.5) and 3.8–6.0 °C (SSP5-8.5). The 95% uncertainty ranges of 1.8 °C for SSP2-4.5 and 2.5 °C for SSP5-8.5 in Fig. 7 also align with the results from using pseudo-observations in Fig. 6.

After processing historical observations, the SSM temperature projections in Fig. 7 are notably lower than the GCM original projections. This is likely related to the “hot model” problem (Hausfather et al. 2022) of some GCMs in CMIP6. As presented previously in Fig. 2, many GCMs in CMIP6 have ECS values greater than 4.7K. In comparison, no GCM has a ECS value greater than 4.7K in the previous CMIP5 phase (Meehl et al. 2020) and these large ECS values do not align well with other evidences (Hausfather et al. 2022). Consequently, these GCMs in CMIP6 with large climate sensitivity simulated larger temperature increases and result in higher upper bound for the original GCM projections in Fig. 7, compared to the SSM projections which are based on the processing of historical observations.

Posterior probabilities of ECS, TCR, and aerosol forcings are subsequently examined in Fig. 7. Part (b1) of Fig. 7 suggests some reduction of uncertainty especially when observations after 2000 are used. Because of the GCM-informed prior is based on the GCMs of CMIP6 (which include many GCMs with large ECS values as previously described), the ECS posterior probability using the GCM-informed prior in part (b1) of Fig. 7 (1.6–6.2K; 95% level) has a greater range than the 2-5K (although a 90% level is reported) given by IPCC AR6 (IPCC 2021). To further investigate such results, the literature prior (i.e., based on the reported parameter values in the literature, as mentioned previously) is also used to process historical observations to assess the ECS and TCR; the results are presented in part (b2) and (c2) of Fig. 7. Using the literature prior, the ECS posterior probability (2–4.5K as summarized by the 95% interval) has a smaller uncertainty range, although the mean ECS value updated at 2022 is comparable to the mean ECS in part (b1). Additionally, the joint posterior distributions of ECS and aerosol forcing are presented in part (c) of Fig. 7, which suggests a correlation between ECS and aerosol forcing. Such a correlation is consistent with the expectation that, if the present-day negative aerosol forcing is stronger (i.e., the net forcings combining aerosol and GHG forcings are smaller), then the Earth climate system should have greater climate sensitivity (i.e., the ECS in this case) given the observed temperature change (Andreae et al. 2005).

5 Summary, conclusions, and recommendations

To facilitate flexible decision-making in climate adaptation, a physical-parameter-based SSM (using a two-layer, energy-balance model) with Bayesian inference is developed and assessed in this work, serving as a modeling framework – consistent with climate science and computationally efficient – to investigate reduction of projection uncertainty of global mean temperature anomaly from additional observations. The method involves two steps: (1) leveraging long-term simulations (up to the year 2099 in this work) from GCMs to obtain the distributions of SSM parameters for each GCM, which are then integrated as a unified, GCM-informed prior distribution; (2) processing pseudo- or historical observations with the GCM-informed prior distribution, obtaining parameter posterior distributions conditional on observations, and projecting future temperature change.

The integration of the two-layer, energy-balance model and Bayesian inference allows the SSM to consider and model different sources of climate change uncertainty (including natural variability, climate sensitivity, ocean heat uptake, and forcing uncertainty from aerosol) and relate these sources of uncertainty to temperature projection. As described in the previous studies (Andreae et al. 2005; Webster et al. 2008), a given temperature change can be related to different combinations of variables related to climate sensitivity, ocean heat uptake, and aerosol forcing; these various sources of uncertainty make the task of reducing projection uncertainty difficult. This issue is addressed in this work by integrating the energy-balance equations to the SSM and explicitly including parameters related to different sources of uncertainty.

Using GCM simulations as pseudo-observations, the SSM method is assessed with respect to the posterior probabilities of physical parameters, the projections of global mean temperature anomaly, and the reduction of projection uncertainty with additional observations. The SSM provides reasonable estimates of physical parameters: for example the estimated ECS and TCR values for each GCM are generally consistent with their reported values. Reduction of parameter uncertainty (such as ECS and TCR) can be observed when further observations are processed, leading to the decreased uncertainty. Analyzing the projections sequentially made with the processing of pseudo-observations up to 2020, 2050, and 2080, reduction of uncertainty is evident: e.g., the end-of-the-century uncertainty range (95% prediction intervals) of global mean temperature anomaly decreases from 1.9 °C (when projected in 2020) to 1.0 °C (when projected in 2050), and further to 0.6 °C (when projected in 2080) on average under SSP2-4.5; under SSP5-8.5, the uncertainty range on average reduces from 2.7 °C in 2020 to 1.2 °C in 2050, and further to 0.7 °C in 2080. Such analyses illustrate how the future reduction of climate change uncertainty can be predicted.

The state space representation can also facilitate additional study objectives with adjustments on its parametric form. One example is to further use the SSM to investigate the posterior distributions of different physical parameters with observational constraints (e.g., as Fig. 7 suggests, using the GCM-informed prior distribution based on GCMs of CMIP6 leads to large ECS uncertainty, greater than the results from directly using the literature prior distribution). Additionally, the method can be extended to the assessment of regional impacts based on the findings from the studies like Arnell et al. (2019) and He et al. (2022) and using additional methods such as pattern scaling (Tebaldi and Arblaster 2014) and modeling of regional climate responses (Beusch et al. 2020).

The SSM method presented in this work can support engineering decision-making for climate change adaptation by projecting temperature and assessing the uncertainty of such projections. The results presented in this work, particularly the uncertainty reduction for projecting global mean temperature anomaly from 2020 to 2050 and from 2050 to 2080, highlight the predicted learning of climate change due to the progressive processing of observations and underscore the potential benefits of flexible adaptation strategies.

Data availability

(1) All GCM simulations of CMIP5 and CMIP6 are openly available from the World Climate Research Programme repositories via the Earth System Grid Federation (ESGF) server at: https://esgf-node.llnl.gov/projects/cmip5/ and https://esgf-node.llnl.gov/projects/cmip6/; (2) The forcing series for the CMIP6 scenarios are openly available from the IPCC AR6 Working Group 1 Extended Data (Smith et al. 2021) at: https://doi.org/10.5281/zenodo.5705391; (3) The historical observations of global mean temperature anomaly used in this work are openly available via the Climate at a Glance server provided by the NOAA National Centers for Environmental Information at: https://www.ncdc.noaa.gov/cag/global/time-series; (4) The R code developed and used for the SSM and the results from the SSM including parameter posterior probabilities and projections are openly available at: https://doi.org/10.17603/ds2-0rdq-9531.

References

Abraham JP, Baringer M, Bindoff NL et al (2013) A review of global ocean temperature observations: Implications for ocean heat content estimates and climate change. Rev Geophys 51:450–483. https://doi.org/10.1002/rog.20022
Article Google Scholar
Andreae MO, Jones CD, Cox PM (2005) Strong present-day aerosol cooling implies a hot future. Nature 435:1187–1190. https://doi.org/10.1038/nature03671
Article CAS Google Scholar
Arnell NW, Lowe JA, Challinor AJ, Osborn TJ (2019) Global and regional impacts of climate change at different levels of global temperature increase. Clim Change 155:377–391. https://doi.org/10.1007/s10584-019-02464-z
Article Google Scholar
ASCE-CACC (2015) Adapting infrastructure and civil engineering practice to a changing climate. American society of civil engineers - committee on adaptation to a changing climate
ASCE-CACC (2018) Climate-resilient infrastructure: adaptive design and risk management. American society of civil engineers committee on adaptation to a changing climate
Barber D (2011) Bayesian reasoning and machine learning. Bayesian Reason Mach Learn. https://doi.org/10.1017/cbo9780511804779
Article Google Scholar
Beusch L, Gudmundsson L, Seneviratne SI (2020) Emulating Earth system model temperatures with MESMER: From global mean temperature trajectories to grid-point-level realizations on land. Earth Syst Dyn 11:139–159. https://doi.org/10.5194/esd-11-139-2020
Article Google Scholar
Byrd R, Lu P, Nocedal J, Zhu C (1995) A limited memory algorithm for bound constrained optimization. J Sci Comput 16:1190–1208. https://doi.org/10.1137/0916069
Article Google Scholar
Calel R, Stainforth DA (2017) On the physics of three integrated assessment models. Bull Am Meteorol Soc 98:1199–1216. https://doi.org/10.1175/BAMS-D-16-0034.1
Article Google Scholar
Chester MV, Allenby B (2019) Toward adaptive infrastructure: flexibility and agility in a non-stationarity age. Sustain Resilient Infrastruct 4:173–191. https://doi.org/10.1080/23789689.2017.1416846
Article Google Scholar
Cohen JS, Herman JD (2021) Dynamic adaptation of water resources systems under uncertainty by learning policy structure and indicators. Water Resour Res 57. https://doi.org/10.1029/2021WR030433
Cook LM, McGinnis S, Samaras C (2020) The effect of modeling choices on updating intensity-duration-frequency curves and stormwater infrastructure designs for climate change. Clim Change 159:289–308. https://doi.org/10.1007/s10584-019-02649-6
Article Google Scholar
Cummins DP, Stephenson DB, Stott PA (2020) Optimal estimation of stochastic energy balance model parameters. J Clim 33:7909–7926. https://doi.org/10.1175/JCLI-D-19-0589.1
Article Google Scholar
Douglas E, Jacobs J, Hayhoe K et al (2017) Progress and challenges in incorporating climate change information into transportation research and design. J Infrastruct Syst 23:1–9. https://doi.org/10.1061/(ASCE)IS.1943-555X.0000377
Article Google Scholar
Eyring V, Bony S, Meehl GA et al (2016) Overview of the coupled model intercomparison project phase 6 (CMIP6) experimental design and organization. Geosci Model Dev 9:1937–1958. https://doi.org/10.5194/gmd-9-1937-2016
Article Google Scholar
Eyring V, Cox PM, Flato GM et al (2019) Taking climate model evaluation to the next level. Nat Clim Chang 9:102–110. https://doi.org/10.1038/s41558-018-0355-y
Article Google Scholar
Fawcett W, Urquijo IR, Krieg H et al (2015) Cost and environmental evaluation of flexible strategies for a highway construction project under traffic growth uncertainty. J Infrastruct Syst 21. https://doi.org/10.1061/(asce)is.1943-555x.0000230
Fletcher S, Lickley M, Strzepek K (2019) Learning about climate change uncertainty enables flexible water infrastructure planning. Nat Commun 10:1–11. https://doi.org/10.1038/s41467-019-09677-x
Article CAS Google Scholar
Geoffroy O, Saint-martin D, Olivié DJL et al (2013) Transient climate response in a two-layer energy-balance model. Part I: Analytical solution and parameter calibration using CMIP5 AOGCM experiments. J Clim 26:1841–1857. https://doi.org/10.1175/JCLI-D-12-00195.1
Article Google Scholar
Ginbo T, Di Corato L, Hoffmann R (2021) Investing in climate change adaptation and mitigation: A methodological review of real-options studies. Ambio 50:229–241. https://doi.org/10.1007/s13280-020-01342-8
Gneiting T, Raftery AE (2007) Strictly proper scoring rules, prediction, and estimation. J Am Stat Assoc 102:359–378. https://doi.org/10.1198/016214506000001437
Article CAS Google Scholar
Gregory JM, Andrews T (2016) Variation in climate sensitivity and feedback parameters during the historical period. Geophys Res Lett 43:3911–3920. https://doi.org/10.1002/2016GL068406
Article Google Scholar
Gregory JM, Stouffer RJ, Raper SCB et al (2002) An observationally based estimate of the climate sensitivity. J Clim 15:3117–3121. https://doi.org/10.1175/1520-0442(2002)015%3c3117:AOBEOT%3e2.0.CO;2
Article Google Scholar
Gregory JM, Ingram WJ, Palmer MA et al (2004) A new method for diagnosing radiative forcing and climate sensitivity. Geophys Res Lett 31:2–5. https://doi.org/10.1029/2003GL018747
Article Google Scholar
Guma A, Pearson J, Wittels K et al (2009) Vertical phasing as a corporate real estate strategy and development option. Journal of Corporate Real Estate 11:144–157. https://doi.org/10.1108/14630010910985904
Article Google Scholar
Guthrie G (2019) Real options analysis of climate-change adaptation: investment flexibility and extreme weather events. Clim Change 156:231–253. https://doi.org/10.1007/s10584-019-02529-z
Article Google Scholar
Hausfather Z, Marvel K, Schmidt GA et al (2022) Climate simulations: recognize the “hot model” problem Setting the agenda in research. Nature 605:26–29. https://doi.org/10.1038/d41586-022-01192-2
Article CAS Google Scholar
Hawkins E, Sutton R (2009) The potential to narrow uncertainty in regional climate predictions. Bull Am Meteorol Soc 90:1095–1107. https://doi.org/10.1175/2009BAMS2607.1
Article Google Scholar
He Y, Manful D, Warren R, et al (2022) Quantification of impacts between 1.5 and 4 °C of global warming on flooding risks in six countries. Clim Change 170. https://doi.org/10.1007/s10584-021-03289-5
Helmrich AM, Chester MV (2022) Reconciling complexity and deep uncertainty in infrastructure design for climate adaptation. Sustain Resilient Infrastruct 7:83–99. https://doi.org/10.1080/23789689.2019.1708179
Herman JD, Quinn JD, Steinschneider S et al (2020) Climate adaptation as a control problem: review and perspectives on dynamic water resources planning under uncertainty. Water Resour Res 56. https://doi.org/10.1029/2019WR025502
Hui R, Herman J, Lund J, Madani K (2018) Adaptive water infrastructure planning for nonstationary hydrology. Adv Water Resour 118:83–94. https://doi.org/10.1016/j.advwatres.2018.05.009
Article Google Scholar
IPCC (2021) 2021: Technical Summary. In Climate Change 2021: The Physical Science Basis. Cambridge university press Cambridge, United Kingdom and New York, NY, USA
Kim MJ, Nicholls RJ, Preston JM, De Almeida GA (2022) Evaluation of flexibility in adaptation projects for climate change. Clim Change 171. https://doi.org/10.1007/s10584-022-03331-0
Knutti R, Sedláček J (2013) Robustness and uncertainties in the new CMIP5 climate model projections. Nat Clim Chang 3:369–373. https://doi.org/10.1038/nclimate1716
Article Google Scholar
Lai Y, Dzombak DA (2019) Use of historical data to assess regional climate change. J Clim 32:4299–4320. https://doi.org/10.1175/JCLI-D-18-0630.1
Article Google Scholar
Lai Y, Dzombak DA (2020) Use of the autoregressive integrated moving average (ARIMA) model to forecast near-term regional temperature and precipitation. Weather Forecast 35:959–976. https://doi.org/10.1175/waf-d-19-0158.1
Article Google Scholar
Lai Y, Dzombak DA (2021) Use of integrated global climate model simulations and statistical time series forecasting to project regional temperature and precipitation. J Appl Meteorol Climatol 60:695–710. https://doi.org/10.1175/JAMC-D-20-0204.1
Article Google Scholar
Lai Y, Lopez-Cantu T, Dzombak DA, Samaras C (2022) Framing the use of climate model projections in infrastructure engineering: practices, uncertainties, and recommendations. J Infrastruct Syst 28:1–16. https://doi.org/10.1061/(asce)is.1943-555x.0000685
Article Google Scholar
Lewis N, Curry JA (2015) The implications for climate sensitivity of AR5 forcing and heat uptake estimates. Clim Dyn 45:1009–1023. https://doi.org/10.1007/s00382-014-2342-y
Article Google Scholar
Lopez-Cantu T, Prein AF, Samaras C (2020) Uncertainties in future U.S. extreme precipitation from downscaled climate projections. Geophys Res Lett 47. https://doi.org/10.1029/2019GL086797
Meehl GA, Senior CA, Eyring V et al (2020) Context for interpreting equilibrium climate sensitivity and transient climate response from the CMIP6 Earth system models. Sci Adv 6:1–11. https://doi.org/10.1126/sciadv.aba1981
Article Google Scholar
Memarzadeh M, Pozzi M (2016) Value of information in sequential decision making: Component inspection, permanent monitoring and system-level scheduling. Reliab Eng Syst Saf 154:137–151. https://doi.org/10.1016/j.ress.2016.05.014
Article Google Scholar
Merryfield WJ, Baehr J, Batté L et al (2020) Current and Emerging Developments in Subseasonal to Decadal Prediction. Bull Am Meteorol Soc 101:E869–E896. https://doi.org/10.1175/bams-d-19-0037.1
Article Google Scholar
Moss RH, Avery S, Baja K et al (2019) A framework for sustained climate assessment in the United States. Bull Am Meteorol Soc 100:897–907. https://doi.org/10.1175/BAMS-D-19-0130
Article Google Scholar
Mudelsee M (2010) Climate time series analysis: classical statistical and bootstrap methods. Springer
Book Google Scholar
Myhre G, Myhre CEL, Samset BH, Storelvmo T (2013) Aerosols and their relation to global climate and climate sensitivity. Nature Education 4:1–11
Google Scholar
Nissan H, Goddard L, de Perez EC et al (2019) On the use and misuse of climate change projections in international development. Wiley Interdiscip Rev Clim Change 10:1–16. https://doi.org/10.1002/wcc.579
Article Google Scholar
Nocedal J, Wright S (2006) Numerical optimization. Springer Science & Business Media
Google Scholar
O’Neill BC, Kriegler E, Ebi KL et al (2017) The roads ahead: Narratives for shared socioeconomic pathways describing world futures in the 21st century. Glob Environ Chang 42:169–180. https://doi.org/10.1016/j.gloenvcha.2015.01.004
Article Google Scholar
Pozzi M, Memarzadeh M, Klima K (2017) Hidden-model processes for adaptive management under uncertain climate change. J Infrastruct Syst 23:04017022. https://doi.org/10.1061/(asce)is.1943-555x.0000376
Article Google Scholar
Ribes A, Qasmi S, Gillett NP (2021) Making climate projections conditional on historical observations. Sci Adv 7:1–10. https://doi.org/10.1126/sciadv.abc0671
Article Google Scholar
Richardson M, Cowtan K, Hawkins E, Stolpe MB (2016) Reconciled climate response estimates from climate models and the energy budget of Earth. Nat Clim Chang 6:931–935. https://doi.org/10.1038/nclimate3066
Article Google Scholar
Rings J, Vrugt JA, Schoups G, et al (2012) Bayesian model averaging using particle filtering and Gaussian mixture modeling: theory, concepts, and simulation experiments. Water Resour Res 48. https://doi.org/10.1029/2011WR011607
Rotstayn LD, Collier MA, Shindell DT, Boucher O (2015) Why does aerosol forcing control historical global-mean surface temperature change in CMIP5 models? J Clim 28:6608–6625. https://doi.org/10.1175/JCLI-D-14-00712.1
Article Google Scholar
Shumway RH, Stoffer DS (2017) Time series analysis and its applications: with R examples. Springer
Book Google Scholar
Smith C, Hall B, Dentener F et al (2021) IPCC Working Group 1 (WG1) Sixth Assessment Report (AR6) Annex III Extended Data. https://doi.org/10.5281/ZENODO.5705391
Steinschneider S, McCrary R, Mearns LO, Brown C (2015a) The effects of climate model similarity on probabilistic climate projections and the implications for local, risk-based adaptation planning. Geophys Res Lett 42:5014–5022. https://doi.org/10.1002/2015GL064529
Article Google Scholar
Steinschneider S, Wi S, Brown C (2015b) The integrated effects of climate and hydrologic uncertainty on future flood risk assessments. Hydrol Process 29:2823–2839. https://doi.org/10.1002/hyp.10409
Article Google Scholar
Tebaldi C, Arblaster JM (2014) Pattern scaling: Its strengths and limitations, and an update on the latest model simulations. Clim Change 122:459–471. https://doi.org/10.1007/s10584-013-1032-9
Article CAS Google Scholar
Tebaldi C, Sansó B (2009) Joint projections of temperature and precipitation change from multiple climate models: A hierarchical Bayesian approach. J R Stat Soc Ser A Stat Soc 172:83–106. https://doi.org/10.1111/j.1467-985X.2008.00545.x
Article Google Scholar
USGCRP (2017) Climate Science Special Report: Fourth National Climate Assessment, Volume I. U.S. Global Change Research Program, Washington, DC, USA
Völz V, Hinkel J (2023) Climate learning scenarios for adaptation decision analyses: review and classification. Clim Risk Manag 40. https://doi.org/10.1016/j.crm.2023.100512
Webster M, Jakobovits L, Norton J (2008) Learning about climate change and implications for near-term policy. Clim Change 89:67–85. https://doi.org/10.1007/s10584-008-9406-0
Article Google Scholar
Wright DB, Bosma CD, Lopez-Cantu T (2019) U.S. hydrologic design standards insufficient due to large increases in frequency of rainfall extremes. Geophys Res Lett 46:8144–8153. https://doi.org/10.1029/2019gl083235
Article Google Scholar
Wright DB, Samaras C, Lopez-Cantu T (2021) Resilience to extreme rainfall starts with science. Bull Am Meteorol Soc 1–14. https://doi.org/10.1175/BAMS-D-20-0267.1

Download references

Acknowledgements

We thank Peter Adams of Carnegie Mellon University for discussions and comments that greatly helped this research.

Funding

Open Access funding provided by Carnegie Mellon University. This research was supported by the National Science Foundation (NSF project CMMI #1663479, titled “From Future Learning to Current Action: Long-Term Sequential Infrastructure Planning under Uncertainty”).

Author information

Yuchuan Lai
Present address: Tetra Tech, 3697 Mt Diablo Blvd., Suite 150, Lafayette, CA, 94549, USA

Authors and Affiliations

Department of Civil and Environmental Engineering, Carnegie Mellon University, 5000 Forbes Ave., PH 119, Pittsburgh, PA, 15213, USA
Matteo Pozzi

Authors

Yuchuan Lai
View author publications
You can also search for this author in PubMed Google Scholar
Matteo Pozzi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Yuchuan Lai or Matteo Pozzi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 7.92 MB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lai, Y., Pozzi, M. Sequential learning of climate change via a physical-parameter-based state-space model and Bayesian inference. Climatic Change 177, 99 (2024). https://doi.org/10.1007/s10584-024-03739-w

Download citation

Received: 10 October 2023
Accepted: 27 April 2024
Published: 11 June 2024
DOI: https://doi.org/10.1007/s10584-024-03739-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Sequential learning of climate change via a physical-parameter-based state-space model and Bayesian inference

Abstract

Similar content being viewed by others

Physics-guided probabilistic modeling of extreme precipitation under climate change

Continental United States climate projections based on thermodynamic modification of historical weather

Observation-based blended projections from ensembles of regional climate models

1 Introduction

2 Methodology