Diffusion model-based probabilistic downscaling for 180-year East Asian climate reconstruction

Ling, Fenghua; Lu, Zeyu; Luo, Jing-Jia; Bai, Lei; Behera, Swadhin K.; Jin, Dachao; Pan, Baoxiang; Jiang, Huidong; Yamagata, Toshio

doi:10.1038/s41612-024-00679-1

Diffusion model-based probabilistic downscaling for 180-year East Asian climate reconstruction

Article
Open access
Published: 13 June 2024

Volume 7, article number 131, (2024)
Cite this article

Download PDF

You have full access to this open access article

npj Climate and Atmospheric Science

Diffusion model-based probabilistic downscaling for 180-year East Asian climate reconstruction

Download PDF

201 Accesses
1 Altmetric
Explore all metrics

Abstract

As our planet is entering into the “global boiling” era, understanding regional climate change becomes imperative. Effective downscaling methods that provide localized insights are crucial for this target. Traditional approaches, including computationally-demanding regional dynamical models or statistical downscaling frameworks, are often susceptible to the influence of downscaling uncertainty. Here, we address these limitations by introducing a diffusion probabilistic downscaling model (DPDM) into the meteorological field. This model can efficiently transform data from 1° to 0.1° resolution. Compared with deterministic downscaling schemes, it not only has more accurate local details, but also can generate a large number of ensemble members based on probability distribution sampling to evaluate the uncertainty of downscaling. Additionally, we apply the model to generate a 180-year dataset of monthly surface variables in East Asia, offering a more detailed perspective for understanding local scale climate change over the past centuries.

An intercomparison of multiple statistical downscaling methods for daily precipitation and temperature over China: future climate projections

Article 29 November 2018

Projecting future precipitation and temperature at sites with diverse climate through multiple statistical downscaling schemes

Article 23 October 2017

Dynamical downscaling of regional climate: A review of methods and limitations

Article 28 September 2018

Introduction

As the average global temperature continues to rise and extreme weather events become more frequent, the impact of climate change on our lives is increasingly apparent^1,2. However, the rate of warming varies across regions, as does the frequency of such extreme events^3,4,5,6. Therefore, reliable and accurate regional climate information is crucial to deal with local climate change and its impacts. While many international efforts have been put to develop high-resolution global climate models (GCMs) in recent decades^7,8, it is important to note that when assessing century-scale climate changes, most GCMs still operate with grid spacings of 100 km or more due to limitations in computational resources^9,10,11. Such coarse spatial resolutions have proved to be inadequate for evaluating climate changes at the scale relevant to local communities¹². As a result, this challenge has given rise to the adoption of downscaling techniques, which are now widely employed to bridge the gap between global climate projections and the specific climate information needs of local communities^{9,13,14,15,16}.

These downscaling techniques, including both dynamical and statistical methods, have been developed to generate high-resolution climate data^{13,14,15,16,17,18,19,20}. Among these techniques, Regional Climate Models (RCMs) stand out as dynamical models that utilize topography and circulation conditions from GCMs to generate the regional climate information^13,14,15. However, it is important to note that RCM downscaling results are reliant on the large-scale circulation conditions provided by the GCMs, which leads to the propagation of systematic biases from the GCMs and an increase in uncertainties in the downscaled result²¹. Furthermore, dynamical models incur significant computational expenses and necessitate substantial data storage and processing costs^13,14,15. In contrast, statistical downscaling methods can provide high-resolution outputs similar to dynamical downscaling but with significantly reduced resource and computational demands. Statistical downscaling methods leverage statistical relationships between low-resolution and high-resolution climate data. These statistical downscaling methods rely on mathematical techniques, including deep learning and traditional statistical approaches, to establish statistical relationships between low-resolution and high-resolution climate data, enabling the derivation of detailed local-scale information^{17,18,19,20,22}. Currently, both dynamical and statistical downscaling techniques find widespread use in studies related to climate change, climate variability, hydro-climate extremes, and impact assessments at regional scales, particularly within sectors such as agriculture, energy, and water resources^23,24,25,26.

Unfortunately, both dynamical and statistical downscaling methods currently prioritize deterministic modeling, which often leads to the oversight of the inherent uncertainties within the data and the ill-posed problem of the downscaling process^27,28. Those uncertainties have become a growing concern, as precise estimations of climate change and robust assessment methods are crucial for a comprehensive understanding of climate change. Many are now exploring innovative technologies and approaches to tackle the challenges posed by climate change^29,30,31,32. Here, we introduce the Diffusion Probabilistic Downscaling Model (DPDM), a data-driven approach designed to simulate the probability distribution function of high-resolution climate based on the corresponding low-resolution data. Furthermore, the DPDM incorporates a conditional probability method that accounts for the influence of external factors, such as topography information during the downscaling process. Compared with deterministic downscaling, DPDM derives probability distribution functions and generates a large number of ensemble members³³, which not only obtains the accurate local details but also allows for more robust estimates of uncertainties. Additionally, we apply DPDM to the NOAA-20th century reanalysis³⁴ to offer a detailed information on surface climate over East Asia from 1836 to 2015, which significantly enhance our understanding of local-scale climate changes since the late 19th century.

Results

Climate downscaling via the DPDM

For monthly climate downscaling, one of the main limitations of deep learning methods is the lack of long-term high-resolution observational data. To mitigate this, we use the easily available ERA5-Land dataset³⁵, which offers the surface meteorological variable data on a high resolution of 0.1°. Furthermore, for a more robust estimation of uncertainty in downscaling, we use the lower-resolution 1° from ERA5 dataset rather than relying on interpolation³⁶. The ERA5 and ERA5-Land are produced by different versions of the Integrated Forecast System model, with ERA5-Land uses an improved land surface model that can better represent processes such as surface-atmosphere interactions, soil moisture dynamics, and vegetation processes compared to the one used in ERA5. In addition, in order to ensure sufficient training data, we assume that the mapping relationship between large-scale areas and small-scale details in downscaling is consistent at different time scales. Consequently, we utilize 6-hourly data spanning the period from 1961 to 2015 for model training (details in Table S1 and “Methods”).

Recognizing that many of the intricate details in high-resolution data are affected by topography information, we challenge to focus on the East Asia region (65°E-135°E, 15°N-55°N), which boasts some of the most complex and diverse terrain conditions. To comprehensively account for the impact of varying terrains, we introduce terrain and land-sea boundary data into the input dataset of DPDM. Furthermore, we employ a training strategy that divides an input image into a series of patches and randomly selects a patch as the training input. This strategy enhances the diversity of training data and reduce training complexity. Besides this, DPDM train with this strategy demonstrates robust generalization capabilities and can easily be adapted to regions beyond East Asia or to smaller areas through simple fine-tuning (details in “Methods”). In terms of our model’s architecture, we are inspired by the well-established model (SR3)³⁷. Additionally, to improve the efficiency of data generation process, we apply the sample process from Denoising Diffusion Implicit Model (DDIM³⁸), leveraging its flexibility to skip some steps in the denoising process and trade off quality for generation speed, as fewer steps inevitably lead to a loss of quality but 250 steps were deemed sufficient for our purposes (ref. ³⁹, Fig. S1 and Table S2).

Table 1 unequivocally highlights the superiority of DPDM in downscaling results compared to other statistical methods. This includes deterministic models such as Enhanced Deep Residual Networks for Super-Resolution with Generative Adversarial framework (EDSR-GAN), known for their superiority over traditional statistical techniques^40,41,42,43, as well as the widely-used linear interpolation methods (Lerp) in meteorology^44,45. The consistent results across a variety of meteorological data reveal that the DPDM outperforms other models in terms of key metrics, including the Anomaly Correlation Coefficient (ACC), Peak Signal-to-Noise Ratio (PSNR), Structure Similarity Index Measure (SSIM), and Root Mean Square Error (RMSE). It is worth mentioning that this superiority is particularly evident in the case of precipitation and temperature downscaling, which are very important for social life in a warming climate. Furthermore, a distinct advantage of DPDM is the capacity to generate multiple ensemble members by sampling from the probability distribution. We conduct a comparative analysis of these metrics for varying membership sizes, including ensemble mean with 33 and 100 members. The results clearly demonstrate that increasing the membership size enhances downscaling capabilities proportionally. It is worth noting that DPDM with multi-member ensembles will increase the inference time compared to other deterministic models (Fig. S2). Nevertheless, when compared to dynamical regional climate models (RCMs), the computational efficiency of DPDM remains acceptable.

Table 1 Evaluation of the downscaling performance of five surface variables from 2016 to 2021using Root Mean Square Error (RMSE), Anomaly Correlation Coefficient (ACC), Structure Similarity Index Measure (SSIM) and Peak Signal-to-Noise Ratio (PSNR) based on three different downscaling methods, including linear interpolation (Lerp), deterministic model (EDSR-GAN) and Diffusion Probabilistic Downscaling Model (DPDM) with different numbers of members

Full size table

To assess their performance in reproducing the spatial distribution, we compare the patterns of climatological mean precipitation and its variance in the test dataset based on the different downscaling methods (Fig. 1). It is evident that the Lerp yields overly smooth results, failing to accurately reproduce local details (Figs. 1a4–d4). In contrast, both the deterministic and DPDM impressively capture the spatial details of the precipitation distributions and its temporal variability, as evidenced by improved spatial pattern correlation and reduced spatially averaged Normalized Root Mean Square Error (NRMSE, see Fig. 1 and S4).

**Fig. 1: The downscaling results for climatological mean total precipitation and its standard deviation based on different methods.**

However, we note that the deterministic method exhibits lower correlation in mid-high latitudes and plateau areas. This divergence can be attributed to the complex and variable land surface characteristics in such terrains, resulting in variations even under the same large-scale background conditions. In addition, methods based on the deterministic approach (EDSR-GAN) sometimes introduce false features^46,47(Figs. 1c2, 1d2). Interestingly, DPDM, especially that with multiple members, is able to effectively address these issues. This conclusion is parallel with the findings in distribution histograms, where the Lerp struggles to accurately represent data variation ranges, and where EDSR-GAN tends to exhibit a potential for overestimation (see Fig. 1e and Fig. S2). For the downscaling of other meteorological variables, DPDM also exhibits similar advantages over the deterministic model (Fig. S5 and S6).

To evaluate whether the downscaled results effectively capture high-resolution details, we employ an objective assessment based on the R squared (R²) and blurriness. Blurriness serves as a crucial metric for objectively assessing whether a model exhibits excessive smoothing or sharpness, avoiding subjective visual assessments²⁰. We utilize the absolute difference between the high-resolution data and the Lerp results to evaluate whether the high-resolution data contains sufficient detail information (shown in Fig. 2 c1–c5) because the Lerp results are smooth enough (Fig. S7). If the data predominantly falls on the left side of the diagonal, it suggests that the downscaled data’s information content is less than that of true high-resolution data, indicating a bias toward smoothing. Conversely, if the data leans toward the right side of the diagonal, it signifies excessive sharpness, introducing more information that deviates from reality. Notably, our DPDM with 100 members exhibits a higher degree of model fitting when compared to deterministic models. The DPDM effectively preserves true local details in downscaling results, although some smoothing is visible in precipitation. The evaluation highlights DPDM’s capability in reconstructing high-resolution details, even with multiple ensemble members, achieving similar or superior performance compared to other methods (Fig. S8), while also mitigating issues such as excessive smoothing or sharpness observed in other models.

**Fig. 2: Evaluation of R squared (R²) and blurriness for downscaling results in different variable.**

Uncertainties of DPDM

The DDPM can generate multi-member downscaling results through probability distribution sampling. Therefore, we evaluate in detail how the number of ensemble members affects the downscaling performance and why they can be used to evaluate uncertainty (Fig. 3).

**Fig. 3: The role of different members in the DPDM.**

As the number of members is increased, the ability of DPDM to capture true details improves in all aspects⁴⁸. An interesting feature is that the improvement in downscaling capability from 30 to 100 members is relatively small. This is reminiscent of the conclusion reached by dynamical models using large members for ensemble predictions. Part of the reason for this phenomenon is that approximately 30 members are sufficient to represent most of the high-resolution details and to cover the uncertainty space in the downscaling. The ensemble scheme of the dynamical model is similar to the ensemble scheme of DPDM in that both the methods involve sampling multiple results, although the sampling methods are different. Dynamical models utilize randomly perturbed parameterization or randomly perturbed physical processes to generate multiple results, whereas DPDM samples multi-members from the probability distribution.

We find significant varieties among different members, as shown in the spatial pattern of precipitation in July 2020 in Fig. S9. Figure 3b highlights the considerable uncertainty in the model’s downscaling output. But it is worth noting that multi-member average of the model output will better improve the downscaling capability. Furthermore, we conduct a specific assessment on extreme rainfall events. In July 2020, severe floods occurred in the Yangtze River Basin in East Asia, and drought occurred in southern China (Figs. 3c and 3d). In the two extreme cases, we find that the results obtained through multi-member mean ensemble scheme are always closer to the true values than the Lerp and single-member results, no matter whether dealing with sparse rainfall during droughts or heavy rainfall during floods. And the distribution of multi-member results closely approximates a normal distribution. The normal distribution may be better able to measure the uncertainty in atmospheric processes that determine the local detail conditions. Similarly, downscaling of other atmospheric variables also show similar results (Fig. S10). Additionally, we have evaluated the spread of DPDM with 100 members (Fig. S11 and Fig. S12). Unlike many other downscaling models that tend to exhibit underestimated spread, DPDM demonstrates the capability to generate sufficient ensemble spread, even overestimating it for each variable. Furthermore, the overestimation of uncertainty suggests that DPDM can accommodate additional sources of uncertainty beyond solely the downscaling process itself.

Application for downscaling 180-year surface climate dataset

With the success of DPDM, we now apply the method to reconstruct high-resolution historical climate data in East Asia for the past century. Additionally, we explore several potential application scenarios for this high-resolution dataset. These application scenarios not only demonstrate the model’s capability for small-scale reconstruction but also address several climate change-related issues of broad academic interest.

We select the NOAA-20C dataset as the low-resolution data because it is the only global dataset covering nearly two centuries from 1836 to 2015, encompassing the entire industrialization period. It also provides different temporal resolutions and circulation data, which not only provides the basis for constructing high-resolution surface data at six-hourly intervals but also uses the reconstructed high-resolution data with circulation to explore more on mechanisms, especially the attribution of extreme events.

To ensure the reliability of reconstruction data, we perform evaluations against the widely utilized CRU-station dataset. Given inherent data credibility issues with reanalysis datasets and the fact that observation stations solely provide precipitation and temperature data, we conduct a relative error analysis between DPDM and the Lerp results for these two datasets. It may be a crude way to evaluate our results but we do not have any other choice in the absence of high-resolution observational data for such a long period from 1836 to 2015. Figures 4a, b reveals a noticeable reduction in relative errors on most stations when employing DPDM. It shows that the reconstructed data are closer to the station-observed data.

**Fig. 4: Applicable scenarios for high-resolution datasets over the past 180 years using DPDM.**

To assess the changes in aridity over the past centuries by use of the high-resolution reconstructed dataset, we have computed the Aridity Index (see Methods), a commonly applied metric with a threshold of 0.65. As shown in Fig. 4c, the shaded area represents the temporal average of AI from 2005 to 2015. The blue line, comprised of both solid and dashed segments, depicts the changes in the 11-year running mean AI at a low resolution with a constant value of 0.65. In contrast, the green line illustrates the AI values of 0.65 from 2005 to 2015 at high resolution. The findings suggest that the low-resolution data underestimates the expansion of aridity in the mid- to high-latitudes of East Asia and Northern China in the context of global warming. Figure 4d further quantifies the area proportion of drought regions (65°E-135°E, 33°N-55°N) on a decade-by-decade basis. Remarkably, even when accounting for the uncertainty among different ensemble members, the low-resolution data persistently underestimates drought areas by approximately 3%.

Regarding the warming and humidification trend in the northwestern China during recent decades, we examine the precipitation changes in northwestern China (75°E-110°E, 30°N-50°N). The analysis reveals an actual increasing trend in precipitation since 1970, but the low-resolution data significantly overestimates precipitation intensity at a rate of 0.79 mm/day per 10 years. In contrast, the high-resolution data offers finer estimations and provides uncertainty estimates for assessing credibility, indicating a trend of 0.69 mm/day per 10 years within an uncertainty range of 0.65-0.78 mm/day per 10 years. In addition, the precipitation distribution histogram in rainy season (MJJAS) found that our high-resolution data can correctly understand the differences in climate characteristics in different regions. It tends to obtained less precipitation in drought regions while more precipitation in humid regions than low resolution (Fig. S13).

High-resolution data can provide important details for assessing extreme events. Therefore, we also conduct a simple evaluation of extreme hot and dry compound events (Figs. 4e and 4f, Methods). In North China, the high-resolution reconstructed data clearly provides greater detail, detecting more extreme events and offering a finer feature of areas that are prone to such events. Additionally, it augments the available samples for subsequent attribution and synthesis analyses. Note that, while wind speed data lacks observational records, the climate statistics indicate that DPDM results also yield more detailed characters (Fig. S14). In previous studies⁴⁹, the trend in wind power change under the global warming is often estimated with the low-resolution data. However, the high-resolution data notably reveals several regions undergoing faster changes (Fig. 4g–i).

In summary, the high-resolution data generated by the DPDM not only exhibits a certain level of credibility but also enhances our understanding of climate details. This high-resolution dataset, covering the past centuries, may provide important details for improved understanding of the historical climate change.

Discussion

In this study, we have introduced a novel probabilistic downscaling model, DPDM, for climate downscaling. We evaluate the downscaling capabilities of DPDM, which not only accurately simulates the probability distribution function of high-resolution data, but also generates a large number of members to quantify the uncertainty of the downscaled information. The latter is important since small-scale conditions under a large-scale background are never deterministic. In addition, the downscaling framework of DPDM has great potential in medium-term weather forecasts, climate predictions and future scenario projections. For instance, to generate an adequate number of ensemble members, it can be used to emulate traditional methods like the single model initial-condition large ensemble for identifying and robustly sampling extreme events^50,51.

It is undeniable that DPDM still has more room for improvement. Introducing additional circulation conditions and external forcing could enhance the model ability with more physical constraints⁵². This new approach holds promise for applications in bias correction and downsizing of dynamical model predictions. As for high-resolution climate datasets over the past centuries, while it offers valuable insights and applications, the existing datasets are limited in terms of the number of available variables and their temporal resolution. With sufficient computing resources, there is potential for increasing the temporal resolution to 6-h intervals and downscaling other surface or upper-atmosphere variables to enable more comprehensive analyses. Additionally, we have noticed that NOAA-20CR provides corresponding ensemble spread data, which can effectively quantify the uncertainty in the initial conditions. In the future, we could sample from this ensemble spread to generate ensemble members that incorporate data uncertainty, and then apply the DPDM separately for each member. This approach would produce a larger ensemble distribution, incorporating not only the uncertainties from the downscaling process but also the uncertainties from the initial conditions. These may be reserved for future study.

In addition, the probabilistic essence and robust mathematical foundation of the diffusion model may open up a wealth of possibilities for its practical applications in climate science as a promising tool. Its applicability extends far beyond downscaling; it holds potential for forecasting, assimilation, data reconstruction, model bias correction, sensitivity experiments, scientific inquiry, and even causal analysis. We believe that the time has come to explore the applications of the diffusion model, tackling intriguing scientific questions and contributing to the advancement of climate science.

Methods

Diffusion probabilistic downscaling model

Denoising diffusion probabilistic model has become escalatingly influential in recent years. It is a type of generative models inspired by considerations from nonequilibrium thermodynamics⁵³, which defines a bidirectional Markov chain with length T. The forward diffusion process gradually adds Gaussian noise to the original input data ${x}_{0}$, generating a sequence of data ${{x}}_{1}\ldots {x}_{T}$. The reverse diffusion process iteratively removes noise from the noisy image ${x}_{t}$ by sampling from $p\left({x}_{t-1},|,{x}_{t}\right)$. To generate new data sample, it samples Gaussaian noise map ${x}_{T}$ from the normal Gaussian distribution $N\left(0,{\sigma }_{\max }^{2}I\right)$ as the input of the model. Then, the model ${\epsilon }_{\theta }$ predicts the noise ${\epsilon }_{\theta }\left({x}_{t},t\right)$ based on the current state ${x}_{t}$ in each diffusion step t to gradually removes noise, thus reversing the diffusion process.

DPDM is a variant of the conditional diffusion probabilistic model with low-resolution input, topography, and land-sea mask as conditions called as $\widetilde{x}$. DPDM removes noise from the noisy image ${{\rm{x}}}_{{\rm{t}}}$ by sampling from the joint probability $p\left({x}_{t-1},|,{x}_{t},\widetilde{x}\,\right)$. The DPDM model is trained with the L2 loss between the predicted noise and the actual noise ${\left|\left|\epsilon -{\epsilon }_{{\rm{\theta }}}\left({x}_{t},t,\widetilde{x}\right)\right|\right|}_{2}^{2}$. Here, we give a brief introduction to DPDM in forward and reverse process (more details in Ho et al. ^33,38,39,54).

Forward diffusion process

The forward process of DPDM is similar to iteratively constructing a mapping relationship from a high-resolution distribution to pure gaussian distribution using the Markov chain with total length T. More specifically, given a data sampled from the real high resolution data distribution ${x}_{0} \sim q\left(x\right)$, the forward diffusion process, in which we gradually add Gaussian noise to ${x}_{0}$ in total steps (T) with the noisy density for each step (t) being controlled by a variance schedule ${\beta }_{1}\ldots {\beta }_{T}$, produces the sequence of high-resolution data with noisy samples ${x}_{1}\ldots {x}_{T}$.

$$q\left({x}_{t},|,{x}_{t-1}\right)=N\left({x}_{t}{\rm{;}}\sqrt{1-{\beta }_{t}}{x}_{t-1},{\beta }_{t}I\right)$$

(1)

If the magnitude ${\beta }_{t}$ of the noise added at each step is small enough, and the total step T is large enough (in our experiments, T is set to 1000 steps), then ${x}_{T}$ is equivalent to an isotropic Gaussian distribution ${x}_{T}$∼ N (0, I). With the help of the properties of Markov chain, we successfully connect the high-resolution data distribution $q\left(x\right)$ with the Gaussian distribution N (0, I). Then, we can obtain the noise samples at step t in the forward process, using the following equation:

$$q\left({x}_{1:T},|,{x}_{0}\right)=\mathop{\prod }\limits_{t=1}^{T}q\left({x}_{t},|,{x}_{t-1}\right)=\mathop{\prod }\limits_{t=1}^{T}N\left({x}_{t}{\rm{;}}\sqrt{1-{\beta }_{t}}{x}_{t-1},{\beta }_{t}I\right)$$

(2)

In order to obtain ${x}_{t}$ without iteration for fast training, we can further expand ${x}_{t}$ with the help of reparameterization trick and additive nature of Gaussian distributions.

Let ${\alpha }_{t}=1-{\beta }_{t}$ and, ${\bar{a}}_{t}=\mathop{\prod }\limits_{i=1}^{t}{a}_{i}$, we can get the ${x}_{t}$ and $q\left({x}_{T},|,{x}_{0}\right)$:

$${x}_{t}=\sqrt{{a}_{t}}{x}_{t-1}+\sqrt{{1-a}_{t}}{\epsilon }_{t-1}=\sqrt{{\bar{a}}_{t}}{x}_{0}+\sqrt{{1-\bar{a}}_{t}}\epsilon ,\epsilon \sim N\left(0,\,I\right)$$

(3)

$$q\left({x}_{T},|,{x}_{0}\right)=N\left(\sqrt{{\bar{a}}_{t}}{x}_{0},{(1-\bar{a}}_{t})\,I\right)$$

(4)

Reverse diffusion process

Now we can conveniently sample ${x}_{t}$ at an arbitrary timestep t in the forward process. At the timestep T, we can sample gaussion noise map ${x}_{T}$ from the gaussion distribution $N$ $\left(0,{I}\right).$ Then we need to estimate $q\left({x}_{t-1},|,{x}_{t}\right)$ to remove the gaussion noise added in the forward process of DPDM. Unfortunately, we cannot easily estimate $q\left({x}_{t-1},|,{x}_{t}\right)$ because it requires to use the entire dataset. Therefore, we learn the DPDM model ${p}_{\theta }$ conditioned on the state ${x}_{t}$ at the timestep t and the conditional data $\widetilde{x}$ to approximate these conditional probabilities for getting the next state ${x}_{t-1}$.

$${p}_{\theta }\left({x}_{t-1},|,{x}_{t},\,\widetilde{x}\right)=q({x}_{t-1}{\rm{|}}{x}_{t},{x}_{0})=N\left({x}_{t-1}{\rm{;}}{\mu }_{\theta }\left({x}_{t},t,\,\widetilde{x}\right),{\Sigma }_{\theta }\left({x}_{t},t\right)\right)$$

(5)

Through Bayes’ rule, we can get the following equation (6) and (7). Subsequently, based on the Markov assumption, equation (8) can be obtained from (7). Finally, by utilizing Gaussian distribution functions from the forward process, all components of equation (8) can be expressed, simplifying it to equation (9):

(6)

(7)

(8)

(9)

Following the standard Gaussian distribution function, the mean and variance can be parameterized as follows:

$${\mu }_{\theta }\left({x}_{t},t\right)=\frac{1}{\sqrt{{\alpha }_{t}}}\left({x}_{t}-\frac{1-{\alpha }_{t}}{\sqrt{1-\bar{{\alpha }_{t}}}}{\epsilon }_{\theta }\left({x}_{t},t,\,\widetilde{x}\right)\right),{\Sigma }_{\theta }\left({x}_{t},t\right)=\frac{{\beta }_{t}\left(1-{\bar{\alpha }}_{t-1}\right)}{1-\bar{{\alpha }_{t}}}I \sim {\beta }_{t}I$$

(10)

According to Eqs. (9) and (10), we can obtain the distribution function (11), and we can sample it to get the ${x}_{t-1}$ thereby realizing the reverse denoising process.

$$\begin{array}{c}{x}_{t-1}=N\,\left({x}_{t-1}{;}\frac{1}{\sqrt{{\alpha }_{t}}}\left({x}_{t}-\frac{1-{\alpha }_{t}}{\sqrt{1-\bar{{\alpha }_{t}}}}{\epsilon }_{\theta }\left({x}_{t},t,\,\widetilde{x}\right)\right),\sqrt{{\beta }_{t}}\epsilon \right)\\ =\frac{1}{\sqrt{{\alpha }_{t}}}\left({x}_{t}-\frac{1-{\alpha }_{t}}{\sqrt{1-\bar{{\alpha }_{t}}}}{\epsilon }_{\theta }\left({x}_{t},t,\,\widetilde{x}\right)\right)+\sqrt{{\beta }_{t}}{\epsilon \; \epsilon } \sim N\,\left(0,\,I\right)\end{array}$$

(11)

Training details and model structure

To obtain a sufficient volume of training data, we employ a dataset with a 6-h time resolution instead of the monthly dataset. We established a connection between the ERA5 product (1° spatial resolution), and the high-resolution ERA-land dataset (0.1° spatial resolution). These reanalyses are provided by the Copernicus Climate Change Service at ECMWF, combining a large range of satellite-based and land-based observations with high-resolution model simulations through state-of-the-art data assimilation techniques, spanning the period from 1961 to 2021.

In this study, we split the ERA-Land and ERA5 datasets into the training period of 1960–2015 and the test period of 2015–2021. In order to consider the effect of terrain, we use the topography, land-sea mask and low-resolution data information as the input of DPDM for training. We randomly crop 128*128 patch from the original low-resolution data and get the corresponding high-resolution data for training, instead of using the original low- and high-resolution data for training. The reason why we use this strategy is because, this cannot only increase the diversity of data to learn the mapping of low-resolution data to high-resolution data, but also reduce the consumption of computing resources, such as graphics memory and computational cost. With the ablation study (Fig. S15), we investigated the effects of adding terrain information to the patch input and explored the impact of different patch sizes (32,64,128,256) on model performance. Our experiments revealed that incorporating terrain information led to faster loss convergence, patch size of 128 can obtained optimal performance and patch size of 256 resulted in non-converging loss.

Through the patch strategy, the model can dynamically process low-resolution data to generate high-resolution data for any region and condition, instead of just fitting in a certain region, thereby improving the generalization ability of the model. For example, it can obtain high-resolution results outside the target region without the training, albeit with limited effectiveness, as seen in the Gulf of Mexico (Fig. S16) and northern Australia (Fig. S17). It should be noted that in the DPDM, we also need to use the above patch strategy for inference, rather than using the entire low-resolution data. The input image is divided into different patches, each patch is processed individually, and then the results are combined to obtain the final output. To ensure continuity at the boundaries, we introduce overlapping between some patches. However, there still be some quality degradation at the boundaries of patches. Increasing the amount of overlap or employing advanced techniques for boundary restoration can yield more consistent results, but this approach requires sufficient computational resources. For different surface variable, we adopt different normalization strategies to facilitate rapid loss convergence and training stability. Precipitation data undergoes a transformation by adding one and then taking the logarithm. Other variables are standardized and normalized to a range between 0 and 1. We also standardize the topography data and change the land-sea mask information to a matrix containing only 0 and 1.

The DPDM model architecture is similar to SR3, which is a U-Net-like architecture (Fig. S1). At each time step t, the model’s input comprises the concatenation of two datasets: the conditional data $\widetilde{x}$ and the noisy data ${x}_{t}$, both have the same dimensions as the high-resolution data ${x}_{0}$. Conditional data $\widetilde{x}$ includes interpolation result obtained by Lerp interpolating low-resolution data, topography, and land-sea mask information. Concatenation is a simple and effective method to add the conditional data to the model. Then, a convolutional layer with 64 kennels is used to extract the input data information. Downsampling modules are applied in DPDM consisting of several residual blocks and self-attention layers⁵⁵. Based on empirical experience, we only use 3 downsampling modules, and the data dimensions are reduced from 128 to 16. The upsampling modules is similar to the downsampling modules. All the convolutions in our model use the 3*3 convolution kernel size and the 1*1 stride. For encoding the timestep t, we use the sinusoidal positional encoder⁵⁵ that contains two fully-connected layers and a sigmoid linear unit (SiLU) activation function between the two layers. Then we add the timestep feature encoded by above sinusoidal positional encoder to our intermediate feature maps after the group normalization operators in each residual block.

Climate index and compound events

An aridity index (AI⁵⁶) is a numerical indicator of the degree of dryness of the climate at a given location.

$${AI}=\frac{{\rm{P}}}{{\rm{PET}}}$$

(12)

where PET is the potential evapotranspiration, which is calculate by the python package of Climate Indices (https://climate-indices.readthedocs.io/en/latest/) and the ${\rm{P}}$ is the annual average precipitation.

Compound hot-dry events are defined as the co-occurrence of high mean temperature anomaly (above the 90th percentile) and low mean precipitation anomaly (below the 10th percentile) values over the warm season⁵¹ (i.e., the three consecutive months with highest mean temperature during 1836–2015) for each grid point, The centennial trends of the two variables are removed.

Wind energy⁵⁷ is a typical measure of wind energy potential, defined as follows:

$${Wind\; power}=\frac{1}{2}\rho {W}_{h}^{3}$$

(13)

where ρ represents the air density, which is assumed to be a constant value of 1.213 ${kg}* {m}^{-3}$ at standard atmospheric conditions, and ${W}_{h}$ is approximately expressed by the wind speed at a height of 10 m.

Metric

Based on the downscaling results, we compute some metrics, i.e., Anomaly Correlation Coefficient (ACC), Peak Signal-to-Noise Ratio (PSNR), Structure Similarity Index Measure (SSIM), Root Mean Square Error (RMSE), and Normalized Root Mean Square Error (NRMSE) defined as follows:

$${ACC}=\mathop{\sum }\limits_{m=1}^{12}\frac{{\sum }_{y=s}^{e}\left({O}_{y,m}-{\overline{{O}_{m}}}\right)\left({D}_{y,m}-{\overline{{D}_{m}}}\right)}{\sqrt{{{\sum }_{y=s}^{e}\left({O}_{y,m}-{\overline{{O}_{m}}}\right)}^{2}{{\sum }_{y=s}^{e}\left({D}_{y,m}-{\overline{{D}_{m}}}\right)}^{2}}}$$

(14)

$${PSNR}=\frac{1}{n}\mathop{\sum }\limits_{i=1}^{n}10* {\log }_{10}\left(\frac{{{MaxValue}}^{2}}{{MSE}}\right)$$

(15)

$${MSE}=\frac{\mathop{\sum }\nolimits_{j=1}^{{N}_{{lat}}}\mathop{\sum }\nolimits_{k=1}^{{N}_{{lon}}}{({D}_{j,k}-{O}_{j,k})}^{2}}{{N}_{{lat}}\times {N}_{{lon}}}$$

(16)

$${SSIM}=\frac{1}{n}\mathop{\sum }\limits_{i=1}^{n}\frac{\left(2{\mu }_{D,i}{\mu }_{O,i}+{C}_{1}\right)\left(2{\sigma }_{D,i}{\sigma }_{O,i}+{C}_{2}\right)}{\left({\mu }_{D,i}^{2}{+\mu }_{O,i}^{2}+{C}_{1}\right)\left({\sigma }_{D,i}^{2}{+\sigma }_{O,i}^{2}+{C}_{2}\right)}$$

(17)

$${RMSE}=\sqrt{\frac{1}{n}\mathop{\sum }\limits_{i=1}^{n}\frac{\mathop{\sum }\nolimits_{j=1}^{{N}_{{lat}}}\mathop{\sum }\nolimits_{k=1}^{{N}_{{lon}}}{({D}_{i,j,k}-{O}_{i,j,k})}^{2}}{{N}_{{lat}}\times {N}_{{lon}}}}$$

(18)

$${NRMSE}=\frac{\sqrt{\frac{1}{n}\mathop{\sum }\nolimits_{i=1}^{n}{({O}_{i}-{D}_{i})}^{2}}\,}{\sqrt{\frac{1}{n}\mathop{\sum }\nolimits_{i=1}^{n}{({O}_{i}-\bar{O})}^{2}}\,}$$

(19)

Here, $O$ and $D$ denote the observed and the downscaling results, respectively. ${\overline{{O}_{m}}}$ and ${\overline{{D}_{m}}}$ denote the climatology of observed and the downscaling results in each calendar month m (from 1 to 12). The label $y$ denotes the forecast target year. Finally, $s$ and $e$ denote the earliest (that is, 2016) and the latest year (that is, 2021) of the validation, respectively. ${MaxValue}$ denote the maximum value of the normalized data (that is, 1). ${\mu }_{D,i}$ and ${\sigma }_{D,i}$ represents the spatial means and standard deviations of the downscaling results, while ${\mu }_{O,i}$ and ${\sigma }_{O,i}$ represents the spatial means and standard deviations of observation. ${C}_{1}$ and ${C}_{2}$ are constants to avoid computation instability when the denominator approaches zero.

Data availability

Code availability

Github link and Reconstruction Dataset link: https://github.com/LingFH/Diffusion_4_downscaling.

References

Easterling, D. R. et al. Climate extremes: observations, modeling, and impacts. Science 289, 2068–2074 (2000).
Article CAS Google Scholar
Hansen, J., Sato, M. & Ruedy, R. Perception of climate change. Proc. Natl. Acad. Sci. 109, E2415–E2423 (2012).
Article CAS Google Scholar
Masson-Delmotte, V. et al. IPCC, 2021: Summary for policymakers. in: climate change 2021: the physical science basis. contribution of working group I to the sixth assessment report of the intergovernmental panel on climate change. Clim. Change 2021 – Phys. Sci. Basis 3, 32 (2021).
Google Scholar
Giorgi, F. et al. Emerging patterns of simulated regional climatic changes for the 21st century due to anthropogenic forcings. Geophys. Res. Lett. 28, 3317–3320 (2001).
Article Google Scholar
Newman, R. & Noy, I. The global costs of extreme weather that are attributable to climate change. Nat. Commun. 14, 6103 (2023).
Article CAS Google Scholar
Roe, G. H., Baker, M. B. & Herla, F. Centennial glacier retreat as categorical evidence of regional climate change. Nat. Geosci. 10, 95–99 (2017).
Article CAS Google Scholar
Gutowski, J. W. et al. WCRP coordinated regional downscaling experiment (CORDEX): a diagnostic MIP for CMIP6. Geosci. Model Dev. 9, 4087–4095 (2016).
Article Google Scholar
Haarsma, R. J. et al. High resolution model intercomparison project (HighResMIP v1.0) for CMIP6. Geosci. Model Dev. 9, 4185–4208 (2016).
Article Google Scholar
Giorgi, F. Thirty years of regional climate modeling: where are we and where are we going next? J. Geophys. Res. Atmos. 124, 5696–5723 (2019).
Article Google Scholar
Flato, G. et al. Evaluation of climate models. Climate Change 2013 the Physical Science Basis. 9781107057999, 741–866 (Cambridge University Press, 2014).
Iles, C. E. et al. The benefits of increasing resolution in global and regional climate simulations for European climate extremes. Geosci. Model Dev. 13, 5583–5607 (2020).
Article Google Scholar
Giorgi, F. & Mearns, L. O. Approaches to the simulation of regional climate change: a review. Rev. Geophys. 29, 191–216 (1991).
Article Google Scholar
Giorgi, F., Jones, C. & Asrar, G. R. Addressing climate information needs at the regional level: the CORDEX framework. WMO Bull. 58, 175–183 (2009).
Google Scholar
Kendon, E. J., Jones, R. G., Kjellström, E. & Murphy, J. M. Using and designing GCM–RCM ensemble regional climate projections. J. Clim. 23, 6485–6503 (2010).
Article Google Scholar
Gudmundsson, L., Bremnes, J. B., Haugen, J. E. & Skaugen, T. E. Downscaling RCM precipitation to the station scale using quantile mapping–a comparison of methods. Hydrol. Earth Syst. Sci. Discuss. 9, 6185–6201 (2012).
Google Scholar
Von Storch, H., Zorita, E. & Cubasch, U. Downscaling of global climate change estimates to regional scales: an application to Iberian rainfall in wintertime. J. Clim. 6, 1161–1171 (1993).
Article Google Scholar
Banõ-Medina, J. et al. Downscaling multi-model climate projection ensembles with deep learning (DeepESD): contribution to CORDEX EUR-44. Geosci. Model Dev. 15, 6747–6758 (2022).
Article Google Scholar
Kaur, H., Sun, J., Aharchaou, M., Baumstein, A. & Fomel, S. Deep learning framework for true amplitude imaging: effect of conditioners and initial models. Geophys. Prospect. (Machine learning applications in geophysical exploration and monitoring), 72, 92–106 (2023).
Sachindra, D. A., Ahmed, K., Rashid, M. M., Shahid, S. & Perera, B. J. C. Statistical downscaling of precipitation using machine learning techniques. Atmos. Res. 212, 240–258 (2018).
Article Google Scholar
Sha, Y., Gagne, D. J., West, G. & Stull, R. Deep-learning-based gridded downscaling of surface meteorological variables in complex terrain. Part I: daily maximum and minimum 2-m Temperature. J. Appl. Meteorol. Climatol. 59, 2057–2073 (2020).
Article Google Scholar
Liang, X.-Z. et al. Regional climate models downscaling analysis of general circulation models present climate biases propagation into future change projections. Geophys. Res. Lett. 35, 8709 (2008).
Article Google Scholar
Kaur, H., Pham, N. & Fomel, S. Improving the resolution of migrated images by approximating the inverse Hessian using deep learning. Geophysics 85, WA173–WA183 (2020).
Article Google Scholar
Fu, X., Lahr, M., Yaxiong, Z. & Meng, B. Actions on climate change, reducing carbon emissions in China via optimal interregional industry shifts. Energy Policy 102, 616–638 (2017).
Article Google Scholar
Koks, E. E. & Thissen, M. A multiregional impact assessment model for disaster analysis. Econ. Syst. Res. 28, 429–449 (2016).
Article Google Scholar
Zhao, X. et al. Linking agricultural GHG emissions to global trade network. Earths Future 8, e2019EF001361 (2020).
Article CAS Google Scholar
Ivanova, D. et al. Quantifying the potential for climate change mitigation of consumption options. Environ. Res. Lett. 15, 093001 (2020).
Article CAS Google Scholar
Adachi, S. A. & Tomita, H. Methodology of the constraint condition in dynamical downscaling for regional climate evaluation: a review. J. Geophys. Res. Atmos. 125, e2019JD032166 (2020).
Article Google Scholar
Tian, J. & Ma, K. K. A survey on super-resolution imaging. Signal Image Video P 5, 329–342 (2011).
Article Google Scholar
Zumwald, M. et al. Understanding and assessing uncertainty of observational climate datasets for model evaluation using ensembles. Wiley Interdiscip. Rev. Clim. Change 11, e654 (2020).
Article Google Scholar
Field, C. B. (Ed.). Managing the risks of extreme events and disasters to advance climate change adaptation: special report of the intergovernmental panel on climate change (Cambridge University Press, 2012).
Sivakumar, B. Global climate change and its impacts on water resources planning and management: assessment and challenges. Stoch. Env. Res. Risk A. 25, 583–600 (2011).
Article Google Scholar
Schneider, T. et al. Harnessing AI and computing to advance climate modelling and prediction. Nat. Clim. Change 13, 887–889 (2023).
Article Google Scholar
Ho, J., Jain, A. & Abbeel, P. Denoising diffusion probabilistic models. Adv. Neural Inf. Process. Syst. 33, 6840–6851 (2020).
Google Scholar
Slivinski, L. C. et al. Towards a more reliable historical reanalysis: improvements for version 3 of the twentieth century reanalysis system. Q. J. R. Meteorol. Soc. 145, 2876–2908 (2019).
Article Google Scholar
Muñoz-Sabater, J. et al. ERA5-Land: A state-of-the-art global reanalysis dataset for land applications. Earth Syst. Sci. Data 13, 4349–4383 (2021).
Article Google Scholar
Hersbach, H. et al. The ERA5 global reanalysis. Q. J. R. Meteorol. Soc. 146, 1999–2049 (2020).
Article Google Scholar
Saharia, C. et al. Image Super-resolution via iterative refinement. IEEE Trans. Pattern Anal. Mach. Intell. 45, 4713–4726 (2023).
Google Scholar
Song, J., Meng, C. & Ermon, S. Denoising diffusion implicit models. ICLR 2021 - 9th International Conference on Learning Representations (2020).
Dhariwal, P. & Nichol, A. Diffusion models beat GANs on image synthesis. Adv., Neural Inf., Process Syst. 11, 8780–8794 (2021).
Google Scholar
Lim, B., Son, S., Kim, H., Nah, S. & Mu Lee, K. Enhanced deep residual networks for single image super-resolution. In Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR) 1132–1140 (IEEE, 2017).
Liu Z., et al. Observation-guided meteorological field downscaling at station scale: A benchmark and a new method. Preprint at https://arxiv.org/abs/2401.11960 (2024).
Wei, X. et al. Deep-learning-based harmonization and super-resolution of near-surface air temperature from CMIP6 models (1850–2100). EARTH SYST SCI DATA 2021, 1–27 (2021).
Google Scholar
Yu, T. et al. Deep precipitation downscaling. IEEE Geosci. Remote Sens. Lett. 19, 1–5 (2021).
Google Scholar
Peng, S., Ding, Y., Liu, W. & Li, Z. 1&thinsp. Km monthly temperature and precipitation dataset for China from 1901 to 2017. Earth Syst. Sci. Data 11, 1931–1946 (2019).
Article Google Scholar
Dorninger, M., Schneider, S. & Steinacker, R. On the interpolation of precipitation data over complex terrain. Meteorol. Atmos. Phys. 101, 175–189 (2008).
Article Google Scholar
Wang X., et al. Recovering realistic texture in image super-resolution by deep spatial feature transform. In Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR) 2018: 606-615.
Zhang, Y. et al. Multiple cycle-in-cycle generative adversarial networks for unsupervised image super-resolution. IEEE Trans. Image Process. 29, 1101–1112 (2019).
Article Google Scholar
Weigel, A. P., Liniger, M. A. & Appenzeller, C. Can multi‐model combination really enhance the prediction skill of probabilistic ensemble forecasts? Q. J. Roy. Meteor. Soc. 134, 241–260 (2008).
Article Google Scholar
Miao, H. et al. Evaluation of northern hemisphere surface wind speed and wind power density in multiple reanalysis datasets. Energy 200, 117382 (2020).
Article Google Scholar
Maher, N., Milinski, S. & Ludwig, R. Large ensemble climate model simulations: Introduction, overview, and future prospects for utilising multiple types of large ensemble. Earth Syst. Dynam. 12, 401–418 (2021).
Article Google Scholar
Bevacqua, E. et al. Advancing research on compound weather and climate events via large ensemble model simulations. Nat. Commun. 14, 1–16 (2023).
Article Google Scholar
Mou, C. et al. T2I-Adapter: learning adapters to dig out more controllable ability for text-to-image diffusion models. Proc. AAAI Conf. Artif. Intell. 38, 4296–4304 (2024).
Google Scholar
Sohl-Dickstein, J., Weiss, E. A., Maheswaranathan, N., Ganguli, S. & Edu, S. Deep unsupervised learning using nonequilibrium thermodynamics. In Proc. 32nd International Conference on Machine Learning Vol. 37 (eds Bach, Francis and Blei, David) 2256–2265 (PMLR, 2015).
Song, Y. et al. Score-based generative modeling through stochastic differential equations. ICLR 2021 - 9th International Conference on Learning Representations, (2020).
Vaswani, A. et al. Attention is all you need. Adv. Neural Inf. Process Syst. 30, 5998–6008 (2017).
Google Scholar
Hulme, M., Marsh, R. & Jones, P. D. Global changes in a humidity index between 1931–60 and 1961–90. Clim. Res. 2, 1–22 (1992).
Article Google Scholar
Lei, Y. et al. Co-benefits of carbon neutrality in enhancing and stabilizing solar and wind energy. Nat. Clim. Change 13, 693–700 (2023).
Article Google Scholar

Download references

Acknowledgements

This work is supported by the National Key Research and Development Program of China (No. 2020YFA0608000), National Natural Science Foundation of China (Grant 42030605), CAAI-MindSpore Academic Fund Research Projects(CAAIXSJLJJ2023MindSpore11) and the program of China Scholarships Council (No. CXXM2101180001). We gratefully acknowledge the support from the Huawei MindSpore team. The code and dataset will be released on the MindSpore platform.

Author information

These authors contributed equally: Fenghua Ling, Zeyu Lu.

Authors and Affiliations

Institute for Climate and Application Research (ICAR)/CIC-FEMD/KLME/ILCEC, Nanjing University of Information Science and Technology, Nanjing, China
Fenghua Ling, Jing-Jia Luo, Dachao Jin & Toshio Yamagata
Application Laboratory, Japan Agency for Marine-Earth Science and Technology, Yokohama, Japan
Fenghua Ling, Swadhin K. Behera & Toshio Yamagata
Shanghai AI Laboratory, Shanghai, China
Fenghua Ling, Zeyu Lu & Lei Bai
Shanghai Jiao Tong University, Shanghai, China
Zeyu Lu
Institute of Atmospheric Physics, Chinese Academy of Sciences, 100029, Beijing, China
Baoxiang Pan
Department of Computer Science, Tokyo Institute of Technology, Yokohama, Japan
Huidong Jiang
RIKEN Center for Advanced Intelligence Project, Tokyo, Japan
Huidong Jiang

Authors

Fenghua Ling
View author publications
You can also search for this author in PubMed Google Scholar
Zeyu Lu
View author publications
You can also search for this author in PubMed Google Scholar
Jing-Jia Luo
View author publications
You can also search for this author in PubMed Google Scholar
Lei Bai
View author publications
You can also search for this author in PubMed Google Scholar
Swadhin K. Behera
View author publications
You can also search for this author in PubMed Google Scholar
Dachao Jin
View author publications
You can also search for this author in PubMed Google Scholar
Baoxiang Pan
View author publications
You can also search for this author in PubMed Google Scholar
Huidong Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Toshio Yamagata
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

F.H.L. and Z.L. are co-first authors and wrote the manuscript. F.H.L. and Z.L. designed the AI models. F.H.L. prepared for data and performed the main experiments. Z.L. reconstructed long-term East Asian climate historical datasets. F.H.L., Z.L., L.B. performed the analysis under supervision of J.-J.L. J.-J.L., S.B., D.J., B.P. and T.Y. conducted analysis from the climate science view. All authors contributed to interpreting results, discussions of associated dynamics and improvement of the presentation.

Corresponding authors

Correspondence to Jing-Jia Luo or Lei Bai.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ling, F., Lu, Z., Luo, JJ. et al. Diffusion model-based probabilistic downscaling for 180-year East Asian climate reconstruction. npj Clim Atmos Sci 7, 131 (2024). https://doi.org/10.1038/s41612-024-00679-1

Download citation

Received: 15 January 2024
Accepted: 30 May 2024
Published: 13 June 2024
DOI: https://doi.org/10.1038/s41612-024-00679-1
Springer Nature Limited

Diffusion model-based probabilistic downscaling for 180-year East Asian climate reconstruction

Abstract

Similar content being viewed by others

An intercomparison of multiple statistical downscaling methods for daily precipitation and temperature over China: future climate projections

Projecting future precipitation and temperature at sites with diverse climate through multiple statistical downscaling schemes

Dynamical downscaling of regional climate: A review of methods and limitations

Introduction