1 Introduction

Global Warming Levels (GWLs) provide a simple climate change dimension for assessing the regional responses of climatic variables and indices as a function of global warming (typically at + 1.5, 2, 3 and 4 °C levels). The use of GWLs became widespread after the Paris Agreement goal of keeping global temperature increase well below 2 °C with respect to the pre-industrial period (UNFCCC 2016). As a result, many recent climate change impact and adaptation studies use GWLs as a convenient and policy-relevant climate dimension (IPCC 2022), as an alternative to the use of fixed periods (typically near-, mid- and far-future) across different scenarios.

GWLs are defined using Global Climate Model (GCM) projections to calculate mean climatic (e.g. 20-year averages) changes in Global Surface Air Temperature (GSAT) along the twenty-first century, relative to a pre-industrial period, typically 1850–1900 (Seneviratne et al. 2021; Cross-Chapter Box 11.1). For a particular warming level (e.g. + 2 °C), the periods when each model first reaches this warming threshold are annotated (different periods for different models) and then used to compute the corresponding regional changes as differences between the variable values in these periods and the baseline. There seems to be a consensus on the fact that GWL responses are consistent across different scenarios for many climate variables (Pendergrass et al. 2015; Seneviratne et al. 2016; Seneviratne and Hauser 2020; Wartenburger et al. 2017), even though there are studies showing small but significant dependencies of the scaling pattern on emission scenarios, especially at middle and high latitudes of the Northern Hemisphere (Maule et al. 2017; Ishizaki et al. 2012). Therefore, it is common practice to pool across models and scenarios the regional change signals for each particular level. GWL information is typically displayed in the form of maps representing the spatial change patterns of variables or indices for specific GWLs (see e.g. Gutiérrez et al. 2021) or as regional climate sensitivity plots, displaying changes of the regionally aggregated variable of interest versus the global warming level (Seneviratne and Hauser 2020). The latter can be expressed as the rate of change of a variable over a region (e.g. degrees change of temperature or percent change of precipitation) per degree increase in GSAT. This approach implies a linear relationship between patterns of regional climate change and the average global temperature change, which was already found for several variables in Europe (Matte et al. 2019).

GWL products are typically obtained using multi-model ensembles of centennial global climate projections produced in the Coupled Model Intercomparison Project (CMIP). The two latest ensembles available are CMIP5 (Taylor et al. 2011) and CMIP6 (Eyring et al. 2016), with typical GCM resolutions of around 200 and 100 km, respectively. Moreover, GWLs have been extended to work also with Regional Climate Models (RCMs). RCMs provide higher resolution results operating over a limited region as compared to the GCM outputs that drive them at the boundaries (Gutowski et al. 2020; Jacob et al. 2020; Giorgi et al. 2022). In particular, the Coordinated Regional Climate Downscaling Experiment (CORDEX, https://cordex.org) provides multi-model ensembles of regional climate projections driven by CMIP5 model outputs over 14 continental domains for a variety of GCM and RCM combinations (Giorgi and Gutowski 2015; Gutowski et al., 2016). These regional projections are highly demanded for regional impact and adaptation studies and their resolution ranges from 50 to 10 km, depending on the domain. GWLs have been extensively used in the latest IPCC report to assess regional climate change (IPCC 2021a). In particular, the Atlas Chapter and the Interactive Atlas (Gutiérrez et al. 2021) provide GWL results for both global CMIP5 and CMIP6 and regional CORDEX multi-model ensembles, facilitating the analysis of regional climate sensitivity to global warming for global and regional climate simulations.

The main problem to extend the GWL methodology to RCMs is that they operate on regional domains and, therefore, they do not provide global GSAT information to compute the periods when GWLs are reached. To overcome this problem, the standard practice has been to use the corresponding periods from the driving GCM as a proxy (Déqué et al. 2017; Nikulin et al. 2018; Ciarlo` et al. 2021), under the assumption that these proxy periods are suitable for estimating the regional climate sensitivity of RCMs as a function of global warming. Assessing the consistency of regional climate change sensitivity between RCM simulations and their driving GCMs across broad climatic regions would shed some light on this assumption and is key for further research exploring the added value of regional vs global projections for regional GWL studies.

Here, we explore this problem by analyzing the consistency between CMIP5 and CORDEX for different GWLs, using the data available from the IPCC-WGI Atlas repository (Iturbide et al. 2021, 2022) over the IPCC AR6 reference regions (Iturbide et al. 2020). In particular, we analyze the regional responses to global warming using GWL scaling plots (Seneviratne and Hauser 2020). These plots represent regionally aggregated changes (for mean temperature and precipitation in this work) vs GSAT changes for successive decadal periods along the twenty-first century. We investigate the existence of significant linear relationships (scaling) supporting the use of GWLs as an informative climate dimension in these cases (e.g. using the amount of regional change per unit of global warming, e.g. per degree of warming). Moreover, we analyze the influence of the emission scenario in the results. And finally, we analyze the robustness of the regional response, calculated for every decade of the twenty-first century, annotating the mean GWL where the regional climate change signal emerges consistently from natural model variability. Here, we combine the GWL plots with the robustness approach defined in the IPCC (Gutiérrez et al. 2021; Cross-ChapterBoxAtlas.1). The goal is assessing the consistency of CMIP and CORDEX by comparing the results of both approaches (scaling relationships and the robustness in the emergence of the climate change signal) in the reference regions.

The paper is organized as follows. Section 2 presents the data and methods used. Results are described in Sec. 3. Finally, Sect. 4 provides the main conclusions of this work.

2 Data and methods

2.1 CORDEX and CMIP5 regionally-aggregated datasets

We use the regionally aggregated dataset available from the IPCC-WGI Atlas repository for CMIP5 and CORDEX simulations (Iturbide et al. 2021, 2022). This dataset contains monthly means of temperature and precipitation spatially averaged over the IPCC AR6 reference regions (Iturbide et al. 2020), which represent subcontinental climatologically consistent areas. Results are available for land, sea and land-sea grid boxes separately. For the present analysis, we consider the results aggregated over all grid boxes (land-sea dataset) for the historical experiment and the RCP8.5 scenario. The RCP8.5 scenario is chosen because 1) it allows exploring higher GWLs (e.g. 4 ºC) and 2) this is the scenario with the highest number of simulations available, thus reducing the uncertainty related to small ensemble sizes (Dosio et al. 2019).

The Atlas repository provides aggregated information for CMIP5 (28 GCMs, including those used to drive all CORDEX simulations) and CORDEX (33 RCMs, considering all different configurations and versions). For the latter, the dataset includes 11 out of the total of 14 official CORDEX domains covering almost entirely the continental regions of the world with a large number of simulations (Diez-Sierra et al. 2022). The number of available RCMs varies for each CORDEX domain resulting in different ensemble sizes, being Europe the domain with the highest number of RCMs (12) and the Antarctic domain with the lowest (3). The simulations available for each CORDEX domain are accessible through the IPCC WGI AR6 Annex II (IPCC 2021b: Annex II) and the IPCC-Atlas repository (Iturbide et al. 2021, 2022). Following the methodology applied in the IPCC-WGI Atlas (Gutiérrez et al. 2021), the mosaic ensemble approach is used to build a world-wide CORDEX dataset by selecting the CORDEX domain that best fits each AR6 reference region (Diez-Sierra et al. 2022, see their Fig. 1b).

Fig. 1
figure 1

Source: IPCC WGI AR6 Interactive Atlas (Gutiérrez et al. 2021)

Regional precipitation responses to global warming for the Mediterranean region obtained from CMIP5 for the RCP8.5 scenario. The period 1986–2005 is selected as the baseline to compute the regional changes for the twenty-first century. The Mediterranean region is shown in Fig. 4.

Here, we analyze the results for 42 out of the total of 46 land reference regions. Four Asian regions (EEU, WSB, ESB and RFE, grayed out in Fig. 4) were discarded since the CORDEX domains used in the work do not properly cover them (less than 80% overlap). The Arctic region (ARO) was the only oceanic region included in the analysis. For each region we used the corresponding set of CORDEX RCM simulations and the subset of CMIP5 GCMs driving them. The latter were weighted according to the frequency each GCM was used as boundary conditions in the corresponding CORDEX domain, following Boé et al., (2020). This means that, if one specific GCM drives e.g. two different RCMs, then this GCM will weigh twice in the analysis (i.e. to calculate the slope and the robustness). Despite this weighting, note that there are typically several RCMs driven by the same GCM, which tends to enlarge the uncertainty of the CORDEX ensemble with respect to the CMIP5 one.

2.2 Regional responses to global warming: GWL plots

In order to analyze the regional responses of mean temperature and precipitation to different GWLs, we use regional climate sensitivity plots (Seneviratne and Hauser 2020), which are called GWL scaling plots in the IPCC Interactive Atlas (Gutiérrez et al. 2021; http://interactive-atlas.ipcc.ch). The latter are computed in a slightly different way to avoid the redundant information arising from running means that can overlap for different GWLs. Namely, for a given climate variable or derived climate index and a region of interest, the GWL plot is a scatter plot representing the regionally aggregated changes vs the GWL; each point in the plot represents both magnitudes for a particular period (e.g. 10-years) along the twenty-first century. GWLs and regional changes are typically computed using the pre-industrial baseline period 1850–1900.

CORDEX historical simulations do not cover the pre-industrial baseline period and, therefore, an alternative baseline corresponding to modern climate is used in some cases for calculating regional changes. In this study we use the AR5 1986–2005 reference modern period to calculate regional changes, and the pre-industrial baseline period 1850–1900 to calculate the GSAT increase with CMIP5. This modern reference period approach can also be selected in the IPCC Interactive Atlas to compute GWL plots for all available indices for CMIP5 and 6. As an illustrative example, Fig. 1 shows a GWL plot generated by the Interactive Atlas for the Mediterranean (MED) precipitation from CMIP5. The points represent individual model values calculated for every decade of the twenty-first century. On the x-axis, this plot displays the global warming levels computed as changes of CMIP5 mean GSAT relative to the 1850–1900 pre-industrial baseline period, from + 0 to + 6 °C. On the y-axis, regional (MED region) relative changes computed as relative regional changes of precipitation, relative to the 1986–2005 baseline, are displayed. Note that different baselines can be selected in the Interactive Atlas. In these figures, global warming values are calculated using a total of 28 CMIP5 model projections. Every point corresponds to future change values for every model and decade of the twenty-first century (280 points), with different decades in different colors. Red points show the decadal ensemble means.

In the Supplementary Material (Fig. S1), we compare this approach to the location of time periods corresponding to preset GWL values on the x-axis, which are then used to compute the regional average response for that particular period (y-axis). Results are qualitatively very similar and are based on the same background information. However, the decadal means shown in this study and in the Interactive Atlas represent independent samples of the regional response to global warming, while the running mean approach produces redundant information arising from overlapping running windows and also drops valuable information from simulations not reaching high GWLs (see also Fig. SM2).

2.3 Linear scaling and robustness of regional responses

Building on the GWL plots described in the previous section, we investigate the existence of significant linear relationships between the regional climate signals and the GWL across variables and regions. For that, we perform a significance test against the null hypothesis that the linear slope of the corresponding GWL plot is zero, using a Wald t-test with a 99% confidence (p-value below 0.01). Significant relationships indicate scaling of the regional climate change signal with global warming and supports the use of GWLs as an informative climate dimension in these cases (e.g. using the amount of regional change per unit of global warming, e.g. per degree of warming).

These significant scaling relationships are typically found for variables and regions with robust climate change signals (Sect. 3). Therefore, we also calculate the robustness of the regional signals along the decades of the twenty-first century, building on the GWL plot. For each decade, we compute the robustness of the multi-model mean signal using the advanced approach defined in IPCC-WGI AR6 (Gutiérrez et al. 2021, Cross-ChapterBoxAtlas.1). This methodology uses the individual GCM signals to classify the multi-model mean signal into three categories: (1) robust signal, (2) conflicting signal, and (3) no change or no robust signal. Categories 1 and 2 correspond to ensembles with a likely robust signal, characterized by over 2/3 of the models exhibiting robust signals (see below). Model agreement on the sign of change leads then to a robust (category 1) or conflicting (category 2) signal. Category 3 indicates no change (signal close to zero) or no robust change (less than 1/3 of the models exhibiting robust signals). Robust model signals are interpreted as (decadal) signals emerging from natural (decadal) variability. In practice, this is computed using a threshold \(\gamma =1.645\sqrt{2/10}\) σ1yr, corresponding to a 90% confidence interval for decadal interannual variability, where σ1yr is the interannual standard deviation computed from the linearly detrended baseline period 1970–2005 (which is used as an approximation when a long historical period is not available, as in CORDEX). For consistency, and following Gutiérrez et al. 2021 (Cross-ChapterBoxAtlas.1), we use the same period to calculate the threshold \(\gamma\) for CORDEX and CMIP5 historical simulations.

In this work, the GWL plot used in the Interactive Atlas (Fig. 1) is expanded by annotating robustness information for the ensemble mean decadal results, indicating with different symbols the above-mentioned categories for robustness. This allows estimating the mean GWL value for which the regional signal emerges consistently from natural model variability. This is inspired in the concept of temperature of emergence (ToE), defined in previous studies by applying different methodologies to estimate the threshold under which the climate change signal emerges consistently from natural variability (Kirchmeier-Young et al. 2019; Seneviratne and Hauser 2020). Here, we perform a gross estimation of the ToE (defined here as mean GWL emergence) by combining the GWL plots with the robustness approach defined in the IPCC (Gutiérrez et al. 2021, Cross-ChapterBoxAtlas.1; Doblas-Reyes et al. 2021, Sec. 10.4.3.2). Note that, for the sake of comparison between CORDEX and CMIP5, the robustness information calculated in this work are based on regional changes relative to a modern period, and therefore the results are offset as compared to a pre-industrial baseline.

3 Results

We compare the results of the regional response to global warming from CMIP5 and CORDEX for every reference region, variable (temperature and precipitation) and season. GWL plots are used to visualize and test the existence of a linear scaling of the regional response to global warming and to display the robustness of the multi-model decadal regional change signals along the twenty-first century, estimating the mean GWL emergence as defined in Sect. 2.

As an example, we first analyze the GWL plots for two different regions with significant regional signals and responses to global warming (Figs. 2 and 3, corresponding to temperature in the Arctic and precipitation in the Mediterranean, respectively). These figures are illustrative examples of the analysis carried out over the 43 reference regions, for both temperature and precipitation, and for annual and seasonal (DJF, MAM, JJA and SON) temporal aggregations for the regional responses. Note that GWLs are always computed from annual GSAT values.

Fig. 2
figure 2

GWL plots displaying regional air temperature responses to global warming over the Arctic Ocean (ARO) for CMIP (left) and CORDEX (right). Top- and bottom-side panels show the regional response for summer (JJA) and winter (DJF), respectively. Coloured points represent model changes (regional changes vs global warming values) for every decade of the twenty-first century with respect to the baseline; note that different baselines are used for calculating the GWLs (1850–1900) and regional response (1986–2005). Black open dots in the left column indicate results for CMIP5 models which do not drive any CORDEX model for this particular region. β denotes the slope of the linear fit (black line) using all available projections or points (except for the black open dots). The asterisks indicate the significance of the slope –two asterisks (**) for p-value \(p\le 0.01\), one asterisk (*) for \(0.01<p\le 0.05\) and no asterisks for \(p>0.05\)–. Squares represent the decadal ensemble mean values and robustness (red, hatched or crossed-out squares) for every decade of the twenty-first century. Note that regional changes (y-axis) are relative to a modern reference and therefore are offset as compared to a pre-industrial baseline

Fig. 3
figure 3

As Fig. 2, but for regional precipitation responses (relative change, as %) to global warming for the MED region

Figure 2 shows the regional climate response in the Arctic Ocean (ARO) region. Black open dots on the left two panels of the figure correspond to CMIP5 models which did not drive any CORDEX model for this region. These models are not used in the analysis and are only included in the plot to show the dispersion of the entire CMIP5 GCM ensemble, as compared to the GCM selection used to drive CORDEX RCMs (in colors, according to the decade; see legend). Squares represent, for every decade of the twenty-first century, the ensemble mean decadal results and robustness (see legend). As can be seen in Fig. 2, there is a significant relationship (p-value < 0.01, represented by the asterisks in the slope value) between regional temperature changes and GWL. This is true for both CMIP5 and CORDEX (in columns) and for both seasons (in rows), with a very high regional response in winter (DJF), when there is a regional to global warming factor above 4.5. Similar values of the slopes are obtained for CMIP5 and CORDEX, indicating high consistency between both global and regional projections. Regional climate signals are robust (decadal means in red) for all decades in winter, indicating that the climate change signal has already emerged in these model ensembles. In summer, all decades except for the first one(s) are also robust, indicating a mean GWL emergence around 1.5° for CMIP5 and 2° for CORDEX. Note that the first decades are very close to the baseline periods used to compute the climate changes (1986–2005) and to determine the threshold for the robustness analysis (1970–2005). Therefore, the results for these first decades are susceptible to being affected by the internal variability of the models.

Figure 3 shows the regional climate responses of precipitation change in the Mediterranean (MED) region. The decreasing trends are significant in all cases with summer (JJA) changes around -10% per ºC for CMIP5 and -8% per ºC for CORDEX. Winter (DJF) trends are around -3% per ºC for both ensembles. Figure 3 allows distinguishing those CORDEX simulations driven by the same GCM, since they share the same GWL value on the x-axis and, therefore, they are spread only vertically. Robustness is also similar in winter with a mean GWL emergence around 3.5ºC (although CMIP5 results show an interrupted emergence signal), but slightly different for summer, with a GWL emergence between 2.5 and 3.5 degrees, according to CMIP5 and CORDEX, respectively.

Up to now we have assumed that the emission scenario does not influence the regional response to global warming. To test this hypothesis, we compare the regional scaling response (i.e. the slopes β in Figs. 2 and 3) for the different IPCC reference regions for the RCP4.5 and RCP8.5 scenarios. Figure 4 assesses the impact of the emission scenario in the regional scaling responses for temperature (top four panels) and precipitation (bottom four panels). Regional scaling responses, and their differences between CMIP5 and CORDEX, are also shown to compare both sources of uncertainty. Upper left panels show the slope of the reference regions for CORDEX. Upper right panels show the slope differences between CMIP5 and CORDEX. Bottom panels show the slope differences between the scenarios RCP8.5 and RCP4.5 for CMIP5 (bottom-left) and CORDEX (bottom-right). For the bottom panels (scenario assessment) only the simulations available from both scenarios were selected. Moreover, the slopes were calculated with a limit GWL of 3 ºC, which is approximately the maximum GWL reached for RCP4.5. Otherwise, using all the information available for the XXI century, the slopes for the different scenarios can easily show significant differences due to the different sampling uncertainty. Only those regions with significant linear scaling to the 0.01 level were displayed; hatching indicates non-significant slopes in the upper left panels.

Fig. 4
figure 4

Regional scaling response for the different IPCC reference regions for temperature (top four panels) and precipitation (bottom four panels). Regions with significant linear scaling to the 0.01 level were displayed (hatching indicates non-significant slopes in the upper left panels). Upper left panels show the slope of the reference regions for CORDEX. Upper right panels show the slope differences between CMIP5 and CORDEX. Bottom panels show the slope differences between the scenarios RCP8.5 and RCP4.5 for CMIP5 (bottom-left) and CORDEX (bottom-right). For the bottom panels (scenario assessment) only the simulations available for both scenarios were selected. Slopes were calculated with a limit GWL of 3 °C

Figure 4 shows, in general, small differences between the scenarios RCP8.5 and RCP4.5. The differences are of the same order of magnitude as the differences between CMIP5 and CORDEX. For temperature, the average relative difference between global and regional models is around 5%, while the relative difference between scenarios varies from 4 to 6% for CMIP5 and CORDEX. The largest relative differences between scenarios take place in North America, northern Russia and Antarctica (~ 0.3 ºC/ºC). However, the small sample size for the Antarctic CORDEX domain might affect the results in this region. For precipitation, the average difference between projects is around 0.7%, while the difference between scenarios varies from 1.2% to 0.7% for CMIP5 and CORDEX. The largest differences between scenarios take place in Greenland and India for CMIP5 (~ 3%/ºC).

Figure 4 highlights the impact of the ensemble size in the results. The small number of simulations available for both scenarios translates into poor ensemble sizes for some of the domains. This leads to non-significant linear scaling for most regions for precipitation, for which only 11–14 regions show significant linear scaling relationships in the scenario assessment.

For an exhaustive assessment of the consistency of the scaling results obtained for CMIP5 and CORDEX, we use heat maps (Fig. 5) summarizing the regional scaling (i.e. the slopes β in Figs. 23) for the different IPCC reference regions (in rows) and for all seasons (in columns). Regions are sorted according to the magnitude of CMIP5 global warming, with higher values at the top of the figure. In order to facilitate the comparison of the results, each cell is divided in two, representing the results for CMIP5 in the upper triangle and for CORDEX in the lower triangle. As in the annual case (Fig. 4), CMIP5 and CORDEX slopes are remarkably similar for most of the regions and for both variables.

Fig. 5
figure 5

Heat map for the slope of the regional temperature (left panel, units: °C/°C) and precipitation (right panel, units: %/ºC) change with respect to the GWL. Cells division represents the results for CMIP5 (top triangle) and CORDEX (bottom triangle). Different seasons are shown in columns and IPCC reference regions are in rows. Simple hatching hides non-significant slopes to the 0.01 level (double hatching for those not significant to the 0.05 level)

The scaling relationships show high variability across seasons, particularly for precipitation, with significant slopes found typically for particular seasons (hatching in Fig. 5 indicates non-significant slopes). Temperature trends are significant with 99% confidence for the entire subset of regions and seasons analyzed. Regions located in high latitudes of the North hemisphere (ARO, RAR, NEN, NWN and GIC; in rows 1–5) show remarkably higher slopes for temperature in the Boreal winter (DJF) than in the rest of the seasons. Contrarily, Antarctic regions (WAS and EAN, rows 20 and 27) experience the highest regional response in the Boreal summer. The ARO region (row 1) experiences the highest regional warming as well as the highest relative increase in precipitation. The Arctic Amplification, produced by local positive feedbacks (e.g. temperature and surface albedo feedbacks, among others), plays an important role in the region (Liang et al. 2022). Finally, regions with a high percentage of ocean areas (SCA, SWS, NAU, SEA, MDG, CAR, SAU, NZ and SSA, on rows 35–43) present slopes lower than 1 for temperature, which just confirm the milder increase in temperature over oceanic regions as compared to the continental ones.

For temperature, the largest differences (0.3º) between CORDEX and CMIP5 are found in the Arctic Ocean (ARO, on row 1), Russian Arctic (RAR, on row 2), East Central Asia (ECA, on row 6) and Western and Central Europe (WCE, on row 18). The former two (ARO and RAR) are regions with high signal and thus smaller relative differences than in other regions. Regions with seasonal snow cover, such as ECA (row 6), show higher inconsistencies between CMIP5 and CORDEX. Seasonal snow cover is handled by the land surface model and is likely to differ between the RCM and the driving GCM; this deserves further investigation since other discrepancies, such as aerosol treatment, could also play a significant role.

Precipitation shows higher variability than temperature. However, most of the regions show high consistency between CMIP5 and CORDEX. The largest inconsistencies are found in dry regions (e.g. Sahara region, SAH, on row 14), where small changes in precipitation lead to great relative changes. In addition, the driving GCMs usually exhibit large biases over dry regions, which are generally reduced by the RCM simulations (Teichmann et al. 2021). Regions with no significant trends tend to show lower values of slopes β and less consistency between CMIP5 and CORDEX. North and central South America regions (SAM, NSA and NES, in rows 7, 10 and 30) exhibit one of the major deviations between CMIP5 and CORDEX.

Figure 6 shows the mean GWL emergence for both CMIP5 and CORDEX. Darker colors indicate regional climate signals emerging for lower GSAT values, with white cells indicating that the regional climate signal does not emerge consistently during the twenty-first century using the RCP8.5 scenario. Overall, Fig. 6 shows similar values of GWL emergence for temperature and precipitation for CMIP5 and CORDEX. It is important to highlight that all the cells which reach a value of GWL emergence (non-white cells) exhibit significant linear GWL scaling relationships, which demonstrate the consistency of both approaches.

Fig. 6
figure 6

As Fig. 5, but showing the mean GWL value for which the regional signal emerges consistently (°C). White cells indicate no emergence, as obtained from the RCP8.5 scenario for the twenty-first century

For temperature, the inconsistencies found in Fig. 5 do not hold in Fig. 6 for most of the cases. Note that for temperature, CMIP5 and CORDEX reach the mean GWL emergence during the first decades (low GWL) of the twenty-first century, even for those regions where the slope between CMIP5 and CORDEX is different. Western and Central European and Western Antarctic regions (WCE and WAN, in rows 18 and 20) exhibit the largest discrepancies for winter (DJF). However, in both cases, these discrepancies seem to be due to the internal variability of the models that may play an important role in the first decades of the twenty-first century; in fact, the slope β is consistent in both cases (see Fig. 5).

Precipitation shows higher variability than temperature; however, the results of GWL emergence are still remarkably similar for both datasets. Some discrepancies are also found for some regions with seasonal snow cover (Russian Arctic and Greenland/Iceland; RAR and GIC, rows 2 and 5). East Central Asia, Northern South America and East Asia regions (ECA, NSA and EAS, on rows 6, 10 and 22) also show some inconsistencies between CMIP5 and CORDEX for some of the seasons. Tibetan Plateau and North-Western South America regions (TIB and NWS, on rows 8 and 23) show the highest discrepancies for summer (JJA) and winter (DJF) seasons, respectively.

There are some reasons that can be identified to cause the discrepancies found between CMIP5 and CORDEX. It is well known that the change in resolution and improved representation of land surface characteristics significantly affects the representation of the local processes and the feedbacks (land–atmosphere, land-ice, sea-ice, ice-albedo), which might has an impact at regional scale (Goosse et al. 2018; Hohenegger et al. 2009; Iles et al. 2020; Müller et al. 2021; Seneviratne et al. 2006). In addition, several studies agree with the major deviations found here. For the Western and Central European region (WCE, on row 18), for example, CMIP5 is known to project a higher increase in temperature changes than CORDEX for summer, mainly caused by GCM/RCM differences in aerosols representation and atmospheric physics (Boé et al. 2020; Taranu et al. 2022). This also affects evapotranspiration, with likely impact clouds and precipitation, and thus explaining the differences shown in Fig. 5. Another example is the investigation carried out by Falco et al. (2019) in South America. They demonstrate the added value of the RCMs in reproducing the summer climatology for different South American regions situated in tropical and subtropical latitudes. Their results are aligned with the deviations found here for precipitation for South America Monsoon, Northern South America and North-Eastern South America regions (SAM, NSA and NES; in rows 7, 10 and 30; see Fig. 5).

In the supplementary material (Fig. SM3), we show the ToE calculated with the usual approach (see SM for more details). Results are qualitatively very similar for both approaches, however, the decadal means shown in this study can be interpreted as a gross estimation of the ToE shown in Fig. SM3. Note that the ToE is independent of time.

4 Conclusions

We analyzed the consistency of the regional response to global warming between CORDEX and CMIP5 projections. Regional climate change signals for temperature and precipitation as a function of global surface air temperature (GSAT) change were studied for 43 out of 61 reference regions used in the IPCC-WGI AR6, including most of the land regions plus the Arctic Ocean. Furthermore, we assess the influence of the emission scenarios in the results. The results were represented in the form of GWL plots, and their slopes and mean GWL values for which the regional signal emerges consistently were summarized for all regions and seasons.

Our results demonstrate high consistency in the regional response to global warming between global and regional climate projections. CORDEX and CMIP5 datasets show remarkably similar regional scaling relationships with respect to the global warming level for most regions analyzed. For temperature, significant and positive linear trends are found between the regional response and global warming for all regions and seasons analyzed. For precipitation, most regions show significant linear trends in at least one of the seasons. The analysis of robustness reveals similar GWL values for which regional signal emerges consistently between CORDEX and CMIP5. Robust signals were found for all the regions and seasons for temperature after the first decades, which indicates that the GWL emergence has already been exceeded for all the cases. For precipitation, similar GWL emergence values were also found for both datasets, although for some of the regions analyzed the robust climate signal emerged intermittently and therefore the temperature of emergence is not reached in these cases. Those regions in which the robustness analysis emerges consistently also show significant linear scaling relationships between the regional response and global warming; thus indicating that both analyses are consistent.

Regions with large seasonal snow covered areas, mostly located in the Russian Arctic and in North America, show some inconsistencies for the winter months for CORDEX and CMIP5. The largest inconsistencies for precipitation are found in dry regions (e.g. Sahara region) since in these regions small changes of precipitation often lead to great relative changes. Although, overall, we showed the high consistency between CMIP5 and CORDEX, it is key to identify the reasons behind the different response of GCMs and RCMs for some of the regions. For example, Taranu et al. (2022) demonstrate that wrong selection in GCM/RCM pairs (i.e. different configuration in aerosols and atmospheric physics) can explain the inconsistencies found for the Western and Central European region. The lack of plant physiological response to increased CO2 levels in many RCMs has also been identified as a key source of discrepancy over central and northern Europe (Schwingshackl et al. 2019), but these kind of analyses need to be extended to the rest of the regions identified in this study (i.e. regions in which the GCMs and RCMs climate signals differ) in order to understand the mechanisms behind these discrepancies and discern between genuine added value and inconsistent model setup.

Our results show that the influence of the emission scenario (RCP4.5 or RCP8.5) is low, and of the same order of magnitude as the influence of the project (CMIP5 or CORDEX). For temperature, the average relative difference between regional and global models is around 5%, while the difference between scenarios varies from 4% (CMIP5) to 6% (CORDEX). For precipitation, the average difference between regional and global models is around 0.7%, while the difference between scenarios varies from 1.2% (CMIP5) to 0.7% (CORDEX). Our results are consistent with the conclusions obtained by Ishizaki et al. (2012) -i.e. the emission scenario has small but higher influence in middle and high latitudes of the Northern Hemisphere-.

Our study demonstrates that CMIP5 and CORDEX are fairly consistent to assess the regional response to global warming at the large-scale analyzed here. Regional models are expected to provide added value when considering climate at smaller scales and/or for extremes (Giorgi et al. 2016). Some of the inconsistencies found have been linked to inconsistencies between GCMs and nested RCMs that may be reduced in the upcoming CORDEX-CMIP6, currently under development. As an example, the use of evolving aerosols consistent with the driving GCM is strongly encouraged by the new CORDEX-CMIP6 experiment design. This will likely tend to align even further the large-scale regional responses of both initiatives.