1 Introduction

Although the increase of the global mean temperature has slowed down since the late 1990s (Kosaka and Xie 2013), terrestrial hot extremes characterized by extreme high surface air temperature (SAT) have become more frequent in recent decades (Perkins-Kirkpatrick and Lewis 2020). Since the twenty-first century, severe summer hot extremes over the northern extratropics have been frequently observed, such as the ones in Europe in 2003 (Schär and Jendritzky 2004), Russia in 2010 (Coumou and Rahmstorf 2012), United States in 2011 (Luo and Zhang 2012), and China in 2013 and 2015 (Sun et al. 2016; Ma et al. 2017). These hot extremes resulted in severe damages to ecosystems and human society, including forest fires, decreasing agricultural production, and loss of human life (e.g., Ciais et al. 2005; Coumou and Rahmstorf 2012). Indeed, some hot extremes were concomitant with droughts due to strong anti-correlation between temperature and precipitation during summer over the extratropics (Coumou et al. 2018). Due to disastrous impacts of hot extremes and accompanying droughts, there is an urgent need for better understanding the dynamics and physics linked to these extremes.

Hot extremes are usually linked to large-scale circulation anomalies driving subsidence of air mass (Röthlisberger and Martius 2019; Tian et al. 2020), cloud changes that modulate the surface radiation (Jaeger and Seneviratne 2011; Sousa et al. 2018; Tian et al. 2020), and/or enhanced warm air advection (Sousa et al. 2018). For example, a circum-global teleconnection (CGT) pattern, characterized by a zonal wavenumber-5 structure from an empirical orthogonal function (EOF) analysis, results from internal atmospheric dynamics and provides the major source of climate variability and predictability in northern mid-latitudes, especially for U.S. hot extremes (Teng et al. 2013). Ding and Wang (2005) suggested that the maintenance of the CGT pattern depends on interactions between the CGT (i.e., Rossby wave train patterns) and Indian summer monsoon (related to the west-central Asian high). Such CGT pattern was also identified through a self-organizing map (SOM) analysis and tends to be linked to the increasing hot extremes over the Northern Hemisphere (NH) (Lee et al. 2017).

These proximate drivers can be connected to atmospheric chemical composition and sea surface temperature (SST) (e.g., Lenderink et al. 2007; Trenberth et al. 2015; Xie et al. 2015; Horton et al. 2016; Baker et al. 2018). For example, the decadal change of the daytime, nighttime, and compound hot extremes over China is attributed to effects of anthropogenic forcing, such as those related to greenhouse gas concentrations and anthropogenic aerosol emissions (Su and Dong 2019). The change in global SAT from 1970 to 2000s is associated with anthropogenic forcing, in combination with the effect of surface–atmosphere interactions and large-scale circulation changes (Tian et al. 2020). Furthermore, the increase of summer hot extremes over the NH land is linked to the Atlantic Multidecadal Oscillation (AMO) (or an AMO-like SST pattern) (Johnson et al. 2018; Gao et al. 2019), but the Atlantic SST forced trend in daily maximum SAT (over land) during 1979–2012 is much weaker in magnitude than that observed (Johnson et al. 2018). SST effects on hot extremes (or global SAT) have also been documented in other studies (e.g., Kosaka and Xie 2013; Watanabe et al. 2014; Dai et al. 2015; Stolpe et al. 2017; Dai and Bloecker 2019).

Many studies have applied an atmospheric general circulation model (AGCM) to understand roles of atmospheric composition (e.g., CO2 concentration) and SST in the change of atmospheric circulations. For example, Deser and Phillips (2009) applied the Community Atmosphere Model version 3 (CAM3) model to examine the relative roles of direct atmospheric radiative forcing (e.g., changes in well-mixed greenhouse gases) and observed SST change in global atmospheric circulation trends during boreal winter for 1950–2000. Their results showed that atmospheric radiative changes slightly contribute to circulation trends over the Atlantic–Eurasian sector in the NH, while the SST change largely contributes to the intensification of the Aleutian low. In addition, some studies compared circulation responses to quadrupled atmospheric CO2 concentration and to 4 K SST warming using Atmospheric Model Intercomparison Project (AMIP) runs (e.g., Grise and Polvani 2014; Shaw and Voigt 2015, 2016). For example, during summer, CO2 concentration rising and SST warming can result in opposite effects on atmospheric circulation change (Shaw and Voigt 2015). This tug of war on circulations highlights the importance of understanding the effect of CO2 concentration and SST on hot extremes, which are closely tied to circulations.

Based on previous studies that applied the AGCM to examine the roles of atmospheric composition and SST in atmospheric circulations, our study adopts a similar approach to understand their individual and combined effects on terrestrial hot extremes during boreal summer over the northern extratropics. We conduct numerical experiments using CAM version 5 (CAM5) model to investigate how observed CO2 change and observed SST change influence the change of hot extremes in recent decades.

2 Data and Methodology

2.1 Data

Daily mean observed geopotential height, wind, total-cloud cover, and surface downward shortwave and longwave radiations (hereafter SW and LW, respectively) are from the European Centre for Medium Range Weather Forecasts Reanalysis 5 (ERA5) on 2.5° × 2.5° spatial grids (Copernicus Climate Change Service 2017). Daily mean 2-m air temperatures is from ERA5 with a resolution of 1.0° and from the National Centers for Environmental Prediction (NCEP)–National Center for Atmospheric Research (NCAR) reanalysis with a Gaussian grid (Kalnay et al. 1996). The ERA5 and NCEP–NCAR data cover the boreal summer (June–August) for the period of 1979–2018 and 1950–2018, respectively. The monthly mean SST and sea ice concentration (SIC) come from the Hadley Centre Sea Ice and Sea Surface Temperature datasets (HadISST) on 1.0° × 1.0° spatial grids for the period of 1950–2018 (Rayner et al. 2003).

2.2 Observed Hot Extremes

At each grid point over the northern extratropics (20°–80°N, 0°–360°; over land), we define an anomalous hot day when the daily 2-m temperature at the grid point is above 90th percentile for that calendar day over the period of 1979–2018 (Zhao et al. 2017, 2020). Such classifications are performed for all days and the total number of days of hot extremes in summer is computed for each year. We also use another method to define anomalous hot days by comparing the daily 2-m temperature anomaly (deviation from long-term daily climatology) with the 1.5 standard deviation in daily temperature for that calendar day. The composite anomalies for an early/late period (which will be defined later) are calculated by subtracting the summer climatology from the average of the annual data over that period, with the Student’s t test applied to examine the significance.

2.3 CAM5 Experiments

To investigate the individual and combined effects of CO2 concentration and SST on hot extremes, we apply the NCAR CAM5 model (Neale et al. 2012), with a grid of 1.9° × 2.5° (gx1v6) for atmosphere (ocean) and 30 vertical levels in the atmosphere. The control experiment is forced with the climatological SST and CO2 concentration (Table 1). Other factors (e.g., SIC) that may contribute to hot extremes are set as climatology. The model has been validated by comparing the climatological SAT over land in the Control Exp with that observed. Three sets of sensitivity experiments are conducted with each set including an “early” run, a “transition” run, and a “late” run. The transition run is forced with the climatological SST and CO2 concentration. The early and late runs are forced with different combinations of SST and CO2, similar to the setting in Deser and Phillips (2009). For example, in the FULL Exp, the SST for the early and late runs is the SST climatology plus monthly mean NH SST anomaly (10°–70°N, 0°–360°) averaged for early (1979–1995) and late (2002–2018) periods, respectively; and the CO2 for the early and late runs is the averaged CO2 concentration in the early and late periods, respectively.

Table 1 List of main CAM5 experiments. There are three sets of the sensitivity experiments. Each experiment includes an early run, a transition run, and a late run. “Clm” denotes climatology of 1979 − 2018

To be comparable to the observation, the early and late runs are integrated forward for 22 years with the first 5 years discarded as spin-up, and the transition run is integrated forward for 11 years with the first 5 years discarded (Fernando et al. 2016). Thus, each set of sensitivity experiments includes 40 members, with 17 ensembles in early runs (corresponding to the 17 years in the early period), 17 in late runs (for the late period), and 6 in the transition run (for the transition period). The definition of hot extremes in each set of sensitivity experiments is similar to that observed, i.e., by comparing the daily SAT with 90th percentile for that calendar day among all members (total of 40 in each set). The composite anomaly for an early (late) run in each set is calculated by subtracting its 40-member average from the average of the 17 ensembles of the early (late) run in that set.

3 Results

3.1 Decadal Change of Hot Extremes

Figure 1a shows the linear trend of the total number of days (using the ERA5 dataset and the definition of exceeding the 90th percentile) of hot extremes over the northern extratropics during the period of 1979 − 2018. The linear trend has the largest magnitudes over five key regions: Europe (35°–60°N, 10 W°–50°E), northern Africa–western Asia (20°–35°N, 18 W°–65°E), eastern Asia (20°–60°N, 95°–135°E), western North America (20°–45°N, 95°–125°W), and Greenland (60°–80°N, 15°–70°W). Such a trend bears a strong resemblance to the observed trend in daily maximum SAT in Johnson et al. (2018). The domain-averaged anomalies of hot extremes days and SAT (over land) have a positive trend (significant at the 0.01 level), with the positive anomaly of hot extremes days from 2002 (Fig. 1b). The increase of the extremes is not sensitive to either the definition of extremes or dataset applied. Hence, the ERA5 dataset and the definition of exceeding the 90th percentile are used in the following analyses. The early period and late period are defined as 1979–1995 and 2002–2018, respectively, with a transition period during 1996 − 2001. There are more hot extremes in the late period and this difference is statistically significant at the 0.05 level over the five key regions (Fig. 2a).

Fig. 1
figure 1

a Linear trend of the total number of days of hot extremes (shading; days per year) for the period of 1979 − 2018. Black boxes indicate the five key regions. b The domain averaged (20°–80°N, 0°–360°; over land) anomaly of the total number of days of hot extremes defined by 90th percentile using ERA5 dataset, 1.5 standard deviation using ERA5 dataset, and 90th percentile using NCEP–NCAR dataset, and the domain averaged (over land) summer mean SAT (°C) using ERA5 from 1979 to 2018. c Difference in summer mean SST (°C) between the late and early periods in the observation. Stippling in a and c indicates regions significant at the 5% level according to the Student’s t-test

Fig. 2
figure 2

a Difference in the total number of days of hot extremes (shading; days) between the late and early periods in the observation. b–d Same as in a, but between the late and early run in three sets of CAM5 experiments. Stippling indicates regions significant at the 5% level according to the Student’s t-test. Black boxes indicate the five key regions

3.2 Role of CO2 Change

First, we conduct the CAM5 experiment by changing the CO2 only with climatological SST (CO2 Exp). The spatial distribution of the extremes is either spatially shifted over the eastern Asia and western North America or insignificant over Europe, northern Africa–western Asia, and Greenland (Fig. 2b). The domain averaged anomaly of hot extremes days in the CO2 Exp is obviously smaller than that observed for the northern extratropics and five key regions (Fig. 3a), consistent with the smaller magnitude of the anomaly in the spatial map.

Fig. 3
figure 3

a Observed and simulated anomalies of total number of days of hot extremes (days) averaged over the northern extratropics and five key regions (over land). b–f Same as in a, but for geopotential height at 300 hPa (m), synoptic EKE at 300 hPa (m2 s–2), total-cloud cover, SW (W m–2), and LW (W m–2), respectively. The 5% significant level according to the Student’s t-test is denoted by error bars. nEX, northern extratropics; EU, Europe; nAF–wAS, northern Africa–western Asia; eAS, eastern Asia; wNA, western North America; GL, Greenland

We further examine dynamic and thermodynamic factors associated with hot extremes. In the observation, the summer mean flow shows a positive geopotential height anomaly at 300 hPa, with high-pressure centers over the eastern Europe, mid-latitude eastern Asia, eastern Siberia, west coast of North America, and Greenland (Fig. 4a; shading). The hemispheric-scale circulation anomaly with the zonal wavenumber-5 structure bears some similarity with the CGT pattern identified in Teng et al. (2013) and Lee et al. (2017). This result indicates the dominant role of the CGT pattern in the summertime temperature variability over the northern extratropics. Observed synoptic eddy kinetic energy (EKE; Zhao et al. 2020) based on 2.5–6-day bandpass-filtered daily winds shows a reduction over mid- and high-latitudes (Fig. 4a; contours), indicating a weakening of fast-moving Rossby waves (Coumou et al. 2015, 2018). The relationship between eddy and mean flows, i.e., high pressure anomaly versus low EKE anomaly, can be understood via a non-modal instability analysis (Zhao et al. 2018, 2020; Zhao and Deng 2020; Mak et al. 2021). Figures 5a and 6a show observed anomalies of total-cloud cover, SW and LW, which can act as feedbacks to circulation changes.

Fig. 4
figure 4

a Difference in geopotential height (shading; m) and synoptic EKE (contours; m2 s–2) anomalies at 300 hPa between the late and early periods in the observation. b–d Same as in a, but between the late and early run in three sets of the CAM5 simulations. Red solid and blue dashed lines represent the positive (1.5 and 4.0 m2 s–2) and negative values (–1.5 and –4.0 m2 s–2), respectively

Fig. 5
figure 5

a–d Same as in Fig. 4, but for total-cloud cover (shading) and SW (contours; W m–2). Red solid and blue dashed lines represent the positive (4 and 10 W m–2) and negative values (–4 and –10 W m–2), respectively

Fig. 6
figure 6

a–d Same as in Fig. 4, but for LW (shading; W m–2)

In the CO2 Exp, the magnitude of geopotential height anomaly over the northern extratropics is underestimated (Fig. 4b), accompanied by a misrepresentation of total-cloud cover, SW and LW anomalies, especially over the five key regions (Figs. 5b and 6b). For example, over Europe, the magnitude of upper-level geopotential height and synoptic EKE anomalies is much smaller compared to that in the observation. The weak high pressure and strong synoptic activity (characterized by positive synoptic EKE anomaly) lead to more frequent maritime air masses movement, creating a wet and cool condition over Europe. The total-cloud cover and surface downward radiations anomalies are also small in magnitude. The large difference between the observation and CO2 Exp can be also seen in domain averaged fields (Fig. 3). The magnitude of the domain averaged variables (blue bars) is generally smaller than that in the observation for the northern extratropics and five key regions. The reason for the difference is probably because the SST is set as climatology in the CO2 Exp. As we know, the majority of excess heat in the Earth system due to rising greenhouse gases (e.g., CO2) concentration ends up in oceans (e.g., von Schuckmann et al. 2020).

3.3 Role of the NH SST Change

In this section, we investigate the role of SST in hot extremes. Figure 1c shows that the largest SST anomaly (difference between the late and early periods) occurs over the NH (10°–70°N, 0°–360°), especially for 15°–50°N and 40°–65°N over the Pacific and Atlantic, respectively. It is interesting to note that the early period (1979–1995) corresponds to the positive and negative phases of the Pacific Decadal Oscillation (PDO) and AMO, respectively, while the late period (2002–2018) is associated with the opposite phases. Moreover, the role of SST over this region (i.e., 10°–70°N, 0°–360°) in hot extremes is not as known as that over the entire Pacific or Atlantic (e.g., Johnson et al. 2018) and tropics (e.g., Kosaka and Xie 2013). Thus, we focus on the impact of the NH SST on the hot extremes.

We conduct an experiment by changing SST over 10°–70°N, 0°–360° with constant CO2 concentration (NH-SST Exp). The NH-SST Exp exhibits a positive hot extremes anomaly over northern mid-latitudes (20°–60°N) (Fig. 2c), especially over four key regions (except for Greenland). Compared to the result in the CO2 Exp, the pattern of hot extremes anomaly in the NH-SST Exp is much closer to the observed pattern. The domain averaged hot extremes anomaly in the NH-SST Exp is also closer to the observation (Fig. 3a). It is also noted that the hot extremes anomaly over mid-latitudes (e.g., Europe, eastern Asia, and western North America) is better simulated compared to that over high-latitudes (e.g., Greenland).

Furthermore, the NH-SST Exp generally well simulates the mean and eddy flows over mid-latitudes, despite some discrepancies over the Pacific–North American sector (Fig. 4c). In particular, the high pressure and low EKE anomalies over Europe and eastern Asia are well captured by the model. Varying degrees of accuracy are shown for clouds and surface downward radiations (Figs. 5c and 6c). For example, the negative anomaly of the total-cloud over the eastern Asia and western North America, and positive anomaly over the northern Africa–western Asia are well simulated, but the large magnitude of negative anomaly over Europe is underestimated (Fig. 5c). As SW is linked to total-cloud cover, similar representation could be found for SW. The observed positive LW anomaly over the northern Africa–western Asia is also well captured by the NH-SST Exp. For the domain averaged variables, their signs are generally same as those in the observation despite their magnitudes are smaller (Fig. 3). These results suggest that the NH SST (mainly over 10°–70°N) is able to trigger hot extremes anomaly similar to that observed, as well as a good representation of dynamic and thermodynamic factors.

3.4 Combined Effects of the NH SST and CO2

The experiment with both NH SST and CO2 concentration changes (FULL Exp) shows a more similar pattern of hot extremes change compared to that of the single-change experiments, especially over high-latitudes (e.g., Greenland) (Fig. 2d). Figure 3a shows that the domain averaged anomaly of hot extremes days in the FULL Exp (5.5 ± 2.7 days) is very close to that observed (5.2 ± 2.4 days) over the northern extratropics. This result indicates the effectiveness of the decomposition of the total contribution into CO2 change and SST change. Compared to Johnson et al. (2018) that applies a coupled GCM with nudged SST, the FULL Exp with the combined SST and CO2 changes shows a more similar pattern to the observed hot extremes change over the northern Africa–western Asia and Greenland. Such similarity is likely to be reflected by prescribing the observed SST change over the NH oceans (where the largest SST anomaly occurs) in the AGCM.

Consistent with the result of the hot extremes, the geopotential height anomaly is well simulated in the FULL Exp (except for Greenland), as well as the synoptic EKE anomaly (Fig. 4d). The circulation anomaly over the Aleutian Islands is well represented due to the incorporation of the CO2 change. Over Europe, eastern Asia, and western North America, the observed positive SW anomaly is linked to negative total-cloud cover anomaly (Figs. 5a and 6a), due to cloud albedo effect (NASA Facts 1999). The negative total-cloud cover and positive SW anomalies over these three key regions are well captured by the FULL Exp (Fig. 5d). In addition, the observed positive total-cloud cover anomaly over the northern Africa–western Asia and Green land leads to the large LW anomaly, due to cloud greenhouse effect (NASA Facts 1999). The large magnitude of LW anomaly over the northern Africa–western Asia could largely explain the hot extremes change there and such LW anomaly is well simulated by the FULL Exp (Fig. 6a, d). For the domain averaged variables, both signs and magnitudes are very similar to those in the observation (Fig. 3), further confirming effectiveness of the decomposition of the total contribution into CO2 change and NH SST change. Overall, our results show that the combined effects of the NH SST and CO2 can lead to more hot extremes in recent decades, similar to those observed. If it were not for the oceans’ capacity to store the excess heat due to the rising CO2 concentration, the positive trend in hot extremes days would be substantially reduced (i.e., the results in the CO2 Exp).

3.5 Role of other Possible Factors

The aforementioned analyses suggest that SST over the NH (especially 10°–70°N) plays a significant role in hot extremes under global warming. To further understand individual roles of the North Pacific and North Atlantic, we conduct two experiments by changing SST over the North Pacific (10°–70°N, 110°E–100°W) and the North Atlantic (10°–70°N, 80 W°–360°), respectively (Table 2). The result shows that neither basin solely can capture the hot extremes change over mid-latitudes (Fig. 7a, b). For completeness, we further investigate the impact of the Arctic sea ice loss, which was found to be important for hot extremes over Europe (Zhang et al. 2020). From 1979 to 2018, there was a significant downward trend in the Arctic sea ice and such trend is linked to Arctic amplification (Coumou et al. 2018). We conduct the experiment by changing the SIC only (Table 2). The result shows that the SIC forcing well captures the increasing hot extremes over Europe but misrepresents changes over other key regions (Fig. 7c). In addition, the influence of the tropical SST on northern extratropical SAT has been documented (e.g., Kosaka and Xie 2013). We conduct the experiment by changing SST over the tropics (20°S–20°N, 0°–360°) (Table 2). Figure 7d shows that the increasing hot extremes over the northern Africa–western Asia, western North America (lower latitudes), eastern Asia, and Greenland are well captured by the model, while the magnitude of such change over the last two regions is much smaller than that in the observation. Overall, the additional experiments suggest that the hot extremes change simulated by changing North Pacific (or Atlantic) or Arctic SIC is not as significant as that in the observation, while tropical SST is also an important factor for the hot extremes change.

Table 2 List of five additional CAM5 experiments. Each experiment includes an early run, a transition run, and a late run. “Clm” denotes climatology of 1979 − 2018 and 1950 − 2018 for the first four experiments and the fifth experiment, respectively
Fig. 7
figure 7

a–d Difference in the total number of days of hot extremes (shading; days) between the late and early run in four sets of additional CAM5 experiments (for the period of 1979–2018). Stippling indicates regions significant at the 5% level according to the Student’s t-test. Black boxes indicate the five key regions

As mentioned in previous sections, we focus on and prescribe the SST anomaly over the NH (especially 10°–70°N) because 1) the largest SST anomaly (i.e., difference between the late and early periods) occurs over this region (Fig. 1a) and 2) the role of SST over this region is not as known as that over the entire Pacific or Atlantic (e.g., Johnson et al. 2018) and tropics (e.g., Kosaka and Xie 2013). Despite the significant role of the NH SST warming, the source for the SST warming is still unclear. Previous studies have suggested that the change in SST is related to both anthropogenic forcing (e.g., CO2 concentration rising) and natural variability (Meehl et al. 2009, 2013; Chan and Wu 2015; Hua et al. 2018; Liguori et al. 2020). In the current studying period (1979–2018), the early period corresponds to the positive and negative phases of the PDO and AMO, respectively, while the late period is linked to the opposite phases. It is uncertain whether the hot extremes change is caused by the phase change of the oceanic modes or the warming trend of SST (due to anthropogenic forcing).

To better estimate the effect of anthropogenic forcing (i.e., warming trend) versus natural variability (i.e., phase change of PDO and AMO), we analyze observational data and conduct an experiment for a longer period (1950–2018). The early and late periods are defined as 1950–1979 and 1989–2018, respectively, with a transition period during 1980 − 1988. Note that the period of 1950–1979 or 1989–2018 is not associated with a single phase of those oceanic modes (e.g., PDO and AMO) and thus we may better understand the role of anthropogenic forcing (i.e., warming trend). In the observation, in contrast to Fig. 2a, there are less hot extremes in the late period (1989–2018) over the northern Africa–western Asia and eastern Asia (Fig. 8a). Then, we conduct an experiment by changing SST over 10°–70°N, 0°–360° for 1950–1979 and 1989–2018 (Table 2). The hot extremes pattern for the NH-SST Exp with the longer period shares some similarities with that in the observation, especially less hot extremes during the late period over the northern Africa–western Asia and eastern Asia (Fig. 8b). Without the influence of the phase change of oceanic modes, the increasing hot extremes over these two keys regions is not shown for the analysis with the longer period. This result indicates that the increasing hot extremes over these two regions (shown in Fig. 2a, c) are probably associated with the certain time period (i.e., 1979–2018) and the influence of the phase change of oceanic modes during this period. For other key regions, both anthropogenic forcing (warming trend) and natural variability (phase change of PDO and AMO) play a role in the increasing hot extremes.

Fig. 8
figure 8

a Difference in the total number of days of hot extremes (shading; days) between the late (1989–2018) and early (1950–1979) period in the observation (using NCEP–NCAR reanalysis data) for the period of 1950–2018. b Same as in a, but between the late and early run in CAM5 NH-SST experiments (for the period of 1950–2018). Stippling indicates regions significant at the 5% level according to the Student’s t-test. Black boxes indicate the five key regions

4 Concluding Remarks

This study investigates possible causes for the increasing hot extremes over the northern extratropics during summer. The past four decades have seen an increase of hot extremes, accompanied by the NH SST warming and CO2 concentration rising. Through a series of modeling experiments with the CAM5 model, we show that the increase of hot extremes is largely due to these two factors. However, the rising CO2 concentration alone cannot trigger the observed change of hot extremes days, as well as mean and eddy flows, clouds, and surface downward radiations, due to the lack of SST adjustment. Due to the oceans’ capacity to store a large amount of heat in the Earth system, the incorporation of the NH SST change strengthens terrestrial warming over the northern extratropics, as shown by capturing anomalies of hot extremes days and those dynamic and thermodynamic factors. Specifically, over Europe, eastern Asia, and western North America, the experiment with both NH SST and CO2 concentration changes (FULL Exp) well simulates SW radiation, which is modulated by total-cloud cover and eddy-mean flows. Over the northern Africa–western Asia and Greenland, the FULL Exp faithfully captures LW radiation, which is responsible for the increasing hot extremes over these two regions. Finally, our study discusses potential roles of natural variability (e.g., PDO and AMO) and other possible factors (e.g., Arctic sea ice and tropical SST).

One caveat of the current study is that the modeling setting in the CAM5 experiments cannot really disentangle the individual roles of the NH SST and CO2 because the change in SST is related to both anthropogenic forcing and natural variability (Meehl et al. 2009, 2013; Chan and Wu 2015; Hua et al. 2018; Liguori et al. 2020). To fully exclude the effects of CO2 on SST, one might need to design an experiment where SST evolves freely and CO2 concentration remains fixed. Such an experiment is part of our ongoing and future works. Despite the caveat, this study truly highlights the importance of the NH oceans (especially over the extratropics) to the northern extratropical hot extremes under global warming. In addition, this study only analyzes outputs of the CAM5 model and model dependency may influence the results. Conducting modeling experiments using different models is encouraged for future studies.